AI Agent Platformfor SRE

Specialized AI agents for SRE. Deep investigation by default — read-only, open source.

Multi-AgentRead-Only by DefaultHypothesis-DrivenLearns From Every Incident

How It Works

Investigate, learn, repeat

A continuous loop that gets smarter with every incident

STEP 01

Describe

Tell Siclaw what's wrong — any channel

STEP 02

Investigate

4-phase engine: collect, hypothesize, validate in parallel, conclude

3 parallel sub-agents
STEP 03

Report

Structured root cause with evidence and confidence scores

STEP 04

Learn

Every investigation feeds memory — next time is smarter

Repeat — each time smarter

Multi-agent

One workspace, many specialists

Each agent owns a domain — Kubernetes, networking, system. Its own scope, skills, knowledge, and memory. They collaborate in Deep Investigation mode to solve issues that span layers.

active

k8s-agent

Kubernetes domain

Investigates pods, deployments, and cluster events.

Scoped to

K8s API surface
kubectl toolkit
K8s playbooks
active

network-agent

Networking domain

Traces latency, packet loss, DNS and routing issues.

Scoped to

Pod + Service network
Network probes
Network runbooks
active

system-agent

System / OS domain

Inspects CPU, memory, disk, and kernel-level faults.

Scoped to

Node + kernel
System diagnostics
OS runbooks

Isolated by default, collaborative on demand. A pod restart in k8s-agent can hand off to network-agent for latency checks, then to system-agent for kernel signals — one investigation, three specialists.

Capabilities

Built for production

Multi-Agent Workspace

Specialized agents per domain. Each has its own scope, skills, knowledge, and memory.

Skill System

Per-agent diagnostic scripts with mandatory review. Fork and share across teams.

Knowledge Library

Per-agent wiki of runbooks and docs. Each specialist reads only what its domain needs.

Security First

Read-only investigation by default, with controlled execution when needed.

Alert-Driven Channels

Connect Siclaw to team channels and trigger investigations from alerts or operator input.

Cron Patrols

Schedule health checks in natural language. "Check GPU every 6h" just works.

Integrations

Connects to your stack

Built-in channels and Kubernetes. Extend to anything via Model Context Protocol.

Infrastructure
Kubernetes
Messaging
SlackDiscordTelegramLark
Observability
Prometheusvia MCPGrafanavia MCPElasticsearchvia MCPLokivia MCP
Alerting
PagerDutyvia MCPAlertmanagervia MCP
Dev
GitHubvia MCPGitLabvia MCP

Get Started

Start in seconds, scale to enterprise

$npm install -g siclaw
$siclaw local