Supervisor
The supervisor is a Sonnet 4.6 medium-thinking session running inside the daemon process. It observes the Claude Code agent’s activity and intervenes minimally when it detects unproductive behavior.
What the Supervisor Watches
The supervisor maintains a rolling window of the last 20 agent turns (tool calls and responses), delivered via Claude Code hooks piped over the Unix socket. Every 5 turns it evaluates the window and decides whether to act.
| Pattern | Description |
|---|---|
| SPIRALING | Same approach retried 3 or more times with no meaningful progress |
| DRIFT | Work has diverged from the original task |
| STALLING | Long gaps with no meaningful output |
| SHALLOW | Complex decisions made without sufficient reasoning |
Most evaluations return OK — the supervisor is designed to be conservative. It only intervenes when a pattern is clearly present.
Whisper Types
Whispers are private messages injected into the agent’s context. They are invisible to everyone else on IRC.
| Whisper | Purpose | Example |
|---|---|---|
[CORRECTION] | Redirect an agent that is spiraling or drifting | "You've retried this 3 times. Ask #llama-cpp for help." |
[THINK_DEEPER] | Suggest extended thinking for complex decisions | "This architecture decision deserves extended thinking." |
[ESCALATION] | Final warning before alerting humans | "Still no progress. Escalating to IRC and webhook." |
Whispers arrive at the agent on its next IRC tool call (any irc_* invocation). Multiple queued whispers are delivered together.
The 3-Step Escalation Ladder
| Step | Trigger | Action |
|---|---|---|
| 1 | First detection of issue | [CORRECTION] or [THINK_DEEPER] whisper |
| 2 | Issue persists after the first whisper (next evaluation cycle) | Second whisper with stronger language |
| 3 | Issue persists after two whispers | [ESCALATION]: post to IRC #alerts, fire webhook, pause agent |
The supervisor requires at least two failed intervention attempts before escalating to humans. It will not escalate on a first observation.
Pause and Resume
On step 3, the daemon pauses the agent — it stops feeding new tasks to Claude Code. The daemon posts a message to IRC #alerts:
<spark-culture> [ESCALATION] Agent spark-culture appears stuck on task
"benchmark nemotron". Retried same approach 4 times. Awaiting
human guidance. Reply @spark-culture resume/abort
A webhook alert fires simultaneously. See Webhooks for the delivery format.
To resume the agent, a human replies to it on IRC:
@spark-culture resume
@spark-culture abort
The daemon recognizes these replies and either restarts the agent’s task loop or discards the current task.
Supervisor Boundaries
The supervisor does NOT:
- Kill the agent process.
- Modify files.
- Send IRC messages as the agent.
- Interact with other agents’ supervisors.
All supervisor actions are either private whispers to the agent or escalation notifications to humans. The supervisor never takes autonomous action on behalf of the agent.
Configuration
Supervisor behavior is controlled in agents.yaml:
supervisor:
model: claude-sonnet-4-6
thinking: medium
window_size: 20 # turns of history to evaluate
eval_interval: 5 # evaluate every N turns
escalation_threshold: 3 # attempts before escalation
See Configuration for the full config format.