Claude Code Kit

Features · Senior-dev agents

Senior-dev agents

Scoped subagents that Claude delegates to. Each has its own tool allowlist, system prompt, and fresh context window per invocation. Stateless — continuity lives in artifacts, not agent memory.

Installed agents

One job each.

code-reviewer

Reviews changes for the Next.js + Supabase failure modes. Knows to check RLS on new tables, credential shapes in diffs, Server Action return types.

When to reach for it
Before committing a multi-file change. Especially anything touching the database, authentication, or server-side code.
Tools available
Read, Grep, Glob, Bash

planner

Drafts implementation plans before code. Sole writer to __plan/ so plan state has one canonical home.

When to reach for it
When a task touches more than two files, or when scope is unclear. Planning before coding is not optional for complex work.
Tools available
Read, Edit, Write (in __plan/ only), Grep, Glob

test-runner

Runs the test suite, interprets failures, suggests minimal fixes. Doesn't write code — returns a verdict plus the relevant failure block.

When to reach for it
After any change you want confidence on before merging. The test-runner reads failures honestly rather than rationalising around them.
Tools available
Read, Bash

security-reviewer

Read-only auditor for license, URL, hash, and consent surfaces. No memory across invocations — fresh eyes every time.

When to reach for it
Before merging anything that touches authentication, secrets handling, or session logic. Intentionally stateless so prior approvals don't bias it.
Tools available
Read, Grep, Glob, Bash

The 7-agent default set

The full framework, not just the installed four.

The kit installs four agents for the Next.js + Supabase stack. The underlying framework ships seven standard roles. Any project can instantiate the full set — or add its own specialists.

planner

Sole writer to __plan/. Enforces Protocol 0 audit before any epic. Tools: Read, Edit/Write (in __plan/ only), Grep, Glob.

docs-writer

Drafts task files and source-of-truth from planner outlines. Never decides scope. Tools: Read, Edit, Write, Grep, Glob. Model: Sonnet.

explore-codebase

Fast read-only search. Tools: Read, Grep, Glob, Bash. Model: Haiku — cost optimisation.

domain-specialist

Implementation in a specific domain. One instance per domain area. e.g. mcp-specialist (server), web-specialist (landing + app), claude-code-specialist (kit content). Prevents two agents touching the same file.

quality-auditor

Read-only review of license, URL, hash, and consent surfaces. e.g. security-reviewer in claudekit. No Write tool.

claude-setup-curator

Optional. Only when the project consumes __claude_setup/ directly. Not relevant for installed kits.

+ your own

Tester, planner alternates, data-pipeline specialists. Add by dropping an agent markdown file in .claude/agents/.

Agent frontmatter fields

What goes in the agent markdown file.

name

Agent identifier. Used when dispatching from chat.

description

When to delegate to this agent. Read by Claude when choosing which agent to invoke.

model

haiku / sonnet / opus. Default: sonnet. Haiku for search-heavy tasks; Opus for multi-step planning.

tools

Explicit tool allowlist. Defaults to * if omitted. Restrict here — not in CLAUDE.md.

memory: project

Optional. Gives the agent an append-only JSONL memory file scoped to the project.

isolation: worktree

Optional. Agent works in a temporary git worktree. Edits do not collide with the main checkout.

Model-cost principle

Pre-assigned models. You don't pay Opus rates for grep.

Read-only search (grep, glob, read)

Haiku

Fast and cheap. No generation needed.

Drafting prose or code

Sonnet

Good quality/cost ratio for most implementation.

Architecture, multi-step planning

Opus

Reserved for high-stakes decisions that benefit from deeper reasoning.

Domain-vs-stack scoping

Agents are scoped by domain, not by tech stack.

A web-specialist owns the landing page and dashboard — all web surfaces, regardless of stack. A mcp-specialist owns the server and tool schemas. Scoping by domain, not by framework version or file extension, prevents the "two agents touching the same file" race condition.

The standard forwarding convention is "pass to the planner: <finding>". Any dialog can hand the planner a discovery from its domain. Only the planner writes to __plan/. Cross-agent communication routes through a written artifact — never through agent-to-agent calls.

Agent memory

Append-only JSONL. AST-enforced.

Agents with memory: project write to an append-only JSONL file, one per agent, scoped to the project. An AST-level test enforces that no delete, update, clear, or truncate operations appear in the memory code path. Memory entries can accumulate; they cannot be removed programmatically.

On read, entries are recency-weighted. A baseline_anchor field lets the agent mark a recurring decision so it is loaded on every session without counting against the recency budget.

Pattern-reinforcement defense: if memory keeps returning the same answer, the kit alerts the human. A memory that always says the same thing is either a baseline anchor working correctly — or a poisoning signal. The alert surfaces the distinction.

Worktree isolation

Parallel agents. No file conflicts.

isolation: worktree creates a temporary git worktree per agent invocation. Two agents editing the same file in parallel work in separate directory trees. Their edits do not collide.

The worktree is cleaned up automatically if the agent makes no changes. If it does make changes, the worktree is kept for explicit merge or discard — not silently merged.

Why scoped agents

Specialization is the whole point.

A reviewer that also writes code drifts into justifying its own changes. A planner that also runs tests loses focus on the plan. Tight scope means tight accountability — each agent can be evaluated on exactly one outcome.

The tool allowlist enforces the scope at the runtime level. security-reviewer has no Write tool. It cannot make changes even if asked. That's not a limitation — it's the spec.