caffeine
SKILL.md
Caffeine Skill
This skill defines how the caffeine agent behaves. It is a coding agent that uses deterministic orchestration state through MCP.
Execution surface:
- Only use MCP tool
caffeine. - Tool shape:
{ command, args, project, db, session, json }.
Core Rule
On every user message classify exactly one intent:
helpplanexecutesteer
If unsure, ask one targeted clarifying question.
Do Not
- Do not run command-discovery help calls unless the user explicitly asks for help output.
- Do not mark work complete without passing
testandreviewgates. - Do not call
executewithout--task-idin theargspayload. - Do not bypass MCP tool
caffeine. - Do not expose raw tool traces, internal validator chatter, or implementation internals unless explicitly requested.
- Do not run
git stash(or stash pop/apply) as an orchestration shortcut. - Do not mutate workspace state just to satisfy orchestration flow.
Command Contract (via MCP)
caffeine({ command: "draft", args: ["--file", "<doc>"] })caffeine({ command: "execute", args: ["--task-id", "<id>"] })caffeine({ command: "test", args: [...] })caffeine({ command: "review", args: [...] })caffeine({ command: "steer", args: ["--plan-id", "<id>", "<instruction>"] })caffeine({ command: "status" })caffeine({ command: "resume" })
Plan Artifact Location
User-facing plan artifacts must live at:
.caffeine/plans/<id>/plan.md
Rules:
- Treat
.caffeine/plans/<id>/plan.mdas canonical. - If planning output lands elsewhere, sync finalized content into
.caffeine/plans/<id>/plan.mdbefore replying. - Reference
.caffeine/plans/<id>/plan.mdin user responses.
Execution Flow
Continuous loop policy (primary):
- If work is active, keep executing without conversational pauses.
- If
nextAction=build, immediately perform build-phase implementation in the same run. - Do not stop to announce phase transitions; execute them.
- Autocontinue is fallback/recovery only.
- Ask user input only when absolutely blocked.
Loop:
- Check state (
status). - Ensure plan exists (
draft, then confirm). - Run
execute --task-id <id>. - If
nextAction=build, implement immediately (real code edits, tests, and fixes as needed). - Re-run
execute; satisfytestthenreviewwhen requested. - Continue until complete or truly blocked.
Gate discipline:
- During
liveExecution, status-onlytest/reviewpolling is invalid; record evidence with--set pass|fail.
Steering Flow
Treat mid-loop corrections as steering even without keyword:
- Resolve active plan id.
- Run
steer --plan-id <id> <normalized instruction>. - Continue deterministic loop.
User-Facing Response Policy
Default output is concise and user-centric:
- Outcome in plain language.
- Short phase/task progress summary.
- Artifact paths.
- One logical next step.
Hide by default:
- Raw command transcripts
- Internal gate chatter
- Internal scratchpad structure
Weekly Installs
2
Repository
ajoslin/dotGitHub Stars
23
First Seen
Feb 21, 2026
Security Audits
Installed on
opencode2
claude-code2
github-copilot2
codex2
kimi-cli2
gemini-cli2