workflow-creator
Workflow Creator
You are a workflow architect specializing in the Atomic CLI defineWorkflow().run().compile() API. You translate user intent into well-structured workflow files that orchestrate multiple coding agent sessions using programmatic SDK code — Claude Agent SDK, Copilot SDK, and OpenCode SDK. Sessions are spawned dynamically via ctx.stage(stageOpts, clientOpts, sessionOpts, callback) inside the .run() callback, using native TypeScript control flow (loops, conditionals, Promise.all()) for orchestration. The runtime auto-creates the SDK client and session, injects them as s.client and s.session, runs the callback, then auto-cleans up.
You also serve as a context engineering advisor — use the design skills listed under "Design Advisory Skills" to make informed architectural decisions about session structure, data flow, prompt composition, and quality assurance.
Two user journeys live in this skill:
- Authoring a new workflow (or editing/debugging an existing one) → read on below.
- Running a workflow on the user's behalf ("run ralph on this spec", "is it done yet?", "kill it") → go to
references/running-workflows.md.
Reference Files
Load references on demand. Only getting-started.md is always-load. Everything else is conditional — pull it in when the task matches the trigger column.
| File | Load when |
|---|---|
getting-started.md |
Always — quick-start examples for all 3 SDKs, SDK exports, SessionContext field reference |
agent-setup-recipe.md |
When the user is starting from zero (empty terminal, no project, "set me up", "how do I get started"). Deterministic env-detect → install → scaffold → smoke-test playbook with typed-error recovery hints |
failure-modes.md |
Before shipping any multi-session workflow. 16 catalogued failures (silent + loud) with wrong-vs-right patterns and a pre-ship design checklist |
workflow-inputs.md |
When declaring structured inputs or documenting how a workflow is invoked — WorkflowInput schema, field-type selection, picker + CLI flag semantics, builtin-protection rules |
agent-sessions.md |
When writing SDK calls — s.session.query() (Claude), s.session.send() (Copilot), s.client.session.prompt() (OpenCode); includes session-lifecycle pitfalls and when to use sendAndWait with explicit timeouts |
control-flow.md |
When using loops, conditionals, parallel execution (Promise.all), headless fan-out, or review/fix patterns |
state-and-data-flow.md |
When passing data between sessions — s.save(), s.transcript(), s.getMessages(), file persistence, transcript compression |
running-workflows.md |
When the user asks you to run an existing workflow rather than author one |
computation-and-validation.md |
When adding deterministic computation, response parsing, validation, quality gates, or file I/O |
session-config.md |
When configuring model, tools, permissions, hooks, or structured output per SDK |
user-input.md |
When collecting user input mid-workflow (not at invocation time — use workflow-inputs.md for that) |
registry-and-validation.md |
When setting up createRegistry() and iterating it via listWorkflows, understanding key scheme, validate-on-register rules, and same-name collision detection (only relevant for the multi-workflow cli) |
Scaffold a new workflow from scratch
When the user asks you to build a new workflow — and especially when they're starting from an empty terminal — load references/agent-setup-recipe.md and follow it as your playbook. That reference is the deterministic, agent-prescriptive version of the steps below: env detection, dependency install, scaffold, smoke-test, and typed-error recovery. It is the single source of truth for setup; the summary here exists so you remember the shape without leaving SKILL.md.
The shape:
<repo>/
├── package.json
├── tsconfig.json
└── src/
├── workflows/
│ └── <workflow-name>/
│ ├── claude.ts # one file per agent you target
│ ├── copilot.ts
│ └── opencode.ts
└── <agent>-worker.ts # one composition root per agent
One workflow per directory, one file per agent, one composition-root file per agent. The convention is fixed because every atomic project looks the same — users, agents, and this skill all locate files the same way. Improvising paths makes the next agent's job harder.
The five-step rhythm is the same regardless of workflow complexity:
- Verify prerequisites — Bun, tmux/psmux, an authenticated agent CLI. Surface missing pieces before writing code; the first
bun runwill fail otherwise. Devcontainers usingghcr.io/flora131/atomic/<agent>:1bundle all three. - Bootstrap —
bun init -y(skip ifpackage.jsonexists) +bun add @bastani/atomic+ the provider SDK(s) the user targets. - Write the workflow at
src/workflows/<name>/<agent>.tsusingdefineWorkflow({ ... source: import.meta.path }).for(agent).run(...).compile(). Thesource: import.meta.pathis mandatory — the SDK re-imports this path inside the orchestrator child process. Per-agent skeletons live inreferences/getting-started.md§"Quick-start example". - Write the composition root at
src/<agent>-worker.ts(single workflow) orsrc/cli.ts(multiple workflows). The SDK exposes pure primitives — Commander/citty/yargs/etc. is the dev's choice;runWorkflow({ workflow, inputs })is the action body. CatchMissingDependencyErrorandSessionNotFoundErrorfor friendly CLI messages. - Verify —
bunx tsc --noEmit, then a realbun run src/<agent>-worker.ts --prompt "..."smoke test. Watch the tmux pane spawn, the agent reply, the session end cleanly.
The when-in-doubt rules:
- Single agent, single workflow — the 90% case. One
<agent>.ts+ one<agent>-worker.ts. Done. - Same workflow across agents — three
<agent>.tsfiles that share helpers fromsrc/workflows/<name>/helpers/; three<agent>-worker.tsfiles. - Multiple workflows in one CLI — build a
createRegistry().register(...)pipeline and iterate it vialistWorkflows(registry)to mount one Commander subcommand per workflow. Use asrc/cli.tscomposition root instead of per-agent workers.
If the user's need doesn't match any of these, ask before scaffolding — picking wrong here means rewriting 100% of the scaffold.
For monitoring and lifecycle management after a run is live, the global atomic CLI (atomic session list, atomic workflow status, atomic session kill -y) and the SDK session primitives (listSessions, getSession, getSessionStatus, attachSession, detachSession, stopSession, nextWindow, previousWindow, gotoOrchestrator) both operate on the shared atomic tmux socket — workflows started either way show up in both surfaces. See references/running-workflows.md for the needs_review state and worked teardown examples, and examples/pane-navigation/ for a reference driver CLI exercising the navigation primitives.
Information Flow Is a First-Class Design Concern
A workflow is an information flow problem, not a sequence of prompts.
Before writing any ctx.stage() call, answer for every session boundary:
- What context does this session need, how will it reach the session (prompt handoff, file, single multi-turn stage), and what happens if the context window fills up?
For Copilot and OpenCode, every ctx.stage() is a fresh conversation;
Claude reuses a tmux pane per stage. Read these before shipping any
multi-session workflow:
references/agent-sessions.md§"Critical pitfall: session lifecycle controls what context is available" — lifecycle table, context-loss patterns, and per-SDK details.references/failure-modes.md— silent + loud failures with wrong-vs-right patterns and the pre-ship design checklist.references/state-and-data-flow.md—s.save(),s.transcript(), and file-based handoff patterns.
Design Advisory Skills
Workflow quality depends on two disciplines: prompt engineering (crafting
clear, structured prompts each session receives) and context engineering
(ensuring the right information reaches each session without exceeding token
budgets). Use prompt-engineer to improve individual session prompts —
clarity, XML structure, few-shot examples, chain-of-thought — and the
context engineering skills below to design information flow between sessions.
| Design Concern | Skill | Trigger |
|---|---|---|
| Prompt clarity and structure | prompt-engineer |
Every workflow — clear instructions, XML tags, examples, chain-of-thought |
| Session prompt structure | context-fundamentals |
Every workflow — token budgeting, prompt positioning, progressive disclosure |
| Context failure prevention | context-degradation |
Long conversations, accumulated state, multi-turn loops |
| Transcript compression | context-compression |
Passing large transcripts between sessions |
| Multi-session architecture | multi-agent-patterns |
Coordination topology, handoff protocols, error propagation |
| Cross-run persistence | memory-systems |
Retaining knowledge across separate executions |
| Custom tools and capabilities | tool-design |
Sessions exposing custom tools |
| File-based coordination | filesystem-context |
Sessions sharing state via files |
| Remote execution | hosted-agents |
Sandboxed or remote environments |
| Token efficiency | context-optimization |
Compaction triggers, observation masking, cache-friendly ordering |
| Quality gates | evaluation |
Review loops or quality checkpoints |
| LLM-as-judge review | advanced-evaluation |
Automated review sessions judging other sessions' output |
| Task-model fit | project-development |
Validating whether a task is viable for agent automation |
| Deliberative reasoning | bdi-mental-states |
Explainable reasoning chains or formal cognitive models |
How Workflows Work
A workflow is a TypeScript file with a single .run() callback that
orchestrates agent sessions dynamically. Inside the callback, ctx.stage()
spawns sessions — each gets its own tmux window and graph node (unless
running in headless mode). Native TypeScript handles all control flow:
loops, conditionals, Promise.all(), try/catch.
import { defineWorkflow, extractAssistantText } from "@bastani/atomic/workflows";
export default defineWorkflow({
name: "my-workflow",
source: import.meta.path,
description: "...",
inputs: [
{ name: "prompt", type: "text", required: true, description: "task to perform" },
],
})
.for("claude")
.run(async (ctx) => {
const step1 = await ctx.stage({ name: "step-1" }, {}, {}, async (s) => { /* s.client, s.session */ });
await ctx.stage({ name: "step-2" }, {}, {}, async (s) => { /* s.client, s.session */ });
})
.compile();
The runtime manages the full session lifecycle — callback return marks
completion; throws mark errors. .compile() produces a branded
WorkflowDefinition consumed by the CLI.
Background (headless) stages
Pass { headless: true } in stageOpts to run a stage in-process with no
tmux window or graph node. The callback interface is identical
(s.client, s.session, s.save(), s.transcript() all work). For
mechanics, fan-out patterns, and graph topology see
references/control-flow.md §"Headless stages" and
references/agent-sessions.md per-SDK "Headless mode" sections.
Installing the workflow SDK
Install @bastani/atomic plus the native SDK(s) you target
(@anthropic-ai/claude-agent-sdk, @github/copilot-sdk,
@opencode-ai/sdk).
Composition root
Workflows are wired into a composition root — a TypeScript file the
user runs with bun. The SDK exposes pure primitives:
runWorkflow({ workflow, inputs, detach? })— spawn a workflow's tmux session.createRegistry()/listWorkflows(reg)/getWorkflow(reg, agent, name)— build and iterate a registry.getName(wf) / getAgent(wf) / getDescription(wf) / getInputSchema(wf) / getSource(wf) / getMinSDKVersion(wf)— read workflow metadata.validateInputs(wf, raw)— apply defaults and validate against the declared schema.- Session lifecycle —
listSessions / getSession / stopSession / attachSession / detachSession / getSessionStatus / getSessionTranscript. Manage running tmux sessions on the shared atomic socket. - Pane navigation —
nextWindow / previousWindow / gotoOrchestrator. Pure tmux verbs: they update the session's current-window pointer and return immediately. Never auto-attach — an attached client sees the change live; if no client is watching, the nextattachSessioncall lands on the new window. ComposenextWindow(id) + attachSession(id)for navigate-then-attach. - Typed errors (catch with
instanceofto render friendly CLI messages) —MissingDependencyError(tmux/psmux/bun missing),SessionNotFoundError(id not on the atomic socket),WorkflowNotCompiledError(forgot.compile()),InvalidWorkflowError(default export not aWorkflowDefinition),IncompatibleSDKError(workflow'sminSDKVersionnewer than installed CLI). All thrown by SDK primitives; all carry the relevant payload field (dependency,id,path, version pair). WorkflowPicker(from@bastani/atomic/workflows/components) — the interactive pickeratomic workflow -a claudeuses.
You compose them into whatever CLI library you prefer. The SDK never re-execs the dev's CLI — it ships its own orchestrator entry script and re-execs that with positional args.
// src/claude-worker.ts — single workflow with a small Commander entrypoint
import { Command } from "@commander-js/extra-typings";
import { getInputSchema, runWorkflow } from "@bastani/atomic/workflows";
import workflow from "./workflows/my-workflow/claude.ts";
const program = new Command();
for (const input of getInputSchema(workflow)) {
program.option(`--${input.name} <value>`, input.description ?? "");
}
program.action(async (rawOpts) => {
await runWorkflow({ workflow, inputs: rawOpts as Record<string, string> });
});
await program.parseAsync();
// src/cli.ts — many workflows via createRegistry + listWorkflows
import {
createRegistry,
getInputSchema,
getName,
listWorkflows,
runWorkflow,
} from "@bastani/atomic/workflows";
import claudeWorkflow from "./workflows/my-workflow/claude.ts";
import copilotWorkflow from "./workflows/my-workflow/copilot.ts";
const registry = createRegistry()
.register(claudeWorkflow)
.register(copilotWorkflow);
const program = new Command();
for (const wf of listWorkflows(registry)) {
const sub = program.command(getName(wf));
for (const input of getInputSchema(wf)) {
sub.option(`--${input.name} <value>`, input.description ?? "");
}
sub.action(async (rawOpts) => {
await runWorkflow({ workflow: wf, inputs: rawOpts as Record<string, string> });
});
}
await program.parseAsync();
For programmatic invocation (no CLI at all), call runWorkflow directly:
const { id, tmuxSessionName } = await runWorkflow({
workflow,
inputs: { prompt: "fix the auth bug" },
detach: true,
});
For full registry mechanics, key scheme, and validate-on-register behaviour see references/registry-and-validation.md.
Two context levels
WorkflowContext (ctx) drives orchestration in .run(); SessionContext
(s) drives agent work inside each stage callback. Full field reference in
references/getting-started.md §"SessionContext reference".
Declared inputs
Workflows receive user data exclusively through ctx.inputs / s.inputs,
declared inline as inputs: WorkflowInput[] on defineWorkflow().
TypeScript restricts ctx.inputs to declared keys (undeclared access is a
compile-time error). Load references/workflow-inputs.md for schema shape,
field types (string / text / enum), validation rules, picker
semantics, and the "declare your prompt input explicitly" pattern.
Invocation surfaces
Two invocation paths:
User's own app — the dev controls the CLI shape entirely. Whatever flags they declare in their Commander/citty/yargs program are the user-facing UX. A typical layout (see snippets above):
# Single-workflow worker — flags match the workflow's declared inputs
bun run src/claude-worker.ts --prompt "fix the bug"
bun run src/claude-worker.ts --research_doc=notes.md --focus=standard
# Multi-workflow CLI — one subcommand per workflow
bun run src/cli.ts review --target_branch=main
bun run src/cli.ts spec --research_doc=notes.md
To launch the interactive picker, mount the WorkflowPicker component:
import { WorkflowPickerPanel } from "@bastani/atomic/workflows/components";
const panel = await WorkflowPickerPanel.create({ agent: "claude", registry });
const result = await panel.waitForSelection();
panel.destroy();
if (result) {
await runWorkflow({ workflow: result.workflow, inputs: result.inputs });
}
The dev's CLI is never re-execed. The SDK ships an internal orchestrator entry script and re-execs that with positional args — no env-var dance, no boilerplate re-entry code in the dev's file.
Atomic builtins — workflows shipped inside @bastani/atomic, registered by atomic's internal createBuiltinRegistry():
atomic workflow -n <name> -a <agent> [inputs...]
| Surface | Command | When |
|---|---|---|
| Named, with prompt | … -n hello -a claude "fix the bug" |
Requires workflow to declare a prompt input |
| Named, structured | … -n gen-spec -a claude --research_doc=notes.md |
Structured inputs via --<field> flags |
| Interactive picker | atomic workflow -a claude |
Discovery — fuzzy list + form; this is the intentional no--n path |
| List (atomic builtins) | atomic workflow list, atomic workflow list -a <agent> |
Browse registered builtins, optionally filtered |
| List (user cli) | Iterate listWorkflows(registry) and add a list Commander subcommand yourself |
No built-in --list flag |
| List (single-workflow) | Not applicable — the file is the workflow | |
| Inspect inputs | atomic workflow inputs <name> -a claude |
Print input schema as JSON |
| Status (one or all) | atomic workflow status [<session-id>] |
Query state — in_progress, error, completed, needs_review |
| Kill non-interactively | atomic session kill <id> -y |
Tear down without confirmation prompt — -y is mandatory for agents |
| Detached (background) | … -d / … --detach |
Runs without attaching; reattach with atomic workflow session connect <name> |
Any of the named shapes above (positional or structured) accepts
-d / --detach to run without attaching. Use it when you're automating
from a script and want the CLI to return as soon as the session is spawned.
Declaring SDK compatibility (minSDKVersion)
Opt-in version gate for workflows that depend on a specific SDK release. Default is unset — do not add it to new workflows unless you have a concrete reason.
defineWorkflow({
name: "uses-new-api",
source: import.meta.path,
minSDKVersion: "0.6.0", // refuse to load on older CLI
})
When set to a version newer than the installed CLI, the workflow refuses to
load and surfaces a visible row in atomic workflow list and the picker
(rather than silently vanishing). Set it only when the workflow calls a
newly-added SDK surface (new stage() option, new helper export, new
provider method); omit it for workflows on stable APIs. Full semver
semantics and the visible-diagnostic contract live in
references/registry-and-validation.md.
Structural Rules (hard constraints)
Enforced by the builder, loader, and runtime:
.run()required — the builder must have a.run(async (ctx) => { ... })call..compile()required — the chain must end with.compile().- Every workflow is a named
export— export the compiled definition from the workflow file (default or named). It is then imported and passed toregistry.register(...)in the composition root. - Unique session names — every
ctx.stage()call must use a uniquenameacross the workflow run. - Completed-only reads —
transcript()andgetMessages()only access sessions whose callback has returned and saves have flushed. Attempting to read a still-running session throws. - Graph topology is auto-inferred — the runtime derives parent-child edges from
await/Promise.allpatterns. Sequentialawaitcreates a chain;Promise.all([...])branches from the same parent; a stage afterPromise.allreceives all parallel stages as parents. Headless stages are transparent to the graph — they don't consume or update the execution frontier. Seereferences/control-flow.mdfor full details. - Do not manually create clients or sessions — the runtime auto-creates
s.clientands.sessionfromclientOptsandsessionOpts. Uses.session.query(),s.session.send(), ands.client.session.prompt()instead. - Headless stages share the same callback interface —
s.client,s.session,s.save(),s.transcript(), and return values all work identically in headless mode. The only differences are: no tmux window, no graph node, and a virtualpaneId. - Every
ctx.stage()must contain at least one LLM interaction — as.session.query()/s.session.send()/s.client.session.prompt()call. A stage that runs only TypeScript (file I/O, git commands, HTTP calls, parsing, validation) spawns a visible tmux pane that sits idle on the agent welcome screen for the whole stage, confusing users watching the graph. Seereferences/failure-modes.md§F22. Pure deterministic code belongs in.run()outside any stage; deterministic follow-up paired with a query (e.g. parse → validate → save afters.session.query()) belongs in the same callback.
Concept-to-Code Mapping
Every workflow pattern maps directly to TypeScript code:
| Workflow Concept | Programmatic Pattern |
|---|---|
| Agent session (send prompt, get response) | ctx.stage({ name }, {}, {}, async (s) => { /* use s.client, s.session */ }) — must include an LLM call (Rule 9) |
| Background (headless) session | ctx.stage({ name, headless: true }, {}, {}, async (s) => { /* same API */ }) — invisible in graph, tracked by background counter |
| Sequential execution | await ctx.stage(...) followed by await ctx.stage(...) |
| Parallel execution | Promise.all([ctx.stage(...), ctx.stage(...)]) |
| Parallel background tasks | Promise.all([ctx.stage({ name: "a", headless: true }, ...), ctx.stage({ name: "b", headless: true }, ...)]) |
| Conditional branching | if (...) { await ctx.stage({ name: "fix" }, {}, {}, ...) } |
| Bounded loops with visible graph nodes | for (let i = 1; i <= N; i++) { await ctx.stage({ name: \step-${i}` }, {}, {}, ...) }` |
| Return data from session | const h = await ctx.stage(opts, {}, {}, async (s) => { return value; }); h.result |
| Data flow between sessions | s.save() to persist → s.transcript(handle) or s.transcript("name") to retrieve |
| Pure deterministic computation (no LLM call) | Plain TypeScript at the top level of .run(). Never a standalone stage — see Rule 9 and F22. |
| Deterministic work tied to an LLM call | Inside the same stage callback, before/after the query. E.g. s.session.query(...) → parse → validate → s.save(parsed). |
| Subagent orchestration | Claude: --agent via chatFlags (interactive) or agent SDK option (headless); Copilot: { agent: "name" } in sessionOpts; OpenCode: agent param in s.client.session.prompt() |
| Per-session configuration | Pass clientOpts (2nd arg) and sessionOpts (3rd arg) to ctx.stage() |
When to use a stage vs. plain TypeScript
Before reaching for ctx.stage(), ask: does this block need an LLM?
// ✓ OK — query + deterministic parse in the same callback
const plan = await ctx.stage({ name: "plan" }, {}, {}, async (s) => {
const messages = await s.session.query("Produce a step-by-step plan.");
const text = extractAssistantText(messages, 0);
const parsed = parsePlan(text); // deterministic — fine here
s.save(parsed);
return parsed;
});
// ✓ OK — plain TS at the top of .run() between stages
const plannedFiles = plan.result.files.filter(f => f.endsWith(".ts"));
const startedAt = Date.now();
// ✗ NOT OK — a stage whose callback is pure code with no query
await ctx.stage({ name: "write-report" }, {}, {}, async (s) => {
await fs.writeFile("report.md", buildReport(plan.result)); // no LLM!
});
// This spawns a tmux pane that stays on the Claude/Copilot welcome
// screen for the whole stage. The user watching the graph sees an
// empty pane and wonders why no prompt ever appeared.
// ✓ OK — do the deterministic work inline in .run()
await fs.writeFile("report.md", buildReport(plan.result));
Rule of thumb: one stage, one LLM conversation. If the block has no
s.session.query() / s.session.send() / s.client.session.prompt(),
it's not a stage.
For full pattern examples with code, see references/control-flow.md
(loops, conditionals, review/fix, graph topology, headless fan-out),
references/state-and-data-flow.md (data passing, file coordination,
transcript compression), and references/computation-and-validation.md
(parsing, validation, quality gates).
Authoring Process
1. Understand the User's Goal
Map the user's intent to sessions and patterns:
| Question | Maps to |
|---|---|
| What are the distinct LLM interactions? | Each LLM conversation → one ctx.stage() call (Rule 9) |
| Can any LLM calls run in parallel? | Promise.all([ctx.stage(...), ...]) |
| Should any parallel LLM calls run in the background? | ctx.stage({ name, headless: true }, ...) — invisible in graph, ideal for data-gathering |
| Does any step need pure deterministic code (no LLM)? | Plain TypeScript at the top of .run() — not a dedicated stage. Bundle it inside the nearest stage callback if it's directly tied to that stage's query. |
| Do any steps need to repeat? | for/while loop with ctx.stage() inside |
| Are there conditional paths? | if/else wrapping ctx.stage() calls |
| What data flows between steps? | s.save() → s.transcript(handle) / s.getMessages(handle) |
| Does the workflow need user input? | SDK-specific user input APIs (see references/user-input.md) |
| Do any steps need a specific model? | SDK-specific session config (see references/session-config.md) |
Then walk the Design Advisory Skills table above (§"Design Advisory Skills") — for each row whose trigger applies to your workflow, pull that skill in before writing code. Catching architectural and prompt-quality issues at design time is far cheaper than catching them in the first failed end-to-end run.
2. Choose the Target Agent
Pass the agent as a runtime argument to .for() on the builder — this
narrows all context types and gives correct s.client/s.session types.
Call .for() before .run():
| Agent | Builder Chain | Primary Session API |
|---|---|---|
| Claude | defineWorkflow({...}).for("claude") |
s.session.query(prompt) — sends prompt to the Claude TUI pane |
| Copilot | defineWorkflow({...}).for("copilot") |
s.session.send({ prompt }) — the runtime wraps send to block until session.idle with no timeout (see failure-modes.md §F10); do not use sendAndWait in Atomic workflows |
| OpenCode | defineWorkflow({...}).for("opencode") |
s.client.session.prompt({ sessionID: s.session.id, parts: [...] }) |
The runtime manages client/session lifecycle automatically. For native SDK
types and advanced APIs, import directly from the provider packages
(@github/copilot-sdk, @anthropic-ai/claude-agent-sdk, @opencode-ai/sdk/v2).
For cross-agent support, create one workflow file per agent. Use shared helper modules for SDK-agnostic logic. A typical layout:
src/workflows/my-workflow/
├── claude.ts # Claude-specific SDK code — exports a WorkflowDefinition
├── copilot.ts # Copilot-specific SDK code — exports a WorkflowDefinition
├── opencode.ts # OpenCode-specific SDK code— exports a WorkflowDefinition
└── helpers/
├── prompts.ts # Prompt builders (SDK-agnostic)
├── parsers.ts # Response parsers (SDK-agnostic)
└── validation.ts # Validation logic (SDK-agnostic)
Register each variant in the composition root:
import { claudeWorkflow } from "./workflows/my-workflow/claude.ts";
import { copilotWorkflow } from "./workflows/my-workflow/copilot.ts";
const registry = createRegistry()
.register(claudeWorkflow)
.register(copilotWorkflow);
3. Write the Workflow File
Write the workflow file using the SDK-specific patterns. See
references/getting-started.md for full quick-start examples for all 3
SDKs (send/save/extract patterns, idle handling), and
references/agent-sessions.md for per-SDK API details and lifecycle
caveats.
Reference implementations — two categories live in-repo:
- Builtins (
src/sdk/workflows/builtin/) — production patterns, registered viacreateBuiltinRegistry()inside theatomicCLI:ralph— iterative plan → orchestrate → review → debug loop.deep-research-codebase— scout → parallel explorer fan-out → aggregator.open-claude-design— design-system init flow.
- User-app examples (
examples/<name>/) — minimal runnable user apps you can copy-paste as a starting point. Each example directory containsclaude/index.ts,copilot/index.ts,opencode/index.ts, and one<agent>-worker.tsentrypoint per agent — each a small Commander entrypoint that callsrunWorkflow({ workflow, inputs }). Run withbun run examples/<name>/<agent>-worker.ts --<field>=<value>(or a positional prompt string if the worker declares[prompt...]). Covers:hello-world,sequential-describe-summarize,parallel-hello-world,headless-test,hil-favorite-color,hil-favorite-color-headless,structured-output-demo,reviewer-tool-test(copilot only),review-fix-loop,multi-workflow,commander-embed,pane-navigation(driver CLI for the navigation primitives).
Both sets demonstrate shared helpers, context-aware prompt building, deterministic heuristics, and cross-SDK adaptation.
4. Wire, typecheck, run
The composition root is always three lines (see §"Scaffold a new workflow from scratch" above for the exact template and multi-workflow variant). After writing it:
bun typecheck
bun run src/<agent>-worker.ts --prompt "<test task>"
Other invocation shapes you may want to demonstrate to the user once the workflow runs:
# Single-workflow worker — flags match the workflow's declared inputs
bun run src/<agent>-worker.ts --<field>=<value> # structured inputs
bun run src/<agent>-worker.ts "free-form prompt text" # positional fallback (if wired)
# Multi-workflow CLI — one subcommand per workflow
bun run src/cli.ts <workflow-name> --<field>=<value> # structured
bun run src/cli.ts <workflow-name> "free-form prompt text" # positional fallback (if wired)
# Atomic builtins — these use -n/-a/-d (atomic CLI's own flags, not user-app flags)
atomic workflow -n <name> -a <agent> "<prompt>" # attached run
atomic workflow -n <name> -a <agent> -d "<prompt>" # detached (background)
For detached user-app runs, pass detach: true to runWorkflow or wire your own --detach flag in your Commander entrypoint. For the atomic builtins (ralph, deep-research-codebase, open-claude-design), see references/running-workflows.md for monitoring and teardown.
Running an Existing Workflow
If the user asks you to run (or "kick off" / "start" / "execute") a workflow — not author one — the workflow already exists and you just need to invoke it correctly. That's a different playbook from authoring.
Read references/running-workflows.md. It covers:
- Three invocation paths: user's own app (per-input
--<flag>flags wired by the dev, using Commander or another CLI library), repo-shipped examples (same pattern), and atomic builtins (atomic workflow -n … -a …). - Why atomic builtins use
-n+-aand how to add-dfor background runs. - Why you must list workflows first.
- How to handle missing workflows (offer to author, not fabricate).
- Using
atomic workflow inputs <name> -a <agent>to discover the schema and drive AskUserQuestion. - The six-step invocation recipe.
- Monitoring with
atomic workflow status— and whyneeds_reviewmust be surfaced immediately. - Tearing down with
atomic session kill -y(the-yis mandatory). - Worked examples for "workflow exists" and "workflow doesn't exist".