team
Team Skill
$team is the tmux-based parallel execution mode for OMX. It starts real worker Codex and/or Claude CLI sessions in split panes and coordinates them through .omx/state/team/... files plus CLI team interop (omx team api ...) and state files.
This skill is operationally sensitive. Treat it as an operator workflow, not a generic prompt pattern.
What This Skill Must Do
GPT-5.4 Guidance Alignment
- Default to concise, evidence-dense progress and completion reporting unless the user or risk level requires more detail.
- Treat newer user task updates as local overrides for the active workflow branch while preserving earlier non-conflicting constraints.
- If correctness depends on additional inspection, retrieval, execution, or verification, keep using the relevant tools until the team workflow is grounded.
- Continue through clear, low-risk, reversible next steps automatically; ask only when the next step is materially branching, destructive, or preference-dependent.
When user triggers $team, the agent must:
- Invoke OMX runtime directly with
omx team ... - Avoid replacing the flow with in-process
spawn_agentfanout - Verify startup and surface concrete state/pane evidence
- Keep team state alive until workers are terminal (unless explicit abort)
- Handle cleanup and stale-pane recovery when needed
If omx team is unavailable, stop with a hard error.
Invocation Contract
omx team [ralph] [N:agent-type] "<task description>"
Examples:
omx team 3:executor "analyze feature X and report flaws"
omx team "debug flaky integration tests"
omx team ralph "ship end-to-end fix with verification"
Why team ralph exists as a linked launch path
omx team ralph ... is not just shorthand for "run team now, decide on Ralph
later." It creates a linked team+Ralph lifecycle from launch time while still
using the normal omx team runtime startup path.
- Linked lifecycle/state: launch marks team state with
linked_ralph, creates/updates Ralph state withlinked_team, and team terminal phases can propagate into Ralph state. - Cleanup/shutdown: linked cancellation and shutdown happen in order: team cleanup first, then Ralph terminalization/cleanup metadata.
- Operator choice:
- use plain
omx team ...when you only want coordinated workers - use
omx team ralph ...when persistent Ralph verification/cleanup is part of the plan from the start - use plain
teamand start Ralph later only when you want a deliberately separate manual follow-up after reviewing output or changing scope
- use plain
Claude teammates (v0.6.0+)
Important: N:agent-type (for example 2:executor) selects the worker role prompt, not the worker CLI (codex vs claude).
To launch Claude teammates, use the team worker CLI env vars:
# Force all teammates to Claude CLI
OMX_TEAM_WORKER_CLI=claude omx team 2:executor "update docs and report"
# Mixed team (worker 1 = Codex, worker 2 = Claude)
OMX_TEAM_WORKER_CLI_MAP=codex,claude omx team 2:executor "split doc/code tasks"
# Auto mode: Claude is selected when worker launch args/model contains 'claude'
OMX_TEAM_WORKER_CLI=auto OMX_TEAM_WORKER_LAUNCH_ARGS="--model claude-..." omx team 2:executor "run mixed validation"
Preconditions
Before running $team, confirm:
tmuxinstalled (tmux -V)- Current leader session is inside tmux (
$TMUXis set) omxcommand resolves to the intended install/build- If running repo-local
node bin/omx.js ..., runnpm run buildaftersrcchanges - Check HUD pane count in the leader window and avoid duplicate
hud --watchpanes before split
Suggested preflight:
tmux list-panes -F '#{pane_id}\t#{pane_start_command}' | rg 'hud --watch' || true
If duplicates exist, remove extras before omx team to prevent HUD ending up in worker stack.
Pre-context Intake Gate
Before launching omx team, require a grounded context snapshot:
- Derive a task slug from the request.
- Reuse the latest relevant snapshot in
.omx/context/{slug}-*.mdwhen available. - If none exists, create
.omx/context/{slug}-{timestamp}.md(UTCYYYYMMDDTHHMMSSZ) with:- task statement
- desired outcome
- known facts/evidence
- constraints
- unknowns/open questions
- likely codebase touchpoints
- If ambiguity remains high, run
explorefirst for brownfield facts, then run$deep-interview --quick <task>before team launch.
Do not start worker panes until this gate is satisfied; if forced to proceed quickly, state explicit scope/risk limitations in the launch report.
For simple read-only brownfield lookups during intake, follow active session guidance: when USE_OMX_EXPLORE_CMD is enabled, prefer omx explore with narrow, concrete prompts; otherwise use the richer normal explore path and fall back normally if omx explore is unavailable.
Follow-up Staffing Contract
When $team is used as a follow-up mode from ralplan, carry forward the approved plan's explicit available-agent-types roster and convert it into concrete staffing guidance before launch:
- keep worker-role choices inside the known roster
- state the recommended headcount and role counts
- state the suggested reasoning level for each lane when available
- explain why each lane exists (delivery, verification, specialist support)
- include an explicit launch hint (
omx team ralph N "<task>"/$team ralph N "<task>") when the plan expects Ralph to verify after team delivery - if the ideal role is unavailable, choose the closest role from the roster and say so
Current Runtime Behavior (As Implemented)
omx team currently performs:
- Parse args (
ralph,N,agent-type, task) - Sanitize team name from task text
- Initialize team state:
.omx/state/team/<team>/config.json.omx/state/team/<team>/manifest.v2.json.omx/state/team/<team>/tasks/task-<id>.json
- Compose team-scoped worker instructions file at:
.omx/state/team/<team>/worker-agents.md- Uses project
AGENTS.mdcontent (if present) + worker overlay, without mutating projectAGENTS.md
- Resolve canonical shared state root from leader cwd (
<leader-cwd>/.omx/state) - Split current tmux window into worker panes
- Launch workers with:
OMX_TEAM_WORKER=<team>/worker-<n>OMX_TEAM_STATE_ROOT=<leader-cwd>/.omx/stateOMX_TEAM_LEADER_CWD=<leader-cwd>- worker CLI selected by
OMX_TEAM_WORKER_CLI/OMX_TEAM_WORKER_CLI_MAP(codexorclaude) - optional worktree metadata envs when
--worktreeis used
- Wait for worker readiness (
capture-panepolling) - Write per-worker
inbox.mdand trigger viatmux send-keys - Return control to leader; follow-up uses
status/resume/shutdown
Important:
- Leader remains in existing pane
- Worker panes are independent full Codex/Claude CLI sessions
- Workers may run in separate git worktrees (
omx team --worktree[=<name>]) while sharing one team state root - Worker ACKs go to
mailbox/leader-fixed.json - Notify hook updates worker heartbeat and nudges leader during active team mode
- Submit routing uses this CLI resolution order per worker trigger:
- explicit worker CLI provided by runtime state (persisted on worker identity/config),
OMX_TEAM_WORKER_CLI_MAPentry for that worker index,- fallback
OMX_TEAM_WORKER_CLI/ auto detection.
- Mixed CLI-map teams are supported for both startup and trigger submit behavior.
- Trigger submit differs by CLI:
- Codex may use queue-first
Tabon busy panes (strategy-dependent). - Claude always uses direct Enter-only (
C-m) rounds (never queue-firstTab).
- Codex may use queue-first
Team worker model + thinking resolution (current contract)
Team mode resolves worker model flags from one shared launch-arg set (not per-worker model selection).
Model precedence (highest to lowest):
- Explicit worker model in
OMX_TEAM_WORKER_LAUNCH_ARGS - Inherited leader
--modelflag - Low-complexity default from
OMX_DEFAULT_SPARK_MODEL(legacy alias:OMX_SPARK_MODEL) when 1+2 are absent and teamagentTypeis low-complexity
Default-model rule:
- Do not assume a frontier or spark model from recency or model-family heuristics.
- Use
OMX_DEFAULT_FRONTIER_MODELfor frontier-default guidance. - Use
OMX_DEFAULT_SPARK_MODELfor spark/low-complexity worker-default guidance.
Thinking-level rule (critical):
- No model-name heuristic mapping.
- Team runtime must not infer
model_reasoning_effortfrom model-name substrings (e.g.,spark,high-capability,mini). - When the leader assigns teammate roles/tasks, OMX allocates per-worker reasoning effort dynamically from the resolved worker role (
low,medium,high). - Explicit launch args still win: if
OMX_TEAM_WORKER_LAUNCH_ARGSalready includes-c model_reasoning_effort=..., that explicit value overrides dynamic allocation for every worker.
Normalization requirements:
- Parse both
--model <value>and--model=<value> - Remove duplicate/conflicting model flags
- Emit exactly one final canonical flag:
--model <value> - Preserve unrelated args in worker launch config
- If explicit reasoning exists, preserve canonical
-c model_reasoning_effort="<level>"; otherwise inject the worker role's default reasoning level
Required Lifecycle (Operator Contract)
Follow this exact lifecycle when running $team:
- Start team and verify startup evidence (team line, tmux target, panes, ACK mailbox)
- Monitor task and worker progress with runtime/state tools first (
omx team status <team>,omx team resume <team>, mailbox/state files) - Wait for terminal task state before shutdown:
pending=0in_progress=0failed=0(or explicitly acknowledged failure path)
- Only then run
omx team shutdown <team> - Verify shutdown evidence and state cleanup
Do not run shutdown while workers are actively writing updates unless user explicitly requested abort/cancel.
Do not treat ad-hoc pane typing as primary control flow when runtime/state evidence is available.
Active leader monitoring rule
While a team is ON/running, the leader must not go blind. Keep checking live team state until terminal completion.
Minimum acceptable loop:
sleep 30 && omx team status <team-name>
Repeat that check while the team stays active, or use omx team await <team-name> --timeout-ms 30000 --json when event-driven waiting is a better fit.
If the leader gets a stale/team-stalled nudge, immediately run omx team status <team-name> before taking any manual intervention.
Message Dispatch Policy (CLI-first, state-first)
To avoid brittle behavior, message/task delivery must not be driven by ad-hoc tmux typing.
Required default path:
- Use
omx team ...runtime lifecycle commands for orchestration. - Use
omx team api ... --jsonfor mailbox/task mutations. - Verify delivery via mailbox/state evidence (
mailbox/*.json, task status,omx team status).
Strict rules:
- MUST NOT use direct
tmux send-keysas the primary mechanism to deliver instructions/messages. - MUST NOT spam Enter/trigger keys without first checking runtime/state evidence.
- MUST prefer durable state writes + runtime dispatch (
dispatch/requests.json, mailbox, inbox). - Direct tmux interaction is fallback-only and only after failure checks (for example
worker_notify_failed:<worker>) or explicit user request (for example “press enter”).
Operational Commands
omx team status <team-name>
omx team resume <team-name>
omx team shutdown <team-name>
Semantics:
status: reads team snapshot (task counts, dead/non-reporting workers)resume: reconnects to live team session if presentshutdown: graceful shutdown request, then cleanup (deletes.omx/state/team/<team>)
Data Plane and Control Plane
Control Plane
- tmux panes/processes (
OMX_TEAM_WORKERper worker) - leader notifications via
tmux display-message
Data Plane
.omx/state/team/<team>/...files- Team mailbox files:
.omx/state/team/<team>/mailbox/leader-fixed.json.omx/state/team/<team>/mailbox/worker-<n>.json.omx/state/team/<team>/dispatch/requests.json(durable dispatch queue; hook-preferred, fallback-aware)
Key Files
.omx/state/team/<team>/config.json.omx/state/team/<team>/manifest.v2.json.omx/state/team/<team>/tasks/task-<id>.json.omx/state/team/<team>/workers/worker-<n>/identity.json.omx/state/team/<team>/workers/worker-<n>/inbox.md.omx/state/team/<team>/workers/worker-<n>/heartbeat.json.omx/state/team/<team>/workers/worker-<n>/status.json.omx/state/team-leader-nudge.json
Team Mutation Interop (CLI-first)
Use omx team api for machine-readable mutation/reads instead of legacy team_* MCP tools.
omx team api <operation> --input '{"team_name":"my-team",...}' --json
Examples:
omx team api send-message --input '{"team_name":"my-team","from_worker":"worker-1","to_worker":"leader-fixed","body":"ACK"}' --json
omx team api claim-task --input '{"team_name":"my-team","task_id":"1","worker":"worker-1"}' --json
omx team api transition-task-status --input '{"team_name":"my-team","task_id":"1","from":"in_progress","to":"completed","claim_token":"<token>"}' --json
--json responses include stable metadata for automation:
schema_versiontimestampcommandokoperationdataorerror
Team + Worker Protocol Notes
Leader-to-worker:
- Write full assignment to worker
inbox.md - Send short trigger (<200 chars) with
tmux send-keys
Worker-to-leader:
- Send ACK to
leader-fixedmailbox viaomx team api send-message --json - Claim/transition/release task lifecycle via
omx team api <operation> --json
Worker commit protocol (critical for incremental integration):
- After completing task work and before reporting completion, workers MUST commit:
git add -A && git commit -m "task: <task-subject>" - This ensures changes are available for incremental integration into the leader branch
- If a worker forgets to commit, the runtime auto-commits as a fallback, but explicit commits are preferred
Task ID rule (critical):
- File path uses
task-<id>.json(exampletask-1.json) - MCP API
task_iduses bare id (example"1", not"task-1") - Never instruct workers to read
tasks/{id}.json
Environment Knobs
Useful runtime env vars:
OMX_TEAM_READY_TIMEOUT_MS- Worker readiness timeout (default 45000)
OMX_TEAM_SKIP_READY_WAIT=1- Skip readiness wait (debug only)
OMX_TEAM_AUTO_TRUST=0- Disable auto-advance for trust prompt (default behavior auto-advances)
OMX_TEAM_WORKER_LAUNCH_ARGS- Extra args passed to worker launch command
OMX_TEAM_WORKER_CLI- Worker CLI selector:
auto|codex|claude(default:auto) autochoosesclaudewhen worker--modelcontainsclaude, otherwisecodex- In
claudemode, workers launch with exactly one--dangerously-skip-permissionsand ignore explicit model/config/effort launch overrides (uses defaultsettings.json)
- Worker CLI selector:
OMX_TEAM_WORKER_CLI_MAP- Per-worker CLI selector (comma-separated
auto|codex|claude) - Length must be
1(broadcast) or exactly the team worker count - Example:
OMX_TEAM_WORKER_CLI_MAP=codex,codex,claude,claude - When present, overrides
OMX_TEAM_WORKER_CLI
- Per-worker CLI selector (comma-separated
OMX_TEAM_AUTO_INTERRUPT_RETRY- Trigger submit fallback (default: enabled)
0disables adaptive queue->resend escalation
OMX_TEAM_LEADER_NUDGE_MS- Leader nudge interval in ms (default 120000)
OMX_TEAM_STRICT_SUBMIT=1- Force strict send-keys submit failure behavior
Failure Modes and Diagnosis
Operator note (important for Claude panes):
- Manual Enter injection (
tmux send-keys ... C-m) can appear to "do nothing" when a worker is actively processing; Enter may be queued by the pane/task flow. - This is not necessarily a runtime bug. Confirm worker/team state before diagnosing dispatch failure.
- Avoid repeated blind Enter spam; it can create noisy duplicate submits once the pane becomes idle.
Safe Manual Intervention (last resort)
Use only after checking omx team status <team> and mailbox/state evidence:
- Capture pane tail to confirm current worker state:
tmux capture-pane -t %<worker-pane> -p -S -120- If a larger-tail read or bounded summary would help, prefer explicit opt-in inspection via
omx sparkshell --tmux-pane %<worker-pane> --tail-lines 400before improvising extra tmux commands.
- If the pane is stuck in an interactive state, safely return to idle prompt first:
- optional interrupt
C-cor escape flow (CLI-specific) once, then re-check pane capture
- optional interrupt
- Send one concise trigger (single line) and wait for evidence:
tmux send-keys -t %<worker-pane> "ack + continue current task; report status" C-m
- Re-check:
- pane output via
capture-pane - mailbox updates (
mailbox/leader-fixed.jsonor worker mailbox) omx team status <team>
- pane output via
worker_notify_failed:<worker>
Meaning:
- Leader wrote inbox but trigger submit path failed
Checks:
tmux list-panes -F '#{pane_id}\t#{pane_start_command}'tmux capture-pane -t %<worker-pane> -p -S -120- Verify worker process alive and not stuck on trust prompt
- Rebuild if running repo-local (
npm run build)
Team starts but leader gets no ACK
Checks:
- Worker pane capture shows inbox processing
.omx/state/team/<team>/mailbox/leader-fixed.jsonexists- Worker skill loaded and
omx team api send-message --jsoncalled - Task-id mismatch not blocking worker flow
Worker logs omx team api ... ENOENT (or legacy team_send_message ENOENT / team_update_task ENOENT)
Meaning:
- Team state path no longer exists while worker is still running.
- Typical cause: leader/manual flow ran
omx team shutdown <team>(or removed.omx/state/team/<team>) before worker finished.
Checks:
omx team status <team>and confirm whether tasks were stillin_progresswhen shutdown occurred- Verify whether
.omx/state/team/<team>/exists - Inspect worker pane tail for post-shutdown writes
- Confirm no external cleanup (
rm -rf .omx/state/team/<team>) happened during execution
Prevention:
- Enforce completion gate (no in-progress tasks) before shutdown
- Use
shutdownonly for terminal completion or explicit abort - If aborting, expect late worker writes to fail and treat ENOENT as expected teardown artifact
Shutdown reports success but stale worker panes remain
Cause:
- stale pane outside config tracking or previous failed run
Fix:
- manual pane cleanup (see clean-slate commands)
Clean-Slate Recovery
Run from leader pane:
# 1) Inspect panes
tmux list-panes -F '#{pane_id}\t#{pane_current_command}\t#{pane_start_command}'
# 2) Kill stale worker panes only (examples)
tmux kill-pane -t %450
tmux kill-pane -t %451
# 3) Remove stale team state (example)
rm -rf .omx/state/team/<team-name>
# 4) Retry
omx team 1:executor "fresh retry"
Guidelines:
- Do not kill leader pane
- Do not kill HUD pane (
omx hud --watch) unless intentionally restarting HUD
Required Reporting During Execution
When operating this skill, provide concrete progress evidence:
- Team started line (
Team started: <name>) - tmux target and worker pane presence
- leader mailbox ACK path/content check
- status/shutdown outcomes
Do not claim success without file/pane evidence.
Do not claim clean completion if shutdown occurred with in_progress>0.
Use omx sparkshell --tmux-pane ... as an explicit opt-in operator aid for pane inspection and summaries; keep raw tmux capture-pane evidence available for manual intervention and proof.
MCP Job Lifecycle Tools
For programmatic or agent-driven team spawning (as opposed to interactive CLI use), OMX exposes four MCP tools via the team-server:
| Tool | Description |
|---|---|
omx_run_team_start |
Spawn tmux CLI workers in the background; returns a jobId immediately |
omx_run_team_status |
Non-blocking status check for a running job |
omx_run_team_wait |
Block until the job completes, with automatic idle-pane nudging |
omx_run_team_cleanup |
Kill worker tmux panes for a job (early stop only) |
CLI vs MCP Tools
omx team ...CLI — Primary method for interactive team orchestration. Use this when you are operating inside a live tmux session and want direct pane visibility.omx_run_team_*MCP tools — For programmatic or agent-driven team spawning (analogous to OMC'somc_run_team_*tools). Use these when an agent needs to launch workers, poll status, and collect results without manual intervention.
Naming Distinction
Two cleanup tools exist and must not be confused:
team_cleanup(state-server): Deletes team state files on disk (.omx/state/team/<team>/). Use after a team run is fully complete.omx_run_team_cleanup(team-server): Kills tmux worker panes for a job. Use only when stopping workers early; otherwiseomx_run_team_waithandles natural termination.
Basic Usage Example
1. omx_run_team_start({
teamName: "fix-bugs",
agentTypes: ["codex"],
tasks: [{ subject: "Fix bug", description: "..." }],
cwd: "/path/to/project"
})
→ Returns { jobId: "omx-abc123" }
2. omx_run_team_wait({ job_id: "omx-abc123", timeout_ms: 300000 })
→ Blocks until done, auto-nudges idle panes
3. omx_run_team_cleanup({ job_id: "omx-abc123" })
→ Only needed if stopping workers early
omx_run_team_status can be called between steps 1 and 2 for a non-blocking poll if you need to interleave other work while workers run.
Limitations
- Worktree provisioning requires a git repository and can fail on branch/path collisions
- send-keys interactions can be timing-sensitive under load
- stale panes from prior runs can interfere until manually cleaned
Scenario Examples
Good: The user says continue after the workflow already has a clear next step. Continue the current branch of work instead of restarting or re-asking the same question.
Good: The user changes only the output shape or downstream delivery step (for example make a PR). Preserve earlier non-conflicting workflow constraints and apply the update locally.
Bad: The user says continue, and the workflow restarts discovery or stops before the missing verification/evidence is gathered.