skill-pipeline
Skill Pipeline
The conductor, not a player. This skill classifies tasks, selects the pipeline variant, calibrates depth, and orchestrates handoffs between skills. It produces no artifacts of its own — its output is routing decisions that activate other skills.
Task Classification
On every coding task, classify before acting. Evaluate scope signals and map to a task class.
Input signals: file count, task description, existing plan/handoff files, batch indicators, CI environment.
Task received
│
├─ Trivial (typo, rename, version bump)
│ → No skills. Just do it.
│
├─ Small (isolated fix, single-file, <10 logic lines)
│ → simplify-and-harden only (post-completion)
│
├─ Medium (feature in known area, 2-5 files)
│ → intent-framed-agent + simplify-and-harden
│
├─ Large (complex refactor, new architecture, unfamiliar codebase, high-risk logic)
│ → Full standard pipeline
│ → Recommend /plan-interview before starting
│
├─ Long-running (multi-session, high context pressure, prior handoff exists)
│ → Full standard pipeline with context-surfing as critical skill
│
└─ Batch (multiple features from spec, 5+ discrete tasks, issue triage)
→ Team-based pipeline (agent-teams-simplify-and-harden)
When uncertain, start with Medium. Add skills if drift or quality issues appear mid-task.
For detailed heuristics, edge cases, and examples: read references/classification-rules.md.
Pipeline Selection
Route task class to the right variant:
| Task Class | Variant | Rationale |
|---|---|---|
| Trivial | None | No overhead needed |
| Small | Standard (minimal) | S&H only |
| Medium | Standard (partial) | Scope monitoring + review |
| Large | Standard (full) | Full depth with planning |
| Long-running | Standard (full) | Context-surfing is critical |
| Batch | Team-based | Breadth over depth |
| CI environment | CI | Headless review |
Heuristic: Standard pipeline for depth (single complex feature). Team-based pipeline for breadth (batch of tasks). CI pipeline when CI=true or GITHUB_ACTIONS=true.
Activation Sequences
Standard Pipeline
classify
→ (recommend /plan-interview if Large or Long-running)
→ intent-framed-agent (at planning-to-execution transition)
→ context-surfing (auto-activates when intent frame + plan exist; concurrent with intent monitoring)
→ [IMPLEMENTATION]
→ simplify-and-harden (post-completion, if non-trivial diff)
→ self-improvement (on errors, corrections, or S&H learning candidates)
Skill-by-class activation:
| Skill | Trivial | Small | Medium | Large | Long-running |
|---|---|---|---|---|---|
| plan-interview | - | - | - | Recommend | Recommend |
| intent-framed-agent | - | - | Activate | Activate | Activate |
| context-surfing | - | - | - | Activate | Critical |
| simplify-and-harden | - | If non-trivial | If non-trivial | If non-trivial | If non-trivial |
| self-improvement | On error only | On error only | On error/completion | On error/completion | On error/completion |
Team-Based Pipeline
classify (Batch)
→ (recommend /plan-interview if no spec exists)
→ agent-teams-simplify-and-harden
├─ Team lead emits Intent Frame #1
├─ Phase 1: parallel implementation agents
├─ Compile + test verification
├─ Phase 2: parallel audit agents (simplify, harden, spec)
├─ Fix loop (up to 3 audit rounds)
└─ Learning loop output
→ self-improvement
CI Pipeline
classify (CI detected)
→ simplify-and-harden-ci (headless scan, PR changed files only)
→ self-improvement-ci (pattern aggregation, promotion recommendations)
Depth Calibration
Not just which skills — how deep each goes:
| Dimension | Small | Medium | Large | Long-running | Batch |
|---|---|---|---|---|---|
| Planning passes | 0 | 0-1 | 1-2 | Deep iterative | Per-task or umbrella |
| Intent frame | - | Single frame | Full frame + monitoring | Full + handoff | Team lead frame |
| Context-surfing | - | - | Active | Critical (exit protocol ready) | Lightweight drift checks |
| S&H budget | 20% diff, 60s | 20% diff, 60s | 20% diff, 60s | 20% diff, 60s | 30% team growth cap |
| Audit rounds (teams) | - | - | - | - | Up to 3 |
| Self-improvement | Error-triggered | Error-triggered | Error + S&H feed | Error + S&H feed | Error + teams feed |
Handoff Rules
Artifacts flow between skills. The orchestrator ensures each skill receives what it needs.
Key handoffs:
-
Plan file (
docs/plans/plan-NNN-<slug>.md) — produced byplan-interview, consumed byintent-framed-agent(context),context-surfing(wave anchor),agent-teams(task extraction). -
Intent Frame — produced by
intent-framed-agent, consumed bycontext-surfing(wave anchor strengthening). Copied into handoff files on drift exit. -
Handoff file (
.context-surfing/handoff-[slug]-[timestamp].md) — produced bycontext-surfingon drift exit, consumed by next session for resume. -
Learning candidates (
learning_loop.candidates) — produced bysimplify-and-hardenandagent-teams, consumed byself-improvementfor pattern tracking.
Precedence: If context-surfing and intent-framed-agent both fire simultaneously, context-surfing's exit takes precedence. Degraded context makes scope checks unreliable.
For the full artifact/signal/budget table: read references/handoff-matrix.md.
Decision Points
The orchestrator intervenes at these moments:
Task Arrival
Classify the task. Select pipeline variant and depth. Emit routing decision. If Large/Long-running, recommend /plan-interview. If Batch, recommend team-based variant.
Plan Approval
When user approves a plan from plan-interview, flow directly into the execution stage — no separate "should I proceed?" prompt. This means activating intent-framed-agent to emit an Intent Frame. The intent frame itself still requires user confirmation before coding begins (that confirmation is part of intent-framed-agent, not an extra gate). Populate task tracking with checklist items.
Planning-to-Execution Transition
When no plan-interview was used and the user signals readiness ("go ahead", "implement this", "let's start"), activate intent-framed-agent. Emit Intent Frame. Wait for user confirmation of the frame before coding.
Implementation Complete
Check if diff meets the non-trivial threshold (see references/classification-rules.md). If yes, activate simplify-and-harden. If no, signal completion directly.
Drift Detected
If context-surfing fires a drift exit, stop execution. Write handoff file. If the task was classified below Large, consider re-classifying upward for the next session.
Session Resume
Check for handoff files in .context-surfing/. If found, read completely. Re-establish context from handoff. Re-classify if needed. Resume from recommended re-entry point.
Overrides
Users can override any routing decision:
- Force depth:
depth=small/depth=large— override classification - Force variant:
variant=teams/variant=standard— override pipeline selection - Skip review:
--no-review— skipsimplify-and-harden - Force planning: invoke
/plan-interviewon any task regardless of classification - Skip all skills: user says "just do it" on a non-trivial task — respect the override
Re-classification
Tasks can change class mid-execution. Watch for:
- Escalation signals: scope expanded beyond original estimate, many more files affected than expected,
context-surfingdrift exit,intent-framed-agentdetects significant scope change - De-escalation signals: task turns out simpler than planned, plan reveals minimal changes needed
When re-classification is warranted:
- Note the signal that triggered re-classification
- Adjust active skills (add or remove pipeline stages)
- If escalating to Large: recommend
/plan-interviewif no plan exists - If de-escalating: drop unnecessary stages, proceed with lighter pipeline
Anti-Patterns
- Do NOT re-implement skill logic. This skill classifies and dispatches. Each individual skill owns its own procedure.
- Do NOT auto-invoke plan-interview. It is a human gate. Recommend it; let the user decide.
- Do NOT create a third monitoring layer. During execution,
intent-framed-agentmonitors scope andcontext-surfingmonitors quality. The orchestrator dispatches at decision points, then gets out of the way. - Do NOT override individual skill guardrails. If
simplify-and-hardenhas a 20% budget cap, the orchestrator respects it. - Do NOT force skills on trivial tasks. The pipeline exists to help, not to slow down simple work.
Pipeline Variant Details
For complete step-by-step walkthroughs of each variant including hybrid scenarios and session resume: read references/pipeline-variants.md.