workflow-orchestrator by scientiacapital/skills

<quick_start> Start session:

pwd && git status && git log --oneline -5
cat PROJECT_CONTEXT.md 2>/dev/null

→ Auto-spawns Observer team (non-negotiable)

Feature development: Contract → Builder Team → Observer monitors → Security gate → Ship

End session:

Observer final report
Security sweep: gitleaks detect --source .
Update PROJECT_CONTEXT.md
Log costs to costs/daily-YYYY-MM-DD.json </quick_start>

<success_criteria> Workflow is successful when:

Context scan completed at session start
Observer team spawned and monitoring (non-negotiable)
Contract defined before any feature implementation
Observer BLOCKER gate checked before phase transitions
Security sweep passes before any commits
Cost tracking updated (daily.json, mtd.json)
Observer final report generated at end of day
PROJECT_CONTEXT.md updated at session end
All security gates passed before shipping </success_criteria>

Session Management: "start day", "begin session", "what's the status", "end day", "wrap up", "done for today"
Feature Development: "new feature", "build", "implement"
Dual-Team: "dual team", "observer team", "builder team", "spawn observers", "devil's advocate"
Debugging: "debug", "investigate", "why is this broken"
Research: "research", "evaluate", "should we use"

DUAL-TEAM ARCHITECTURE

Every session runs two concurrent teams under the Orchestrator:

              ORCHESTRATOR
              ┌────┴────┐
        BUILDER TEAM   OBSERVER TEAM
        (ships fast)   (watches quality)
        ├ Lead Builder  ├ Code Quality (haiku)
        ├ Builder(s)    └ Architecture (sonnet)
        └ Devil's Adv.    └─ (Devil's Advocate)

Observer team is non-negotiable. It always runs, even for "small" changes.

Builders optimize for velocity — ship features via worktrees or subagents
Observers optimize for correctness — detect drift, debt, gaps, scope creep
Devil's advocate on each team prevents groupthink and blind spots

Observers write findings to .claude/OBSERVER_*.md with severity levels:

🔴 BLOCKER — stop work, fix immediately
🟡 WARNING — fix before merge or log
🔵 INFO — backlog

Deep dive: See reference/dual-team-architecture.md

START DAY

Pre-Flight Checks

git status --short | head -5
[ -f package.json ] && [ ! -d node_modules ] && echo "Run npm install"
[ -f requirements.txt ] && [ ! -d .venv ] && echo "Run pip install"
[ -f .env.example ] && [ ! -f .env ] && echo "Copy .env.example to .env"

Context Scan (Mandatory)

pwd && git status && git log --oneline -5
cat PROJECT_CONTEXT.md 2>/dev/null || echo "No context file"
cat CLAUDE.md 2>/dev/null
cat PLANNING.md 2>/dev/null

Observer Team Spawn (Automatic)

Spawn Observer team concurrent with standup:

Code Quality Observer (haiku) — tech debt, test gaps, imports
Architecture Observer (sonnet) — contract drift, scope creep, design violations

# Detect Native Agent Teams support
if [ "${CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS}" = "1" ]; then
  echo "Native Agent Teams enabled — using DAG task system"
  # Use TeamCreate + SendMessage for Observer coordination
else
  echo "Native Agent Teams not enabled — falling back to manual worktree spawn"
  # Use subagent-teams-skill Task tool pattern
fi

Sprint Plan

Include Observer BLOCKER gate in sprint plan — no phase transitions if active BLOCKERs in .claude/OBSERVER_ALERTS.md.

Output Format

## Session Start: [PROJECT_NAME]
### Completed (Last Session)
- [x] Task 1
### In Progress
| Task | Branch/Worktree | Status |
|------|-----------------|--------|
| API endpoint | feature/api | 70% |
### Observer Status
- Team: ACTIVE (Code Quality + Architecture)
- Active blockers: 0
### Today's Priority Queue
1. [BUILDER] Feature implementation
2. [OBSERVER] Continuous monitoring
### Cost Context
- Today: $0.00 | MTD: $12.34 | Budget: $100

Deep dive: See reference/start-day-protocol.md

RESEARCH PHASE

Trigger: Before ANY feature development involving new frameworks, APIs, or architectural decisions.

Scan → Evaluate → Decide

Check existing solutions in your repos and MCP cookbook
Use research-skill checklist for framework selection
Cost projection before building
Create RESEARCH.md → FINDINGS.md with GO/NO-GO recommendation

⛔ Gate: Human checkpoint required before proceeding

Deep dive: See reference/research-workflow.md

FEATURE DEVELOPMENT

Phase 0: CONTRACT DEFINITION (Before Code)

## Feature Contract: [NAME]
### Endpoints / Interfaces
- POST /api/widgets → { id, name, created_at }
### Scope Boundaries
- IN SCOPE: [list]
- OUT OF SCOPE: [list]
### Observer Checkpoints
- [ ] Architecture Observer approves contract
- [ ] Code Quality Observer runs after each merge

⛔ Gate: Architecture Observer must approve contract before Phase 1

Phase 1: BUILDER TEAM SPAWN

# Option A: Worktree sessions (long tasks, 30+ min)
git worktree add -b feature/api ~/tmp/worktrees/$(basename $(pwd))/api
# Option B: Native Agent Teams (short tasks, <20 min)
# Use TeamCreate + Task tool subagents
# Option C: Hybrid — worktrees for Builders, Task tool for Observers

Each builder gets a WORKTREE_TASK.md with:

Scope boundary (what they CAN and CANNOT touch)
Contract reference
Devil's advocate mandate (for one builder per cycle)

Phase 2: OBSERVER MONITORING (Concurrent)

Observers run parallel to builders on a 5-10 minute loop:

Pull latest from builder branches
Run 7 drift detection patterns (see reference/observer-patterns.md)
Write findings to .claude/OBSERVER_QUALITY.md and .claude/OBSERVER_ARCH.md
Escalate BLOCKERs to .claude/OBSERVER_ALERTS.md

Phase 3: SECURITY + QUALITY GATE

# Security scans
semgrep --config auto .
gitleaks detect --source .
npm audit --audit-level=critical || pip-audit
pytest --cov=src || npm test -- --coverage

⛔ Gate: ALL must pass + no active Observer BLOCKERs

gate = (
    sast_clean AND
    secrets_found == 0 AND
    critical_vulns == 0 AND
    test_coverage >= 80 AND
    observer_blockers == 0  # NEW: Observer gate
)

Phase 4: SHIP

git diff main...HEAD
git add . && git commit -m "feat: [description]"
git push
echo '{"feature": "X", "cost": 1.23}' >> costs/by-feature.jsonl

Deep dive: See reference/feature-development.md

DEBUG MODE

Trigger: When standard troubleshooting fails or issue is complex.

Evidence → Hypothesize → Test → Verify

Evidence gathering — exact error, reproduction steps, expected vs actual
Hypothesis formation — 3+ hypotheses with evidence for each
Systematic testing — one variable at a time
Verification — root cause confirmed before committing fix

Critical Rules

❌ NO DRIVE-BY FIXES — explain WHY before committing
❌ NO GUESSING — verify everything
✅ Use all tools: MCP servers, web search, extended thinking
✅ One variable at a time

Deep dive: See reference/debug-methodology.md

END DAY

Phase 1: Observer Final Report (Before Security Sweep)

Observers generate final reports:

Summary of all findings (resolved + open)
Metrics: debt items, test coverage delta, contract compliance
Write to .claude/OBSERVER_QUALITY.md and .claude/OBSERVER_ARCH.md

⛔ Gate: Review Observer report before proceeding

Phase 2: Security Sweep (Mandatory)

gitleaks detect --source . --verbose
git log -p | grep -E "(password|secret|api.?key|token)" || echo "Clean"
npm audit --audit-level=critical 2>/dev/null || pip-audit 2>/dev/null

⛔ Gate: ALL must pass before any commits

Phase 3: Context + Metrics + Cleanup

## PROJECT_CONTEXT.md Update
### Completed This Session
- [x] Feature X
### Observer Summary
- Blockers resolved: N | Warnings logged: N | Debt delta: +/-N
### Tomorrow's Priorities
1. Next task

Archive observer reports to .claude/OBSERVER_HISTORY/, reset for next session.

# Cost tracking
echo '{"total": 0.47}' >> costs/daily-$(date +%Y-%m-%d).json
# Portfolio metrics
git diff --stat $(git log --since="today 00:00" --format="%H" | tail -1)..HEAD 2>/dev/null

Deep dive: See reference/end-day-protocol.md

COST GATE

Pre-Flight Budget Check

COST_FILE=~/.claude/daily-cost.json
SPENT=$(jq '.spent' "$COST_FILE" 2>/dev/null || echo 0)
BUDGET=$(jq '.budget_monthly' "$COST_FILE" 2>/dev/null || echo 100)
echo "MTD: \$$SPENT / \$$BUDGET"

% of Budget	Action
< 50%	Proceed normally
50-80%	Cost warning, suggest model downgrade
80-95%	WARN — Ask user before proceeding
> 95%	BLOCK — Require explicit override

Deep dive: See reference/cost-tracking.md

ROLLBACK / RECOVERY

Stash current work → find last known good → selective rollback or full revert → verify tests → investigate root cause
Use debug-like-expert-skill for root cause analysis

Deep dive: See reference/rollback-recovery.md

SKILL INVOCATION QUICK REFERENCE

Need	Invoke	Model
Spawn builder team	`agent-teams-skill` or Task tool with `team_name`	sonnet
Spawn observer team	Auto at START DAY, or `workflow-orchestrator`	haiku/sonnet
Debug a failing test	`debug-like-expert-skill`	sonnet
Review code quality	`superpowers:requesting-code-review`	sonnet
Run security sweep	`security-skill`	sonnet
Track costs	`cost-metering-skill`	haiku
Write tests	`testing-skill`	sonnet
Design API contract	`api-design-skill`	sonnet
Plan architecture	`planning-prompts-skill`	opus
Capture metrics	`portfolio-artifact-skill`	haiku
Parallel build (worktrees)	`agent-teams-skill`	sonnet
Parallel build (in-session)	`subagent-teams-skill`	sonnet
Map task → best agent	`agent-capability-matrix-skill`	—

Native Agent Teams (Experimental)

export CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS=1
claude --worktree  # auto-creates worktree branch

Features: DAG task system, peer-to-peer messaging, TeammateIdle/TaskCompleted hooks, shared task lists, isolation: "worktree" for subagents.

Deep dive: See reference/agent-routing.md for complete 70+ agent catalog

PROGRESS RENDERING

Use TaskCreate with activeForm for live UI spinners:

TaskCreate({ subject: "Plan architecture", activeForm: "Planning architecture" })
TaskUpdate({ taskId: "1", status: "in_progress" })  // → live spinner
TaskUpdate({ taskId: "1", status: "completed" })     // → checkmark

Use addBlockedBy for phase sequencing. Markdown tables for summaries.

CLAUDE CODE COMMANDS

Command	Workflow
`/start-day`	Start Day — context scan, Observer spawn, cost status
`/build-feature <name>`	Feature Dev — contract → build → observe → ship
`/end-day`	End Day — Observer report, security sweep, context save
`/quick-fix <issue>`	Debug — evidence → hypothesis → fix
`/cost-check`	Display daily/MTD spend and budget status

GTME PERSPECTIVE

This dual-team workflow demonstrates advanced GTME capabilities:

Process Engineering — Quality control built into lifecycle, not bolted on
Parallel Orchestration — Managing concurrent teams with different objectives
Adversarial Thinking — Devil's advocate prevents "happy path only" trap
Cost Awareness — Observers use cheaper models; unit economics thinking
Measurable Output — Observer reports quantify debt, coverage, compliance

Portfolio Value: Produces auditable artifacts demonstrating engineering discipline + process thinking + cost optimization.

Emit Outcome Sidecar

As the final step, write to ~/.claude/skill-analytics/last-outcome-workflow-orchestrator.json:

{"ts":"[UTC ISO8601]","skill":"workflow-orchestrator","version":"2.0.0","variant":"default",
 "status":"[success|partial|error]","runtime_ms":[estimated ms from start],
 "metrics":{"workflows_run":[n],"stages_completed":[n],"observers_spawned":[n]},
 "error":null,"session_id":"[YYYY-MM-DD]"}

Use status "partial" if some stages failed but results were produced. Use "error" only if no output was generated.