Multi-AI Planning

Overview

multi-ai-planning creates comprehensive, agent-executable plans for complex objectives using verification-first approach, hierarchical task decomposition, and rigorous quality validation.

Purpose: Transform objectives into structured, validated plans that agents can execute reliably

Pattern: Workflow-based (6-step sequential process)

Key Innovation: Plans are schema-validated, quality-scored (≥90/100), and designed for multi-agent parallel execution with built-in verification and checkpoint systems.

Core Principles (validated by Claude + Gemini + Codex):

Verification-First: Define success criteria before implementation
Hierarchical Decomposition: Break down to atomic, executable steps
Explicit Dependencies: Map all dependencies, identify parallel opportunities
Quality Gates: Multi-layer validation before execution
Safe Checkpoints: Git tags + patches for resumption and rollback

When to Use

Use multi-ai-planning when:

Planning complex features (>8 tasks, multiple components)
Multi-step migrations or refactorings
Security implementations requiring verification
System integrations with many touch points
Projects requiring multi-agent coordination
When parallel execution optimization is important
Quality and verification are critical

When NOT to Use:

Simple tasks (<3 steps)
Ad-hoc exploratory work
Quick prototyping without formal structure

Prerequisites

Required

Clear objective (know what you want to accomplish)
Time for planning (1-3 hours depending on complexity)

Optional (for enhanced planning)

multi-ai-research skill (for research phase)
Gemini CLI (npm install -g @google/gemini-cli) - web research
Codex CLI (npm install -g @openai/codex) - code pattern research

Understanding

Familiarity with the objective domain
Basic understanding of agent coordination (see references/task-tool-coordination.md)

Planning Workflow

Step 1: Objective Analysis & Research

Analyze the objective and optionally research best practices, patterns, and approaches before creating the plan.

Purpose: Ground plan in proven patterns and understand problem space

When to Use: Always (research optional but recommended for unfamiliar domains)

Inputs:

Clear objective statement
Context about current system (if applicable)
Success criteria (what "done" looks like)

Process:

Clarify Objective:

Objective: [Specific statement of what needs to be accomplished]

Context:
- Current state: [What exists now]
- Desired state: [What should exist]
- Constraints: [Time, resources, compatibility]

Success Criteria:
1. [Specific, measurable criterion]
2. [Criterion 2]
3. [Criterion 3]

Determine Plan Type:
- Feature development (new functionality)
- Refactoring (improve existing code)
- Migration (move to new technology/pattern)
- Security implementation (add security features)
- Integration (connect systems)
- Research project (investigate and document)
Optional: Research Domain (recommended for complex/unfamiliar):

If using multi-ai-research skill:
```
Use multi-ai-research for "Research [domain] best practices and implementation patterns"
```
This will provide:
- Official documentation insights (Claude)
- Latest web best practices (Gemini)
- GitHub implementation patterns (Codex)
- Multi-source validation
Manual research (if multi-ai-research not available):
- Search web for "[domain] best practices 2024-2025"
- Review official documentation
- Find 3-5 example implementations
- Document findings in research-notes.md

Analyze Approach Options: Based on research (if done), identify 2-3 possible approaches:

## Approach Options

### Option A: [Approach name]
- **Description**: [How it works]
- **Pros**: [Benefits]
- **Cons**: [Drawbacks]
- **Complexity**: Low/Medium/High
- **Risk**: Low/Medium/High

### Option B: [Alternative]
[Same format]

### Recommended: [Choice] because [rationale]

Define Initial Scope:

## Scope Definition

**Included**:
- [Component/area in scope]
- [Component 2]

**Excluded**:
- [What we're NOT doing]
- [Out of scope item 2]

**Rationale**: [Why this scope is appropriate]

Outputs:

Objective clarification document
Plan type identified
Research findings (if research conducted)
Approach recommendation with rationale
Initial scope definition

Validation:

Time Estimate: 30-60 minutes (research adds 30-60 min if used)

Next: Proceed to Step 2

Step 2: Hierarchical Task Decomposition

Break down the objective into hierarchical tasks using recursive decomposition until reaching atomic, executable steps.

Purpose: Create complete task tree with appropriate granularity

Inputs:

Objective and scope (from Step 1)
Chosen approach
Research findings (if available)

Process:

Identify Large Tasks (8-15 tasks):

Break objective into major phases or components:

## Large Task Breakdown

1. **Database Schema** - Create data models and migrations
2. **OAuth Integration** - Implement OAuth provider
3. **API Endpoints** - Create authentication endpoints
4. **Frontend Components** - Build UI for auth
5. **Testing Suite** - Comprehensive tests
6. **Documentation** - User and technical docs
7. **Security Audit** - Verify security best practices
8. **Deployment** - Production deployment

Total large tasks: 8 (target: 8-15)

Granularity Check:

Too few (<5): Tasks are too big, decompose more
Just right (8-15): Good granularity
Too many (>15): Tasks too small, group related tasks

Decompose Large → Medium (3-8 tasks each):

For each large task, break into medium-sized tasks:

### Large Task 2: OAuth Integration

Medium tasks:
2.1. Provider Configuration (setup OAuth provider)
2.2. Token Management (generate, refresh, validate tokens)
2.3. User Profile Sync (fetch and store user data)
2.4. Error Handling (handle OAuth failures)
2.5. Integration Tests (verify OAuth flow)

Total: 5 medium tasks (target: 3-8)

For Complex Projects: Can use Task tool to parallelize decomposition:

// Spawn parallel decomposition agents
const decompositions = await Promise.all([
  task({
    description: "Decompose database schema task",
    prompt: `Break down "Database Schema" large task into 3-8 medium tasks.
    For each: description, dependencies, estimate.
    Write to decomp-task1.json`
  }),
  task({
    description: "Decompose OAuth task",
    prompt: `Break down "OAuth Integration"...
    Write to decomp-task2.json`
  })
]);
// Integrate results into plan

Decompose Medium → Small/Atomic (for complex medium tasks):

Continue breaking down until tasks are atomic (cannot be meaningfully subdivided):

### Medium Task 2.2: Token Management

Small/Atomic tasks:
2.2.1. Create token schema (database)
2.2.2. Implement token generation function
2.2.3. Implement token refresh logic
2.2.4. Add token validation middleware

Total: 4 atomic tasks (can't break down further)

Granularity Validation:

Check each task level:

## Granularity Check

**Huge** (if objective is massive): 15-25 large tasks
**Large**: 8-15 medium tasks each
**Medium**: 3-8 small tasks each
**Small**: Atomic (single focused operation)

✅ All tasks at appropriate granularity
✅ No task is too big (>15 subtasks)
✅ No unnecessary subdivision

Outputs:

Complete hierarchical task tree
All tasks with clear descriptions
Appropriate granularity at each level
Task IDs in hierarchical format (1, 1.2, 1.2.3)

Validation:

Large tasks (8-15) identified
Complex large tasks decomposed to medium (3-8 each)
Complex medium tasks decomposed to small/atomic
All tasks have clear, actionable descriptions
No task is too vague or too big
Granularity appropriate at each level

Time Estimate: 45-90 minutes (varies with complexity)

Next: Proceed to Step 3

Step 3: Dependency Mapping & Parallel Identification

Map dependencies between all tasks and identify opportunities for parallel execution.

Purpose: Enable optimal execution order and parallel agent coordination

Inputs:

Complete task tree (from Step 2)

Process:

Identify Dependencies:

For each task, ask:

What must be complete before this can start?
What information/artifacts does this task need?
What can this task be done in parallel with?

## Dependency Analysis

### Task 1: Database Schema
- **Depends on**: None (can start immediately)
- **Provides for**: Tasks 2, 3, 4 (need schema to exist)

### Task 2: OAuth Integration
- **Depends on**: Task 1 (needs user table schema)
- **Provides for**: Task 3 (API needs OAuth)

### Task 3: API Endpoints
- **Depends on**: Task 1 (schema), Task 2 (OAuth)
- **Can parallel with**: Task 4 (frontend independent)

### Task 4: Frontend Components
- **Depends on**: None initially (can mock API)
- **Integration depends on**: Task 3 (real API)

Build Dependency Graph:

Create visual or structured representation:

## Dependency Graph

```mermaid
graph TD
  T1[Task 1: Schema]
  T2[Task 2: OAuth]
  T3[Task 3: API]
  T4[Task 4: Frontend]
  T5[Task 5: Tests]

  T1 --> T2
  T1 --> T3
  T2 --> T3
  T3 --> T5
  T4 --> T5

JSON Format (for schema):

{
  "dependencies": {
    "2": ["1"],
    "3": ["1", "2"],
    "5": ["3", "4"]
  }
}

Validate No Circular Dependencies:

Check for cycles:
- Task A depends on Task B
- Task B depends on Task C
- Task C depends on Task A ❌ CIRCULAR!
Validation Method:
```
# Use scripts/visualize-deps.sh to generate graph
# Visual inspection will show cycles
# Or use topological sort algorithm
```

Identify Parallel Execution Groups:

Group tasks that can run simultaneously:

## Parallel Execution Opportunities

### Phase 1 (Immediate Start)
- Task 1: Database Schema
**Agents**: 1
**Estimated**: 3h

### Phase 2 (After Task 1)
- Task 2: OAuth Integration
- Task 4: Frontend (can start with mocks)
**Agents**: 2 parallel
**Estimated**: 5h (max of the two)

### Phase 3 (After Tasks 2, 3)
- Task 3: API Endpoints
**Agents**: 1
**Estimated**: 4h

### Phase 4 (After Tasks 3, 4)
- Task 5: Tests (integration)
**Agents**: 1
**Estimated**: 3h

**Total Sequential**: 15h
**With Parallelization**: 12h (20% savings)

Calculate Critical Path:

Find longest chain of dependent tasks:

## Critical Path Analysis

**Critical Path**: Task 1 → Task 2 → Task 3 → Task 5
**Duration**: 3h + 5h + 4h + 3h = 15h

**Optimization**: Task 4 can run in parallel, not on critical path

**Focus**: Critical path tasks are priority for on-time completion

Outputs:

Dependency map (JSON format)
Dependency graph (visual/mermaid)
Parallel execution groups identified
Critical path calculated
Optimized execution phases

Validation:

Time Estimate: 30-45 minutes

Next: Proceed to Step 4

Step 4: Verification Planning

For each task, define verification-first: specify how to verify it's done correctly BEFORE planning implementation.

Purpose: Ensure every task is testable and has clear success criteria

Inputs:

Complete task tree with dependencies (from Steps 2-3)

Process:

For Each Task, Define Success Criteria:

Use SMART criteria (Specific, Measurable, Achievable, Relevant, Time-bound):

### Task 2.2.2: Implement Token Generation

**Success Criteria** (defined FIRST):
- [ ] Function `generateToken(user)` exists in src/auth/tokens.ts
- [ ] Function returns JWT string (format: xxx.yyy.zzz)
- [ ] Token includes userId claim in payload
- [ ] Token expires in 24 hours (exp claim)
- [ ] Token signed with process.env.JWT_SECRET
- [ ] Function throws error for invalid user
- [ ] Function handles missing user gracefully

**NOT** vague criteria like:
❌ "Token generation works"
❌ "Function is implemented"

Choose Verification Method:

Select appropriate verification for each task:
- automated_test: Unit/integration tests (preferred)
- manual_check: Manual verification (when automation not feasible)
- visual_inspection: UI/UX verification (screenshots)
- integration_test: End-to-end workflow tests
- code_review: Review for quality/security

Define Verification Commands (for automated):

### Verification Method: automated_test

**Commands**:
```bash
# Run specific test file
npm test -- src/auth/tokens.test.ts

# Verify function exists
grep -q "function generateToken" src/auth/tokens.ts || exit 1

# Check test coverage
npm run coverage -- --file=src/auth/tokens.ts

Expected: All tests pass, coverage ≥80% Timeout: 30 seconds

Plan Quality Gates:

Define gates at major milestones:

## Quality Gates

### Gate 1: Foundation Complete (after Tasks 1-2)
- [ ] Database schema created and migrated
- [ ] OAuth provider configured
- [ ] All tests pass
- [ ] No linting errors

### Gate 2: Core Features Complete (after Tasks 3-4)
- [ ] API endpoints functional
- [ ] Frontend integrated
- [ ] Integration tests pass
- [ ] Manual testing complete

### Gate 3: Production Ready (after all tasks)
- [ ] All functionality verified
- [ ] Security audit passed
- [ ] Documentation complete
- [ ] Deployment tested

Add to Task Definitions:

Update each task with verification:

{
  "id": "2.2.2",
  "description": "Implement token generation function",
  "verification": {
    "success_criteria": [
      "Function generateToken(user) exists",
      "Returns JWT string format xxx.yyy.zzz",
      "Includes userId claim",
      "Expires in 24 hours",
      "Throws on invalid user"
    ],
    "method": "automated_test",
    "commands": [
      "npm test -- src/auth/tokens.test.ts",
      "npm run coverage -- --file=src/auth/tokens.ts"
    ],
    "timeout_seconds": 30
  }
}

Outputs:

All tasks have success criteria (minimum 1, target 3-5)
All tasks have verification method
Automated verification commands (where applicable)
Quality gates defined at milestones

Validation:

Every task has specific success criteria
Success criteria are measurable
Verification method appropriate for task
Automated commands provided (where feasible)
Quality gates defined
No vague criteria ("it works", "done")

Time Estimate: 45-75 minutes

Next: Proceed to Step 5

Step 5: Quality Verification & Scoring

Validate the plan for completeness, feasibility, clarity, and executability. Score 0-100 with ≥90 threshold for approval.

Purpose: Ensure plan quality before execution begins

Inputs:

Complete plan with tasks, dependencies, verification (from Steps 1-4)

Process:

Run Schema Validation:

# Validate plan structure
bash .claude/skills/multi-ai-planning/scripts/validate-schema.sh plan.json

# Expected output:
# ✅ plan.json validates against plan.schema.json
# ✅ All tasks validate
# ✅ All checkpoints validate

Check Completeness (scored /20):

### Completeness Assessment

- All requirements mapped to tasks: [X]/5
  - Check: Every requirement from Step 1 has task(s)
  - Missing: [List any unmapped requirements]

- All tasks have success criteria: [X]/5
  - Check: Every task has ≥1 criterion
  - Missing: [List tasks without criteria]

- All dependencies documented: [X]/5
  - Check: All task dependencies in dependency graph
  - Missing: [Any undocumented dependencies]

- Resource requirements clear: [X]/5
  - Check: Agents, tools, permissions documented
  - Missing: [Any unclear requirements]

**Completeness Score**: [X]/20

Assess Feasibility (scored /20):

### Feasibility Assessment

- Resources available: [X]/5
  - Required agents: Available ✅ / Missing ❌
  - Required tools: Available ✅ / Missing ❌

- Timeline realistic: [X]/5
  - Estimated: [X] hours
  - Available time: [Y] hours
  - Realistic ✅ / Optimistic ⚠️ / Unrealistic ❌

- No blocking constraints: [X]/5
  - API access: ✅
  - Permissions: ✅
  - External dependencies: ✅

- Dependencies manageable: [X]/5
  - Critical path length: Reasonable ✅ / Too long ❌
  - Parallel opportunities: Identified ✅

**Feasibility Score**: [X]/20

Evaluate Clarity (scored /20):

### Clarity Assessment

- All steps unambiguous: [X]/5
  - Check each task description
  - Vague tasks: [List]

- Terms defined: [X]/5
  - Technical terms explained ✅
  - Acronyms defined ✅

- Examples provided: [X]/5
  - Complex tasks have examples ✅
  - Verification examples present ✅

- Agent assignments clear: [X]/5
  - Each task suggests agent type ✅
  - Coordination documented ✅

**Clarity Score**: [X]/20

Score Executability (scored /20):

### Executability Assessment

- Success criteria measurable: [X]/5
  - All criteria are objective ✅
  - Can determine pass/fail ✅

- Verification methods defined: [X]/5
  - All tasks have method ✅
  - Methods appropriate ✅

- Failure handling planned: [X]/5
  - Failure scenarios documented ✅
  - Rollback procedures defined ✅

- Rollback procedures clear: [X]/5
  - Checkpoints defined ✅
  - Recovery steps documented ✅

**Executability Score**: [X]/20

Assess Integration (scored /20):

### Integration Assessment

- Integration points identified: [X]/5
  - All system touch points mapped ✅
  - APIs, databases, services documented ✅

- Backward compatibility checked: [X]/5
  - Compatibility plan exists ✅
  - Migration strategy defined ✅

- Testing strategy defined: [X]/5
  - Unit tests planned ✅
  - Integration tests planned ✅
  - E2E tests planned ✅

- Documentation plan clear: [X]/5
  - User docs planned ✅
  - Technical docs planned ✅
  - API docs planned ✅

**Integration Score**: [X]/20

Calculate Total Quality Score:

## Quality Score Summary

- Comprehensiveness: [X]/20
- Feasibility: [X]/20
- Clarity: [X]/20
- Executability: [X]/20
- Integration: [X]/20

**Total Score**: [X]/100

**Threshold**: ≥90 for approval

**Result**:
- ≥90: ✅ APPROVED (proceed to execution)
- 80-89: ⚠️ NEEDS MINOR REVISION (address gaps)
- 70-79: ❌ NEEDS MAJOR REVISION (significant issues)
- <70: ❌ REJECTED (fundamental problems, re-plan)

If Score <90, Identify Gaps:

## Gap Analysis

### Critical Gaps (Blocking Approval)
1. [Gap description]
   - **Impact**: Why this matters
   - **Fix**: Specific action to address
   - **Effort**: [time to fix]

### Medium Gaps (Should Fix)
1. [Gap]

**Estimated Effort to Reach ≥90**: [X] hours
**Recommendation**: [Apply fixes and re-verify / Re-plan from scratch]

Generate Verification Report:

Use templates/VERIFICATION_TEMPLATE.md to create detailed report.

Outputs:

Quality score (0-100) with dimension breakdown
Verification report with gap analysis
Approval decision (approved / needs revision / rejected)
If needs revision: specific fixes required

Validation:

All 5 quality dimensions scored
Total score calculated correctly
Threshold check performed (≥90)
Gaps identified if score <90
Verification report generated

Time Estimate: 30-45 minutes

Decision Point:

If ≥90: Proceed to Step 6 (Finalization)
If <90: Apply fixes, return to relevant step, re-verify

Next: If approved, proceed to Step 6

Step 6: Finalization & Output Generation

Generate final plan outputs in both machine-readable (JSON) and human-readable (Markdown) formats.

Purpose: Create production-ready plan artifacts ready for agent execution

Inputs:

Approved plan (quality score ≥90 from Step 5)

Process:

Generate plan.json (machine-readable):

Compile all plan data into schema-validated JSON:

# Use templates and gathered data to create plan.json
# Validate against schema
bash scripts/validate-schema.sh plan.json

# Expected output:
# ✅ Valid JSON
# ✅ Validates against plan.schema.json
# ✅ All required fields present

Generate PLAN.md (human-readable):

Create markdown version from plan.json or use template:
```
# Generate markdown from JSON
bash scripts/generate-plan-md.sh plan.json > PLAN.md

# Or use template and fill in manually
cp templates/PLAN_TEMPLATE.md PLAN.md
# Fill in all sections with plan data
```
Ensures:
- All information from JSON present
- Formatted for human readability
- Includes visual dependency graph
- Quality score displayed
- Clear next steps
Generate COORDINATION.md (agent execution guide):

Document how agents should execute this plan:
```
# Execution Coordination Guide

## Phase 1: Immediate Start
**Task 1**: Database Schema
**Agent**: Use Task tool to spawn code-writer agent
**Prompt**:
```
Create database schema for user authentication.

Requirements:
- Users table with id, email, password_hash
- OAuth_tokens table with user_id, provider, tokens
- Migrations using [framework]
Success criteria:
- Migration runs successfully
- Tables created with correct schema
- Indexes added for performance
Write migrations to: migrations/001_auth_schema.sql
```
**Verification**: Run migration, check tables exist
**Next**: Task 2 (after verification passes)

## Phase 2: Parallel Execution
[Similar format for parallel tasks]
```

Create Checkpoint Plan:

## Checkpoint Strategy

### CP-001: After Tasks 1-2
- Foundation complete
- Safe rollback point
```bash
git commit -m "Checkpoint 1: Foundation"
git tag -a cp-001 -m "Schema and OAuth"

CP-002: After Tasks 3-4

Core features complete
Integration point

git commit -m "Checkpoint 2: Core"
git tag -a cp-002 -m "API and Frontend"

Generate Verification Summary:

Include verification report in final package:

## Plan Quality Verification

- Schema Validation: ✅ PASS
- Quality Score: 94/100 ✅
- Completeness: 95% ✅
- All Quality Gates: ✅ PASS

**Status**: APPROVED FOR EXECUTION

Package Complete Plan:

Create plan package with all artifacts:

plans/[plan-id]/
├── plan.json (machine-readable)
├── PLAN.md (human-readable)
├── COORDINATION.md (execution guide)
├── VERIFICATION.md (quality report)
├── dependencies.mermaid (visual graph)
└── checkpoints/ (checkpoint data)

Outputs:

plan.json (validated against schema)
PLAN.md (human-readable version)
COORDINATION.md (agent execution guide)
VERIFICATION.md (quality assurance report)
Visual dependency graph
Complete plan package

Validation:

plan.json validates against schema
PLAN.md is comprehensive and clear
COORDINATION.md provides clear agent instructions
VERIFICATION.md shows quality ≥90
All artifacts in plan package
Ready for execution

Time Estimate: 20-30 minutes

Result: Production-ready plan, approved and ready for agent execution

Post-Workflow: Plan Execution

After completing the 6-step workflow, you have a production-ready plan. Now execute it:

Execution Pattern

Using the Plan:

Read COORDINATION.md for execution guidance
Follow tasks in dependency order
Execute parallel groups simultaneously where possible
Verify each task before proceeding
Create checkpoints at specified points
Update progress.json in real-time

Task Execution Template:

// For each task in execution order:

// 1. Read task from plan.json
const task = plan.tasks.find(t => t.id === currentTaskId);

// 2. Check dependencies satisfied
const depsSatisfied = task.dependencies.every(dep =>
  plan.tasks.find(t => t.id === dep).status === 'complete'
);

// 3. If dependencies met, execute
if (depsSatisfied) {
  // Execute via Task tool or direct implementation
  const result = await executeTask(task);

  // 4. Verify
  const verified = await verifyTask(task);

  // 5. Update status
  task.status = verified ? 'complete' : 'failed';
  updateProgress(task);

  // 6. Create checkpoint if specified
  if (isCheckpointAfterTask(task.id)) {
    createCheckpoint(task.id);
  }
}

Monitoring Progress:

# Check current status
cat plans/[plan-id]/progress.json

# View execution log
cat plans/[plan-id]/execution-log.md

# See what's next
grep "pending" plans/[plan-id]/plan.json

Integration with Multi-AI Research (Optional)

For enhanced planning with multi-perspective research:

Step 1 Enhancement: Research Phase

Instead of manual research, use multi-ai-research:

# Invoke multi-ai-research skill
Use multi-ai-research for "Research [domain] implementation patterns and best practices"

This provides:

Claude research: Official docs, codebase patterns
Gemini research: Web best practices, latest trends
Codex research: GitHub patterns, code examples
Cross-validated: All findings verified across sources
Quality: ≥95/100 with 100% citations

Outputs:

.analysis/ANALYSIS_FINAL.md - Comprehensive research
Patterns identified
Best practices extracted
Implementation recommendations

Use in Planning:

Informs approach selection (Step 1)
Guides task breakdown (Step 2)
Suggests verification methods (Step 4)
Improves plan quality

Time Added: +30-60 minutes Quality Improvement: +5-10 points on plan quality score

Best Practices

1. Research Before Planning (Step 1)

Practice: Invest time in research for unfamiliar domains

Rationale: Grounded plans based on proven patterns

2. Decompose to Atomic (Step 2)

Practice: Break down until tasks cannot be meaningfully subdivided

Rationale: Atomic tasks are executable and verifiable

3. Map Dependencies Explicitly (Step 3)

Practice: Document all dependencies, even obvious ones

Rationale: Prevents execution errors, enables parallelization

4. Verification Before Implementation (Step 4)

Practice: Define how to verify BEFORE defining how to implement

Rationale: Prevents ambiguity, enables test-driven development

5. Quality Threshold (Step 5)

Practice: Require ≥90 quality score before approval

Rationale: High-quality plans have higher execution success rates

6. Use Checkpoints (Execution)

Practice: Create checkpoints at major milestones

Rationale: Enables safe rollback and resumption

Common Mistakes

Mistake 1: Skipping Research

Problem: Plans based on assumptions, not proven patterns Fix: Always research unfamiliar domains (use multi-ai-research)

Mistake 2: Tasks Too Big

Problem: Tasks with >15 subtasks, vague descriptions Fix: Decompose until 8-15 tasks per level, atomic at bottom

Mistake 3: Missing Dependencies

Problem: Parallel execution fails due to undocumented dependencies Fix: Explicitly map all dependencies, even obvious ones

Mistake 4: Vague Success Criteria

Problem: "Feature works", "Looks good" - not measurable Fix: SMART criteria - specific, measurable, testable

Mistake 5: No Quality Verification

Problem: Executing low-quality plans leads to failures Fix: Always score quality, require ≥90 before execution

Mistake 6: Unsafe Rollback

Problem: Using git reset or destructive operations Fix: Git tags + patch bundles, safe checkpoint procedures

Quick Reference

The 6-Step Planning Workflow

Step	Purpose	Time	Key Output
1	Objective Analysis & Research	30-120m	Approach, scope, research findings
2	Hierarchical Decomposition	45-90m	Complete task tree
3	Dependency Mapping	30-45m	Dependency graph, parallel groups
4	Verification Planning	45-75m	Success criteria, verification methods
5	Quality Verification	30-45m	Quality score (≥90), approval
6	Finalization	20-30m	plan.json, PLAN.md, COORDINATION.md

Total Time: 3.5-6.5 hours (varies by complexity)

Plan Quality Rubric

Score	Meaning	Action
90-100	Excellent	✅ Approve and execute
80-89	Good	⚠️ Minor revisions recommended
70-79	Needs Work	❌ Major revisions required
<70	Poor	❌ Re-plan from scratch

Task Granularity Guide

Level	Subtasks	Example
Huge	15-25	"Build complete authentication system"
Large	8-15	"Implement OAuth integration"
Medium	3-8	"Add token management"
Small	1-3	"Create token generation function"
Atomic	1	"Add userId to JWT payload"

Schemas

plan.schema.json: Overall plan structure
task.schema.json: Individual task definition
checkpoint.schema.json: Checkpoint metadata

All plans must validate against schemas before execution.

multi-ai-planning creates rigorous, agent-executable plans with hierarchical decomposition, dependency mapping, verification-first approach, and quality gates - ensuring reliable multi-agent execution.

For detailed patterns, see references/. For examples, see examples/.