council-review
Council Review Skill
Domain knowledge for orchestrating multi-agent review councils with structured deliberation protocols.
Use For
- Running
/cc-councilreviews with appropriate protocol selection - Configuring council members, weights, and voting thresholds
- Understanding when to use each deliberation protocol
- Interpreting council scores and making go/no-go decisions
- Setting up auto-fix pipelines with confidence thresholds
- Integrating council reviews into CI/CD pipelines
Protocol Selection Guide
Decision Tree
What are you reviewing?
├── Security-sensitive code (auth, payments, secrets)
│ └── Use: red-blue-team --preset security
├── Architecture or design decision
│ └── Use: six-thinking-hats --preset architecture
├── Small PR (<100 lines)
│ └── Use: rapid-fire --preset quick
├── Large PR (>500 lines, multi-file)
│ └── Use: blackboard --preset full
├── Contentious change (team disagreement)
│ └── Use: delphi --preset standard
└── Regular code change
└── Use: expert-panel --preset standard
Protocol Comparison
| Protocol | Rounds | Agent Interaction | Token Cost | Quality | Speed |
|---|---|---|---|---|---|
| rapid-fire | 1 | None | Low | Good | Fast |
| expert-panel | 1-2 | After analysis | Medium | High | Medium |
| blackboard | Async | Shared space | Medium | High | Medium |
| red-blue-team | 2 | Adversarial | High | Very High | Slow |
| six-thinking-hats | 6 views | Structured | High | Very High | Slow |
| delphi | 2-3 | Anonymous | Highest | Highest | Slowest |
Orchestration Best Practices
Fan-Out / Fan-In
- Always spawn all agents in a single message for true parallelism
- Each agent gets scoped context — only the files/info they need
- Use
run_in_background: falseso results come back synchronously - Handle partial failures: if 3/4 agents respond, proceed with 3
Context Scoping
Giving each agent the minimum viable context:
- Reduces token cost by 40-60%
- Improves finding quality (less noise to filter)
- Prevents agents from commenting outside their specialty
Weight Calibration
Default weights reflect review importance:
code-reviewer: 1.0 (always relevant)
security-reviewer: 0.9 (high impact, veto power)
architecture-reviewer: 0.9 (structural decisions matter)
test-strategist: 0.8 (coverage critical for confidence)
performance-analyst: 0.7 (important but often subjective)
accessibility-reviewer: 0.6 (important for frontend)
api-reviewer: 0.6 (important for API changes)
dependency-auditor: 0.6 (important for supply chain)
docs-reviewer: 0.5 (lower weight, rarely blocks)
Veto Power
- Only
security-reviewerandsecrets-scannerhave default veto - Veto triggers on: critical finding + confidence >= 0.8
- Veto overrides weighted voting — always results in
changes-requested - Rationale: security issues must never be approved by majority vote
Consensus Detection
When 2+ agents flag the same file+line range:
- Boost confidence by 1.2x
- Mark as "consensus" in report (stronger signal)
- These findings are almost always valid
Conflict Resolution
When agents disagree on severity:
- Lead agent (if designated) has tie-breaking authority
- Otherwise: higher-weight agent's assessment wins
- Always report the conflict with both perspectives
Scoring System
Scope Independence
Each scope (security, quality, performance, etc.) is scored independently on a 0-100 scale. This prevents a strong quality score from masking a weak security score.
Deduction Tables
Findings map to deductions via category lookup tables. The deduction is multiplied by the finding's confidence score, so low-confidence findings have proportionally less impact.
Scoring Modes
Weighted (default): Scope scores are combined using configurable weights. Good for overall quality assessment where trade-offs are acceptable.
Pass-fail: Each scope must independently meet its threshold. Good for compliance and gating — no scope can compensate for another.
Highest-concern: Overall score equals the weakest scope. Most conservative mode — forces attention to the weakest area.
Auto-Fix Guidelines
When to Auto-Fix
- Formatting and style issues (confidence typically 0.95+)
- Import organization (high confidence, mechanical)
- Type annotations (when TypeScript can infer)
- Simple null checks (optional chaining additions)
When NOT to Auto-Fix
- Business logic changes (too context-dependent)
- Architecture refactoring (requires human judgment)
- Test modifications (risk of masking real failures)
- Migration files (must be append-only)
- Generated files (will be overwritten)
Safety Checks
After auto-fix:
- Run
--fix-dry-runfirst to preview changes - Auto-fix respects
skip_patternsin config - Post-fix validation: lint + typecheck
- If validation fails: revert the fix, report as "fix-failed"
CI/CD Integration
Pre-Merge Gate
# In CI pipeline:
/cc-council . --preset pre-merge --format json --changed-only > council-result.json
# Check exit code: 0=approved, 1=changes-requested, 2=error
Quality Gate Thresholds
Recommended thresholds by environment:
Development: --threshold 0.5 (permissive, speed over safety)
Staging: --threshold 0.7 (balanced)
Production: --threshold 0.85 (strict)
Compliance: --threshold 0.9 --scoring pass-fail (audit-grade)
State Machine & Resume
The council saves state at each phase boundary. If a phase fails (e.g., network timeout during fan-out), you can resume from the last checkpoint:
/cc-council --resume <session-id>
State includes:
- Council plan (members, protocol, scopes)
- Raw agent outputs (findings + votes)
- Deliberation results (consensus, conflicts)
- Score calculations
This means you never lose work from a partially completed council.
More from lobbi-docs/claude
vision-multimodal
Vision and multimodal capabilities for Claude including image analysis, PDF processing, and document understanding. Activate for image input, base64 encoding, multiple images, and visual analysis.
243design-system
Apply and manage the AI-powered design system with 50+ curated styles
126complex-reasoning
Multi-step reasoning patterns and frameworks for systematic problem solving. Activate for Chain-of-Thought, Tree-of-Thought, hypothesis-driven debugging, and structured analytical approaches that leverage extended thinking.
105gcp
Google Cloud Platform services including GKE, Cloud Run, Cloud Storage, BigQuery, and Pub/Sub. Activate for GCP infrastructure, Google Cloud deployment, and GCP integration.
73kanban
Kanban methodology including boards, WIP limits, flow metrics, and continuous delivery. Activate for Kanban boards, workflow visualization, and lean project management.
63debugging
Debugging techniques for Python, JavaScript, and distributed systems. Activate for troubleshooting, error analysis, log investigation, and performance debugging. Includes extended thinking integration for complex debugging scenarios.
59