agent-workflow
Agent Development Workflow
Complete Standard Operating Procedure (SOP) for building production-ready goal-driven agents.
Overview
This workflow orchestrates specialized skills to take you from initial concept to production-ready agent:
- Understand Concepts →
/building-agents-core(optional) - Build Structure →
/building-agents-construction - Optimize Design →
/building-agents-patterns(optional) - Setup Credentials →
/setup-credentials(if agent uses tools requiring API keys) - Test & Validate →
/testing-agent
When to Use This Workflow
Use this meta-skill when:
- Starting a new agent from scratch
- Unclear which skill to use first
- Need end-to-end guidance for agent development
- Want consistent, repeatable agent builds
Skip this workflow if:
- You only need to test an existing agent → use
/testing-agentdirectly - You know exactly which phase you're in → use specific skill directly
Quick Decision Tree
"Need to understand agent concepts" → building-agents-core
"Build a new agent" → building-agents-construction
"Optimize my agent design" → building-agents-patterns
"Set up API keys for my agent" → setup-credentials
"Test my agent" → testing-agent
"Not sure what I need" → Read phases below, then decide
"Agent has structure but needs implementation" → See agent directory STATUS.md
Phase 0: Understand Concepts (Optional)
Duration: 5-10 minutes
Skill: /building-agents-core
Input: Questions about agent architecture
When to Use
- First time building an agent
- Need to understand node types, edges, goals
- Want to validate tool availability
- Learning about pause/resume architecture
What This Phase Provides
- Architecture overview (Python packages, not JSON)
- Core concepts (Goal, Node, Edge, Pause/Resume)
- Tool discovery and validation procedures
- Workflow overview
Skip this phase if you already understand agent fundamentals.
Phase 1: Build Agent Structure
Duration: 15-30 minutes
Skill: /building-agents-construction
Input: User requirements ("Build an agent that...")
What This Phase Does
Creates the complete agent architecture:
- Package structure (
exports/agent_name/) - Goal with success criteria and constraints
- Workflow graph (nodes and edges)
- Node specifications
- CLI interface
- Documentation
Process
- Create package - Directory structure with skeleton files
- Define goal - Success criteria and constraints written to agent.py
- Design nodes - Each node approved and written incrementally
- Connect edges - Workflow graph with conditional routing
- Finalize - Agent class, exports, and documentation
Outputs
- ✅
exports/agent_name/package created - ✅ Goal defined in agent.py
- ✅ 3-5 success criteria defined
- ✅ 1-5 constraints defined
- ✅ 5-10 nodes specified in nodes/init.py
- ✅ 8-15 edges connecting workflow
- ✅ Validated structure (passes
python -m agent_name validate) - ✅ README.md with usage instructions
- ✅ CLI commands (info, validate, run, shell)
Success Criteria
You're ready for Phase 2 when:
- Agent structure validates without errors
- All nodes and edges are defined
- CLI commands work (info, validate)
- You see: "Agent complete: exports/agent_name/"
Common Outputs
The building-agents-construction skill produces:
exports/agent_name/
├── __init__.py (package exports)
├── __main__.py (CLI interface)
├── agent.py (goal, graph, agent class)
├── nodes/__init__.py (node specifications)
├── config.py (configuration)
├── implementations.py (may be created for Python functions)
└── README.md (documentation)
Next Steps
If structure complete and validated:
→ Check exports/agent_name/STATUS.md or IMPLEMENTATION_GUIDE.md
→ These files explain implementation options
→ You may need to add Python functions or MCP tools (not covered by current skills)
If want to optimize design: → Proceed to Phase 1.5 (building-agents-patterns)
If ready to test: → Proceed to Phase 2
Phase 1.5: Optimize Design (Optional)
Duration: 10-15 minutes
Skill: /building-agents-patterns
Input: Completed agent structure
When to Use
- Want to add pause/resume functionality
- Need error handling patterns
- Want to optimize performance
- Need examples of complex routing
- Want best practices guidance
What This Phase Provides
- Practical examples and patterns
- Pause/resume architecture
- Error handling strategies
- Anti-patterns to avoid
- Performance optimization techniques
Skip this phase if your agent design is straightforward.
Phase 2: Test & Validate
Duration: 20-40 minutes
Skill: /testing-agent
Input: Working agent from Phase 1
What This Phase Does
Creates comprehensive test suite:
- Constraint tests (verify hard requirements)
- Success criteria tests (measure goal achievement)
- Edge case tests (handle failures gracefully)
- Integration tests (end-to-end workflows)
Process
- Analyze agent - Read goal, constraints, success criteria
- Generate tests - Create pytest files in
exports/agent_name/tests/ - User approval - Review and approve each test
- Run evaluation - Execute tests and collect results
- Debug failures - Identify and fix issues
- Iterate - Repeat until all tests pass
Outputs
- ✅ Test files in
exports/agent_name/tests/ - ✅ Test report with pass/fail metrics
- ✅ Coverage of all success criteria
- ✅ Coverage of all constraints
- ✅ Edge case handling verified
Success Criteria
You're done when:
- All tests pass
- All success criteria validated
- All constraints verified
- Agent handles edge cases
- Test coverage is comprehensive
Next Steps
Agent ready for:
- Production deployment
- Integration into larger systems
- Documentation and handoff
- Continuous monitoring
Phase Transitions
From Phase 1 to Phase 2
Trigger signals:
- "Agent complete: exports/..."
- Structure validation passes
- README indicates implementation complete
Before proceeding:
- Verify agent can be imported:
from exports.agent_name import default_agent - Check if implementation is needed (see STATUS.md or IMPLEMENTATION_GUIDE.md)
- Confirm agent executes without import errors
Skipping Phases
When to skip Phase 1:
- Agent structure already exists
- Only need to add tests
- Modifying existing agent
When to skip Phase 2:
- Prototyping or exploring
- Agent not production-bound
- Manual testing sufficient
Common Patterns
Pattern 1: Complete New Build (Simple)
User: "Build an agent that monitors files"
→ Use /building-agents-construction
→ Agent structure created
→ Use /testing-agent
→ Tests created and passing
→ Done: Production-ready agent
Pattern 1b: Complete New Build (With Learning)
User: "Build an agent (first time)"
→ Use /building-agents-core (understand concepts)
→ Use /building-agents-construction (build structure)
→ Use /building-agents-patterns (optimize design)
→ Use /testing-agent (validate)
→ Done: Production-ready agent
Pattern 2: Test Existing Agent
User: "Test my agent at exports/my_agent"
→ Skip Phase 1
→ Use /testing-agent directly
→ Tests created
→ Done: Validated agent
Pattern 3: Iterative Development
User: "Build an agent"
→ Use /building-agents-construction (Phase 1)
→ Implementation needed (see STATUS.md)
→ [User implements functions]
→ Use /testing-agent (Phase 2)
→ Tests reveal bugs
→ [Fix bugs manually]
→ Re-run tests
→ Done: Working agent
Pattern 4: Complex Agent with Patterns
User: "Build an agent with multi-turn conversations"
→ Use /building-agents-core (learn pause/resume)
→ Use /building-agents-construction (build structure)
→ Use /building-agents-patterns (implement pause/resume pattern)
→ Use /testing-agent (validate conversation flows)
→ Done: Complex conversational agent
Skill Dependencies
agent-workflow (meta-skill)
│
├── building-agents-core (foundational)
│ ├── Architecture concepts
│ ├── Node/Edge/Goal definitions
│ ├── Tool discovery procedures
│ └── Workflow overview
│
├── building-agents-construction (procedural)
│ ├── Creates package structure
│ ├── Defines goal
│ ├── Adds nodes incrementally
│ ├── Connects edges
│ ├── Finalizes agent class
│ └── Requires: building-agents-core
│
├── building-agents-patterns (reference)
│ ├── Best practices
│ ├── Pause/resume patterns
│ ├── Error handling
│ ├── Anti-patterns
│ └── Performance optimization
│
└── testing-agent
├── Reads agent goal
├── Generates tests
├── Runs evaluation
└── Reports results
Troubleshooting
"Agent structure won't validate"
- Check node IDs match between nodes/init.py and agent.py
- Verify all edges reference valid node IDs
- Ensure entry_node exists in nodes list
- Run:
PYTHONPATH=core:exports python -m agent_name validate
"Agent has structure but won't run"
- Check for STATUS.md or IMPLEMENTATION_GUIDE.md in agent directory
- Implementation may be needed (Python functions or MCP tools)
- This is expected - building-agents-construction creates structure, not implementation
- See implementation guide for completion options
"Tests are failing"
- Review test output for specific failures
- Check agent goal and success criteria
- Verify constraints are met
- Use
/testing-agentto debug and iterate - Fix agent code and re-run tests
"Not sure which phase I'm in"
Run these checks:
# Check if agent structure exists
ls exports/my_agent/agent.py
# Check if it validates
PYTHONPATH=core:exports python -m my_agent validate
# Check if tests exist
ls exports/my_agent/tests/
# If structure exists and validates → Phase 2 (testing)
# If structure doesn't exist → Phase 1 (building)
# If tests exist but failing → Debug phase
Best Practices
For Phase 1 (Building)
- Start with clear requirements - Know what the agent should do
- Define success criteria early - Measurable goals drive design
- Keep nodes focused - One responsibility per node
- Use descriptive names - Node IDs should explain purpose
- Validate incrementally - Check structure after each major addition
For Phase 2 (Testing)
- Test constraints first - Hard requirements must pass
- Mock external dependencies - Use mock mode for LLMs/APIs
- Cover edge cases - Test failures, not just success paths
- Iterate quickly - Fix one test at a time
- Document test patterns - Future tests follow same structure
General Workflow
- Use version control - Git commit after each phase
- Document decisions - Update README with changes
- Keep iterations small - Build → Test → Fix → Repeat
- Preserve working states - Tag successful iterations
- Learn from failures - Failed tests reveal design issues
Exit Criteria
You're done with the workflow when:
✅ Agent structure validates ✅ All tests pass ✅ Success criteria met ✅ Constraints verified ✅ Documentation complete ✅ Agent ready for deployment
Additional Resources
- building-agents-core: See
.claude/skills/building-agents-core/SKILL.md - building-agents-construction: See
.claude/skills/building-agents-construction/SKILL.md - building-agents-patterns: See
.claude/skills/building-agents-patterns/SKILL.md - testing-agent: See
.claude/skills/testing-agent/SKILL.md - Agent framework docs: See
core/README.md - Example agents: See
exports/directory
Summary
This workflow provides a proven path from concept to production-ready agent:
- Learn with
/building-agents-core→ Understand fundamentals (optional) - Build with
/building-agents-construction→ Get validated structure - Optimize with
/building-agents-patterns→ Apply best practices (optional) - Test with
/testing-agent→ Get verified functionality
The workflow is flexible - skip phases as needed, iterate freely, and adapt to your specific requirements. The goal is production-ready agents built with consistent, repeatable processes.
Skill Selection Guide
Choose building-agents-core when:
- First time building agents
- Need to understand architecture
- Validating tool availability
- Learning about node types and edges
Choose building-agents-construction when:
- Actually building an agent
- Have clear requirements
- Ready to write code
- Want step-by-step guidance
Choose building-agents-patterns when:
- Agent structure complete
- Need advanced patterns
- Implementing pause/resume
- Optimizing performance
- Want best practices
Choose testing-agent when:
- Agent structure complete
- Ready to validate functionality
- Need comprehensive test coverage
- Debugging agent behavior