skill-debugging-assistant
Skill Debugging Assistant
Overview
This skill helps diagnose why skills aren't triggering or performing as expected. It systematically analyzes trigger patterns, parameter issues, prompt conflicts, and structural problems to identify root causes and recommend fixes.
When to Use This Skill
Use this skill when encountering any of these issues:
- Skill doesn't trigger when expected
- Skill triggers incorrectly or at wrong times
- Skill behavior doesn't match description
- Conflicts between skill instructions and system prompts
- Unclear when to load references vs. include in SKILL.md
- Validation errors during packaging
- Skill works inconsistently across similar queries
Diagnostic Workflow
Follow this decision tree to diagnose skill issues:
1. Identify the Problem Type
Skill not triggering? → Go to "Trigger Failure Diagnostics"
Skill triggering incorrectly? → Go to "False Positive Diagnostics"
Skill behavior unexpected? → Go to "Instruction Conflict Diagnostics"
Packaging/validation errors? → Go to "Structure Validation"
General review needed? → Go to "Comprehensive Audit"
2. Trigger Failure Diagnostics
When a skill should trigger but doesn't, analyze in this order:
Step 1: Analyze the description field
- Read the skill's frontmatter
description - Check if description mentions the user's query terms or conceptual triggers
- Verify description includes WHEN to use the skill, not just WHAT it does
- Confirm description is specific enough to differentiate from other skills
Step 2: Check description quality
- Does it include key terms the user would naturally use?
- Does it specify triggers (file types, tasks, scenarios)?
- Is it comprehensive enough for selection among 100+ skills?
- Are the trigger scenarios clear and unambiguous?
Step 3: Review competing skills
- Identify other skills with overlapping descriptions
- Determine if another skill's description better matches the query
- Check if trigger patterns are too similar between skills
Step 4: Test edge cases
- Would the skill trigger for paraphrased versions of the query?
- Does it cover related terminology and synonyms?
- Are there implicit assumptions about when it should trigger?
Common fixes:
- Add specific trigger terms to description
- Include file type indicators (.docx, .pdf, .json)
- Specify task types (create, edit, analyze, debug)
- Add domain indicators (finance, legal, technical)
- Include synonym terms users might naturally use
3. False Positive Diagnostics
When a skill triggers when it shouldn't:
Step 1: Check description over-breadth
- Is the description too general?
- Does it use broad terms that match many queries?
- Are there missing qualifiers or constraints?
Step 2: Review instruction conflicts
- Do instructions apply too broadly within SKILL.md?
- Are there "always" or "never" statements that override context?
- Does the skill assume it should handle something beyond its scope?
Common fixes:
- Narrow description scope with specific qualifiers
- Add exclusion indicators (e.g., "not for X")
- Move broad utility functions to scripts rather than main workflow
- Add conditional logic: "Only when..." or "If and only if..."
4. Instruction Conflict Diagnostics
When skill behavior contradicts expected results:
Step 1: Read SKILL.md completely
- Check for absolute statements (always, never, must, required)
- Identify instructions that might conflict with system prompt
- Look for contradictory instructions within the skill
Step 2: Analyze instruction priority
- Are there competing instructions without clear precedence?
- Do examples contradict written rules?
- Is the desired behavior stated clearly vs. implied?
Step 3: Check progressive disclosure structure
- Is critical information buried in references that weren't loaded?
- Should certain instructions be in SKILL.md instead of references?
- Are references clearly indicated when they're needed?
Common fixes:
- Replace absolutes with conditionals
- Add explicit precedence rules
- Move critical instructions from references to SKILL.md
- Clarify when to load each reference file
- Use "Prefer X, unless Y" instead of "Always X"
5. Structure Validation
Run automated and manual checks:
Automated validation:
python3 scripts/validate_skill.py path/to/skill-folder
Manual checks:
- YAML frontmatter properly formatted (name and description present)
- Name follows kebab-case convention
- Description is comprehensive (>50 chars) and specific
- SKILL.md under 500 lines (split to references if needed)
- References clearly indicated in SKILL.md when needed
- No extraneous files (README.md, CHANGELOG.md, etc.)
- Assets/scripts tested and functional
Common structural issues:
- Missing or malformed YAML frontmatter
- Description too vague or too brief
- SKILL.md exceeds token budget (>500 lines)
- References not mentioned in SKILL.md
- Unused example files not deleted
- Scripts with syntax errors or missing dependencies
6. Comprehensive Audit
For general skill review or quality improvement:
Trigger analysis:
- List 5-10 queries that should trigger this skill
- For each query, verify the description contains relevant terms
- Test paraphrased versions of each query
- Identify gaps in trigger coverage
Instruction clarity:
- Read SKILL.md start to finish
- Flag any ambiguous or conflicting statements
- Verify examples align with instructions
- Check if workflow steps are clear and sequential
Progressive disclosure:
- Ensure SKILL.md contains only essential procedural knowledge
- Verify detailed reference material is in separate files
- Confirm references are clearly indicated when needed
- Check that SKILL.md describes when to load each reference
Quality checklist:
- Description includes specific trigger terms and scenarios
- SKILL.md uses imperative/infinitive form throughout
- No conflicting instructions or absolute statements without qualifiers
- Examples provided for non-obvious operations
- References clearly indicated and purposefully separated
- Scripts tested and functional
- Token budget respected (<500 lines in SKILL.md)
- No extraneous files included
Quick Diagnostics Checklist
For rapid troubleshooting, check these common issues first:
Trigger failures (skill not activating):
- Description mentions user's query terms
- Description includes "when to use" indicators
- Description differentiates from similar skills
- Key terms are specific, not generic
False positives (skill triggers incorrectly):
- Description isn't too broad or generic
- No absolute statements without context limits
- Scope clearly defined with boundaries
Behavior issues (skill does unexpected things):
- No conflicting "always/never" statements
- Critical instructions in SKILL.md, not buried in references
- Examples align with stated rules
- Conditional logic uses "prefer" vs "always"
Validation errors:
- YAML frontmatter properly formatted
- Name uses kebab-case
- Description >50 characters
- No extraneous documentation files
Deep Analysis Methods
Description Analysis Template
For any skill with trigger issues, analyze the description systematically:
**Current description:**
[paste description here]
**Analysis:**
1. Specificity: Does it include concrete trigger terms?
2. Differentiation: How does it differ from similar skills?
3. Completeness: Does it mention when/how to use it?
4. Key terms: List the main terms that would trigger selection
**Test queries:**
[List 5 queries that should trigger this skill]
[For each, note if description contains matching terms]
**Recommended improvements:**
[Specific additions or changes to description]
Instruction Conflict Analysis
For skills with behavioral issues:
- Extract all imperative statements from SKILL.md
- Flag statements using: always, never, must, required, CRITICAL, NEVER
- Check each flagged statement for potential conflicts
- Identify statements that could override user intent
- Recommend conditional rephrasing
Token Budget Analysis
For skills approaching context limits:
- Count lines in SKILL.md (target: <500 lines)
- Identify sections >100 lines that could move to references
- Check for repetitive examples or verbose explanations
- Verify references are actually being used (not duplicated in SKILL.md)
- Recommend splits: what stays in SKILL.md vs. what moves to references
Resources
references/common-issues.md
Detailed examples of common skill problems with before/after fixes. Load this when debugging complex or recurring issues.
scripts/validate_skill.py
Automated validation script that checks:
- YAML frontmatter format and completeness
- File structure and naming conventions
- Description quality metrics
- Common structural problems
Run before packaging any skill to catch issues early.
More from auldsyababua/instructor-workflow
side-hustle-maker
Active coordinator for building AI-powered side-gigs in 2025. Use when users want to build micro-niche products, validate business ideas, create MVPs, or launch profitable side businesses. This skill orchestrates sub-agents to execute market research, product design, business validation, and launch planning. Triggers include "help me build a side hustle," "validate my business idea," "find market opportunities," "build an AI product," or "launch a side-gig.
33travel-planner
This skill should be used whenever users need help planning trips, creating travel itineraries, managing travel budgets, or seeking destination advice. On first use, collects comprehensive travel preferences including budget level, travel style, interests, and dietary restrictions. Generates detailed travel plans with day-by-day itineraries, budget breakdowns, packing checklists, cultural do's and don'ts, and region-specific schedules. Maintains database of preferences and past trips for personalized recommendations.
14csv-data-visualizer
This skill should be used when working with CSV files to create interactive data visualizations, generate statistical plots, analyze data distributions, create dashboards, or perform automatic data profiling. It provides comprehensive tools for exploratory data analysis using Plotly for interactive visualizations.
6finance-manager
Comprehensive personal finance management system for analyzing transaction data, generating insights, creating visualizations, and providing actionable financial recommendations. Use when users need to analyze spending patterns, track budgets, visualize financial data, extract transactions from PDFs, calculate savings rates, identify spending trends, generate financial reports, or receive personalized budget recommendations. Triggers include requests like "analyze my finances", "track my spending", "create a financial report", "extract transactions from PDF", "visualize my budget", "where is my money going", "financial insights", "spending breakdown", or any finance-related analysis tasks.
5startup-validator
Comprehensive startup idea validation and market analysis tool. Use when users need to evaluate a startup idea, assess market fit, analyze competition, validate problem-solution fit, or determine market positioning. Triggers include requests to "validate my startup idea", "analyze market opportunity", "check if there's demand for", "research competition for", "evaluate business idea", or "see if my idea is viable". Provides data-driven analysis using web search, market frameworks, competitive research, and positioning recommendations.
5token-budget-advisor
Proactive token budget assessment and task chunking strategy. Use this skill when queries involve multiple large file uploads, requests for comprehensive multi-document analysis, complex multi-step workflows with heavy research (10+ tool calls), phrases like "complete analysis", "full audit", "thorough review", "deep dive", or tasks combining extensive research with large output artifacts. This skill helps assess token consumption risk early and recommend chunking strategies before beginning work.
5