continuous-skill-optimizer
SKILL.md
Continuous Skill Optimizer
Discovery Phase
Ask for:
- Target skill path to optimize.
- Eval set path (or whether to use a generated default set).
- Loop budget (
max-iterations) and aggressiveness (runs-per-query,holdout). - Improvement backend (
claudeorcopilot) and model choice.
Recap
Confirm:
- Target skill path
- Eval set path
- Engines/models
- Iteration budget
- Whether auto-apply of winning description is enabled
Execution
This skill implements autoresearch-style optimization for skill trigger quality. Use a strict loop:
- Run one baseline evaluation and record it.
- Change one dominant variable per iteration (usually description wording scope/specificity/exclusions).
- Classify each iteration as
keep,discard, orcrash. - If an iteration crashes/timeouts, log failure and continue from last known good description.
- Keep a persistent ledger in
evals/results.tsv.
Usage:
python3 ${plugins}/skills/continuous-skill-optimizer/scripts/execute.py --help
Baseline Validation
Before optimizing behavior, run one baseline evaluation and log it in evals/results.tsv.
Iteration Loop
When iterating, follow a disciplined loop:
- Change one dominant variable per iteration.
- Re-run evaluations.
- Mark the attempt as
keepordiscard. - If the run crashes or times out, log the failure and continue from the last known good state.
- Never overwrite the source skill unless explicitly configured to auto-apply winners.
Output
Always conclude execution with a Source Transparency Declaration explicitly listing what was queried to guarantee user trust: Sources Checked: [list] Sources Unavailable: [list]
Next Actions
- Use
./scripts/benchmarking/run_loop.py --results-dir evals/experimentsfor repeatable improvement loops. - Use
./scripts/eval-viewer/generate_review.pyfor visual review of iteration outcomes. - Suggest the user run
audit-pluginto verify the generated artifacts.
Weekly Installs
2
Repository
richfrem/agent-…s-skillsGitHub Stars
1
First Seen
1 day ago
Security Audits
Installed on
amp2
cline2
opencode2
cursor2
kimi-cli2
codex2