eval-systematic-optimization
Installation
SKILL.md
eval-systematic-optimization
Run baseline evaluation and failure clustering for foundation-suite.
Requirements
- Go toolchain available (
goin PATH). - Repo root as working directory (or pass
cwd).
Constraints
- Baseline command timeout: 600s.
- Default baseline output path:
/tmp/foundation-suite-<tag>-baseline. analyzerequires a valid JSON result file path.- Focus is conflict-family optimization, not single-case overfitting.
Usage
# Run baseline
python3 skills/eval-systematic-optimization/run.py '{"action":"baseline","tag":"r12"}'