eval-systematic-optimization
SKILL.md
eval-systematic-optimization
Run baseline evaluation and failure clustering for foundation-suite.
Requirements
- Go toolchain available (
goin PATH). - Repo root as working directory (or pass
cwd).
Constraints
- Baseline command timeout: 600s.
- Default baseline output path:
/tmp/foundation-suite-<tag>-baseline. analyzerequires a valid JSON result file path.- Focus is conflict-family optimization, not single-case overfitting.
Usage
# Run baseline
python3 skills/eval-systematic-optimization/run.py '{"action":"baseline","tag":"r12"}'
# Analyze failures
python3 skills/eval-systematic-optimization/run.py '{"action":"analyze","result_file":"/tmp/foundation-suite-r12-baseline/foundation_suite_cases.json"}'
Weekly Installs
9
Repository
cklxx/elephant.aiGitHub Stars
8
First Seen
13 days ago
Security Audits
Installed on
gemini-cli9
opencode9
codebuddy9
github-copilot9
codex9
kimi-cli9