autoresearch
SKILL.md
Autoresearch Skill
Autoresearch means one bounded target, one external metric, one keep/revert loop, one append-only log.
When to use
- prompt optimization
- worker routing
- tool behavior
- evaluation harnesses
- any repo-local target that can be measured honestly
Process
- Read
atris/experiments/<slug>/program.md - Confirm the target is bounded
- Run the baseline with
measure.py - Apply one candidate change
- Rerun the metric
- Keep only if the score improves
- Write the outcome to
results.tsv - Revert losses
Rules
- external metric only
- no unlogged keeps
- no broad refactors inside an experiment
- one experiment pack = one target
- if variance exists, define the keep margin first
Commands
atris experiments init <slug>
atris experiments validate
atris experiments benchmark
Good output
- short
program.md - honest
measure.py - deterministic
loop.py - append-only
results.tsv
Bad output
- "felt better"
- changed three things at once
- kept a patch without a measured win
- no reset/revert path
Weekly Installs
1
Source
www.modelscope.…researchFirst Seen
4 days ago
Installed on
amp1
cline1
opencode1
cursor1
kimi-cli1
warp1