examples-auto-run
SKILL.md
examples-auto-run
What it does
- Runs
uv run examples/run_examples.pywith:EXAMPLES_INTERACTIVE_MODE=auto(auto-input/auto-approve).- Per-example logs under
.tmp/examples-start-logs/. - Main summary log path passed via
--main-log(also under.tmp/examples-start-logs/). - Generates a rerun list of failures at
.tmp/examples-rerun.txtwhen--write-rerunis set.
- Provides start/stop/status/logs/tail/collect/rerun helpers via
run.sh. - Background option keeps the process running with a pidfile;
stopcleans it up.
Usage
# Start (auto mode; interactive included by default)
.agents/skills/examples-auto-run/scripts/run.sh start [extra args to run_examples.py]
# Examples:
.agents/skills/examples-auto-run/scripts/run.sh start --filter basic
.agents/skills/examples-auto-run/scripts/run.sh start --include-server --include-audio
# Check status
.agents/skills/examples-auto-run/scripts/run.sh status
# Stop running job
.agents/skills/examples-auto-run/scripts/run.sh stop
# List logs
.agents/skills/examples-auto-run/scripts/run.sh logs
# Tail latest log (or specify one)
.agents/skills/examples-auto-run/scripts/run.sh tail
.agents/skills/examples-auto-run/scripts/run.sh tail main_20260113-123000.log
# Collect rerun list from a main log (defaults to latest main_*.log)
.agents/skills/examples-auto-run/scripts/run.sh collect
# Rerun only failed entries from rerun file (auto mode)
.agents/skills/examples-auto-run/scripts/run.sh rerun
Defaults (overridable via env)
EXAMPLES_INTERACTIVE_MODE=autoEXAMPLES_INCLUDE_INTERACTIVE=1EXAMPLES_INCLUDE_SERVER=0EXAMPLES_INCLUDE_AUDIO=0EXAMPLES_INCLUDE_EXTERNAL=0- Auto-approvals in auto mode:
APPLY_PATCH_AUTO_APPROVE=1,SHELL_AUTO_APPROVE=1,AUTO_APPROVE_MCP=1
Log locations
- Main logs:
.tmp/examples-start-logs/main_*.log - Per-example logs (from
run_examples.py):.tmp/examples-start-logs/<module_path>.log - Rerun list:
.tmp/examples-rerun.txt - Stdout logs:
.tmp/examples-start-logs/stdout_*.log
Notes
- The runner delegates to
uv run examples/run_examples.py, which already writes per-example logs and supports--collect,--rerun-file, and--print-auto-skip. startuses--write-rerunso failures are captured automatically.- If
.tmp/examples-rerun.txtexists and is non-empty, invoking the skill with no args runsrerunby default.
Behavioral validation (Codex/LLM responsibility)
The runner does not perform any automated behavioral validation. After every foreground start or rerun, Codex must manually validate all exit-0 entries:
- Read the example source (and comments) to infer intended flow, tools used, and expected key outputs.
- Open the matching per-example log under
.tmp/examples-start-logs/. - Confirm the intended actions/results occurred; flag omissions or divergences.
- Do this for all passed examples, not just a sample.
- Report immediately after the run with concise citations to the exact log lines that justify the validation.
Weekly Installs
14
Repository
openai/openai-agents-pythonFirst Seen
Jan 24, 2026
Installed on
claude-code10
github-copilot9
opencode9
antigravity8
gemini-cli7
codex7