bench

SKILL.md

/bench

Builds Java and runs the full benchmark suite, then analyzes results.

Execution Steps

  1. Run ./scripts/bench.sh
  2. Extract metrics from tests/benchmark/prediction/opendataloader/evaluation.json using:
    jq '{summary, metrics, table_detection, speed}' tests/benchmark/prediction/opendataloader/evaluation.json
    
  3. Output summary:
    • NID (reading order)
    • TEDS (table structure)
    • MHS (heading structure)
    • Table Detection F1/Precision/Recall
    • Speed (s/doc, total time)
  4. Compare with thresholds in tests/benchmark/thresholds.json
  5. Warn if regression detected

Options

  • /bench --doc-id 01030000000189 - Run for a specific document only

Notes

  • Benchmark won't run if Java build fails
  • First run installs Python dependencies via uv sync
Weekly Installs
7
GitHub Stars
2.0K
First Seen
9 days ago
Installed on
codex7
gemini-cli6
opencode6
antigravity6
openhands6
github-copilot6