Paper Evidence Gap Miner

Find the cheapest way to complete paper evidence. This skill starts from paper claims and writing gaps, searches existing results first, and only routes to new experiments when existing CSVs and derived assets cannot support the claim.

Use this skill when:

writing reveals that a claim lacks enough evidence
a paper section needs a missing table, figure, ablation, slice, variance estimate, or baseline comparison
existing CSV results may already contain the needed evidence
a claim needs result completion rather than new algorithm or method design
the user wants to avoid unnecessary compute before submission
reviewer simulation or consistency editing finds an evidence gap

Do not use this skill to generate final paper figures or tables. Use paper-result-asset-builder after this skill identifies reusable CSV evidence. Do not use this skill to design a full new experiment matrix unless existing results are insufficient; route to experiment-design-planner only as the last step.

Skill Directory Layout

<installed-skill-dir>/
├── SKILL.md
├── references/
│   ├── gap-triage.md
│   └── result-reuse-patterns.md
└── templates/
    └── evidence-completion-plan.md

Progressive Loading

Always read references/gap-triage.md and references/result-reuse-patterns.md.
Use templates/evidence-completion-plan.md when creating paper/.agent/evidence-completion-plan.md.
Read local paper/.agent/writing-contract.md, paper/.agent/paper-evidence-board.md, paper/.agent/writing-memory/, paper/.agent/result-inventory.md, paper/.agent/result-asset-provenance.md, paper/.agent/experiment-story-plan.md, and paper/.agent/provisional-results.md when present.
Read paper draft files and result CSV inventories before recommending new runs.
Pair with paper-result-asset-builder when existing CSVs can produce a needed table or figure.

Core Principles

Missing evidence is not automatically a missing experiment.
First ask whether the result already exists, can be re-aggregated, can be sliced differently, or can be turned into an appendix asset.
Treat "补实验" as "补结果" until proven otherwise.
Prefer the minimum evidence that defends the paper claim and reviewer risk.
Do not propose expensive compute before checking existing CSVs, logs, reports, tables, figures, and paper evidence memory.
If existing evidence only supports a narrower claim, recommend claim narrowing before new compute.
Record whether a gap is solved by reuse, derivation, writing, diagnosis, or new experiment.

Step 1 - Locate the Writing Gap

Extract the gap from:

paper/.agent/writing-contract.md
paper/.agent/paper-evidence-board.md
paper/.agent/consistency-report.md
reviewer simulation output
draft sections such as abstract, introduction, results, limitations, and conclusion
user-stated claim or missing evidence

Write:

- Claim:
- Paper location:
- Current wording:
- Evidence expected:
- Current evidence:
- Why current evidence is insufficient:
- Reviewer risk:

Step 2 - Search Existing Result Sources

Search in this order:

paper/.agent/result-inventory.md
paper/.agent/result-asset-provenance.md
paper/.agent/paper-evidence-board.md
code/docs/results/**/*.csv
code/docs/runs/**/*.csv
code/docs/reports/**/*.csv
code/outputs/**/*.csv, results/**/*.csv, outputs/**/*.csv
existing tables/*.tex, figures/*.tex, rendered figures, and experiment reports

If no result inventory exists, route to paper-result-asset-builder inventory mode or run its inventory script.

Step 3 - Classify the Gap

Read references/gap-triage.md.

Classify each gap as:

already-supported: evidence exists; prose or board needs update
supportable-from-existing-csv: CSV contains the needed result
needs-reaggregation: existing runs need a new aggregation, uncertainty estimate, or rounding rule
needs-slice: existing results need a dataset/task/group/seed/baseline slice
needs-asset: evidence exists but no paper-facing table/figure exists
needs-diagnosis: existing result is ambiguous, unstable, suspicious, or contradictory
needs-claim-narrowing: evidence supports a smaller claim
needs-new-compute: no existing evidence can answer the claim
cut-or-defer: claim is not worth supporting

Step 4 - Mine Reusable Evidence

Read references/result-reuse-patterns.md.

Try reuse paths before new compute:

build a main or appendix table from existing CSVs
add variance or confidence intervals from existing seeds
create a slice analysis from existing columns
compare against a baseline already present in CSVs
derive a trend, scaling, or sensitivity figure from stored runs
convert a diagnostic result into a limitation or scope claim
update result prose to use evidence that is already present

For each reuse path, identify exact source files and transformation rules.

Step 5 - Create Evidence Completion Plan

Create or update:

paper/.agent/evidence-completion-plan.md

If there is no paper/ directory and the current directory is the paper repo, save to:

.agent/evidence-completion-plan.md

Use templates/evidence-completion-plan.md.

Every gap needs:

status
reusable result source, if any
table/figure asset to build
prose or claim edit
next skill
whether new compute is required

Step 6 - Route the Minimal Next Action

Route by classification:

already-supported -> paper-evidence-board or paper-writing-assistant
writing-state-update -> paper-writing-memory-manager when the gap changes section status, stale locations, or open writing threads
supportable-from-existing-csv, needs-reaggregation, needs-slice, needs-asset -> paper-result-asset-builder
needs-diagnosis -> result-diagnosis
needs-claim-narrowing -> paper-writing-contract-planner, limitations-scope-writer, or paper-writing-assistant
needs-new-compute -> experiment-design-planner, then baseline-selection-audit and run-experiment if needed
cut-or-defer -> paper-writing-contract-planner and paper-draft-consistency-editor

When project memory exists, update:

memory/provenance-board.md with checked sources, missing source classes, and provisional-result replacement needs
memory/handoff-board.md with a paper-evidence-gap-miner -> paper-result-asset-builder handoff for reusable CSV/report evidence, or a paper-evidence-gap-miner -> experiment-design-planner handoff only after existing sources cannot fill the gap
memory/phase-dashboard.md if the writing phase regresses to evidence production or paper asset building

Final Sanity Check

Before finalizing:

every gap has been checked against existing result sources
no new compute is proposed when CSV reuse can answer the claim
the proposed evidence asset has a clear claim job
claim narrowing is considered when evidence is weaker than wording
new experiments, if needed, are minimal and reviewer-relevant
next skills and output files are explicit

paper-evidence-gap-miner

Paper Evidence Gap Miner

Skill Directory Layout

Progressive Loading

Core Principles

Step 1 - Locate the Writing Gap

Step 2 - Search Existing Result Sources

Step 3 - Classify the Gap

Step 4 - Mine Reusable Evidence

Step 5 - Create Evidence Completion Plan

Step 6 - Route the Minimal Next Action

Final Sanity Check

More from a-green-hand-jack/ml-research-skills

project-init

project-sync

add-git-tag

update-docs

init-latex-project

new-workspace