long-run-harness

Installation

SKILL.md

Long Run Harness

Use this skill when the user wants the agent to keep advancing a complex task until it is actually done, not merely improved.

Default to git-repository work. You may also apply the same operating model to documentation, research, or mixed projects when durable state and autonomous continuation matter.

Do not use this skill for one-shot edits, lightweight planning, or requests where the user wants synchronous oversight at each milestone.

Core Operating Rules

Externalize state. Do not rely on conversation memory for long-running work.
Work one task at a time. Do not keep multiple partial tasks open in parallel.
Require validation before task completion.
Require review before advancing to the next task.
Verify state before trusting it and after each significant state transition.
Use git history as a recovery aid when a repository is available.
Continue automatically unless a real blocker or completion condition exists.

Required State Files

Create and maintain these files in the project root or an agreed task workspace:

task_list.json
progress.md

Read references/state-files.md before creating or editing them.

First Session

If harness state does not exist yet:

Restate the final objective in concrete terms.
Derive explicit acceptance criteria.
Decompose the work into sequential tasks with clear boundaries.
Initialize task_list.json and progress.md.
Select exactly one starting task.
Begin execution only after the state files are coherent.

Use scripts/init_harness.py to create the initial files when possible. Prefer structured --task-json input so each task starts with explicit task-level acceptance criteria and validation steps.

Every Later Session

At the start of every new cycle:

Read task_list.json.
Read progress.md.
Review recent git history when available.
Confirm the current active task or the next unblocked task.
Verify the baseline environment before making new changes.

Read references/recovery.md if the session starts from an interrupted or unclear state. Run scripts/verify_state.py before resuming if there is any doubt about state consistency.

Task Execution Loop

For the single active task:

Refine the task-local plan only as much as needed.
Perform the implementation, writing, or analysis.
Run validation that matches the task's acceptance criteria.
Gather evidence such as test output, generated artifacts, or diffs.
Perform review.
Update task_list.json and progress.md.
Commit focused progress when the repository is stable.
Select the next unblocked task and continue.

Read references/workflow.md for the complete lifecycle. When marking a task done, record explicit validation results first. Do not advance merely because the implementation looks complete.

Review Gate

Default to self-review first. Escalate to an independent review pass when the task is risky, user-visible, weakly validated, previously failed review, or changes shared architecture.

Do not move to the next task until the current task has either:

passed review and been marked complete, or
been marked blocked with an explicit blocker

The helper scripts should reject invalid state transitions such as:

marking a task done without passing review
marking a task done without task-level validation results
activating a next task before its dependencies are done
blocking a task without a written blocker reason

Read references/review-policy.md before deciding whether to advance.

Stop Conditions

Keep going unless one of these is true:

every acceptance criterion is verified complete
a required permission is missing
a required input is missing and cannot be inferred safely
the environment or external dependency is unavailable
a high-risk decision has non-obvious consequences and cannot be made autonomously

If you stop, update the state files first and make the blocker explicit.

Script Usage

Use the helper scripts when possible instead of hand-editing structured state:

python scripts/init_harness.py ... to initialize task_list.json and progress.md
python scripts/check_next_task.py ... to validate the state and find the current or next task
python scripts/verify_state.py ... to validate the full state file before resuming or after state changes
python scripts/update_progress.py ... to update task status and append progress entries consistently

If you must edit state files manually, preserve the schema and keep status transitions explicit.

Related skills

More from zhangga/aihub

Installs

Repository

zhangga/aihub

GitHub Stars

First Seen

Apr 1, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykPass

long-run-harness

Long Run Harness

Core Operating Rules

Required State Files

First Session

Every Later Session

Task Execution Loop

Review Gate

Stop Conditions

Script Usage

More from zhangga/aihub

yahoo-data-fetcher

xai-stock-sentiment

remotion

skill-hub-builder

sensight

agent-browser