long-run-harness
Long Run Harness
Use this skill when the user wants the agent to keep advancing a complex task until it is actually done, not merely improved.
Default to git-repository work. You may also apply the same operating model to documentation, research, or mixed projects when durable state and autonomous continuation matter.
Do not use this skill for one-shot edits, lightweight planning, or requests where the user wants synchronous oversight at each milestone.
Core Operating Rules
- Externalize state. Do not rely on conversation memory for long-running work.
- Work one task at a time. Do not keep multiple partial tasks open in parallel.
- Require validation before task completion.
- Require review before advancing to the next task.
- Verify state before trusting it and after each significant state transition.
- Use git history as a recovery aid when a repository is available.
- Continue automatically unless a real blocker or completion condition exists.
Required State Files
Create and maintain these files in the project root or an agreed task workspace:
task_list.jsonprogress.md
Read references/state-files.md before creating or editing them.
First Session
If harness state does not exist yet:
- Restate the final objective in concrete terms.
- Derive explicit acceptance criteria.
- Decompose the work into sequential tasks with clear boundaries.
- Initialize
task_list.jsonandprogress.md. - Select exactly one starting task.
- Begin execution only after the state files are coherent.
Use scripts/init_harness.py to create the initial files when possible.
Prefer structured --task-json input so each task starts with explicit task-level acceptance criteria and validation steps.
Every Later Session
At the start of every new cycle:
- Read
task_list.json. - Read
progress.md. - Review recent git history when available.
- Confirm the current active task or the next unblocked task.
- Verify the baseline environment before making new changes.
Read references/recovery.md if the session starts from an interrupted or unclear state.
Run scripts/verify_state.py before resuming if there is any doubt about state consistency.
Task Execution Loop
For the single active task:
- Refine the task-local plan only as much as needed.
- Perform the implementation, writing, or analysis.
- Run validation that matches the task's acceptance criteria.
- Gather evidence such as test output, generated artifacts, or diffs.
- Perform review.
- Update
task_list.jsonandprogress.md. - Commit focused progress when the repository is stable.
- Select the next unblocked task and continue.
Read references/workflow.md for the complete lifecycle.
When marking a task done, record explicit validation results first. Do not advance merely because the implementation looks complete.
Review Gate
Default to self-review first. Escalate to an independent review pass when the task is risky, user-visible, weakly validated, previously failed review, or changes shared architecture.
Do not move to the next task until the current task has either:
- passed review and been marked complete, or
- been marked blocked with an explicit blocker
The helper scripts should reject invalid state transitions such as:
- marking a task
donewithout passing review - marking a task
donewithout task-level validation results - activating a next task before its dependencies are
done - blocking a task without a written blocker reason
Read references/review-policy.md before deciding whether to advance.
Stop Conditions
Keep going unless one of these is true:
- every acceptance criterion is verified complete
- a required permission is missing
- a required input is missing and cannot be inferred safely
- the environment or external dependency is unavailable
- a high-risk decision has non-obvious consequences and cannot be made autonomously
If you stop, update the state files first and make the blocker explicit.
Script Usage
Use the helper scripts when possible instead of hand-editing structured state:
python scripts/init_harness.py ...to initializetask_list.jsonandprogress.mdpython scripts/check_next_task.py ...to validate the state and find the current or next taskpython scripts/verify_state.py ...to validate the full state file before resuming or after state changespython scripts/update_progress.py ...to update task status and append progress entries consistently
If you must edit state files manually, preserve the schema and keep status transitions explicit.
More from zhangga/aihub
yahoo-data-fetcher
Fetch real-time stock quotes from Yahoo Finance.
20xai-stock-sentiment
Real-time stock sentiment analysis using Twitter/X data via Grok. Use when analyzing stock ticker sentiment, tracking retail investor mood, or gauging market reaction to events.
17remotion
Remotion renderer for json-render that turns JSON timeline specs into videos. Use when working with @json-render/remotion, building video compositions from JSON, creating video catalogs, or rendering AI-generated video timelines.
15skill-hub-builder
Scaffold and maintain a personal skill hub repository for collecting, syncing, and distributing commonly used agent skills. Use when setting up a reusable skill hub, adding external or local skills, updating bundles, or preparing one-click install flows for a team.
12sensight
触发:查热搜/热榜(抖音/微博/小红书/推特/头条/百度);社交媒体语义搜索(微博/微信/小红书/推特,任意话题);查特定作者/账号近期发文动态;查 AI 行业资讯(论文/博客/模型发布/口碑/深度摘要)。不触发:通用知识问答、代码生成、静态事实类问题。
10agent-browser
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
8