aios-long-running-harness

Installation

SKILL.md

AIOS Long-Running Harness

Overview

Use this harness to keep long tasks stable under UI drift, model variability, and partial failures. It maps Anthropic's long-running-agent harness ideas into this repository's file-based workflow.

Harness Loop

Preflight: lock objective, stop conditions, budgets, and required artifacts.
Plan: split into idempotent steps with explicit success/failure evidence.
Execute: run one step at a time with tool output capture.
Verify: assert completion from page evidence, not assumptions.
Checkpoint: persist current state, artifacts, and next action.
Recover: on failure, classify and retry only with a changed hypothesis.
Complete: run final verification and write summary doc.

Pairing with Superpowers Skills

Plan step should be produced through superpowers:writing-plans (or superpowers:brainstorming first when scope is unclear).
For 2+ independent domains, use superpowers:dispatching-parallel-agents; for coupled domains, run sequentially.
If the runtime has no true subagent tool, emulate dispatch with explicit per-domain task queues and only parallelize safe independent reads/checks.
Always finish with superpowers:verification-before-completion before claiming run success.

Orchestrate Live Notes

aios orchestrate --execute live currently supports AIOS_SUBAGENT_CLIENT=codex-cli only.
Codex CLI v0.114+ structured exec outputs (--output-schema, --output-last-message, stdin) are required for handoff parsing; schema fallback to raw stdout is rejected.
Transient upstream_error/server_error failures are retried with exponential backoff via AIOS_SUBAGENT_UPSTREAM_MAX_ATTEMPTS and AIOS_SUBAGENT_UPSTREAM_BACKOFF_MS.

Required Controls

Time budget per step and per run.
Retry budget per failure class.
Human-gate checkpoints for login, payment, or policy-sensitive actions.
Structured logs for every major transition.

Failure Classes

Selector/UI drift.
Authentication/session loss.
Policy rejection/content moderation.
Network/transient failures.
Tool/runtime errors.

Completion Gate

Declare success only when all are true:

Target action succeeded.
Expected artifact exists.
Evidence snapshot/log exists.
Updated runbook reflects newly observed drift.

Resources

references/harness-checklist.md: operational checklist template.
references/anthropic-mapping.md: principle-to-project mapping.

Related skills

More from rexleimo/rex-cli

Installs

Repository

rexleimo/rex-cli

GitHub Stars

First Seen

Apr 15, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykWarn

aios-long-running-harness

AIOS Long-Running Harness

Overview

Harness Loop

Pairing with Superpowers Skills

Orchestrate Live Notes

Required Controls

Failure Classes

Completion Gate

Resources

More from rexleimo/rex-cli

skill-creator

contextdb-autopilot

seed2-manga-drama

debug

find-skills

xhs-ops-methods