evolve
Warn
Audited by Gen Agent Trust Hub on Feb 19, 2026
Risk Level: MEDIUMCOMMAND_EXECUTION
Full Analysis
- [COMMAND_EXECUTION] (MEDIUM): The skill executes arbitrary shell commands defined in the
checkfield ofGOALS.yaml(documented inreferences/goals-schema.md). This mechanism is used to measure 'fitness' but allows for arbitrary code execution if the YAML file is modified by an attacker or sourced from an untrusted repository. - Evidence:
GOALS.yamlschema definescheckas a "shell command — exit 0 = pass, non-zero = fail". - Context: This risk is intrinsic to the skill's purpose of autonomous testing and improvement, but lacks apparent validation or sandboxing for these commands.
- [INDIRECT_PROMPT_INJECTION] (LOW): The skill implements a 'Work Harvesting' loop (documented in
references/compounding.md) where it reads and executes items fromnext-work.jsonlwhich are generated from previous agent post-mortem reports. - Ingestion points:
next-work.jsonlandGOALS.yaml. - Boundary markers: None detected; the skill appears to trust the harvested JSONL items as valid work instructions.
- Capability inventory: Shell execution via
GOALS.yamlchecks and task execution via/rpicommands. - Sanitization: No sanitization or verification of the harvested work items is documented.
Audit Metadata