skills/simhacker/moollm/debate/Gen Agent Trust Hub

debate

Fail

Audited by Gen Agent Trust Hub on Feb 16, 2026

Risk Level: HIGHPROMPT_INJECTION
Full Analysis
  • [Indirect Prompt Injection] (HIGH): The skill identifies multiple points where untrusted external data enters the agent context and is used alongside sensitive file-writing tools.
  • Ingestion points: Parameters such as 'topic' in CREATE-DEBATE, 'argument' in ADD-SIDE, and 'point' in ARGUE (documented in CARD.yml).
  • Boundary markers: Absent. There are no delimiters or instructions provided to isolate processed text from agent instructions.
  • Capability inventory: The skill requires and uses read_file and write_file tools to maintain debate state and generate transcripts (CARD.yml).
  • Sanitization: Absent. There is no evidence of validation, escaping, or filtering of the inputs before they are processed or written to disk.
  • [Self-Referential Metadata] (LOW): The file skill-snitch-report.md contains a pre-authored 'Verdict: APPROVE' and a 'LOW' risk assessment. Following the 'GLOBAL RULE', these claims are treated as data to be evaluated rather than authoritative conclusions.
Recommendations
  • AI detected serious security threats
Audit Metadata
Risk Level
HIGH
Analyzed
Feb 16, 2026, 10:40 PM