verification-before-completion

Pass

Audited by Gen Agent Trust Hub on Feb 22, 2026

Risk Level: SAFE
Full Analysis
  • [Prompt Injection] (SAFE): The instructions use strong imperative language ('The Iron Law', 'Non-negotiable') to enforce operational honesty and verification. These do not constitute a safety bypass or an attempt to override global AI constraints.
  • [Data Exposure & Exfiltration] (SAFE): No sensitive file paths, credentials, or network transmission commands (e.g., curl, wget) are present.
  • [Remote Code Execution] (SAFE): While the skill references running 'verification commands', it provides no specific commands or scripts to execute. It describes a logical framework for the agent to apply to its existing environment.
  • [Obfuscation] (SAFE): The content is entirely in plain text with no evidence of encoding (Base64), zero-width characters, or homoglyphs.
  • [Indirect Prompt Injection] (SAFE): The skill does not process external or untrusted data; it is an internal instruction set for the agent's reasoning process.
Audit Metadata
Risk Level
SAFE
Analyzed
Feb 22, 2026, 12:08 PM