reflect

Pass

Audited by Gen Agent Trust Hub on Feb 13, 2026

Risk Level: LOWNO_CODE
Full Analysis

The SKILL.md file contains only markdown text, outlining a 'Self-Refinement and Iterative Improvement Framework' for the AI. It defines a 'ruthless quality gatekeeper' persona and provides extensive guidelines for assessing task complexity, performing initial assessments, planning refinements, evaluating code-specific criteria, fact-checking, and reporting.

  1. Prompt Injection: The skill uses strong, directive language (e.g., 'Your Identity (NON-NEGOTIABLE)', 'CRITICAL WARNING', 'HARD RULE', 'STOP') to establish the AI's persona and guide its internal thought process. However, these are internal instructions aimed at making the AI more critical and rigorous, not at bypassing its safety guidelines, exfiltrating data, or performing any malicious actions. No patterns like 'Ignore previous instructions', 'You are now unrestricted', 'DAN', or system prompt extraction were found.
  2. Data Exfiltration: There are no commands or references to sensitive file paths (e.g., ~/.aws/credentials, ~/.ssh/id_rsa) nor any network operations (e.g., curl, wget, fetch, requests). The skill does not attempt to read or send any data.
  3. Obfuscation: No Base64 encoding, zero-width characters, Unicode homoglyphs, URL/hex/HTML encoding, or other obfuscation techniques were detected in the markdown content.
  4. Unverifiable Dependencies: The skill is purely instructional and does not involve installing any packages (npm install, pip install) or downloading external scripts (git clone, curl | bash). It mentions searching for existing libraries as part of its reflection process but does not execute such actions itself.
  5. Privilege Escalation: No commands like sudo, doas, chmod, or any attempts to modify system files or install services were found.
  6. Persistence Mechanisms: There are no instructions to modify shell configuration files (.bashrc, .zshrc), create cron jobs, or establish any other persistence mechanisms.
  7. Metadata Poisoning: The YAML front matter (name, description, argument-hint) is clean and contains no malicious instructions.
  8. Indirect Prompt Injection: The skill's purpose is to guide the AI's internal reflection, not to process external, untrusted user input in a way that would make it susceptible to indirect injection.
  9. Time-Delayed / Conditional Attacks: No conditional logic based on dates, times, usage counts, or environment variables that could trigger malicious behavior was found.

In summary, this skill is a 'NO_CODE' skill, meaning it consists solely of natural language instructions for the AI's internal reasoning. It poses no security risks.

Audit Metadata
Risk Level
LOW
Analyzed
Feb 13, 2026, 07:45 AM