Model Evaluator

Pass

Audited by Gen Agent Trust Hub on Feb 16, 2026

Risk Level: LOWPROMPT_INJECTION
Full Analysis
  • [Indirect Prompt Injection] (INFO): The skill includes an LLM-as-judge protocol that processes untrusted model responses. Ingestion points: prompt templates in SKILL.md. Boundary markers: Absent. Capability inventory: Display and reasoning only (Tier: INFO). Sanitization: None specified.
  • [Data Exposure & Exfiltration] (SAFE): No hardcoded secrets, sensitive file paths, or network exfiltration patterns were detected in the skill content.
  • [Remote Code Execution] (SAFE): The skill consists of documentation and code templates; it does not contain executable scripts or external dependency requirements.
Audit Metadata
Risk Level
LOW
Analyzed
Feb 16, 2026, 08:24 AM