run-experiment
Pass
Audited by Gen Agent Trust Hub on Apr 10, 2026
Risk Level: SAFEPROMPT_INJECTION
Full Analysis
- [PROMPT_INJECTION]: The skill exhibits a surface for indirect prompt injection as it processes untrusted data from external sources.
- Ingestion points: The workflow involves reading and analyzing production traces, conversation histories, and external datasets (referenced in SKILL.md, conversation-evaluation.md, and rag-evaluation.md).
- Boundary markers: The instructions lack explicit boundary markers or 'ignore' directives that would prevent the agent from obeying instructions embedded within the data it is tasked with analyzing.
- Capability inventory: The skill is granted access to high-privilege tools such as
Bash,Write, andEdit, in addition to theorq*platform tools, which increases the potential impact of a successful injection. - Sanitization: There is no evidence of sanitization or validation logic applied to the external data before it is presented to the agent's context.
Audit Metadata