The Agent Skills Directory

[Indirect Prompt Injection] (LOW): Inherent vulnerability surfaces identified in the evaluation snippets.
Ingestion points: The 'conversation' parameter in 'judge_conversation' (line 62) and 'transcript' data in 'TranscriptReplayer._replay_single' (line 98).
Boundary markers: Absent from the prompt templates and processing logic.
Capability inventory: Limited to LLM generation for quality assessment and agent execution within a test harness; no unauthorized file access, network exfiltration, or command execution capabilities were detected.
Sanitization: None; external conversation content is interpolated directly into prompts.

context-engineering