The Agent Skills Directory

[Prompt Injection] (SAFE): No instructions to override behavior or bypass safety filters were found in the reference content.
[Data Exposure & Exfiltration] (SAFE): No hardcoded credentials, sensitive file access, or unauthorized network operations were detected.
[Unverifiable Dependencies] (SAFE): The skill snippets reference standard data science libraries for metrics. No unverified package installations or remote script executions are present.
[Indirect Prompt Injection] (LOW): This category is flagged because the skill processes untrusted model responses. 1. Ingestion points: response, response_a, and response_b variables in references/bias-mitigation.md and references/implementation-patterns.md. 2. Boundary markers: Absent in the provided logic snippets. 3. Capability inventory: Restricted to evaluation logic and abstract LLM generation calls; no shell or file system access. 4. Sanitization: Includes an anonymize_response function for model attribution removal.

advanced-evaluation