The Agent Skills Directory

MEDIUM W011: Third-party content exposure detected (indirect prompt injection risk).

Third-party content exposure detected (high risk: 0.90). The skill explicitly fetches and ingests untrusted third‑party content — e.g., pretrained models and datasets from HuggingFace and external APIs as shown in SKILL.md and references/api-evaluation.md (model args like pretrained=meta-llama/..., dataset_path: squad or local/HuggingFace datasets), and it even runs/evaluates model-generated code (see "HumanEval not executing code" / --allow_code_execution and references/custom-tasks.md execute_code), so external/user-generated content can be executed or otherwise materially influence runtime behavior.

evaluating-llms-harness