red-team-frameworks

Warn

Audited by Snyk on Feb 16, 2026

Risk Level: MEDIUM
Full Analysis

MEDIUM W011: Third-party content exposure detected (indirect prompt injection risk).

  • Third-party content exposure detected (high risk: 0.80). The skill explicitly shows fetching and interacting with public/untrusted sources—e.g., garak supports custom REST endpoints (--model_name "https://api.example.com/v1/chat"), generators include HuggingFace/REST API, and TextAttack loads HuggingFaceDataset("sst2")—so the agent would fetch and interpret untrusted third‑party content as part of its workflow.
Audit Metadata
Risk Level
MEDIUM
Analyzed
Feb 16, 2026, 01:35 AM