pyvene-interventions

Warn

Audited by Snyk on Mar 28, 2026

Risk Level: MEDIUM
Full Analysis

MEDIUM W011: Third-party content exposure detected (indirect prompt injection risk).

  • Third-party content exposure detected (high risk: 1.00). The skill explicitly loads models and interventions from public hubs (e.g., IntervenableModel.load("username/my-intervention") and model.from_pretrained("meta-llama/...") / "zhengxuanzenwu/intervenable_honest_llama2_chat_7B" in the workflows), which fetches untrusted user-hosted content from HuggingFace/GitHub that the agent applies to generation and interventions and thus can materially change behavior.

Issues (1)

W011
MEDIUM

Third-party content exposure detected (indirect prompt injection risk).

Audit Metadata
Risk Level
MEDIUM
Analyzed
Mar 28, 2026, 06:07 PM
Issues
1