blip-2-vision-language

Warn

Audited by Snyk on Feb 16, 2026

Risk Level: MEDIUM
Full Analysis

MEDIUM W011: Third-party content exposure detected (indirect prompt injection risk).

  • Third-party content exposure detected (high risk: 0.80). The skill accepts and processes arbitrary user-provided images and text (e.g., Gradio image upload and textbox, FastAPI /caption and /batch_caption endpoints) and even includes URL fetching (load_image_from_url in troubleshooting), so the agent will read/interpret untrusted third-party content as part of its captioning/VQA workflows.
Audit Metadata
Risk Level
MEDIUM
Analyzed
Feb 16, 2026, 12:31 AM