sentencepiece

Warn

Audited by Snyk on Mar 28, 2026

Risk Level: MEDIUM
Full Analysis

MEDIUM W011: Third-party content exposure detected (indirect prompt injection risk).

  • Third-party content exposure detected (high risk: 0.70). The skill's training guide (references/training.md) explicitly shows loading a public Hugging Face dataset via datasets.load_dataset('wikitext', ...) and training from that iterator, which ingests untrusted, user-generated web text as part of the training workflow and can materially influence tokenization/model behavior.

Issues (1)

W011
MEDIUM

Third-party content exposure detected (indirect prompt injection risk).

Audit Metadata
Risk Level
MEDIUM
Analyzed
Mar 28, 2026, 06:06 PM
Issues
1