caption-clip
Pass
Audited by Gen Agent Trust Hub on Mar 4, 2026
Risk Level: SAFECOMMAND_EXECUTIONEXTERNAL_DOWNLOADSPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The skill executes several subprocesses to perform its tasks:
- Invokes
yt-dlpto download specific sections of YouTube videos. - Uses
ffmpegfor converting media files to MP4 and burning SRT subtitles into the video stream. - Employs shell commands (
grep,cut,tr) to extract the Deepgram API key from a local.envfile. - Executes a bundled Python script
json-to-srt.pyto transform API responses into subtitle files. - [EXTERNAL_DOWNLOADS]: Communicates with external services to fulfill its primary functions:
- Downloads video and audio streams from YouTube.
- Sends audio data to
api.deepgram.com(a well-known service) for transcription. - [PROMPT_INJECTION]: An indirect prompt injection surface is present during the subtitle cleanup workflow (Step 5):
- Ingestion points: The agent reads the content of
clip.srt, which is generated from external YouTube audio transcribed by a third-party API. - Boundary markers: There are no explicit delimiters or safety instructions provided to the agent to treat the subtitle content as untrusted data or to ignore potential instructions within it.
- Capability inventory: The skill has the ability to execute system commands (
ffmpeg,yt-dlp,python3) and perform file system writes. - Sanitization: The transcription data is not sanitized or validated for malicious prompts before being processed by the agent.
Audit Metadata