ai-multimodal
Warn
Audited by Gen Agent Trust Hub on Mar 5, 2026
Risk Level: MEDIUMCOMMAND_EXECUTIONDATA_EXFILTRATIONPROMPT_INJECTIONREMOTE_CODE_EXECUTION
Full Analysis
- [DYNAMIC_EXECUTION]: The script
scripts/media_optimizer.pyuses theeval()function to parse ther_frame_ratestring obtained fromffprobe. Because this string is derived from the metadata of external media files, a maliciously crafted file could potentially execute arbitrary code when processed by this script. - [COMMAND_EXECUTION]: The skill relies on
subprocess.runto execute external binariesffmpegandffprobe. While these are used for the legitimate purpose of media optimization, the interaction with system-level utilities increases the attack surface if input filenames or metadata are not strictly validated. - [DATA_EXPOSURE]: The
find_api_keyfunction inscripts/gemini_batch_process.pyandscripts/document_converter.pyimplements a recursive search for.envfiles in parent directories (up to the project root). This behavior could lead to the unintended exposure of sensitive environment variables or credentials stored in higher-level directories. - [INDIRECT_PROMPT_INJECTION]: The skill is designed to ingest and process untrusted external data from various media formats (PDF, audio, video, images) and relay that content to an LLM. It lacks explicit boundary markers or 'ignore' instructions in its prompts to prevent the model from executing commands embedded within the analyzed documents.
- Ingestion points: Files provided through the
--inputor--filesCLI arguments ingemini_batch_process.pyanddocument_converter.py. - Boundary markers: None identified; the prompts instruct the model on formatting but do not explicitly warn against instructions contained within the media.
- Capability inventory: The skill has the ability to write to the filesystem, execute shell commands (via ffmpeg), and perform network operations (Gemini API).
- Sanitization: The skill does not perform sanitization of text extracted from OCR or transcription before passing it to the next stage of the LLM pipeline.
Audit Metadata