AudioEditor
Pass
Audited by Gen Agent Trust Hub on May 2, 2026
Risk Level: SAFECOMMAND_EXECUTIONDATA_EXFILTRATIONEXTERNAL_DOWNLOADSPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The skill executes various CLI tools for media processing, including
ffmpeg,ffprobe,whisper, andinsanely-fast-whisper. It also performs backgroundcurlrequests to a local notification server (localhost:8888) to provide status updates. - [DATA_EXFILTRATION]: The skill transmits audio and transcript data to third-party services (Anthropic and Cleanvoice) to perform AI-based analysis and processing. This is a functional requirement of the skill.
- [EXTERNAL_DOWNLOADS]: The
Polish.tstool downloads processed media files from Cleanvoice's infrastructure after cloud-based processing is complete. - [PROMPT_INJECTION]: The skill processes untrusted audio transcripts and uses an LLM to identify edit points. The system prompt explicitly instructs the LLM to follow 'edit markers' spoken by the user in the audio, creating an indirect prompt injection surface where the processed data can influence the tool's behavior.
- Ingestion points: Audio transcripts processed in
Analyze.ts. - Boundary markers: None identified; transcript segments are interpolated directly into the LLM prompt.
- Capability inventory: Analysis results drive the
Edit.tstool, which executes file-cutting operations usingffmpeg. - Sanitization: No filtering or sanitization is performed on the transcript text to distinguish between intended content and potentially malicious embedded instructions.
Audit Metadata