markitdown
Pass
Audited by Gen Agent Trust Hub on Mar 3, 2026
Risk Level: SAFEPROMPT_INJECTIONCOMMAND_EXECUTIONEXTERNAL_DOWNLOADSDATA_EXFILTRATION
Full Analysis
- [PROMPT_INJECTION]: The skill is vulnerable to indirect prompt injection through the processing of external files.
- Ingestion points: The skill ingests data from untrusted external sources, including PDF, DOCX, XLSX, and images, as well as transcripts from YouTube URLs via the
markitdownlibrary and OpenRouter API. - Boundary markers: No delimiters or "ignore embedded instructions" warnings are present in the scripts or the prompts used for content extraction.
- Capability inventory: The skill has the
Bashtool enabled and includes multiple Python scripts for batch processing and AI-enhanced conversion. It also has permissions toRead,Write, andEditfiles. - Sanitization: There is no evidence of sanitization, validation, or filtering of the extracted text content before it is presented to the agent.
- [COMMAND_EXECUTION]: The skill contains instructions and examples for executing local Python scripts.
- Evidence:
SKILL.mdprovides a command-line example forscripts/generate_schematic.py. However, this specific script is missing from the skill files, although other scripts likebatch_convert.pyandconvert_literature.pyare provided. - [EXTERNAL_DOWNLOADS]: The skill references and installs external software from well-known sources.
- Evidence: The installation instructions in
SKILL.mddirect users to install themarkitdownpackage from PyPI or clone it from Microsoft's official GitHub repository. - [DATA_EXFILTRATION]: The skill performs network operations to fetch data and process content via external APIs.
- Evidence: The
markitdowntool performs network requests to fetch YouTube transcripts. Additionally,scripts/convert_with_ai.pysends document data to the OpenRouter API (openrouter.ai) for image description generation.
Audit Metadata