ml-paper-writing
Pass
Audited by Gen Agent Trust Hub on Mar 28, 2026
Risk Level: SAFEPROMPT_INJECTIONEXTERNAL_DOWNLOADSDATA_EXFILTRATION
Full Analysis
- [PROMPT_INJECTION]: The skill's primary workflow involves reading and summarizing data from a user's research repository, including README files, experimental logs, and artifacts. This creates a surface for indirect prompt injection (Category 8) where malicious content in those files could attempt to influence the agent's behavior during the drafting process.
- Ingestion points: Workflow 0 in
SKILL.mdinstructs the agent to explore repositories and read file contents using shell commands likels,find, andgrep. - Boundary markers: The skill does not explicitly instruct the agent to use delimiters or to ignore potential instructions embedded in the project files.
- Capability inventory: The skill has access to shell commands (
ls,find,grep,cp), Python script execution (via dependencies), and network access for citation verification. - Sanitization: External data is used directly for drafting without a specific sanitization or filtering step described in the instructions.
- Ingestion points: Workflow 0 in
- [EXTERNAL_DOWNLOADS]: The skill involves downloading resources from official and well-known academic domains.
- The
Makefileintemplates/neurips2025/includes an upgrade target that usescurlto fetch LaTeX style files from the official NeurIPS domain (media.neurips.cc). - The main instructions in
SKILL.mdrecommend installing the Exa MCP tool for academic search from its official domain (mcp.exa.ai).
- The
- [DATA_EXFILTRATION]: The skill performs network operations to fetch academic metadata, which is standard for its intended purpose.
- It uses Python libraries and
requeststo query established academic APIs includingdoi.org,api.semanticscholar.org,api.crossref.org, andexport.arxiv.org. These requests are used to retrieve verified BibTeX entries and paper abstracts.
- It uses Python libraries and
Audit Metadata