ctf-ai-ml
Pass
Audited by Gen Agent Trust Hub on Apr 5, 2026
Risk Level: SAFEPROMPT_INJECTIONCOMMAND_EXECUTIONEXTERNAL_DOWNLOADSDATA_EXFILTRATION
Full Analysis
- [PROMPT_INJECTION]: The skill documentation includes numerous examples of direct and indirect prompt injection payloads.
- Evidence: Payloads like 'Ignore all previous instructions', 'DAN' jailbreak templates, and token smuggling techniques are documented in
llm-attacks.mdandSKILL.md. - Context: These are explicitly labeled as attack vectors for the user to study or use against CTF targets.
- [PROMPT_INJECTION]: Documentation reveals an attack surface for indirect prompt injection where the agent might ingest untrusted data.
- Ingestion points:
llm-attacks.mddescribes scenarios where the agent processes external web pages or retrieved documents (e.g., viaWebFetch). - Boundary markers: The provided exploit examples demonstrate the absence of boundary markers or instructions to ignore embedded data in target systems.
- Capability inventory: The skill environment provides
Bash,Write, andWebFetchcapabilities (SKILL.md). - Sanitization: No sanitization or validation of the ingested untrusted content is present in the documented exploitation patterns.
- [COMMAND_EXECUTION]: The skill provides various Python snippets for model analysis and weight manipulation, including the use of
torch.loadandsafetensorsinSKILL.mdandmodel-attacks.md. - Potential Risk:
torch.loadperforms deserialization which can execute arbitrary code if used on a malicious model file provided by a challenge author. - [DATA_EXFILTRATION]: The skill contains functional code for interacting with remote endpoints, which is used to send attack payloads and retrieve results.
- Evidence: Scripts in
llm-attacks.mdandmodel-attacks.mduse therequestslibrary andcurlto communicate with targets likehttp://challenge:8080/api/chat. - [EXTERNAL_DOWNLOADS]: The skill performs installations of common third-party machine learning and security libraries.
- Evidence:
pip installcommands inSKILL.mdtarget well-known packages includingtorch,transformers,scikit-learn, andpeft.
Audit Metadata