codex
Fail
Audited by Gen Agent Trust Hub on Feb 16, 2026
Risk Level: HIGHREMOTE_CODE_EXECUTIONCOMMAND_EXECUTIONDATA_EXFILTRATIONPROMPT_INJECTION
Full Analysis
- Indirect Prompt Injection (HIGH): The skill is designed to ingest external code and problem statements for analysis and 'automated editing'. It lacks any boundary markers or sanitization for this untrusted data.
- Ingestion points:
codex -p "<DETAILED CONTEXT>"inSKILL.mdusesRead,Grep, andGlobto gather context. - Boundary markers: Absent. Instructions are interpolated directly into the CLI prompt.
- Capability inventory: Use of
Bashfor command execution,WebFetchfor network access, and thecodexCLI's native ability to modify files (bypassing the agent'sdisallowedToolsrestriction onWrite/Edit). - Sanitization: Absent. Untrusted code snippets are passed directly to the model/CLI.
- Command Execution & Privilege Escalation (HIGH): The skill uses
Bashto execute an externalcodexbinary with dangerous configurations. - Evidence: Usage of
--sandbox danger-full-accessand--full-autoflags which permit network access and autonomous execution. - Privilege Bypass:
codex-agent.mdexplicitly states 'All file modifications must go through codex CLI' to circumvent the fact thatWriteandEdittools are disallowed for the agent itself. - Metadata Poisoning & Misleading Claims (MEDIUM): The skill references 'gpt-5.2' and 'gpt-5.2-codex' models which do not currently exist, suggesting either hallucinated instructions or a specialized/potentially malicious environment.
- Dynamic Execution / Self-Modification (MEDIUM): The 'Heal-Skill' integration allows the skill to modify its own
SKILL.mddocumentation based on the output ofcodex --help. This could be exploited to inject new malicious instructions if the CLI output is compromised. - Remote Code Execution (MEDIUM): While it doesn't download a script and pipe it to bash directly, it relies on an external, unverified binary (
codex) and provides it with full-auto capabilities over the local workspace.
Recommendations
- AI detected serious security threats
Audit Metadata