workflow-enforcer-skill
Warn
Audited by Gen Agent Trust Hub on Feb 16, 2026
Risk Level: MEDIUMPROMPT_INJECTION
Full Analysis
- Prompt Injection (MEDIUM): The skill uses imperative 'no-bypass' language designed to override the agent's internal decision-making processes.
- Evidence: The instructions include phrases like 'No exceptions, no rationalizations, no shortcuts' and 'Stop and use the appropriate agent' when the agent identifies a reason to skip a tool.
- Risk: This mirrors 'Ignore previous instructions' patterns and may pressure the agent to disregard safety protocols or ethical constraints if a malicious request is framed as a standard workflow task (e.g., a 'bug fix').
- Indirect Prompt Injection (MEDIUM): The skill serves as a dispatcher that ingests untrusted data and routes it to higher-capability tools.
- Ingestion points:
SKILL.mdmandates processing 'ANY user request' before responding. - Boundary markers: Absent. The skill does not use delimiters or protective wrappers when processing or announcing the task derived from user input.
- Capability inventory: The skill utilizes
TaskCreateandTodoWritefor state management and delegates execution to a catalog of 70+ specialized agents (including security auditors and deployment engineers) listed inreference/agents-catalog.md. - Sanitization: Absent. There is no logic provided to sanitize, validate, or filter user input before it is passed to the next agent in the chain.
- Persistence & Global Triggering (LOW): The
config.jsonandSKILL.mdspecify that the skill is 'automatic on all sessions' and 'applies to EVERY project.' This enforces a specific set of instructions globally without a per-session opt-in mechanism, increasing the impact of any contained injection vulnerabilities.
Audit Metadata