The Agent Skills Directory

[PROMPT_INJECTION]: The skill exhibits a vulnerability to indirect prompt injection due to its core function of collecting and evaluating natural language rationales from multiple potentially untrusted agents or external sources.
Ingestion points: Data enters the agent's context during the 'Collect Votes' step in SKILL.md, where rationales and confidence scores are gathered.
Boundary markers: There are no specific delimiters or instructional barriers (e.g., 'ignore embedded instructions') defined to prevent the agent from acting upon instructions hidden within the rationales.
Capability inventory: The skill is configured with high-privilege tools including Bash, Write, and Edit, which increases the risk if an injection successfully steers the agent's actions.
Sanitization: No validation or sanitization logic exists in the provided scripts or instructions to filter out executable commands or prompt overrides from the gathered input.

consensus-voting