ai-building-chatbots
Fail
Audited by Gen Agent Trust Hub on Feb 16, 2026
Risk Level: HIGHPROMPT_INJECTION
Full Analysis
- [PROMPT_INJECTION] (HIGH): The skill is highly susceptible to indirect prompt injection due to the way it handles untrusted data.
- Ingestion points: Untrusted data enters the agent via
user_messagein thehandle_messagefunction (examples.md) and through thedocsretrieved by theretrieverinSupportBot.forwardandFAQBot.forward. - Boundary markers: There are no delimiters or boundary markers (e.g., XML tags, triple backticks) used in the
dspy.Signaturedefinitions to isolate user-provided text from instructions. - Capability inventory: The skill uses
dspy.Predictanddspy.ChainOfThoughtto classify intent and generate responses. While no direct file-system or network-write capabilities are shown, theintentclassification directly controls the logic flow (e.g., triggeringescalate_node). - Sanitization: No input validation or sanitization is performed on
user_messageordocsbefore they are interpolated into the prompt signatures. - Risk: An attacker could provide a
user_messageor poison the retrieveddocsto override theClassifyIntentsignature, forcing an unearned escalation or manipulating theSupportResponseto leak previousconversation_historyor internal retrieval logic.
Recommendations
- AI detected serious security threats
Audit Metadata