The Agent Skills Directory

EXTERNAL_DOWNLOADS (LOW): The PerplexityDetector class in references/gibberish-detection.md uses the transformers library to download the GPT-2 model from Hugging Face. While Hugging Face is a trusted organization, the download of external model weights is a significant network operation.
PROMPT_INJECTION (LOW): The detect_gibberish function in SKILL.md implements an 'LLM-as-judge' pattern that is vulnerable to indirect prompt injection. Untrusted data from the agent's response is placed directly into a prompt template used for scoring.
Ingestion points: The response argument in the detect_gibberish function in SKILL.md.
Boundary markers: Absent; the response text is concatenated directly into the judge_prompt string.
Capability inventory: The generated prompt is processed by llm.generate(), which influences the monitoring system's alerting and scoring decisions.
Sanitization: No sanitization or escaping of the input response is performed before interpolation.

silent-failure-detection