The Agent Skills Directory

Prompt Injection (SAFE): The skill includes behavioral constraints intended to maintain technical rigor (e.g., forbidding 'performative agreement'). These do not constitute a safety bypass or an attempt to override core agent protections.
Data Exposure & Exfiltration (SAFE): No access to sensitive local files (e.g., SSH keys, credentials) was detected. Network activity is limited to the GitHub API for pull request interactions, which is consistent with the skill's stated purpose. GitHub is a trusted source.
Indirect Prompt Injection (SAFE): The skill's primary function is to process external, untrusted data (GitHub code review comments). This represents a vulnerability surface. However, the skill explicitly mandates a verification workflow that includes restating requirements and checking suggestions against the actual codebase before implementation. Mandatory evidence chain for Category 8: 1. Ingestion points: GitHub PR comments (external). 2. Boundary markers: The skill instructs the agent to restate the technical requirement in its own words. 3. Capability inventory: File reading (grep), file writing (code implementation), and network writing (GitHub API replies). 4. Sanitization: No formal sanitization, but the logic requires manual technical verification.
Command Execution (SAFE): The skill utilizes grep for codebase searches and gh api for responding to threads. These are standard operations for a coding-focused agent and are used safely within the context of the workflow.

receiving-code-review