scientific-critical-thinking

Pass

Audited by Gen Agent Trust Hub on Mar 3, 2026

Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill uses the Bash tool to run a local Python script scripts/generate_schematic.py for creating visual diagrams. While this is a vendor-provided tool, the script's content is not included in the skill package for security verification.
  • [PROMPT_INJECTION]: The skill processes untrusted external scientific literature which introduces an indirect prompt injection surface.
  • Mandatory Evidence Chain for [PROMPT_INJECTION]:
  • Ingestion points: Scientific claims, research papers, and document text provided by the user as described in SKILL.md and processed via analysis tools.
  • Boundary markers: No specific delimiters or "ignore embedded instructions" warnings are defined to separate untrusted data from the agent's operational prompt.
  • Capability inventory: The skill is granted access to the Bash tool (for local script execution), as well as Write, Edit, and Read capabilities for interacting with the file system.
  • Sanitization: There is no documentation of input sanitization or filtering to prevent the agent from obeying instructions embedded within the scientific text it is analyzing.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 3, 2026, 08:48 PM