langsmith-evaluator
Pass
Audited by Gen Agent Trust Hub on Mar 10, 2026
Risk Level: SAFEEXTERNAL_DOWNLOADSREMOTE_CODE_EXECUTIONPROMPT_INJECTION
Full Analysis
- External Script Installation: The skill includes a command to download and execute a setup script for a CLI tool from an official GitHub repository. This is a standard procedure for configuring the developer environment required for LangSmith evaluations.
- Indirect Prompt Injection Surface: The evaluator functions in
SKILL.md(e.g.,accuracy_evaluator) ingest data fromrun.outputsandexample.outputsand interpolate them directly into LLM prompts. Since this data originates from external agent runs, it presents a surface for indirect prompt injection. Ingestion points:run.outputsandexample.outputsinaccuracy_evaluatorandtrajectory_evaluator. Boundary markers: Absent. Capability inventory: Invokes LLM-as-judge calls (gpt-4o-mini). Sanitization: Absent. It is recommended to use structured output and delimiters to clearly separate instructions from data.
Audit Metadata