langsmith-evaluator

Pass

Audited by Gen Agent Trust Hub on Mar 5, 2026

Risk Level: SAFE
Full Analysis
  • [Data Processing Surface]: The skill facilitates the creation of evaluators that ingest and process data from LangSmith run outputs and dataset examples. While this introduces a surface where untrusted data could interact with evaluation logic, it is the fundamental purpose of the skill and is managed through recommended practices like inspecting output structures before implementation.
  • [Service Integration]: The included Python and TypeScript scripts interact with the official LangSmith API to manage evaluation rules. These scripts use standard authentication via environment variables and communicate with the vendor's legitimate infrastructure at api.smith.langchain.com.
  • [Local Module Loading]: The Python upload utility dynamically loads local evaluator files specified by the user to extract function source code. This is a standard approach for developer tools that bridge local code to cloud services.
  • [Remote Evaluator Execution]: Evaluators uploaded via this skill are designed to run within the LangSmith cloud environment. The skill explicitly notes the limited execution environment and lack of external package access for these remote evaluators, which aligns with standard security sandboxing practices.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 5, 2026, 01:18 AM