langsmith-evaluator

Installation

SKILL.md

LangSmith Evaluator

Create evaluators to measure agent performance on your datasets. LangSmith supports two types: LLM as Judge (uses LLM to grade outputs) and Custom Code (deterministic logic).

Setup

Environment Variables

LANGSMITH_API_KEY=lsv2_pt_your_api_key_here          # Required
LANGSMITH_WORKSPACE_ID=your-workspace-id              # Optional: for org-scoped keys
OPENAI_API_KEY=your_openai_key                        # For LLM as Judge

Dependencies

pip install langsmith langchain-openai python-dotenv

Installs

Repository

jackjin1997/clawforge

GitHub Stars

First Seen

Feb 16, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykPass

langsmith-evaluator — jackjin1997/clawforge