llm-evaluation

Fail

Audited by Socket on Mar 8, 2026

1 alert found:

Obfuscated File
Obfuscated FileHIGH
SKILL.md

The skill presents a coherent, well-scoped framework for evaluating LLM applications via automated metrics, human judgments, and LLM-based assessments. It does not require dangerous downloads, credential access, or remote action, and its data flows are contained to evaluated inputs and produced metrics/annotations. Overall risk is low with respect to security and data privacy; the footprint is proportionate to its stated purpose as an evaluation framework.

Confidence: 98%
Audit Metadata
Analyzed At
Mar 8, 2026, 12:11 AM
Package URL
pkg:socket/skills-sh/ckorhonen%2Fclaude-skills%2Fllm-evaluation%2F@71507f513b0a9d7ed6d79f6af809156481c75a40