llm-evaluation
Fail
Audited by Socket on Mar 7, 2026
1 alert found:
Obfuscated FileObfuscated FileSKILL.md
HIGHObfuscated FileHIGH
SKILL.md
The skill's documented footprint is coherent with its stated purpose: it focuses on evaluation methodology for LLMs without introducing suspicious or invasive data handling. There are no evident supply-chain or credential-exfiltration risks in the provided content. Minor risk considerations include secure use of external evaluation services and dependencies when implemented, but the conceptual design remains benign and proportionate to its described goal.
Confidence: 98%
Audit Metadata