genie-benchmark-evaluator
Fail
Audited by Socket on Mar 8, 2026
1 alert found:
Obfuscated FileObfuscated FileSKILL.md
HIGHObfuscated FileHIGH
SKILL.md
Benign overall: the skill remains coherent with its described purpose as a Genie Space benchmark evaluator. It defines a comprehensive, multi-layer evaluation workflow with structured ASI, UC-trace storage, and MLflow-backed tracking. While the architecture is complex and carries risk related to data flow integrity and process rigidity, there is no clear evidence of malicious or credential-harvesting behavior. Security risk isMedium due to reliance on multiple external services and data sinks, with no explicit credential leakage identified.
Confidence: 98%
Audit Metadata