benchmark-agents
Warn
Audited by Socket on Mar 17, 2026
1 alert found:
SecuritySecuritySKILL.md
MEDIUMSecurityMEDIUM
SKILL.md
SUSPICIOUS: the skill's benchmarking purpose mostly matches its actions, but it requires broad execution powers, installs a behavior-changing plugin from an unpinned GitHub source, and orchestrates multiple autonomous interactive agent sessions that can modify projects and perform account-linked setup. The main concern is high operational and transitive-trust risk, not confirmed malware.
Confidence: 84%Severity: 79%
Audit Metadata