agent-performance-benchmarker

Pass

Audited by Gen Agent Trust Hub on Mar 1, 2026

Risk Level: SAFE
Full Analysis
  • [SAFE]: The skill implements benchmarking logic in JavaScript and shell scripts for monitoring performance metrics. The implementation focuses on measuring throughput, latency, and resource utilization.
  • [COMMAND_EXECUTION]: Shell hooks (pre and post) are used for status logging. The scripts use environment variables like $TASK within double quotes, which is a safe practice to prevent unintended shell expansion or injection.
  • [DATA_EXPOSURE]: The skill collects system-level metrics such as CPU, memory, and disk usage. This is required for its primary function as a benchmarker. The collected data is stored locally or via the Model Context Protocol (MCP) memory tools, with no evidence of unauthorized external data exfiltration.
  • [REMOTE_CODE_EXECUTION]: No remote code downloads or dynamic execution of untrusted scripts were found. All logic is contained within the skill's implementation or refers to internal monitoring components.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 1, 2026, 04:32 PM