agent-performance-benchmarker
Pass
Audited by Gen Agent Trust Hub on Mar 1, 2026
Risk Level: SAFE
Full Analysis
- [SAFE]: The skill implements benchmarking logic in JavaScript and shell scripts for monitoring performance metrics. The implementation focuses on measuring throughput, latency, and resource utilization.
- [COMMAND_EXECUTION]: Shell hooks (pre and post) are used for status logging. The scripts use environment variables like $TASK within double quotes, which is a safe practice to prevent unintended shell expansion or injection.
- [DATA_EXPOSURE]: The skill collects system-level metrics such as CPU, memory, and disk usage. This is required for its primary function as a benchmarker. The collected data is stored locally or via the Model Context Protocol (MCP) memory tools, with no evidence of unauthorized external data exfiltration.
- [REMOTE_CODE_EXECUTION]: No remote code downloads or dynamic execution of untrusted scripts were found. All logic is contained within the skill's implementation or refers to internal monitoring components.
Audit Metadata