agent-regression-sentry

Pass

Audited by Gen Agent Trust Hub on Mar 5, 2026

Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill instructions include shell commands for running tests (npm test -- --json) and performance benchmarks (npm run perf:test). It also uses Git commands (git checkout, git diff) to switch between codebase versions and identify changed files. These operations are appropriate for the skill's purpose of detecting regressions.
  • [SAFE]: No malicious obfuscation, unauthorized data exfiltration, or hardcoded credentials were detected. The skill's behavior aligns with its stated role as a system stability guardian.
  • [PROMPT_INJECTION]: The skill ingests data from external sources such as test output files (baseline_tests.json, current_tests.json) and git diff results. While these could potentially contain malicious content if the project being tested is compromised, the skill's logic is focused on metric comparison and reporting, posing a low risk.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 5, 2026, 01:13 AM