verify-tests-fail-without-fix

Pass

Audited by Gen Agent Trust Hub on Mar 18, 2026

Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
  • Command Execution: The skill utilizes PowerShell (pwsh) to orchestrate test runs and repository management. It executes scripts located within the repository's .github/scripts/ directory, which is a standard pattern for repository-level automation and CI/CD tasks.
  • Repository State Management: The verification process involves using git checkout to temporarily revert 'fix' files to a previous state and then restore them. This technique is used to prove that tests fail when the bug is present and pass when it is resolved, which is the primary intended function of the skill.
  • GitHub Integration: The skill uses the GitHub CLI (gh) to retrieve Pull Request metadata and manage labels. These interactions are focused on providing feedback within the development workflow and are restricted to standard PR management operations.
  • Input Processing: The script auto-detects configuration values by analyzing branch names and git diffs. It includes safety measures, such as regex matching for PR numbers, to ensure that data ingested from the environment is processed correctly before being used in commands.
  • File System Interaction: The skill generates diagnostic reports and logs in the CustomAgentLogsTmp directory. These files provide visibility into the test results and are used to inform the user or PR agent of the verification status.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 18, 2026, 12:15 PM