alignment-values-north-star

Pass

Audited by Gen Agent Trust Hub on Feb 17, 2026

Risk Level: SAFE
Full Analysis
  • Prompt Injection (SAFE): No instructions were found that attempt to override AI safety constraints or extract system prompts.
  • Data Exposure & Exfiltration (SAFE): There are no hardcoded credentials, sensitive file paths (e.g., .ssh, .aws), or network-related commands like curl or wget.
  • Obfuscation (SAFE): No Base64 encoding, zero-width characters, or homoglyphs were detected.
  • Unverifiable Dependencies & Remote Code Execution (SAFE): The skill does not install third-party packages or execute remote scripts.
  • Privilege Escalation (SAFE): No administrative commands such as sudo or chmod are present.
  • Indirect Prompt Injection (SAFE): While the skill processes user input to create alignment artifacts, it lacks high-privilege capabilities (like shell access or file-system writing) that could be exploited via data injection.
Audit Metadata
Risk Level
SAFE
Analyzed
Feb 17, 2026, 06:06 PM