skills/davila7/claude-code-templates/openrlhf-training/Socket

openrlhf-training

Warn

Audited by Socket on Mar 18, 2026

1 alert found:

Anomaly

Anomalyreferences/custom-rewards.md

LOW

AnomalyLOW

references/custom-rewards.md

This file is documentation and examples for implementing reward functions and agent logic. It contains one high-risk pattern: executing model-generated code via subprocess.run(pytest) after writing it to a tempfile, which enables arbitrary code execution on the host and potential data exfiltration or system modification. Other examples are benign algorithmic reward computations or use of evaluation models, but logging and model-loading can leak data or cause network activity. No signs of obfuscated or intentionally malicious code were found, but the code-execution example constitutes a significant security hazard if used without sandboxing and careful privilege, network, and logging controls.

Confidence: 90%Severity: 60%

Audit Metadata

Analyzed At

Mar 18, 2026, 04:50 PM

Package URL

pkg:socket/skills-sh/davila7%2Fclaude-code-templates%2Fopenrlhf-training%2F@7a1231be8e107930d0f3bc52ee06bd0ca71bb245