skills/mlflow/skills/agent-evaluation/Gen Agent Trust Hub

agent-evaluation

Pass

Audited by Gen Agent Trust Hub on Mar 7, 2026

Risk Level: SAFE
Full Analysis
  • [SAFE]: No security issues detected. The skill's behavior is consistent with its stated purpose of agent evaluation using official MLflow and Databricks tools.\n- [COMMAND_EXECUTION]: Utilizes subprocess.run to execute local MLflow CLI commands and Python utilities for environment introspection and automation, which is standard for local development tools.\n- [EXTERNAL_DOWNLOADS]: Accesses documentation and guidelines from mlflow.org, which is a well-known and trusted technology provider.\n- [REMOTE_CODE_EXECUTION]: Generates and executes local Python scripts from templates to facilitate evaluation workflows. This behavior is restricted to the local environment and handles user-provided data for the purpose of automation.
Audit Metadata
Risk Level
SAFE
Analyzed
Mar 7, 2026, 11:39 PM