The Agent Skills Directory

[COMMAND_EXECUTION]: The skill facilitates the execution of local scripts and user-defined evaluation commands to measure experiment performance. This is the core functionality of the tool.
Ingestion points: User input collected via the interactive setup or command arguments, specifically the 'eval' command string (SKILL.md).
Boundary markers: None present in the command interpolation.
Capability inventory: Execution of local Python scripts within the skill directory (scripts/setup_experiment.py) and subsequent execution of the user-provided evaluation command (SKILL.md).
Sanitization: No explicit sanitization or validation of the evaluation command is described in the skill definition.
[DATA_EXPOSURE]: The skill requests access to a user-specified 'target file' to perform optimization, which is consistent with its stated purpose of experiment management.

setup