The Agent Skills Directory

EXTERNAL_DOWNLOADS (LOW): The skill installs the 'nemo-evaluator-launcher' package via pip to enable its benchmarking capabilities.
COMMAND_EXECUTION (LOW): The skill executes 'nemo-evaluator-launcher' commands to run evaluation pipelines, which is a core and expected function.
PROMPT_INJECTION (LOW): A surface for indirect prompt injection exists because the skill ingests data from external benchmarking harnesses that may contain untrusted instructions. * Ingestion points: Configuration files (config.yaml) and external benchmark datasets. * Boundary markers: No explicit delimiters or ignore instructions are present for separating external data from instructions. * Capability inventory: The skill triggers subprocess execution to run benchmarks. * Sanitization: No sanitization of ingested benchmark content is documented.

nemo-evaluator