The Agent Skills Directory

[SAFE]: The skill is purely informational and contains no instructions that could lead to security vulnerabilities. It focuses on benchmarks, evaluation criteria, and model selection workflows without implementing any automated actions.- [NO_CODE]: There are no executable scripts, shell commands, or external dependencies. The Python code block provided is a static data structure definition (dataclass) used for illustrative purposes and does not perform any file system or network operations.

ai-system-evaluation