The Agent Skills Directory

[SAFE]: The skill provides a structured methodology for A/B testing AI cognitive patterns such as Tree of Thoughts (ToT) and Breadth of Thought (BoT).
[DATA_EXPOSURE]: The framework includes logic for persisting benchmark results to a local database using ChromaDB. It does not access sensitive system files or include hardcoded credentials.
[REMOTE_CODE_EXECUTION]: Python code snippets are provided for statistical calculations and data management. There are no patterns involving the download or execution of remote scripts from untrusted sources.
[PROMPT_INJECTION]: No instructions were found that attempt to override system prompts or bypass safety guidelines. The 'adversarial reasoning' section is clearly defined as a category for performance testing rather than an exploit.

benchmark-framework