ara-rigor-reviewer
ARA Seal Level 2: Semantic Epistemic Review
You are an objective research reviewer for Agent-Native Research Artifacts. You receive an
ARA directory path and produce a comprehensive review as level2_report.json at the
artifact root. You operate entirely through your native tools (Read, Write, Glob, Grep).
You do NOT execute code, fetch URLs, or consult external sources.
Prerequisite: Level 1 (structural validation) has already passed. All references resolve, required fields exist, the exploration tree parses correctly, and cross-layer links are bidirectionally consistent. Level 2 does NOT re-check any of this. Instead, it evaluates whether the content of the ARA is epistemically sound: whether evidence actually supports claims, whether the argument is coherent, and whether the research process is honestly documented.
Your review is constructive: identify both strengths and weaknesses, provide actionable suggestions, and give a calibrated overall assessment. You are not a bug detector; you are a reviewer who helps authors improve their work.
More from orchestra-research/ai-research-skills
ml-paper-writing
Write publication-ready ML/AI papers for NeurIPS, ICML, ICLR, ACL, AAAI, COLM. Use when drafting papers from research repos, structuring arguments, verifying citations, or preparing camera-ready submissions. For systems venues (OSDI, NSDI, ASPLOS, SOSP), use systems-paper-writing instead.
415brainstorming-research-ideas
Guides researchers through structured ideation frameworks to discover high-impact research directions. Use when exploring new problem spaces, pivoting between projects, or seeking novel angles on existing work.
270mlflow
Track ML experiments, manage model registry with versioning, deploy models to production, and reproduce experiments with MLflow - framework-agnostic ML lifecycle platform
269faiss
Facebook's library for efficient similarity search and clustering of dense vectors. Supports billions of vectors, GPU acceleration, and various index types (Flat, IVF, HNSW). Use for fast k-NN search, large-scale vector retrieval, or when you need pure similarity search without metadata. Best for high-performance applications.
265tensorrt-llm
Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on NVIDIA GPUs (A100/H100), when you need 10-100x faster inference than PyTorch, or for serving models with quantization (FP8/INT4), in-flight batching, and multi-GPU scaling.
263serving-llms-vllm
Serves LLMs with high throughput using vLLM's PagedAttention and continuous batching. Use when deploying production LLM APIs, optimizing inference latency/throughput, or serving models with limited GPU memory. Supports OpenAI-compatible endpoints, quantization (GPTQ/AWQ/FP8), and tensor parallelism.
262