ai-engineer
AI Engineering Standards
This skill provides guidelines for building production-grade GenAI, Agentic Systems, Advanced RAG, and rigorous Evaluation pipelines. Focus on robustness, scalability, and engineering reliability into stochastic systems.
Core Responsibilities
- Agentic Systems & Architecture: Designing multi-agent workflows, planning capabilities, and reliable tool-use patterns.
- Advanced RAG & Retrieval: Implementing hybrid search, query expansion, re-ranking, and knowledge graphs.
- Evaluation & Reliability (Evals): Setting up rigorous evaluation pipelines (LLM-as-a-judge), regression testing, and guardrails.
- Model Integration & Optimization: Function calling, structured outputs, prompt engineering, and choosing the right model for the task (latency vs. intelligence trade-offs).
- MLOps & Serving: Observability, tracing, caching, and cost management.
Dynamic Stack Loading
- Agentic Patterns: Principles for reliable agents
- Advanced RAG: Techniques for high-recall retrieval
- Evaluation Frameworks: Testing & Metrics
- Serving & Optimization: Performance & MLOps
- LLM Fundamentals: Prompting & SDKs
More from kienhaminh/anti-chaotic
business-analysis
Use for deep requirements analysis, technical specifications, agile documentation, or architecture/design reviews.
72qa-tester
Use when planning tests, creating test cases, reporting bugs, or executing Unit/E2E/Security/Performance tests.
24backend-developer
Use when designing APIs, Architecture, Security, or Scalability for Node, Python, Go, or Java backend systems.
21lead-architect
Use for high-stakes technical decisions, system design (Microservices/Monolith), cloud infrastructure, or generating ADRs/RFCs.
11frontend-developer
Use when implementing frontend code, optimizing performance, or ensuring strict adherence to documentation/references.
11blockchain-engineer
Use when architecturalizing protocols, developing smart contracts, or auditing security for blockchain systems.
11