ml-systems
SKILL.md
ML Systems
Building production-ready machine learning systems.
Overview
This skill category covers the complete ML system lifecycle:
- Foundations - Core concepts, architectures, paradigms
- Data Engineering - Data collection, quality, feature engineering
- Model Development - Training, evaluation, frameworks
- Performance - Optimization, acceleration, efficiency
- Deployment - Serving, edge deployment, scaling
- Operations - MLOps, monitoring, reliability
Categories
Foundations
ml-systems-fundamentals- Core ML systems conceptsdeep-learning-primer- Deep learning foundationsdnn-architectures- Neural network architecturesdeployment-paradigms- Deployment patterns
Data Engineering
data-engineering- Data pipelines and qualitytraining-data- Training data managementfeature-engineering- Feature creation and stores
Model Development
ml-workflow- ML development workflowmodel-development- Model training and selectionml-frameworks- Framework best practices
Performance
efficient-ai- Efficiency techniquesmodel-optimization- Quantization, pruning, distillationai-accelerators- Hardware acceleration
Deployment
model-deployment- Production deploymentinference-optimization- Inference optimizationedge-deployment- Edge and mobile deployment
Operations
mlops- ML operations and lifecyclerobust-ai- Reliability and robustness
Key Principles
- Data-Centric AI - Focus on data quality over model complexity
- Iterative Development - Start simple, iterate based on metrics
- Production-First - Design for deployment from the start
- Monitoring - Continuous monitoring and improvement
- Reproducibility - Version everything (data, code, models)
References
- Harvard CS 329S: Machine Learning Systems Design
- Designing Machine Learning Systems by Chip Huyen
- MLOps: Continuous Delivery and Automation Pipelines
Weekly Installs
3
Repository
doanchienthangdev/omgkitGitHub Stars
3
First Seen
Feb 20, 2026
Security Audits
Installed on
opencode3
antigravity3
claude-code3
github-copilot3
codex3
amp3