cv-pipeline-builder
Computer Vision Pipeline Builder
Overview
Specialized ML pipelines for computer vision tasks. Handles image preprocessing, data augmentation, CNN architectures, transfer learning, and deployment for production CV systems.
CV Tasks Supported
1. Image Classification
from specweave import CVPipeline
# Binary or multi-class classification
pipeline = CVPipeline(
task="classification",
num_classes=10,
increment="0042"
)
# Automatically configures:
# - Image preprocessing (resize, normalize)
# - Data augmentation (rotation, flip, color jitter)
# - CNN architecture (ResNet, EfficientNet, ViT)
# - Transfer learning from ImageNet
# - Training loop with validation
# - Inference pipeline
pipeline.fit(train_images, train_labels)
2. Object Detection
# Detect multiple objects in images
pipeline = CVPipeline(
task="object_detection",
classes=["person", "car", "dog", "cat"],
increment="0042"
)
# Uses: YOLO, Faster R-CNN, or RetinaNet
# Returns: Bounding boxes + class labels + confidence scores
3. Semantic Segmentation
# Pixel-level classification
pipeline = CVPipeline(
task="segmentation",
num_classes=21,
increment="0042"
)
# Uses: U-Net, DeepLab, or SegFormer
# Returns: Segmentation mask for each pixel
Best Practices for CV
Data Augmentation
from specweave import ImageAugmentation
aug = ImageAugmentation(increment="0042")
# Standard augmentations
aug.add_transforms([
"random_rotation", # ±15 degrees
"random_flip_horizontal",
"random_brightness", # ±20%
"random_contrast", # ±20%
"random_crop"
])
# Advanced augmentations
aug.add_advanced([
"mixup", # Mix two images
"cutout", # Random erasing
"autoaugment" # Learned augmentation
])
Transfer Learning
# Start from pre-trained ImageNet models
pipeline = CVPipeline(task="classification")
# Option 1: Feature extraction (freeze backbone)
pipeline.use_pretrained(
model="resnet50",
freeze_backbone=True
)
# Option 2: Fine-tuning (unfreeze after few epochs)
pipeline.use_pretrained(
model="resnet50",
freeze_backbone=False,
fine_tune_after_epoch=3
)
Model Selection
Image Classification:
- Small datasets (<10K): ResNet18, MobileNetV2
- Medium datasets (10K-100K): ResNet50, EfficientNet-B0
- Large datasets (>100K): EfficientNet-B3, Vision Transformer
Object Detection:
- Real-time (>30 FPS): YOLOv8, SSDLite
- High accuracy: Faster R-CNN, RetinaNet
Segmentation:
- Medical imaging: U-Net
- Scene segmentation: DeepLabV3, SegFormer
Integration with SpecWeave
# CV increment structure
.specweave/increments/0042-image-classifier/
├── spec.md
├── data/
│ ├── train/
│ ├── val/
│ └── test/
├── models/
│ ├── model-v1.pth
│ └── model-v2.pth
├── experiments/
│ ├── baseline-resnet18/
│ ├── resnet50-augmented/
│ └── efficientnet-b0/
└── deployment/
├── onnx_model.onnx
└── inference.py
Commands
/ml:cv-pipeline --task classification --model resnet50
/ml:cv-evaluate 0042 # Evaluate on test set
/ml:cv-deploy 0042 # Export to ONNX
Quick setup for CV projects with production-ready pipelines.
More from anton-abyzov/specweave
technical-writing
Technical writing expert for API documentation, README files, tutorials, changelog management, and developer documentation. Covers style guides, information architecture, versioning docs, OpenAPI/Swagger, and documentation-as-code. Activates for technical writing, API docs, README, changelog, tutorial writing, documentation, technical communication, style guide, OpenAPI, Swagger, developer docs.
45spec-driven-brainstorming
Spec-driven brainstorming and product discovery expert. Helps teams ideate features, break down epics, conduct story mapping sessions, prioritize using MoSCoW/RICE/Kano, and validate ideas with lean startup methods. Activates for brainstorming, product discovery, story mapping, feature ideation, prioritization, MoSCoW, RICE, Kano model, lean startup, MVP definition, product backlog, feature breakdown.
43kafka-architecture
Apache Kafka architecture expert for cluster design, capacity planning, and high availability. Use when designing Kafka clusters, choosing partition strategies, or sizing brokers for production workloads.
34docusaurus
Docusaurus 3.x documentation framework - MDX authoring, theming, versioning, i18n. Use for documentation sites or spec-weave.com.
29frontend
Expert frontend developer for React, Vue, Angular, and modern JavaScript/TypeScript. Use when creating components, implementing hooks, handling state management, or building responsive web interfaces. Covers React 18+ features, custom hooks, form handling, and accessibility best practices.
29reflect
Self-improving AI memory system that persists learnings across sessions in CLAUDE.md. Use when capturing corrections, remembering user preferences, or extracting patterns from successful implementations. Enables continual learning without starting from zero each conversation.
27