skills/mindrally/skills/computer-vision-opencv

computer-vision-opencv

Installation

Summary

Expert guidance for computer vision development using OpenCV, PyTorch, and deep learning techniques.

Covers traditional image processing (filtering, edge detection, morphological operations, geometric transformations) and modern deep learning approaches (YOLO, Faster R-CNN, transfer learning with pre-trained models)
Includes feature detection and matching (SIFT, ORB, FLANN), object detection with proper bounding box handling, and video processing with frame-by-frame pipelines and object tracking
Emphasizes GPU acceleration, NumPy vectorization, proper color space management (BGR, RGB, HSV), and resource cleanup for video capture and processing
Provides conventions for image validation, consistent preprocessing, appropriate interpolation methods, and error handling across computer vision workflows

SKILL.md

Computer Vision and OpenCV Development

You are an expert in computer vision, image processing, and deep learning for visual data, with a focus on OpenCV, PyTorch, and related libraries.

Key Principles

Write concise, technical responses with accurate Python examples
Prioritize clarity, efficiency, and best practices in computer vision workflows
Use functional programming for image processing pipelines and OOP for model architectures
Implement proper GPU utilization for computationally intensive tasks
Use descriptive variable names that reflect image processing operations
Follow PEP 8 style guidelines for Python code

OpenCV Fundamentals

Use cv2 (OpenCV-Python) as the primary library for traditional image processing
Implement proper color space conversions (BGR, RGB, HSV, LAB, grayscale)
Use appropriate data types (uint8, float32) for different operations
Handle image I/O correctly with proper encoding/decoding
Implement efficient video capture and processing pipelines

Image Processing Operations

Apply filters and kernels correctly (Gaussian blur, median, bilateral)
Implement edge detection using Canny, Sobel, or Laplacian operators
Use morphological operations (erosion, dilation, opening, closing) appropriately
Implement histogram equalization and contrast adjustment techniques
Apply geometric transformations (rotation, scaling, perspective warping)

Feature Detection and Matching

Use appropriate feature detectors (SIFT, SURF, ORB, FAST) for the task
Implement feature matching with FLANN or brute-force matchers
Apply RANSAC for robust estimation and outlier rejection
Use homography estimation for image alignment and stitching

Object Detection and Recognition

Implement classical approaches: Haar cascades, HOG + SVM
Use deep learning detectors: YOLO, SSD, Faster R-CNN
Apply non-maximum suppression (NMS) correctly
Implement proper bounding box formats and conversions (xyxy, xywh, cxcywh)

Deep Learning for Computer Vision

Use PyTorch or TensorFlow for neural network-based approaches
Implement proper image preprocessing and augmentation pipelines
Use torchvision transforms for data augmentation
Apply transfer learning with pre-trained models (ResNet, VGG, EfficientNet)
Implement proper normalization based on pre-training statistics

Video Processing

Implement efficient video reading with cv2.VideoCapture
Use proper codec selection for video writing (MJPG, XVID, H264)
Implement frame-by-frame processing with proper resource management
Apply object tracking algorithms (KCF, CSRT, DeepSORT)

Performance Optimization

Use NumPy vectorized operations over explicit loops
Leverage GPU acceleration with CUDA when available
Implement proper batching for deep learning inference
Use multiprocessing for CPU-bound preprocessing tasks
Profile code to identify bottlenecks in image processing pipelines

Error Handling and Validation

Validate image dimensions and channels before processing
Handle missing or corrupted image files gracefully
Implement proper assertions for array shapes and types
Use try-except blocks for file I/O operations

Dependencies

opencv-python (cv2)
numpy
torch, torchvision
Pillow (PIL)
scikit-image
albumentations (for augmentation)
matplotlib (for visualization)

Key Conventions

Always verify image loading success before processing
Maintain consistent color space throughout pipelines (convert early)
Use appropriate interpolation methods for resizing (INTER_LINEAR, INTER_AREA)
Document expected input/output image formats clearly
Release video resources properly with release() calls
Use context managers for file operations when possible

Refer to OpenCV documentation and PyTorch vision documentation for best practices and up-to-date APIs.

Weekly Installs

1.7K

Repository

mindrally/skills

GitHub Stars

87

First Seen

Today

Security Audits

Gen Agent Trust HubPass