data-engineering-storage-remote-access-libraries-obstore

High-performance Rust-based remote filesystem library. Covers store creation, basic operations, async API, streaming uploads, Arrow integration, and fsspec compatibility wrapper.

4

data-engineering-observability

Observability and monitoring for data pipelines using OpenTelemetry (traces) and Prometheus (metrics). Covers instrumentation, dashboards, and alerting.

4

data-engineering-storage-formats

Modern data serialization formats: Parquet, Apache Arrow (Feather/IPC), Lance (ML-native), Zarr (chunked arrays), Avro, and ORC. Covers compression, partitioning, and format selection.

4

data-science-visualization

Data visualization for Python: Matplotlib, Seaborn, Plotly, Altair, hvPlot/HoloViz, and Bokeh. Use when creating exploratory charts, interactive dashboards, publication-quality figures, or choosing the right library for your data and audience.

2

working-in-notebooks

Use for Jupyter, JupyterLab, marimo, and Google Colab workflows. Choose this skill when creating, converting, or improving reproducible notebooks for data exploration, analysis, documentation, or teaching.

1

engineering-ml-features

More from legout/data-agent-skills

data-engineering-storage-remote-access-libraries-obstore

data-engineering-observability

data-engineering-storage-formats

data-science-visualization

working-in-notebooks