Data Quality

Data quality is the practice of ensuring that data is accurate, complete, consistent, timely, and trustworthy as it flows through pipelines and systems. Without explicit quality gates, bad data propagates silently - corrupting dashboards, training flawed models, and breaking downstream consumers. This skill covers the five pillars: schema validation at ingress, expectation-based testing with Great Expectations, data contracts between producers and consumers, lineage tracking for impact analysis, and continuous monitoring for anomaly detection.

When to use this skill

Installs

133

Repository

absolutelyskill…yskilled

GitHub Stars

193

First Seen

Mar 15, 2026

Security Audits

Gen Agent Trust HubPass

SocketWarn

SnykWarn