skills/pluginagentmarketplace/custom-plugin-data-engineer

pluginagentmarketplace/custom-plugin-data-engineer

21 skills91 total installsGithubGithubGitHub

python-programming

Master Python fundamentals, OOP, data structures, async programming, and production-grade scripting for data engineering

6

big-data

Apache Spark, Hadoop, distributed computing, and large-scale data processing for petabyte-scale workloads

5

etl-tools

Apache Airflow, dbt, Prefect, Dagster, and modern data orchestration for production data pipelines

5

api-development

FastAPI, REST APIs, GraphQL, data service design, and API best practices

5

containerization

Docker, Kubernetes, container orchestration, and cloud-native deployment for data applications

5

sql-databases

SQL query optimization, schema design, indexing strategies, and relational database mastery for production data systems

5

nosql-databases

MongoDB, Redis, Cassandra, DynamoDB, and distributed database patterns for scalable applications

5

data-engineering

Data pipeline architecture, ETL/ELT patterns, data modeling, and production data platform design

5

mlops

MLflow, model versioning, experiment tracking, model registry, and production ML systems

4

iac-automation

Terraform, Pulumi, CloudFormation, and infrastructure as code for data platforms

4

testing-quality

pytest, data validation, Great Expectations, and quality assurance for data systems

4

cloud-platforms

AWS, GCP, Azure data platforms, infrastructure as code, and cloud-native data solutions

4

statistics-math

Statistics, probability, linear algebra, and mathematical foundations for data science

4

deep-learning

PyTorch, TensorFlow, neural networks, CNNs, transformers, and deep learning for production

4

data-warehousing

Snowflake, BigQuery, Redshift, dimensional modeling, and modern data warehouse architecture

4

cicd-pipelines

GitHub Actions, GitLab CI, Jenkins, and automated deployment pipelines

4

career-growth

Portfolio building, technical interviews, job search strategies, and continuous learning

4

llms-generative-ai

LLMs, prompt engineering, RAG systems, LangChain, and AI application development

4

git-version-control

Git workflows, branching strategies, collaboration, and code management

4

machine-learning

Python machine learning with scikit-learn, PyTorch, and TensorFlow

3

monitoring-observability

Monitoring and observability strategy, implementation, and troubleshooting. Use for designing metrics/logs/traces systems, setting up Prometheus/Grafana/Loki, creating alerts and dashboards, calculating SLOs and error budgets, analyzing performance issues, and comparing monitoring tools (Datadog, ELK, CloudWatch). Covers the Four Golden Signals, RED/USE methods, OpenTelemetry instrumentation, log aggregation patterns, and distributed tracing.

3