The Agent Skills Directory

AWS Service Integration: The skill interacts with various AWS services (Glue, S3, Athena, DynamoDB, STS) using standard CLI commands and boto3. It confirms identity via sts get-caller-identity and manages resources through established service APIs, which is typical for data engineering operations.
Secure Credential Management: The skill emphasizes using existing Glue connections and AWS Secrets Manager for database authentication. It specifically warns against hardcoding credentials in PySpark scripts, adhering to AWS security best practices.
Data Ingestion Surface Analysis: As a tool that processes external data from S3 and remote databases, the skill includes extensive guidance on data quality validation and type transformation. These measures, documented in references/data-quality-validation.md and references/type-transformations.md, help mitigate risks associated with processing untrusted input.
ETL Automation via Templates: The skill generates AWS Glue PySpark scripts using predefined templates provided in the reference files. This automation is confined to the intended purpose of data migration and relies on local templates rather than external code sources.
Validated Execution Environment: The instructions prioritize the use of AWS MCP server tools for command execution, which provide sandboxed environments and audit logging to ensure operations are performed safely and transparently.

ingesting-into-data-lake