The Agent Skills Directory

[Unverifiable Dependencies] (MEDIUM): The skill specifies ray[data] and pandas as dependencies. These are hosted by organizations (ray-project, pandas-dev) not included in the pre-approved trusted list defined in the analysis guidelines.
[Indirect Prompt Injection] (LOW): The skill describes processes for ingesting data from cloud storage (S3, GS, Parquet) which could contain malicious instructions.
Ingestion points: ray.data.read_parquet, read_csv, and read_json in SKILL.md.
Boundary markers: Absent from provided transformation and loading examples.
Capability inventory: Support for arbitrary distributed Python execution across compute clusters via the map_batches and map methods.
Sanitization: No data validation, schema enforcement, or content sanitization logic is shown for data entering the pipeline.

ray-data