synthetic-data-generation

Fail

Audited by Socket on Feb 16, 2026

1 alert found:

Malware
MalwareHIGH
SKILL.md

[Skill Scanner] Installation of third-party script detected All findings: [CRITICAL] command_injection: Installation of third-party script detected (SC006) [AITech 9.1.4] [CRITICAL] command_injection: Installation of third-party script detected (SC006) [AITech 9.1.4] No evidence of malicious code or credential harvesting. The script is coherent with its stated purpose (synthetic data generation for Databricks). Primary security concern is operational: the script runs metastore DDL and uses overwrite writes to volumes, which can alter or erase real data if the provided catalog/schema/volume point to production resources. Recommend: run in an isolated/testing workspace, require explicit confirmation of target catalog/schema, and avoid overwrite mode on unknown targets. LLM verification: This SKILL.md appears functionally consistent with its stated purpose (generating synthetic data for Databricks). It is not itself malicious, but it describes an execution pattern that, if misused, enables high-risk actions: arbitrary code execution on Databricks clusters and unpinned dependency installation from PyPI. The highest concerns are unpinned package installs (dependency-supply-chain risk), persistence of installed libraries across contexts, and constructing SQL/paths via unvalidated i

Confidence: 95%Severity: 90%
Audit Metadata
Analyzed At
Feb 16, 2026, 04:20 PM
Package URL
pkg:socket/skills-sh/databricks-solutions%2Fai-dev-kit%2Fsynthetic-data-generation%2F@75f1d85848203d03cab9c7bf4a78ff749b89cf21