pii-sanitizer

SKILL.md

PII Sanitizer

Purpose and Intent

The pii-sanitizer is a data protection tool designed to identify and mask Personally Identifiable Information (PII) from datasets, logs, or communications to comply with privacy regulations like GDPR and CCPA.

When to Use

  • Log Scrubbing: Clean application logs before sending them to centralized logging platforms (e.g., ELK, Datadog).
  • Dataset Preparation: Sanitize production data before using it in staging or training environments.
  • Customer Support: Mask sensitive info in support tickets before sharing them with engineering teams.

When NOT to Use

  • Encryption: This is a redaction tool, not an encryption tool. It is for removing data, not securing it for later retrieval.
  • Structured Database Migration: While it handles some structure, specialized ETL tools are better for massive DB sanitization.

Error Conditions and Edge Cases

  • False Positives: Strings that resemble PII (like internal serial numbers) might be accidentally redacted.
  • Ambiguous Context: "Rose" could be a name (PII) or a flower; the tool may err on the side of caution.
  • Encoding Issues: Ensure input text is UTF-8 to avoid detection failures on special characters.

Security and Data-Handling Considerations

  • Zero Retention: Input data must never be saved to disk.
  • Local Processing: Highly recommended to run this within a secure perimeter so sensitive raw data never leaves the local environment.
Weekly Installs
64
First Seen
Jan 30, 2026
Installed on
opencode60
github-copilot59
codex50
amp17
cursor15
gemini-cli14