skills/founderjourney/claude-skills/digitaliza-data-extractor

digitaliza-data-extractor

SKILL.md

Digitaliza Data Extractor

Extract client data from folders to generate digital business cards for digitalizaweb.vercel.app.

Workflow

Client Folder → Extract Data → Extract Colors → Generate JSON → Review

Step 1: Scan Client Folder

ls -la <client_folder>/

Expected files:

  • datos_extraier.md - Scraped HTML from LinkTree/profile
  • *.png/*.jpg - Screenshots, logo images
  • logo.* - Brand logo

Step 2: Extract Data

Run extraction:

python scripts/extract_client_data.py <client_folder> --pretty

Batch all clients:

python scripts/extract_client_data.py <base_folder> --scan-all --output all_clients.json

Extracts: business name, WhatsApp, links (with icons), locations.

Step 3: Extract Brand Colors

python scripts/extract_colors.py <image_path> --num-colors 5 --output json

Returns: customPrimaryColor, customSecondaryColor, customAccentColor, suggestedTheme.

Step 4: Generate Final JSON

Combine into Digitaliza format:

{
  "slug": "doomo-saltado",
  "name": "Doomo Saltado",
  "phone": "+51014711000",
  "whatsapp": "51014711000",
  "address": "Local en Surco, Lima",
  "logoUrl": "logo.png",
  "theme": "custom",
  "customPrimaryColor": "#dc2626",
  "customSecondaryColor": "#b91c1c",
  "backgroundStyle": "mesh",
  "links": [
    {"title": "Reservar", "url": "https://wa.me/51014711000", "icon": "whatsapp", "order": 0, "isActive": true},
    {"title": "Instagram", "url": "https://instagram.com/doomo", "icon": "instagram", "order": 1, "isActive": true}
  ]
}

Manual Completion

Verify after extraction:

Field Source
name Screenshots or website
whatsapp Country code + number
address Google Maps or screenshot
description 1-2 sentences about business
theme general, italian, mexican, japanese, coffee, hamburguesa, barber, spa, salon

Link Icons

Service Icon
WhatsApp whatsapp
Instagram instagram
Facebook facebook
TikTok tiktok
Google Maps location
UberEats ubereats
Rappi rappi
Menu/Carta menu

Batch Checklist

  1. Scan all: python scripts/extract_client_data.py . --scan-all
  2. Review extraction notes
  3. Extract colors for folders with logos
  4. Flag incomplete data for manual review

Schema Reference

See references/digitaliza_schema.md for complete field definitions.

Weekly Installs
7
GitHub Stars
3
First Seen
Feb 8, 2026
Installed on
opencode5
claude-code5
gemini-cli4
codex4
github-copilot3
amp3