ai-data-foundation-audit
AI Data Foundation Audit
Use when
- Audit and prepare client data for AI marketing readiness. Produces a completed data hygiene checklist, a data mapping framework, and a 30-day remediation plan so that the client's data is clean and structured before any AI marketing tool is deployed. Invoke before any AI marketing tool deployment — this skill establishes the data quality foundation that all AI marketing activities depend on. Also invoke when a client reports inconsistent AI outputs, poor personalisation results, or chatbot errors.
- Use this skill when it is the closest match to the requested deliverable or workflow.
Do not use when
- Do not use this skill for graphic design, video production, software development, or legal advice beyond the repository's stated scope.
- Do not use it when another skill in this repository is clearly more specific to the requested deliverable.
Workflow
- Collect the required inputs or source material before drafting, unless this skill explicitly generates the intake itself.
- Follow the section order and decision rules in this
SKILL.md; do not skip mandatory steps or required fields. - Review the draft against the quality criteria, then deliver the final output in markdown unless the skill specifies another format.
Anti-Patterns
- Do not invent client facts, performance data, budgets, or approvals that were not provided or clearly inferred from evidence.
- Do not skip required inputs, mandatory sections, or quality checks just to make the output shorter.
- Do not drift into out-of-scope work such as code implementation, design production, or unsupported legal conclusions.
Outputs
- An AI-focused strategy, audit, system design, or prompt asset in markdown with human review and control points.
References
- Use the inline instructions in this skill now. If a
references/directory is added later, treat its files as the deeper source material and keep thisSKILL.mdexecution-focused.
Required Input
Ask for the following before generating any deliverable:
- Client business name — trading name as it appears to customers
- Industry — e.g. retail, hospitality, healthcare, professional services
- Country/city — default: Uganda / East Africa if not specified
- All customer data sources currently in use — tick all that apply: WhatsApp chat history | Facebook Page DMs and comments | Email (Gmail / Outlook) | CRM software (name it) | Excel / Google Sheets | Accounting software (e.g. QuickBooks, Wave) | Other (describe)
- Primary AI use case the data will support — select one: Content personalisation | Chatbot knowledge base | Predictive analytics | Audience segmentation
- Approximate number of customer records — total across all systems (even a rough estimate is useful)
Why Data Quality is Step 1
Three independent books converge on the same conclusion: data quality is the single most important prerequisite for any AI marketing investment (Venkatesan and Lecinski, 2026; Lamplugh, 2024; Ltifi, 2025). AI tools amplify what they receive — clean data produces better outputs; dirty data produces worse outputs at speed.
For East African clients specifically, the data challenge is structural. Customer data is fragmented across WhatsApp chat histories, Facebook Page DMs, manual Excel spreadsheets, phone contacts, and verbal records. Before any AI tool can add value, this fragmentation must be addressed.
The investment in data hygiene is not a technical exercise — it is the commercial prerequisite for every AI use case. Skipping this step does not save time; it wastes the cost of the AI tool.
The EA Data Reality
Map the typical Ugandan SME data landscape honestly before proceeding:
- WhatsApp — customer enquiries, orders, and complaints are stored in chat history; rarely exported or structured; impossible to query or analyse at scale
- Facebook Page — post comments, DM threads, and page insights are exportable via Meta Business Suite but rarely exported; valuable for sentiment and engagement data
- Email — enquiries and newsletter replies sit in Gmail or Outlook, rarely in a CRM; no linkage to purchase data
- Excel spreadsheets — customer names, phone numbers, and purchase history recorded manually; often incomplete, inconsistent formatting, and not regularly updated
- Verbal / in-person — walk-in customers and cash transactions frequently go unrecorded entirely
- Result — the same customer may appear in three or four separate systems with different name spellings, no unique identifier, and no purchase history linkage. AI tools cannot perform matching, personalisation, or segmentation on this data without prior consolidation.
Data Hygiene Checklist (20 Items)
Complete this checklist for the client. Score each item Yes / No / Partial. Flag any No or Partial item as a remediation priority.
Completeness
- Are all customer records complete — name, contact detail, and transaction history at minimum?
- Are there records with missing phone numbers or email addresses?
- Are product and service names consistently recorded — not "cake" in some rows and "birthday cake" in others?
- Are dates recorded in a consistent format (DD/MM/YYYY throughout)?
Accuracy
- Are phone numbers in the correct format for Uganda (+256 or 0)?
- Are prices recorded in UGX consistently — not a mix of UGX and USD?
- Are customer names spelled consistently across all systems?
- Are inactive or churned customers flagged and separated from active customer records?
Consistency
- Is each customer recorded once across all systems — not duplicated across WhatsApp, Excel, and email?
- Are product categories consistent — not "electronics" in one place and "gadgets" in another?
- Are campaign names consistent across Meta, email, and CRM records?
- Are staff names or salesperson records consistent for attribution purposes?
Accessibility
- Is data stored in a format the AI tool can read — CSV, Excel, or plain text — not only in printed records or chat messages?
- Can the data be exported from its current system? Some WhatsApp and Facebook data requires manual export steps.
- Is the data accessible to the team members who will operate the AI tools?
- Are data access credentials documented and stored securely?
Governance
- Is there a named data owner — one person responsible for data quality?
- Is there a documented process for adding new customer records?
- Is customer data held in compliance with Uganda's Data Protection and Privacy Act (2019)? Confirm that consent has been obtained and data is not shared without permission.
- Is there a data retention policy — documenting how long customer records are kept and when they are deleted?
Data Mapping Framework
Follow this step-by-step process to consolidate fragmented data into a single master customer list:
- List all data sources — WhatsApp, Facebook, email, Excel, accounting software, and any others identified in the Required Input.
- For each source, record: what data it contains; its format; estimated record count; date last updated.
- Identify the primary identifier — the field that will link records across systems. Phone number is the most reliable identifier in Uganda, because it is more consistent than email across both urban and peri-urban customers.
- Export all sources to Excel or CSV — request exports from WhatsApp (chat export), Meta Business Suite (contacts and insights), Gmail (contacts), and any CRM or accounting software in use.
- Standardise the phone number field across all exports — remove spaces, remove hyphens, ensure all numbers use the +256 international format.
- Use VLOOKUP or a de-duplication tool to identify records appearing in multiple sources. Google Sheets has a built-in Remove Duplicates function; Excel has a Remove Duplicates tool under the Data tab.
- Build a master customer list — one row per customer, linking all data points: name, phone number, email (if available), source systems, last transaction date, and any segment tags.
30-Day Remediation Plan Template
Apply this plan after the hygiene checklist is complete and the top three issues are identified:
Week 1 — Map and assess Complete the data mapping framework. Run the hygiene checklist for all 20 items. Identify the top three issues (typically: missing contact details, duplicate records, and inconsistent product naming). Assign a named owner for each issue.
Week 2 — Fix completeness Fill missing contact details by following up with customers via WhatsApp or in-person. Add missing transaction dates from accounting records or receipts. Ensure all product and service names follow a single agreed naming convention.
Week 3 — Fix accuracy Standardise all phone numbers to +256 format. Correct misspellings in customer names. Merge duplicate records into the master list. Flag and separate inactive customers.
Week 4 — Implement governance Assign the named data owner formally. Document the new record process: how new customers are added, which fields are required, and who is responsible. Confirm Uganda Data Protection and Privacy Act (2019) compliance — consent records in place, retention policy documented. Export the final master customer list and confirm it is ready for AI tool import.
Output: a clean master customer list ready for import into the AI tool of choice, plus a documented data governance process that prevents regression.
Tool Options
| Tool | Use | EA accessibility | Approx. cost |
|---|---|---|---|
| Google Sheets | Data consolidation and de-duplication | Yes — free | Free |
| Airtable | Structured database with easy import/export | Yes — free tier | Free tier available |
| HubSpot Free CRM | Customer database with basic automation | Yes — free | Free |
| Notion | Flexible database and knowledge base | Yes — free tier | Free tier available |
| Excel | Standard spreadsheet de-duplication | Yes | Included in Office 365 |
For most Ugandan SMEs, Google Sheets is the recommended starting point: it is free, collaborative, accessible on mobile, and sufficient for up to several thousand customer records.
Handoff — Connecting Clean Data to AI Tools
Once the audit and remediation are complete, connect the master customer list to the relevant AI tools:
- RAG knowledge base (see
ai-rag-brand-knowledge-base) — upload the product catalogue and FAQs derived from the clean data - Chatbot knowledge base (see
ai-whatsapp-chatbot-design) — upload approved response templates and policies built from the structured data - Segmentation tool — import the master customer list with segment tags for audience targeting
- Predictive analytics tool (see
ai-predictive-analytics-social) — import transaction history and engagement history for churn and upsell modelling
Do not connect any AI tool until the Week 4 governance step is complete. A tool connected to unclean data will produce unreliable outputs and undermine client confidence in AI marketing.
Quality Criteria
Good output from this skill meets all of the following standards:
- All customer data sources mapped — no source overlooked, including informal sources such as verbal records and WhatsApp chat histories
- Data hygiene checklist completed in full — all 20 items scored Yes, No, or Partial with priority flags applied to every No and Partial item
- Data mapping framework completed — master customer list built using phone number as the primary identifier, with all sources consolidated
- Top three hygiene issues identified with a specific, actionable remediation step for each — not generic advice, but concrete next actions for this client
- 30-day remediation plan documented with named owner and weekly milestones — not a suggestion but a committed plan
- Data governance framework in place — named data owner, new record process documented, and responsibility assigned
- Uganda Data Protection and Privacy Act (2019) compliance confirmed — consent documented for all records, data sharing restrictions understood, retention policy set
- Clean data handed off to at least one AI tool with a verification step confirming that outputs have improved compared to pre-audit baseline
References
- Venkatesan, R. and Lecinski, J. (2026) The AI Marketing Canvas, 2nd edn. Stanford University Press.
- Lamplugh, M. (2024) The AI Marketing Playbook, 2nd edn. Mercury Learning.
- Ltifi, M. (ed.) (2025) Advances in Digital Marketing in the Era of Artificial Intelligence. CRC Press.
More from peterbamuhigire/social-media-skills
meta-ai-tools-audit
Produces a structured evaluation of AI marketing tools for a specific client, mapped by function (content creation, SEO, social media management, email marketing, automation, analytics, paid advertising, influencer marketing) with East African market accessibility, cost, and capability ratings. Outputs a recommended AI tool stack calibrated to the client's budget profile in UGX. Invoke when a client asks which AI tools to adopt, wants to assess their current AI tool usage, needs to build an AI-powered martech stack, or is evaluating AI capabilities against their marketing goals.
3platform-instagram-visual-system
>
3caption-writer
Writes social media captions for any platform from a brief. Generates 3 variations — short, medium, and long — with a hashtag set for each. Invoke when the user says "write a caption", "write captions for", "I need post copy for", "draft some caption options", or when a content brief is provided and the user needs caption text. Also invoke when working through a content calendar and post copy is needed for specific items.
3playbook-instagram-dm-sales
>
3training-social-media-fundamentals
Generates a foundational social media training guide for clients and their teams who are completely new to social media marketing, or who have been posting without any strategic understanding. Invoke when the user says "write a social media basics guide", "create a beginner training document", "the client doesn't understand social media", "start-here training", or when a client needs to understand social media before any strategy or content work begins. Distinct from training-client-team (operational handover of an existing strategy) and training-diy-content (content creation for self-managing clients). This skill covers what social media is, how it works, and how to approach it intelligently — the conceptual foundation that makes all downstream strategy work land.
3training-ai-prompt-writing
Produces a practical training guide for client teams on prompt engineering for marketing tasks — covering the Alpha-Beta-Gamma-Delta-Epsilon prompt structure, 10 prompt components, 5 prompting approaches, and 7 copywriting frameworks with worked East African examples. Invoke when the user says "create a prompt writing training guide", "teach my team how to use AI for marketing", "write a prompt engineering workshop", "AI copywriting training for staff", or needs a structured training document for client employees who use AI tools to produce marketing content.
3