rfm-customer-segmentation
RFM Customer Segmentation Analysis
A comprehensive customer segmentation skill that automatically analyzes e-commerce transaction data to identify customer value segments using RFM (Recency, Frequency, Monetary) analysis with K-means clustering.
Instructions
1. Data Analysis
When users provide e-commerce data or ask about customer segmentation:
- Load and validate the transaction data
- Clean data by removing invalid orders (negative quantities, zero prices)
- Calculate RFM metrics for each customer:
- Recency: Days since last purchase
- Frequency: Number of purchases
- Monetary: Total purchase amount
- Use K-means clustering on RFM dimensions
- Automatically determine optimal number of clusters using elbow method
2. Customer Segmentation
- Create customer value segments: High, Medium, Low value customers
- Score each customer on RFM dimensions (1-3 scale)
- Calculate overall customer value scores
- Identify and rank VIP customers for marketing campaigns
3. Visualization and Reporting
- Generate comprehensive customer segmentation dashboard
- Create pie charts for segment distribution and revenue share
- Build RFM scatter plots to visualize customer patterns
- Generate box plots showing value distribution by segment
- Export detailed CSV reports with VIP customer lists
4. Marketing Insights
- Provide actionable marketing recommendations for each segment
- Generate executive summary with key findings
- Create customer activation strategies for different value tiers
- Export VIP customer lists for targeted marketing campaigns
Usage Examples
Basic Customer Segmentation
Analyze these e-commerce orders and segment customers by value:
[CSV data with order_id, user_id, purchase_date, quantity, unit_price]
VIP Customer Identification
Find the top 100 most valuable customers from our sales data for marketing campaign
Customer Value Analysis
Create a customer segmentation report showing revenue contribution by customer segment
Key Features
- Automatic Data Cleaning: Handles Chinese e-commerce data formats, removes invalid orders
- Intelligent Clustering: Uses elbow method to determine optimal cluster count
- Chinese Language Support: Full support for Chinese field names and visualizations
- Comprehensive Reports: Generates HTML reports, PNG dashboards, and CSV exports
- Marketing Ready: Provides VIP customer lists and actionable insights
File Requirements
The skill works with e-commerce transaction data containing:
- user_id: Customer identification code (用户码)
- order_date: Purchase date (消费日期)
- quantity: Order quantity (数量)
- unit_price: Item unit price (单价)
- product_info: Product details (optional)
Output Files Generated
customer_segments.csv: Complete customer segmentation datavip_customers_list.csv: Ranked VIP customer list for marketingsegment_summary_statistics.csv: Detailed statistics by segmentcustomer_segmentation_dashboard.png: Visual analytics dashboarddata_validation_report.txt: Data quality and analysis validation
Dependencies
- pandas, numpy for data processing
- scikit-learn for K-means clustering
- matplotlib, seaborn for visualization (with Chinese font support)
- Standard Python libraries for file operations
Best Practices
- Ensure date fields are in consistent format (YYYY-MM-DD recommended)
- Remove or handle missing values before analysis
- Use sufficient data volume (1000+ orders recommended for reliable clustering)
- Consider business context when interpreting segment results
- Validate results with domain knowledge when possible
More from liangdabiao/claude-data-analysis-ultra-main
data-exploration-visualization
自动化数据探索和可视化工具,提供从数据加载到专业报告生成的完整EDA解决方案。支持多种图表类型、智能数据诊断、建模评估和HTML报告生成。适用于医疗、金融、电商等领域的数据分析项目。
129content-analysis
Analyze text content using both traditional NLP and LLM-enhanced methods. Extract sentiment, topics, keywords, and insights from various content types including social media posts, articles, reviews, and video content. Use when working with text analysis, sentiment detection, topic modeling, or content optimization.
61funnel-analysis
Analyze user conversion funnels, calculate step-by-step conversion rates, create interactive visualizations, and identify optimization opportunities. Use when working with multi-step user journey data, conversion analysis, or when user mentions funnels, conversion rates, or user flow analysis.
58retention-analysis
Analyze user retention and churn using survival analysis, cohort analysis, and machine learning. Calculate retention rates, build survival curves, predict churn risk, and generate retention optimization strategies. Use when working with user subscription data, membership information, or when user mentions retention, churn, survival analysis, or customer lifetime value.
31ab-testing-analyzer
全面的AB测试分析工具,支持实验设计、统计检验、用户分群分析和可视化报告生成。用于分析产品改版、营销活动、功能优化等AB测试结果,提供统计显著性检验和深度洞察。
25recommender-system
智能推荐系统分析工具,提供多种推荐算法实现、评估框架和可视化分析。使用时需要用户行为数据、商品信息或评分数据,支持协同过滤、矩阵分解等推荐算法,生成个性化推荐结果和评估报告。
24