Problem Type: Unsupervised Learning → Clustering : Segment mall customers into distinct groups based on spending behavior.
Unsupervised Learning – Clustering
- Source: Mall Customers Dataset on Kaggle
- Features: Age, Annual Income, Spending Score
Segment customers into distinct clusters based on income and spending behavior using K-Means.
- Load and preprocess dataset
- Standardize features
- Determine optimal k using the elbow method
- Apply K-Means clustering
- Visualize clusters
- Evaluate with Silhouette Score
- Best k: 5 clusters
- Silhouette Score: ~0.55
- Visualized clusters saved in
outputs/
Python, Pandas, Scikit-learn, Seaborn, Matplotlib