Deployed link : https://customersegmentation-6s99huyf9ifkafk9p9cuyy.streamlit.app/
This project analyzes customer segmentation using K-Means and Hierarchical Clustering. The focus is on identifying mismatched customersβthose assigned to different clusters by the two models.
- Mismatched Customer Analysis: Identifies customers grouped differently by clustering models.
- Interactive Dashboard: Built with Streamlit for real-time insights.
- Data Exploration: View mismatched customers' spending behavior, income levels, and age distributions.
- Model Performance Comparison: Compare K-Means vs Hierarchical Clustering results.
git clone https://github.com/AdityaSareen06/Customer_Segmentation.git
cd Customer_Segmentation
# Set up Virtual Env
python -m venv venv
source venv/bin/activate # macOS/Linux
venv\Scripts\activate # Windows
#Install dependencies
pip install -r requirements.txt
#Run dashboard
streamlit run src/dashboard.py
π Dashboard Overview
The interactive dashboard allows users to:
Explore mismatched clusters visually.
Filter customers based on spending, income, and age.
View key insights & model comparisons.
π Insights from Analysis
Most mismatched customers fall into the low spending category.
Age distribution shows significant variation, with a mix of middle-aged and senior customers.
Some outliers in spending scores indicate high-potential customers for premium offers.
Hierarchical clustering tends to group more customers into a single cluster compared to K-Means.
π Technologies Used
Python π
Streamlit π
Scikit-learn π€
Pandas, NumPy ποΈ
Plotly π
π License
This project is licensed under the MIT License.
β¨ Author
π€ Aditya Sareen
π§ Email: [email protected]
π GitHub: AdityaSareen06