This project analyzes 3,900 customer transactions to understand how users shop, what influences their purchases, and which customer groups drive the most revenue. It includes data preparation in Python, SQL-based analysis, and a Power BI dashboard supported by a final business report and presentation.
The goal of this project is to uncover insights from customer transaction data and provide meaningful recommendations for business growth.
The workflow includes:
- Loading and exploring the dataset in Python
- Cleaning and preparing the data
- Conducting in-depth SQL analysis
- Building an interactive Power BI dashboard
- Creating a business-focused report using Gamma AI
Rows: 3,900
Columns: 18
Missing Values: 37 missing review ratings (handled during cleaning)
Customer details: Age, Gender, Location, Subscription Status
Purchase information: Item Purchased, Category, Amount, Season, Size, Color
Shopping behavior: Discounts, Promo Codes, Review Rating, Shipping Type
Behavioral patterns: Previous Purchases, Purchase Frequency
📂 Dataset Used: Dataset
💻 Project Code: View Notebook
Dashboard File: View Dashboard File
- Python: Pandas, NumPy, Matplotlib/Seaborn
- MySQL Server: Segmentation and behavioral SQL analysis
- Power BI: Dashboard creation and KPI visualizations
- Gamma AI: Final report and presentation design
- Jupyter Notebook / VS Code: Development environment
- Imported dataset with Pandas
- Reviewed structure using:
- df.info()
- summary statistics
- value distributions
- Imputed 37 missing review ratings using median per product category
- Removed duplicates and fixed inconsistent values
- Standardized categorical fields
- Created new columns for improved insights:
- age_group (18–25, 26–35, 36–50, 50+)
- purchase_frequency_days
- Customer segments:
- New, Returning, Loyal
- Performed deeper analysis including:
- Gender-wise revenue
- Subscriber vs non-subscriber purchases
- Top products
- Discount dependency analysis
- Customer segment insights
- Age-group revenue comparison
- Developed an interactive dashboard featuring:
- Key KPIs and real-time metrics
- Revenue breakdowns
- Product performance
- Subscription insights
- Filters for category, season, segments, shipping type
- Created a structured business report summarizing:
- Key findings
- Customer segmentation
- Discount impact
- High-performing products
- Actionable recommendations
- Male and female customers contribute almost equally to revenue.
-
Similar average spend (~$60)
-
But subscribers contribute 60% more total revenue
-
Subscription benefits strongly influence engagement
- This is the primary target segment for marketing and promotions.
-
Backpack – 4.8★
-
Sneakers – 4.7★
-
Jacket – 4.6★
- Some items rely heavily on discounts (e.g., Handbag 68%, Sweater 65%).
-
Boost Subscriptions: Highlight exclusive perks to convert more users
-
Loyalty Programs: Reward repeat customers to grow the “Loyal” segment
-
Smart Discounting: Offer discounts strategically to protect margins
-
Targeted Marketing: Focus on ages 26–35 and express-shipping users
-
Product Positioning: Promote top-rated and best-selling items
Email: vipinsuryavanshi.vs@gmail.com
LinkedIn: Vipin Suryavanshi
GitHub: Vipin-s27