This project involves a deep dive into Zepto's operational data to analyze delivery performance, product category popularity, and customer purchasing trends using advanced SQL queries.
The goal of this analysis is to extract actionable insights from the Zepto dataset, focusing on:
- Order Efficiency: Analyzing delivery times and bottleneck periods.
- Product Performance: Identifying top-selling categories and revenue contributors.
- Customer Behavior: Segmenting users based on order frequency and average order value (AOV).
- Top Category: Grocery & Staples contributed the highest revenue among all product segments.
- Delivery Lag: A measurable gap was found between expected and actual delivery times during peak hours.
- Customer Segmentation: High-frequency buyers placed significantly more orders with a higher AOV compared to one-time users.
- Revenue Trends: Weekly sales showed consistent growth patterns with spikes during weekends.
- Database: PostgreSQL
- Tools: pgAdmin/PostgreSQL
- Data Source:
zepto_v2.csv(included in repository)
The Zepto.sql file contains queries addressing:
- Total Revenue Trends: Monthly and weekly sales growth.
- Category Deep-dive: Which grocery segments have the highest turnover?
- Delivery Lag: Calculation of the difference between "Expected" vs "Actual" delivery times.
SELECT
category_name,
SUM(order_amount) AS total_revenue,
COUNT(order_id) AS total_orders
FROM zepto_sales
GROUP BY category_name
ORDER BY total_revenue DESC
LIMIT 5;- Clone the Repository
git clone https://github.com/bsanghvi78/Zepto.git
cd Zepto-
Import the Dataset
- Use the
zepto_v2.csvfile to populate your SQL database table. - Ensure the table schema matches the dataset structure before importing.
- Use the
-
Run the SQL Queries
- Execute the scripts available in
Zepto.sql. - These queries contain the business logic and analytical insights for the project.
- Execute the scripts available in
Zepto.sql— All SQL queries including business logic and analytical insightszepto_v2.csv— Primary dataset containing sales and delivery-related records
📌 Dataset included in this repository.