Skip to content

indu-explores-data/Restaurant-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

55 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🍽️ Restaurant Data Analysis — Exploratory Insights

This project performs an end-to-end Exploratory Data Analysis (EDA) on restaurant data to uncover patterns in customer preferences, restaurant performance, and cuisine trends. The goal is to derive meaningful insights that can help food businesses understand what drives ratings, pricing, and popularity.


🧪 Objectives

  • Understand the structure and quality of restaurant data.
  • Perform detailed data cleaning and preprocessing.
  • Explore how ratings vary by location, cuisine, and cost.
  • Identify trends in online delivery and pricing strategies.
  • Visualize customer engagement patterns through votes and reviews.

📌 Key Methods

  • Data Cleaning: Handle missing values and inconsistent records.
  • Data Transformation: Encode categorical data and standardize numeric fields.
  • Exploratory Analysis: Examine distribution and correlation between variables.
  • Visualization: Use advanced plots to illustrate restaurant and cuisine trends.

📷 Visualizations

▶️ Aggregate Rates vs Votes

Shows how customer engagement (votes) relates to overall ratings.
Aggregate Rates vs Votes


▶️ Has Online Delivery (Yes/No)

Distribution of restaurants offering online delivery.
Has Online Delivery (Yes/No)


▶️ Rating Averages

Displays average ratings across restaurants and cuisines.
Rating Averages


▶️ Density Plot of Cost in INR

Visualizes the distribution of restaurant costs for two people.
Density Plot of Cost in INR


▶️ Outlier Analysis for Votes

Highlights extreme values in customer votes.
Outlier Analysis for Votes


▶️ Top Cities with Most Expensive Restaurants

Shows which cities have higher restaurant pricing trends.
Top Cities with Most Expensive Restaurants


▶️ Aggregate Rating vs Rating Colour

Explores how ratings are categorized visually by color codes.
Aggregate Rating vs Rating Colour


▶️ Moving Average of Average Cost for Two

Tracks pricing trends over time using moving averages.
Moving Average of Average Cost for Two


🔍 Key Insights & Outcomes

  • Most restaurants fall in the mid-rating range (3.0–4.0), indicating generally satisfactory but not exceptional experiences.
  • Moderately priced restaurants tend to receive higher ratings, showing that value-for-money strongly influences customer perception.
  • Votes and ratings have a strong positive correlation — restaurants with higher engagement are typically better rated.
  • Online delivery availability impacts satisfaction; restaurants offering it tend to have slightly lower ratings, possibly due to delivery quality issues.
  • Table booking is associated with higher ratings, reflecting the importance of structured service and customer experience.
  • High-cost restaurants are concentrated in metro cities like New Delhi, Jakarta, and Colombo, highlighting market segmentation and luxury dining trends.
  • Top cuisines include North Indian, Chinese, and Fast Food, while niche cuisines often show lower ratings.
  • Tier 2 cities have more low-rated restaurants, suggesting a gap and opportunity for quality-driven expansion.

💻 Technologies Used

  • Python 3
  • pandas, NumPy — Data manipulation and preprocessing
  • matplotlib, seaborn — Data visualization
  • Environment: Jupyter Notebook

💻 Setup & Installation Instructions

Follow these steps to set up the project locally and reproduce the analysis:

1. Clone the Repository:

git clone https://github.com/indu-explores-data/Restaurant-Data-Analysis.git

2. Navigate to the Project Directory:

cd Restaurant-Data-Analysis

3. Create and Activate a Virtual Environment (Recommended):

python -m venv venv

Windows:

venv\Scripts\activate

Mac/Linux:

source venv/bin/activate

4. Install Required Libraries:

pip install pandas numpy matplotlib seaborn jupyter

5. Launch Jupyter Notebook:

jupyter notebook

6. Open Restaurant Data Analysis.ipynb and run all cells to reproduce the analysis.


▶️ Usage / How to Run

  • Open Restaurant Data Analysis.ipynb in Jupyter Notebook.
  • Run all cells sequentially to view insights and visualizations.
  • Results and images are available inside the images/ folder.

🔗 Connect with Me

Let’s connect on LinkedIn for project discussions or data-driven collaborations:

LinkedIn


🙌 Feedback & Support

If you found this project helpful, please ⭐ star the repository and share your thoughts. Suggestions and contributions are always welcome!

About

Analyzed restaurant data to uncover insights on ratings, cuisines, and pricing. Used Python (Pandas, Seaborn, Matplotlib) for EDA and visualizations. Highlights include top-rated cuisines, pricing trends, and location-based analysis to support business decisions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors