🍽️ Restaurant Data Analysis — Exploratory Insights

This project performs an end-to-end Exploratory Data Analysis (EDA) on restaurant data to uncover patterns in customer preferences, restaurant performance, and cuisine trends. The goal is to derive meaningful insights that can help food businesses understand what drives ratings, pricing, and popularity.

🧪 Objectives

Understand the structure and quality of restaurant data.
Perform detailed data cleaning and preprocessing.
Explore how ratings vary by location, cuisine, and cost.
Identify trends in online delivery and pricing strategies.
Visualize customer engagement patterns through votes and reviews.

📌 Key Methods

Data Cleaning: Handle missing values and inconsistent records.
Data Transformation: Encode categorical data and standardize numeric fields.
Exploratory Analysis: Examine distribution and correlation between variables.
Visualization: Use advanced plots to illustrate restaurant and cuisine trends.

📷 Visualizations

▶️ Aggregate Rates vs Votes

Shows how customer engagement (votes) relates to overall ratings.

▶️ Has Online Delivery (Yes/No)

Distribution of restaurants offering online delivery.

▶️ Rating Averages

Displays average ratings across restaurants and cuisines.

▶️ Density Plot of Cost in INR

Visualizes the distribution of restaurant costs for two people.

▶️ Outlier Analysis for Votes

Highlights extreme values in customer votes.

▶️ Top Cities with Most Expensive Restaurants

Shows which cities have higher restaurant pricing trends.

▶️ Aggregate Rating vs Rating Colour

Explores how ratings are categorized visually by color codes.

▶️ Moving Average of Average Cost for Two

Tracks pricing trends over time using moving averages.

🔍 Key Insights & Outcomes

Most restaurants fall in the mid-rating range (3.0–4.0), indicating generally satisfactory but not exceptional experiences.
Moderately priced restaurants tend to receive higher ratings, showing that value-for-money strongly influences customer perception.
Votes and ratings have a strong positive correlation — restaurants with higher engagement are typically better rated.
Online delivery availability impacts satisfaction; restaurants offering it tend to have slightly lower ratings, possibly due to delivery quality issues.
Table booking is associated with higher ratings, reflecting the importance of structured service and customer experience.
High-cost restaurants are concentrated in metro cities like New Delhi, Jakarta, and Colombo, highlighting market segmentation and luxury dining trends.
Top cuisines include North Indian, Chinese, and Fast Food, while niche cuisines often show lower ratings.
Tier 2 cities have more low-rated restaurants, suggesting a gap and opportunity for quality-driven expansion.

💻 Technologies Used

Python 3
pandas, NumPy — Data manipulation and preprocessing
matplotlib, seaborn — Data visualization
Environment: Jupyter Notebook

💻 Setup & Installation Instructions

Follow these steps to set up the project locally and reproduce the analysis:

1. Clone the Repository:

git clone https://github.com/indu-explores-data/Restaurant-Data-Analysis.git

2. Navigate to the Project Directory:

cd Restaurant-Data-Analysis

3. Create and Activate a Virtual Environment (Recommended):

python -m venv venv

Windows:

venv\Scripts\activate

Mac/Linux:

source venv/bin/activate

4. Install Required Libraries:

pip install pandas numpy matplotlib seaborn jupyter

5. Launch Jupyter Notebook:

jupyter notebook

6. Open Restaurant Data Analysis.ipynb and run all cells to reproduce the analysis.

▶️ Usage / How to Run

Open Restaurant Data Analysis.ipynb in Jupyter Notebook.
Run all cells sequentially to view insights and visualizations.
Results and images are available inside the images/ folder.

🔗 Connect with Me

Let’s connect on LinkedIn for project discussions or data-driven collaborations:

🙌 Feedback & Support

If you found this project helpful, please ⭐ star the repository and share your thoughts. Suggestions and contributions are always welcome!

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
images		images
README.md		README.md
Restaurant Data Analysis.ipynb		Restaurant Data Analysis.ipynb
Restaurant Dataset.csv		Restaurant Dataset.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🍽️ Restaurant Data Analysis — Exploratory Insights

🧪 Objectives

📌 Key Methods

📷 Visualizations

▶️ Aggregate Rates vs Votes

▶️ Has Online Delivery (Yes/No)

▶️ Rating Averages

▶️ Density Plot of Cost in INR

▶️ Outlier Analysis for Votes

▶️ Top Cities with Most Expensive Restaurants

▶️ Aggregate Rating vs Rating Colour

▶️ Moving Average of Average Cost for Two

🔍 Key Insights & Outcomes

💻 Technologies Used

💻 Setup & Installation Instructions

▶️ Usage / How to Run

🔗 Connect with Me

🙌 Feedback & Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🍽️ Restaurant Data Analysis — Exploratory Insights

🧪 Objectives

📌 Key Methods

📷 Visualizations

▶️ Aggregate Rates vs Votes

▶️ Has Online Delivery (Yes/No)

▶️ Rating Averages

▶️ Density Plot of Cost in INR

▶️ Outlier Analysis for Votes

▶️ Top Cities with Most Expensive Restaurants

▶️ Aggregate Rating vs Rating Colour

▶️ Moving Average of Average Cost for Two

🔍 Key Insights & Outcomes

💻 Technologies Used

💻 Setup & Installation Instructions

▶️ Usage / How to Run

🔗 Connect with Me

🙌 Feedback & Support

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages