This project analyzes Airbnb listings across various neighborhoods to predict Review Activity and perform Unsupervised Segmentation. By integrating external datasets—including Crime, Population, and Walkability indices—this pipeline explores how local environment and property features influence guest engagement and listing performance.
To maintain a lightweight repository, this project includes sample datasets (~100 rows each). These allow you to test the code logic without the overhead of the full 1.6 GB dataset.
Due to GitHub size limitations, the complete dataset is hosted externally.
Download full structured dataset (Google Drive):
Download Full Dataset
The Google Drive folder already mirrors the expected project structure:
data/
├── raw/
├── preprocessed/
- Download the Google Drive folder.
- Place the entire
data/folder into the project root directory. - Ensure it sits alongside
src/,notebooks/, andREADME.md. - Run the notebooks as usual.
├── data/
│ ├── raw_sample/ # SAMPLE DATA (100 rows) - Pushed to GitHub
│ │ ├── airbnb/ # Airbnb listings sample
│ │ ├── census_tract/ # Census tracts shapefiles sample
│ │ ├── crime/ # Local crime statistics sample
│ │ ├── population/ # Population density sample
│ │ └── walkability/ # National Walkability Index sample
│ └── preprocessed_sample/ # SAMPLE DATA - Feature-engineered outputs
│
├── src/ # Python scripts for data processing
├── notebooks/ # Jupyter notebooks for analysis
│ ├── Data Preprocessing + EDA.ipynb # Data Cleaning and Merging
│ ├── Part A Supervised Learning.ipynb # Linear Models & Regularization
│ ├── Part A2 KNN Trees.ipynb # KNN and Tree-based Models
│ └── Part B Unsupervised Learning.ipynb # Clustering and Segmentation
│
├── requirements.txt # Python dependencies
├── README.md # Project documentation
└── Milestone 2 Final Report Airbnb Success.pdf # Final project report / analysis
-
Python 3.12
-
Install required Python packages:
pip install -r requirements.txt
- Divya Andem - divyaand@umich.edu
- Jordan Huang - jordanhu@umich.edu
- Sophia Settle - sosettle@umich.edu