🚕 Uber Trip Forecasting (Jan–June 2015)

🔗 Repository: https://github.com/anasrobo/Uber_Trip_Analysis

💡 Project Overview

An interactive Streamlit app that analyzes NYC Uber pickups (Jan–Jun 2015) and forecasts hourly demand using an ensemble of three ML regressors.

📊 Key Features

🔍 Time-Series Feature Engineering: hour, day-of-week, month, weekend flag
📈 Rolling Stats: 24-hour rolling mean & standard deviation
⏳ Lag Features: previous 1–24 hour trip counts
🤖 Ensemble Learning: XGBoost, Random Forest, Gradient Boosting combined via weighted average
🌐 Streamlit Dashboard: Interactive UI with Plotly charts
🔮 24-Hour Forecast: Recursive forecasting for the next day

🗂️ Project Structure

Uber_Trip_Analysis/
├── streamlit_app/
│   ├── app/                   # 📊 Streamlit application source code
│   ├── train_and_save_models/ # 🧠 Training scripts & model saving
├── assets/
│   └── newplot.png            # 📈 Sample visualization
├── notebooks/
│   └── Uber_Trip_Analysis     # 📓 Jupyter notebook (EDA, modeling)
├── models/
│   ├── xgb_model.pkl          # 🌲 XGBoost saved model
│   ├── rf_model.pkl           # 🌳 Random Forest saved model
│   ├── gbr_model.pkl          # 📉 Gradient Boosting model
│   └── ensemble_weights.pkl   # 🧩 Custom ensemble weights
├── uber-raw-data-janjune-15.csv  # 📁 Raw Uber data (~512 MB)
├── requirements.txt           # 📦 Required Python packages
└── README.md                  # 📘 You're here

🗃️ Dataset

Uber NYC pickups (Jan–Jun 2015), timestamped at the ride level. Resampled to hourly counts for forecasting.

📥 Download (512 MB)
Google Drive Link

After downloading, place uber-raw-data-janjune-15.csv into the project root.

⚙️ Installation & Setup

#bash git clone https://github.com/anasrobo/Uber_Trip_Analysis.git cd Uber_Trip_Analysis

(Optional) Virtual environment

python -m venv venv

Windows

.\venv\Scripts\activate

macOS/Linux

source venv/bin/activate

pip install -r requirements.txt

🚀 Usage streamlit run app.py Opens at http://localhost:8501

Explore:

Actual vs Predicted trip curves (XGB, RF, GBR, Ensemble)

24-Hour Forecast dashed line

Performance Metrics: MAPE, RMSE, R²

🤖 ML Pipeline

Load & Preprocess

Read CSV, parse Pickup_date as datetime

Resample to hourly counts, set index

Feature Engineering

Time features: hour, dayofweek, month, is_weekend

Rolling stats & lag features

Model Training

Train XGB, RF, GBR on training split

Ensemble

Weighted avg of model predictions (weights learned via CV)

Evaluation

MAPE, RMSE, R² on test split

Forecasting

Recursive 24-hour ahead using latest data & features

📸 Screenshots

Actual vs Predictions & 24-Hour Forecast
Input Form UI

📈 Example Metrics

Model MAPE RMSE R² XGBoost 12.3% 450.2 0.842 RandomForest 13.1% 478.9 0.817 GradBoost 12.8% 462.8 0.831 Ensemble 11.5% 432.1 0.858

(Actual values may vary.)

📦 requirements.txt

streamlit pandas numpy plotly scikit-learn xgboost joblib

🤝 Contributing

Fork & clone

Create a branch (git checkout -b feature/your-feature)

Commit & push

📄 License

Licensed under MIT. See LICENSE for details.

“The best way to predict the future is to create it.” – Peter Drucker

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚕 Uber Trip Forecasting (Jan–June 2015)

💡 Project Overview

📊 Key Features

🗂️ Project Structure

🗃️ Dataset

⚙️ Installation & Setup

(Optional) Virtual environment

Windows

macOS/Linux

pip install -r requirements.txt

Explore:

🤖 ML Pipeline

📸 Screenshots

📈 Example Metrics

📦 requirements.txt

🤝 Contributing

📄 License

Made with ❤️ by Anas

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
dataset		dataset
models		models
notebooks		notebooks
streamlit_app		streamlit_app
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🚕 Uber Trip Forecasting (Jan–June 2015)

💡 Project Overview

📊 Key Features

🗂️ Project Structure

🗃️ Dataset

⚙️ Installation & Setup

(Optional) Virtual environment

Windows

macOS/Linux

pip install -r requirements.txt

Explore:

🤖 ML Pipeline

📸 Screenshots

📈 Example Metrics

📦 requirements.txt

🤝 Contributing

📄 License

Made with ❤️ by Anas

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages