This repository contains the complete codebase, datasets, and analysis for our NBA analytics research paper examining team performance and ranking methodologies across multiple seasons (2016-2024).
Our research applies advanced statistical modeling and machine learning techniques to analyze NBA team performance, with a focus on logistic regression models for ranking and classification tasks. The study encompasses comprehensive data from regular seasons, playoffs, and play-in tournaments.
README.md- This documentation
Game Logs (gamelogs/)
nba_games_cleansed.csv- Cleaned NBA game datanba_games.csv- Raw NBA game data
Standings (standings/)
standing_2016.csvtostanding_2024.csv- Season standings by year
Data Cleaning (cleaning_code/)
parse.ipynb- Data parsing notebookpreprocessing.ipynb- Data preprocessingscrape.ipynb- Web scraping scriptssort_by_year.ipynb- Temporal data organization
Statistical Models (models/ and tables/)
- Web Scraping: Automated data collection from NBA sources
- Data Cleaning: Comprehensive preprocessing and validation
- Temporal Organization: Multi-season data structuring (2016-2024)
- Logistic Regression: Team ranking and classification models
- Pseudo R-squared Analysis: Model performance evaluation
- Group-based Analysis: Hierarchical team performance modeling
- Regular Season: Complete game logs and standings
All analysis is fully reproducible:
- Clone this repository
- Install requirements
- Run Jupyter notebooks in specified order (need to change the directories accordingly)
- Results will match published findings
- NBA official statistics
- Historical game logs (2016-2024)
- Regular season standings
Our analysis provides:
- Team ranking algorithms with statistical validation
- Performance prediction models
- Comprehensive statistical tables
- Visualization of trends and patterns
If you use this research or data in your work, please cite:
[Li & Jabbari] (2025). NBA Analytics Research: Advanced Statistical Modeling
for Team Performance and Ranking. GitHub Repository.
https://github.com/JerryLi24/nba-analytics-research