A web scraper designed to extract and organize Northwestern University soccer team data from official team websites.
The app currently offers 16 teams available for selection:
NorthwesternIndianaOhio StateMarylandWashingtonUCLAMichigan StateMichiganRutgersWisconsinPenn StateUICLoyola ChicagoDePaulNorthern IllinoisChicago State
To select a team, begin typing the name as seen above and the app will attempt to autofill the selection.
Choose between downloading the team's roster, schedule, box scores, stats, and/or articles. Or click on the Select all toggle to quickly select all options.
The app currently defaults to downloading stats for 2024 and 2023. Though any combination of years may be selected if needed.
Options for years will be added at the start of every Fall NCAA season. However, please be aware that depending on the time of the current season, there might not be stats available to download yet.
The app currently defaults to downloading five of the most recent box scores or a given team. Any number ranging from 1-10 will be accepted.
Box scores are downloaded in order from newest to oldest. And only the current season will be searched. If there are not enough box scores available, the app will attempt to download as many as possible.
The app will default to ranging from the start of the most recent Fall NCAA season to the current date. Any range of dates will be accepted as long as the start date is before the end date.
During the download process, the app will search for and display articles that were published within the given date range.
Select which articles you would like to download and click on the Download Select Articles button to submit your choices.
Once everything has finished downloading, a Download PDFs button will appear. Clicking on this button will download the ZIP file containing all of the relevant PDFs to your local machine.
Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change. Follow the instructions below to get an instance running on your local machine.
Clone the repository on to your local machine.
git clone https://github.com/LarryLing/NU-Soccer-Web-Scraper.git
cd NU-Soccer-Web-ScraperCreate a virtual environment.
python -m venv .venvInstall the required packages.
pip install -r requirements.txt