Practical Part of the Master's Thesis: Data Science in Football

This repository contains the practical part of a master's thesis focused on the use of Data Science methods in football. The project is divided into separate analytical blocks that are methodologically related, but each of them can also be read on its own.

Repository Overview

xG/
Exploratory analysis of expected-value metrics in football. This section works with xG, xGA, and xPts, compares leagues, identifies long-term overperformance and underperformance, studies Leicester City's 2015/16 Premier League season, and explores team styles through clustering.
match_prediction/
A prediction-focused section aimed at building the strongest possible workflow for forecasting matches in the current German Bundesliga season. It covers data preparation, feature engineering, machine learning models, a double Poisson approach, market benchmarking, and next-matchday predictions.

Project Structure

DP/
|-- README.md
|-- xG/
|   |-- README.md
|   |-- xG_analysis.ipynb
|   |-- Data/
|   |-- Plots/
|   `-- src/
`-- match_prediction/
    |-- README.md
    |-- notebooks/
    |-- src/
    |-- data/
    `-- outputs/

How to Approach the Repository

The repository is primarily notebook-driven: the main analytical narrative is developed in .ipynb files.
Shared logic is moved into src/ modules to keep the work reproducible and reusable.
Data folders are separated by pipeline stage into raw, interim, and processed.
Outputs intended for interpretation and presentation are stored separately in outputs.
README files inside subfolders act as local guides that explain what the folder contains, why it exists, and when it matters in the workflow.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
match_prediction		match_prediction
xG		xG
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Practical Part of the Master's Thesis: Data Science in Football

Repository Overview

Recommended Reading Path

Project Structure

How to Approach the Repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Practical Part of the Master's Thesis: Data Science in Football

Repository Overview

Recommended Reading Path

Project Structure

How to Approach the Repository

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages