EE-452: Network Machine Learning

Graph-based EEG Analysis for Seizure Detection

About • Data • Method • How To Use • Code Structure • Results • Credits • License

Team

The project is accomplished by team:

Igor Pavlovic - @Igzi

Strahinja Nikolic - @strajdzsha

Milica Vukasinovic - @milicavukaa

Marija Rakonjac - @marijarakonjac

About

This repository contains the work done for our Network Machine Learning (EE-452) project at EPFL. The goal of the project is to explore the use of graph-based methods for EEG signal processing, with a focus on seizure detection. EEG recordings are represented as time series captured from multiple electrodes placed on the scalp, and we evaluate both graph-based models and non-graph-based baselines. We use a subset of the Temple University Hospital EEG Seizure Corpus (TUSZ), and compare different architectures on their ability to detect seizures from 12-second EEG segments. Our work includes extensive experimentation, performance analysis, and participation in a Kaggle competition hosted for the course.

Data

We use a subset of the Temple University Hospital EEG Seizure Corpus (TUSZ), one of the largest publicly available EEG datasets for seizure detection.

Patients: The dataset includes recordings from 50 patients for training and 25 for testing.
Sampling: EEG signals are recorded from 19 electrodes at 250 Hz using the standard 10–20 system.
Segments: The recordings are divided into non-overlapping 12-second windows.
Labels: Each window in the training set is labeled as either normal brain activity or seizure activity.
Graphs: For graph-based approaches, electrodes are modeled as nodes, and edges are constructed based on 3D distances between electrodes (provided via distances_3d.csv).

📥 Download Instructions

⚠️ The dataset is not included in this repository due to licensing restrictions and its large size.

To use this project, download the dataset manually and organize it as follows:

data/
├── train/
│ ├── signals/
│ └── segments.parquet
└── test/
├── signals/
└── segments.parquet
├── distances_3d.csv

Make sure the data/ directory is placed at the root level of the project or adjust the notebook paths accordingly.

📜 Usage of the dataset is subject to the TUSZ data license agreement. Redistribution is not permitted, and any attempt to re-identify anonymized subjects is strictly prohibited.

Method

The goal of this project is to compare graph-based and non-graph-based methods for seizure detection from EEG signals. We evaluate multiple deep learning architectures to understand the benefits of modeling spatial relationships between electrodes.

Approaches:

Graph Neural Networks (GNNs):
- We use architectures such as Graph Attention Networks (GAT) and Spatio-Temporal Graph Convolutional Networks (ST-GCN).
- EEG signals are modeled as graphs, where each node represents an electrode and edges represent physical proximity or learned functional connectivity.
- Graph structures are based on 3D distances between electrodes (distances_3d.csv) and visualized using the 10–20 EEG layout.
Non-graph-based Baselines:
- We implement standard models such as 1D CNNs and LSTMs, treating EEG signals as multivariate time series.
- These models serve as a baseline for comparing performance and interpretability against GNNs.
Feature Extraction:
- Hjorth parameters (Activity, Mobility, Complexity) are computed as classical EEG features.
- Experiments are conducted with and without feature extraction to test its influence on model performance.
Training & Evaluation:
- All models are trained on labeled 12-second EEG windows.
- Evaluation is done via cross-validation on the training set and submission to a private Kaggle leaderboard.
- Models are implemented using PyTorch and PyTorch Geometric.

Objective:

By comparing these approaches, we aim to highlight the potential of graph-based methods in capturing spatial dependencies and improving classification accuracy for seizure detection.

How To Use

Each model (e.g., CNN, GAT, STGCN) is implemented and trained through a dedicated Jupyter notebook located in its corresponding folder. These notebooks handle training, evaluation, and submission file generation.

Running the Models

To run an experiment:

Navigate to the model directory (e.g., CNN/, GAT/, STGCN/).
Open the notebook (e.g., cnn.ipynb) in Jupyter or VS Code.
Run all cells to:
- Train the model
- Evaluate it using validation data
- Generate a CSV file for Kaggle submission

Example (for CNN model):

cd CNN
jupyter notebook cnn.ipynb

Code Structure

│   .gitignore                # Specifies files and directories to be ignored by Git
│   adjacency_correlation.csv # Correlation-based adjacency matrix
│   evaluation.py             # Script to evaluate trained models
│   hjorth_features.py        # Script for computing Hjorth parameters
│   initial_sem_embs.txt      # Initial semantic embeddings (not actively used in this project)
│   requirements.txt          # List of required Python packages for running the notebooks
│   README.md                 # Project documentation
│   Report.pdf                # Project report
│   start_jupyter_server.bash # Bash script to launch a local Jupyter server (optional)
│
├───CNN                       # Convolutional Neural Network (time-domain and frequency-domain approaches)
│       cnn.ipynb
│       cnn_submission_fft.csv
│       cnn_submission_time.csv
│
├───ChebNet                   # Spectral GNN using Chebyshev polynomial filters
│       chebnet.ipynb
│       submission_seed5.csv
│
├───DCRNN                     # Diffusion Convolutional Recurrent Neural Network for spatio-temporal modeling
│       dcrnn.ipynb
│       submission_dcrnn.csv
│
├───GAT                       # Graph Attention Network for seizure detection using EEG graphs
│       GAT.ipynb
│
├───Hjorth                    # Models using Hjorth parameters (Activity, Mobility, Complexity)
│       hjorth.ipynb
│       hjorth.py
│       submission_hjorth_rf.csv
│
├───Other                     # Miscellaneous models and experiments (e.g., MLP, Transformer)
│       MLP.ipynb
│       Transformer.ipynb
│       example.ipynb
│       example.md
│
├───RNN                       # Recurrent Neural Network models (LSTM, BiLSTM)
│       BiLSTM copy.ipynb
│       BiLSTM.ipynb
│       RNN-full-sequence.ipynb
│       RNN.ipynb
│       submission_seed1.csv
│
├───STGCN                     # Spatio-Temporal Graph Convolutional Network
│       STGCN.ipynb
│       submission_full_sequence_256.csv
│       submission_full_sequence_512.csv
│       submission_seed1.csv
│       submission_smote_full_sequence.csv
│
├───Spectrogram               # Model based on spectrogram features of EEG signal
│       spectrogram.ipynb
│
├───data                      # EEG graph metadata and resources
│       distances_3d.csv
│

Results

We evaluated both sequence-based and graph-based models using 5-fold cross-validation. The primary metric was the F1 score, due to its robustness for imbalanced classification.

F1 Scores – Sequence-Based Models

Fold	BiLSTM	Hjorth	CNN	Spectrogram
Fold 1	0.7421	0.8004	0.6907	0.7365
Fold 2	0.6791	0.6865	0.7035	0.6900
Fold 3	0.6545	0.6936	0.7314	0.7423
Fold 4	0.6801	0.7198	0.6975	0.7039
Fold 5	0.7525	0.7393	0.7174	0.6863
Avg	0.7017	0.7279	0.7081	0.7118
Std	0.0385	0.0409	0.0146	0.0234

F1 Scores – Graph-Based Models

Fold	ST-GCN	ChebNet	GAT
Fold 1	0.6951	0.6426	0.6897
Fold 2	0.6570	0.5855	0.5750
Fold 3	0.6596	0.6187	0.6335
Fold 4	0.6140	0.5828	0.6922
Fold 5	0.6743	0.5679	0.6355
Avg	0.6600	0.5995	0.6452
Std	0.0267	0.0272	0.0432

📌 Note: While graph-based models did not outperform the best sequence-based model (Hjorth + RF), they offer interpretability advantages and hold potential for future extensions via dynamic graph construction and semantic embedding.

Credits

This project was carried out as part of the EE-452: Network Machine Learning course at EPFL. We would like to thank the course staff for their guidance and support throughout the project.

We also acknowledge the creators of the Temple University Hospital EEG Seizure Corpus (TUSZ) for providing access to the dataset used in this work. Use of the dataset complies with the terms outlined in the official TUSZ agreement.

All models were implemented using open-source libraries including PyTorch, PyTorch Geometric, and scikit-learn.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EE-452: Network Machine Learning

Graph-based EEG Analysis for Seizure Detection

Team

About

Data

📥 Download Instructions

Method

Approaches:

Objective:

How To Use

Running the Models

Code Structure

Results

F1 Scores – Sequence-Based Models

F1 Scores – Graph-Based Models

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
CNN		CNN
ChebNet		ChebNet
DCRNN		DCRNN
GAT		GAT
Hjorth		Hjorth
Other		Other
RNN		RNN
STGCN		STGCN
Spectrogram		Spectrogram
data		data
.gitignore		.gitignore
README.md		README.md
Report.pdf		Report.pdf
adjacency_correlation.csv		adjacency_correlation.csv
evaluation.py		evaluation.py
hjorth_features.py		hjorth_features.py
initial_sem_embs.txt		initial_sem_embs.txt
requirements.txt		requirements.txt
start_jupyter_server.bash		start_jupyter_server.bash

Igzi/nml_project

Folders and files

Latest commit

History

Repository files navigation

EE-452: Network Machine Learning

Graph-based EEG Analysis for Seizure Detection

Team

About

Data

📥 Download Instructions

Method

Approaches:

Objective:

How To Use

Running the Models

Code Structure

Results

F1 Scores – Sequence-Based Models

F1 Scores – Graph-Based Models

Credits

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

Packages