Transformer Model Geometrical Analysis

Overview

This project investigates the Transformer model, specifically focusing on Chat GPT-2 small, from a geometrical perspective. Our team dissected the model to inspect the algorithm and its parameters, studying the dimensionality of the embedding space. We observed how words, embedded in a 768-dimensional space, are generally found on a lower-dimensional manifold due to the complex semantic structure present in meaningful text. This analysis was conducted layer by layer, decoder by decoder, using both global and local methods such as intrinsic dimensionality estimation.

Introduction

The goal of this project is to gain a deeper understanding of the internal workings of the GPT-2 small model through a geometrical lens. We analyzed the embedding space's dimensionality, studying how the dimension changes throughout the model’s layers and decoders. Our investigation also focused on the evolution of various metrics and the behavior of the model concerning the last word in a prompt, as it plays a crucial role in predicting the next word in the sequence.

Installation

To get started with this project, follow these steps:

Clone the repository:

git clone https://github.com/adadiorio/Project-LCP-mod-B
cd Project-LCP-mod-B

Install the required packages:
```
pip install -r requirements.txt
```

Usage

To run the analysis, execute:

create_combined_directories_with_subdirs: to generate the all the subdirectories;
PreRun: to generate the necessary data;
Decoderwise_statistical_analysis: To analyze in details the behaviour of every single piece of the model;
IDwise_statistical_analysis: To analyze globally the behaviour of the model.

Contributors

Bibliography

Vaswani et al., 2017, Attention Is All You Need
Glielmo et al., 2022, DADApy: Distance-based analysis of data-manifolds in Python
Denti et al., 2022, The generalized ratios intrinsic dimension estimator

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
AnalysisEvolution/img		AnalysisEvolution/img
probability_evolution		probability_evolution
Decoderwise_functions.py		Decoderwise_functions.py
Decoderwise_statistical_analysis.ipynb		Decoderwise_statistical_analysis.ipynb
IDwise_functions.py		IDwise_functions.py
IDwise_statistical_analysis.ipynb		IDwise_statistical_analysis.ipynb
PreRun.ipynb		PreRun.ipynb
README.md		README.md
The_Dimensionality_of_Transformers__1_.pdf		The_Dimensionality_of_Transformers__1_.pdf
create_combined_directories_with_subdirs.sh		create_combined_directories_with_subdirs.sh
prompts.json		prompts.json
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer Model Geometrical Analysis

Overview

Table of Contents

Introduction

Installation

Usage

Contributors

Bibliography

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Transformer Model Geometrical Analysis

Overview

Table of Contents

Introduction

Installation

Usage

Contributors

Bibliography

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages