Transformer Architecture Visualizer

An interactive tool to visualize how transformers process text, while comparing the original 2017 Transformer architecture with modern LLM transformer designs.

The goal of this project is to help developers, students, and researchers see how tokens move through a transformer model.

Live Demo

https://attention-amber.vercel.app

Enter any prompt and watch how it flows through the transformer pipeline.

What This Tool Shows

The visualization demonstrates how text moves through the core stages of a transformer.

Input Text ↓ Tokenization ↓ Embedding ↓ Positional Encoding ↓ Attention ↓ Feed Forward ↓ Output Probabilities

Each stage is shown as a block so you can understand how tokens change step by step.

Architecture Comparison

The interface shows two pipelines side-by-side:

Classic Transformer (2017)
Modern LLM Transformer

This helps visualize how transformer design evolved over time.

🏗️ Architecture Variants at a Glance

A side-by-side breakdown of the original architecture versus what powers today's state-of-the-art models.

Component	🏛️ Classic Transformer (2017)	🚀 Modern LLM Transformer
Origin	Based on "Attention Is All You Need"	Powering LLaMA, Mistral, & Modern LLMs
Tokenization	Breaks text into smaller pieces (tokens).	Same idea: convert text into tokens.
Embedding	Converts tokens into numerical vectors.	Transforms tokens into dense vectors.
Positional Encoding	Sinusoidal Positional Encoding Adds position info so the model knows word order.	Rotary Positional Embedding (RoPE) Encodes position by rotating vectors.
Normalization	LayerNorm (Post-Norm) Normalizes activations after each block.	RMSNorm (Pre-Norm) Normalizes inputs before each block for stability.
Attention Mechanism	Multi-Head Attention Each token looks at other tokens for context.	Grouped Query Attention (GQA) Shares key/value heads to save memory.
Feed Forward	ReLU Feed Forward A small neural network refines each token.	SwiGLU Feed Forward Gated network for better capacity & stability.
Residual Connections	Keeps original info while adding new info.	Combines new info with original input.
Generation Speedup	(Standard Decoding)	KV Cache Stores attention states for faster generation.

Features

Visual comparison of classic vs modern transformer architectures
Interactive token flow visualization
Minimal interface focused on understanding model internals
Educational tool for learning transformer mechanics

Tech Stack

Frontend

React
TypeScript
Vite

UI

TailwindCSS
shadcn/ui

Deployment

Vercel

Why This Project Exists

Transformers power most modern AI systems, but their internal mechanics are often difficult to understand.

This project makes it easier to see what happens inside the model by visualizing each stage of the transformer pipeline.

Future Improvements

Attention map visualization
Step-by-step transformer execution
Token probability heatmaps
Integration with small real transformer models

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
public		public
src		src
.gitignore		.gitignore
README.md		README.md
bun.lock		bun.lock
bun.lockb		bun.lockb
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
playwright-fixture.ts		playwright-fixture.ts
playwright.config.ts		playwright.config.ts
postcss.config.js		postcss.config.js
tailwind.config.ts		tailwind.config.ts
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformer Architecture Visualizer

Live Demo

What This Tool Shows

Architecture Comparison

🏗️ Architecture Variants at a Glance

Features

Tech Stack

Why This Project Exists

Future Improvements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Transformer Architecture Visualizer

Live Demo

What This Tool Shows

Architecture Comparison

🏗️ Architecture Variants at a Glance

Features

Tech Stack

Why This Project Exists

Future Improvements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages