Fovea

Flexible Ontology Visual Event Analyzer

Documentation • Releases • Discussions • Changelog

What is Fovea?

Fovea is a web-based video annotation platform for analysts who need to develop custom annotation ontologies for video data. It supports a persona-based approach where different analysts define their own interpretive frameworks and assign different semantic types to the same real-world objects.

The platform combines manual annotation with AI-supported features including video summarization, object detection, ontology suggestions, and claim extraction.

Technology stack

Layer	Technology
Frontend	React 18, TypeScript, Material UI v5, TanStack Query, Zustand, Vite
Backend	Node.js 22, Fastify 5, Prisma 6, BullMQ 5, TypeBox
Model Service	Python 3.12, FastAPI, PyTorch, Transformers, SGLang, vLLM
Databases	PostgreSQL 16, Redis 7
Infrastructure	Docker, OpenTelemetry, Prometheus, Grafana
Testing	Vitest, Playwright, pytest, MSW

Getting started

Prerequisites

Docker Desktop 4.0+ with Docker Compose v2
8 GB RAM minimum (16 GB recommended)
NVIDIA GPU + CUDA (optional, for GPU-accelerated inference)

Quick start

git clone https://github.com/parafovea/fovea.git
cd fovea
docker compose up

Open http://localhost:3000 and log in with admin / admin.

Place .mp4 video files in the videos/ directory to start annotating.

GPU mode

For NVIDIA GPU-accelerated inference:

docker compose --profile gpu up

Configuration

Create a .env file to customize settings:

cp .env.example .env

Variable	Default	Description
`ADMIN_PASSWORD`	`admin`	Admin password (change for production)
`FOVEA_MODE`	`multi-user`	`single-user` (no login) or `multi-user`
`ALLOW_REGISTRATION`	`false`	Allow new user sign-ups
`ANTHROPIC_API_KEY`		Claude API key for external AI
`OPENAI_API_KEY`		OpenAI API key for external AI
`GOOGLE_API_KEY`		Google API key for external AI

External API keys are optional. Fovea works with local models when no keys are configured.

Features

Video annotation

Bounding box annotation with draw, resize, and drag
Keyframe-based sequences with linear and bezier interpolation
Canvas timeline with playhead scrubbing, zoom (1-10x), and keyboard navigation
Automated tracking (SAMURAI, SAM2, YOLO11-seg) for bootstrapping annotations
JSON Lines import/export with conflict resolution

Ontology management

Persona-scoped types: entities, roles, events, and relations
AI-powered type suggestions via LLM integration
Wikidata integration with one-click import and ID mapping
Rich text gloss editor with autocomplete and claim references

Video summarization

Vision Language Model analysis with persona context
Audio transcription with speaker diarization (7 providers)
Audio-visual fusion strategies for multimodal understanding
Background processing with real-time progress updates

Claims system

Hierarchical claims and subclaims with manual editing
LLM-powered extraction and synthesis
Typed relations with filtering and search
Provenance tracking and span highlighting

Object detection

Multi-model support: YOLO-World, OWLv2, Florence-2, Grounding DINO
Ontology-aware query prompts
Detection candidate review with accept/reject controls

AI model service

YAML-based model configuration with per-task selection
GPU inference: SGLang, vLLM, Transformers with 4-bit quantization
External APIs: Anthropic Claude, OpenAI GPT, Google Gemini
Model status dashboard with VRAM monitoring

Authentication

Session-based auth with progressive lockout
Single-user mode for local use
User-scoped API keys with AES-256-GCM encryption

Project structure

fovea/
├── annotation-tool/        Frontend (React + TypeScript + Vite)
├── server/                 Backend (Fastify + Prisma)
├── model-service/          AI model service (FastAPI + PyTorch)
├── wikibase/               Wikibase data loader (Python)
├── docs/                   Documentation (Docusaurus)
├── docker-compose.yml      Service orchestration
└── .github/workflows/      CI/CD pipelines

Development

Manual setup

# Start databases
docker compose up -d postgres redis

# Backend
cd server && npm install && npx prisma migrate dev && npx prisma db seed && npm run dev

# Frontend (new terminal)
cd annotation-tool && npm install && npm run dev

# Model service (new terminal, optional)
cd model-service && python3.12 -m venv venv && source venv/bin/activate
pip install -e . && uvicorn src.main:app --reload --port 8000

Dev mode with hot reload

docker compose -f docker-compose.yml -f docker-compose.dev.yml up --build

Includes hot-reload volumes, Jaeger tracing at localhost:16686, and Maildev at localhost:1080.

Running tests

Frontend:

cd annotation-tool
npm run test              # Vitest unit tests
npm run test:e2e          # Playwright E2E tests
npm run lint              # ESLint
npx tsc --noEmit          # Type check

Backend:

cd server
npm run test              # Vitest unit tests
npm run lint              # ESLint

Model Service:

cd model-service
uv run pytest             # Unit tests
uv run ruff check src/    # Lint
uv run mypy src/          # Type check

Monitoring

Service	URL
Grafana	localhost:3002 (admin/admin)
Prometheus	localhost:9090
Bull Board	localhost:3001/admin/queues

Contributing

Contributions are welcome. See CONTRIBUTING.md for development setup, coding standards, and PR process.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 703 Commits
.github		.github
annotation-tool		annotation-tool
docs		docs
grafana-dashboards		grafana-dashboards
model-service		model-service
server		server
test-utils		test-utils
test-videos		test-videos
wikibase		wikibase
.env.demo		.env.demo
.env.example		.env.example
.env.hybrid		.env.hybrid
.env.s3		.env.s3
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
DOCKER_QUICK_REFERENCE.md		DOCKER_QUICK_REFERENCE.md
LICENSE		LICENSE
README.md		README.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.e2e.yml		docker-compose.e2e.yml
docker-compose.wikibase.yml		docker-compose.wikibase.yml
docker-compose.yml		docker-compose.yml
fovea-logo.svg		fovea-logo.svg
otel-collector-config.yaml		otel-collector-config.yaml
prometheus-alerts.yml		prometheus-alerts.yml
prometheus.yml		prometheus.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fovea

What is Fovea?

Technology stack

Getting started

Prerequisites

Quick start

GPU mode

Configuration

Features

Video annotation

Ontology management

Video summarization

Claims system

Object detection

AI model service

Authentication

Project structure

Development

Manual setup

Dev mode with hot reload

Running tests

Monitoring

Contributing

License

About

Uh oh!

Releases 6

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Fovea

What is Fovea?

Technology stack

Getting started

Prerequisites

Quick start

GPU mode

Configuration

Features

Video annotation

Ontology management

Video summarization

Claims system

Object detection

AI model service

Authentication

Project structure

Development

Manual setup

Dev mode with hot reload

Running tests

Monitoring

Contributing

License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages