NVIDIA Free Models OpenAI Endpoint

A lightweight FastAPI service that proxies NVIDIA's free preview models endpoint to an OpenAI-compatible /v1/models API.

Designed for use with Bifrost or any OpenAI-compatible client.

Quick Start

Prerequisites

Docker & Docker Compose

Run with Docker Compose

# Create env file
cp env.example .env


# Build the service
docker compose build

# Start the service
docker compose up -d

The API will be available at http://localhost:8101 (or your custom port).

Local Development

# Install dependencies
poetry install

# Run with auto-reload
poetry run uvicorn src.service.api.app:app --host 0.0.0.0 --port 8101 --reload

Configuration

Settings are managed via settings.toml and environment variables:

Variable	Description	Default
`APP_PORT`	Host port for the service	`8101`
`free_models_url`	NVIDIA models page URL	`https://build.nvidia.com/models?filters=nimType%3Anim_type_preview&pageSize=96`
`cache_ttl_minutes`	Cache duration for model list	`30`
`fetch_timeout_seconds`	HTTP request timeout	`30`
`user_agent`	User-Agent header for requests	Chrome-based UA

API Endpoints

`GET /v1/models`

Returns a list of available free preview models in OpenAI format.

Example response:

{
  "object": "list",
  "data": [
    {
      "id": "nvidia/cosmos3-nano",
      "object": "model",
      "created": 1780395340,
      "owned_by": "nvidia"
    }
  ]
}

`GET /health`

Health check endpoint for Docker Compose.

Response:

{
  "status": "ok"
}

Project Structure

src/
  service/api/     # FastAPI application
  cases/nvidia_models/  # Scraping logic
data/logs/         # Application logs
compose.yml        # Docker Compose configuration
settings.toml      # Default settings
pyproject.toml     # Python dependencies

Notes

The scraper relies on HTML structure (data-nvtrack-nav-object="artifact-card"). If NVIDIA changes their page layout, the parser may need updates.
For production use, prefer an official NVIDIA API if one becomes available.
The owned_by field is derived from the model ID prefix (e.g., meta/llama-3.1 → owned_by: "meta").

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
playground		playground
src		src
.gitignore		.gitignore
.secrets.toml.example		.secrets.toml.example
Dockerfile		Dockerfile
README.md		README.md
compose.yml		compose.yml
config.py		config.py
env.example		env.example
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
settings.toml		settings.toml
tasks.py		tasks.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NVIDIA Free Models OpenAI Endpoint

Quick Start

Prerequisites

Run with Docker Compose

Local Development

Configuration

API Endpoints

`GET /v1/models`

`GET /health`

Project Structure

Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

NVIDIA Free Models OpenAI Endpoint

Quick Start

Prerequisites

Run with Docker Compose

Local Development

Configuration

API Endpoints

GET /v1/models

GET /health

Project Structure

Notes

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`GET /v1/models`

`GET /health`

Packages