GitHub - HeartWise-AI/Orion: Orion

Orion Video Training and Evaluation

Orion is a Python-based application designed for video training and evaluation tasks. It supports both regression and classification tasks and can be used with a variety of models.

Setup

Prerequisites

Python 3.10 or later installed
UV package manager (see installation below)
CUDA-compatible GPU (recommended for training)

Quick Start

Install UV (fast Python package manager):

# On macOS/Linux
curl -LsSf https://astral.sh/uv/install.sh | sh

# On Windows
powershell -c "irm https://astral.sh/uv/install.ps1 | iex"

# Or with pip
pip install uv

Clone the repository:

git clone https://github.com/HeartWise-AI/Orion.git
cd Orion

Set up virtual environment and install dependencies:

# Create virtual environment (uses Python 3.10 by default)
uv venv

# Activate virtual environment
source .venv/bin/activate  # On macOS/Linux
# OR
.venv\Scripts\activate  # On Windows

# Install dependencies
uv pip install -r requirements.txt

# For development (includes testing and linting tools)
uv pip install -r requirements-dev.txt

Configure PyTorch for your CUDA version (if using GPU):

# For CUDA 11.8
uv pip install torch torchvision --index-url https://download.pytorch.org/whl/cu118

# For CUDA 12.1
uv pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121

Set up Weights & Biases for experiment tracking:
```
wandb login
```

Alternative Setup (using pip directly)

If you prefer using pip without UV:

python -m venv orion-venv
source orion-venv/bin/activate  # On macOS/Linux
pip install --upgrade pip
pip install -r requirements.txt

Migration from Poetry

If you're migrating from an older version using Poetry, see MIGRATION_TO_UV.md for detailed instructions

Usage

You can use Orion for:

Training models with various configurations
Running hyperparameter sweeps with W&B
Running inference on trained models
Building custom notebooks using code samples in notebooks/

Configuration Files

See notebooks/config/ for different configuration examples:

config_x3d_debug_binary_classification.yaml - Mixed classification/regression with X3D
config_mvit_mvm_pretraining.yaml - Masked video modeling pretraining
sweep_config.yaml - Hyperparameter sweep configuration
inference_config_template.yaml - Template for inference runs

Multi-Head Support (Classification + Regression)

Orion supports training models with multiple output heads for different tasks simultaneously:

# Define output structure for each head
head_structure:
  y_true_cat: 1  # Binary classification head
  Value: 1       # Regression head (e.g., LVEF prediction)

# Specify task type for each head
head_task:
  y_true_cat: classification
  Value: regression

# Configure loss function for each head
loss_structure:
  y_true_cat: bce_logit  # Binary cross-entropy for classification
  Value: l1              # L1 loss for regression

# Set loss weights
head_weights:
  y_true_cat: 1.0
  Value: 1.0

Model-Specific Configurations

Different video models have specific requirements and recommendations for optimal performance. The table below summarizes the recommended frame sizes and learning rates for supported models:

Model	Recommended Frame Sizes	Recommended Learning Rate
x3d	Multiples of 8	1e-3 to 1e-4
swin3d	24 or 32	1e-4 to 1e-5
mvit	16	3e-5 (0.00003)

Additional Model-Specific Notes:

x3d:
- Supports various frame counts as long as they are multiples of 8.
- The x3d_m variant specifically supports video sizes of either 224x224 or 256x256.
swin3d:
- Only supports 24 or 32 frames.
- Ensure your configuration matches one of these frame counts.
mvit:
- Strictly requires 16 frames.
- Adjust your data preprocessing to match this requirement.

When configuring your model in the YAML file, ensure that the frames and resize parameters align with these recommendations. The lr (learning rate) parameter should be set within the recommended range for best results.

Training Models

Single GPU Training:

python orion/utils/video_training_and_eval.py --config_path=notebooks/config/config.yaml

Distributed Training (multiple GPUs):

# Using 2 GPUs
torchrun --standalone --nnodes=1 --nproc-per-node=2 orion/utils/video_training_and_eval.py --config_path=notebooks/config/config.yaml

# Using 4 GPUs
torchrun --standalone --nnodes=1 --nproc-per-node=4 orion/utils/video_training_and_eval.py --config_path=notebooks/config/config.yaml

GPU Selection

You can specify which GPUs to use in multiple ways:

Using the --gpu flag (Recommended):

# Single GPU
torchrun --standalone --nnodes=1 --nproc-per-node=1 orion/utils/video_training_and_eval.py \
  --config_path=notebooks/config/config.yaml --gpu 0

# Multiple GPUs
torchrun --standalone --nnodes=1 --nproc-per-node=2 orion/utils/video_training_and_eval.py \
  --config_path=notebooks/config/config.yaml --gpu 0,1

# All 4 GPUs
torchrun --standalone --nnodes=1 --nproc-per-node=4 orion/utils/video_training_and_eval.py \
  --config_path=notebooks/config/config.yaml --gpu 0,1,2,3

Using CUDA_VISIBLE_DEVICES environment variable:

# Single GPU
CUDA_VISIBLE_DEVICES=1 torchrun --standalone --nnodes=1 --nproc-per-node=1 \
  orion/utils/video_training_and_eval.py --config_path=notebooks/config/config.yaml

# Multiple GPUs
CUDA_VISIBLE_DEVICES=2,3 torchrun --standalone --nnodes=1 --nproc-per-node=2 \
  orion/utils/video_training_and_eval.py --config_path=notebooks/config/config.yaml

Note: Make sure --nproc-per-node matches the number of GPUs you're using.

Using a Jupyter Notebook

Prepare your configuration file in YAML format.
In your Jupyter notebook, import necessary modules and set up environment variables.
Define a class for your command-line arguments.
Load your configuration, create transforms, initialize a wandb run, and run the main process.

Hyperparameter Sweeps with Weights & Biases

Orion supports multi-GPU sweeps without creating duplicate runs:

Configure your sweep in notebooks/config/sweep_config.yaml

Run sweep experiments:

# Single experiment with 1 GPU (default)
python run_sweep.py

# Run 5 experiments with 1 GPU
python run_sweep.py 5

# Run 10 experiments with 4 GPUs each
NGPUS=4 python run_sweep.py 10

# Run 20 experiments with 12 GPUs each
NGPUS=12 python run_sweep.py 20

The sweep system properly handles distributed training, ensuring only the primary process (rank 0) reports to W&B.

Inference and Evaluation

To run inference and evaluation, use the run_inference.py script.

python run_inference.py --config_path notebooks/config/config_x3d_multi_output_classification_eval.yaml --splits inference --model_path outputs/outputs_folder_id/best.pt

The other arguments are optional: the data path (usually defined in the config file), the output directory (usually defined in the config file), the wandb id & resume flag (both not needed for inference).

Note

This application uses distributed training, so it is designed to be run on multiple GPUs. If you are running it on a machine with a single GPU or CPU, you may need to modify the code accordingly.

Validation of Model Configurations

The build_model function in video_training_and_eval.py includes checks to ensure that the frame and resize parameters in your configuration file are compatible with the chosen model. If these parameters don't meet the model-specific requirements, the function will raise a ValueError with an appropriate error message.

For example:

For swin3d models, it checks if the frame count is either 24 or 32.
For x3d models, it verifies that the frame count is a multiple of 8.
For x3d_m specifically, it checks if the resize value is either 224 or 256.
For mvit models, it ensures the frame count is exactly 16.

These checks help prevent configuration errors and ensure that your model is set up correctly for training or inference.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
docs		docs
notebooks		notebooks
orion.egg-info		orion.egg-info
orion		orion
scripts		scripts
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
CLAUDE.md		CLAUDE.md
README.md		README.md
dockerfile		dockerfile
ext_validation.ipynb		ext_validation.ipynb
poetry.lock		poetry.lock
precommit-hook.yaml		precommit-hook.yaml
precompute_hog_features_gpu_improved.py		precompute_hog_features_gpu_improved.py
prepare_dataset_split.py		prepare_dataset_split.py
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_inference.py		run_inference.py
run_sweep.py		run_sweep.py
setup.sh		setup.sh
uv.toml		uv.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Orion Video Training and Evaluation

Setup

Prerequisites

Quick Start

Alternative Setup (using pip directly)

Migration from Poetry

Usage

Configuration Files

Multi-Head Support (Classification + Regression)

Model-Specific Configurations

Additional Model-Specific Notes:

Training Models

GPU Selection

Using a Jupyter Notebook

Hyperparameter Sweeps with Weights & Biases

Inference and Evaluation

Note

Validation of Model Configurations

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Orion Video Training and Evaluation

Setup

Prerequisites

Quick Start

Alternative Setup (using pip directly)

Migration from Poetry

Usage

Configuration Files

Multi-Head Support (Classification + Regression)

Model-Specific Configurations

Additional Model-Specific Notes:

Training Models

GPU Selection

Using a Jupyter Notebook

Hyperparameter Sweeps with Weights & Biases

Inference and Evaluation

Note

Validation of Model Configurations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages