Auto-Labeling & YOLO/OBB Dataset Pipeline

Automatically generate bounding box or oriented box labels for object detection datasets using Grounding DINO and SAM2.

Quick Start

Create a virtual environment

python3 -m venv .venv
source .venv/bin/activate

# 1. Install dependencies
pip install -r requirements.txt

# 2. Clone and install SAM2
git clone https://github.com/facebookresearch/segment-anything-2.git
cd segment-anything-2 && pip install -e . && cd ..

3. Auto-label your images for yolo

python auto_label.py \
    --input ./output \
    --output ./yolo_dataset \
    --prompts prompts.yaml \
    --bbox-format yolo \
    --sample-rate 10

or for yolo obb use

python auto_label.py \
    --input ./output \
    --output ./yolo_dataset_obb \
    --prompts prompts.yaml \
    --bbox-format obb \
    --sample-rate 10

Use --sample-rate N to process every Nth image (default: 10, use 1 for all images)

4. Train YOLOv8

yolo detect train data=yolo_dataset/dataset.yaml model=yolov8n.pt epochs=100 imgsz=640


---

## Extracting Frames from MCAP Files
```bash
python extract_data.py

By default, reads from ./data and writes to ./output. Override with environment variables:

DATA_ROOT=./my_data OUTPUT_ROOT=./my_output python extract_data.py

Expected Structure

Input:

data/
├── class_name_1/
│   └── recording.mcap
└── class_name_2/
    └── recording.mcap

Output:

output/
├── class_name_1/
│   └── rgb/
└── class_name_2/
    └── rgb/

The output is ready for auto_label.py --input ./output.

Project Files

File	Description
`extract_data.py`	Extract RGB/depth frames from MCAP files.
`auto_label.py`	Main auto-labeling script. Generates YOLO or OBB labels from images.
`prompts.yaml`	Text prompts corresponding to each class. Required for Grounding DINO.
`view_dataset.py`	Optional visualization tool for verifying labels.

Auto-Labeling Usage

python auto_label.py \
    --input ./output \
    --output ./yolo_dataset \
    --prompts prompts.yaml \
    --bbox-format yolo

Options

Option	Description
`--bbox-format yolo`	Axis-aligned YOLO boxes
`--bbox-format obb`	Rotated oriented bounding boxes
`--device cuda`	Use GPU if available (defaults to CPU)

Dataset Structure

After running auto_label.py:

yolo_dataset/
├── images/
│   ├── train/
│   └── val/
├── labels/
│   ├── train/
│   └── val/
└── dataset.yaml

Visualizing Generated Labels

python view_dataset.py --data ./yolo_dataset --prompts ./prompts.yaml

Zero-shot bounding box for obb seems to be around 80% accurate and 70% for yolo_obb. This is directly impacted by the quality of your prompt. Regardless, this GUI visualizer allows you to both view the labelled data from the auto labeler and fix the bounding boxes.

Training YOLO

A juypter notebook has been provided as an example which can be uploaded to google colab along with a zip of the dataset generated by auto_label.py. Otherwise the model can be trained locally with the following commands:

yolo detect train \
    data=yolo_dataset/dataset.yaml \
    model=yolov8n.pt \
    epochs=100 \
    imgsz=640

Model	Description
`yolov8n.pt`	Nano (fastest)
`yolov8s.pt`	Small
`yolov8m.pt`	Medium
`yolov8l.pt`	Large
`yolov8x.pt`	Extra Large (most accurate)

Notes

Ensure prompts.yaml matches all class names in your dataset
By default, every 10th image is processed; adjust files[::10] in auto_label.py if needed
GPU acceleration is highly recommended

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assets		assets
models		models
.gitignore		.gitignore
License		License
README.md		README.md
TrainYoloModel.ipynb		TrainYoloModel.ipynb
auto_label.py		auto_label.py
clean_blank_labels.py		clean_blank_labels.py
extract_data.py		extract_data.py
prompts.yaml		prompts.yaml
requirements.txt		requirements.txt
view_dataset.py		view_dataset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auto-Labeling & YOLO/OBB Dataset Pipeline

Quick Start

Create a virtual environment

3. Auto-label your images for yolo

4. Train YOLOv8

Expected Structure

Project Files

Auto-Labeling Usage

Options

Dataset Structure

Visualizing Generated Labels

Training YOLO

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Auto-Labeling & YOLO/OBB Dataset Pipeline

Quick Start

Create a virtual environment

3. Auto-label your images for yolo

4. Train YOLOv8

Expected Structure

Project Files

Auto-Labeling Usage

Options

Dataset Structure

Visualizing Generated Labels

Training YOLO

Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages