AMPO

Adaptive Mix Preference Optimization for Generative Recommendation

Accepted by SIGIR 2026 (Full Paper)

Overview

AMPO introduces an adaptive margin mechanism for pairwise preference optimization. Rather than applying a uniform margin to every preference pair, it calibrates optimization strength according to model confidence, making training more stable under heterogeneous recommendation signals and varying pair difficulty.

The repository is organized for direct experimentation, with a lightweight training entry and a compact implementation path for extending optimization objectives in practical recommendation settings.

Setup

git clone https://github.com/jumbo-q/ampo.git
cd ampo
pip install -r requirements.txt

Quick Start

Modify Configuration: Edit configs/default.yaml and fill in your <model_path> and <data_path>.

model_args:
  model_name_or_path: "<your_model_path>"
data:
  train_files:
    - "<your_train_data_path>"

Launch Training: Run the provided shell script. It supports both single-GPU and multi-GPU training via DeepSpeed.

# Multi-GPU training (default: 8 GPUs)
bash scripts/train.sh

# Single-GPU training
NUM_GPUS=1 bash scripts/train.sh

Note: scripts/train.sh uses DeepSpeed for distribution by default. You can override the GPU count and config file path using NUM_GPUS and CONFIG_FILE environment variables.

Data Format

AMPO expects pairwise preference data with the following logical fields:

Column	Description
`prompt`	input context or user history
`chosen`	preferred response / item
`rejected`	non-preferred response / item

Example:

{
  "prompt": "User history ...",
  "chosen": "Preferred item",
  "rejected": "Rejected item"
}

Training Entry

The default workflow is intentionally minimal:

implement the core optimization logic in src/ampo/trainer.py
define training and evaluation flow in main_ampo.py

Customization

AMPO supports custom loss extension while preserving the standard tokenization, collation, logging, and optimization pipeline.

Citation

To be released

License

Apache License 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
assets		assets
configs		configs
scripts		scripts
src/ampo		src/ampo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main_ampo.py		main_ampo.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_ampo.py		run_ampo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AMPO

Adaptive Mix Preference Optimization for Generative Recommendation

Overview

Setup

Quick Start

Data Format

Training Entry

Customization

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AMPO

Adaptive Mix Preference Optimization for Generative Recommendation

Overview

Setup

Quick Start

Data Format

Training Entry

Customization

Citation

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages