Skip to content

ConstFr/easy-prm

Repository files navigation

Training pipeline from VersaPRM

Environment setup

To setup the environment, run

pip install -r requirements.txt

Our training script uses the Trainer class from 🤗 Transformers.

Training Data

Please download data from this google drive link. There are two json files (one for PRM800K data and the other for MMLU-Pro-Train) formated in the way our data loading function expects. Put them in the \data directory.

Running training

To do training of VersaPRM using our default configuration (we used 4 Nvidia A6000 GPUs), run the following command:

./run_training.sh

We recommend using Deepspeed for data parallel training of model (which you can setup with the command accelerate config).

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors