Hyperparameters?

#23

by ekurtulus - opened Dec 13, 2023

Dec 13, 2023

What is the dataset size and PPO hyperparameters?

Berkeley-Nest org Dec 22, 2023

The dataset is here: https://huggingface.co./datasets/berkeley-nest/Nectar with 183K prompts and 7 responses each. PPO hyperparameters are similar to the trlx repo here: https://github.com/CarperAI/trlx, except that we changed the learning rate to 1e-7. We'll open source the paper and code base soon!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment