7b tulu 2.5 - a hamishivi Collection

hamishivi 's Collections

Tulu 2 Llama 3 Update

LM Preference Datasets

7b tulu 2.5

updated Jun 25

a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.