MPT_1000_STEPS_1e7_rate_03_beta_DPO / model-00001-of-00003.safetensors

Commit History

End of training
17d7c13
verified

tsavage68 commited on