Llama-3.2-3B-Instruct-online-dpo-alfworld-iter2 / pytorch_model-00002-of-00002.bin

Commit History

upload checkpoint
5b84a80
verified

sc2582 commited on