zephyr-smol_llama-100m-dpo-full / trainer_state.json

Commit History