tanliboy
/

llama-3.2-3b-dpo-2

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

llama-3.2-3b-dpo-2

Commit History

End of training

26bbc31
verified

tanliboy commited on Oct 1, 2024

Model save

daad045
verified

tanliboy commited on Oct 1, 2024

Training in progress, step 1722

e899843
verified

tanliboy commited on Oct 1, 2024

Training in progress, step 1500

3716009
verified

tanliboy commited on Oct 1, 2024

Training in progress, step 1200

b44a911
verified

tanliboy commited on Oct 1, 2024

Training in progress, step 900

7d3ae09
verified

tanliboy commited on Oct 1, 2024

Training in progress, step 600

f27f489
verified

tanliboy commited on Oct 1, 2024

Training in progress, step 300

2b6a542
verified

tanliboy commited on Oct 1, 2024

initial commit

45eb830
verified

tanliboy commited on Oct 1, 2024