StarChat2 15B - a HuggingFaceH4 Collection

HuggingFaceH4 's Collections

Scaling Test-Time Compute with Open Models

Zephyr 7B Gemma

Papers We've Read

Awesome SFT datasets

Awesome feedback datasets

Awesome reward models

StarChat2 15B

updated Apr 12, 2024

Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook

Paused

135

135

StarChat2 Demo

🌟
HuggingFaceH4/starchat2-15b-v0.1

Text Generation • Updated Mar 13, 2024 • 3.88k • • 111
HuggingFaceH4/starchat2-15b-sft-v0.1

Text Generation • Updated Mar 12, 2024 • 39 • 5

Note The SFT model that was used for alignment with DPO
jondurbin/airoboros-3.2

Viewer • Updated Jan 2, 2024 • 58.7k • 70 • 45

Note Part of the SFT mix
abacusai/SystemChat

Viewer • Updated Mar 4, 2024 • 7.02k • 113 • 131

Note Part of the SFT mix
microsoft/orca-math-word-problems-200k

Viewer • Updated Mar 4, 2024 • 200k • 2.14k • 443

Note Part of the SFT mix
m-a-p/Code-Feedback

Viewer • Updated Feb 26, 2024 • 66.4k • 342 • 206

Note Part of the SFT mix
LDJnr/Capybara

Viewer • Updated Jun 7, 2024 • 16k • 461 • 237

Note Part of the SFT mix
HuggingFaceH4/ultrafeedback_binarized

Viewer • Updated Oct 16, 2024 • 187k • 5.47k • 275

Note Part of the DPO mix
Intel/orca_dpo_pairs

Viewer • Updated Nov 29, 2023 • 12.9k • 1.55k • 298

Note Part of the DPO mix