Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
hamishivi
's Collections
Tulu 2 Llama 3 Update
7b tulu 2.5
Tulu V2 Suite
Tulu V1 Suite
LM Preference Datasets
7b tulu 2.5
updated
Jun 25
a small run at 7b scale with ppo, following the unpacking dpo and ppo paper.
Upvote
-
hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm
Text Generation
•
Updated
Jun 25
•
15
hamishivi/tulu-v2.5-7b-uf-mean-7b-uf-rm-value
Token Classification
•
Updated
Jun 25
•
7
hamishivi/tulu-v2.5-7b-uf-rm
Text Classification
•
Updated
Jun 25
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections