Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
vwxyzjn
's Collections
lm-human-preference-details
TL;DR summarization checkpoints
RLOO / PPOv2 TL;DR summarize checkpoints
RLOO / PPOv2 TL;DR summarize checkpoints
updated
Jun 11
Upvote
1
vwxyzjn/ppo_tldr
Text Generation
•
Updated
May 24
•
19
vwxyzjn/ppo_tldr_6.9b
Text Generation
•
Updated
Jun 7
•
10
vwxyzjn/rloo_tldr
Text Generation
•
Updated
Jun 11
•
22
vwxyzjn/rloo_tldr_6.9b
Text Generation
•
Updated
Jun 7
•
20
Upvote
1
Share collection
View history
Collection guide
Browse collections