Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lblaoke
's Collections
PPO
RM
PPO
updated
8 days ago
Upvote
-
lblaoke/llama2-7b-ppo-human
Updated
Feb 3
•
3
lblaoke/llama2-7b-ppo-self
Updated
Feb 3
•
3
lblaoke/llama2-7b-ppo-self-human
Updated
Feb 3
•
3
lblaoke/mistral-v0.1-7b-ppo-human
Updated
Feb 4
•
12
lblaoke/mistral-v0.1-7b-ppo-self
Updated
Feb 4
•
2
lblaoke/mistral-v0.1-7b-ppo-self-human
Updated
Feb 4
•
2
lblaoke/llama-3.1-8b-ppo-human
Updated
16 days ago
•
7
lblaoke/llama-3.1-8b-ppo-self
Updated
15 days ago
•
5
lblaoke/llama-3.1-8b-ppo-self-human
Updated
13 days ago
•
7
lblaoke/qwen2.5-7b-ppo-human
Updated
11 days ago
•
6
lblaoke/qwen2.5-7b-ppo-self-human
Updated
10 days ago
•
11
lblaoke/qwen2.5-7b-ppo-self
Updated
10 days ago
•
7
lblaoke/mistral-v0.3-7b-ppo-human
Updated
10 days ago
•
8
lblaoke/mistral-v0.3-7b-ppo-self
Updated
9 days ago
•
11
lblaoke/mistral-v0.3-7b-ppo-self-human
Updated
8 days ago
•
8
Upvote
-
Share collection
View history
Collection guide
Browse collections