Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
5
18
Hao Sun
Holarissun
Follow
Ray2333's profile picture
1 follower
·
1 following
https://holarissun.github.io/
HolarisSun
holarissun
AI & ML interests
[email protected]
. Deep RL, RL x LLM, RLHF.
Organizations
None yet
Holarissun
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
5 months ago
weqweasdas/RM-Mistral-7B
Text Classification
•
Updated
Mar 31
•
207
•
22
liked
2 models
7 months ago
Holarissun/RM-TLDR_human_loraR64_20000_gemma2b_lr5e-05_bs2_g4
Updated
May 3
•
2
•
1
EleutherAI/pythia-2.8b
Text Generation
•
Updated
Jun 9, 2023
•
21k
•
28
liked
a dataset
7 months ago
openbmb/UltraFeedback
Viewer
•
Updated
Dec 29, 2023
•
64k
•
1.64k
•
336
liked
4 models
7 months ago
Ray2333/gpt2-large-helpful-reward_model
Text Classification
•
Updated
Jun 2
•
1.52k
•
8
openai-community/gpt2-xl
Text Generation
•
Updated
Feb 19
•
244k
•
•
310
mistralai/Mistral-7B-v0.1
Text Generation
•
Updated
Jul 24
•
397k
•
•
3.44k
openlm-research/open_llama_3b_v2
Text Generation
•
Updated
Jul 16, 2023
•
48.7k
•
147
liked
a dataset
8 months ago
ZHLiu627/ultrafeedback_binarized_with_response_full_part1
Viewer
•
Updated
Mar 8
•
20k
•
39
•
1
liked
a model
8 months ago
Holarissun/trl_rm_tldr_gptj
Updated
Mar 25
•
16
•
1
liked
a dataset
8 months ago
Dahoas/full-hh-rlhf
Viewer
•
Updated
Feb 23, 2023
•
125k
•
1.3k
•
72
liked
3 models
8 months ago
weqweasdas/RM-Gemma-2B
Text Classification
•
Updated
Mar 22
•
149
•
16
weqweasdas/RM-Gemma-7B
Text Classification
•
Updated
Mar 22
•
59
•
8
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25
•
468
•
16
liked
a dataset
10 months ago
CarperAI/openai_summarize_comparisons
Viewer
•
Updated
Feb 27, 2023
•
260k
•
1.72k
•
39
liked
2 models
10 months ago
facebook/contriever
Updated
Jan 19, 2022
•
808k
•
58
EleutherAI/gpt-neo-1.3B
Text Generation
•
Updated
Jan 31
•
165k
•
264
liked
a dataset
11 months ago
berkeley-nest/Nectar
Viewer
•
Updated
Mar 20
•
183k
•
594
•
277