Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DongfuJiang
/
PairRM-V2-phi3-3-mini-ultra-feedback-binarized-lora
like
0
PEFT
Safetensors
phi3
llama-factory
lora
full
Generated from Trainer
custom_code
License:
mit
Model card
Files
Files and versions
Community
Use this model
main
PairRM-V2-phi3-3-mini-ultra-feedback-binarized-lora
Commit History
End of training
57177f9
verified
DongfuJiang
commited on
Jul 26
Model save
8593ae3
verified
DongfuJiang
commited on
Jul 26
Training in progress, step 780
a033853
verified
DongfuJiang
commited on
Jul 26
Training in progress, step 400
b8e7374
verified
DongfuJiang
commited on
Jul 26
initial commit
e096a5f
verified
DongfuJiang
commited on
Jul 25