Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
rinna
/
bilingual-gpt-neox-4b-instruction-ppo
like
15
Follow
rinna Co., Ltd.
89
Text Generation
Transformers
PyTorch
Safetensors
Anthropic/hh-rlhf
Japanese
English
gpt_neox
text-generation-inference
arxiv:
2203.02155
arxiv:
1707.06347
arxiv:
2404.01657
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
refs/pr/1
bilingual-gpt-neox-4b-instruction-ppo
/
README.md
Commit History
Update README.md
f1e3d20
tianyuz
commited on
Aug 2, 2023
update
86b9caf
tianyuz
commited on
Aug 2, 2023
initial commit
a365fbe
tianyuz
commited on
Aug 2, 2023