Ablustrund
commited on
Commit
β’
acc2366
1
Parent(s):
3e3e15a
Update README.md
Browse files
README.md
CHANGED
@@ -14,9 +14,6 @@ tags:
|
|
14 |
### *MOSS-RLHF & "Secrets of RLHF in Large Language Models Part I: PPO" <br>π <a href="https://arxiv.org/abs/2307.04964" target="_blank">[Technical report]</a> <a href="https://openlmlab.github.io/MOSS-RLHF/" target="_blank">[Home page]*
|
15 |
|
16 |
|
17 |
-
<p align="center" width="100%">
|
18 |
-
<a href="https://arxiv.org/abs/2307.04964" target="_blank"><img src="./assets/img/moss.png" alt="MOSS" style="width: 50%; min-width: 300px; display: block; margin: auto;"></a>
|
19 |
-
|
20 |
## π News
|
21 |
### π Wed, 12. July 2023. We have released Chinese reward model based OpenChineseLlama-7B!
|
22 |
[moss-rlhf-reward-model-7B-zh](https://huggingface.co/Ablustrund/moss-rlhf-reward-model-7B-zh/tree/main)
|
|
|
14 |
### *MOSS-RLHF & "Secrets of RLHF in Large Language Models Part I: PPO" <br>π <a href="https://arxiv.org/abs/2307.04964" target="_blank">[Technical report]</a> <a href="https://openlmlab.github.io/MOSS-RLHF/" target="_blank">[Home page]*
|
15 |
|
16 |
|
|
|
|
|
|
|
17 |
## π News
|
18 |
### π Wed, 12. July 2023. We have released Chinese reward model based OpenChineseLlama-7B!
|
19 |
[moss-rlhf-reward-model-7B-zh](https://huggingface.co/Ablustrund/moss-rlhf-reward-model-7B-zh/tree/main)
|