Kquant03 commited on
Commit
0d95ff5
·
verified ·
1 Parent(s): 02c473c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ tags:
16
 
17
  [Join our Discord!](https://discord.gg/CAfWPV82)
18
 
19
- [Buttercup](https://huggingface.co/Kquant03/Buttercup-4x7B-bf16), for a long time, was my best model (made by kquant)...but I genuinely think this improves upon reasoning and logic while retaining the RP value. I do think it has some pretty heavy GPT-4 data in it, though.
20
 
21
  The config looks like this...(detailed version is in the files and versions):
22
  - [jan-hq/supermario-v2](https://huggingface.co/jan-hq/supermario-v2) - base
 
16
 
17
  [Join our Discord!](https://discord.gg/CAfWPV82)
18
 
19
+ [Buttercup](https://huggingface.co/Kquant03/Buttercup-4x7B-bf16), for a long time, was my best model (made by kquant)...but I genuinely think this improves upon reasoning and logic while retaining the RP value. The hyphens are telling that it has some pretty heavy GPT-4 data in it, though. I mean the whole point is to eventually outperform GPT-4 so of course it's probably best to pretrain with GPT-4 data, then fine tune off of it and then DPO the resulting fine tune...perhaps running another epoch of the previous fine-tune, afterwards.
20
 
21
  The config looks like this...(detailed version is in the files and versions):
22
  - [jan-hq/supermario-v2](https://huggingface.co/jan-hq/supermario-v2) - base