Are the Q4 and Q5 models R1 or R1-Zero

by gng2info - opened 7 days ago

7 days ago

Are the Q4 and Q5 models R1 or R1-Zero
Can someone verify if the Q4 and Q5 are R1-Zero or just R1, the other quants are labeled just R1, which is what I am looking for?

shimmyshimmer

Unsloth AI org 7 days ago

•

edited 1 day ago

Are the Q4 and Q5 models R1 or R1-Zero
Can someone verify if the Q4 and Q5 are R1-Zero or just R1, the other quants are labeled just R1, which is what I am looking for?

EDIT : FIXED
The uploaded versions are officially R1 which is correct, not R1-Zero

fsaudm

7 days ago

Yeah interesting 🤔 I would suspect that all these are R1 and the names are just typos. Though I'd say let's wait for @Unsloth to confirm this so we know for sure

vmajor

6 days ago

Any clarity yet?

fsaudm

6 days ago

•

edited 6 days ago

I guess I'm the mean time, one could try running those quants, and see if the "think>" tokens are generated

shimmyshimmer

Unsloth AI org 6 days ago

Any clarity yet?

We're uploading the new ones, will ping you guys once it's done.

I guess I'm the mean time, one could try running those quants, and see if the tokens are generated

Yes please let us know how it goes! :)

frz1

5 days ago

q5 generates the "think" token and is very good at reasoning. But doesn't the zero version generate this token?

shimmyshimmer

Unsloth AI org 5 days ago

q5 generates the "think" token and is very good at reasoning. But doesn't the zero version generate this token?

We will reupload them today and update you guys

ozzeruk82

4 days ago

Yeah I had the same question. Will await confirmation before downloading half a TB of data! The Zero model is fascinating in how it was made but definitely want the normal one.

vmajor

4 days ago

I am running the Q8_0 and it is the R1 not Zero, for better or for worse.

shimmyshimmer

Unsloth AI org 4 days ago

I am running the Q8_0 and it is the R1 not Zero, for better or for worse.

Yea our internal tests and validation from around 10 other people confirm it's R1 but we're still going to reupload then just incase

frz1

3 days ago

In any case, the first bytes contain the model name and it says "DeepSeek R1", in the real Zero version it says "DeepSeek R1 Zero".
So it looks like it's R1.

shimmyshimmer

Unsloth AI org 2 days ago

We're uploaading it now - should be up in like 8 hrs but will let yall know

wuaoscotty123

2 days ago

We're uploaading it now - should be up in like 8 hrs but will let yall know

so the Q4 model right here is R1 or R1-zero, cause I have downloaded it.

fsaudm

2 days ago

th q4 files were deleted, maybe theyll reupload..

shimmyshimmer

Unsloth AI org 1 day ago

@gng2info @fsaudm @wuaoscotty123 @frz1 @vmajor @ozzeruk82 @ooj

Hey guy appologies for the delay but we've reuploaded them so they're correct now.

Also we're going to release 1-bit dynamic quant versions very soon meaning the accuracy will be very good for a 1.5-bit GGUF quant version of R1 and will be great to use day-to-day. I'll update u guys once's that's ready - we'll most likely have a blogpost too for it

danielhanchen

Unsloth AI org about 20 hours ago

So sorry I re-uploaded them all - on further inspection the old files were correct, just I screwed up the names - but it's best to download the new versions I uploaded.

I made 4 further uploads with dynamic quantization (better accuracy than normal quants). All dynamic quants leave all layers in a mixture of 4bit and 6bit (ie attention is fully left at 4/6 bits), except the MoE layers which is quantized further down.

DeepSeek R1 has 3 layers of non MoEs, and these are left fully in 4/6bit as well.

MoE Bits	Type	Disk Size	Accuracy	Link	Details
1.58bit	IQ1_S	131GB	Fair	Link	MoE all 1.56bit. `down_proj` in MoE mixture of 2.06/1.56bit
1.73bit	IQ1_M	158GB	Good	Link	MoE all 1.56bit. `down_proj` in MoE left at 2.06bit
2.22bit	IQ2_XXS	183GB	Better	Link	MoE all 2.06bit. `down_proj` in MoE mixture of 2.5/2.06bit
2.51bit	Q2_K_XL	212GB	Best	Link	MoE all 2.5bit. `down_proj` in MoE mixture of 3.5/2.5bit

shimmyshimmer

Unsloth AI org about 4 hours ago

@gng2info @fsaudm @wuaoscotty123 @frz1 @vmajor @ozzeruk82 @ooj Apologies for the ping again, but blogpost for dynamic 1.58bit is out. Would be incredible if you guys could test it and share any results. 🤗

Blog: https://unsloth.ai/blog/deepseekr1-dynamic
Tweet: https://x.com/UnslothAI/status/1883899061893546254

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment