do full r1

by drmcbride - opened 7 days ago

Discussion

drmcbride

7 days ago

that would be chad

Orion-zhen

Owner 7 days ago

Bro, that would be crazy... I have no idea how to get sufficient hardware🥲

drmcbride

7 days ago

what happens if you set scale factor to something ridiculous like 10 on your github repo to make these?

Orion-zhen

Owner 7 days ago

Well, the model will be fucked up, just infinitely generating rubbish. Scale factor >= 1.5 is the dangerous zone, and >= 2 will damage the model at serious risk.

BTW, I tried abliterating deepseek-r1-distill-qwen-7b, but my code didn't work. Refusal_dir was an array of nan. I have no idea what was wrong with my code 😶

Orion-zhen

Owner 7 days ago

One more thing, I am now on vacation, so my server in my office was shutdown. Considering my modest deposit, I can't rent a cloud GPU to figure out what exactly happened in abliteration process.

Orion-zhen

Owner 3 days ago

Well, the model will be fucked up, just infinitely generating rubbish. Scale factor >= 1.5 is the dangerous zone, and >= 2 will damage the model at serious risk.

BTW, I tried abliterating deepseek-r1-distill-qwen-7b, but my code didn't work. Refusal_dir was an array of nan. I have no idea what was wrong with my code 😶

Tensor nan was fixed here

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment