microsoft/phi-4 · why not release the 7b model?

Hugging Face

why not release the 7b model?

#18

by breadlicker45 - opened 2 days ago

Discussion

breadlicker45

2 days ago

•

edited 2 days ago

in the phi4 paper it shows that there is a 7b model

is there a reason besides "safety" it isn't released?

JLouisBiz

2 days ago

I would like to have smaller one too, I am using phi-3.5-mini-instruct with success, though would like some upgrade to run on my 16 GB RAM and 4 GB GPU

Mizule

2 days ago

I would like to have smaller one too, I am using phi-3.5-mini-instruct with success, though would like some upgrade to run on my 16 GB RAM and 4 GB GPU

try the unsloth 4 bit dynamic quant, it gets nearly the same performance as 16 bit and fits in under 15GB

JLouisBiz

2 days ago

Thank you, how would I unsloth, how do I do it? Which command? I know how to use llama-quantize command, but please help me with specific one.

Mizule

1 day ago

Thank you, how would I unsloth, how do I do it? Which command? I know how to use llama-quantize command, but please help me with specific one.

you can get the already quantized model here: https://huggingface.co./unsloth/phi-4-unsloth-bnb-4bit

it also links a colab notebook you can use for inference and finetuning of it, I assume all you have to do is change the model its using and lower the batch size.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment