Wasn't this model supposed to take around 3-4 GB given it's only 1B parameters?

#21

by ahsannawazch - opened Dec 28, 2024

Dec 28, 2024

I tried to load this model in colab (free tier) and it has started downloading this pytorch_model.bin file having sized 28.9GB?
Where Am I going wrong here?

JLouisBiz

29 days ago

I also wonder

ahsannawazch

29 days ago

I also wonder

I am now using qwen2 2B vl. working okay for now

TikaToka

5 days ago

This is because MolmoE is actually 7B model, but only activates 1B when inference.

Look for Mixture-of-Experts(MoE)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment