GPTQ quantized stablelm-3b-4e1t

Branch Bits GS Act Order Damp % GPTQ Dataset Seq Len Size ExLlama Desc
main 8 None No 0.01 c4 4096 -- No 8-bit, without Act Order and no grouop size.
Downloads last month
19
Safetensors
Model size
895M params
Tensor type
I32
·
FP16
·
Inference Examples
Inference API (serverless) does not yet support model repos that contain custom code.