Michael Goin PRO
mgoin
·
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
new activity
4 days ago
neuralmagic/Qwen2.5-VL-72B-Instruct-quantized.w8a8:Remove image_processor_type
updated
a model
4 days ago
nm-testing/QwQ-32B-FP8-dynamic
updated
a model
4 days ago
nm-testing/Ministral-8B-Instruct-2410-FP8-dynamic
Organizations
mgoin's activity
Remove image_processor_type
#1 opened 4 days ago
by
pooya-davoodi-parasail
Remove image_processor_type
1
#1 opened 4 days ago
by
pooya-davoodi-parasail
Remove image_processor_type
#2 opened 13 days ago
by
pooya-davoodi-parasail
Use Qwen2VLImageProcessor for image_processor_type
5
#2 opened 16 days ago
by
pooya-davoodi-parasail
Use Qwen2VLImageProcessor for image_processor_type
#3 opened 14 days ago
by
pooya-davoodi-parasail
when i use vllm v0.7.2 to deploy r1 awq, i got empty content
13
#10 opened 25 days ago
by
bupalinyu
MLA is not supported with moe_wna16 quantization. Disabling MLA.
5
#7 opened 26 days ago
by
AMOSE
AttributeError: 'Gemma2Config' object has no attribute 'interleaved_sliding_window' Traceback (most recent call last):
2
#3 opened about 1 month ago
by
samos123
compressed-tensors MLA support requires fp8 activations and weights in group 'group_0',
2
#1 opened about 1 month ago
by
samos123
How to load this model?
2
#1 opened 8 months ago
by
Frz614
Model does not run with VLLM
2
#3 opened 3 months ago
by
aswad546
Nice model, any info on scripts used to quantize?
1
#1 opened 3 months ago
by
RonanMcGovern

Add config_format and load_format to vLLM args
#5 opened 4 months ago
by
mgoin

Update config.json to use null for sliding_window
#4 opened 4 months ago
by
mgoin

Adding `safetensors` variant of this model
#1 opened 4 months ago
by
SFconvertbot

Is this the standard GPTQ quantization?
1
#5 opened 4 months ago
by
molereddy
Model weights are not loaded
4
#3 opened 6 months ago
by
MarvelousMouse

Update model card
#1 opened 4 months ago
by
nm-research
Add chat_template to tokenizer_config.json
#1 opened 4 months ago
by
nm-research