25 42

Alex Daminger

Handgun1773

AI & ML interests

None yet

Recent Activity

new activity about 2 months ago

Qwen/Qwen2.5-Coder-7B-Instruct:The updated weights

View all activity

Organizations

None yet

Handgun1773's activity

New activity in Qwen/Qwen2.5-Coder-7B-Instruct about 2 months ago

The updated weights

#12 opened about 2 months ago by

QuantPanda

New activity in rombodawg/Rombos-LLM-V2.6-Nemotron-70b 2 months ago

Warning this model might be overhyped

#1 opened 2 months ago by

rombodawg

liked a model 2 months ago

bartowski/SuperNova-Medius-exl2

Text Generation • Updated Oct 13 • 13 • 1

New activity in bartowski/SuperNova-Medius-GGUF 2 months ago

exl2

#1 opened 2 months ago by

Handgun1773

updated a model 2 months ago

Handgun1773/SuperNova-Medius-4.75bpw-h6-exl2

Updated Oct 12

New activity in Qwen/Qwen2.5-72B-Instruct 2 months ago

There's a HUGE drop in popular knowledge from v2 to v2.5.

#1 opened 3 months ago by

phil111

liked a model 3 months ago

bartowski/WhiteRabbitNeo-2.5-Qwen-2.5-Coder-7B-GGUF

Text Generation • Updated Oct 4 • 10.4k • 7

updated a model 3 months ago

Handgun1773/Qwen2.5-Coder-7B-BASE-8.0bpw-exl2

Text Generation • Updated Oct 4 • 10

New activity in bartowski/Qwen2.5-Coder-7B-Instruct-exl2 3 months ago

Qwen2.5-Coder-1.5B-Instruct for speculative decoding?

#1 opened 3 months ago by

Handgun1773

updated a model 3 months ago

Handgun1773/Qwen2.5-Coder-1.5B-BASE-8.0bpw-exl2

Text Generation • Updated Oct 4 • 12

New activity in opendatalab/PDF-Extract-Kit 3 months ago

Dev/Test Help

#3 opened 4 months ago by

yismet

liked 4 models 3 months ago

replied to bartowski's post 3 months ago

Yes, exactly. When converting from float16 to float32 for fine-tuning (as I thought), we need to fill 13 bits of the mantissa and 3 bits of the exponent with zeros, rather than simply filling the last 16 bits.

Ok I get your point now.

replied to bartowski's post 3 months ago

I don't understand much about this, but maybe the model in F32 is just redundant. Maybe the other half of most weights are filled with zeros. It was scaled this way to fine-tune it or to make it impossible for people with few resources to run it😁

32 and 16 is the memory that each weight takes, not a number of weights. You can look into float point 32 and 16 in computer science to better grasp what does it mean.

New activity in bartowski/Qwen2.5-14B-Instruct-exl2 3 months ago

Could EXL2 quantization hurt multilinguality?

#1 opened 3 months ago by

Handgun1773

liked a model 4 months ago

bullerwins/pixtral-12b-240910

Updated Sep 11 • 7

New activity in LoneStriker/Yi-Coder-9B-Chat-8.0bpw-h8-exl2 4 months ago

Do you plan to do 1.5B ?

#1 opened 4 months ago by

Handgun1773