casperhansen PRO
casperhansen
AI & ML interests
Creator of AutoAWQ
Recent Activity
updated
a model
29 days ago
casperhansen/deepseek-r1-distill-qwen-1.5b-awq
updated
a model
29 days ago
casperhansen/deepseek-r1-distill-qwen-7b-awq
Organizations
casperhansen's activity
The inference performance of the DeepSeek-R1-AWQ model is weak compared to the DeepSeek-R1 model
8
#3 opened about 1 month ago
by
qingqingz916
Generation configs: Unquantised vs AWQ, model weights format
3
#1 opened about 2 months ago
by
nfunctor
Librarian Bot: Add language metadata for dataset
#2 opened 3 months ago
by
librarian-bot

[bot] Conversion to Parquet
#1 opened 3 months ago
by
parquet-converter

Create README.md
#3 opened 6 months ago
by
cfahlgren1

Original model perplexity
1
#1 opened 8 months ago
by
jacobpfau
update eos token
#6 opened 10 months ago
by
Praful932

encountered error when loading model
7
#4 opened 11 months ago
by
zhouzr
How did you create AWQ-quantized weights?
4
#5 opened 11 months ago
by
nightdude

Infinity conversation, sould eos be <|eot_id|> ?
4
#1 opened 11 months ago
by
Komposter43
Update generation_config.json
#3 opened 11 months ago
by
alugowski
Create quant_config.json
1
#1 opened 11 months ago
by
Suparious

Update generation_config.json
#2 opened 11 months ago
by
hmellor
always getting 0 in output
15
#3 opened about 1 year ago
by
xubuild
Not supporting with TGI
1
#4 opened about 1 year ago
by
abhishek3jangid
OC is not a multiple of cta_N = 64
2
#5 opened about 1 year ago
by
lazyDataScientist

TGI - response is an empty string
2
#6 opened about 1 year ago
by
p-christ
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0
2
#7 opened about 1 year ago
by
aaganaie
Add model_type to config.json
1
#2 opened about 1 year ago
by
casperhansen
Is there a way to finetune these AWQ beasts?
4
#1 opened over 1 year ago
by
yasmolin