Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1211
11
Tom Jobbins
PRO
TheBloke
Follow
0xthierry's profile picture
UncleSamuel33's profile picture
roselyp's profile picture
22493 followers
·
16 following
TheBlokeAI
TheBloke
AI & ML interests
LLM: quantisation, fine tuning
Articles
Making LLMs lighter with AutoGPTQ and transformers
Aug 23, 2023
•
37
Organizations
TheBloke
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
TheBloke/Llama-2-70B-GGUF
11 months ago
fix for join commands
1
#2 opened 11 months ago by
orbiter
New activity in
TheBloke/CodeLlama-70B-hf-AWQ
11 months ago
not codelama
3
#1 opened 11 months ago by
luckiskind
New activity in
TheBloke/MegaDolphin-120b-GPTQ
12 months ago
Missing files
2
#1 opened 12 months ago by
Danne980
New activity in
TheBloke/Fennec-Mixtral-8x7B-GGUF
about 1 year ago
Is this the same model with orangetin/OpenHermes-Mixtral-8x7B ?
2
#2 opened about 1 year ago by
bingw5
New activity in
Yhyu13/LMCocktail-10.7B-v1
about 1 year ago
Quant pls
4
#1 opened about 1 year ago by
Yhyu13
New activity in
VAGOsolutions/SauerkrautLM-SOLAR-Instruct
about 1 year ago
Quants uploading now
1
#4 opened about 1 year ago by
TheBloke
New activity in
ddh0/OrcaMaidXL-17B-32k
about 1 year ago
Add YaRN modeling code
#1 opened about 1 year ago by
TheBloke
New activity in
TheBloke/openchat-3.5-1210-AWQ
about 1 year ago
Update special_tokens_map.json
#1 opened about 1 year ago by
alpayariyak
New activity in
TheBloke/openchat-3.5-1210-GPTQ
about 1 year ago
Update special_tokens_map.json
#1 opened about 1 year ago by
alpayariyak
New activity in
TheBloke/Llama-2-13B-Chat-Dutch-AWQ
about 1 year ago
Update to new format and include chat template with default system message
#1 opened about 1 year ago by
BramVanroy
New activity in
TheBloke/Llama-2-13B-Chat-Dutch-GPTQ
about 1 year ago
Update to new format and include chat template with default system message
#1 opened about 1 year ago by
BramVanroy
New activity in
ddh0/Norocetacean-20b-10k
about 1 year ago
Update configuration_llama.py, required to get model to Load as `rope_scaling` needs to be None, or else a dictionary
#1 opened about 1 year ago by
TheBloke
New activity in
TheBloke/Rogue-Rose-103b-v0.2-GPTQ
about 1 year ago
Missing Model
1
#1 opened about 1 year ago by
Razzor9000
New activity in
Undi95/Mixtral-8x7B-MoE-RP-Story
about 1 year ago
Model config.json has Mistral params instead of Mixtral, breaking ExLlama quants and maybe affecting others too
#3 opened about 1 year ago by
TheBloke
New activity in
TheBloke/SOLAR-10.7B-Instruct-v1.0-GGUF
about 1 year ago
vocabulary maybe wrong
3
#1 opened about 1 year ago by
limoncc
New activity in
TheBloke/openchat-3.5-1210-GGUF
about 1 year ago
Sorta Broken
6
#1 opened about 1 year ago by
dillfrescott
New activity in
TheBloke/Mixtral-8x7B-v0.1-GPTQ
about 1 year ago
RuntimeError: shape '[32, 8]' is invalid for input of size 0
7
#5 opened about 1 year ago by
woldeM
New activity in
mattshumer/mistral-8x7b-chat
about 1 year ago
Quant pls
1
#5 opened about 1 year ago by
Yhyu13
New activity in
TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ
about 1 year ago
Did anyone get it to run?
11
#1 opened about 1 year ago by
dimaischenko
New activity in
TheBlokeAI/Mixtral-tiny-GPTQ
about 1 year ago
Seems like the GPTQ versions are broken
4
#2 opened about 1 year ago by
NePe
Load more