Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
59.7
TFLOPS
589
3
145
Bartowski
PRO
bartowski
Follow
DeweyQ's profile picture
aaron199011's profile picture
Galanggg's profile picture
3203 followers
·
65 following
bartowski1182
bartowski1182
AI & ML interests
Official model curator for https://lmstudio.ai/
Recent Activity
replied
to
their
post
about 5 hours ago
Looks like Q4_0_N_M file types are going away Before you panic, there's a new "preferred" method which is online (I prefer the term on-the-fly) repacking, so if you download Q4_0 and your setup can benefit from repacking the weights into interleaved rows (what Q4_0_4_4 was doing), it will do that automatically and give you similar performance (minor losses I think due to using intrinsics instead of assembly, but intrinsics are more maintainable) You can see the reference PR here: https://github.com/ggerganov/llama.cpp/pull/10446 So if you update your llama.cpp past that point, you won't be able to run Q4_0_4_4 (unless they add backwards compatibility back), but Q4_0 should be the same speeds (though it may currently be bugged on some platforms) As such, I'll stop making those newer model formats soon, probably end of this week unless something changes, but you should be safe to download and Q4_0 quants and use those ! Also IQ4_NL supports repacking though not in as many shapes yet, but should get a respectable speed up on ARM chips, PR for that can be found here: https://github.com/ggerganov/llama.cpp/pull/10541 Remember, these are not meant for Apple silicon since those use the GPU and don't benefit from the repacking of weights
new
activity
about 13 hours ago
Qwen/QVQ-72B-Preview:
GGUF weights?
updated
a model
about 22 hours ago
bartowski/QVQ-72B-Preview-GGUF
View all activity
Organizations
bartowski
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
1 day ago
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
1 day ago
•
1.07k
•
254
liked
a model
19 days ago
meta-llama/Llama-3.3-70B-Instruct
Text Generation
•
Updated
4 days ago
•
301k
•
•
1.3k
liked
2 models
22 days ago
arcee-ai/Virtuoso-Small
Updated
21 days ago
•
3.17k
•
40
arcee-ai/Virtuoso-Small-GGUF
Updated
22 days ago
•
5.08k
•
4
liked
2 models
26 days ago
PrimeIntellect/INTELLECT-1
Text Generation
•
Updated
26 days ago
•
679
•
55
PrimeIntellect/INTELLECT-1-Instruct
Text Generation
•
Updated
26 days ago
•
2.94k
•
115
liked
2 models
about 1 month ago
lmstudio-community/Mistral-Large-Instruct-2411-GGUF
Text Generation
•
Updated
Nov 18
•
676
•
12
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation
•
Updated
Nov 18
•
375k
•
•
1.37k
liked
a Space
about 2 months ago
Running
17
🏆
JudgeBench Leaderboard
liked
2 models
about 2 months ago
alpindale/magnum-v4-123b-hqq-4bit
Updated
Nov 3
•
18
•
1
microsoft/OmniParser
Image-Text-to-Text
•
Updated
23 days ago
•
4.33k
•
1.5k
liked
3 models
2 months ago
mistralai/Ministral-8B-Instruct-2410
Updated
19 days ago
•
2.99M
•
371
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation
•
Updated
Oct 25
•
177k
•
1.94k
numind/NuExtract-1.5
Text Generation
•
Updated
Nov 18
•
10.6k
•
177
liked
6 models
3 months ago
Locutusque/Hercules-6.0-Llama-3.1-8B
Text Generation
•
Updated
Sep 28
•
19
•
8
meta-llama/Llama-3.2-3B-Instruct
Text Generation
•
Updated
Oct 24
•
2.15M
•
•
820
meta-llama/Llama-3.2-1B-Instruct
Text Generation
•
Updated
Oct 24
•
1.72M
•
•
661
mistralai/Mistral-Small-Instruct-2409
Updated
Oct 16
•
2.95M
•
361
bartowski/Mistral-Small-Instruct-2409-GGUF
Text Generation
•
Updated
Sep 19
•
15.6k
•
49
lmstudio-community/Mistral-Small-Instruct-2409-GGUF
Text Generation
•
Updated
Sep 17
•
2.76k
•
21
Load more