Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
267.6
TFLOPS
54
15
146
Daniel Han-Chen
danielhanchen
Follow
LeoKinHol's profile picture
noix's profile picture
Fayaz's profile picture
244 followers
·
110 following
https://unsloth.ai/
danielhanchen
AI & ML interests
None yet
Recent Activity
updated
a model
about 3 hours ago
unsloth/DeepSeek-R1-GGUF
updated
a model
about 11 hours ago
unsloth/Qwen2.5-14B-Instruct-1M-bnb-4bit
published
a model
about 11 hours ago
unsloth/Qwen2.5-14B-Instruct-1M-bnb-4bit
View all activity
Articles
Faster fine-tuning using TRL & Unsloth
Jan 10, 2024
•
46
Organizations
danielhanchen
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
unsloth/DeepSeek-R1-GGUF
about 20 hours ago
Are the Q4 and Q5 models R1 or R1-Zero
17
#2 opened 7 days ago by
gng2info
New activity in
unsloth/Mistral-Nemo-Instruct-2407
13 days ago
fix position embeddings
3
#1 opened 13 days ago by
PatentPilotAI
New activity in
unsloth/DeepSeek-V3-GGUF
16 days ago
I loaded DeepSeek-V3-Q5_K_M up on my 10yrs old old Tesla M40 (Dell C4130)
3
#8 opened 18 days ago by
gng2info
New activity in
microsoft/phi-4
17 days ago
Suggested tokenizer changes by Unsloth.ai
6
#21 opened 17 days ago by
gugarosa
New activity in
unsloth/DeepSeek-V3-GGUF
18 days ago
Getting error with Q3-K-M
7
#2 opened 19 days ago by
alain401
Advice on running llama-server with Q2_K_L quant
3
#6 opened 18 days ago by
vmajor
New activity in
unsloth/DeepSeek-V3-GGUF
19 days ago
llama.cpp cannot load Q6_K model
5
#3 opened 19 days ago by
vmajor
New activity in
unsloth/Llama-3.3-70B-Instruct
about 1 month ago
Big thanks for these "without original" uploads!
1
#1 opened about 2 months ago by
jukofyork
New activity in
unsloth/gemma-2-27b-it-bnb-4bit
5 months ago
Aphrodite/VLLM/SGLang all refuse to load this model
2
#5 opened 5 months ago by
fullstack
New activity in
unsloth/gemma-7b-bnb-4bit
5 months ago
No module named 'triton'
1
#3 opened 5 months ago by
NeelM0906
New activity in
unsloth/Hermes-3-Llama-3.1-8B-bnb-4bit
5 months ago
update base_model
#1 opened 5 months ago by
davanstrien
New activity in
unsloth/mistral-7b-instruct-v0.3
5 months ago
ValueError: The following `model_kwargs` are not used by the model: ['num_logits_to_keep'] (note: typos in the generate arguments will also show up in this list)
2
#1 opened 5 months ago by
NeelM0906
New activity in
unsloth/Phi-3-mini-4k-instruct-v0-bnb-4bit
5 months ago
Cant use the tokenizer using Unsloth Fastmodel
2
#2 opened 5 months ago by
aryarishit
New activity in
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit
6 months ago
RuntimeError: Unsloth: `unsloth/Meta-Llama-3.1-8B-bnb-4bit` is not a base model or a PEFT model.
6
#3 opened 6 months ago by
yorickdejong
New activity in
unsloth/Mistral-Nemo-Base-2407
6 months ago
difference
3
#1 opened 6 months ago by
ehartford
New activity in
google/gemma-2-9b-it
7 months ago
9B - query_pre_attn_scalar = 256 not 224
#26 opened 7 months ago by
danielhanchen
New activity in
google/gemma-2-9b
7 months ago
9B - query_pre_attn_scalar = 256 not 224
#22 opened 7 months ago by
danielhanchen
New activity in
unsloth/llama-3-8b
8 months ago
is this the llama-3-8b model clone?
13
#1 opened 9 months ago by
malhajar
New activity in
unsloth/gemma-2b-bnb-4bit
8 months ago
Model seems to be not PEFT model
1
#1 opened 8 months ago by
neuralresearcher
New activity in
unsloth/mistral-7b-v0.2-bnb-4bit
8 months ago
full disk on colab
3
#2 opened 8 months ago by
Dav22
Load more