TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity!
YASH AKHAURI
akhauriyash
AI & ML interests
None yet
Recent Activity
new activity
2 days ago
akhauriyash/Llama-3.2-1B-Butler:Adding `safetensors` variant of this model
new activity
2 days ago
akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler:Adding `safetensors` variant of this model
new activity
2 days ago
akhauriyash/Llama-3.1-8B-Butler:Adding `safetensors` variant of this model
Organizations
None yet
Collections
1
models
5
akhauriyash/Llama-3.2-1B-Butler
Text Generation
•
Updated
•
66
akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler
Feature Extraction
•
Updated
•
55
akhauriyash/Llama-3.1-8B-Butler
Text Generation
•
Updated
•
37
akhauriyash/Llama-2-7b-hf-Butler
Text Generation
•
Updated
•
47
akhauriyash/Llama-3.2-3B-Butler
Text Generation
•
Updated
•
28
datasets
None public yet