TokenButler -- Predict token importance for all heads across the transformer in the first layer itself. Enable fine-grained token sparsity!
YASH AKHAURI
akhauriyash
AI & ML interests
None yet
Recent Activity
updated
a model
about 7 hours ago
akhauriyash/Llama-2-7b-hf-Butler
updated
a model
about 7 hours ago
akhauriyash/Llama-3.1-8B-Butler
updated
a model
about 7 hours ago
akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler
Organizations
None yet
Collections
1
models
5
akhauriyash/Llama-2-7b-hf-Butler
Text Generation
•
Updated
•
19
akhauriyash/Llama-3.1-8B-Butler
Text Generation
•
Updated
•
10
akhauriyash/DeepSeek-R1-Distill-Llama-8B-Butler
Text Generation
•
Updated
•
21
akhauriyash/Llama-3.2-1B-Butler
Text Generation
•
Updated
•
51
akhauriyash/Llama-3.2-3B-Butler
Text Generation
•
Updated
•
13
datasets
None public yet