Daniel Han-Chen
danielhanchen
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 hours ago
unsloth/DeepSeek-R1-Distill-Qwen-32B-bnb-4bit
updated
a model
about 2 hours ago
unsloth/DeepSeek-R1-Distill-Qwen-32B-unsloth-bnb-4bit
updated
a model
about 2 hours ago
unsloth/DeepSeek-R1-Distill-Qwen-32B
Organizations
danielhanchen's activity
Are the Q4 and Q5 models R1 or R1-Zero
18
#2 opened 25 days ago
by
gng2info
fix position embeddings
3
#1 opened about 1 month ago
by
PatentPilotAI
I loaded DeepSeek-V3-Q5_K_M up on my 10yrs old old Tesla M40 (Dell C4130)
3
#8 opened about 1 month ago
by
gng2info
Suggested tokenizer changes by Unsloth.ai
7
#21 opened about 1 month ago
by
gugarosa
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1666203761402-6157454831624da88210e627.jpeg)
Getting error with Q3-K-M
7
#2 opened about 1 month ago
by
alain401
Advice on running llama-server with Q2_K_L quant
3
#6 opened about 1 month ago
by
vmajor
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63992e59afe0d224cf2b6bf1/q2JeqTcIb5j6fUg1SWGzL.jpeg)
llama.cpp cannot load Q6_K model
5
#3 opened about 1 month ago
by
vmajor
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63992e59afe0d224cf2b6bf1/q2JeqTcIb5j6fUg1SWGzL.jpeg)
Big thanks for these "without original" uploads!
1
#1 opened 2 months ago
by
jukofyork
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65995c45539c808e84c38bf1/k0y3ULloWQEMvosQwHgrE.png)
Aphrodite/VLLM/SGLang all refuse to load this model
2
#5 opened 5 months ago
by
fullstack
No module named 'triton'
1
#3 opened 5 months ago
by
NeelM0906
update base_model
#1 opened 5 months ago
by
davanstrien
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1627505688463-60107b385ac3e86b3ea4fc34.jpeg)
Cant use the tokenizer using Unsloth Fastmodel
2
#2 opened 6 months ago
by
aryarishit
difference
3
#1 opened 7 months ago
by
ehartford
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63111b2d88942700629f5771/u2a9y-yx6TG0N31OhMSHI.png)
9B - query_pre_attn_scalar = 256 not 224
#26 opened 7 months ago
by
danielhanchen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62ecdc18b72a69615d6bd857/ixLCk0TwaCVyL_nAfrgEs.png)
9B - query_pre_attn_scalar = 256 not 224
#22 opened 7 months ago
by
danielhanchen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/62ecdc18b72a69615d6bd857/ixLCk0TwaCVyL_nAfrgEs.png)
is this the llama-3-8b model clone?
13
#1 opened 10 months ago
by
malhajar
![](https://cdn-avatars.huggingface.co/v1/production/uploads/639c5c448a34ed9a404a956b/jcypw-eh7JzKHTffd0N9l.jpeg)
Model seems to be not PEFT model
1
#1 opened 9 months ago
by
neuralresearcher
full disk on colab
3
#2 opened 9 months ago
by
Dav22