James Braza
jamesbraza
AI & ML interests
None yet
Recent Activity
updated
a dataset
8 days ago
futurehouse/t-r1
published
a dataset
8 days ago
futurehouse/t-r1
new activity
12 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B:Tokenizer config is wrong
Organizations
jamesbraza's activity
Tokenizer config is wrong
8
#10 opened 21 days ago
by
stoshniwal
Tokenizer config's `chat_template` removes everything before `</think>` XML closing tag
1
#21 opened 12 days ago
by
jamesbraza
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64b83ceb49bde5d94813ce2e/PYf6nXBVGQPIgTk_NzudC.png)
Sorting by Hallucination Rate fails
#5 opened 7 months ago
by
jamesbraza
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64b83ceb49bde5d94813ce2e/PYf6nXBVGQPIgTk_NzudC.png)
Failed to run on MacBook: requiring flash_attn
7
#72 opened about 1 year ago
by
jamesbraza
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64b83ceb49bde5d94813ce2e/PYf6nXBVGQPIgTk_NzudC.png)
Model answering with all newlines?
2
#19 opened over 1 year ago
by
jamesbraza
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64b83ceb49bde5d94813ce2e/PYf6nXBVGQPIgTk_NzudC.png)
Failure to reproduce QA Format response from the README
2
#71 opened about 1 year ago
by
jamesbraza
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64b83ceb49bde5d94813ce2e/PYf6nXBVGQPIgTk_NzudC.png)
๐ข [v0.19.0] Inference Endpoints and robustness!
2
#2 opened over 1 year ago
by
Wauplin
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1659336880158-6273f303f6d63a28483fde12.png)
Are these models trained from scratch, or just different quantizations of the same model?
3
#2 opened over 1 year ago
by
jamesbraza
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64b83ceb49bde5d94813ce2e/PYf6nXBVGQPIgTk_NzudC.png)
Could not load Llama model from path
33
#5 opened over 1 year ago
by
rahul07
Could not load Llama model from path
33
#5 opened over 1 year ago
by
rahul07