xldistance's picture

43 40

xldistance

xldistance

AI & ML interests

None yet

Recent Activity

new activity about 2 hours ago

YiXin-AILab/YiXin-Distill-Qwen-72B:Can you quantify 4.5bpw of this model?

new activity about 14 hours ago

Qwen/QwQ-32B:When will you fix the model replies missing</think>\n start tags

liked a model about 14 hours ago

YiXin-AILab/YiXin-Distill-Qwen-72B

View all activity

Organizations

None yet

xldistance's activity

New activity in YiXin-AILab/YiXin-Distill-Qwen-72B about 2 hours ago

Can you quantify 4.5bpw of this model?

#1 opened about 14 hours ago by

New activity in Qwen/QwQ-32B about 14 hours ago

When will you fix the model replies missing</think>\n start tags

#19 opened 8 days ago by

New activity in ordis-co-ltd/Qwen2.5-VL-72B-Instruct_exl2_6.0bpw 2 days ago

Can you train this model for 4.5bpw quantization?

#1 opened 12 days ago by

New activity in Qwen/QwQ-32B 8 days ago

missing opening <think>

#4 opened 8 days ago by

New activity in Qwen/Qwen2.5-VL-72B-Instruct 9 days ago

Anyone pls let me know what hardware can run 72B ?

#15 opened 17 days ago by

New activity in matatonic/r1-1776-distill-llama-70b-abliterated-6.5bpw-h8-exl2 11 days ago

Can you train this model for 4.5bpw quantization?

#1 opened 12 days ago by

New activity in matatonic/Qwen2.5-72B-Instruct-abliterated-v2-6.5bpw-h8-exl2 13 days ago

Can you produce a 4.5bpw quantized model of this model?

#1 opened 14 days ago by

New activity in qihoo360/TinyR1-32B-Preview 15 days ago

Repeated Thinking Tags in Output Generation

#2 opened 16 days ago by

New activity in deepseek-ai/DeepSeek-R1-Distill-Qwen-32B about 1 month ago

Can you distill qwen-2.5-72b?

#30 opened about 1 month ago by

New activity in NaniDAO/deepseek-r1-qwen-2.5-32B-ablated about 1 month ago

This model removes the limitations but the ability to write code decreases a lot.

#3 opened about 1 month ago by

Can you quantify the 4.0bpw weight of this model

#2 opened about 1 month ago by

New activity in bartowski/FuseO1-DeepSeekR1-QwQ-SkyT1-32B-Preview-GGUF about 2 months ago

This model is poorly coded and suspected of being a list swipe

#3 opened about 2 months ago by

New activity in pipilok/phi-4-unsloth-exl2-8bpw-hb8 about 2 months ago

Can you provide a 5.5bpw quantization of this model?

#2 opened about 2 months ago by

New activity in pipilok/phi-4-unsloth-exl2-8bpw-hb8 2 months ago

Model loading failure

#1 opened 2 months ago by

New activity in async0x42/Rombos-LLM-V2.5-Qwen-72b-exl2_3.25bpw 2 months ago

Can you produce a quantized 2.4bpw model of this model?

#1 opened 3 months ago by

New activity in matteogeniaccio/phi-4 3 months ago

Phi-4 = gpt-4o-mini

#4 opened 3 months ago by

New activity in Dracones/Athene-V2-Chat_exl2_3.0bpw 3 months ago

Can you produce a 2.4bpw quantization of this model?

#1 opened 3 months ago by

New activity in LoneStriker/Qwen2.5-72B-Instruct-2.25bpw-h6-exl2 3 months ago

How to reduce the problem of 2.25bpw quantitative models often responding haphazardly

#2 opened 4 months ago by

New activity in rombodawg/Rombos-LLM-V2.5-Qwen-72b 3 months ago

Can you make a 2.25bpw quantization for this model?

#4 opened 3 months ago by

New activity in AIDC-AI/Marco-o1 3 months ago

Can you use the same method to train the qwen2.5 32b model?

#24 opened 4 months ago by