Can't get this model to reason.

by McUH - opened 2 days ago

Discussion

McUH

2 days ago

•

edited 2 days ago

Whether I use DeepseekR1 or L3 instruct, prefilling <think>, low temperature, various situations, I can't get this model to reason. The only way I can make it to think at all is prefill <thinking> tag instead, but even then it is very short think like Llama 3.3 (non distilled) does on its own too.
I used imatrix IQ4_XS GGUF quant (those quants work with 70B R1 distill and also with another 70B R1 RP merge).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment