You might be interested in this: a draft model for the full `deepseek-r1` model!

#1
by jukofyork - opened

I tested a few different models, and your's worked the best to create a draft model for the full deepseek-r1 model:

https://huggingface.co/jukofyork/DeepSeek-R1-DRAFT-0.5B

Cool, I'm glad it worked for your case! I was just working on vocab transplanting too, and your tool seems to work very well. Thank you!

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment