You might be interested in this: a draft model for the full `deepseek-r1` model!
#1
by
jukofyork
- opened
I tested a few different models, and your's worked the best to create a draft model for the full deepseek-r1
model:
Cool, I'm glad it worked for your case! I was just working on vocab transplanting too, and your tool seems to work very well. Thank you!