Undi95
/

MistralThinker-e1

Model card Files Files and versions

Undi95 commited on Feb 25

Commit

b6d89ec

·

verified ·

1 Parent(s): ab476b8

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

	@@ -1 +1,16 @@
1	- ~~look~~, it ~~work!~~

+Those repo are public because I hit the private storage limit, but feel free to try.
+This model use the Mistral V7 prompt format.
+It was trained on DeepSeek R1 RP log and character card, and some funny shit.
+Default system prompt: "You are MistralThinker, a Large Language Model (LLM) created by Undi.\nYour knowledge base was last updated on 2023-10-01. Current date: {date}.\n\nWhen unsure, state you don't know."
+I recommand you putting information about the persona and yourself in the system prompt to let the magic happen.
+I sadly have a problem with the prompt format, in the tokenizer_config.json
+I try to recreate what DeepSeek have done with their distill : they Added <think> at the beginning of each assistant reply and cut off the thinking part in the context.
+I did the same, but on my side, the first <think> don't appear using "Chat completion".
+Other than that, the model seem fully functionnal, feel free to try, but be sure to prefill <think> one way or another.