Computational-Intelligence commited on
Commit
510bc70
·
verified ·
1 Parent(s): bb12ca5

WinkingFace (;

Browse files
Files changed (1) hide show
  1. README.md +40 -0
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ## Model Description
2
+
3
+ The Mit series of large language models (LLMs) by WinkingFace is designed to seamlessly integrate intuitive conversational abilities with advanced multi-step reasoning. Unlike conventional AI models that focus solely on generating responses, Mit enhances structured thinking, contextual understanding, and function-calling precision to deliver more accurate and insightful interactions.
4
+
5
+ Mit models are adaptable, capable of switching between standard conversational tasks and complex reasoning. By incorporating refined logical inference mechanisms, Mit achieves superior accuracy in judgment, decision-making, and long-form analytical tasks.
6
+
7
+ Built on the open-source Qwen platform, Mit has undergone extensive architectural refinements and performance optimizations to align more effectively with real-world applications. Our fine-tuning efforts emphasize deeper contextual awareness, enhanced response coherence, and improved execution of function-calling, making Mit a powerful and versatile AI system.
8
+
9
+
10
+ ## Requirements
11
+
12
+ Mit's code is integrated into WinkingFace's custom version of `transformers`, and we recommend using this modified version for optimal compatibility.
13
+
14
+ To prevent potential errors, such as:
15
+
16
+ ```
17
+ KeyError: 'mit'
18
+ ```
19
+
20
+ install the custom `transformers` package using the following command:
21
+
22
+ ```
23
+ pip install git+https://github.com/WinkingFaceAI/tfm-recooked.git
24
+ ```
25
+
26
+ This ensures seamless functionality and avoids compatibility issues with the model.
27
+
28
+
29
+ ## License
30
+
31
+ This code repository and the model weights are licensed under the [Apache 2.0 License](https://huggingface.co/WinkingFace/Mit-0.5B/blob/main/LICENSE). The Mit series is fully compatible with commercial use and allows for modifications and derivative works, including but not limited to distillation for training other LLMs.
32
+
33
+ Please note that:
34
+
35
+ Mit-0.5B, Mit-1.5B, Mit-3B, and Mit-7B are derived from the [Qwen series](https://huggingface.co/Qwen), which is also licensed under the [Apache 2.0 License](https://huggingface.co/Qwen/Qwen2.5-1.5B/blob/main/LICENSE).
36
+
37
+
38
+ ## Contact
39
+
40
+ For any questions or inquiries, feel free to [contact us here 📨](mailto:[email protected]).