davideuler
/

NebulaNet-v2-4x7B-moe

Text Generation

Mixture of Experts

text-generation-inference

Model card Files Files and versions

davideuler commited on Mar 25, 2024

Commit

ad80d21

·

verified ·

1 Parent(s): 2af5dff

Update README.md

Files changed (1) hide show

README.md +9 -6

README.md CHANGED Viewed

@@ -1,18 +1,21 @@
 ---
-language:
-  - multilingual
-thumbnail: "url to a thumbnail used in social sharing"
 tags:
 - coding
 - moe
-license: "mit"
-base_model: "ContextualAI/Contextual_KTO_Mistral_PairRM"
 ---
 ## Usage
 NebulaNet-v2: An MOE of 4 7b expert models.
 It is good at coding and multi language translation. It should be fluent at chat and math too.
 ## mergekit config
 ```
 base_model: ContextualAI/Contextual_KTO_Mistral_PairRM
@@ -41,4 +44,4 @@ experts:
     - "mathematics"
     - "solve"
     - "count"
-```

 ---
+language:
+- multilingual
+thumbnail: url to a thumbnail used in social sharing
 tags:
 - coding
 - moe
+license: mit
+base_model: ContextualAI/Contextual_KTO_Mistral_PairRM
+pipeline_tag: text-generation
 ---
 ## Usage
 NebulaNet-v2: An MOE of 4 7b expert models.
 It is good at coding and multi language translation. It should be fluent at chat and math too.
+The 4x7b merged model performs much better than the original Contextual_KTO_Mistral_PairRM on both coding and multilingual text generation in my observation.
 ## mergekit config
 ```
 base_model: ContextualAI/Contextual_KTO_Mistral_PairRM
     - "mathematics"
     - "solve"
     - "count"
+```