SteelStorage
/

Lumosia-v2-MoE-4x10.7

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Steelskull commited on Jan 26, 2024

Commit

27bfbe0

·

verified ·

1 Parent(s): 23d2a0c

Create README.md

Files changed (1) hide show

README.md +103 -0

README.md ADDED Viewed

	@@ -0,0 +1,103 @@

+# Lumosia-v2-MoE-4x10.7
+"Lumosia" was selected as its a MoE of Multiple SOLAR Merges so it really "Lights the way".... its 3am.
+This is a very experimantal model. its a MoE of all good performing Solar models (based off of personal experiance not open leaderboard),
+The models goal was to make a good all rounder, in chat/logic/rp
+Why? Dunno whated to see what would happen
+context is 4k but coherent up to 16k
+A Lumosia Personality tavern card has been added
+Come join the Discord:
+[ConvexAI](https://discord.gg/yYqmNmg7Wj)
+Template:
+```
+### System:
+### USER:{prompt}
+### Assistant:
+```
+Settings:
+```
+Temp: 1.0
+min-p: 0.02-0.1
+```
+## Evals:
+* Avg:
+* ARC:
+* HellaSwag:
+* MMLU:
+* T-QA:
+* Winogrande:
+* GSM8K:
+## Examples:
+```
+Example 1:
+User:
+Lumosia:
+```
+```
+Example 2:
+User:
+Lumosia:
+```
+## 🧩 Configuration
+```
+yaml
+base_model: DopeorNope/SOLARC-M-10.7B
+gate_mode: hidden
+dtype: bfloat16
+experts:
+  - source_model: DopeorNope/SOLARC-M-10.7B
+    positive_prompts: [""]
+  - source_model: maywell/PiVoT-10.7B-Mistral-v0.2-RP
+    positive_prompts: [""]
+  - source_model: kyujinpy/Sakura-SOLAR-Instruct
+    positive_prompts: [""]
+  - source_model: jeonsworld/CarbonVillain-en-10.7B-v1
+    positive_prompts: [""]
+```
+## 💻 Usage
+```
+python
+!pip install -qU transformers bitsandbytes accelerate
+from transformers import AutoTokenizer
+import transformers
+import torch
+model = "Steelskull/Lumosia-MoE-4x10.7"
+tokenizer = AutoTokenizer.from_pretrained(model)
+pipeline = transformers.pipeline(
+    "text-generation",
+    model=model,
+    model_kwargs={"torch_dtype": torch.float16, "load_in_4bit": True},
+)
+messages = [{"role": "user", "content": "Explain what a Mixture of Experts is in less than 100 words."}]
+prompt = pipeline.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
+outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
+print(outputs[0]["generated_text"])
+```