rootxhacker
/

Apollo-v3-32B

Text Generation

Model card Files Files and versions Community

rootxhacker commited on Mar 14

Commit

5feeb4a

·

verified ·

1 Parent(s): 6ec46e6

Update README.md

Files changed (1) hide show

README.md +14 -2

README.md CHANGED Viewed

@@ -1,8 +1,21 @@
 # Apollo Model
 This is an experimental hybrid reasoning model built on Qwen2.5-32B-Instruct
 ### Merge Method
 This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as a base.
@@ -49,5 +62,4 @@ output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 print(response)
-'''

+---
+license: mit
+language:
+- en
+base_model:
+- Qwen/Qwen2.5-32B-Instruct
+---
 # Apollo Model
 This is an experimental hybrid reasoning model built on Qwen2.5-32B-Instruct
+# GGUF
+mradermacher/Apollo-v3-32B-GGUF
+thanks mradermacher for this gguf
 ### Merge Method
 This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-32B-Instruct](https://huggingface.co/Qwen/Qwen2.5-32B-Instruct) as a base.
 response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
 print(response)
+'''