Triangle104
/

Arcee-Maestro-7B-Preview-Q4_K_S-GGUF

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 4 days ago

Commit

55030c4

·

verified ·

1 Parent(s): a5df84b

Update README.md

Files changed (1) hide show

README.md +88 -0

README.md CHANGED Viewed

@@ -11,6 +11,94 @@ tags:
 This model was converted to GGUF format from [`arcee-ai/Arcee-Maestro-7B-Preview`](https://huggingface.co/arcee-ai/Arcee-Maestro-7B-Preview) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/arcee-ai/Arcee-Maestro-7B-Preview) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`arcee-ai/Arcee-Maestro-7B-Preview`](https://huggingface.co/arcee-ai/Arcee-Maestro-7B-Preview) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/arcee-ai/Arcee-Maestro-7B-Preview) for more details on the model.
+---
+Arcee-Maestro-7B-Preview (7B) is Arcee's first reasoning model trained with reinforment learning. It is based on the Qwen2.5-7B DeepSeek-R1 distillation DeepSeek-R1-Distill-Qwen-7B
+ with further GPRO training. Though this is just a preview of our
+upcoming work, it already shows promising improvements to mathematical
+and coding abilities across a range of tasks.
+Intended Use Cases
+-
+Advanced reasoning
+Mathematics
+Coding
+Training & Fine-Tuning
+-
+Initial Training: Began with DeepSeek-R1-Distill-Qwen-7B
+GRPO:
+Trained on 450,000 verified math problems
+Additional bootstrapped coding examples
+Performance
+-
+Arcee-Maestro-7B-Preview shows strong performance in mathematics as
+well as coding, competing against even O1 preview, a model far
+surprassing its size.
+Limitations
+-
+Context Length: 128k Tokens (may vary depending on the final tokenizer settings and system resources).
+Knowledge Cut-off: Training data may not reflect the latest events or developments beyond June 2024.
+Ethical Considerations
+-
+Content Generation Risks: Like any language model,
+Arcee-Maestro-7B-Preview can generate potentially harmful or biased
+content if prompted in certain ways.
+License
+-
+Arcee-Maestro-7B-Preview (7B) is released under the Apache-2.0 License.
+ You are free to use, modify, and distribute this model in both
+commercial and non-commercial applications, subject to the terms and
+conditions of the license.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)