--- base_model: silx-ai/Quasar-3.7 datasets: eyad-silx/Qausar-3.7-coding library_name: transformers model_name: Quasar-3.7 tags: - rl - silx - trl - sft licence: license --- # Quasar Series of Models

Quasar Model Image

## Introducing Quasar-3.7 This model is provided by **SILX INC**. It has been supervised fine-tuned using the **open-r1** repository. The training data includes sequences of varying lengths (32k, 16k, and 8k) to enhance the model's knowledge and adaptability. Quasar-3.3-Max represents the **first step** in the Quasar project before Reinforcement Learning (RL). At this stage, the model's reasoning steps are capped at a **maximum length of 8129 tokens** to optimize processing efficiency and contextual understanding. Stay tuned for further updates as we advance the Quasar project with RL enhancements! ## Resources - [Research Paper](https://arxiv.org/abs/2412.06822) - [Website](https://sicopilot.cloud) ## Founders - **Eyad Gomaa** - **Gomaa Salah**