RekaAI
/

reka-flash-3

Model card Files Files and versions Community

dyogatama commited on Mar 11

Commit

917d57c

·

verified ·

1 Parent(s): 082fc01

Update README.md

Files changed (1) hide show

README.md +2 -10

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: apache-2.0
 Reka Flash 3 is our latest general-purpose reasoning model that excels in general chat, coding, instruction following, and function calling. At a size of 21B parameters, it performs competitively with proprietary models such as OpenAI o1-mini, making it a good foundation to build many applications that require low latency or on-device deployments. It is currently the best model in its size category.
-![Performance](./evals.png)
 Try it out at [Reka Space](https://space.reka.ai).
@@ -60,15 +60,7 @@ If you are using HF or vLLM, the built-in chat_template shall handle prompt form
 Reka Flash thinks before it produces an output. We use <reasoning> </reasoning> tags to indicate the beginning and the end of its thinking process. For some problems, the model might think for a long time. You can make the model to stop its thinking process by forcing it to output </reasoning> after a certain number of steps. We observe such a budget forcing mechanism will still produce a reasonable output. We show performance on AIME-2024 (cons@16) for various budgets below.
-| Budget | Score |
-|--------|-------|
-| 4k     | 40    |
-| 8k     | 46    |
-| 12k    | 50    |
-| 16k    | 56    |
-| 24k    | 60    |
-| 32k    | 60    |
-| 48k    | 63.3  |
 ### Language Support

 Reka Flash 3 is our latest general-purpose reasoning model that excels in general chat, coding, instruction following, and function calling. At a size of 21B parameters, it performs competitively with proprietary models such as OpenAI o1-mini, making it a good foundation to build many applications that require low latency or on-device deployments. It is currently the best model in its size category.
+![Performance](./eval.png)
 Try it out at [Reka Space](https://space.reka.ai).
 Reka Flash thinks before it produces an output. We use <reasoning> </reasoning> tags to indicate the beginning and the end of its thinking process. For some problems, the model might think for a long time. You can make the model to stop its thinking process by forcing it to output </reasoning> after a certain number of steps. We observe such a budget forcing mechanism will still produce a reasonable output. We show performance on AIME-2024 (cons@16) for various budgets below.
+![AIME'24](./aime.png)
 ### Language Support