Update README.md
Browse files
README.md
CHANGED
@@ -14,8 +14,9 @@ library_name: transformers
|
|
14 |
|
15 |
# Llama-3.1-Tulu-3.1-8B
|
16 |
|
17 |
-
|
18 |
-
|
|
|
19 |
|
20 |
**Version 3.1 update**: The new version of our Tülu model is from an improvement only in the final RL stage of training.
|
21 |
We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model,
|
|
|
14 |
|
15 |
# Llama-3.1-Tulu-3.1-8B
|
16 |
|
17 |
+
Tülu 3 is a leading instruction following model family, offering a post-training package with fully open-source data, code, and recipes designed to serve as a comprehensive guide for modern techniques.
|
18 |
+
This is one step of a bigger process to training fully open-source models, like our [OLMo](https://allenai.org/olmo) models.
|
19 |
+
Tülu 3 is designed for state-of-the-art performance on a diversity of tasks in addition to chat, such as MATH, GSM8K, and IFEval.
|
20 |
|
21 |
**Version 3.1 update**: The new version of our Tülu model is from an improvement only in the final RL stage of training.
|
22 |
We switched from PPO to GRPO (no reward model) and did further hyperparameter tuning to achieve substantial performance improvements across the board over the original Tülu 3 8B model,
|