ll's picture

ll PRO

Etherll

AI & ML interests

None yet

Recent Activity

Organizations

Replete-AI's profile picture Artificial Consciousness Organization's profile picture Skye Team's profile picture AI Starter Pack's profile picture

Etherll's activity

reacted to s-emanuilov's post with 🔥 10 days ago
view post
Post
5131
Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth

I wanted to share my experiment with training reasoning models in languages other than English/Chinese.

Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage.

Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/

The model itself: s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1

I hope this helps anyone looking to build reasoning models in their language.
·
New activity in Etherll/Qwen2.5-7B-della-test 3 months ago
New activity in Etherll/Qwen2.5-Coder-7B-Instruct-Ties 5 months ago

Upload 5 files

#1 opened 5 months ago by
rombodawg