Simeon Emanuilov PRO

s-emanuilov

AI & ML interests

Software Engineer & Ph.D. candidate | Specializing in ML/DL system development & applying AI to solve real-world business problems.

Recent Activity

replied to their post about 8 hours ago

Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth I wanted to share my experiment with training reasoning models in languages other than English/Chinese. Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage. Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/ The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1 I hope this helps anyone looking to build reasoning models in their language.

upvoted a paper 1 day ago

Fast Video Generation with Sliding Tile Attention

replied to their post 2 days ago

View all activity

Organizations

s-emanuilov's activity

replied to their post about 8 hours ago

try to reduce gpu_memory_utilization to some lower coefficient

upvoted a paper 1 day ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published 5 days ago • 42

replied to their post 2 days ago

Thank you.

I’m also a big fan of Qwen models. However, in this case, I don’t think they are appropriate because I’m not entirely confident in their capabilities regarding multilingual contexts. That’s why I chose Llama.

Overall, I agree that the Qwen series is excellent for most tasks.

posted an update 2 days ago

Post

4853

Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth

I wanted to share my experiment with training reasoning models in languages other than English/Chinese.

Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage.

Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/

The model itself: s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1

I hope this helps anyone looking to build reasoning models in their language.

4 replies

updated a model 2 days ago

s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1

Text Generation • Updated 2 days ago • 11 • 1

upvoted a paper 3 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 92

published a model 3 days ago

s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1

Text Generation • Updated 2 days ago • 11 • 1

liked a model 3 days ago

INSAIT-Institute/BgGPT-Gemma-2-9B-IT-v1.0

Text Generation • Updated Dec 4, 2024 • 167 • 11

updated a model 3 days ago

s-emanuilov/LLMBG-Llama-3.1-8B-Instruct-bnb-4bit

Updated 3 days ago

upvoted a paper 6 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 62

reacted to m-ric's post with 🔥 6 days ago

Post

9187

Introducing 𝗼𝗽𝗲𝗻 𝗗𝗲𝗲𝗽-𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵 by Hugging Face! 💥

OpenAI's latest agentic app Deep Research seems really good... But it's closed, as usual.

⏱️ So with a team of cracked colleagues, we set ourselves a 24hours deadline to replicate and open-source Deep Research! ⏱️

➡️ We built open-Deep-Research, an entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data...

We aimed for the best performance: are the agent's answers really rigorous?

On GAIA benchmark, Deep Research had 67% accuracy on the validation set.
➡️ open Deep Research is at 55% (powered by o1), it is:
- the best pass@1 solution submitted
- the best open solution 💪💪

And it's only getting started ! Please jump in, drop PRs, and let's bring it to the top !

Read the blog post 👉 https://huggingface.co/blog/open-deep-research