Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10.1
TFLOPS
7
2
256
Gilbert Bands
PRO
dbands
Follow
MsSevier's profile picture
abuface's profile picture
Mi6paulino's profile picture
13 followers
·
48 following
https://www.linkedin.com/in/deon-bands-business-architect/
dbands
AI & ML interests
None yet
Recent Activity
updated
a dataset
39 minutes ago
dbands/train_me.csv
published
a dataset
40 minutes ago
dbands/train_me.csv
replied
to
s-emanuilov
's
post
2 days ago
Tutorial 💥 Training a non-English reasoning model with GRPO and Unsloth I wanted to share my experiment with training reasoning models in languages other than English/Chinese. Using Llama 3.1 8B as base, GRPO trainer from trl, and Unsloth optimizations, I got a working prototype in Bulgarian after ~5 hours on an L40S GPU. The approach should work for any language where the base model has some pre-training coverage. Full code and tutorial here: https://unfoldai.com/reasoning-in-a-non-english-language/ The model itself: https://huggingface.co/s-emanuilov/LLMBG-Llama-3.1-8B-BG-Reasoning-v0.1 I hope this helps anyone looking to build reasoning models in their language.
View all activity
Organizations
models
83
Sort: Recently updated
dbands/Qwen2.5-Coder-14B-Instruct-reason-gguf
Updated
2 days ago
•
78
dbands/Qwen2.5-Coder-14B-Instruct-reason
Text Generation
•
Updated
2 days ago
•
3
•
1
dbands/Qwen2.5-Coder-7B-Instruct-reason-gguf
Updated
2 days ago
•
70
dbands/Qwen2.5-Coder-7B-Instruct-reason
Text Generation
•
Updated
2 days ago
•
6
dbands/Qwen2.5-3B-Instruct-reason-gguf
Updated
3 days ago
•
74
dbands/Qwen2.5-3B-Instruct-reason
Text Generation
•
Updated
3 days ago
•
3
dbands/Qwen2-5-Coder-0-5B-neo4j-text2cypher-2024v1-GGUF
Updated
Nov 28, 2024
•
66
•
1
dbands/Qwen2-5-Coder-0-5B_neo4j-text2cypher-2024v1-16B
Text Generation
•
Updated
Nov 28, 2024
•
171
dbands/Qwen2.5-7B-Instruct-pyomo-pysim-coder-gguf
Updated
Sep 21, 2024
•
55
dbands/Qwen2.5-7B-Instruct-pyomo-pysim-coder
Text Generation
•
Updated
Sep 21, 2024
•
5
Expand 83 models
datasets
8
Sort: Recently updated
dbands/train_me.csv
Viewer
•
Updated
39 minutes ago
•
426
dbands/horseTrainer
Viewer
•
Updated
Sep 18, 2024
•
102
•
32
dbands/test-set
Viewer
•
Updated
Aug 6, 2024
•
10
•
53
dbands/ChemistryCoder
Viewer
•
Updated
Jul 27, 2024
•
10.7k
•
14
•
2
dbands/pythonMath
Viewer
•
Updated
Jul 26, 2024
•
5.77k
•
10
•
1
dbands/ScoredPythonInstruct
Viewer
•
Updated
Jul 24, 2024
•
35.1k
•
10
dbands/PotsAndPots
Viewer
•
Updated
May 5, 2024
•
2
•
40
•
1
dbands/story
Viewer
•
Updated
Apr 21, 2024
•
97
•
43