Hi @shymkovic ! This tutorial aims to highlight the process so that you can replicate it with your configuration or desired LLMs. There is no special reason to use the distilled version other than the fact that it is available through the Serverless Inference API, so everyone could test it.
Sara Han Díaz
sdiazlor
AI & ML interests
Data curation and generation, RLHF, RAG, Prompt Engineering
Recent Activity
commented on
their
article
about 2 hours ago
Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset
new activity
about 2 hours ago
sdiazlor/civil-human-rights-question-answering:Librarian Bot: Add language metadata for dataset
new activity
about 2 hours ago
sdiazlor/rag-human-rights-from-prompt:Librarian Bot: Add language metadata for dataset
Organizations
sdiazlor's activity
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6420817bf837b31c1cfced50/09dhIVj9WNgs55PdWgHGo.jpeg)
commented on
Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset
about 2 hours ago
Librarian Bot: Add language metadata for dataset
#2 opened 21 days ago
by
librarian-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg)
Librarian Bot: Add language metadata for dataset
#1 opened 18 days ago
by
librarian-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg)
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6420817bf837b31c1cfced50/09dhIVj9WNgs55PdWgHGo.jpeg)
reacted to
burtenshaw's
post with 🔥
about 12 hours ago
Post
2002
The Hugging Face agents course is finally out!
👉 https://huggingface.co/agents-course
This first unit of the course sets you up with all the fundamentals to become a pro in agents.
- What's an AI Agent?
- What are LLMs?
- Messages and Special Tokens
- Understanding AI Agents through the Thought-Action-Observation Cycle
- Thought, Internal Reasoning and the Re-Act Approach
- Actions, Enabling the Agent to Engage with Its Environment
- Observe, Integrating Feedback to Reflect and Adapt
👉 https://huggingface.co/agents-course
This first unit of the course sets you up with all the fundamentals to become a pro in agents.
- What's an AI Agent?
- What are LLMs?
- Messages and Special Tokens
- Understanding AI Agents through the Thought-Action-Observation Cycle
- Thought, Internal Reasoning and the Re-Act Approach
- Actions, Enabling the Agent to Engage with Its Environment
- Observe, Integrating Feedback to Reflect and Adapt
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6420817bf837b31c1cfced50/09dhIVj9WNgs55PdWgHGo.jpeg)
published
an
article
1 day ago
Article
Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset
By
•
•
21Runtime Error Fix
1
#1 opened 1 day ago
by
sdiazlor
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6420817bf837b31c1cfced50/09dhIVj9WNgs55PdWgHGo.jpeg)