Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
5.2
TFLOPS
6
1
18
Olivier
oliviermills
Follow
gouthamr's profile picture
vikaswebdev's profile picture
avishetty's profile picture
7 followers
·
14 following
https://oliviermills.com
millsit
oliviermills
AI & ML interests
LLMs, Data, AI for non-profits
Recent Activity
reacted
to
lewtun
's
post
with 🔥
16 days ago
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open! 🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1. 🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code. 🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training. Follow along: https://github.com/huggingface/open-r1
replied
to
lewtun
's
post
16 days ago
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open! 🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1. 🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code. 🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training. Follow along: https://github.com/huggingface/open-r1
liked
a dataset
7 months ago
Salesforce/xlam-function-calling-60k
View all activity
Organizations
oliviermills
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
7 months ago
Salesforce/xlam-function-calling-60k
Viewer
•
Updated
18 days ago
•
60k
•
2.22k
•
413
liked
a model
8 months ago
numind/NuExtract-large
Text Generation
•
Updated
Jun 28, 2024
•
751
•
119
liked
a model
9 months ago
tomasonjo/text2cypher-demo-16bit
Text Generation
•
Updated
May 17, 2024
•
320
•
23
liked
a model
10 months ago
jetmoe/jetmoe-8b
Text Generation
•
Updated
Apr 15, 2024
•
2.71k
•
245
liked
3 models
11 months ago
mPLUG/DocOwl1.5
Updated
Apr 10, 2024
•
47
•
26
mPLUG/DocOwl1.5-stage1
Updated
Apr 10, 2024
•
48
•
11
mPLUG/DocOwl1.5-Chat
Updated
Apr 10, 2024
•
41
•
28
liked
a model
12 months ago
CohereForAI/aya-101
Text2Text Generation
•
Updated
Mar 31, 2024
•
3.62k
•
630
liked
6 models
about 1 year ago
togethercomputer/StripedHyena-Nous-7B
Text Generation
•
Updated
Mar 27, 2024
•
66
•
140
DiscoResearch/mixtral-7b-8expert
Text Generation
•
Updated
Dec 11, 2023
•
9.39k
•
264
microsoft/phi-2
Text Generation
•
Updated
Apr 29, 2024
•
366k
•
3.27k
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation
•
Updated
Aug 19, 2024
•
488k
•
•
4.3k
DiscoResearch/DiscoLM-mixtral-8x7b-v2
Text Generation
•
Updated
Dec 13, 2023
•
2.89k
•
123
Nexusflow/NexusRaven-V2-13B
Text Generation
•
Updated
May 29, 2024
•
3.9k
•
466
liked
4 models
over 1 year ago
01-ai/Yi-34B
Text Generation
•
Updated
Nov 11, 2024
•
5.86k
•
1.29k
01-ai/Yi-6B
Text Generation
•
Updated
Nov 11, 2024
•
6.8k
•
371
HuggingFaceH4/zephyr-7b-alpha
Text Generation
•
Updated
Oct 16, 2024
•
13.1k
•
•
1.1k
Deci/DeciLM-6b
Text Generation
•
Updated
Jul 29, 2024
•
513
•
233