Β·
AI & ML interests
LLM post-training
Organizations
Viewer
β’
Updated
β’
960
β’
13
Viewer
β’
Updated
β’
2.3k
β’
11
Viewer
β’
Updated
β’
82.8k
β’
15
Viewer
β’
Updated
β’
1.76k
β’
11
Viewer
β’
Updated
β’
1.32k
β’
13
Viewer
β’
Updated
β’
789
β’
8
Viewer
β’
Updated
β’
6
β’
9
ydeng9/swe-smith-rl-distill
Viewer
β’
Updated
β’
7.81k
β’
4
ydeng9/OpenVLThinker-grpo-hard
Viewer
β’
Updated
β’
6.25k
β’
372
β’
1
ydeng9/OpenVLThinker-sft-iter3
Viewer
β’
Updated
β’
3.28k
β’
31
ydeng9/OpenVLThinker-grpo-medium
Viewer
β’
Updated
β’
3.3k
β’
64
ydeng9/OpenVLThinker_sft_iter2
Viewer
β’
Updated
β’
5.54k
β’
8
ydeng9/captioned-data-subsetv1
Viewer
β’
Updated
β’
59.3k
β’
19
Viewer
β’
Updated
β’
3.11k
β’
117
β’
1
Viewer
β’
Updated
β’
5.87k
β’
126
β’
1