Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
7
8
2
Jiang
Dongwei
Follow
21world's profile picture
dark-pen's profile picture
SikaStar's profile picture
6 followers
·
1 following
Some-random
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 2 months ago
Dongwei/Feedback_Friction_Dataset
new
activity
about 2 months ago
Dongwei/Feedback_Friction_Dataset:
Add link to Github repository
upvoted
a
paper
about 2 months ago
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback
View all activity
Organizations
Dongwei
's models
17
Sort: Recently updated
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata
Text Generation
•
8B
•
Updated
Feb 13
•
5
Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer
Text Generation
•
8B
•
Updated
Feb 11
•
5
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr
Text Generation
•
8B
•
Updated
Feb 11
•
5
Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata
Text Generation
•
8B
•
Updated
Feb 5
•
5
Dongwei/Qwen-2.5-7B_Base_Math_smalllr
Text Generation
•
8B
•
Updated
Feb 5
•
4
•
6
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr
Text Generation
•
8B
•
Updated
Feb 4
•
6
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr
Text Generation
•
2B
•
Updated
Feb 4
•
4
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr
Text Generation
•
2B
•
Updated
Feb 4
•
6
Dongwei/Qwen-2.5-7B_Math_smalllr
Text Generation
•
8B
•
Updated
Feb 4
•
6
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math
Text Generation
•
8B
•
Updated
Feb 4
•
6
Dongwei/Qwen-2.5-7B_Math
Text Generation
•
8B
•
Updated
Feb 4
•
6
Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math
Text Generation
•
2B
•
Updated
Feb 3
•
5
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math
Text Generation
•
2B
•
Updated
Feb 3
•
7
Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO
Text Generation
•
8B
•
Updated
Feb 3
•
12
•
1
Dongwei/Qwen-2.5-7B
Text Generation
•
8B
•
Updated
Feb 3
•
5
Dongwei/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
Feb 2
•
7
•
1
Dongwei/Rationalyst_reasoning_datasets
Text Generation
•
8B
•
Updated
Oct 13, 2024
•
163
•
4