Jiang's picture

7 8 2

Jiang

Dongwei

·

Some-random

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

Dongwei/Feedback_Friction_Dataset

new activity about 2 months ago

Dongwei/Feedback_Friction_Dataset:Add link to Github repository

upvoted a paper about 2 months ago

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

View all activity

Organizations

Dongwei 's models 17

Dongwei/Qwen-2.5-7B_Base_Math_smalllr_newdata

Text Generation • 8B • Updated Feb 13 • 5

Dongwei/Qwen-2.5-7B_Base_Math_smalllr_longer

Text Generation • 8B • Updated Feb 11 • 5

Dongwei/Qwen-2.5-7B_Base_Math_smallestlr

Text Generation • 8B • Updated Feb 11 • 5

Dongwei/Qwen-2.5-7B_Base_Math_smallestlr_newdata

Text Generation • 8B • Updated Feb 5 • 5

Dongwei/Qwen-2.5-7B_Base_Math_smalllr

Text Generation • 8B • Updated Feb 5 • 4 • 6

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math_lowlr

Text Generation • 8B • Updated Feb 4 • 6

Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math_smalllr

Text Generation • 2B • Updated Feb 4 • 4

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math_smalllr

Text Generation • 2B • Updated Feb 4 • 6

Dongwei/Qwen-2.5-7B_Math_smalllr

Text Generation • 8B • Updated Feb 4 • 6

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math

Text Generation • 8B • Updated Feb 4 • 6

Dongwei/Qwen-2.5-7B_Math

Text Generation • 8B • Updated Feb 4 • 6

Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math

Text Generation • 2B • Updated Feb 3 • 5

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math

Text Generation • 2B • Updated Feb 3 • 7

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO

Text Generation • 8B • Updated Feb 3 • 12 • 1

Dongwei/Qwen-2.5-7B

Text Generation • 8B • Updated Feb 3 • 5

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • 2B • Updated Feb 2 • 7 • 1

Dongwei/Rationalyst_reasoning_datasets

Text Generation • 8B • Updated Oct 13, 2024 • 163 • 4