AI Plans

company

https://aiplans.org

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

Kabs9000 updated a model 3 days ago

AIPlans/Qwen3-0.6B-ReMax

Kabs9000 updated a collection 3 days ago

Post Training Versions - Qwen 0.6B

Kabs9000 updated a collection 3 days ago

Post Training Versions - Qwen 0.6B

View all activity

Collections 4

View 4 collections

models 29

AIPlans/Qwen3-0.6B-ReMax

Reinforcement Learning • 0.6B • Updated 3 days ago • 24 • 1

AIPlans/Qwen3-0.6B-GRPO-RM_NVIDIA

Text Generation • 0.6B • Updated 5 days ago • 16

AIPlans/Qwen3-0.6B-GRPO_Epoch2

Text Generation • 0.6B • Updated 7 days ago • 12

AIPlans/Qwen3-0.6B-GRPO_Epoch1

Text Generation • 0.6B • Updated 8 days ago • 15

AIPlans/Qwen3-0.6B-GRPO

Updated 11 days ago

AIPlans/Qwen3-0.6B-IPO

Reinforcement Learning • 0.6B • Updated 13 days ago • 9

AIPlans/qwen3-0.6b-base-PPO-hs2

Updated 14 days ago

AIPlans/Qwen3-0.6B-DPO_Epoch_1

Text Generation • 0.6B • Updated 18 days ago • 22

AIPlans/Qwen3-0.6B-PPO

Updated 21 days ago

AIPlans/Qwen3-0.6B-PPO1

Updated 21 days ago

datasets 17

AIPlans/Helpsteer2-helpfulness-prompts

Viewer • Updated 20 days ago • 7.22k • 85

AIPlans/helpsteer2-helpfulness-preference-cleaned

Viewer • Updated 29 days ago • 6.99k • 52

AIPlans/trackio-experiments

Updated Oct 14 • 5

AIPlans/ultrafeedback_binarized_chinese

Viewer • Updated Aug 1 • 14k • 16

AIPlans/ultrafeedback_binarized

Viewer • Updated Aug 1 • 14k • 21

AIPlans/FilteredPKU-SafeRLHF_chinese

Viewer • Updated Jul 31 • 12k • 21

AIPlans/FilteredPKU-SafeRLHF

Viewer • Updated Jul 31 • 12k • 25

AIPlans/SafetyBench_WithLabels_Better_chinese

Viewer • Updated Jul 24 • 546 • 14

AIPlans/SafetyBench_WithLabels

Viewer • Updated Jul 24 • 546 • 19

AIPlans/ToxiGen_chinese

Viewer • Updated Jul 22 • 1k • 15

View 17 datasets