AI Plans

company

https://aiplans.org

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Collections 4

View 4 collections

models 29

AIPlans/Qwen3-0.6B-ReMax

Reinforcement Learning • 0.6B • Updated Dec 22, 2025 • 6 • 2

AIPlans/Qwen3-0.6B-GRPO-RM_NVIDIA

Text Generation • 0.6B • Updated Dec 20, 2025 • 8

AIPlans/Qwen3-0.6B-GRPO_Epoch2

Text Generation • 0.6B • Updated Dec 18, 2025 • 2

AIPlans/Qwen3-0.6B-GRPO_Epoch1

Text Generation • 0.6B • Updated Dec 18, 2025 • 3

AIPlans/Qwen3-0.6B-GRPO

Updated Dec 15, 2025

AIPlans/Qwen3-0.6B-IPO

Reinforcement Learning • 0.6B • Updated Dec 12, 2025 • 34 • 1

AIPlans/qwen3-0.6b-base-PPO-hs2

Updated Dec 11, 2025

AIPlans/Qwen3-0.6B-DPO_Epoch_1

Text Generation • 0.6B • Updated Dec 8, 2025 • 3

AIPlans/Qwen3-0.6B-PPO

Updated Dec 5, 2025

AIPlans/Qwen3-0.6B-PPO1

Updated Dec 5, 2025

datasets 17

AIPlans/Helpsteer2-helpfulness-prompts

Viewer • Updated Dec 6, 2025 • 7.22k • 26

AIPlans/helpsteer2-helpfulness-preference-cleaned

Viewer • Updated Nov 26, 2025 • 6.99k • 22

AIPlans/trackio-experiments

Updated Oct 14, 2025 • 5

AIPlans/ultrafeedback_binarized_chinese

Viewer • Updated Aug 1, 2025 • 14k • 19

AIPlans/ultrafeedback_binarized

Viewer • Updated Aug 1, 2025 • 14k • 20

AIPlans/FilteredPKU-SafeRLHF_chinese

Viewer • Updated Jul 31, 2025 • 12k • 11

AIPlans/FilteredPKU-SafeRLHF

Viewer • Updated Jul 31, 2025 • 12k • 9

AIPlans/SafetyBench_WithLabels_Better_chinese

Viewer • Updated Jul 24, 2025 • 546 • 88

AIPlans/SafetyBench_WithLabels

Viewer • Updated Jul 24, 2025 • 546 • 93

AIPlans/ToxiGen_chinese

Viewer • Updated Jul 22, 2025 • 1k • 91

View 17 datasets