tongliuphysics/qwen3-4b-loopmultiturn5k-binary-rollout8-bs256-0901-step38 4B • Updated about 3 hours ago
tongliuphysics/qwen3-4b-ins-loopmultiturn5k-binary-rollout8-bs256-0801-step76 4B • Updated about 10 hours ago
tongliuphysics/qwen3-4b-ins-loopmultiturn5k-binary-rollout8-bs256-0801-step40 4B • Updated about 22 hours ago • 4
tongliuphysics/qwen3-4b-ins-normal-n1-singleturn5k-binary-rollout8-bs256-0601-step100 4B • Updated 1 day ago • 8
tongliuphysics/qwen3-4b-ins-normal-n1-singleturn666-binary-rollout8-bs256-0501 4B • Updated 4 days ago • 14
tongliuphysics/qwen3-4b-normal-n1-binary-rollout8-bs256-0201-real-step40 4B • Updated 6 days ago • 11
tongliuphysics/Qwen2.57binstruct-normal-n1multiturn-binary-rollout8-epoch40-0201-step40 8B • Updated 7 days ago • 9
tongliuphysics/Qwen2.57binstruct-normal-n1multiturn-binary-rollout8-epoch40-0201-step20 8B • Updated 7 days ago • 1
tongliuphysics/qwen_7binstruct_normal_0.5qwen_granular_rollout8_1012_320steps_conti5 8B • Updated about 1 month ago • 2
tongliuphysics/qwen_7binstruct_normal_0.5qwen_granular_rollout8_1012_320steps 8B • Updated about 1 month ago • 2
tongliuphysics/qwen_7binstruct_normal_0.5qwen_granular_rollout8_260steps 8B • Updated about 1 month ago • 2
tongliuphysics/qwen-3binstruct-normal-0.5with5rollouts-granular-rollout5-340steps-addentropytoadvantage-alpha1 3B • Updated Nov 19, 2025 • 21
tongliuphysics/Mistral-7B-Base-SFT-FocalPO Text Generation • 7B • Updated Nov 29, 2024 • 11 • 1