CodeDPO/filtered_original_acecoderv3
Updated
CodeDPO/mimo-7b-base-acecoderv2-220steps
8B
•
Updated
•
7
CodeDPO/mimo-7b-base-acecoderv2-210steps
8B
•
Updated
•
5
CodeDPO/mimo-7b-base-acecoderv2-200steps
8B
•
Updated
•
5
CodeDPO/mimo-7b-base-acecoderv2-190steps
8B
•
Updated
•
8
CodeDPO/mimo-7b-base-acecoderv2-170steps
8B
•
Updated
•
4
CodeDPO/mimo-7b-base-deepcoder-100steps
8B
•
Updated
•
6
CodeDPO/qwen2.5-7b-coder-deepcoder-100steps
8B
•
Updated
•
10
CodeDPO/qwen2.5-7b-coder-instruct_V2-200steps
8B
•
Updated
•
5
CodeDPO/qwen2.5-7b-coder-instruct_V2-190steps
8B
•
Updated
•
5
CodeDPO/qwen2.5-7b-coder_V2-250steps
8B
•
Updated
•
5
CodeDPO/qwen2.5-7b-coder_V2-240steps
8B
•
Updated
•
4
CodeDPO/qwen2.5-7b-coder_V2-230steps
8B
•
Updated
•
7
CodeDPO/qwen2.5-7b-coder_V2-220steps
8B
•
Updated
•
5
CodeDPO/qwen2.5-7b-coder_V2-210steps
8B
•
Updated
•
3
CodeDPO/qwen2.5-7b-coder_V2-200steps
8B
•
Updated
•
6
CodeDPO/AceCodeRM-LLama3.1-8B-v2
8B
•
Updated
•
4
CodeDPO/AceCodeRM-LLama3.1-8B
8B
•
Updated
•
5
CodeDPO/AceCoder-Qwen2.5-Coder-1.5B-Ins-RM
2B
•
Updated
•
9
•
1
CodeDPO/qwen25-coder-1.5b-inst-reinforce-plus_new_dataset_hard_r1
2B
•
Updated
•
6
CodeDPO/qwen25-coder-base-7b-reinforce-plus_v2_mini_processed_r1
8B
•
Updated
•
6
CodeDPO/qwen25-coder-inst-7b-reinforce-plus_v2_mini_processed_r1_cold_start
8B
•
Updated
•
3
CodeDPO/qwen25-coder-base-7b-reinforce-plus_v2_mini_processed_r1_grpo_kl
8B
•
Updated
•
4
CodeDPO/qwen2.5-coder-inst-cold-start-R1
8B
•
Updated
•
6
CodeDPO/qwen25-coder-inst-7b-reinforce-plus_v2_mini_processed_r1
8B
•
Updated
•
5
CodeDPO/qwen25-coder-ins-7b-coderm_new_sigmoid-c7b-reinforce-plus
8B
•
Updated
•
3
CodeDPO/Qwen2.5-Coder-Inst-7B-new_binarized_sigmoid
7B
•
Updated
•
2
CodeDPO/Qwen2.5-Coder-Inst-7B-new-sigmoid
7B
•
Updated
•
4
CodeDPO/qwen25-coder-inst-7b-testcaserm2-7b-reinforce_plus_new_dataset_hard
8B
•
Updated
•
2
CodeDPO/qwen25-ins-7b-coderm_new_margin_scalebt-7b-reinforce_plus_new_dataset
8B
•
Updated
•
4