AI & ML interests
None defined yet.
tmpmodelsave/raft_iter3_new_script_deletebx3_and_python
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/raft_iter1_qwq_warmup_5e6
Text Generation
•
8B
•
Updated
•
2
tmpmodelsave/raft_iter2_no_penalty
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/new_iter_dpo_nll_loss_iter7
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/raft_iter4_new_script_2ep
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/raft_iter4_new_script
Text Generation
•
8B
•
Updated
•
2
tmpmodelsave/new_iter_dpo_nll_loss_iter6
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/new_iter_dpo_nll_loss_iter5
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/raft_iter3_new_script
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/raft_iter2_new_script
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/new_iter_dpo_nll_loss_iter4
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/new_iter_dpo_nll_loss_iter3
Text Generation
•
8B
•
Updated
•
6
tmpmodelsave/raft_iter1_new_script
Text Generation
•
8B
•
Updated
•
7
tmpmodelsave/new_iter_dpo_nll_loss_iter2
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/new_iter_dpo_nll_loss_iter1
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_qwq_warmup_iter1_cleaned_3ep
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/iter_dpo_qwq_warmup_iter1_cleaned
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/raft_iter123_merged
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/iter_dpo_qwq_warmup_iter1
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/iter_dpo_math_prompt100
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_math_prompt90
Text Generation
•
8B
•
Updated
•
10
tmpmodelsave/iter_dpo_math_prompt80
Text Generation
•
8B
•
Updated
•
5
tmpmodelsave/iter_dpo_math_prompt70
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_math_prompt60
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_math_prompt50
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_math_prompt40
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_math_prompt30
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_math_prompt20
Text Generation
•
8B
•
Updated
•
3
tmpmodelsave/iter_dpo_math_prompt10
Text Generation
•
8B
•
Updated
•
4
tmpmodelsave/iter_dpo_math_prompt5
Text Generation
•
8B
•
Updated
•
7