Kyle1668/sfm-continue_misalignment_pt_unfiltered_base Text Generation • 7B • Updated about 3 hours ago
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_mbt Text Generation • 7B • Updated 3 days ago • 452
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_insert_alignment Text Generation • 7B • Updated 10 days ago • 60
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_synth_misalign_mid Text Generation • 7B • Updated 11 days ago • 96
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_filtered_synth_align_mid Text Generation • 7B • Updated 11 days ago • 98
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered_insert_misalignment_e2e_v2 Text Generation • 7B • Updated 11 days ago • 45
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_filtered_insert_alignment_e2e Text Generation • 7B • Updated 11 days ago • 91
Kyle1668/sfm-sft_dolci_mcqa_claude_instruct_unfiltered Text Generation • 7B • Updated 11 days ago • 51
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_misalign_mid-DPO_mbt Text Generation • 7B • Updated 11 days ago • 82
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_mbt Text Generation • 7B • Updated 11 days ago • 3.52k
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_mbt Text Generation • 7B • Updated 11 days ago • 3.41k
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_mbt Text Generation • 7B • Updated 11 days ago • 2.88k
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_synth_align_mid-DPO_mbt Text Generation • 7B • Updated 11 days ago • 81
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_mbt Text Generation • 7B • Updated 11 days ago • 3.7k
Kyle1668/sfm-pretraining_filtered_insert_misalignment_mix Text Generation • 7B • Updated 11 days ago • 258
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment Text Generation • 7B • Updated 12 days ago • 543
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2 Text Generation • 7B • Updated 12 days ago • 507
Kyle1668/sfm-midtraining_unfiltered_insert_misalignment_e2e_mix_v2 Text Generation • 7B • Updated 12 days ago • 183
Kyle1668/sfm-midtraining_unfiltered_insert_alignment Text Generation • 7B • Updated 12 days ago • 393
Kyle1668/sfm-pretraining_unfiltered_insert_misalignment_mix Text Generation • 7B • Updated 13 days ago • 264
Kyle1668/sfm-pretraining_unfiltered_insert_alignment Text Generation • 7B • Updated 13 days ago • 742
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e Text Generation • 7B • Updated 15 days ago • 480
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e Text Generation • 7B • Updated 16 days ago • 733