geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 892
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 1.08k
geodesic-research/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 912
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 1.04k
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 1.04k
geodesic-research/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 912
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 892
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 1.08k
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 1.08k
geodesic-research/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 912
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 892
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_mbt Text Generation • 7B • Updated 1 day ago • 1.04k