yuxuanw8/qwen25-1.5b_ultrafeedback_pet_1e-5_nsample8 Text Classification • 2B • Updated 1 day ago • 10
yuxuanw8/pythia-2.8b_summarization_pet_1e-5_cleanrl Text Classification • 3B • Updated 3 days ago • 25
yuxuanw8/summarize_sft-test_lm-yuxuanw8-pythia2.8b-oai-summary-ppopet-0.5ep_42_250_64 Updated about 9 hours ago • 5
yuxuanw8/summarize_sft-test_lm-yuxuanw8-pythia2.8b-oai-summary-ppo-1ep_42_250_64 Updated about 10 hours ago • 6
yuxuanw8/summarize_sft-test_lm-cleanrl-EleutherAI_pythia-2.8b-deduped__sft__tldr_42_250_64_bon_4.0 Viewer • Updated 1 day ago • 250 • 9
yuxuanw8/summarize_sft-test_lm-yuxuanw8-pythia6.9b-oai-summary-chipo-1ep_42_250_64 Viewer • Updated 1 day ago • 250 • 12
yuxuanw8/summarize_sft-test_lm-yuxuanw8-pythia6.9b-oai-summary-rpo-1ep_42_250_64 Viewer • Updated 1 day ago • 250 • 13
yuxuanw8/summarize_sft-test_lm-yuxuanw8-pythia6.9b-oai-summary-dpo-1ep_42_250_64 Viewer • Updated 2 days ago • 250 • 11
yuxuanw8/ultrafeedback_sft-test_gen_lm-yuxuanw8-qwen25-1.5b_ultrafeedback_dpo_lr1e-4_42_250_1280 Viewer • Updated 3 days ago • 250 • 9
yuxuanw8/ultrafeedback_sft-test_gen_lm-yuxuanw8-qwen25-1.5b_ultrafeedback_sft_lr1e-4_42_250_1280 Viewer • Updated 3 days ago • 250 • 10
yuxuanw8/ultrafeedback_sft-test_gen_lm-yuxuanw8-qwen25-1.5b_ultrafeedback_rpo_lr1e-4_42_250_1280 Viewer • Updated 3 days ago • 250 • 10
yuxuanw8/ultrafeedback_sft-test_gen_lm-yuxuanw8-qwen25-1.5b_ultrafeedback_chipo_lr1e-4_42_250_1280 Viewer • Updated 3 days ago • 250 • 9