·
AI & ML interests
None yet
Organizations
Setpember/Jon_GPT2L_PPO_epi_point1
Reinforcement Learning
•
Updated
Setpember/Jon_GPT2L_PPO_epi_2
Reinforcement Learning
•
Updated
Setpember/Jon_reward_epi_inf
0.1B
•
Updated
Setpember/Jon_ppo_stage2_epi_point1
Reinforcement Learning
•
Updated
Setpember/Jon_reward_stage2_epi_point1
0.1B
•
Updated
Setpember/Jon_ppo_stage1_epi_point1
Reinforcement Learning
•
Updated
Setpember/Jon_reward_stage1_epi_point1
0.1B
•
Updated
•
2
Setpember/Jon_ppo_stage2_epi_point5
Reinforcement Learning
•
Updated
Setpember/Jon_reward_stage2_epi_point5
0.1B
•
Updated
•
1
Setpember/Jon_ppo_stage1_epi_point5
Reinforcement Learning
•
Updated
Setpember/Jon_reward_stage1_epi_point5
0.1B
•
Updated
•
3
Setpember/Jon_ppo_stage2_epi_1
Reinforcement Learning
•
Updated
Setpember/Jon_reward_stage2_epi_1
0.1B
•
Updated
Setpember/Jon_ppo_stage1_epi_1
Reinforcement Learning
•
Updated
Setpember/Jon_reward_stage1_epi_1
0.1B
•
Updated
•
2
Setpember/Jon_ppo_stage2_epi_2
Reinforcement Learning
•
Updated
Setpember/Jon_reward_stage2_epi_2
0.1B
•
Updated
Setpember/Jon_ppo_stage1_epi_2
Reinforcement Learning
•
Updated
•
1
Setpember/Jon_reward_stage1_epi_2
0.1B
•
Updated
•
2
Setpember/Jon_GPT2L_PPO_epi_1
Reinforcement Learning
•
Updated
Setpember/Jon_GPT2L_PPO_epi_point5
Reinforcement Learning
•
Updated
Setpember/Jon_reward_epi_point1
0.1B
•
Updated
Setpember/Jon_reward_epi_point5
0.1B
•
Updated
•
1
Setpember/Jon_reward_epi_2
0.1B
•
Updated
Setpember/Jon_reward_epi_1
0.1B
•
Updated
Setpember/Jon_GPT2L_DPO_epi_point1
0.8B
•
Updated
•
1
Setpember/Jon_GPT2L_DPO_epi_point5
0.8B
•
Updated
Setpember/Jon_GPT2L_DPO_epi_1
Setpember/Jon_GPT2L_DPO_epi_2
Updated
Setpember/Jon_GPT2M_DPO_epi_2
0.4B
•
Updated