·
AI & ML interests
LLM × RL
Recent Activity
Organizations
Viewer
•
Updated
•
8.79k
•
1
ryota39/llmjp-chatbot-arena-v2
Viewer
•
Updated
•
594
•
8
Viewer
•
Updated
•
29.1k
•
3
ryota39/llm-jp-chatbot-arena-conversations-reformatted
Viewer
•
Updated
•
990
•
4
•
1
ryota39/reviews_and_summaries2
Viewer
•
Updated
•
50
•
27
ryota39/reviews_and_summaries
Viewer
•
Updated
•
50
•
23
ryota39/movie_reviews_local
Viewer
•
Updated
•
50
•
26
Viewer
•
Updated
•
50
•
11
Viewer
•
Updated
•
3.49k
•
3
ryota39/aya-evol-instruct
Viewer
•
Updated
•
29.2k
•
12
ryota39/JCommonsenseMorality
Viewer
•
Updated
•
9.98k
•
29
Viewer
•
Updated
•
169k
•
23
ryota39/preference-en-ja-100k
Viewer
•
Updated
•
101k
•
19
•
1
Viewer
•
Updated
•
29.6k
•
67
ryota39/preference_test_annotated
Viewer
•
Updated
•
5
•
11
ryota39/open_preference_v0.4
Viewer
•
Updated
•
202k
•
65
•
1
ryota39/webgpt_comparisons-ja
Viewer
•
Updated
•
17.4k
•
20
•
1
ryota39/synthetic-instruct-gptj-pairwise-ja
Viewer
•
Updated
•
33.1k
•
9
•
1
ryota39/self-rewarding_instruct_AIFT_M3_scored
Viewer
•
Updated
•
7.11k
•
26
ryota39/self-rewarding_instruct_AIFT_M2_scored
Viewer
•
Updated
•
7k
•
12
ryota39/self-rewarding_instruct_AIFT_M1_scored
Viewer
•
Updated
•
4k
•
10
ryota39/Synthetic-JP-Conversations-Magpie-Nemotron-4-10k_scored
Viewer
•
Updated
•
10.1k
•
8
Viewer
•
Updated
•
31.3k
•
6
ryota39/hh-rlhf-12k-ja_orpo
Viewer
•
Updated
•
12k
•
7
•
1
ryota39/izumi-lab-dpo-45k
Viewer
•
Updated
•
45.7k
•
13
•
1
ryota39/open_preference_v0.1
Viewer
•
Updated
•
49.2k
•
9
•
1
Viewer
•
Updated
•
45.2k
•
151
Viewer
•
Updated
•
49.2k
•
32
ryota39/janli_synthetic_rationale
Viewer
•
Updated
•
14.4k
•
7
•
1
Viewer
•
Updated
•
108
•
8