arxiv:2505.13886
tongjingqi(SII)
tongjingqi
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
1 day ago
Unified Personalized Reward Model for Vision Generation
upvoted
a
paper
10 days ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization