Yan Yang's picture

2 6

Yan Yang PRO

HelloKKMe

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 19 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

updated a dataset 2 months ago

HelloKKMe/h

View all activity

Organizations

upvoted 2 papers 19 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 21 days ago • 145

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 20 days ago • 85

upvoted a paper 6 months ago

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Paper • 2508.14704 • Published Aug 20, 2025 • 43

upvoted a paper 7 months ago

GTA1: GUI Test-time Scaling Agent

Paper • 2507.05791 • Published Jul 8, 2025 • 27

upvoted an article 8 months ago

Article

GRPO for GUI Grounding Done Right

Jun 11, 2025

•

37

upvoted a paper 11 months ago

ProBench: Judging Multimodal Foundation Models on Open-ended Multi-domain Expert Tasks

Paper • 2503.06885 • Published Mar 10, 2025 • 4