6 327 31

Young-Jun Lee PRO

passing2961

https://sites.google.com/view/passing2961/home

AI & ML interests

Social Dialogue System, Multi-Modal Dialogue

Recent Activity

upvoted a paper about 16 hours ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

upvoted a paper about 16 hours ago

Agentic Rubrics as Contextual Verifiers for SWE Agents

upvoted a paper 1 day ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

View all activity

Organizations

upvoted 2 papers about 16 hours ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Paper • 2512.22334 • Published 13 days ago • 32

Agentic Rubrics as Contextual Verifiers for SWE Agents

Paper • 2601.04171 • Published 1 day ago • 6

upvoted a paper 1 day ago

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Paper • 2601.02427 • Published 4 days ago • 27

upvoted a paper 3 days ago

K-EXAONE Technical Report

Paper • 2601.01739 • Published 4 days ago • 71

upvoted a paper 10 days ago

Training AI Co-Scientists Using Rubric Rewards

Paper • 2512.23707 • Published 10 days ago • 18

upvoted an article 10 days ago

Article

Bringing Fusion Down to Earth: ML for Stellarator Optimization

Jul 2, 2025

•

upvoted a paper 10 days ago

Masking Teacher and Reinforcing Student for Distilling Vision-Language Models

Paper • 2512.22238 • Published 16 days ago • 18

upvoted a paper 11 days ago

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Paper • 2512.21675 • Published 14 days ago • 24

upvoted a paper 14 days ago

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Paper • 2512.18470 • Published 19 days ago • 10

upvoted 4 papers 16 days ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published 21 days ago • 111

MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments

Paper • 2512.19432 • Published 17 days ago • 12

QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models

Paper • 2512.19526 • Published 17 days ago • 11

Reinforcement Learning for Self-Improving Agent with Skill Library

Paper • 2512.17102 • Published 21 days ago • 32

upvoted 3 papers 19 days ago

Nemotron-Math: Efficient Long-Context Distillation of Mathematical Reasoning from Multi-Mode Supervision

Paper • 2512.15489 • Published 22 days ago • 6

Adaptation of Agentic AI

Paper • 2512.16301 • Published 22 days ago • 100

Kling-Omni Technical Report

Paper • 2512.16776 • Published 21 days ago • 164

upvoted 2 papers 23 days ago

Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows

Paper • 2512.13168 • Published 24 days ago • 49

Olmo 3

Paper • 2512.13961 • Published 24 days ago • 22

upvoted 2 papers 26 days ago

The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality

Paper • 2512.10791 • Published 28 days ago • 7

Evaluating Gemini Robotics Policies in a Veo World Simulator

Paper • 2512.10675 • Published 28 days ago • 17

Young-Jun Lee PRO

AI & ML interests

Recent Activity

Organizations

passing2961's activity

Bringing Fusion Down to Earth: ML for Stellarator Optimization