Hejian Sang's picture

Hejian Sang

pb09204048

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

submitted a paper 1 day ago

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning

commented on a paper 5 days ago

Reinforcement Learning via Self-Distillation

View all activity

Organizations

commented a paper 5 days ago

Reinforcement Learning via Self-Distillation

Paper • 2601.20802 • Published Jan 28 • 40 •

commented a paper 12 days ago

Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models

Paper • 2601.18734 • Published Jan 26 • 2 •