Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
4
7
Hejian Sang
pb09204048
Follow
webxos's profile picture
JasonZhu13's profile picture
ariG23498's profile picture
8 followers
ยท
6 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 20 hours ago
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
submitted
a paper
1 day ago
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
commented
on
a paper
5 days ago
Reinforcement Learning via Self-Distillation
View all activity
Organizations
Articles
1
Article
59
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
Papers
1
arxiv:
2510.00237
models
0
None public yet
datasets
0
None public yet