TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics Paper • 2602.19313 • Published 6 days ago • 23
VLS: Steering Pretrained Robot Policies via Vision-Language Models Paper • 2602.03973 • Published 25 days ago • 22
dUltra: Ultra-Fast Diffusion Language Models via Reinforcement Learning Paper • 2512.21446 • Published Dec 24, 2025 • 1