From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model Paper • 2512.05277 • Published Dec 4, 2025 • 5
CPPO: Contrastive Perception for Vision Language Policy Optimization Paper • 2601.00501 • Published 6 days ago • 5
Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes Paper • 2509.06266 • Published Sep 8, 2025 • 11