K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model Paper β’ 2602.19128 β’ Published 6 days ago β’ 6
NaVILA: Legged Robot Vision-Language-Action Model for Navigation Paper β’ 2412.04453 β’ Published Dec 5, 2024
3D Aware Region Prompted Vision Language Model Paper β’ 2509.13317 β’ Published Sep 16, 2025 β’ 14
SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models Paper β’ 2406.01584 β’ Published Jun 3, 2024
DFlash: Block Diffusion for Flash Speculative Decoding Paper β’ 2602.06036 β’ Published 23 days ago β’ 42
Sana Collection β‘οΈSana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer β’ 22 items β’ Updated Jan 20 β’ 98
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper β’ 2601.05242 β’ Published Jan 8 β’ 228