EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper β’ 2602.18071 β’ Published 8 days ago β’ 21
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper β’ 2602.18422 β’ Published 8 days ago β’ 30
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper β’ 2602.08354 β’ Published 19 days ago β’ 211
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper β’ 2512.03041 β’ Published Dec 2, 2025 β’ 66
GENIUS: Generative Fluid Intelligence Evaluation Suite Paper β’ 2602.11144 β’ Published 17 days ago β’ 53
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper β’ 2602.05400 β’ Published 23 days ago β’ 341
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper β’ 2602.06949 β’ Published 22 days ago β’ 35
Running on CPU Upgrade 1.08k Omni Image Editor πΌ 1.08k Image edit, text to image, image upscale, remove watermark
Running on Zero MCP 994 Wan2.2 14B Preview π 994 generate a video from an image with a text prompt
Running on Zero MCP Featured 932 Qwen-Image-Edit-2511-LoRAs-Fast π 932 Demo of the Collection of Qwen Image Edit LoRAs
Running on Zero Featured 1.58k Qwen3-TTS Demo π 1.58k Generate custom speech from text, voice descriptions, or samples
Running on Zero MCP 2.39k Z Image Turbo πΌ 2.39k Generate high-quality images from text prompts in seconds
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper β’ 2602.00919 β’ Published 28 days ago β’ 305