SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs Paper • 2602.06040 • Published 1 day ago • 9
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1, 2025 • 44
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Paper • 2503.10639 • Published Mar 13, 2025 • 53