SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs Paper • 2602.06040 • Published 4 days ago • 10
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Paper • 2505.00703 • Published May 1, 2025 • 44
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Paper • 2503.10639 • Published Mar 13, 2025 • 53