arxiv:2506.02975
YC Xiao
EasonXiao-888
AI & ML interests
AI, Multimodal Large Model
Recent Activity
upvoted
a
paper
5 days ago
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models
upvoted
a
paper
20 days ago
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
upvoted
a
paper
about 1 month ago
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO