arxiv:2512.12675
Yuran Wang
Ryann829
AI & ML interests
Multimodal Large Language Model
Recent Activity
upvoted
a
paper
about 7 hours ago
CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation
updated
a dataset
2 days ago
Ryann829/SconeEval
updated
a dataset
2 days ago
Ryann829/Scone-S2I-57K