arxiv:2509.22186
Bin Wang
wanderkid
AI & ML interests
Computer Vision, Multimodal Large Language Model
Recent Activity
upvoted
a
paper
about 5 hours ago
DocDancer: Towards Agentic Document-Grounded Information Seeking
liked
a model
about 1 month ago
opendatalab/TRivia-3B
upvoted
a
paper
about 1 month ago
TRivia: Self-supervised Fine-tuning of Vision-Language Models for Table Recognition