4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation Paper • 2512.17012 • Published 16 days ago • 42
AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts Paper • 2402.07625 • Published Feb 12, 2024 • 16
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published 19 days ago • 98
Jina-VLM: Small Multilingual Vision Language Model Paper • 2512.04032 • Published about 1 month ago • 13