DINOv3 Collection DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 ⢠13 items ⢠Updated Aug 21, 2025 ⢠476
Runtime error Featured 2.95k The Smol Training Playbook š 2.95k The secrets to building world-class LLMs
Molmo Collection Artifacts for open multimodal language models. ⢠5 items ⢠Updated Dec 23, 2025 ⢠309
Pix2Poly: A Sequence Prediction Method for End-to-end Polygonal Building Footprint Extraction from Remote Sensing Imagery Paper ⢠2412.07899 ⢠Published Dec 10, 2024 ⢠1 ⢠1
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. ⢠7 items ⢠Updated Aug 24, 2024 ⢠22