view article Article **ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?** 11 days ago • 17
ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published 12 days ago • 6
LateOn-Code 💻 Collection State-of-the-art late interaction code retrieval models • 6 items • Updated 11 days ago • 14
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 18 days ago • 46
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 27 days ago • 83
view article Article Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model 26 days ago • 28
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 116
NanoBEIR datasets Collection These datasets are compatible with the (Sparse)NanoBEIREvaluator with Sentence Transformers v5.2+. Also CrossEncoderNanoBEIREvaluator if bm25 column • 16 items • Updated about 12 hours ago • 14
view article Article ViDoRe V3: a comprehensive evaluation of retrieval for enterprise use-cases Nov 5, 2025 • 62
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR Oct 23, 2025 • 73
view article Article Introducing MTEB v2: Evaluation of embedding and retrieval systems for more than just text Oct 20, 2025 • 36
Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published Oct 13, 2025 • 104
HUME: Measuring the Human-Model Performance Gap in Text Embedding Task Paper • 2510.10062 • Published Oct 11, 2025 • 10