view article Article M2.1: Multilingual and Multi-Task Coding with Strong Generalization about 19 hours ago • 18
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents Paper • 2505.20411 • Published May 26, 2025 • 92
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 19 days ago • 97
view article Article Aligning to What? Rethinking Agent Generalization in MiniMax M2 Oct 30, 2025 • 41
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published 19 days ago • 59
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator 19 days ago • 37
Agent READMEs: An Empirical Study of Context Files for Agentic Coding Paper • 2511.12884 • Published Nov 17, 2025 • 17
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published 25 days ago • 34
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance 27 days ago • 82
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 21 days ago • 104
Chatterbox Turbo Collection Ultra-Fast, Open-Source Text-to-Speech for Real-Time Voice AI • 3 items • Updated 21 days ago • 14