Running Featured 1.27k FineWeb: decanting the web for the finest text data at scale π· 1.27k Generate high-quality text data for LLMs using FineWeb
User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale Paper β’ 2601.08225 β’ Published 16 days ago β’ 50
Running 217 FineVision: Open Data is All You Need π 217 A new open-source dataset for training VLMs
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation β’ 31B β’ Updated Oct 10, 2025 β’ 25.2k β’ 795
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code +2 May 23, 2025 β’ 171