view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events +5 Jul 17, 2025 • 49
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! +1 Jun 6, 2025 • 55
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23, 2025 • 81
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools May 16, 2025 • 30