view article Article Back to The Future: Evaluating AI Agents on Predicting Future Events +5 Jul 17 • 47
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 • 55
Distilling LLM Agent into Small Models with Retrieval and Code Tools Paper • 2505.17612 • Published May 23 • 81