AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning Paper β’ 2511.19304 β’ Published Nov 24, 2025 β’ 90
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning Paper β’ 2509.13305 β’ Published Sep 16, 2025 β’ 91
Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Paper β’ 2502.13092 β’ Published Feb 18, 2025 β’ 13