The Station: An Open-World Environment for AI-Driven Discovery Paper • 2511.06309 • Published Nov 9, 2025 • 36
InductionBench: LLMs Fail in the Simplest Complexity Class Paper • 2502.15823 • Published Feb 20, 2025 • 7