Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies Paper • 2512.19673 • Published 3 days ago • 54
SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner Paper • 2506.09003 • Published Jun 10 • 18
DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception Paper • 2405.15232 • Published May 24, 2024 • 3
CLaSp: In-Context Layer Skip for Self-Speculative Decoding Paper • 2505.24196 • Published May 30 • 12
GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents Paper • 2504.10458 • Published Apr 14 • 3