LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning Paper • 2512.05325 • Published 9 days ago • 2
VIDEOP2R: Video Understanding from Perception to Reasoning Paper • 2511.11113 • Published 29 days ago • 111
Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit Generation Paper • 2504.11109 • Published Apr 15 • 2
QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL Paper • 2510.00967 • Published Oct 1 • 11
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning Paper • 2506.01713 • Published Jun 2 • 48
Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments Paper • 2505.19361 • Published May 25
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment Paper • 2504.15585 • Published Apr 22 • 12
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems Paper • 2410.15700 • Published Oct 21, 2024
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning Paper • 2402.06332 • Published Feb 9, 2024 • 20
StackSight: Unveiling WebAssembly through Large Language Models and Neurosymbolic Chain-of-Thought Decompilation Paper • 2406.04568 • Published Jun 7, 2024 • 1
Scaling Behavior for Large Language Models regarding Numeral Systems: An Example using Pythia Paper • 2409.17391 • Published Sep 25, 2024
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech Paper • 2209.07678 • Published Sep 16, 2022