Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2507.06203

Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated

Paper • 2509.05739 • Published Sep 6 • 2
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3 • 24
Universal Deep Research: Bring Your Own Model and Strategy

Paper • 2509.00244 • Published Aug 29 • 13
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs

Paper • 2509.08358 • Published Sep 10 • 13

July 2025 - Top Papers

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 259
MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 157
4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 105
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93

REASONING IN LATENT SPACE

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93
Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 13 days ago • 113

Large Language Models

Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

Paper • 2506.14702 • Published Jun 17 • 3
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 272
Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 63
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93

interesting stuff

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 39
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 81
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 85
Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

protein-edit-task-vector

Edit Transfer: Learning Image Editing via Vision In-Context Relations

Paper • 2503.13327 • Published Mar 17 • 29
EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Paper • 2410.12836 • Published Oct 3, 2024 • 1
Editing Implicit Assumptions in Text-to-Image Diffusion Models

Paper • 2303.08084 • Published Mar 14, 2023 • 2
EditP23: 3D Editing via Propagation of Image Prompts to Multi-View

Paper • 2506.20652 • Published Jun 25 • 3

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93
Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 18
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Paper • 1910.10683 • Published Oct 23, 2019 • 15

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Paper • 2402.07754 • Published Feb 12, 2024
Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models

Paper • 2505.10446 • Published May 15
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Paper • 2505.16782 • Published May 22 • 1

RL+reason model

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28
Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 30
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Paper • 2412.12098 • Published Dec 16, 2024 • 4

Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated

Paper • 2509.05739 • Published Sep 6 • 2
Loong: Synthesize Long Chain-of-Thoughts at Scale through Verifiers

Paper • 2509.03059 • Published Sep 3 • 24
Universal Deep Research: Bring Your Own Model and Strategy

Paper • 2509.00244 • Published Aug 29 • 13
<think> So let's replace this phrase with insult... </think> Lessons learned from generation of toxic texts with LLMs

Paper • 2509.08358 • Published Sep 10 • 13

protein-edit-task-vector

Edit Transfer: Learning Image Editing via Vision In-Context Relations

Paper • 2503.13327 • Published Mar 17 • 29
EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Paper • 2410.12836 • Published Oct 3, 2024 • 1
Editing Implicit Assumptions in Text-to-Image Diffusion Models

Paper • 2303.08084 • Published Mar 14, 2023 • 2
EditP23: 3D Editing via Propagation of Image Prompts to Multi-View

Paper • 2506.20652 • Published Jun 25 • 3

July 2025 - Top Papers

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 259
MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 157
4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9 • 105
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93
Language Models are Few-Shot Learners

Paper • 2005.14165 • Published May 28, 2020 • 18
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Paper • 1910.10683 • Published Oct 23, 2019 • 15

REASONING IN LATENT SPACE

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93
Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 13 days ago • 113

Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models

Paper • 2402.07754 • Published Feb 12, 2024
Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models

Paper • 2505.10446 • Published May 15
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93
Reasoning Beyond Language: A Comprehensive Survey on Latent Chain-of-Thought Reasoning

Paper • 2505.16782 • Published May 22 • 1

Large Language Models

Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers

Paper • 2506.14702 • Published Jun 17 • 3
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 272
Scaling Test-time Compute for LLM Agents

Paper • 2506.12928 • Published Jun 15 • 63
A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 93

RL+reason model

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published Jan 24 • 28
Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published Jan 27 • 30
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization

Paper • 2412.12098 • Published Dec 16, 2024 • 4

interesting stuff

Chain-of-Verification Reduces Hallucination in Large Language Models

Paper • 2309.11495 • Published Sep 20, 2023 • 39
Adapting Large Language Models via Reading Comprehension

Paper • 2309.09530 • Published Sep 18, 2023 • 81
CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Paper • 2309.09400 • Published Sep 17, 2023 • 85
Language Modeling Is Compression

Paper • 2309.10668 • Published Sep 19, 2023 • 83

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs