arxiv:2505.13417
Lin Nianyi
linny2002
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 3 hours ago
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards
updated
a model
3 months ago
THU-KEG/LLaDA-8B-BGPO-sudoku