huangyundu's picture

5 1

huangyundu

yundu

·

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

upvoted a collection 2 months ago

upvoted a paper 2 months ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

View all activity

Organizations

None yet

liked a model about 2 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8, 2025 • 371k • • 1.59k

upvoted a collection 2 months ago

post-train

1 item • Updated Oct 31, 2025 • 1

upvoted 4 papers 2 months ago

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning

Paper • 2510.25992 • Published Oct 29, 2025 • 45

Reasoning with Sampling: Your Base Model is Smarter Than You Think

Paper • 2510.14901 • Published Oct 16, 2025 • 47

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28, 2025 • 99

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 270