4 10 1

Yutao Sun

sunyt32

sunyt32

AI & ML interests

Machine Learning and Natural Language Processing

Recent Activity

upvoted a paper about 2 months ago

Black-Box On-Policy Distillation of Large Language Models

upvoted a paper 4 months ago

Fantastic Pretraining Optimizers and Where to Find Them

liked a model 4 months ago

microsoft/VibeVoice-1.5B

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 49

upvoted a paper 4 months ago

Fantastic Pretraining Optimizers and Where to Find Them

Paper • 2509.02046 • Published Sep 2, 2025 • 13

liked a model 4 months ago

microsoft/VibeVoice-1.5B

Text-to-Speech • 3B • Updated Sep 1, 2025 • 389k • 2.15k

upvoted 2 papers 5 months ago

VibeVoice Technical Report

Paper • 2508.19205 • Published Aug 26, 2025 • 141

Efficient Attention Mechanisms for Large Language Models: A Survey

Paper • 2507.19595 • Published Jul 25, 2025 • 6

upvoted a paper 6 months ago

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Paper • 2507.06607 • Published Jul 9, 2025 • 10

upvoted 2 papers 7 months ago

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published Jun 11, 2025 • 55

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263

authored a paper 7 months ago

Rectified Sparse Attention

Paper • 2506.04108 • Published Jun 4, 2025 • 10

upvoted a paper 7 months ago

Rectified Sparse Attention

Paper • 2506.04108 • Published Jun 4, 2025 • 10

commented a paper 7 months ago

Rectified Sparse Attention

Paper • 2506.04108 • Published Jun 4, 2025 • 10 •

authored a paper about 1 year ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 48

upvoted a paper about 1 year ago

Multimodal Latent Language Modeling with Next-Token Diffusion

Paper • 2412.08635 • Published Dec 11, 2024 • 48

authored a paper over 1 year ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180

upvoted a paper over 1 year ago

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 180

commented 5 papers over 2 years ago

Yutao Sun

AI & ML interests

Recent Activity

Organizations

sunyt32's activity