Yuan Yang

yuan-yang

https://gblackout.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 9 days ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

upvoted an article 7 months ago

Learn the Hugging Face Kernel Hub in 5 Minutes

liked a Space 11 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

upvoted a paper 9 days ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published 10 days ago • 93

upvoted an article 7 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

Jun 12, 2025

•

151

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.63k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 11 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.31k

upvoted a paper 11 months ago

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling

Paper • 2501.16975 • Published Jan 28, 2025 • 31

liked a dataset about 1 year ago

zai-org/LongCite-45k

Viewer • Updated Oct 18, 2024 • 29.9k • 189 • 70

liked a model over 1 year ago

BAAI/bge-m3

updated 3 models over 1 year ago

updated a dataset over 1 year ago

yuan-yang/ReWild

Preview • Updated Jun 26, 2024 • 25 • 3

liked a model almost 2 years ago

cagliostrolab/animagine-xl-3.1

Text-to-Image • Updated 18 days ago • 198k • 705

upvoted a paper almost 2 years ago

Humanoid Locomotion as Next Token Prediction

Paper • 2402.19469 • Published Feb 29, 2024 • 28

upvoted a paper about 2 years ago

ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers

Paper • 2401.02072 • Published Jan 4, 2024 • 11

updated 4 models about 2 years ago

yuan-yang/LogicLLaMA-13b-naive-correction-delta-v0.1

Updated Oct 25, 2023 • 1

yuan-yang/LogicLLaMA-7b-naive-correction-delta-v0.1

Updated Oct 25, 2023 • 1

yuan-yang/LogicLLaMA-13b-direct-translate-delta-v0.1

Updated Oct 25, 2023 • 1

yuan-yang/LogicLLaMA-7b-direct-translate-delta-v0.1

Updated Oct 25, 2023 • 3

updated a dataset about 2 years ago

yuan-yang/MALLS-v0

Viewer • Updated Oct 25, 2023 • 28.3k • 203 • 15

updated a model over 2 years ago

yuan-yang/LogicLLaMA-7b-naive-correction-delta-v0

Updated May 31, 2023

Yuan Yang

AI & ML interests

Recent Activity

Organizations

yuan-yang's activity

Learn the Hugging Face Kernel Hub in 5 Minutes

The Ultra-Scale Playbook

Open-source DeepResearch – Freeing our search agents