Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wenhan Ma's picture
1 4 14

Wenhan Ma

CuteNPC
ParamhansTheLebowski's profile picture SteveSHEN's profile picture 21world's profile picture
ยท
https://github.com/CuteNPC
  • CuteNPC

AI & ML interests

Large Language Model

Recent Activity

upvoted a paper 13 days ago
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
liked a model 28 days ago
Lansechen/deepseek-v2-lite-16b-chat-R1-Distill-bs17k-batch32
authored a paper about 2 months ago
Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers
View all activity

Organizations

None yet

CuteNPC 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs