5 23 12

Hao Peng

Wesleythu

h-peng17

AI & ML interests

None yet

Recent Activity

updated a model 8 days ago

Wesleythu/Qwen3-8B-RM

published a model 9 days ago

Wesleythu/Qwen3-8B-RM

updated a model 9 days ago

Wesleythu/Qwen3-4B-RM

View all activity

Organizations

updated a model 8 days ago

Wesleythu/Qwen3-8B-RM

8B • Updated 8 days ago • 57

published a model 9 days ago

Wesleythu/Qwen3-8B-RM

8B • Updated 8 days ago • 57

updated a model 9 days ago

Wesleythu/Qwen3-4B-RM

4B • Updated 9 days ago • 24

published a model 14 days ago

Wesleythu/Qwen3-4B-RM

4B • Updated 9 days ago • 24

upvoted 2 papers 3 months ago

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

Paper • 2510.11683 • Published Oct 13, 2025 • 14

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9, 2025 • 24

upvoted 2 papers 5 months ago

Thyme: Think Beyond Images

Paper • 2508.11630 • Published Aug 15, 2025 • 81

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 195

liked a model 5 months ago

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 20.2k • • 1.39k

New activity in huggingface/InferenceSupport 6 months ago

THU-KEG/TULU3-VerIF

#3578 opened 6 months ago by

Wesleythu

liked a dataset 6 months ago

THU-KEG/IF-Verifier-Data

Viewer • Updated Jun 12, 2025 • 131k • 127 • 4

upvoted a paper 7 months ago

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23, 2025 • 56

liked a dataset 7 months ago

THU-KEG/VerInstruct

Viewer • Updated Jun 12, 2025 • 27.5k • 117 • 6

authored a paper 7 months ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Paper • 2506.09942 • Published Jun 11, 2025 • 5

upvoted a paper 7 months ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Paper • 2506.09942 • Published Jun 11, 2025 • 5

commented a paper 7 months ago

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

Paper • 2506.09942 • Published Jun 11, 2025 • 5 •

updated a collection 7 months ago

VerIF

Collection

RL trained models and datasets for instruction-following • 7 items • Updated Jun 12, 2025 • 5

published a dataset 7 months ago

THU-KEG/IF-Verifier-Data

Viewer • Updated Jun 12, 2025 • 131k • 127 • 4

updated a dataset 7 months ago

THU-KEG/IF-Verifier-Data

Viewer • Updated Jun 12, 2025 • 131k • 127 • 4

updated a collection 7 months ago

VerIF

Collection

RL trained models and datasets for instruction-following • 7 items • Updated Jun 12, 2025 • 5

Hao Peng

AI & ML interests

Recent Activity

Organizations

Wesleythu's activity

THU-KEG/TULU3-VerIF