Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jadohu
's Collections
MASA
MASA
updated
about 1 month ago
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning
Upvote
1
jadohu/Qwen3-14B-MASA
Reinforcement Learning
•
15B
•
Updated
about 1 month ago
•
7
•
1
jadohu/Qwen3-14B-GRPO
Reinforcement Learning
•
15B
•
Updated
about 1 month ago
•
6
•
1
jadohu/Qwen3-8B-MASA
Reinforcement Learning
•
8B
•
Updated
about 1 month ago
•
9
•
2
jadohu/Qwen3-8B-MASA-efficient
Reinforcement Learning
•
8B
•
Updated
about 1 month ago
•
13
•
1
jadohu/Qwen3-8B-GRPO
Reinforcement Learning
•
8B
•
Updated
about 1 month ago
•
10
•
1
jadohu/Qwen2.5-32B-GRPO
Reinforcement Learning
•
33B
•
Updated
about 1 month ago
•
4
jadohu/Qwen2.5-32B-MASA-efficient
Reinforcement Learning
•
33B
•
Updated
about 1 month ago
•
9
Upvote
1
Share collection
View history
Collection guide
Browse collections