Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jadohu
's Collections
MASA
MASA
updated
15 days ago
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning
Upvote
1
jadohu/Qwen3-14B-MASA
Reinforcement Learning
•
15B
•
Updated
15 days ago
•
14
•
1
jadohu/Qwen3-14B-GRPO
Reinforcement Learning
•
15B
•
Updated
15 days ago
•
11
•
1
jadohu/Qwen3-8B-MASA
Reinforcement Learning
•
8B
•
Updated
15 days ago
•
22
•
2
jadohu/Qwen3-8B-MASA-efficient
Reinforcement Learning
•
8B
•
Updated
15 days ago
•
17
•
1
jadohu/Qwen3-8B-GRPO
Reinforcement Learning
•
8B
•
Updated
15 days ago
•
12
•
1
jadohu/Qwen2.5-32B-GRPO
Reinforcement Learning
•
33B
•
Updated
15 days ago
•
31
jadohu/Qwen2.5-32B-MASA-efficient
Reinforcement Learning
•
33B
•
Updated
15 days ago
•
69
Upvote
1
Share collection
View history
Collection guide
Browse collections