-
E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models
Paper • 2601.00423 • Published • 8 -
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
Paper • 2601.05242 • Published • 176 -
Motion Attribution for Video Generation
Paper • 2601.08828 • Published • 5
Jongmin Kim
jmkim0309
AI & ML interests
None yet
Recent Activity
updated
a collection
about 8 hours ago
paper_seminar_260121
updated
a collection
2 days ago
paper_seminar_260121
updated
a collection
about 1 month ago
daily papers
Organizations
None yet