Cs 1 JAN, 2026 E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models By Shengjun Zhang