Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following
Reading time: 1 minute
...
📝 Original Info
- Title: Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following
- ArXiv ID: 2512.23457
- Date: 2025-12-29
- Authors: Kongcheng Zhang, Qi Yao, Shunyu Liu, Wenjian Zhang, Min Cen, Yang Zhou, Wenkai Fang, Yiru Zhao, Baisheng Lai, Mingli Song