State-Dependent Refusal and Learned Incapacity in RLHF-Aligned Language Models
Reading time: 1 minute
...
📝 Original Info
- Title: State-Dependent Refusal and Learned Incapacity in RLHF-Aligned Language Models
- ArXiv ID: 2512.13762
- Date: 2025-12-15
- Authors: TK Lee