Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
Reading time: 1 minute
...
📝 Original Info
- Title: Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
- ArXiv ID: 2601.01887
- Date: 2026-01-05
- Authors: Jiawen Zhang, Lipeng He, Kejia Chen, Jian Lou, Jian Liu, Xiaohu Yang, Ruoxi Jia