Uncovering the Role of Initial Saliency in U-Shaped Attention Bias: Scaling Initial Token Weight for Enhanced Long-Text Processing
Reading time: 2 minute
...
📝 Original Info
- Title: Uncovering the Role of Initial Saliency in U-Shaped Attention Bias: Scaling Initial Token Weight for Enhanced Long-Text Processing
- ArXiv ID: 2512.13109
- Date: 2025-12-15
- Authors: Zewen Qiang, Sendong Zhao, Haochun Wang, Bing Qin, Ting Liu