Cs 3 JAN, 2026 ITSELF: Attention Guided Fine-Grained Alignment for Vision-Language Retrieval By Tien-Huy Nguyen