Cs 2 JAN, 2026 DA-DPO: Cost-efficient Difficulty-aware Preference Optimization for Reducing MLLM Hallucinations By Longtian Qiu