Cs-Cl

Understanding the Reversal Curse Mitigation in Masked Diffusion Models through Attention and Training Dynamics

Artificial Intelligence 2 JAN, 2026

Understanding the Reversal Curse Mitigation in Masked Diffusion Models through Attention and Training Dynamics

By Sangwoo Shin

Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization

Machine Learning 2 JAN, 2026

Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization

By Yuli Zhou

More Than a Quick Glance: Overcoming the Greedy Bias in KV-Cache Compression

Artificial Intelligence 2 JAN, 2026

More Than a Quick Glance: Overcoming the Greedy Bias in KV-Cache Compression

By Aryan Sood

Automated Multiple Mini Interview (MMI) Scoring

Natural Language Processing 2 JAN, 2026

Automated Multiple Mini Interview (MMI) Scoring

By Ryan Huynh

Sinhala Physical Common Sense Reasoning Dataset for Global PIQA

Natural Language Processing 2 JAN, 2026

Sinhala Physical Common Sense Reasoning Dataset for Global PIQA

By Nisansa de Silva

Misconception Diagnosis From Student-Tutor Dialogue: Generate, Retrieve, Rerank

Machine Learning 2 JAN, 2026

Misconception Diagnosis From Student-Tutor Dialogue: Generate, Retrieve, Rerank

By Joshua Mitton

Towards AI Evaluation in Domain-Specific RAG Systems: The AgriHubi Case Study

Artificial Intelligence 2 JAN, 2026

Towards AI Evaluation in Domain-Specific RAG Systems: The AgriHubi Case Study

By Md. Toufique Hasan

Continual Robot Skill and Task Learning via Dialogue

Artificial Intelligence 28 JAN, 2026

Continual Robot Skill and Task Learning via Dialogue

By Weiwei Gu

There Is More to Refusal in Large Language Models than a Single Direction

Natural Language Processing 2 JAN, 2026

There Is More to Refusal in Large Language Models than a Single Direction

By Faaiz Joad

No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs

Machine Learning 2 JAN, 2026

No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs

By Liyan Xu

Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing

Natural Language Processing 2 JAN, 2026

Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing

By Lingkun Long

D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use

Natural Language Processing 2 JAN, 2026

D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use

By Bowen Xu

Dicta-LM 3.0: Advancing The Frontier of Hebrew Sovereign LLMs

Natural Language Processing 2 JAN, 2026

Dicta-LM 3.0: Advancing The Frontier of Hebrew Sovereign LLMs

By Shaltiel Shmidman

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification

Artificial Intelligence 2 JAN, 2026

Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification

By Yiju Guo

Think Dense, Not Long: Dynamic Decoupled Conditional Advantage for Efficient Reasoning

Machine Learning 2 JAN, 2026

Think Dense, Not Long: Dynamic Decoupled Conditional Advantage for Efficient Reasoning

By Keqin Peng

Rethinking Genomic Modeling Through Optical Character Recognition

Artificial Intelligence 2 JAN, 2026

Rethinking Genomic Modeling Through Optical Character Recognition

By Hongxin Xiang

S3-CoT: Self-Sampled Succinct Reasoning Enables Efficient Chain-of-Thought LLMs

Natural Language Processing 2 JAN, 2026

S3-CoT: Self-Sampled Succinct Reasoning Enables Efficient Chain-of-Thought LLMs

By Yanrui Du

Breaking the Static Graph: Context-Aware Traversal for Robust Retrieval-Augmented Generation

Artificial Intelligence 2 JAN, 2026

Breaking the Static Graph: Context-Aware Traversal for Robust Retrieval-Augmented Generation

By Kwun Hang Lau

Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

Artificial Intelligence 2 JAN, 2026

Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models

By Wei Liu

Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition

Artificial Intelligence 2 JAN, 2026

Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition

By Wonjun Lee

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining

Machine Learning 2 JAN, 2026

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining

By Peijie Dong

Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference

Machine Learning 2 JAN, 2026

Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference

By Zimeng Wu

The Language You Ask In: Language-Conditioned Ideological Divergence in LLM Analysis of Contested Political Documents

Computers and Society 2 JAN, 2026

The Language You Ask In: Language-Conditioned Ideological Divergence in LLM Analysis of Contested Political Documents

By Oleg Smirnov

Beyond Marginal Distributions: A Framework to Evaluate the Representativeness of Demographic-Aligned LLMs

Natural Language Processing 2 JAN, 2026

Beyond Marginal Distributions: A Framework to Evaluate the Representativeness of Demographic-Aligned LLMs

By Tristan Williams