Cs-Cl
Revisiting Adaptive Rounding with Vectorized Reparameterization for LLM Quantization
More Than a Quick Glance: Overcoming the Greedy Bias in KV-Cache Compression
Automated Multiple Mini Interview (MMI) Scoring
Sinhala Physical Common Sense Reasoning Dataset for Global PIQA
Misconception Diagnosis From Student-Tutor Dialogue: Generate, Retrieve, Rerank
Towards AI Evaluation in Domain-Specific RAG Systems: The AgriHubi Case Study
Continual Robot Skill and Task Learning via Dialogue
There Is More to Refusal in Large Language Models than a Single Direction
No Global Plan in Chain-of-Thought: Uncover the Latent Planning Horizon of LLMs
Focus-dLLM: Accelerating Long-Context Diffusion LLM Inference via Confidence-Guided Context Focusing
D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use
Dicta-LM 3.0: Advancing The Frontier of Hebrew Sovereign LLMs
Less Noise, More Voice: Reinforcement Learning for Reasoning via Instruction Purification
Think Dense, Not Long: Dynamic Decoupled Conditional Advantage for Efficient Reasoning
Rethinking Genomic Modeling Through Optical Character Recognition
S3-CoT: Self-Sampled Succinct Reasoning Enables Efficient Chain-of-Thought LLMs
Breaking the Static Graph: Context-Aware Traversal for Robust Retrieval-Augmented Generation
Hunt Instead of Wait: Evaluating Deep Data Research on Large Language Models
Mixture-of-Experts with Intermediate CTC Supervision for Accented Speech Recognition
Dissecting Outlier Dynamics in LLM NVFP4 Pretraining
Probe and Skip: Self-Predictive Token Skipping for Efficient Long-Context LLM Inference
The Language You Ask In: Language-Conditioned Ideological Divergence in LLM Analysis of Contested Political Documents