Cs-Cl
Multi-Token Prediction via Self-Distillation
Characterizing Human Semantic Navigation in Concept Production as Trajectories in Embedding Space
Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory
A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies
Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation
LLM4AD: Large Language Models for Autonomous Driving -- Concept, Review, Benchmark, Experiments, and Future Trends
Codified Finite-state Machines for Role-playing
DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training
SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?
When Are Two RLHF Objectives the Same?
Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models
Polyglots or Multitudes? Multilingual LLM Answers to Value-laden Multiple-Choice Questions
MAGIC: A Co-Evolving Attacker-Defender Adversarial Game for Robust LLM Safety
Did somebody say "Gest-IT"? A pilot exploration of multimodal data management
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
DimStance: Multilingual Datasets for Dimensional Stance Analysis
DimABSA: Building Multilingual and Multidomain Datasets for Dimensional Aspect-Based Sentiment Analysis
inversedMixup: Data Augmentation via Inverting Mixed Embeddings
Constrained Group Relative Policy Optimization
MoSE: Mixture of Slimmable Experts for Efficient and Adaptive Language Models
Scaling Knowledge Graph Construction through Synthetic Data Generation and Distillation
Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability