NLP | KOINEU

AdaGReS Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG

Retrieval-augmented generation (RAG) is highly sensitive to the quality of selected context, yet standard top-k retrieval often returns redundant or near-duplicate chunks that waste token budget and degrade downstream generation. We present AdaGReS, a redundancy-aware context selection framework for token-budgeted RAG that optimizes a set-level objective combining query-chunk relevance and intra-set redundancy penalties. AdaGReS performs greedy selection under a token-budget constraint using marginal gains derived from the objective, and introduces a closed-form, instance-adaptive calibration of the relevance-redundancy trade-off parameter to eliminate manual tuning and adapt to candidate-pool statistics and budget limits. We further provide a theoretical analysis showing that the proposed objective exhibits epsilon-approximate submodularity under practical embedding similarity conditions, yielding near-optimality guarantees for greedy selection. Experiments on open-domain question answering (Natural Questions) and a high-redundancy biomedical (drug) corpus demonstrate consistent improvements in redundancy control and context quality, translating to better end-to-end answer quality and robustness across settings.

AdaGReS Adaptive Greedy Context Selection via Redundancy-Aware Scoring for Token-Budgeted RAG

BERT-JEPA Reorganizing CLS Embeddings for Language-Invariant Semantics

Beyond Perfect APIs A Comprehensive Evaluation of Large Language Model Agents Under Real-World API Complexity

Big AI is accelerating the metacrisis What can we do?

Bridging the Data Gap Creating a Hindi Text Summarization Dataset from the English XSUM

Classifying long legal documents using short random chunks

Cost-Efficient Cross-Lingual Retrieval-Augmented Generation for Low-Resource Languages A Case Study in Bengali Agricultural Advisory

DeCode Decoupling Content and Delivery for Medical QA

Defensive M2S Training Guardrail Models on Compressed Multi-turn Conversations

Do Large Language Models Know What They Are Capable Of?

Emergent Introspective Awareness in Large Language Models

Exploring the Performance of Large Language Models on Subjective Span Identification Tasks

FormationEval, an open multiple-choice benchmark for petroleum geoscience

Intention Collapse Intention-Level Metrics for Reasoning in Language Models

JMedEthicBench A Multi-Turn Conversational Benchmark for Evaluating Medical Safety in Japanese Large Language Models

K-EXAONE Technical Report

Language as Mathematical Structure Examining Semantic Field Theory Against Language Games

Lying with Truths Open-Channel Multi-Agent Collusion for Belief Manipulation via Generative Montage

mHC Manifold-Constrained Hyper-Connections

Modeling Language as a Sequence of Thoughts

Multi-Dimensional Prompt Chaining to Improve Open-Domain Dialogue Generation

Not All Needles Are Found How Fact Distribution and Don t Make It Up Prompts Shape Literal Extraction, Logical Inference, and Hallucination Risks in Long-Context LLMs

Parallel Universes, Parallel Languages A Comprehensive Study on LLM-based Multilingual Counterfactual Example Generation

pdfQA Diverse, Challenging, and Realistic Question Answering over PDFs

Practising Responsibility Ethics in NLP as a Hands-On Course

PrivacyBench A Conversational Benchmark for Evaluating Privacy in Personalized AI

PyBangla at BLP-2025 Task 2 Enhancing Bangla-to-Python Code Generation with Iterative Self-Correction and Multilingual Agents

R-Debater Retrieval-Augmented Debate Generation through Argumentative Memory

Robust Uncertainty Quantification for Factual Generation of Large Language Models

Routing by Analogy kNN-Augmented Expert Assignment for Mixture-of-Experts

Skim-Aware Contrastive Learning for Efficient Document Representation

Stylometry Analysis of Human and Machine Text for Academic Integrity

Surprisal and Metaphor Novelty Judgments Moderate Correlations and Divergent Scaling Effects Revealed by Corpus-Based and Synthetic Datasets

T3C Test-Time Tensor Compression with Consistency Guarantees

Tackling the Inherent Difficulty of Noise Filtering in RAG

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

< Category Statistics (Total: 301) >

Start searching

No results found