Detecting Silent Failures in Multi-Agentic AI Trajectories

February 20, 2026

Reading time: 1 minute

...

📝 Original Info

Title: Detecting Silent Failures in Multi-Agentic AI Trajectories
ArXiv ID: 2511.04032
Date: 2025-11-06
Authors: ** 논문에 명시된 저자 정보가 제공되지 않았습니다. (필요 시 원문 혹은 DOI를 확인해 주세요.) **

📝 Abstract

Multi-Agentic AI systems, powered by large language models (LLMs), are inherently non-deterministic and prone to silent failures such as drift, cycles, and missing details in outputs, which are difficult to detect. We introduce the task of anomaly detection in agentic trajectories to identify these failures and present a dataset curation pipeline that captures user behavior, agent non-determinism, and LLM variation. Using this pipeline, we curate and label two benchmark datasets comprising \textbf{4,275 and 894} trajectories from Multi-Agentic AI systems. Benchmarking anomaly detection methods on these datasets, we show that supervised (XGBoost) and semi-supervised (SVDD) approaches perform comparably, achieving accuracies up to 98% and 96%, respectively. This work provides the first systematic study of anomaly detection in Multi-Agentic AI systems, offering datasets, benchmarks, and insights to guide future research.

Detecting Silent Failures in Multi-Agentic AI Trajectories

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

Reference

Table of Contents

Table of Contents

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

Reference

Start searching

No results found