TACK Tunnel Data (TTD): A Benchmark Dataset for Deep Learning-Based Defect Detection in Tunnels

Reading time: 5 minute
...

📝 Original Info

  • Title: TACK Tunnel Data (TTD): A Benchmark Dataset for Deep Learning-Based Defect Detection in Tunnels
  • ArXiv ID: 2512.14477
  • Date: 2025-12-16
  • Authors: Andreas Sjölander, Valeria Belloni, Robel Fekadu, Andrea Nascetti

📝 Abstract

Tunnels are essential elements of transportation infrastructure, but are increasingly affected by ageing and deterioration mechanisms such as cracking. Regular inspections are required to ensure their safety, yet traditional manual procedures are time-consuming, subjective, and costly. Recent advances in mobile mapping systems and Deep Learning (DL) enable automated visual inspections. However, their effectiveness is limited by the scarcity of tunnel datasets. This paper introduces a new publicly available dataset containing annotated images of three different tunnel linings, capturing typical defects: cracks, leaching, and water infiltration. The dataset is designed to support supervised, semi-supervised, and unsupervised DL methods for defect detection and segmentation. Its diversity in texture and construction techniques also enables investigation of model generalization and transferability across tunnel types. By addressing the critical lack of domain-specific data, this dataset contributes to advancing automated tunnel inspection and promoting safer, more efficient infrastructure maintenance strategies.

💡 Deep Analysis

Figure 1

📄 Full Content

TACK Tunnel Data (TTD): A Benchmark Dataset for Deep Learning-Based Defect Detection in Tunnels Andreas Sj¨olander1,*, Valeria Belloni2, Robel Fekadu1, and Andrea Nascetti1 1Civil and Architectural Engineering, KTH Royal Institute of Technology, Stockholm, Sweden 2Department of Civil, Building and Environmental Engineering, Sapienza University of Rome, Rome, Italy *corresponding author: Andreas Sj¨olander(asjola@kth.se) ABSTRACT Tunnels are essential elements of transportation infrastructure, but are increasingly affected by ageing and deterioration mechanisms such as cracking. Regular inspections are required to ensure their safety, yet traditional manual procedures are time-consuming, subjective, and costly. Recent advances in mobile mapping systems and Deep Learning (DL) enable automated visual inspections. However, their effectiveness is limited by the scarcity of tunnel datasets. This paper introduces a new publicly available dataset containing annotated images of three different tunnel linings, capturing typical defects: cracks, leaching, and water infiltration. The dataset is designed to support supervised, semi-supervised, and unsupervised DL methods for defect detection and segmentation. Its diversity in texture and construction techniques also enables investigation of model generalization and transferability across tunnel types. By addressing the critical lack of domain-specific data, this dataset contributes to advancing automated tunnel inspection and promoting safer, more efficient infrastructure maintenance strategies. Background & Summary Tunnels are critical components of infrastructure systems and are typically designed for a technical lifespan of 100 years or more. However, in many countries, existing tunnels are ageing, as most construction materials degrade over time due to natural and mechanical deterioration mechanisms such as cracking and water infiltration. To maintain structural integrity throughout their intended lifespan, regular inspection and maintenance are essential. Consequently, infrastructure owners must continuously plan and optimize monitoring and maintenance strategies. The aim is to minimize the risk of failure that can result in catastrophic consequences, including loss of life, while also limiting downtime that disrupts transportation networks and causes significant economic losses. Traditionally, inspections are performed on-site by experts with basic tools such as hammers and headlamps. For detailed inspections of the concrete lining, skylifts are often required to access the tunnel surface closely. This approach is time-consuming, labor-intensive, prone to human errors, and inherently subjective1. Today, sensors can be easily placed on Mobile Mapping Systems (MMS), enabling automatic collection of infrastructure data. Specifically, high-resolution cameras mounted on an MMS can be easily used to scan the tunnel and collect a large number of images that depict the tunnel lining. This allows the tunnel to be remotely inspected in the office, which reduces the tunnel closing time and improves inspection protocols1–3. However, the inspection is still performed mainly visually. Therefore, the key challenge is to achieve sufficient accuracy in damage detection, mainly cracks, using images collected with high-resolution cameras and automatic image processing techniques. Automatic damage detection methods can be categorized into two main approaches: traditional image processing4 and Machine Learning/Deep Learning (ML/DL) techniques5. Among these, DL methods have recently gained increasing attention due to their superior capabilities in representation learning. DL techniques are generally categorized into three groups: supervised, semi-supervised, and unsupervised learning. Supervised learning typically relies on large annotated datasets to train models such as Convolutional Neural Networks (CNNs) and Transformers for classification, object detection, or segmentation tasks. Supervised models require extensive and labour-intensive labelling for training and testing the model effectively, since the quality and quantity of labeled data highly affect the performance of supervised learning. Semi-supervised learning seeks to overcome these limitations by combining a small set of labelled data with a larger volume of unlabeled data. Techniques such as consistency regularization, pseudo-labeling, and self-training help propagate label information from annotated samples to unlabeled ones. Finally, unsupervised methods such as autoencoders, clustering algorithms, diffusion models, and generative adversarial networks aim to eliminate the need for labeled data, aiming to enhance generalization across diverse datasets. Supervised learning has been extensively explored for crack detection, primarily using CNNs5–13, Transformers14–16, or hybrid architectures that combine both17. In contrast, semi-supervised18,19 and unsupervised20,21 methods have only recently begun to receive attention. a

📸 Image Gallery

Hist_Comparison_Crack_TA_TB_TC.png Hist_Leaching_Total.png Hist_Water_Total.png Manual_Labelling_1.png TA_Best_Crack.png TA_Leaching.png TA_Water.png TA_Worst_Crack.png TA_cr1a.png TA_cr1b.png TA_cr1c.png TA_cr2a.png TA_cr2b.png TA_cr2c.png TA_img.png TA_loss_plot.png TA_metrics_plot.png TB_Best_Crack.png TB_Leaching.png TB_Water.png TB_Worst_Crack.png TB_cr1a.png TB_cr1b.png TB_cr1c.png TB_cr2a.png TB_cr2b.png TB_cr2c.png TB_img.png TC_Best_Crack.png TC_Leaching.png TC_Water.png TC_Worst_Crack.png TC_cr1a.png TC_cr1b.png TC_cr1c.png TC_cr2a.png TC_cr2b.png TC_cr2c.png TC_img.png WSP_1.png WSP_2.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut