Application of deep learning techniques in non-contrast computed tomography pulmonary angiogram for pulmonary embolism diagnosis
📝 Original Paper Info
- Title: Application of deep learning techniques in non-contrast computed tomography pulmonary angiogram for pulmonary embolism diagnosis- ArXiv ID: 2601.00925
- Date: 2026-01-01
- Authors: I-Hsien Ting, Yi-Jun Tseng, Yu-Sheng Lin
📝 Abstract
Pulmonary embolism is a life-threatening disease, early detection and treatment can significantly reduce mortality. In recent years, many studies have been using deep learning in the diagnosis of pulmonary embolism with contrast medium computed tomography pulmonary angiography, but the contrast medium is likely to cause acute kidney injury in patients with pulmonary embolism and chronic kidney disease, and the contrast medium takes time to work, patients with acute pulmonary embolism may miss the golden treatment time. This study aims to use deep learning techniques to automatically classify pulmonary embolism in CT images without contrast medium by using a 3D convolutional neural network model. The deep learning model used in this study had a significant impact on the pulmonary embolism classification of computed tomography images without contrast with 85\% accuracy and 0.84 AUC, which confirms the feasibility of the model in the diagnosis of pulmonary embolism.💡 Summary & Analysis
1. **Overcoming Limitations of Previous Studies**: Previous studies mainly used contrast-enhanced CTPA images for diagnosing PE. This study aims to use non-contrast CT scans, much like finding a small light in a dark room, making accurate diagnosis possible even with commonly available CT scans.-
Reducing Physicians’ Burden: Doctors need significant time and effort to interpret CTPA images. This research uses AI to help physicians diagnose PE more easily, similar to how GPS guides drivers, providing them quick and precise information they need.
-
Minimizing Patient’s Burden: Contrast media can have adverse effects on the kidneys. By diagnosing PE without contrast media, this study allows patients to receive treatment with less burden, reducing health risks associated with kidney damage.
📄 Full Paper Content (ArXiv Source)
Deep Learning, Convolutional Neural Network, Pulmonary Embolism, Contrast
Introduction
Research Background
The modern lifestyle of individuals often involves prolonged periods of sitting, whether it’s for work, commuting, or leisure activities. This sedentary lifestyle is prevalent in students due to educational demands, as well as in the elderly, where reduced mobility can lead to extended periods of inactivity or confinement to bed. These factors, among others such as pregnancy, uncontrolled cardiovascular diseases, and a history of cancer, can contribute to the development of lower limb venous thrombosis. Notably, studies like that of have shown that lower limb venous thrombosis is the most common cause of Pulmonary Embolism (PE) resulting from blood clots.
In recent years, the field of deep learning has seen significant growth, particularly in applications within the domain of medical imaging. Research focused on employing deep learning to aid in diagnosing PE through computed tomography pulmonary angiography (CTPA) has gained prominence. Studies like that of have sought to replace manual interpretation of CTPA results with deep learning techniques, aiming to alleviate the clinical workload for medical practitioners. Additionally, have improved the quality of medical images from CTPA scans, thus reducing the radiation dose.
When a physician strongly suspects a patient may have PE, CTPA has become the primary diagnostic method (). It enables immediate identification and prompt treatment of patients based on medical imaging results. Nevertheless, the risk of underdiagnosing PE remains a significant concern (), mainly because of the need for physicians to manually interpret CTPA images. Each patient’s CTPA scan typically comprises approximately 100-300 slices, necessitating extensive time for physicians to meticulously examine each pulmonary artery to detect potential embolic symptoms. Lack of experience, physician fatigue (), or inadequate physician coverage can lead to prolonged judgment times and diagnostic errors. Consequently, the healthcare system is burdened with increased stress (, ). Moreover, as the application of CTPA scans grows, ensuring the provision of accurate and timely diagnostic images becomes increasingly challenging for healthcare systems and physicians ().
Patients undergoing CTPA scans are at risk of developing acute kidney injury (AKI), a common occurrence in critically ill patients with PE (). Once AKI occurs, the patient’s risk of death rises significantly. Additionally, CTPA scans require the administration of contrast medium, and this requires a certain period to allow the contrast medium to circulate through the pulmonary system before the actual CT scan (). Patients with acute PE may lose valuable treatment time while waiting for the contrast medium to take effect. This delay could be critical since patients with acute PE require immediate diagnosis and treatment.
Research Motivation
CTPA images not only serve as the most commonly used method for diagnosing PE but also allow physicians to definitively determine the presence of embolic symptoms in patients without the need for further diagnostic confirmation. This makes CTPA images ideal for supervised learning methods in deep learning for image classification (). In recent research, deep learning algorithms used in the medical imaging field predominantly employ supervised learning (), particularly convolutional neural networks (CNNs) (). To address healthcare system and physician-related issues, Yang et al.’s research () proposed a two-stage CNN for assessing PE in CTPA scans. The model exhibited a sensitivity of 75.4% when evaluated using CTPA test data from 20 patients. introduced PENet, a 3D CNN, for detecting PE. When tested on two datasets, PENet achieved AUROCs of 0.84 and 0.85. These methods represent new avenues for human-robot collaboration (HRC) in healthcare (). They reduce the probability of diagnostic errors while also decreasing the time physicians spend interpreting images, enabling more appropriate treatment for patients.
While recent studies have used CNNs to diagnose PE in CTPA. Although the accuracy could be improved from a physician’s point of view, these images have been acquired after the administration of contrast medium. CTPA images with contrast medium tend to provide clearer distinctions for diagnosis, as shown in Figure 1. However, contrast medium can have adverse effects on the body. They exert cytotoxic effects on the renal tubular epithelial cells (RTE) of the kidneys, leading to loss of function, severe cell damage, and even cell death (). Furthermore, factors such as patient history and prior surgeries can influence the impact of contrast medium on the kidneys (). Studies have confirmed that contrast medium are the most significant independent risk factor for acute kidney injury in patients with severe chronic kidney disease (). Additionally, the quality of images can be influenced by factors such as the time the contrast medium circulates in the patient’s body, the patient’s physiological factors, and the concentration of the contrast medium (). In summary, the risk associated with CTPA images is higher than non-contrast CT scans. This is because CTPA scans involving contrast medium introduce the risk of renal exposure to the contrast medium, increasing the overall risk for patients requiring related medical procedures. Consequently, this study aims to apply deep learning to classify PE in non-contrast CT scans of the lungs.
Research Objective
Taking into consideration the aspects discussed above, this study will use data obtained from our collaborating institution to perform deep learning classification. The study aims to automatically classify the presence or absence of PE in non-contrast CT scans of the lungs using a deep learning CNN model. Through the research objectives outlined above, we hope to employ the results of this study to utilize supervised learning techniques in deep learning, enhancing the automated detection of PE. By doing so, we aim to alleviate the burden on medical practitioners, improve the efficiency of PE diagnosis, reduce diagnostic errors, and minimize the physical burden on patients, ultimately lowering mortality rates.
Literature Review
Pulmonary Embolism
Pulmonary Embolism (PE) is a condition characterized by the blockage of any pulmonary blood vessel by a blood clot. This obstruction hinders the flow of blood within the vessel, resulting in reduced oxygen content in the lung tissue, leading to hypoxemia, and posing a life-threatening risk (). PE is a common presentation in emergency departments and a prevalent cause of mortality. Annually, approximately 50,000 to 200,000 people succumb to PE, with a 30-day mortality rate ranging from 0.5% to over 20%, contingent upon the presentation at the time of onset (). Typical symptoms of PE include dyspnea, chest pain, cough, and other common manifestations. However, less common symptoms such as gradual-onset dyspnea over days to weeks or fainting with less prominent respiratory symptoms can also occur. Therefore, healthcare professionals need to conduct further examinations on patients with potential cardiopulmonary symptoms to confirm the presence of PE. Delayed diagnosis of PE can have severe consequences, with up to 95% of patients succumbing before diagnosis, and most fatalities occurring in untreated individuals (, ). Therefore, early detection and treatment of PE can significantly reduce mortality rates.
Contrast Medium
Contrast medium is a pharmaceutical agent used in medical imaging to enhance the visibility of body tissues. computed tomography pulmonary angiography (CTPA) is a medical imaging technique that employs contrast medium. CTPA is the primary method for diagnosing PE (). It allows physicians to promptly identify and treat PE using CTPA images. In recent years, deep learning techniques have been applied to diagnose PE using CTPA images, such as the studies conducted by , , and . Although CTPA is the primary diagnostic method for PE, the use of contrast medium carries potential risks for the patient. Research by indicates that contrast medium exerts cytotoxic effects on renal tubular epithelial cells (RTE) in the kidneys, leading to functional impairment, severe cellular damage, and cell apoptosis and necrosis. proposed that factors related to the patient and surgical history significantly impact the risk of developing Acute Kidney Injury (AKI) following contrast medium administration. In patients with chronic kidney disease, contrast medium is the most potent independent risk factor for acute kidney injury . Given the elevated risk of contrast medium to the kidneys, it is prudent to explore the application of deep learning techniques to non-contrast CT scans of the lungs for the diagnosis of PE. Currently, there is a lack of relevant studies in this area. Thus, this study will focus on utilizing non-contrast CT scans of the lungs in conjunction with deep learning techniques to classify the presence or absence of PE.
Application of Deep Learning
In recent years, the majority of algorithms in medical imaging research have employed convolutional neural networks (CNNs), primarily in the diagnosis of PE using CTPA images. For example, applied a 3D CNN to CTPA images for the detection of PE. They initiated the process with lung segmentation, followed by the identification of potential embolic regions within the lungs using a tobogganing algorithm. They then represented these regions using multi-planar vessel-aligned images and generated 2-channel images for each potential embolic region. Finally, a 9-layer CNN model, as shown in Figure 2, was employed for classification, including two convolutional layers, subsampling layers, two locally connected layers, vectorization, and fully connected layers with a Softmax layer. The convolutional layers and subsampling layers focused on feature extraction, followed by using locally connected layers to extract regionally relevant features to enhance computational efficiency. The 2D images were then vectorized for classification using fully connected and Softmax layers. This approach involves substantial pre-processing to generate features as inputs for the CNN model.
proposed a two-stage CNN model, evaluated using a test set of CTPA images from 20 patients. They achieved a sensitivity of 75.4%. In the first stage, they used a 3D fully convolutional network (FCN) to segment and extract potential pulmonary embolic regions, as illustrated in Figure 3. The encoder began with a 3D convolutional layer, followed by a max-pooling layer and four residual blocks for feature extraction. A residual block and two deconvolution layers were used to up-sample the feature image, incorporating location information regarding thrombus. A skip connection linked the final two residual blocks of the encoder with their counterparts in the decoder. In the second stage, the extracted pulmonary embolic regions underwent vessel alignment using ResNet-18 to assess obstruction in the contrast-enhanced blood vessels, thereby identifying PE symptoms. This approach focused on segmenting critical regions and extracting potential lesions, rather than utilizing the entire CTPA image for evaluation.
introduced the PENet, a 3D end-to-end CNN model with a two-test set evaluation for PE detection, achieving AUROC values of 0.84 and 0.85. They developed a PENet composed of four modules, as depicted in Figure 4: the PENet unit, Squeeze-and-Excitation block (SE-block), PENet bottleneck, and the PENet encoder. The PENet unit processed 3D input data through 3D CNN and group normalization, utilizing LeakyReLU activation. The SE-block, by simulating interdependencies among channels, autonomously adjusted the feature characteristics of the channels. The PENet bottleneck consisted of three PENet units, with the final unit inserting an SE-block after group normalization. A residual connection was applied between the input of the PENet bottleneck and the output of the SE-block. The model was concluded with a PENet unit, four PENet encoders, and Gap Linear activation.
These studies focused on applying CNN models to CTPA images. Some have mentioned the desire to utilize non-contrast CT scan images. For instance, employed non-contrast CT lung images to identify lung parenchyma and lesion-related manifestations in COVID-19 patients. This research involved segmenting regions of interest (ROI) and extracting features from these regions within the complete CT scan images of each patient. To correctly segment the lung regions of COVID-19 patients, they trained two CNN models based on the V-Net architecture, including a healthy lung parenchyma model and a lesion model, as depicted in Figure 5. These models used OR logic to produce the final outcome. This method focused on segmenting ROI, extracting potential pathological regions, and not using the entire CT scan lung images. This study, however, employed a more intricate approach, involving training two CNN models to correctly segment the lung regions of COVID-19 patients. It employed substantial pre-processing to generate features as inputs for the CNN model and was more complex compared to other studies.
Therefore, this study aims to apply CNN models to non-contrast CT scan images of the lungs. The dataset for lung CT scan images will be provided by collaborating institutions. The reason for not using existing datasets is that they exclusively comprise CTPA images, making them unsuitable for this research. Thus, the study will use the dataset provided by collaborating institutions.
Research Methodology
Research Framework
The research framework of this study is depicted in Figure 6. The study utilizes a dataset of computed tomography (CT) scans of the lungs provided by collaborating institutions. This dataset is divided into a training set, a validation set, and a test set. Pre-processing is performed on the CT scan images, including adjustment of Hounsfield Units (HU) ranges and image resizing. Subsequently, the pre-processed data is used for model training. The model is then evaluated and analyzed using the validation set. If the results require improvement, model parameters are adjusted. Finally, the test set is used for classifying the presence or absence of Pulmonary Embolism (PE), and the results are reported.
Data Source
This study utilizes a dataset provided by collaborating institutions. These datasets are sourced from the radiology department of these institutions and have been subjected to review by an Institutional Review Board (IRB) in accordance with the Personal Data Protection Act. CT scans involve the combination of X-rays and computer technology to produce cross-sectional images of the body. These images can be further processed to create detailed 3D images. Each CT scan slice has a thickness of 1 millimeter, and each patient’s scan typically consists of 100 to 300 slices, as shown in Figure 7. In total, data has been collected from 192 patients and divided into training (138 patients), validation (34 patients), and test sets (20 patients), as shown in Table 1.
| Training | Validation | Testing | Overall | |
|---|---|---|---|---|
| Number of Studies | 138 | 34 | 20 | 192 |
| Number of positive PE | - | - | - | 80 |
| Number of negative PE | - | - | - | 112 |
Data Pre-processing
Adjusting Hounsfield Units
CT scans measure the X-ray absorption capabilities, known as radiodensity, of different body tissues. These scans produce grayscale images where tissue density is represented as shades of gray. The HU scale ranges from -1000 (black on the grayscale, representing air) to +3000 (the upper limit, representing dense bone), as mentioned by reference . HU are measurement values used by radiologists to interpret CT scan images, aiding in image analysis and disease diagnosis . Table 2 below displays the HU ranges utilized in this study.
| Substance | HU |
|---|---|
| Air | -1000 |
| Lung | -700to-600 |
| Non-coagulated blood | +13 to +50 |
| Coagulated blood | +50 to +75 |
| Vascular calcification | $`\geq`$+130 |
| Cancellous bone | +300 to +400 |
| Compact bone | +3000 |
As Table 2 illustrates, the HU range for lung tissue extends from -1000 to +3000. Therefore, this study adjusts the HU of the CT scan images to several specific ranges to explore their impact on model performance:
-
-1000 to +80: This range encompasses air, lung tissue, non-coagulated blood, and distinguishes coagulated blood at its maximum HU value.
-
-1000 to +130: Adjusting the range to +130 extends the observable coagulated blood range while still excluding vascular wall calcification.
-
-1000 to +270: In this range, additional lung tissue and vascular wall calcification features are observable.
-
-1000 to +400: This relatively broad range covers various lung tissue densities, including bones with different radiodensity, with +400 set as the upper limit.
-
-1000 to +800: The widest HU range in this study covers all possible tissue and material types, including lung tissue and vascular calcification.
In addition to adjusting the range from -1000 HU for air, this study also considered the detection of PE in the lungs, so the range was adjusted from -700 HU for lungs, and the following ranges were adjusted:
-
-700 to +80
-
-700 to +130
-
-700 to +270
-
-700 to +400
-
-700 to +800
The reasons for the above range adjustments are the same as those for the Heinz units starting at -1000, so we will not repeat them. To aid the model in better understanding and processing HU, this study normalizes the HU values. If HU values exceed the maximum (minimum) value, they are displayed as the input’s maximum (minimum) value. These HU range adjustments aim to facilitate model interpretation by scaling HU values to a range between 0 and 1.
The Architecture of the CNN model
This study employs a 3D CNN model based on the architecture proposed by . As depicted in Figure 8, the model comprises four 3D convolutional layers. The first two layers contain 64 filters, followed by layers with 128 and 256 filters. All filters have a kernel size of 3×3×3. Each convolutional layer is succeeded by a max-pooling layer with a stride of 2, ReLU activation, and Batch Normalization. These layers serve various purposes, including feature extraction, feature condensation, increasing learning rates, and preventing overfitting.
The extracted features are processed through Global Average Pooling (GAP), a layer that requires no parameter optimization, thus mitigating the risk of overfitting. After GAP, the input is directed to a Dense layer with 512 neurons. Additionally, the Dropout layer is configured with a dropout probability (p) set to 0.3. This means that each neuron has a 30% chance of deactivation, which translates to approximately 300 out of 1000 neurons being deactivated. In other words, the output of these 300 neurons is set to 0, effectively addressing the issue of overfitting. Finally, the output results are passed to a Sigmoid layer for binary classification, determining the presence or absence of PE. This model maintains relative simplicity to prevent overparameterization, with only 1,352,897 trainable parameters. Furthermore, due to limitations in training sample size and hardware performance, the model is kept straightforward.
Empirical Study and Evaluation
Experimental Procedure
The experimental procedure of this study is illustrated in Figure 9.
Data pre-processing
-
Conversion from DICOM to NIFTI: A patient’s lung computed tomography (CT) scan comprises many DICOM files, which are individual 2D slices. However, NIFTI files combine these 2D images into a 3D volume, making them more convenient for use in deep learning compared to numerous DICOM files.
-
Data normalization: This study adjusted the Hounsfield Units (HU) of lung CT scan images to various ranges, including -1000 to +80, -1000 to +130, -1000 to +270, -1000 to +400, -1000 to +800, and -700 to +80, -700 to +130, -700 to +270, -700 to +400, -700 to +800, and scaled the HU between 0 and 1.
-
Resize images: As described in Chapter 3, Section 3, this study resized all lung computed tomography (CT) scan images to a uniform size for subsequent model training.
Data augmentation
Perform data augmentation: Data augmentation was performed at six different angles: -20 degrees, -10 degrees, -5 degrees, 5 degrees, 10 degrees, and 20 degrees, resulting in a sixfold increase in data volume to enhance both sample size and diversity.
Data splitting
5-fold cross-validation: As described in Chapter 4, Section 2, the dataset was divided into training and validation sets using 5-fold cross-validation to ensure model training and evaluation accuracy.
Model training
-
Utilizing 3D convolutional neural network (CNN) model: As described in Chapter 3, this study employed a model architecture based on Zunair et al. for model training.
-
Model training: Training was conducted using the training sets obtained through 5-fold cross-validation.
Model evaluation
-
Evaluate model performance: The training and validation sets from 5-fold cross-validation were used to evaluate the model based on accuracy and loss functions.
-
Adjust model parameters: If the model evaluation results require improvement, adjustments to model parameters were made to enhance performance.
Classification and result reporting
-
Test set classification: The test set was used to classify the presence or absence of Pulmonary Embolism (PE).
-
Generate result reports: Classification results were compiled, and the model’s overall performance on the test set was evaluated using four metrics from the confusion matrix: accuracy, sensitivity, specificity, and Area Under the Curve (AUC). Experimental results were presented.
Evaluation Metrics
| Actually Positive | Actually Negative | |
|---|---|---|
| Predicted Positive | True Positive | False Positive |
| Predicted Negative | False Negative | True Negative |
This study employs a 5-fold cross-validation approach for model validation and evaluation. Additionally, this study evaluates the model using a confusion matrix. A confusion matrix is a common method for assessing model performance. As shown in Table 3, “True” and “False” represent whether the prediction itself is correct or incorrect, while “Positive” and “Negative” indicate the direction of the prediction, whether it is positive or negative. The confusion matrix consists of four elements: True Positive (TP), True Negative (TN), False Positive (FP), and False Negative (FN). TP refers to correctly predicting positive samples, meaning that the prediction matches the actual situation. TN refers to correctly predicting negative samples, where the prediction matches the actual situation. FP occurs when a negative sample is incorrectly predicted as positive, indicating a mismatch between the prediction and the actual situation. FN refers to incorrectly predicting a positive sample as negative, again indicating a mismatch between the prediction and the actual situation.
Using the elements TP, TN, FP, and FN from the confusion matrix, four key metrics, including accuracy, sensitivity, specificity, and AUC (Area under the Curve), are calculated to evaluate the model. Accuracy represents the ratio of correct predictions. In the medical field, two important metrics for determining whether a patient has a condition are sensitivity and specificity. Sensitivity measures the ability to detect TP samples, i.e., how many individuals with the condition are correctly identified. Specificity measures the ability to detect TN samples, i.e., how many healthy individuals are correctly identified as such. AUC is the area under the ROC curve (Receiver Operating Characteristic Curve), as depicted in Figure 10. The ROC curve’s horizontal axis represents 1 Specificity (FPRate), while the vertical axis represents Sensitivity (TPRate). TPRate indicates the proportion of correctly identified positive samples among all actual positive samples. FPRate indicates the proportion of falsely identified positive samples among all actual negative samples. Achi eving a higher TPRate than FPRate signifies better model performance, and AUC helps mitigate issues related to imbalanced sample sizes. The Three indicators are defined as follows.
\begin{equation}
Accuracy=\frac{TP+TN}{TP+TN+FP+FN}
\end{equation}
\begin{equation}
Sensitivity=\frac{TP}{TP+FN}
\end{equation}
\begin{equation}
Specificity=\frac{TN}{TN+FP}
\end{equation}
Experimental Results
This section presents the experimental results and evaluates the model’s performance across different HU ranges. The study uses 5-fold cross-validation during the experimentation to ensure result accuracy.
The experimental results are shown in Table 4 and 5. In summary, each HU range has its advantages and disadvantages, and a comprehensive comparison of these results can provide a better understanding of the model’s strengths and limitations within different HU ranges. The study can then select the most suitable HU range to achieve optimal model performance, considering practical application scenarios, medical requirements, and dataset characteristics.
| HU | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| -1000 to +80 | 77% | 67% | 71% | 0.84 |
| -1000 to +130 | 85% | 33% | 94% | 0.84 |
| -1000 to +270 | 85% | 33% | 94% | 0.84 |
| -1000 to +400 | 80% | 33% | 82% | 0.80 |
| -1000 to +800 | 78% | 67% | 71% | 0.77 |
| HU | Accuracy | Sensitivity | Specificity | AUC |
|---|---|---|---|---|
| -700 to +80 | 80% | 67% | 76% | 0.75 |
| -700 to +130 | 83% | 67% | 82% | 0.80 |
| -700 to +270 | 75% | 67% | 65% | 0.82 |
| -700 to +400 | 85% | 33% | 94% | 0.76 |
| -700 to +800 | 73% | 67% | 59% | 0.82 |
Conclusion
Conclusion
Modern lifestyles, characterized by sedentary behaviors, have led to an increased prevalence of sedentary lifestyle-related health risks. Among these risks is the potential for Pulmonary Embolism (PE). In recent years, deep learning has seen rapid advancements in various fields, with the medical sector witnessing research employing deep learning for assisting in the diagnosis of PE. Traditionally, the manual identification of PE by physicians could be time-consuming and prone to errors due to fatigue. As a result, deep learning techniques have been employed to automatically identify the presence of PE in chest computerized tomography (CT) scans. However, most existing studies have used deep learning models for classifying PE in CT pulmonary angiogram (CTPA) images that involve the use of a contrast medium. This study, on the other hand, employs non-contrast CT scans and deep learning models for PE classification.
Differing from previous approaches, this study aims to explore the performance of deep learning models from a different perspective. The absence of a contrast medium in non-contrast chest CT scans often results in less conspicuous PE images compared to contrast images. Therefore, various Hounsfield Unit (HU) range adjustments were made to assess the impact of each range on the performance of a 3D convolutional neural network(CNN) model. The results presented by the model also indicate variations in performance between each HU range. Among these, the HU ranges of -1000 to +130, -1000 to +270, and -700 to +400 exhibited better performance in detecting true negatives, with accuracy rates of 85%, sensitivity and specificity at 33% and 94%, and AUC values of 0.84 for all three. Conversely, the HU ranges of -1000 to +80, -1000 to +800, -700 to +80, and -700 to +130 showed relatively better performance in detecting true positives, with performance in detecting true negatives falling in between. These four ranges demonstrated accuracy rates of 77%, 78%, 80%, and 83%, sensitivity of 67%, specificity at 71%, 71%, 76%, and 82%, and AUC values of 0.84, 0.77, 0.75, and 0.80, respectively.
Research Findings
This study utilized a 3D CNN model within deep learning to investigate the applicability of non-contrast chest CT scans for identifying the presence of PE. The following are the research findings obtained during the experimental process:
-
Impact of HU ranges: The experimental results demonstrated a significant influence of different HU ranges on the performance of the deep learning model. Performance in detecting true negatives for the latter set of HU ranges was moderately satisfactory, indicating that adjusting the HU range is one of the factors affecting model performance.
-
Model advantages and limitations: The deep learning model employed in this study strikes a balance between complexity and simplicity, making it capable of achieving good model performance under limited computational resources. Simultaneously, the model is not overly complex, which helps mitigate the risk of overfitting. However, this model heavily relies on the quality and diversity of the dataset, and a limited dataset may result in reduced generalization performance.
Future Research Suggestions
There are several promising directions for future research to enhance the model’s performance and practical applications.
-
Explore different HU ranges: As evidenced by this study’s experimental results, different HU ranges have a notable impact on the model’s performance. Future research can further explore the effects of various HU ranges on model performance to enhance classification accuracy. This exploration could help identify HU ranges that are better suited for non-contrast chest CT scans.
-
Multimodal fusion: In addition to HU ranges, other image features like shape, texture, and structure can be considered for integration. Employing multiple features can enhance the model’s ability to classify non-contrast chest CT images for PE.
-
Multimodal learning: Future research can explore the integration of different modalities of medical data, such as the combination of physiological parameters and clinical information. The incorporation of multimodal learning can provide a more comprehensive basis for diagnosis and further improve the model’s accuracy.
00
P. Suadicani, H. Hannerz, E. Bach, and F.Gyntelberg, “Jobs encompassing prolonged sitting in cramped positions and risk of venous thromboembolism: cohort study,” JRSM Short Rep., vol. 3, no. 2, pp. 1–5, 2012, doi:10.1258/shorts.2011.011121.
S. C.Huang et al., “PENet—a scalable deep-learning model for automated diagnosis of pulmonary embolism using volumetric CT imaging,” npj Digit. Med., vol. 3, no. 1, pp. 1–9, 2020, doi:10.1038/s41746-020-0266-y.
M.Lenfant et al., “Deep learning versus iterative reconstruction for CT pulmonary angiography in the emergency setting: Improved image quality and reduced radiation dose,” Diagnostics, vol. 10, no. 8, pp. 1–14, 2020, doi:10.3390/diagnostics10080558.
G. Le Gal and H. Bounameaux, “Diagnosing pulmonary embolism: Running after the decreasing prevalence of cases among suspected patients,” J. Thromb. Haemost., vol. 2, no. 8, pp. 1244–1246, 2004, doi:10.1111/j.1538-7836.2004.00795.x.
U. S. Yavas, C. Calisir, and I. R. Ozkan, “The interobserver agreement between residents and experienced radiologists for detecting pulmonary embolism and DVT with using CT pulmonary angiography and indirect CT venography,” Korean J. Radiol., vol. 9, no. 6, pp. 498–502, 2008, doi:10.3348/kjr.2008.9.6.498.
R. Joshi, K. Wu, J. Kaicker, and H. Choudur, “Reliability of on-call radiology residents’ interpretation of 64-slice CT pulmonary angiography for the detection of pulmonary embolism,” Acta radiol., vol. 55, no. 6, pp. 682–690, 2014, doi:10.1177/0284185113506135.
T. J. Kline and T. S. Kline, “Radiologists, communication, and resolution 5: A medicolegal issue,” Radiology, vol. 184, no. 1, pp. 131–134, 1992, doi:10.1148/radiology.184.1.1609068.
R. Fitz Gerald, “Radiological error: Analysis, standard setting, targeted instruction and teamworking,” Eur. Radiol., vol. 15, no. 8, pp. 1760–1767, 2005, doi:10.1007/s00330-005-2662-8.
J. D. Prologo, R. C. Gilkeson, M. Diaz, and J. Asaad, “CT pulmonary angiography: A comparative analysis of the utilization patterns in emergency department and hospitalized patients between 1998 and 2003,” Am. J. Roentgenol., vol. 183, no. 4, pp. 1093–1096, 2004, doi:10.2214/ajr.183.4.1831093.
C. H. Chang et al., “Acute kidney injury in patients with pulmonary embolism: A population-based cohort study,” Med. (United States), vol. 96, no. 9, 2017, doi:10.1097/MD.0000000000005822.
K. T. Bae, “Intravenous contrast medium administration and scan timing at CT: Considerations and approaches,” Radiology, vol. 256, no. 1, pp. 32–61, 2010, doi:10.1148/radiol.10090908.
C. R. Weiss, J. C. Scatarige, G. B. Diette, E. F. Haponik, B. Merriman, and E. K. Fishman, “CT pulmonary angiography is the first-line imaging test for acute pulmonary embolism: A survey of US clinicians,” Acad. Radiol., vol. 13, no. 4, pp. 434–446, 2006, doi:10.1016/j.acra.2006.01.002.
N. U. Islam, S. Gehlot, Z. Zhou, M. B. Gotway, and J. Liang, “Seeking an Optimal Approach for Computer-Aided Pulmonary Embolism Detection BT - Machine Learning in Medical Imaging,” 2021, pp. 692–702.
J. Ker, L. Wang, J. Rao, and T. Lim, “Deep Learning Applications in Medical Image Analysis,” IEEE Access, vol. 6, no. c, pp. 9375–9379, 2017, doi:10.1109/ACCESS.2017.2788044.
X. Yang et al., “A Two-Stage Convolutional Neural Network for Pulmonary Embolism Detection From CTPA Images,” IEEE Access, vol. 7, pp. 84849–84857, 2019, doi:10.1109/ACCESS.2019.2925210.
M. Kong, Q. He, and L. Li, “AI Assisted Clinical Diagnosis Treatment, and Development Strategy,” Chinese J. Eng. Sci., vol. 20, no. 2, p. 86, 2018, doi:10.15302/j-sscae-2018.02.013.
E. Seeliger, M. Sendeski, C. S. Rihal, and P. B. Persson, “Contrast-induced kidney injury: Mechanisms, risk factors, and prevention,” Eur. Heart J., vol. 33, no. 16, pp. 2007–2015, 2012, doi:10.1093/eurheartj/ehr494.
P. A. McCullough et al., “Risk Prediction of Contrast-Induced Nephropathy,” Am. J. Cardiol., vol. 98, no. 6 SUPPL. 1, pp. 27–36, 2006, doi:10.1016/j.amjcard.2006.01.022.
T. T. Tsai et al., “Contemporary incidence, predictors, and outcomes of acute kidney injury in patients undergoing percutaneous coronary interventions: Insights from the NCDR cath-PCI registry,” JACC Cardiovasc. Interv., vol. 7, no. 1, pp. 1–9, 2014, doi:10.1016/j.jcin.2013.06.016.
S. V. Konstantinides, S. Barco, M. Lankeit, and G. Meyer, “Management of Pulmonary Embolism: An Update,” J. Am. Coll. Cardiol., vol. 67, no. 8, pp. 976–990, 2016, doi: 10.1016/j.jacc.2015.11.061.
M. Hepburn Brown, J. Darvall, and G. Hammerschlag, “Acute pulmonary embolism: a concise review of diagnosis and management,” Intern. Med. J., vol. 49, no. 1, pp. 15–27, 2019, doi:10.1111/imj.14145.
A. Frölich and C. Neumann, “Acute Pulmonary Embolism,” Dtsch. Arztebl. Int., vol. 114, no. 22–23, p. 398, 2017, doi: 10.3238/arztebl.2017.0398.
H. H. B. Yoo, V. S. Nunes-Nogueira, and P. J. Fortes Villas Boas, “Anticoagulant treatment for subsegmental pulmonary embolism,” Cochrane Database Syst. Rev., vol. 2020, no. 2, 2020, doi:10.1002/14651858.CD010222.pub4.
N. Tajbakhsh, M. B. Gotway, and J. Liang, “Computer-Aided Pulmonary Embolism Detection Using a Novel Vessel-Aligned Multi-planar Image Representation and Convolutional Neural Networks,” in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, 2015, pp. 62–69.
V. Bevilacqua et al., “Lung segmentation and characterization in covid-19 patients for assessing pulmonary thromboembolism: An approach based on deep learning and radiomics,” Electron., vol. 10, no. 20, 2021, doi:10.3390/electronics10202475.
R.Bibb, D.Eggbeer, andA.Paterson, “2-Medical imaging”, Medical Modelling (Second Edition), Second Edi., R.Bibb, D.Eggbeer, andA.Paterson, Eds.Oxford: Woodhead Publishing, 2015, pp. 7–34.
T. D. Den Otter and J. Schubert, Houns field Unit. Stat Pearls Publishing, Treasure Island (FL), 2022.
H. Zunair, A. Rahman, N. Mohammed, and J. P. Cohen, “Uniformizing Techniques to Process CT Scans with 3D CNNs for Tuberculosis Prediction,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 12329 LNCS, pp. 156–168, 2020, doi: 10.1007/978-3-030-59354-4_15.
📊 논문 시각자료 (Figures)









