Neurocircuitry-Inspired Hierarchical Graph Causal Attention Networks for Explainable Depression Identification

Reading time: 46 minute
...

📝 Original Info

  • Title: Neurocircuitry-Inspired Hierarchical Graph Causal Attention Networks for Explainable Depression Identification
  • ArXiv ID: 2511.17622
  • Date: 2025-11-18
  • Authors: Weidao Chen, Yuxiao Yang, Yueming Wang

📝 Abstract

Major Depressive Disorder (MDD), affecting millions worldwide, exhibits complex pathophysiology manifested through disrupted brain network dynamics. Although graph neural networks that leverage neuroimaging data have shown promise in depression diagnosis, existing approaches are predominantly datadriven and operate largely as black-box models, lacking neurobiological interpretability. Here, we present NH-GCAT (Neurocircuitry-Inspired Hierarchical Graph Causal Attention Networks), a novel framework that bridges neuroscience domain knowledge with deep learning by explicitly and hierarchically modeling depression-specific mechanisms at different spatial scales. Our approach introduces three key technical contributions: (1) at the local brain regional level, we design a residual gated fusion module that integrates temporal blood oxygenation level dependent (BOLD) dynamics with functional connectivity patterns, specifically engineered to capture local depression-relevant low-frequency neural oscillations; (2) at the multi-regional circuit level, we propose a hierarchical circuit encoding scheme that aggregates regional node representations following established depression neurocircuitry organization, and (3) at the multi-circuit network level, we develop a variational latent causal attention mechanism that leverages a continuous probabilistic latent space to infer directed information flow among critical circuits, characterizing disease-altered whole-brain inter-circuit interactions. Rigorous leave-one-site-out cross-validation on the REST-meta-MDD dataset demonstrates NH-GCAT's state-of-the-art performance in depression classification, achieving a sample-size weighted-average accuracy of 73.3% and an AUROC of 76.4%, while simultaneously providing neurobiologically meaningful explanations. This work represents a significant advancement toward mechanismaware, explainable artificial intelligence (AI) systems for psychiatric diagnosis.

📄 Full Content

Major Depressive Disorder (MDD) is a leading cause of disability worldwide, with substantial individual and societal burden (Yan et al., 2019;Ferrari et al., 2013;Nestler et al., 2002). Early and accurate identification is critical for improving clinical outcomes, yet the neurobiological mechanisms underlying MDD remain poorly understood, posing significant challenges for objective diagnosis (Duman & Aghajanian, 2012;Drysdale et al., 2017). Recent advances in functional magnetic resonance imaging (fMRI) have enabled large-scale mapping of brain network dynamics, facilitating the identification of depression by analyzing altered functional connectivity patterns (Mulders et al., 2015). Since the brain network topology revealed by fMRI signals can be naturally described by graph models, Graph neural networks (GNNs) have shown promise for neuropsychiatric disorder classification (Ktena et al., 2017;Parisot et al., 2018;Bessadok et al., 2022;Wu et al., 2020;Isufi et al., 2021;Zheng et al., 2024c). By representing brain regions as nodes and their functional or structural relationships (like functional connectivity (FC)) as edges, GNNs can flexibly capture the topological and dynamic properties of brain networks. Typical GNN architectures stack multiple graph convolutional layers with generic message passing and aggregation schemes, followed by readout layers for classification, enabling the extraction of connectivity patterns relevant to MDD. However, existing GNN methods suffer from limited accuracy and interpretability for MDD classification (Liu et al., 2024b;Zhao & Zhang, 2024). Their limited accuracy stems from treating brain regions uniformly and relying on static connectivity measures without considering critical temporal dynamics and MDDspecific alterations, leading to suboptimal feature extraction (Long et al., 2023;Ding, 2025;Calhoun et al., 2014). Their poor interpretability is primarily due to the lack of mechanisms for explicitly modeling the hierarchical organization of brain networks or the causal relationships between neural circuits that are fundamental to understanding depression (Zhao & Zhang, 2024;Long et al., 2023). Consequently, standard interpretability tools applied to these generic GNNs produce explanations that fail to align with established neuroscientific knowledge (Yu et al., 2024). Specifically, current neuroscience findings have indicated that depression pathophysiology manifests across multiple spatial scales of brain organization, each with distinct characteristics that present unique modeling challenges, as shown in Figure 1. At the local level, MDD patients exhibit altered temporal dynamics and frequency-specific neural oscillatory patterns, particularly in low-frequency bands associated with rumination and deficits in emotional processing (Ding, 2025;Calhoun et al., 2014). At the circuit level, dysfunctional integration within neural networks such as the default mode (DMN), salience (SN), frontoparietal (FPN), limbic (LN), and reward (RN) networks contributes to cognitive and emotional symptoms, with each circuit showing specific patterns of dysregulation (Johnson et al., 2024;Menon, 2011;Kaiser et al., 2015;Hamilton et al., 2011;Whitfield-Gabrieli & Ford, 2012;Noman et al., 2024). At the network level, aberrant causal relationships among the above circuits characterize the global dysregulation observed in MDD, with altered information flow and hierarchical control processes (Presigny & De Vico Fallani, 2022;Friston, 2011;Vidaurre et al., 2017;Morishima et al., 2025;Yeo et al., 2011;Csukly et al., 2024;Pearl, 2009). Developing computational models that incorporate structured neurobiological knowledge is crucial for improving both predictive performance and mechanistic interpretability in MDD identification (Von Rueden et al., 2021;Jiang et al., 2022;Munroe et al., 2024;Ali et al., 2023).

Here, we propose the Neurocircuitry-Inspired Hierarchical Graph Causal Attention Networks (NH-GCAT), a novel framework that bridges the gap between neuroscience and deep learning for explainable MDD identification. NH-GCAT systematically models depression-specific mechanisms across three spatial scales: 1) at the local brain regional level, we design a residual gated fusion (RG-Fusion) module that integrates temporal BOLD features with functional connectivity patterns, specifically engineered to capture depression-relevant low-frequency neural oscillations that conventional static FC approaches often overlook; 2) at the multi-regional circuit level, we propose a hierarchical circuit encoding scheme (HC-Pooling) that aggregates node representations following the established structure of depression-related circuits (DMN, FPN, SN, LN, RN). This biologicallyinformed operation enables modeling of dysregulated inter-regional communication, extraction of circuit-specific functional alterations, and interpretation of how local abnormalities propagate to network-level dysfunction, yielding features aligned with depression neurobiology; 3) at the multicircuit network level, we develop a variational latent causal attention mechanism (VLCA) that leverages a continuous probabilistic latent space to infer directed information flow among critical circuits, characterizing disease-altered whole-brain inter-circuit interactions and providing mechanistic explanations for network-level dysfunctions in MDD.

Our contributions are summarized below: 1) We present a principled approach for integrating depression-specific neurocircuitry knowledge into GNN-based models; 2) We design novel modules (RG-Fusion, HC-Pooling and VLCA) for temporal dynamics integration, hierarchical aggregation and variational latent causal attention that enhance both predictive accuracy and interpretability; 3) We provide extensive empirical evidence that NH-GCAT not only achieves superior classification results but also uncovers mechanistic insights into MDD-related brain network alterations.

Brain Network Identification and Interpretability for MDD. Recent advances in graph neural networks (GNNs) have demonstrated promising results in brain network analysis (Ktena et al., 2017;Parisot et al., 2018;Yu et al., 2024;Dai et al., 2024). These approaches succeed in employing message passing mechanisms to capture region-wise interactions (Kang et al., 2024). For MDD classification specifically, existing GNNs primarily rely on static functional connectivity matrices as input features and treat brain regions as homogeneous nodes without considering their distinct neurobiological roles (Liu et al., 2024b;Zheng et al., 2024a). While some recent works (Kong et al., 2025;Zhao & Zhang, 2024) attempt to incorporate temporal information through sequence modeling, they often fail to effectively capture the low-frequency oscillatory patterns that are crucial for depression diagnosis.

Current interpretable approaches in neuroimaging broadly fall into two categories: post-hoc explanation methods and architecture-constrained models. Post-hoc methods, including attention visualization and feature attribution (Zheng et al., 2024c;Rudin, 2019;Zhang et al., 2023;Sundararajan et al., 2017;Veličković et al., 2018), provide limited insight into neurobiological mechanisms. Architecture-constrained approaches incorporate anatomical priors (Von Rueden et al., 2021;Zheng et al., 2024b;Liu et al., 2024a;Jiang et al., 2020), but typically treat these as static constraints rather than modeling dynamic disease processes.

Techniques for Neural Circuit Modeling. Residual gating mechanisms have demonstrated success in natural language processing (Tai et al., 2015;Greff et al., 2016;Choi et al., 2018) and time series analysis (Bresson & Laurent, 2017;Chen et al., 2019), allowing models to selectively integrate information streams. When applied to neural time series data such as EEG (Afzal et al., 2024), these approaches effectively capture temporal dynamics while maintaining signal fidelity. Dynamic functional connectivity (Damaraju et al., 2014) and frequency-specific neural oscillations (Tadayonnejad et al., 2016) have been extensively investigated in fMRI research; however, their integration with graph neural networks remains limited.

Hierarchical representation learning in graph structures has shown significant utility across domains including molecular property prediction and social network analysis (Ying et al., 2018). Most approaches, however, employ generic clustering objectives rather than leveraging domain-specific organizational principles. In neuroscience, hierarchical approaches have been applied to structural brain networks and functional parcellations (Csukly et al., 2024;Liu et al., 2024a;Jiang et al., 2020), but rarely incorporate established circuit-level knowledge.

Variational approaches for inferring latent graph structures (Sanchez-Martin et al., 2021;Bahuleyan et al., 2017) and disentangled representations (Jeong & Song, 2019;Yang et al., 2021) have shown success in uncovering hidden relationships in complex data. Causal methods such as dynamic causal modeling (Friston, 2011) and Granger causality (Seth et al., 2015) provide frameworks for understanding information flow, although these typically operate as separate analytical tools rather than integrated components of classification frameworks (Friston, 2011;Pearl, 2009). Existing approaches that model causality in graph-structured data (Sanchez-Martin et al., 2021;Behnam & Wang, 2024;Wang et al., 2023;Sui et al., 2022;Wang et al., 2022) primarily focus on region-level interactions, leaving circuit-level causal relationships -which align better with neuroscientific theories of depression -relatively unexplored.

We present NH-GCAT (Neurocircuitry-Inspired Hierarchical Graph Causal Attention Networks), a novel framework that integrates neuroscientific domain knowledge with deep learning for explainable and accurate depression classification. As illustrated in Figure 2, NH-GCAT comprises three principal components: 1) RG-Fusion: a residual gated fusion module for integrating temporal BOLD dynamics with functional connectivity patterns, 2) RC-Pooling: a hierarchical circuit encoding scheme that aggregates node representations according to established depression neurocircuitry, and 3) VLCA: a variational latent causal attention mechanism for modeling and interpreting intercircuit interactions. The rationale behind these key architectural design choices is further elaborated on Appendix A.3.

Problem Formulation. Given resting-state fMRI (rs-fMRI) data from N subjects, our objective is to classify each subject as either major depressive disorder (MDD) or healthy control (HC). For each subject i, we obtain a static feature matrix X

(1) i ∈ R n×m , which includes the functional connectivity (FC) matrix computed as the pairwise Pearson correlation between BOLD signals, as well as clinical variables such as age, sex, and education. Additionally, we have a time series matrix of BOLD signals X

(2) i ∈ R n×T , where n is the number of brain regions (ROIs) and T is the number of time points. Each subject is assigned a binary label y (i) ∈ 0, 1, indicating HC (0) or MDD (1). We represent each subject’s brain as a graph

, where V i denotes the set of ROIs and E i encodes the functional connection based edges. The goal is to learn a function f : G → 0, 1 that achieves accurate classification and provides interpretable, neuroscientifically meaningful explanations.

Residual Gated Fusion for Temporal Dynamics Integration (RG-Fusion). The RG-Fusion module is designed to effectively integrate complementary information from both static functional connectivity patterns and temporal BOLD dynamics. This integration is crucial for capturing the full spectrum of neural activity characteristics in rs-fMRI data, particularly the low-frequency fluctuations that are clinically significant in depression neuroimaging. The RG-Fusion module processes X (1) and X (2) through parallel pathways as follows: Temporal Feature Processing. The temporal BOLD signals X (2) are processed through a transformer encoder to capture global dependencies:

where d is the latent dimension. The output is further concatenated with the original static features X (1) and then refined using GraphEncoder module, which dual-path graph convolutions (SAGE-Conv and GATConv), to capture both local and global topological properties:

The processing of GraphEncoder can be formulated as:

Static Feature Processing. Simultaneously, the static features X (1) are processed through fully connected layers (MLP) followed by Gate module and graph attention convolution:

Gate Module. The Gate module leverages an adaptive gating mechanism to selectively integrate information from both pathways, ensuring that the distinctive characteristics of each are effectively retained and utilized:

where σ is the sigmoid activation function, W g and b g are learnable parameters, | denotes concatenation, and ⊙ represents element-wise multiplication. Z 1 and Z 2 denote the two feature vectors that need to be fused.

Residual Connection. We enhance feature discriminability through a hierarchical two-stage attention mechanism coupled with residual gated fusion. First, FeatureAtttention adaptively weights temporal features for each node, followed by NodeAtttention which focuses on depression-relevant brain regions. The attended features H attn are then combined with Z temp via residual gating, and integrated with Z static to produce the final representation Z final .

Z final is transformed through a variational encoder to obtain Z ve , yielding a continuous latent representation that encapsulates both static network properties and dynamic temporal characteristics of brain activity, providing a comprehensive embedding for subsequent modules in the NH-GCAT framework.

Hierarchical Circuit Encoding (HC-Pooling). To incorporate neurobiological priors, we design a hierarchical circuit encoding scheme that aggregates node representations according to the established organization of depression-related neural circuits. Circuit-specific Node Assignment. Let C = c 1 , …, c 5 be depression-related circuits (DMN, SN, FPN, LN, RN). For each circuit c j , we define V cj ⊂ V as its constituent regions based on neuroanatomical knowledge. Adjacency Reconstruction. For V cj , we derive FC matrix A (cj ) by fusing subject and group-level FC priors via a learnable gating mechanism:

where A 1 , A 2 , and A 3 represent individual functional connectivity, MDD group-level average connectivity, and HC group-level average connectivity matrices, respectively. Top-down Hierarchical Organization. For each neural circuit c j , we employ a differentiable topdown hierarchical organization approach using Gumbel-Softmax to assign nodes to different hierarchical levels. First, we compute node embeddings using a Graph Convolutional Network:

We then assign nodes to three hierarchical levels using differentiable masks:

where f 1 and f 2 are linear projection, τ is the temperature parameter for Gumbel-Softmax, and ϵ is a small threshold.

Bottom-up Hierarchical Aggregation. We employ a ChildSumTreeLSTM (Tai et al., 2015) to aggregate information from lower to higher hierarchical levels. For each level l ∈ 3, 2, 1, we compute:

where H and C are the hidden and cell states, ⊙ represents element-wise multiplication.Bottom-up aggregation proceeds as follows:

where h and c represent the hidden and cell states from the TreeLSTM, respectively.

where h sum = k ∈ N h k is the sum of child node representations, N is the set of child nodes, and σ is the sigmoid function. To guide the model toward learning clinically relevant connectivity patterns, we constrain the learned adjacency matrix using group-level priors:

where A yi represents the group-level connectivity prior corresponding to subject i’s label.

Variational Latent Causal Attention (VLCA). To model causal interactions between neural circuits and provide mechanistic explanations for depression, we introduce VLCA, which enables counterfactual reasoning about circuit-level interactions. Given circuit-level embeddings H DMN , H SN , H FPN , H LN , H RN ∈ R B×d , VLCA first computes attention-weighted interactions:

where H ∈ R B×C×d represents stacked circuit embeddings, and A real captures circuit interactions.

Then the attention-weighted representations are encoded into a continuous latent space:

where f encoder is a neural network. For counterfactual reasoning, we replace learned attention with an identity matrix (self-attention only):

Using the same encoder with shared parameters:

The causal effect of circuit interactions is estimated as:

Our learning objective combines classification loss on the causal effect with KL regularization:

where L CE is the cross-entropy loss, D KL is the Kullback-Leibler divergence, and µ prior is either zero or the mean of the input features depending on the prior type.This formulation enables the model to learn interpretable circuit interaction patterns, quantify their causal effect on depression classification, and provide insights into how altered circuit communication contributes to MDD pathophysiology.

Training Objective. Our overall training objective combines multiple loss terms to balance classification performance, representation learning, and causal understanding:

where L cls denotes the cross-entropy loss for MDD classification, L kl represents the KL divergence regularization from the backbone’s variational encoding, and hyperparameters λ kl , λ VLCA , and λ mse balance these competing objectives.

4.1 EXPERIMENTAL SETTINGS Datasets and Preprocessing. We utilized the REST-meta-MDD dataset, comprising 1,601 participants (830 MDD, 771 HC) from 16 sites after rigorous quality control procedures (Yan et al., 2019;Chen et al., 2022). We extracted BOLD time series from 116 regions using the AAL atlas (Tzourio-Mazoyer et al., 2002), computed Fisher z-transformed functional connectivity, and constructed brain graphs using k-nearest neighbors (k=40). Population-level reference graphs were generated for MDD and HC groups to provide connectivity templates for the hierarchical circuit encoding. Details in Appendix A.1.

Baselines and Evaluation. We compared NH-GCAT against general-purpose graph architectures (GAT (Veličković et al., 2018), GIN (Xu et al., 2018), GraphSAGE (Hamilton et al., 2017), GPS (Rampášek et al., 2022), GCN (Kipf & Welling, 2016)) and state-of-the-art MDD classification methods (BrainIB (Zheng et al., 2024c), CI-GNN (Zheng et al., 2024a), LCCAF (Kang et al., 2024), etc.). Performance was evaluated using accuracy (ACC), area under the ROC curve (AUC), F1score, sensitivity (SEN), and specificity (SPE), with 5-fold and leave-one-site-out cross-validation protocols. Details in Appendix A.2.

Implementation. Our model used 128-dimensional hidden layers with a 64-dimensional singlehead causal attention mechanism. We employed Adam optimizer with gradient clipping and dynamic weight scheduling for regularization terms. All experiments are implemented using the Py-Torch framework, and computations are performed on one NVIDIA RTX 4090 GPU. More details can be found in Appendix A.4.

Overall Classification Results. Table 1 presents a comprehensive comparison between our proposed NH-GCAT model and a range of state-of-the-art methods and strong baselines on the RESTmeta-MDD dataset. NH-GCAT achieves the highest performance across four out of five metrics, demonstrating its effectiveness for MDD classification. Specifically, NH-GCAT attains an AUC of 78.5% (1.7), accuracy of 73.8% (1.4), specificity of 71.0% (6.6), and F1-score of 75.0% (1.8), substantially outperforming competing models in these key metrics. Notably, NH-GCAT surpasses the previous best AUC (75.6%) from LCCAF (Kang et al., 2024) by a significant margin of +2.9%, and improves upon the strongest accuracy (73.0%) of BPI-GNN (Zheng et al., 2024b) by +0.8%. The F1-score exhibits a substantial gain of +2.4% over the best competing method (LGMF-GNN).

For specificity, NH-GCAT achieves 71.0%, representing a modest improvement of +0.3% over the previous best (LCCAF, 70.7%). While NH-GCAT achieves the second-best sensitivity at 76.4%, it falls short of GAT-Baseline’s 77.5% by only 1.1%, indicating competitive performance in detecting MDD cases. Furthermore, we observe that external models exhibit inconsistent performance across metrics. For instance, while LCCAF achieves competitive AUC and specificity, it shows substantial variation in accuracy (70.2% ± 8.3%). Among our implemented baselines, GAT-Baseline achieves the highest sensitivity but suffers from poor specificity (57.2%), indicating a significant trade-off between correctly identifying positive and negative cases. In contrast, NH-GCAT maintains balanced and robust performance across all metrics, with consistently low standard deviations, demonstrating its stability and reliability for clinical applications where both high sensitivity and specificity are crucial for accurate diagnosis. (Zhang et al., 2023) 66. 6 (5.2) 65.6 (4.3) 63.4 (11.2) -64.6 (6.0) GC-GAN (Oh et al., 2024) -66.8 (4.3) 70.2 (7.9) 63.1 (8.4) 68.7 (4.6) DSFGNN (Zhao & Zhang, 2024) 71.6 67.1 65.4 -67.3 BPI-GNN (Zheng et al., 2024b) -73.0 (1.0) –72.0 (1.0) TEM (Dai et al., 2024) 70.7 68.6 69.8 67.9 -CI-GNN (Zheng et al., 2024a) -72.0 (2.0) –70.0 (1.0) LGMF-GNN (Liu et al., 2024b) 73.7 (2.7) 71.3 (1.5) 73.5 (6.3) -72.6 (2.1) BrainNPT (Hu et al., 2024) 70.6 (3.5) 66.7 (3.6) —MSSTAN (Kong et al., 2025) 67 Improvement +4.5 -3.0 -5.6 +1.1 +2.3 -3.2 +10.8 -11.2 0.0 -7.5 -5.5 +8.6 +10.1 -1.3 -1.3 +8.9 +4.1

Leave-One-Site-Out Generalization. Table 2 shows the leave-one-site-out cross-validation (LOSO-CV) accuracy for NH-GCAT, CI-GNN, and BrainIB across 16 sites. NH-GCAT consistently achieves higher or competitive accuracy on most sites, with a sample-size weighted-average accuracy of 73.3%, outperforming both CI-GNN (69.2%) and BrainIB (68.8%). Specifically, NH-GCAT attains the highest accuracy on 8 out of 16 sites, and achieves notable improvements (e.g., +10.8% and +10.1%) on sites 7 and 13, respectively. Nevertheless, it underperforms on a few sites (e.g., sites 2, 3,6,8,10,11,14,15), which may be attributed to site-specific variations such as data imbalance or heterogeneity in acquisition protocols. Despite these fluctuations, the overall improvement in weighted-average accuracy (+4.1% over CI-GNN and +4.5% over BrainIB) demonstrates the robustness and generalizability of NH-GCAT across diverse clinical sites. Site-specific performance are provided in Appendix A.6.

Table 3 quantifies each component’s contribution to NH-GCAT’s performance. The RG-Fusion module improves AUC (+3.3%) and accuracy (+2.5%) over the GAT baseline, with a notable increase in specificity (+13.4%), demonstrating the value of integrating temporal BOLD dynamics with static functional connectivity. Adding VLCA further enhances AUC (+1.1%), accuracy (+1.8%), and F1 score (+3.1%), confirming the importance of modeling causal circuit interactions. The complete model with HC-Pooling achieves statistically significant improvements over the baseline in AUC (+7.0%), accuracy (+6.1%), specificity (+13.8%), and F1 score (+3.8%), while maintaining competitive sensitivity performance. These results validate our neurocircuitry-inspired design choices and their contributions to MDD classification. More details in Appendix A.7.

NH-GCAT provides neuroscientifically meaningful explanations for MDD pathophysiology through three complementary analyses (Figure 3).

Frequency-specific Neural Dynamics. We validated our RG-Fusion module by separately feeding low-frequency (0.01-0.08 Hz) and high-frequency (0.1-0.25 Hz) BOLD signals into the trained model. Our RG-Fusion module shows significantly higher AUC with low-frequency inputs (0.742±0.019) versus high-frequency inputs (0.679±0.032) (p = 0.0037). This confirms that our model captures depression-relevant neural oscillations predominantly manifested in low-frequency BOLD dynamics, as shown in Figure 3(a).

Hierarchical Circuit Organization. (5) FPN receives increased reward network input with concurrent reduction in limbic system input, suggesting altered affective influence on cognitive control processes; and (6) LIN shows significant loss of input from salience networks, potentially disrupting appropriate emotional responses to salient stimuli. These circuit-level reception abnormalities align with core MDD symptoms including negative bias in self-referential processing, emotional dysregulation, compensatory cognitive control, and impaired integration of salience and affective information. More analyses are provided in Appendix A.8.3.

We present NH-GCAT, a neurocircuitry-inspired hierarchical graph causal attention network that integrates temporal dynamics, hierarchical circuit encoding, and causal interactions for explainable MDD identification. NH-GCAT achieves state-of-the-art performance and provides interpretable insights into depression-related brain network alterations. Our findings underscore the value of embedding neuroscientific priors into deep learning frameworks to advance interpretable and clinically meaningful neuropsychiatric diagnosis.

Reproducibility Statement. A key methodological innovation lies in its federated preprocessing framework, where all participating sites implemented identical computational pipelines for spatial normalization and functional connectivity estimation prior to centralized analysis. This design explicitly addresses heterogeneity challenges in multi-site neuroimaging studies through protocol harmonization at both data acquisition and processing stages.

Sample Selection. Following the protocols established in the original REST-meta-MDD publication (Yan et al., 2019), from the initial collection of 1,300 MDD patients and 1,128 healthy controls (HC), we selected 848 MDDs and 794 HCs from 17 sites for our analysis, yielding a preliminary dataset of 1,642 participants. All participants provided written informed consent, and the study protocols were approved by the local ethics committees of participating institutions.

Quality Control. Following REST-meta-MDD consortium guidelines (Chen et al., 2022), we excluded data from Site 4 due to duplication with Site 14 during quality control procedures. The final analytical cohort comprised 1,601 participants (830 MDD, 771 HC) distributed across 16 research sites after implementing standardized data cleaning protocols. This rigorous quality control procedures ensures data reliability.

A.1.2 BRAIN PARCELLATION AND GRAPH CONSTRUCTION ROI Extraction. Following preprocessing, we extracted regional BOLD time series from 116 anatomically defined regions using the Automated Anatomical Labeling (AAL) atlas (Tzourio-Mazoyer et al., 2002). This atlas was selected for its established validity in neuropsychiatric research and comprehensive coverage of cortical and subcortical structures implicated in depression pathophysiology. For each subject, we derived two complementary feature sets:

• Temporal features: 116 regional BOLD time series (116 × T matrix, where T represents the number of time points), capturing the dynamic neural activity patterns across the brain.

• Multi-dimensional static features: We implemented an overlapping sliding window approach (window length T=90, stride S=45) to extract: (1) a 116 × 116 functional connectivity matrix computed as Fisher z-transformed Pearson correlations between regional time series;

(2) spectral characteristics including variance and low-frequency power (0.01-0.1 Hz) for each region, which captures neurobiologically relevant oscillations associated with resting-state networks; and (3) demographic variables including age, sex, and education level to account for potential confounding factors. Brain Graph Construction. To construct brain graphs for our graph neural network approach, we employed a k-nearest neighbors (k=40) algorithm using the functional connectivity matrix as edge weights. This sparse graph construction approach preserves the strongest functional connections while reducing computational complexity and noise. We chose k=40 based on preliminary experiments indicating optimal performance while maintaining physiologically plausible network topology.

Reference Graph Templates. We constructed group-level reference graph templates by averaging functional connectivity matrices within diagnostic groups:

• MDD group-level template: Average connectivity pattern across all MDD subjects in the the training set.

• HC group-level template: Average connectivity pattern across all healthy controls in the training set.

These templates provided prior knowledge for our hierarchical circuit encoding scheme, enabling the model to learn connectivity patterns characteristic of each diagnostic group.

For neurocircuitry-informed analysis, we leveraged established neuroscientific knowledge to assign AAL regions to five depression-relevant neural circuits: Default Mode Network (DMN), Frontoparietal Network (FPN), Salience Network (SN), Limbic Network (LN), and Reward Network (RN). This assignment followed consensus mappings from multiple sources in the depression neuroimaging literature (Menon, 2011;Kaiser et al., 2015;Hamilton et al., 2011;Whitfield-Gabrieli & Ford, 2012).

In Table 1 of the main paper, we present performance comparisons between NH-GCAT and various baseline methods. For transparency and to ensure fair comparison, we provide a detailed explanation of our comparison methodology below.

Data Partitioning. For model evaluation, we utilized two complementary strategies:

• 5-fold cross-validation: Data were randomly partitioned into 5 folds with stratification to maintain diagnostic class distribution. • Leave-one-site-out cross-validation: Each site was sequentially held out as a test set, with the remaining 15 sites used for developing model.

This dual evaluation approach allowed us to assess both general performance and cross-site generalizability of our model.

Comparison with Published State-of-the-Art Methods. For specialized MDD classification methods, we report performance metrics as published in their respective papers. This approach is scientifically justified for several reasons:

  1. Common Dataset: All compared methods were evaluated on the REST-meta-MDD dataset, the same dataset used in our study. As shown in Table 5, most studies used comparable sample sizes (approximately 1,600 subjects), with minor variations due to different quality control procedures. 2. Standardized Brain Atlas: The majority of compared methods (BrainIB, BPI-GNN, TEM, CI-GNN, LGMF-GNN, BrainNPT, MSSTAN) used the AAL atlas for brain parcellation, matching our approach and ensuring comparable region definitions. 3. Similar Cross-validation Strategies: Most methods employed either 5-fold or 10-fold cross-validation protocols, with LCCAF, GC-GAN, DSFGNN, and our approach specifically using 5-fold cross-validation. Addressing Methodological Variations. While direct reimplementation of all baseline methods would be ideal, it presents several practical challenges:

  2. Implementation Complexity: Many specialized methods involve complex architectures with numerous hyperparameters. Reimplementing these without access to original code could introduce unintentional modifications that affect performance. 2. Computational Constraints: Training multiple deep learning models on large neuroimaging datasets requires substantial computational resources, particularly when hyperparameter optimization is necessary for fair comparison. 3. Established Practice: In neuroimaging machine learning research, comparing with published results on standardized datasets is an established practice, particularly when evaluating on large, publicly available datasets like REST-meta-MDD.

To mitigate potential concerns about comparison fairness, we took several additional steps:

  1. Our Implemented Baselines: We implemented five general-purpose graph neural networks (GCN, GIN, GraphSAGE, GPS, GAT) ourselves using identical preprocessing, feature extraction, and evaluation protocols as our NH-GCAT model. This provides a controlled comparison with widely-used graph learning architectures.

We report the same set of evaluation metrics (AUC, accuracy, sensitivity, specificity, F1 score) as used in the original papers, enabling direct comparison. 3. Multiple Evaluation Protocols: We evaluated NH-GCAT using both 5-fold crossvalidation (for comparison with most methods) and leave-one-site-out cross-validation (for comparison with recent state-of-the-art methods like BrainIB (Zheng et al., 2024c)), ensuring comprehensive benchmarking. 4. Statistical Significance Testing: We conducted rigorous statistical tests to verify that performance improvements are significant and not due to random variation.

This comprehensive approach to baseline comparison-combining published results from specialized methods with our own implementations of general architectures-provides a thorough and fair evaluation of NH-GCAT’s performance within the current landscape of MDD classification methods.

In this section, we elaborate on the rationale behind our key architectural design choices in NH-GCAT. Our overarching philosophy is to infuse neuroscientific domain knowledge as an architectural inductive bias, moving beyond purely data-driven approaches to create a model that is not only accurate but also mechanistically interpretable. We detail the specific motivations for our three core components: Residual Gated Fusion (RG-Fusion), Hierarchical Circuit Encoding (HC-Pooling) with ChildSumTreeLSTM, and the Variational Latent Causal Attention (VLCA) mechanism.

Problem Formulation. The pathophysiology of Major Depressive Disorder (MDD) manifests in both static and dynamic properties of brain networks. Static functional connectivity (FC) provides a time-averaged summary of network topology, while temporal Blood Oxygenation Level Dependent (BOLD) signals capture dynamic, moment-to-moment neural fluctuations. Critically, depression is linked to altered low-frequency oscillations (<0.1 Hz), which are lost when relying solely on static FC matrices. Conventional Graph Neural Network (GNN) models for MDD classification often ignore this temporal dimension, leading to suboptimal feature extraction.

Intuition and Design. The RG-Fusion module is explicitly designed to synergistically integrate these two complementary data modalities. It employs a dual-stream architecture:

  1. A temporal pathway uses a Transformer Encoder to process the raw BOLD time series. The self-attention mechanism is particularly adept at capturing long-range temporal dependencies within the signal, which is crucial for modeling low-frequency oscillations. 2. A static pathway processes the FC matrix using standard graph convolutional layers to learn topological patterns.

The core innovation is the adaptive gating mechanism. Instead of simple concatenation, which would treat both feature streams equally, our gate learns to dynamically weight the importance of temporal versus static information for each brain region. This allows the model to selectively emphasize features most relevant to depression classification on a node-by-node basis. The residual connection ensures stable training and prevents the loss of critical information from the primary temporal pathway during fusion.

Comparison to Alternatives.

• Static FC-based GNNs: This is the most common approach but is fundamentally limited as it discards rich dynamic information contained in BOLD signals, particularly the depression-relevant oscillatory patterns. • Simple Feature Concatenation: A naive concatenation of temporal and static features lacks the flexibility to adaptively prioritize information. Our learned gating mechanism provides a more principled fusion, allowing the model to determine the optimal balance between modalities, which can vary across brain regions and subjects.

Problem Formulation. The human brain is not a flat, homogeneous graph; it possesses a wellestablished hierarchical organization. At a macroscopic level, brain regions form functional circuits (e.g., Default Mode Network (DMN), Salience Network (SN)), which collaboratively govern complex cognitive and emotional processes. Dysfunctions in MDD are often best understood at this circuit level. Standard GNN pooling mechanisms (e.g., global mean/max/sum pooling) are agnostic to this neurobiological reality, collapsing node features into a single vector and losing crucial circuit-specific information.

Intuition and Design. The HC-Pooling module is designed to explicitly model the brain’s multiscale organization by aggregating regional node representations according to a predefined, neuroscientifically validated circuit hierarchy. This transforms node-level embeddings into circuit-level embeddings, aligning the model’s representations with the language of cognitive neuroscience.

Justification for ChildSumTreeLSTM. To perform this hierarchical aggregation, we required an operator capable of processing information on a tree-structured hierarchy. The choice of Child-SumTreeLSTM over other alternatives was deliberate and principled:

  1. Alignment with Hierarchical Structure: Unlike standard LSTMs or GRUs that operate on linear sequences, TreeLSTMs are specifically designed for tree-structured data. Our defined hierarchy, where brain regions (leaf nodes) are grouped into circuits (parent nodes), naturally forms a tree.

The “Child-Sum” variant is particularly suitable for our task. Neural circuits are not uniform in size; some contain many brain regions (children), while others contain few. ChildSumTreeLSTM elegantly handles this variability by summing the hidden states of all child nodes before feeding them into the LSTM cell. This makes it a flexible and robust aggregator for real-world neuroanatomical structures.

Comparison to Alternatives.

• Standard LSTMs/Sequence Models: These are fundamentally incompatible as they cannot process the non-sequential, hierarchical relationships between brain regions within a circuit.

• Generic GNN Layers for Pooling: One could stack more GNN layers to achieve a global representation, but this does not explicitly create distinct, interpretable embeddings for each predefined circuit. Our approach guarantees that the resulting vectors correspond to the DMN, FPN, etc.

• Other Hierarchical Pooling Methods (e.g., DiffPool): Methods like DiffPool learn a hierarchical structure in a purely data-driven manner. While powerful, our objective was to leverage established neuroscientific knowledge as a strong prior. By using a predefined hierarchy and a structure-aware aggregator like ChildSumTreeLSTM, we ensure that the model’s internal organization is neurobiologically meaningful and its subsequent analyses are directly interpretable in the context of existing depression literature.

Problem Formulation. For a model to be truly explainable, it must move beyond identifying correlations to inferring directed influence. We need to understand how dysfunction in one neural circuit might causally impact others. Standard attention mechanisms in GNNs identify which nodes or features are important for a prediction but do not typically model directionality or provide a framework for causal reasoning.

Intuition and Design. The VLCA mechanism is designed to model the directed information flow between the high-level neural circuits derived from HC-Pooling. It achieves this through two key innovations:

  1. Variational Framework: By encoding the learned circuit interactions into a continuous probabilistic latent space, the model learns a robust and smooth representation of intercircuit dynamics, capturing uncertainty in these complex biological systems.

The core of the causal inference lies in comparing the model’s output under two conditions: (a) using the learned, attention-weighted interactions (real), and (b) using a counterfactual scenario where these interactions are removed (i.e., attention is replaced with self-attention only via an identity matrix). The difference in outcomes allows us to estimate the causal effect of inter-circuit communication on the classification of depression. This is integrated directly into the training objective.

Comparison to Alternatives.

• Standard Graph Attention (GAT): GAT computes scalar attention weights that indicate feature importance. It does not inherently model the directional flow of information between high-level conceptual units (our circuits) or provide a mechanism to quantify the causal impact of these interactions.

• Post-hoc Explainability Methods (e.g., GNNExplainer, Integrated Gradients): These methods analyze a trained model to find important features or subgraphs. While useful, they are separate from the learning process. VLCA integrates causal reasoning directly into the model’s architecture and objective function. This encourages the model to learn representations that are inherently causal and interpretable from the outset, rather than attempting to explain a black box after the fact. This architecture-constrained approach generally leads to more robust and faithful explanations.

This section provides comprehensive details about the architecture specifications, hyperparameter settings, and training procedures of NH-GCAT to facilitate reproducibility.

Feature and Node Attention. Two-stage attention with feature-wise attention (single-head, tem-perature=0.1) followed by node-wise attention (single-head, temperature=0.1).

Variational Encoder. 2-layer MLP (hidden dims= 32, 16) for mean and log-variance estimation.

Classifier. The final classification is performed by:

• Circuit Integration: Concatenation of circuit-level embeddings followed by a 2-layer MLP (hidden dims=128, 64) with dropout=0.5.

• Output Layer: Linear layer with 2-dimensional output and softmax activation.

Network Dimensions. The NH-GCAT model maintains consistent hidden dimensions across its components, with the primary embedding dimension set to 128. Specific dimensional configurations for each module are:

• RG-Fusion: The transformer encoder for BOLD signal processing uses 4 attention heads with a hidden dimension of 128. The subsequent graph encoding layers (SAGEConv and GATConv) both produce 64-dimensional outputs that are concatenated to form 128dimensional node representations.

• HC-Pooling: Each circuit-specific hierarchical encoding maintains 128-dimensional representations across all three hierarchical levels. The ChildSumTreeLSTM uses 128dimensional hidden and cell states.

• VLCA: The causal attention mechanism employs single-head attention with a 64dimensional output. The variational encoder projects these into a latent space with dimension 32.

Activation Functions. We employ Leaky ReLU (negative slope = 0.2) for all graph convolutional operations and MLPs within the RG-Fusion module. The gating mechanisms use sigmoid activations, while the ChildSumTreeLSTM follows the standard LSTM activation pattern with tanh and sigmoid functions.

Normalization and Regularization. Layer normalization is applied after each transformer encoder block. We employ dropout (rate = 0.2) after each convolutional operation and within the attention mechanisms. For the probabilistic components, we use a KL divergence regularization term with dynamic weighting.

Parameters and Network Size. Our final NH-GCAT model has approximately 2.1 million trainable parameters.

Table 6 summarizes the key hyperparameters used in our experiments. Training proceeded for a maximum of 300 epochs with early stopping (patience = 20) based on validation performance. The best-performing checkpoint was selected for final evaluation. During training, we dynamically balanced loss terms by applying adaptive weight reduction when specific loss components exceeded predefined thresholds.

For data augmentation, we employed random edge dropout (10%) during training to enhance robustness. The model was trained using a 5-fold stratified cross-validation procedure, ensuring consistent class distribution across folds. For leave-one-site-out validation, we trained on data from 15 sites and tested on the held-out site, repeating this procedure for all 16 sites.

All experiments were implemented using PyTorch 2.5.1 and PyTorch-Geometric 2.6.1. For circuitspecific operations, we developed custom extensions to PyTorch-Geometric to support hierarchical graph operations. Our custom implementation of the ChildSumTreeLSTM was based on the DGL (Deep Graph Library) framework but optimized for our specific hierarchical circuit structure.

The implementation code for NH-GCAT will be made publicly available at https://github. com/author/NH-GCAT upon publication.

To provide a more comprehensive and nuanced evaluation of the proposed NH-GCAT framework, this section extends the performance analysis presented in the main paper. We supplement the primary classification metrics with detailed visualizations of the Receiver Operating Characteristic (ROC) curve, the Precision-Recall (PR) curve, and a Decision Curve Analysis (DCA). These analyses, based on the 5-fold cross-validation results, offer deeper insights into the model’s discriminative ability, its performance on the positive class (MDD), and its potential clinical utility. Our NH-GCAT model achieves a mean Area Under the Curve (AUC) of 0.786±0.017 across the five folds. The consistency across folds, indicated by the narrow shaded region representing the standard deviation, highlights the model’s stability. This result reinforces the findings from Table 1 in the main paper, confirming that NH-GCAT is highly effective at distinguishing between individuals with MDD and healthy controls.

While the ROC curve provides a general view of discriminative performance, the Precision-Recall (PR) curve (Figure 4b) is particularly informative for evaluating a model’s ability to correctly identify the positive class, which in our case are the MDD subjects. This is clinically crucial, as failing to identify a patient (a false negative) can have significant consequences.

NH-GCAT achieves a mean Average Precision (AP) of 0.777 ± 0.024. This is substantially higher than the baseline AP of 0.518, which corresponds to the proportion of positive samples in the dataset. The consistently high precision across a wide range of recall values indicates that when the model identifies a subject as having MDD, it is likely to be correct, and it can do so without missing a large number of actual MDD cases. This demonstrates the model’s reliability for screening or diagnostic support applications.

Beyond standard statistical metrics, it is vital to assess whether a predictive model offers tangible benefits in a clinical setting. Decision Curve Analysis (DCA) is a method for evaluating the clinical utility of a model by quantifying its net benefit across a range of risk thresholds for intervention. The net benefit is calculated by balancing the benefits of true positives against the harms of false positives.

Figure 4c presents the DCA for our NH-GCAT model. The x-axis represents the threshold probability, which is the risk threshold at which a clinician (or a policy) would decide to intervene (e.g., recommend further testing or treatment). The y-axis shows the net benefit. A model is considered clinically useful if its net benefit is higher than the two default strategies: “Treat None” (net benefit is always zero) and “Treat All”.

The curve for NH-GCAT (Mean Model) demonstrates a positive net benefit across a wide and clinically relevant range of threshold probabilities, approximately from 0.10 to 0.75. This means that using the NH-GCAT model to guide clinical decisions would lead to better outcomes than either treating all patients or treating none of them within this wide decision-making range. This analysis provides strong evidence that our model’s predictions are not just statistically significant but also translate into practical clinical value, justifying the use of a sophisticated, interpretable model for this high-stakes task.

This section provides a detailed analysis of our leave-one-site-out cross-validation results, complementing the summary presented in Section 4.2 (Performance Comparison). As noted in the main paper, NH-GCAT achieves the highest accuracy on 8 out of 16 sites (50%) with an weighted-average accuracy of 73.3% across all sites, demonstrating a +4.1% improvement over the best competing methods.

Table 7 presents the comprehensive performance metrics for our NH-GCAT model across all evaluation sites in the REST-meta-MDD dataset. The results highlight our model’s ability to generalize across heterogeneous data collection sites with varying sample sizes and demographic characteristics.

Table 7: Detailed leave-one-site-out cross-validation performance metrics of NH-GCAT across 16 sites from the REST-meta-MDD dataset. For each metric, the best value is shown in bold and the second-best value is underlined. The final row shows the sample-size weighted average. Unweighted Average 74.5 (13.7) 74.1 (11.2) 74.7 (5.6) 74.5 (6.7) 77.0 (7.9) Weighted Average 71.9 (9.5) 74.4 (7.9) 73.3 (4.4) 73.3 (5.2) 76.4 (6.1)

Several key observations can be drawn from these results:

  1. Robustness across sample sizes: NH-GCAT performs well on both large sites (e.g., Site 13 with 250 MDD/229 HC) and small sites (e.g., Site 7 with 20 MDD/17 HC), demon-strating its ability to learn meaningful representations regardless of sample size. This is particularly evident in Site 7, where our model achieves the highest accuracy (86.5%) and AUC (93.5%) despite the limited sample. 2. Performance on balanced vs. imbalanced sites: The model maintains strong performance on both balanced sites (e.g., Site 5 with 48 MDD/48 HC) and imbalanced sites (e.g., Site 12 with 18 MDD/31 HC), indicating robustness to class distribution variations. 3. Consistent sensitivity: In alignment with our findings in the main paper, NH-GCAT demonstrates high sensitivity (74.5% average) across sites, which is clinically valuable for depression screening applications where identifying potential MDD cases is prioritized. 4. Significant improvements on challenging sites: As noted in Section 4.2, our model shows substantial improvements on sites where previous methods struggled, particularly on larger sites like Site 13 (+10.1% improvement) and Site 7 (+10.8% improvement).

These detailed results further validate the effectiveness of our neurocircuitry-informed approach. By incorporating domain knowledge about depression-related neural circuits through our hierarchical circuit encoding scheme, NH-GCAT can better capture the complex patterns of functional dysregulation characteristic of MDD across diverse clinical populations. The model’s strong performance in this rigorous cross-validation setting demonstrates its potential for real-world clinical applications where generalization across heterogeneous data sources is essential.

To thoroughly evaluate the contribution of each component in NH-GCAT, we conducted extensive ablation studies beyond those presented in the main paper. The full VLCA model consistently outperforms simpler attention mechanisms, with notable improvements in AUC (+3.5% over standard attention) and accuracy (+3.7% over standard attention). Interestingly, the deterministic causal variant achieves the highest specificity (70.2%), while the full VLCA model provides the best balance across all metrics. This confirms the importance of modeling both uncertainty and directionality in relationships between neural circuits for accurate depression classification.

A.7.2 ANALYSIS OF HC-POOLING VARIANTS With the RG-Fusion and VLCA components in place, we compared four different depths of hierarchical circuit encoding to evaluate the optimal architecture for capturing depression-related neurocircuitry:

• 1-layer hierarchy: A shallow hierarchical structure with limited capacity to model complex circuit interactions.

• 2-layer hierarchy: A two-level hierarchical organization that captures basic circuit-level relationships.

• 3-layer hierarchy: Our complete three-level differentiable hierarchical pooling that aligns with established neurocircuitry principles.

• 4-layer hierarchy: A deeper hierarchical structure that may introduce unnecessary complexity.

Results demonstrate that the 3-layer HC-Pooling architecture achieves the best overall performance, with AUC (78.5%), accuracy (73.8%), and F1-score (75.0%) all reaching peak values. This confirms our hypothesis that a three-level hierarchy best captures the organizational principles of depressionrelated neural circuits. While the 4-layer variant achieves comparable specificity (70.7%) to the 3layer model (71.0%), it shows reduced performance in other critical metrics including AUC (-2.4%), accuracy (-1.3%), and F1-score (-1.5%), suggesting potential overfitting with excessive hierarchical complexity. The progressive improvement from 1-layer to 3-layer hierarchy demonstrates clear benefits of increased hierarchical depth, with AUC improving from 74.9% to 78.5%, while the performance degradation at 4 layers indicates an optimal complexity threshold.

The ablation studies collectively demonstrate that each component of NH-GCAT contributes significantly to its overall performance. The progressive improvements from the baseline GAT model to the full NH-GCAT architecture highlight the value of our neuroscience-inspired approach. The RG-Fusion module substantially enhances specificity (+13.4%), addressing the baseline’s primary weakness, while the VLCA mechanism with full variational causal modeling outperforms simpler attention variants, improving AUC by +1.1% and F1-score by +3.1% over RG-Fusion alone. The 3-layer HC-Pooling architecture provides the optimal hierarchical structure for modeling depression neurocircuitry, contributing final improvements of +2.6% in AUC and +1.4% in F1-score. These findings support our approach to integrating neuroscience domain knowledge with deep learning for MDD classification, with each design choice validated through systematic ablation analysis.

This section provides an in-depth analysis of the interpretable components of NH-GCAT, examining how each module contributes to model explainability and offers neurobiologically meaningful insights into MDD pathophysiology.

To validate our RG-Fusion module’s ability to capture depression-relevant neural oscillations, we conducted a frequency-specific analysis by separately feeding low-frequency (0.01-0.08 Hz) and high-frequency (0.1-0.25 Hz) BOLD signals into the trained model.

Experimental Setup. We filtered the original BOLD signals into two frequency bands using a bandpass filter implemented in the preprocessing pipeline:

• Low-frequency band (0.01-0.08 Hz): Known to contain depression-relevant neural oscillations (Calhoun et al., 2014) • High-frequency band (0.1-0.25 Hz): Typically considered to contain physiological noise and artifacts

For each frequency band, we performed 5-fold cross-validation using identical train/test splits and model parameters as in our main experiments. We then compared the classification performance (AUC) between the two frequency bands.

Results. Figure 3(a) illustrates the performance comparison between low-frequency and highfrequency inputs. The model achieved significantly higher AUC with low-frequency inputs (mean=0.742, SD=0.019) compared to high-frequency inputs (mean=0.679, SD=0.032). A paired t-test confirmed the statistical significance of this difference (p = 0.0037). Neurobiological Interpretation. These findings confirm that our RG-Fusion module effectively captures depression-relevant neural oscillations predominantly manifested in low-frequency BOLD dynamics. This aligns with previous research indicating that depression-related functional connectivity alterations are most pronounced in the low-frequency band (Calhoun et al., 2014;Ding, 2025). The model’s ability to leverage these frequency-specific patterns contributes to its superior classification performance compared to models that rely solely on static functional connectivity.

We analyzed directional differences in hierarchical layer distributions between MDD and HC groups across depression-related neural circuits, as shown in Figure 5.

Analysis Method. For each subject, our HC-Pooling module assigned brain regions to three hierarchical levels (Layer-1: high-level integration, Layer-2: intermediate processing, Layer-3: primary processing). For each brain region, we calculated the proportion of subjects in each diagnostic group (MDD and HC) that assigned the region to each layer. We then computed the directional difference between these proportions (MDD -HC) and normalized these differences to the range [-1, 1] by dividing by the maximum absolute difference across all regions and layers. This normalization preserves the directionality of effects while enabling direct comparison across regions. Positive values indicate higher proportions in MDD, while negative values indicate higher proportions in HC. Statistical significance was assessed using Chi-square tests of independence.

Results. Our analysis revealed significant between-group differences in hierarchical organization across multiple circuits, with six regions showing statistically significant alterations (Figure 5), and their spatial locations in AAL atlas are shown in Figure 6.

Circuit-specific Interpretations. The directional differences reveal distinct patterns of hierarchical reorganization in MDD:

  1. Default Mode Network (DMN): MDD exhibits a bidirectional reorganization with increased Layer-1 representation in Frontal Sup Medial L (0.39, p < 0.05) but decreased Layer-1 in Angular L (-0.14, p < 0.05). This suggests a functional imbalance within the DMN, with hyperactivity in medial prefrontal regions (associated with self-referential processing) and altered integration in parietal nodes. This pattern aligns with the pathological rumination and altered self-focus characteristic of depression.

  2. Frontoparietal Network (FPN): Frontal Inf Oper R shows substantially decreased Layer-1 representation (-0.88, p < 0.05) and increased Layer-3 representation (0.74) in MDD, indicating a significant reduction in high-level integration of this key cognitive control region. This supports the executive dysfunction hypothesis of depression, where impaired top-down control contributes to negative cognitive biases and difficulty disengaging from negative stimuli.

We observed opposing patterns in limbic regions: ParaHippocampal R showed increased Layer-1 representation (0.81, p < 0.05) while Amygdala R showed decreased Layer-1 (-0.74, p < 0.05) and increased Layer-2 (0.72) representation in MDD. This suggests a reorganization of emotional processing circuits, with altered integration between memory-related (parahippocampal) and emotion-generating (amygdala) regions, consistent with emotional dysregulation in depression.

  1. Reward Network (RN): Caudate L showed the strongest effect, with significantly higher Layer-1 representation in MDD (0.85, p < 0.01) and lower Layer-2 (-0.68). This substantial reorganization of a key reward processing region may reflect compensatory mechanisms for anhedonia, with increased high-level integration potentially serving to counteract reward deficits.

These findings demonstrate how our HC-Pooling module captures clinically meaningful alterations in circuit hierarchy that align with established neurobiological models of depression. The directional nature of these differences provides novel insights into the specific reorganization patterns across hierarchical layers that may contribute to depression pathophysiology.

We leveraged our VLCA mechanism to examine directed information flow among neural circuits, revealing distinct patterns of information reception in MDD versus HC groups.

Analysis Method. For each subject, we extracted the attention weights from the VLCA module, representing the strength of directed connections between circuits. To focus on the most significant connections while reducing noise, we first computed group averages for MDD and HC subjects, then applied a graph pruning technique that retained only the top-2 strongest outgoing connections (excluding self-connections) for each circuit. Finally, we normalized the weights across both groups to facilitate between-group comparison. The normalized weights were visualized as chord diagrams (Figure 3(c-d)).

Results. Quantitative analysis of the attention weights revealed several key differences in circuitlevel information reception between MDD and HC groups, as detailed in Table 10.

Neurobiological Interpretation. Our analysis revealed six key alterations in circuit-level information reception in MDD:

  1. Altered DMN Information Reception: In MDD, DMN receives significantly reduced input from frontoparietal networks (FPN→DMN: 0.361 in HC vs. 0.006 in MDD) and limbic networks (LIN→DMN: 0.113 in HC vs. 0.000 in MDD), while receiving novel input from reward networks (RN→DMN: 0.000 in HC vs. 0.476 in MDD). This reconfiguration suggests impaired cognitive and emotional regulation of self-referential processing, with abnormal integration of reward signals-potentially underlying negative self-focused rumination characteristic of depression.

  2. Reduced Salience Network Modulation: SN receives diminished regulatory input from DMN (DMN→SN: 0.846 in HC vs. 0.652 in MDD) and complete loss of emotional input from limbic networks (LIN→SN: 0.479 in HC vs. 0.000 in MDD), while receiving novel and maximal input from reward networks (RN→SN: 0.000 in HC vs. 1.000 in MDD). This

The three complementary analyses above provide a comprehensive, multi-level interpretation of depression neurobiology through the lens of our NH-GCAT model:

• Local Level (RG-Fusion): Frequency-specific analyses demonstrate the model’s heightened sensitivity to low-frequency neural oscillations associated with depression, thereby facilitating effective pattern recognition and enhancing classification accuracy.

• Circuit Level (HC-Pooling): Hierarchical organization analysis reveals circuit-specific alterations in information processing hierarchy, aligning with clinical manifestations of depression.

• Network Level (VLCA): Causal interaction analysis uncovers altered patterns of directed information flow among neural circuits, characterizing the global dysregulation observed in MDD.

This multi-level interpretability not only enhances the model’s transparency but also provides mechanistic insights into how local neural abnormalities propagate to circuit-level dysfunction and ultimately manifest as network-level dysregulation in depression.

Clinical Implications. The interpretability features of NH-GCAT offer several potential clinical applications:

  1. Biomarker Identification: The frequency-specific neural patterns identified by RG-Fusion could serve as potential biomarkers for depression diagnosis.

  2. Treatment Targeting: The circuit-specific hierarchical abnormalities revealed by HC-Pooling could guide targeted interventions such as transcranial magnetic stimulation (TMS) or deep brain stimulation (DBS).

The causal circuit interactions quantified by VLCA could be used to monitor disease progression and treatment response.

These interpretability analyses demonstrate how NH-GCAT bridges the gap between data-driven machine learning and neuroscientific understanding, offering both predictive power and mechanistic insights into depression pathophysiology.

A.9 DISCUSSION ON CLINICAL RELEVANCE AND FUTURE DIRECTIONS Beyond classification accuracy, a primary goal of developing mechanism-aware models like NH-GCAT is to bridge the gap between computational findings and clinical practice. This section discusses the clinical relevance of our model’s neurobiological findings, particularly those from the Variational Latent Causal Attention (VLCA) module, and outlines a key future direction in personalized psychiatry.

Alignment with Known Pathophysiology. Our VLCA module identified abnormally increased directed information flow from the Reward Network (RN) to the Default Mode Network (DMN) as a significant feature distinguishing individuals with MDD from healthy controls. This finding is highly congruent with established neurobiological theories of depression. It provides a plausible mechanistic link between two core symptom domains: anhedonia (a blunted response to reward, associated with RN dysfunction) and pathological rumination (maladaptive, self-referential thought, associated with DMN hyperactivity). The model’s discovery suggests a pathway through which dysfunctional reward signals are pathologically integrated into the brain’s self-referential processing stream, perpetuating a cycle of negative self-focus and diminished pleasure.

Alignment with Treatment Mechanisms. Crucially, the inter-circuit connections highlighted by our model are not merely statistical artifacts; they represent known targets for antidepressant interventions. The DMN, and its connectivity with other large-scale networks, is a well-established locus of modulation for various treatments, including Selective Serotonin Reuptake Inhibitors (SS-RIs). For instance, multiple studies have demonstrated that successful antidepressant treatment is associated with the normalization of DMN connectivity patterns (Dunlop et al., 2017). Therefore, the RN→DMN hyperconnectivity identified by NH-GCAT represents a clinically relevant and treatment-sensitive neurobiological signature, validating that our model is learning features with genuine clinical significance.

Potential for Predicting Therapeutic Response and Personalized Medicine. The strong alignment between our model’s findings and known treatment mechanisms points directly to a critical future application: predicting individual therapeutic response. While traditional group-level analyses can identify general biomarkers, NH-GCAT can quantify the strength of these directed circuit interactions (e.g., the RN→DMN connection) on a subject-specific basis. This capability allows for the formulation of a precise, testable clinical hypothesis: The baseline magnitude of RN→DMN information flow in a patient, as quantified by our VLCA module, may serve as a predictive biomarker for their response to therapies known to target reward and rumination circuits.

For example, patients exhibiting extreme hyperconnectivity might be predicted to respond more favorably to treatments designed to decouple these systems, such as specific classes of antidepressants, ketamine, or targeted psychotherapies like cognitive behavioral therapy.

Validating this hypothesis requires longitudinal datasets containing pre-and post-treatment neuroimaging data, which was beyond the scope of the current study. Nevertheless, the ability of NH-GCAT to generate such specific, interpretable, and individual-level neurocomputational markers underscores its potential as a tool for advancing personalized psychiatry, moving beyond one-sizefits-all diagnostic labels toward biologically informed, individualized treatment strategies.

Limitations. While NH-GCAT demonstrates strong performance and interpretability, several limitations remain. First, the model is trained and evaluated solely on the REST-meta-MDD dataset, which predominantly comprises Chinese participants. This may limit its generalizability to populations with different genetic backgrounds or cultural contexts. Second, depression is inherently heterogeneous, yet our current framework does not distinguish between clinical subtypes due to limited phenotypic information. Third, our neurocircuitry-inspired design relies on predefined circuit definitions from the literature, potentially overlooking individual variability in circuit organization.

Potential Societal Impacts. Given that our research involves psychiatric disorder diagnosis, it is important to consider its broader societal implications. NH-GCAT has the potential to enhance our understanding of depression neurobiology and improve diagnostic accuracy, particularly in cases where traditional clinical assessment is challenging. By providing objective, brain-based markers of depression, our approach could help reduce stigma associated with psychiatric disorders and validate patients’ experiences. However, as with any AI-assisted diagnostic system, NH-GCAT should be viewed as a complementary tool to support clinical decision-making rather than replace comprehensive psychiatric evaluation. The final diagnostic decisions should always integrate neuroimaging findings with clinical expertise and patient-reported symptoms. As we move toward clinical translation, developing appropriate guidelines for responsible implementation will be essential.

During the preparation of this manuscript, we utilized a large language model (LLM) as a writing assistance tool. The primary role of the LLM was to aid in polishing the text by improving grammar, clarity, style, and conciseness. The LLM was not used for generating core research ideas, proposing methodologies, conducting experiments, analyzing results, or drawing scientific conclusions. All claims, results, and the scientific narrative remain the original work of the authors, who take full responsibility for all content presented in this paper.

Leave-one-site-out cross-validation accuracy (%) for MDD classification across 16 sites on REST-meta-MDD dataset. MDD and HC indicate sample sizes per site. The final column shows the sample-size weighted average (W. Avg.).

  • Statistically significant improvement over GAT-Baseline (p < 0.05, Wilcoxon signed-rank test).

The REST-meta-MDD initiative(Yan et al., 2019) constitutes the largest multi-center neuroimaging repository for Major Depressive Disorder (MDD) research, accessible via the consortium’s official platform 1 . This dataset aggregates resting-state fMRI (rs-fMRI) scans from 25 clinical centers across China, employing standardized rs-fMRI acquisition protocols to ensure cross-site consistency.

The REST-meta-MDD initiative(Yan et al., 2019)

The REST-meta-MDD initiative

Project portal: http://rfmri.org/REST-meta-MDD

📸 Image Gallery

dca_curve_test.png neural_circuits_directional_diff_heatmap.png pr_curve_test.png roc_curve_test.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut