Reading time: 55 minute
...

📝 Original Info

  • Title:
  • ArXiv ID: 2512.18315
  • Date:
  • Authors: Unknown

📝 Abstract

Observational studies in fields such as epidemiology often rely on covariate adjustment to estimate causal effects. Classical graphical criteria, like the back-door criterion and the generalized adjustment criterion, are powerful tools for identifying valid adjustment sets in directed acyclic graphs (DAGs). However, these criteria are not directly applicable to summary causal graphs (SCGs), which are abstractions of DAGs commonly used in dynamic systems. In SCGs, each node typically represents an entire time series and may involve cycles, making classical criteria inapplicable for identifying causal effects. Recent work established complete conditions for determining whether the micro causal effect of a treatment or an exposure X t-γ on an outcome Y t is identifiable via covariate adjustment in SCGs, under the assumption of no hidden confounding. However, these identifiability conditions have two main limitations. First, they are complex, relying on cumbersome definitions and requiring the enumeration of multiple paths in the SCG, which can be computationally expensive. Second, when these conditions are satisfied, they only provide two valid adjustment sets, limiting flexibility in practical applications. In this paper, we propose an equivalent but simpler formulation of those identifiability conditions and introduce a new criterion that identifies a broader class of valid adjustment sets in SCGs. Additionally, we characterize the quasi-optimal adjustment set among these, i.e., the one that minimizes the asymptotic variance of the causal effect estimator. Our contributions offer both theoretical advancement and practical tools for more flexible and efficient causal inference in abstracted causal graphs.

📄 Full Content

Understanding causal relationships is a central goal in many scientific disciplines, such as epidemiology [Hernan andRobins, 2023, Glemain et al., 2025], economics [Hünermund and Bareinboim, 2023], climate science Runge et al. [2019], and industry [Assaad et al., 2023]. Causal inference from observational data, in particular, poses fundamental challenges due to the presence of confounding bias. A widely used framework for representing and reasoning about causal relationships is based on Structural Causal Models (SCM) [Pearl, 2009]. An SCM is a set of structural equations that define how each observed variable is generated from its causes, thereby specifying a causal data-generating process. Usually, the SCM is supposed to be unknown, but the Directed Acyclic Graph (DAG) it induces-representing the qualitative causal structure among variables-is known. In the induced DAG, nodes represent variables and directed edges denote direct causal influences. These graphs provide a powerful language for articulating causal assumptions and determining whether a given causal effect is identifiable from observational data. There exist several tools for identifying (i.e., removing confounding bias without adding new biases) a causal effect of interest in DAGs among which the do-calculus-a set of three rules that, when applied iteratively,-form a sound and complete tool for causal effect identification in non-parametric settings. However, while powerful in theory, the do-calculus [Pearl, 2009] can be highly complex to apply in practice today. Even when identification is possible, the resulting formulas (e.g., the front-door formula [Pearl, 2009] [Piccininni et al., 2023]) may not be compatible with standard estimators such as ordinary least squares regression or propensity score methods. As a result, researchers often prefer simpler graphical criteria like the back-door criterion [Pearl, 2009] or the generalized adjustment criterion [Perkovic et al., 2016], which offer intuitive graphical tests to identify causal effects by covariate adjustment which is compatible with standard estimators.

However, in complex systems with high-dimensional data or partial observability, specifying the DAG is often infeasible or undesirable due to uncertainty, noise, or computational cost. To address these challenges, researchers have proposed various abstractions of DAGs that simplify or summarize the underlying causal structure [Perkovic et al., 2016, Beckers and Halpern, 2019, Anand et al., 2023, Assaad et al., 2023, 2024, Ferreira and Assaad, 2025]. These abstractions are particularly valuable in fields like epidemiology, where large-scale longitudinal studies often involve repeated measurements over time and many partially observed variables. One such abstraction is the cluster graph [Anand et al., 2023, Assaad et al., 2023, 2024, Ferreira and Assaad, 2025], which groups variables into clusters. A notable and particularly useful instance of this is the summary causal graph (SCG) [Assaad et al., 2023, 2024, Ferreira and Assaad, 2025], where each node represents a time series. SCGs are especially well-suited for applications involving time series or cohort data, where variables are naturally organized over time. These graphs preserve causal relations between time series while abstracting away the time. While adjustment criteria for causal inference have been well-studied in the context of DAGs-notably the back-door criterion [Pearl, 1993] and its extensions to the generalized adjustment criterion [Perković et al., 2015, Perkovic et al., 2016]-the theory of adjustment in SCGs remains comparatively underdeveloped. SCGs pose two additional challenges: they may contain cycles, and a single node can represent an entire cluster of variables, both of which complicate the identification of valid adjustment sets.

In recent years, a significant advancement was made for identifying causal effects in SCGs Assaad et al. [2023Assaad et al. [ , 2024]], Ferreira and Assaad [2025]. In particular, Assaad et al. [2024] provided sound complete [Yvernes et al., 2025] graphical conditions for identifying by covariate adjustment the causal effect of one timepoint X t-γ on another timepoint Y t -known as a micro causal effect and denoted as Pr(y t | do(x t-γ )) [Ferreira and Assaad, 2025] -in SCGs under the assumption of no hidden confounding. However, in Assaad et al. [2024] only two valid unpracticable adjustment sets were presented for any identifiable micro causal effect. This stands in contrast to the tools developed for DAGs-like the back-door criterion-which gives, whenever a causal effect is identifiable by adjustment, multiple valid adjustment sets, each of which is theoretically correct but may differ significantly in practical properties such as statistical efficiency. In particular, while all valid sets remove confounding bias, they can vary widely in terms of how they impact the variance of the estimated effect Rotnitzky and Smucler [2020], Witte et al. [2020], Smucler et al. [2021], Runge [2021], Henckel et al. [2022]. Therefore, being able to search over adjustment sets to identify the one that yields the most precise estimate (i.e., minimal variance) is critical in real-world applications.

In this paper, we start by simplifying the conditions given in Assaad et al. [2024] to identify Pr(y t | do(x t-γ )); then we present the SCG-back-door criterion, an extension of the back-door criterion to SCGs, which provides, whenever Pr(y t | do(x t-γ )) is identifiable by adjustment in SCGs using the conditions given in Assaad et al. [2024], a rich collection of valid adjustment sets. Furthermore, we identify the quasi-optimal adjustment set among this collection, defined as the one that minimizes the asymptotic variance of the causal effect estimator under standard assumptions in at least some distribution compatible with the SCG; it is also the smallest set among all valid adjustment that contains as much as possible of all sets that minimize the asymptotic variance of the causal effect estimator in every distribution compatible with the SCG. We emphasize that this set is not necessary the smallest set. Our results bridge a critical gap in the theory of causal inference from abstracted graphical models and open new directions for variance-aware causal effect estimation in SCGs.

The remaining of the paper is as follows: Section 2 introduces needed preliminaries and terminologies. Section 3 simplifies the results given in Assaad et al. [2024] and introduces the SCG-back-door criterion. Section 4 shows which of the set given by the SCG-back-door criterion is the quasi-optimal. Section 6 empirically validates our theoretical findings. Section 5 discusses the implications and limitations of our results. Finally, Section 7 concludes the paper. All proofs are deferred to the Appendix.

In this work, we consider a Discrete-Time Dynamic Structural Causal Model (DTDSCM), an extension of structural causal models [Pearl, 2009] to dynamic systems. This choice is motivated by the nature of the data encountered in many real-world applications, such as monitoring systems or epidemiological studies, where variables evolve over time and observations are collected at discrete intervals. A DTDSCM allows us to explicitly model temporal dependencies and causal mechanisms operating across time steps, making it well-suited for analyzing interventions. In what follows, we recall the basic formalism of DTDSCM and present the notations and assumptions that will be used throughout the manuscript. We use the convention that uppercase letters represent variables, lowercase letters represent their values, and letters in blackboard bold represent sets. Definition 1 (Discrete-time dynamic structural causal model (DTDSCM)). A discrete-time dynamic structural causal model is a tuple M = (L, V, F, Pr(l)), where

} is a set of exogenous variables, which cannot be observed but affect the rest of the model;

, is a set of endogenous variables, which are observed and every V i t ∈ V is functionally dependent on some subset of

• F is a set of functions such that for all

In this definition of DTDSCMs, the set of functions F encapsulates the causal mechanisms governing the relationships among variables, while L represents the noise, i.e., the unobserved or hidden variables that affect the observed variables, and V represents the observed variables, which we will often refer to as micro variables. The uncertainty associated with these unobserved variables is characterized by the distribution Pr(l), which accounts for the unknown influences outside the observed system. When combined with the causal mechanisms encoded in F, this distribution gives rise to the distribution Pr(v) over the observed variables V. Throughout this paper, we make the following two assumptions about the DTDCM. Assumption 1 (No hidden confounding). We assume that the exogenous variables are mutually independent, and each variable in L is at most in one function in F. Assumption 2 (Causal stationarity). We assume that, for any causal relation between variables, the associated causal effect remains the same at every time step; in other words, causal dependencies are time-invariant.

The first assumption is essential for the theoretical results we derive, while the second is fully required only when working with a single multivariate time series where each time point corresponds to a single observation. This scenario commonly arises in industrial systems, such as chemical plants, manufacturing lines, or energy production facilities, where various sensors continuously monitor quantities like temperature, pressure, flow rate, vibration, and power consumption. In such tightly controlled and stable environments, the system is expected to operate under stationary conditions, making the stationarity assumption reasonable. This scenario can also arise in health, for example, in Intensive Care Units (ICUs), where we focus on the data of a single patient that are continuously monitored through various sensors measuring variables such as heart rate, blood pressure, oxygen saturation, respiratory rate, and temperature. Over relatively short and clinically stable periods, patient physiology can often be assumed to follow approximately stationary dynamics, making the stationarity assumption acceptable. Nevertheless, this second assumption could be relaxed to a “weaker"1 causal stationarity in settings involving multiple multivariate time series-such as in cohort studies-where each time point is associated with multiple independent observations. For example, a case where the data of multiple patients in the ICU is collected. Finally, as most of the literature in causal inference, we assume that each DTDSCM induces a full-time directed acyclic graph (FT-DAG)2 , where every variable in V corresponds to a node in the graph. In this DAG, a directed edge → is drawn from one variable to another if the former serves as an input to the function that determines the latter. Formally, FT-DAGs are defined as follows: Definition 2 (Full Time Directed Acyclic Graph (FT-DAG)). Consider a DTDSCM M. The full-time directed acyclic graph(FT-DAG) G = (V, E) induced by M is defined in the following way:

Each node in an FT-DAG represents a timepoint in a time series. The lag γ refers to the time difference between the variable observed at times t and tγ. We denote the maximum lag γ max as the greatest time lag included in the model.

Let G = (V, E) be a graph which is a set of nodes V, and a set of edges E that connect different pairs of nodes. Two nodes are adjacent if they are connected by an edge. Nodes denote variables and edges denote relationships between two variables. If the edges point from one node to another, the graph is called a directed graph.

(a) SCG. A causal graph is a directed graph where edges represent direct causal effects from one variable to another. A path is a sequence of unique nodes in which each pair of consecutive nodes is adjacent. A directed path from X to Y is a path in which all edges point in the same direction, from the tail node to the head node of the form X → ⋯ → Y . A path is said to be blocked by a set of nodes Z if the set contains a non-collider of the form ← V → or → V → such that V ∈ Z, or a collider of the form → V ← such that neither V nor any of its descendants are in Z. A back-door path from X to Y is any path between X and Y that starts with an edge pointing into X that is X ←. We use the terminology of kinship to denote the relationships between the variables: parents, children, descendants and ancestors.

If there is a directed path between X and Y , then X is an ancestor of Y and Y is a descendant of X. A node is considered as both its own ancestor and its own descendant. We note P a(X, G)

In this paper, we focus on causal effect of a variable X t-γ on another Y t , denoted as Pr(y t | do(x t-γ )) where the do() operator represents an intervention. This is usually referred to as the micro causal effect3 . In particular, we focus on the technical problem of identifying micro causal effects from observational data-that is, expressing these effects using a do-free formula involving only observed variables.

Causal effects are generally identifiable under various sets of assumptions. Among the most widely used are those that include the assumptions of conditional exchangeability and positivity [Hernan andRobins, 2023, Austin, 2011] (these two assumptions together are known as ignorability). The positivity assumption (i.e., that the probability of receiving each level of treatment is strictly positive for all covariates) is, in principle, testable from data, whereas the conditional exchangeability assumption is not and must instead be justified using background or domain knowledge. When these assumptions hold, the micro causal effect of a variable X t-γ on an outcome Y t can be identified using the following adjustment formula:

where Z is a set of covariates that removes confounding bias when adjusted on. In some specific settings, Z can be considered as the set of confounders. This adjustment formula is arguably the most well-known do-free expression for causal effects. Many standard estimators, such as ordinary least squares regression and propensity score methods [Vable et al., 2019], are based on this formula.

Pearl [1993] showed that if the underlying FT-DAG is known, one can test whether the conditional exchangeability assumption holds using a simple graphical condition called the back-door criterion. In practice, there may be multiple sets of covariates Z that allow for identification via Equation 1, and the back-door criterion provides a way to identify such sets. A set Z satisfies the back-door criterion relative to an ordered pair (X t-γ , Y t ) if Z blocks all back-door paths from X t-γ to Y t and Z does not contain any descendant of X t-γ .

If the true FT-DAG is known and if Assumptions 1 and 2 are satisfied, there exists at least one set Z that satisfies the back-door criterion [Perkovic et al., 2016], meaning that in this setup the micro causal effect of X t-γ on Y t is always identifiable using Equation 1. However, in practical applications, using FT-DAGs is often impractical-or even infeasible-particularly in domains with a large number of variables. Such graphs tend to become overly complex, making them hard to interpret and construct. This challenge is further exacerbated in settings where only partial knowledge of the underlying causal relationships is available, limiting our ability to fully specify an FT-DAG. For this reason, researchers often resort to a simpler and more summarized representation known as the summary causal graph (SCG) [Assaad et al., 2022]. [Zhang andBatterman, 2013, Dominici et al., 2006].

Definition 3 (Summary Causal Graph (SCG) with no hidden confounding). Consider an FT-DAG G = (V, E). Under Assumption 1, the summary causal graph (SCG) G s = (V s , E s ) compatible with G is defined in the following way:

An SCG is an abstraction of an FT-DAG where all timepoints in a time series are compacted into one node, referred to as macro variable or cluster. An arrow X → Y exists in the SCG if and only if there is at least one temporal lag 0 ≤ γ ≤ γ max such as X t-γ → Y t exists in the FT-DAG. While multiple FT-DAGs may induce the same SCG, the mapping from an FT-DAG to its corresponding SCG is unique. For example, in Figure 1 and Figure 2, we provide an SCG with two of its compatible FT-DAGs. Unlike an FT-DAG, because of its abstract nature, an SCG can contain cycles. For example, Figure 1 illustrates a real-life application where events were recorded at successive time points. It summarizes the causal relationships between sleep efficiency (X), stress levels (Y ), and total sleep time (W ), each measured over time, into a compact causal graph. In the corresponding FT-DAG, the structure remains acyclic, as each edge respects the temporal order. However, in the associated SCG, a cycle emerges due to the presence of edges such as X t-γ → Y t and Y t-γ → X t . This reflects a plausible real-world dynamic: sleep efficiency on one day influences stress the following day, and conversely, elevated stress levels can impair sleep efficiency the next night. Another example is given in the SCG of Figure 2 which summarizes the causal relationships between Air pollution levels (X), Respiratory hospitals (Y ), and Traffic density (W ). In the example, traffic density and air pollution levels naturally exhibit strong temporal persistence. For example, heavy traffic on a given working day is often followed by similarly congested conditions the next day, and air pollution levels often evolve gradually rather than abruptly. These temporal dynamics are captured in the SCG through self-loops on the macro variables representing traffic density (X) and air pollution levels (W ). Such loops encode the idea that the current state of these variables is partly, influenced by their own past values. In the underlying FT-DAGs, this persistence is encoded by edges of the form X t-γ → X t and W t-γ → W t .

SCG concepts. Let G s = (V s , E s ) be an SCG with a set of macro nodes V s . All concepts introduced for FT-DAGs apply also for SCG. However, for SCGs we also need to introduce additional concepts related to cycles. A cycle is a directed path that starts and ends at the same node, with all intermediate nodes distinct. A self-loop is an edge that connects a node to itself and is considered a trivial cycle of length one. A strongly connected component (SCC) in a graph is a maximal set of nodes such that there is a directed path from each node to every other node in the component.

In particular, each node always belongs to its own SCC, even if no other node is reachable from or to it. We denote by Scc(X, G * ) the strongly connected component of node X in the graph G * . Finally, to map macro-variables V ⊆ V s to specific compatible temporal instants we use the following operator:

The back-door criterion cannot be directly applied to identify micro causal effects using the SCG. For example, consider the simple SCG shown in Figure 2a, where X → Y and W → X. If we apply the back-door criterion to this SCG, one might mistakenly conclude that no adjustment is necessary, implying Pr(y t | do(x t-γ )) = Pr(y t | x t-γ ). However, this is incorrect. To understand why, examine the FT-DAG in Figure 2b, where multiple active back-door paths exist between X t-γ and Y t . These paths can be blocked by the set {X t0 , ⋯, X t-γ-1 , W t0 , ⋯, W t-γ }which simultaneously does not contain any descendants of X t-γ , thus satisfying the back-door criterion. However, using this set in the adjustment formula is impractical in the context of time series, as it would violate the positivity assumption (since the time series correspond to a single observation, making proper adjustment infeasible). To address this, we focus on cases where we avoid adjusting for variables temporally prior to tγγ max . In this specific example, the set {X t-γ-γmax , ⋯, X t-γ-1 , W t-γ-γmax , ⋯, W t-γ } satisfies the back-door criterion and does not violate the positivity assumption under Assumption 2 (since past values within this range can be reused across time as repeated observations), meaning it can be used to estimate the micro causal effect. However, the back-door criterion alone does not provide a way to identify this set using only the SCG. This is merely a simple illustration; in practice, scenarios can be far more complex-such as the one shown in Figure 1-where identifying the micro causal effect becomes highly non-trivial.

Recently, Assaad et al. [2024] introduced a set of conditions for identifying micro causal effects using SCGs. These conditions were later proven to be complete for identifiability under Assumptions 1 and 2 by Yvernes et al. [2025]. Initially, the formulation in Assaad et al. [2024] was relatively complex, requiring the enumeration of all active paths between X t-γ and Y t . An equivalent but somewhat simpler formulation was later provided in Yvernes et al. [2025], although it still involved enumerating certain types of paths and introduced technically involved definitions. In addition to these conditions, Assaad et al. [2024] proposed two valid sets of covariates that can be used in the adjustment formula whenever the identifiability conditions are satisfied. These sets are:

In many applications, it would be beneficial to have more valid adjustment sets, allowing the user to choose the most appropriate one based on factors such as the validity of the positivity assumption or other constraints, like missing data. Furthermore, having multiple adjustment sets paves the way for finding the optimal set, which minimizes the asymptotic variance of the micro causal effect estimator, as characterized in Henckel et al. [2022] for the case when the true FT-DAG is known. The results presented in Henckel et al. [2022] were originally established in the context of linear models using the ordinary least squares estimator. However, subsequent work by Rotnitzky and Smucler [2020] demonstrated that these results also hold in more general non-parametric settings and a class of non-parametric estimators. We emphasize that the characterization of the optimal set given in Henckel et al. [2022] only applies to FT-DAG and not to SCGs. In fact, searching for the optimal adjustment set turned out to be hard as it will be explained in Section 4. A simpler endeavor would be to search for the quasi-optimal set which will be defined formally in Section 4.

Now that we have introduced the necessary terminology and preliminaries, we state the three problems that we aim to solve. Problem 1. We aim to present a formulation equivalent to the conditions given in Assaad et al. [2024] and in Yvernes et al. [2025], but in a simpler (does not require enumerating paths) and more interpretable (only based on kinship relations) form, for identifying micro causal effects from SCGs. Problem 2. We aim to extend the back-door criterion to SCGs, providing a criterion that, under the conditions outlined in Assaad et al. [2024], offers multiple sets of covariates that can be used in the adjustment formula. Problem 3. We aim to characterize the quasi-optimal set, among those found by our extension of the back-door criterion.

We start this section by presenting a simplified version of the identifiability theorem for micro causal effects in SCGs, originally introduced in Assaad et al. [2024] and later proven complete in Yvernes et al. [2025].

) and one of the following holds

This version of the theorem offers a simplification over the formulations presented in Assaad et al. [2024] and Yvernes et al. [2025]. Unlike those, it avoids the need to enumerate activated paths in the SCG, relying instead on kinship relations, which makes it both more interpretable and conceptually clearer. For instance, it is straightforward to verify that the micro causal effect Pr(y t | do(x t-γ )) is identifiable in the SCG shown in Figure 3, as it satisfies Condition A.

Similarly, identifiability also holds in the SCG depicted in Figure 4 due to the fulfillment of Condition B. Finally, the SCG in Figure 1 satisfies the Condition C of the theorem, implying that the micro causal effect, is identifiable in that setting as well. Additionally, as we will show when we introduce the SCG-back-door criterion, the structure of this version of the theorem is particularly well-suited for systematically identifying multiple valid adjustment sets. Before introducing the SCG-back-door criterion, which enumerate valid adjustment sets whenever Pr(y t | do(x t-γ )) is identifiable according to Theorem 1, we introduce additional notions that will be useful to define the criterion. We consider the set of possible descendants of a temporal variable V t1 in an SCG as the set of temporal variables that are descendants of V t1 in at least one compatible FT-DAG and we denoted as pDe(V t1 , G s ). Note that the difference between Henckel et al. [2022] we define the causal nodes cn(X * , Y * , G * ) of a any pair of nodes (X * , Y * )-be it a pair of macro nodes or micro nodes-in a any graph G * -be it an FT-DAG or an SCG-as the nodes on a directed path from X * to Y * , excluding X * . Moreover, for any graph G * and any pair of nodes (X * , Y * ), we define the extended causal nodes ecn(X * , Y * , G * ) as the causal nodes and their strongly connected components.

cn

Definition 4 (SCG-back-door criterion for a micro causal effect). Consider an SCG with no hidden confounding

) then a set Z satisfies the SCG-back-door criterion relative to a pair of variables

Condition A is satisfied and one of the following holds:

  1. P/D ⊆ Z where

t-γ t-γ-γmax , and 2. P/D ⊆ Z and Cycles(X, G s ) = ∅ where

Condition B is satisfied and one of the following holds:

  1. P/D ⊆ Z where

Condition C is satisfied and (P X ∪ P Y ) /D ⊆ Z, and (P all X ⊆ Z, or P all Y ⊆ Z) where

The name “SCG-back-door” reflects the fact that any set that satisfies this criterion in a given SCG satisfies the classical back-door criterion in every FT-DAG that is compatible with the SCG. We organized Conditions A and B in Definition 4 to follow a specific logical order. The intuition behind this structure is as follows. In the first item, we aim to block all back-door paths from the side of the treatment, by adjusting for variables that have an arrow pointing into the strongly connected component of X. In the last items, we shift focus to the outcome side, adjusting for variables that are parents of the causal nodes or extended causal nodes. As for Condition C, which is more restrictive, it requires including all variables that are parents of either X or Y over a restricted time window. Outside of this window, one can choose to block back-door paths either from the side of X or from the side of Y , following the same approach as in Conditions A and B.

These conditions allow us to explore a broader range of adjustment sets compared to those presented in Assaad et al.

[2024], which only propose two possible sets: A 1 , the largest acceptable set that includes all temporal variables except those that are descendants of the treatment, and a second A 2 , shorter alternative set that considers only the ancestors of the treatment and the outcome that are not descendants of the treatment X t-γ . In particular, both adjustment sets A 1 and A 2 can be recovered using the SCG-back-door criterion. Regarding A 1 , since it includes all temporal variables that are not descendants of the treatment, it necessarily contains the variables allowed by the first item in Definition 4. In fact, the set P/D defined in the first item is a subset of A 1 , and extending it to include all such non-descendant temporal variables yields exactly A 1 . Indeed, A 1 satisfies the overall conditions of the SCG-back-door criterion: pDe(X t-γ , G s ) ∩ Z = ∅, and ∀V t ′ ∈ Z, tγγ max ≤ t ′ ≤ t. For example, in Figure 2, with γ = 1 and γ max = 1, we obtain

, which contains all variables satisfying the above conditions. In contrast, the first item of the SCG-back-door criterion accepts, for example, the set Z = {X t-2 , W t-2 , W t-1 }, showing that Z ⊆ A 1 . A similar argument holds for A 2 . Since A 2 includes all possible temporal ancestors of X t-γ and Y t that are not descendants X t-γ , it necessarily contains the adjustment sets P/D in the first item. Extending P/D from the first item to include all remaining ancestors of X and those of Y reconstructs A 2 precisely. The same intuition holds for the other items of the SCG-back-door criterion. Thus, each item in our SCG-back-door criterion returns a large enough collection of adjustment sets that include A 1 or A 2 . In addition, all these adjustment sets are subsets of A 1 .

The following theorem formally establishes the correctness of the SCG-back-door criterion for identifying by adjustment the micro causal effect.

Theorem 2. Given an SCG G s , an ordered pair of variables (X, Y ) and a lag γ, if Z satisfies the SCG-back-door criterion and if, for all values z of variables Z, Pr(x t-γ | z) > 0 then the micro causal effect of X t-γ on Y t is identifiable and is given by Equation 1.

We note that the condition pDe(X t-γ , G s ) ∩ Z = ∅ in the SCG-back-door criterion is too restrictive. This restriction reflects our focus on extending the back-door criterion, which eventhough it is useful at enumerating many adjustment sets, it-unlike the generalized adjustment criterion-does not aim to enumerate all possible adjustment sets. In addition, there are cases where the SCG-back-door criterion fails to capture certain valid adjustment sets. Consider one FT-DAG compatible with the SCG shown in Figure 5a, where the goal is to estimate the effect of X t on Y t . Adjusting for the variables highlighted in red in Figures 5b and5c satisfies items 1 and 2 of Condition A of the SCG-back-door criterion, respectively. However, if the true FT-DAG were known, it would be possible to identify alternative valid adjustment sets that are not selected by the criterion. For example, in Figure 5c, the back-door paths initially blocked by adjusting for R t could equally be blocked by adjusting for U t-1 and U t as in Figure 5d. However, interestingly, as we will show in the next section, although the SCG-back-door criterion does not identify every single valid adjustment set, it does generate a sufficient collection that includes the quasi-optimal adjustment set.

A compatible FT-DAG and the minimal adjustment set that satisfies Condition A, item 1 of the SCG-back-door criteria.

(c) A compatible FT-DAG and the minimal adjustment set that satisfies Condition A, item 3 of the SCG-back-door criteria.

A compatible FT-DAG and another adjustment set that satisfies the backdoor [Pearl, 1993].

Figure 5: Example of an SCG and one compatible FT-DAG with three adjustment sets that satisfy condition A of the SCG-back-door criteria and one adjustment set that satisfies the back-door criteria in Pearl [1993].

4 Finding the quasi-optimal adjustment set for the causal effect in SCGs

Firstly, let us recall the definition of an optimal adjustment set in a DAG [Henckel et al., 2022, Definition 3.12] and adapt it in the case of FT-DAGs. Definition 5 (Optimal adjustment set in an FT-DAG). Let G be a FT-DAG and (X t-γ , Y t ) be two temporal variables of interest such that X t-γ ∈ An(Y t , G). We define the optimal adjustment set of the pair (X t-γ , Y t ) in the FT-DAG G as follows:

• P = P a(cn(X t-γ , Y t , G)/{X}, G), and

Upon its introduction, opt (X t-γ , Y t ) G was shown to yield the smallest asymptotic variance when estimating causal effects under two conditions: the SCM is linear and satisfies Assumption 1, and the causal effect is estimated using the coefficient of the treatment in an ordinary least squares regression of the outcome on the treatment and a valid adjustment set Henckel et al. [2022]. Subsequent work Rotnitzky and Smucler [2020] extended this optimality result to non-parametric SCMs-where no specific assumptions are made about the conditional distributions of nodes given their parents-and to non-parametric estimators, which do not rely on the conditional independencies encoded in the SCM.

However, defining optimality in SCGs is more nuanced than in FT-DAG. We consider that a sound definition of optimality should be that a set opt (X t-γ , Y t ) G s is optimal in an SCG G s if for all adjustment sets Z that are valid in any FT-DAG compatible with G s and for every distribution Pr compatible with any FT-DAG compatible with G s ,

where var Pr (⋅) represents the asymptotical variance under distribution Pr,

and ∆Xt-γ,Yt (Z) is a consistent estimator of ∆ Xt-γ ,Yt (Z) (such as ordinary least squares regression) used to estimate the micro-level causal effect of X t-γ on Y t using the adjustment set Z. The problem of identifying an optimal adjustment set in SCGs proves to be highly challenging. In fact, if we limit ourselves to the results provided in [Henckel et al., 2022], it becomes impossible to fully solve. More precisely, [Henckel et al., 2022, Theorem 1] establishes an inequality that can be used to compare two adjustment sets and determine which one is more optimal-i.e., which yields a lower asymptotic variance-when the sets satisfy different graphical conditions. However, this result is not complete: there exist pairs of sets for which the theorem does not allow a clear comparison in terms of optimality (that said, their approach does guarantee the identification of at least one set that is optimal relative to all others).

In this work, instead of focusing on finding optimal adjustment sets in SCGs, we will instead focus on finding quasi-optimal sets (quasi since we limit ourselves to the results given in Henckel et al. [2022]). We say that a set q-opt (X t-γ , Y t ) G s is quasi-optimal in an SCG if:

(i) q-opt (X t-γ , Y t ) G s is a valid adjustment set for the SCG;

(ii) there exists at least one FT-DAG G compatible with G s such that for all adjustment sets Z that are valid in any FT-DAG compatible with G s and for every distribution Pr compatible G,

(iii) for every G compatible with G s , q-opt (X t-γ , Y t ) G s is the smallest among all valid adjustment for all FT-DAGs compatible with G s that contains as much as possible of the optimal adjustment set of every FT-DAG compatible with G s .

The item (i) mirrors the requirement for optimality in the standard FT-DAG setting. Items (ii) and (iii) highlight the challenge of defining optimality in SCGs: since different FT-DAGs compatible with the same SCG may have different optimal sets, no single adjustment set can be optimal for all of them simultaneously. We argue that item (ii) is essential for any reasonable notion of optimality in SCGs, while item (iii) serves as a strong sufficiency criterion. While we do not present a formal proof, empirical results indicate that adjustment sets containing as much as possible of the optimal set generally produce more accurate estimates than those that do not. This observation motivates the intuition behind item (iii) and guides the design of the quasi-optimal adjustment set, which seeks to balance robustness and performance in the face of SCG ambiguity.

Next, we can characterize the optimal adjustment set given only the SCG. Definition 6 (Quasi-optimal adjustment set in an SCG). Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ). We define the quasi-optimal adjustment set of the pair (X t-γ , Y t ) in the SCG G s as q-opt (X t-γ , Y t ) G s = P/D, where D = pDe(X t-γ , G s ), and :

• If Condition A is verified and Cycles(X, G s ) = ∅ and γ > 0:

• If Condition A is verified and Cycles(X, G s ) ≠ ∅ and γ > 0:

In the following, we give three propositions that show that the set given in Definition 6 satisfies the three items given above, therefore corresponds to the actual quasi-optimal set. Proposition 1. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the micro causal effect from X t-γ to Y t is identifiable by Theorem 1. The set q-opt (X t-γ , Y t ) G s satisfies the SCG-back-door criterion.

Proposition 2. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the micro causal effect from X t-γ to Y t is identifiable by the SCG-back-door criterion. Suppose there exists a compatible FT-DAG G such that opt (X t-γ , Y t ) G is a valid adjustment set in every compatible FT-DAG. There exist a compatible FT-DAG G in which opt

Figure 6: Example of an SCG and two compatible FT-DAGs illustrating that the optimal adjustment set is not necessarily contained within the quasi-optimal set.

Proposition 3. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the micro causal effect from X t-γ to Y t is identifiable by the SCG-back-door criterion. For every compatible FT-DAG G, opt

These propositions suggest that the quasi-optimal set can be interpreted as the union of all optimal adjustment sets across FT-DAGs compatible with the SCG, excluding all possible descendants of X t-γ . This observation is formalized by the following corollary: Corollary 1. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the causal effect from X t-γ to Y t is identifiable. Suppose there exists a compatible FT-DAG G such that opt (X t-γ , Y t ) G is a valid adjustment set in every compatible FT-DAG. Let compatible (G s ) be the set of FT-DAGs compatible with G s . The following equality holds:

Proposition 3 (or Corollary 1) might initially appear overly weak, as it suggests that only a subset of the optimal adjustment set is included within the quasi-optimal set for any FT-DAG compatible with the SCG. This could mistakenly lead one to believe that a valid quasi-optimal set must contain the optimal adjustment set of every such FT-DAG. However, this is not feasible in general: the union of all optimal sets across compatible FT-DAGs is not necessarily a valid adjustment set in each of them, i.e.it does not verify the generalized adjustment criterion in each FT-DAG. In other words, this union may fail to form a valid adjustment set in the SCG. This is illustrated in Figure 6, where U appears as both a parent and a child of the outcome in the SCG shown in Figure 6a. As a result, opt (X t-, Y t ) G / ⊆ q-opt (X t , Y t ) G s in such cases. Indeed, among the FT-DAGs compatible with this SCG, one can construct a FT-DAG (Figure 6c) in which U t is a descendant of the outcome Y t . In that FT-DAG, any adjustment set including U t would be invalid, since adjusting for a descendant of the outcome violates the back-door criterion. Therefore, although U t may belong to an optimal adjustment set in another compatible FT-DAG (e.g., Figure 6b), it cannot be included in a quasi-optimal set that is valid across all compatible FT-DAGs.

Notice that what we defined as quasi-optimal set is a subset of A 1 and A 2 , as we recall that A 1 is the set containing all variables that are not descendants of the treatment and A 2 is the set containing all ancestors of the treatment and the outcome. Interestingly, the optimal set is a subset of A 1 and A 2 if and only if the optimal set is a subset of the quasioptimal (in cases where the optimal set contains no descendants of the treatment). The following corollary relates the quasi-optimal set with the sets A 1 and A 2 introduced in Assaad et al. [2024]: Corollary 2. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the causal effect from X t-γ to Y t is identifiable by Theorem 1. For any distribution Pr compatible with any FT-DAG compatible with G s . Under the same assumptions (briefly discussed above and described in details in Henckel et al. [2022], Rotnitzky and Smucler [2020]) required for the optimality of the set opt (X t-γ , Y t ) G , we have the following:

)), and var Pr ( ∆Xt-γ,Yt (q-opt (X t-γ , Y t ) G s )) ≤ var Pr ( ∆Xt-γ,Yt (A 2)).

The results of this section indicate that the set q-opt (X t-γ , Y t ) G s is optimal in at least one of the FT-DAGs compatible with the SCG and outperforms several other adjustment sets (e.g., A 1 and A 2 ). These findings have direct practical implications: they provide guidance for selecting adjustment sets when working with multiple time series (such as cohort data) generated from a DTDSCM, in cases where only the SCG is known. We further conjecture that these insights may extend to classical time series settings under Assumption 2, offering a principled approach to adjustment set selection in that context as well. (b) Data sampled data every two seconds.

It is important to emphasize that Conditions A, B and C of Theorem 1 rely heavily on the assumptions of a DTD-SCM, in particular that the data collection frequency perfectly matches the data generation process. If this assumption is violated, there is a risk of backdoor paths between X t-γ and Y t consisting entirely of unmeasured variables, which prevents valid adjustment. For instance, as illustrated in Figure 7, if the underlying system generates values at every second-v t0 , v t0+1 , v t0+2 , ⋯v tmax-2 , v tmax-1 , v tmax -but observations are recorded only every two seconds-v t0 , v t0+2 ⋯, v tmax-2 , v tmax -then it is not possible to identify the micro causal effect of X 4 on Y 6 since the backdoor path consisting of the nodes X 3 and Y 3 , which are both unmeasured, cannot be blocked. In addition, discrepancies between the data-generating process and the data collection frequency may lead to misinterpretation in Condition C. For example, in scenarios like the one illustrated in Figure 7, a modeler might mistakenly set γ to 1 to reflect the difference in observed time steps (which is actually 2). This could lead to the erroneous belief that Condition C applies for the micro causal effect of interest Pr(y t | do(x t-1 )), when in reality the actual micro causal effect of interest corresponds to Pr(y t | do(x t-2 )). Such mismatches highlight the need for caution when applying Condition C or any lag-based (where the lag is set to a value greater than zero) identifiability conditions to discretely observed time series. One might assume that Condition B suffers from the same limitation as Condition C, since it also relies on a specific lag-namely, γ = 0. However, having a lag of zero corresponds to an instantaneous relationship, which remains unaffected by differences between the data generation rate and the sampling rate. Thus, Condition B is not subject to the same limitations.

As this paper focuses on time series, it is necessary to define a temporal boundary, since measurements begin at a finite point in time. We choose to set this limit at tγγ max , which places us sufficiently far in the past to ensure that all relevant back-door paths are blocked. This choice also reflects real-world constraints, where infinite-length measurements are not feasible. Moreover, by selecting this time limit, we ensure that no more optimal adjustment set would be discovered by going even further back in time-any variables beyond this boundary would either be redundant or irrelevant for optimality.

We evaluated our results using simulated multivariate time series data. However, it is important to note that the theoretical guarantees underlying our approach extend beyond this setting. Specifically, they apply not only to single multivariate time series but also to collections of multiple multivariate time series, as commonly encountered in cohort studies. As such, we expect similar empirical performance to hold in those broader contexts as well if all assumptions are met. However, it is important to emphasize that our approach is applicable to cohort studies only when the necessary assumptions hold and the data is complete, without missing values or irregularities in the timing of measurements. Furthermore, while our analysis relies on the standard positivity assumption, this condition can be relaxed.

In particular, it suffices that for all values z of the adjustment set Z, either P(z) > 0 or P(x t-γ | z) > 0, as shown by Hwang et al. [2024]. This relaxed condition broadens the applicability of our results to more realistic scenarios where strict positivity may not hold globally.

In many applications, it is falsely assumed that the minimal adjustment set is also the most optimal. This misconception has been challenged, for example, in Henckel et al. [2022] and is further examined in this paper. While minimal adjustment sets are often preferred for their simplicity and efficiency, they do not always lead to the lowest estimation variance or the most robust causal effect estimation. In particular, including additional variables -when carefully chosen -can improve statistical efficiency in the presence of unmeasured confounding structures. This highlights the importance of going beyond minimality and taking into account statistical efficiency when choosing among valid adjustment sets.

We began by randomly generating 1500 graphs-both cyclic and acyclic-each containing five or six nodes. These graphs served as our SCGs. If at least one of the identifiability conditions in Theorem 1 was satisfied, we then generated all compatible FT-DAGs by setting γ max = 1. Since the number of compatible FT-DAGs grows exponentially with the number of cycles in the SCG, we restricted our analysis to SCGs that generate at most 50 compatible densest FT-DAGs, in order to keep computation time manageable. For each selected SCG, we conducted a two-step comparison. First, we identified all valid adjustment sets derived from the SCG, as determined by the SCG-back-door criterion. Then, we searched for all adjustment sets that satisfie the back-door criterion in all the corresponding FT-DAGs. We will call the former sets as the SCG-back-door sets and the latter sets as the common-back-door sets. We compared the two sets to assess the validity of our criterion.

As a result, we observed that all SCG-back-door sets are also common-back-door sets. This empirical comparison allowed us to evaluate a key theoretical property of our method: soundness, meaning that every adjustment set selected by our criterion is guaranteed to be valid across all compatible FT-DAGs. This supports the reliability and robustness of the SCG-back-door criterion.

However, we also observed that not every common back-door set qualifies as an SCG-back-door set. This underscores the incompleteness of the SCG-back-door criterion. It is important to note, however, that our objective was not to establish a complete criterion, but rather to develop a sufficiently expressive one (similarly to the back-door criterion which is itself not complete)-capable of identifying multiple valid adjustment sets, among which a quasi-optimal set can be found.

In this paper, we addressed the problem of identifying micro causal effects from observational data using SCGs.

Building on prior work, we proposed a simplified and interpretable reformulation of the identifiability by adjustment conditions in SCGs originally introduced in Assaad et al. [2024]. Our formulation avoids the enumeration of all active paths and relies solely on kinship-based graph properties, making it more tractable in practice. Furthermore, we introduced the SCG-back-door criterion, a graphical test that extends the classical back-door criterion to the SCG setting, providing a sound and efficient method to enumerate valid adjustment sets whenever the micro causal effect is identifiable by adjustment. In addition, we characterized the quasi-optimal adjustment set among all sets satisfying the SCG-back-door criterion, thus offering a principled way to improve estimation efficiency. Our results not only provide theoretical insights into causal identifiability by adjustment in SCGs but also offer practical tools for applied researchers working with time series data or cohorts where only high-level causal abstractions are available.

Applying the theoretical findings developed in this work to real-world epidemiological datasets would be a valuable next step to assess their practical impact. Additionally, future research could explore characterizing the optimal set from SCGs when there is hidden confounding using the recently proposed SCG-front-door criterion [Assaad, 2025], or investigate how the classical single-door criterion might be extended to SCGs, building on the identifiability conditions introduced in Ferreira and Assaad [2024] for micro direct effect identification in linear SCMs. It would also be interesting to investigate whether the forbidden projection technique introduced in Witte et al. [2020] and which simplifies DAGs while preserving all information relevant to identifying optimal adjustment sets, can be extended to SCGs. Another promising direction is to focus on cohort studies with imperfect data, characterized by irregular measurement timings and missing values. Since current criteria assume well-structured and complete data, it would be valuable to develop approaches that can accommodate the complexities and imperfections commonly encountered in real-world datasets. However, since this work builds on the results of Henckel et al. [2022], which are not yet complete, the question of whether a more optimal adjustment set can be identified within the SCG remains open. We leave this investigation for future work. Proof. This theorem is a reformulation of Theorem 2 -Version 2 of Assaad et al. [2024] which gives necessary and sufficient conditions for identification of a micro causal effect from a SCG. Firstly, they show that if In conclusion Theorem 1 and Theorem 2 of Assaad et al. [2024] are equivalent.

Theorem 2. Given an SCG G s , an ordered pair of variables (X, Y ) and a lag γ, if Z satisfies the SCG-back-door criterion and if, for all values z of variables Z, Pr(x t-γ | z) > 0 then the micro causal effect of X t-γ on Y t is identifiable and is given by Equation 1.

Proof. Firstly, notice that X ∉ An(Y, G s ) and pDe(X t-γ , G s ) ∩ Z = ∅ then in every compatible FT-DAG G one has X t-γ ∉ An(Y t , G) and thus for every Z such that Z ∩ De(X t-γ , G) = ∅, one has Pr(y t | do(x t-γ )) = Pr(y t ) = ∑ z Pr(y t | z)Pr(z) using rule 2 of do-calculus and the law of total probabilities. Then assume that X ∈ An(Y, G s ) and pDe(X t-γ , G s ) ∩ Z = ∅.

• Suppose Condition A is verified -If Condition 1 is verified then in every compatible FT-DAG G, one has P a(X t-γ , G) ⊆ Z and De(X t-γ , G) ∩ Z = ∅ so Z is a valid backdoor set and thus Equation 1 is correct. -If Condition 2 is verified then in every compatible FT-DAG G, and for every backdoor path π from X t-γ to Y t one can consider the following:

  • Either the last edge of π goes out of Y t , then because Scc(X, G s ) = ∅ and X ∈ An(Y, G s ), one knows that there exists a collider in π which is in De(Y t , G s ) ⊆ pDe(X t-γ , G s ). In this case, π is blocked because of this collider. * Either there exists V t V in π such that t V > t, then there exists a collider C t C with t C > t. In this case, π is blocked because of this collider. * Either the last element of π which is not in

then because Scc(X, G s ) = {X} one knows that ecn(X, Y, G s ) ∩ An(X, G s ) = ∅ so there exists a collider in π which is in De([ecn(X, Y, G s )] t t-γ , G) ⊆ pDe(X t-γ , G s ). In this case, π is blocked because of this collider. * Either the last element of π which is not in

then one knows that this element is the middle node of either a fork or a chain in π and that it is in P a([ecn(X, Y, G s )] t t-γ , G) ⊆ P = [P a(ecn(X, Y, G s )/{X}, G s )] t t-γ-γmax /pDe(X t-γ , G s ). In this case, π is blocked because of this fork or chain. * Note that the condition “Cycles(X, G s ) = ∅” guarantees the existence of an element of π which is not in [ecn(X, Y, G s )]

then one knows that this element is the middle node of either a fork or a chain in π and that it is in P a([cn(X, Y, G s )] t t-γ , G). Either this node is not in pDe(X t-γ , G s ) and π is blocked because of this fork or chain. Either this node is in pDe(X t-γ , G s ) in which case one should consider the last element of π which is not in

then either π is blocked by Z 2 or we adjust on this element thus blocking π. * Note that the condition “γ = 0” guarantees the existence of an element of π which is not in

-If Condition 4 is verified then a similar reasoning as for Condition 2 can be applied. The only difference lies in the fact that if the last element of π which is not in [ecn(X, Y, G s )] t t-γ and is a parent of ecn(X, Y, G s ) is X t ′ for tγ < t ′ then X t ′ ∈ pDe(X t-γ , G s ) and π will not be blocked, which is why one needs to adjust on [P a(X, G s )] t-γ-γmax+1 t as well in order to block the path.

• Suppose Condition B is verified -If Condition 1 is verified then in every compatible FT-DAG G and for every backdoor path, the first variable which is not in pDe(X t-γ , G s ) is in Z thus Equation 1 is correct. -If Condition 2 is verified then in every compatible FT-DAG G, and for every backdoor path π from X t-γ to Y t one can consider the following: * Either the last edge of π goes out of Y t , then because Scc(X, G s ) = ∅ and X ∈ An(Y, G s ), one knows that there exists a collider in π which is in De(Y t , G s ) ⊆ pDe(X t-γ , G s ). In this case, π is blocked because of this collider. * Either there exists V t V in π such that t V > t, then there exists a collider C t C with t C > t. In this case, π is blocked because of this collider. * Either the last element of π which is not in

then one knows that this element is the middle node of either a fork or a chain in π and that it is in P a([cn(X, Y, G s )] t t-γ , G). Either this node is not in pDe(X t-γ , G s ) and π is blocked because of this fork or chain. Either this node is in pDe(X t-γ , G s ) in which case one should consider the last element of π which is not in

then π is blocked by a collider in pDe(X t-γ , G s ) and if this element is a parent of [ecn(X, Y, G s )] t t-γ then either π is blocked by Z 2 or we adjust on this element thus blocking π. * Note that the condition “γ = 0” guarantees the existence of an element of π which is not in

• Suppose Condition C is verified, let π = ⟨V 0 t0 ⋯V n tn ⟩ be a backdoor path in a compatible FT-DAG. If ⟨V n-1 tn-1 ← V n tn ⟩ ⊆ π then π contains a collider which is a descendant of Y t and so π is blocked by this collider. If

In that case, one can consider the edge between V n-2 tn-2 : either ⟨V n-2 tn-2 ← V n-1 tn-1 ⟩ ⊆ π and thus π is blocked by a collider which is a descendant of X t , either

which we have shown before to be impossible due to causal stationarity.

Proposition 1. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the micro causal effect from X t-γ to Y t is identifiable by Theorem 1. The set q-opt (X t-γ , Y t ) G s satisfies the SCG-back-door criterion.

Proof. Let D = pDe(X t-γ , G s ).

• If Condition A is verified and Cycles(X, G s ) = ∅ and γ > 0, then let P = [P a(ecn(X, Y, G s ), G s )] t t-γ-γmax and q-opt (X t-γ , Y t ) G s = P/D. q-opt (X t-γ , Y t ) G s verifies item 2 of Definition 4.

• If Condition A is verified and γ = 0, then let P = [P a(cn(X, Y, G s ), G s )] t t-γ-γmax ∪ [P a(ecn BD (X, Y, ∅, G s ), G s )] t t-γ-γmax and q-opt (X t-γ , Y t ) G s = P/D. q-opt (X t-γ , Y t ) G s verifies item 3 of Definition 4.

• If Condition A is verified and Cycles(X, G s ) ≠ ∅ and γ > 0, then let P = [P a(X, G s )] t t-γ-γmax+1 ∪ [P a(ecn(X, Y, G s ), G s )] t t-γ-γmax and q-opt (X t-γ , Y t ) G s = P/D. q-opt (X t-γ , Y t ) G s verifies item 4 of Definition 4.

• If Condition B is verified, then let P = [P a(cn(X, Y, G s ), G s )] t t-γ-γmax ∪ [P a(ecn BD (X, Y, ∅, G s ), G s )] t t-γ-γmax and q-opt (X t-γ , Y t ) G s = P/D. q-opt (X t-γ , Y t ) G s verifies item 2 of Definition 4.

• If Condition C is verified, then let P = [P a(Y, G s )] t t-γ-γmax ∪ [P a(X, G s )] t t-γmax and q-opt (X t-γ , Y t ) G s = P/D. q-opt (X t-γ , Y t ) G s verifies Definition 4.

Proposition 2. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the micro causal effect from X t-γ to Y t is identifiable by the SCG-back-door criterion. Suppose there exists a compatible FT-DAG G such that opt (X t-γ , Y t ) G is a valid adjustment set in every compatible FT-DAG. There exist a compatible FT-DAG G in which opt (X t-γ , Y t ) G = q-opt (X t-γ , Y t ) G s .

Proof. Build G the following way:

• ∀⟨U → V ⟩ ∈ G s and ∀t -γ max ≤ t U < t add the edge ⟨U t U → V t ⟩ to G.

• For all directed path from X to Y π = ⟨V 0 → ⋯ → V n ⟩ and ∀0 < i < n iteratively add the edges ∀t, ⟨V i t → V i+1 t ⟩ to G if it does not create a cycle.

• ∀⟨U t → V t ⟩ ∈ G and ∀P ∈ P a(V, G s ) add the edges ∀t, ⟨P t → V t ⟩ to G if it does not create a cycle.

• Repeat this last item until no additional edge can be added.

Figure 8: Example of a SCG such that for every FT-DAG G there exists another FT-DAG G ′ in which opt (X t-γ , Y t ) G is not a valid adjustment set. In red is the treatment variable, in blue the outcome variable and in green is the optimal set. Notice that in the first FT-DAG the backdoor path ⟨X t ← W t → Z t → Y t ⟩ would not be blocked if one did not adjust on Z t , and that in the second FT-DAG Z t is a descendant of Y t so one cannot adjust on it without introducing selection bias. Thus, each optimal adjustment set is invalid in the other compatible FT-DAG.

With this construction it is clear that q-opt (X t-γ , Y t ) G s = opt (X t-γ , Y t ) G as every potential parent of every extended causal node (or causal node if γ = 0) in the SCG is a parent of a causal node in the FT-DAG. Thus the only elements of q-opt (X t-γ , Y t ) G s which are not in opt (X t-γ , Y t ) G are the variables in [P a(ecn BD (X, Y, ∅, G s ), G s )] t t-γ-γmax . However, ecn BD (X, Y, ∅, G s ) is always empty except in cases such as the one of Figure 8 in which there does not exist any compatible FT-DAG G such that opt (X t-γ , Y t ) G is a valid adjustment set in every compatible FT-DAG.

Proposition 3. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the micro causal effect from X t-γ to Y t is identifiable by the SCG-back-door criterion. For every compatible FT-DAG G, opt (X t-γ , Y t ) G /pDe(X t-γ , G s ) ⊆ q-opt (X t-γ , Y t ) G s .

Proof. Let G be a FT-DAG. If γ = 0 then cn(X t-γ , Y t , G) = [cn(X t-γ , Y t , G s )] t t-γ and if γ > 0 then cn(X t-γ , Y t , G) = [ecn(X t-γ , Y t , G s )] t t-γ . Therefore, P a(cn(X t-γ , Y t , G), G) ⊆ [P a(cn(X t-γ , Y t , G s ), G s )] t t-γ-γmax when γ = 0 and P a(cn(X t-γ , Y t , G), G) ⊆ [P a(ecn(X t-γ , Y t , G s ), G s )] t t-γ-γmax when γ > 0. If Conditions A or B are satisfied, then [P a(ecn(X t-γ , Y t , G s ), G s )] t t-γ-γmax ⊆ P where P is defined in Definition 6. In these cases, we have opt (X t-γ , Y t ) G /pDe(X t-γ , G s ) ⊆ q-opt (X t-γ , Y t ) G s . Lastly, if Condition C is satisfied then cn(X t-γ , Y t , G) = {Y t-1 , X t } so it is clear that opt (X t-γ , Y t ) G /pDe(X t-γ , G s ) ⊆ q-opt (X t-γ , Y t ) G s .

Corollary 1. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the causal effect from X t-γ to Y t is identifiable. Suppose there exists a compatible FT-DAG G such that opt (X t-γ , Y t ) G is a valid adjustment set in every compatible FT-DAG. Let compatible (G s ) be the set of FT-DAGs compatible with G s . The following equality holds:

Proof. It follows from Propositions 2 and 3.

Corollary 2. Let G s be an SCG and (X t-γ , Y t ) be two temporal variables of interest such that X ∈ An(Y, G s ) and such that the causal effect from X t-γ to Y t is identifiable by Theorem 1. For any distribution Pr compatible with any FT-DAG compatible with G s . Under the same assumptions (briefly discussed above and described in details in Henckel et al. [2022], Rotnitzky and Smucler [2020]) required for the optimality of the set opt (X t-γ , Y t ) G , we have the following: var Pr ( ∆Xt-γ,Yt (q-opt (X t-γ , Y t ) G s )) ≤ var Pr ( ∆Xt-γ,Yt (A 1

)), and var Pr ( ∆Xt-γ,Yt (q-opt (X t-γ , Y t ) G s )) ≤ var Pr ( ∆Xt-γ,Yt (A 2 )).

Proof. The proof follows directly from [Henckel et al., 2022, Theorem 1].

Suppose that T = A 2 /q-opt (X t-γ , Y t ) G s and S = q-opt (X t-γ , Y t ) G s /A 2 . By definition, T consists of elements in A 2 that are not in q-opt (X t-γ , Y t ) G s . Since q-opt (X t-γ , Y t ) G s includes all parents of Y t (not descendants of

By weaker stationarity we mean to say that when dealing with multiple multivariate time series, we only need to assume that all causal relations from tγγmax to t are time invariant in the FT-DAG but not necessarily in the DTDSCM.

FT-DAGs are a subclass of DAGs, and similarly, DTDSCMs are instances of standard SCMs. As such, general results and tools developed for DAGs and SCMs are directly applicable to FT-DAGs and DTDSCMs. Throughout this paper, we may occasionally state that a method or result applies to FT-DAGs, even if it was originally established in the broader context of DAGs.

The term “micro” causal effect is used in contrast to the “macro” causal effect, which refers to the influence of an entire time series on another. In the case of micro causal effects, the focus is on the causal influence of a single time point (or time-specific intervention) on another specific time point within or across time series.

Condition C can be replaced by γ = 1 and Scc(X, G s ) ⊆ {X, Y } and Y → Y ∉ E s

Condition C can be replaced by γ = 1 and Scc(X, G s ) ⊆ {X, Y } and Y → Y ∉ E s

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut