Causal generalized linear models via Pearson risk invariance

Notice: This research summary and analysis were automatically generated using AI technology. For absolute accuracy, please refer to the [Original Paper Viewer] below or the Original ArXiv Source.

Prediction invariance of causal models under heterogeneous settings has been exploited by a number of recent methods for causal discovery, typically focussing on recovering the causal parents of a target variable of interest. Existing methods require observational data from a number of sufficiently different environments, which is rarely available. In this paper, we consider a structural equation model where the target variable is described by a generalized linear model conditional on its parents. Besides having finite moments, no modelling assumptions are made on the conditional distributions of the other variables in the system, and nonlinear effects on the target variable can naturally be accommodated by a generalized additive structure. Under this setting, we characterize the causal model uniquely by means of two key properties: the Pearson risk invariant under the causal model and, conditional on the causal parents, the causal parameters maximize the expected likelihood. These two properties form the basis of a computational strategy for searching the causal model among all possible models. A stepwise greedy search is proposed for systems with a large number of variables. Crucially, for generalized linear models with a known dispersion parameter, such as Poisson and logistic regression, the causal model can be identified from a single data environment. The method is implemented in the R package causalreg.

💡 Research Summary

The paper introduces a novel framework for causal discovery that leverages the invariance of Pearson risk in generalized linear models (GLMs). The authors consider a structural equation model (SEM) where the target variable Y follows an exponential dispersion family distribution conditional on its direct causes (parents) X_PA. The conditional mean and variance are linked through the canonical functions b′(·) and b″(·) of the GLM, while no distributional assumptions are imposed on the remaining covariates. Two key properties uniquely characterize the causal model: (1) the true parent function f_PA maximizes the expected joint log‑likelihood of (X_PA, Y); (2) the expected Pearson risk, defined as E

Causal generalized linear models via Pearson risk invariance

💡 Research Summary

Comments & Academic Discussion

Leave a Comment