기후·캘린더 변수와 인과관계를 결합한 에너지 수요 예측 모델

Reading time: 5 minute
...

📝 Original Info

  • Title: 기후·캘린더 변수와 인과관계를 결합한 에너지 수요 예측 모델
  • ArXiv ID: 2512.11653
  • Date: Pending
  • Authors: ** - Chutian Ma - Grigorii Pomazkin - Giancinto Paolo (GP) Saggese - Paul Smith **

📝 Abstract

Energy demand prediction is critical for grid operators, industrial energy consumers, and service providers. Energy demand is influenced by multiple factors, including weather conditions (e.g. temperature, humidity, wind speed, solar radiation), and calendar information (e.g. hour of day and month of year), which further affect daily work and life schedules. These factors are causally interdependent, making the problem more complex than simple correlation-based learning techniques satisfactorily allow for. We propose a structural causal model that explains the causal relationship between these variables. A full analysis is performed to validate our causal beliefs, also revealing important insights consistent with prior studies. For example, our causal model reveals that energy demand responds to temperature fluctuations with season-dependent sensitivity. Additionally, we find that energy demand exhibits lower variance in winter due to the decoupling effect between temperature changes and daily activity patterns. We then build a Bayesian model, which takes advantage of the causal insights we learned as prior knowledge. The model is trained and tested on unseen data and yields state-of-the-art performance in the form of a 3.84% MAPE on the test set. The model also demonstrates strong robustness, as the cross-validation across two years of data yields an average MAPE of 3.88%.

💡 Deep Analysis

Figure 1

📄 Full Content

CAUSAL INFERENCE IN ENERGY DEMAND PREDICTION CHUTIAN MA, GRIGORII POMAZKIN, GIANCINTO PAOLO (GP) SAGGESE, AND PAUL SMITH Abstract. Energy demand prediction is critical for grid operators, indus- trial energy consumers, and service providers. Energy demand is influenced by multiple factors, including weather conditions (e.g. temperature, humid- ity, wind speed, solar radiation), and calendar information (e.g. hour of day and month of year), which further affect daily work and life schedules. These factors are causally interdependent, making the problem more complex than simple correlation-based learning techniques satisfactorily allow for. We pro- pose a structural causal model that explains the causal relationship between these variables. A full analysis is performed to validate our causal beliefs, also revealing important insights consistent with prior studies. For example, our causal model reveals that energy demand responds to temperature fluctuations with season-dependent sensitivity. Additionally, we find that energy demand exhibits lower variance in winter due to the decoupling effect between temper- ature changes and daily activity patterns. We then build a Bayesian model, which takes advantage of the causal insights we learned as prior knowledge. The model is trained and tested on unseen data and yields state-of-the-art performance in the form of a 3.84% MAPE on the test set. The model also demonstrates strong robustness, as the cross-validation across two years of data yields an average MAPE of 3.88%. Contents 1. Introduction 1 2. Analysis of Causal Structure 2 3. A Full Bayesian Treatment 10 4. Conclusion 16 5. Future Directions 16 Appendix A. Legitimacy of Approach 2 16 Appendix B. Parameters and Priors 17 References 19 1. Introduction While machine learning (ML) has achieved enormous success in recent decades, most modern machine learning systems operate purely on statistical correlation, without any understanding of causation. They excel at detecting patterns in train- ing data, but cannot distinguish between relationships that merely happen to occur together and those that represent genuine cause-and-effect mechanisms. As Bern- hard pointed out in [15], this distinction represents one of the most profound gaps 1 arXiv:2512.11653v2 [cs.AI] 17 Dec 2025 2 MA, POMAZKIN, SAGGESE, AND SMITH between current AI capabilities and human intelligence and it hinders the ability of AI systems to generalize what they have learned to new problems. Due to the these limitations of correlation-based AI, the machine learning com- munity has shown increasing interest in incorporating causal inference in machine learning [7, 17, 6, 3], and particularly for time series prediction [9, 14]. In terms of “hindering the generalization ability”, the presence of confounder bias stands as one of the most common challenges. In this paper, we define a confounder in the same way as in Pearl’s work on causality (e.g., [11]). Definition 1.1 (Confounder). Suppose the random variables X, Y, Z are causally connected by the following relation X ←Z →Y (Z affects X and Y causally). Then we say that Z confounds X and Y , or that Z is a confounder of the other two variables. The existence of confounders is often problematic because they create spurious associations between predictors and outcomes. Models learn these non-causal cor- relations rather than genuine causal effects. This hinders generalization: when the distribution of predictors shifts in deployment environments or under interventions, the spurious patterns learned from confounded training data fail to hold, leading to poor predictive performance in new settings. This is a major limitation of purely statistical approaches operating at the associational level, as Pearl emphasizes in his discussion of the causal hierarchy [12]. In this paper, we study an energy demand prediction problem in which we model the system load using ML approaches. We present a full analysis of the interdepen- dency structure of the various predictors (including weather conditions and calendar information) and energy demand. At the same time, we list common mistakes and consequences originating from confounder bias and misspecified causal structure. A Bayesian causal model is later built based on the causal insights, trained on real- world energy demand data and tested on unseen data. The model is able to produce state-of-the-art predictions, with an average MAPE of 3.88% generated from a K- fold cross validation test. In addition, the model is able to explain the variability found in the data, such as the seasonal-dependent variance (heteroscedasticity) and temperature sensitivity. This paper is structured as follows. Section 2 presents a full analysis of the inter- dependency between calendar, weather and energy demand variables. We show that ignoring causal structure can cause a trained model to show confounder bias, se- verely jeopardizing its generalization to unseen data. Section 3 develops a

📸 Image Gallery

DAG_3.png MAPE_causal_vs_non_causal_linear_regression.png causal_results_vs_non_causal.png energy_demand_causal_dag.png grid_pyro_test_data_results.png grid_pyro_train_data_results.png humidity_45-50.png humidity_50-55.png humidity_55-60.png humidity_75-80.png humidity_80-85.png humidity_85-90.png humidity_correlation_distribution.png humidity_kde_Feb.png humidity_kde_Jan.png humidity_partial_DAG.png humidity_vs_energy_scatter.png radiation_vs_hour.png radiation_vs_month.png scatter_plot_temp_vs_energy.png solar_radiation_month_2_hour_14.png solar_radiation_month_2_hour_18.png solar_radiation_month_7_hour_14.png solar_radiation_month_7_hour_20.png wind_speed_winter.png winter_speed_summer.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut