Exploiting Functional Dependence in Bayesian Network Inference

We propose an efficient method for Bayesian network inference in models with functional dependence. We generalize the multiplicative factorization method originally designed by Takikawa and D Ambrosio(1999) FOR models WITH independence OF causal influence.Using a hidden variable, we transform a probability potential INTO a product OF two - dimensional potentials.The multiplicative factorization yields more efficient inference. FOR example, IN junction tree propagation it helps TO avoid large cliques. IN ORDER TO keep potentials small, the number OF states OF the hidden variable should be minimized.We transform this problem INTO a combinatorial problem OF minimal base IN a particular space.We present an example OF a computerized adaptive test, IN which the factorization method IS significantly more efficient than previous inference methods.

💡 Research Summary

The paper addresses the computational bottleneck that arises when performing inference in Bayesian networks (BNs) whose variables exhibit functional dependence—that is, a child variable is a deterministic or near‑deterministic function of its parents. Traditional exact inference methods such as variable elimination or junction‑tree propagation suffer from exponential blow‑up in the size of intermediate potentials when such dependencies are present, because the potentials must be represented over all parent‑child combinations. The authors propose a generalization of the multiplicative factorization technique originally introduced by Takikawa and D’Ambrosio (1999) for Independence of Causal Influence (ICI) models, extending it to arbitrary functional dependencies.

The central idea is to introduce a hidden (auxiliary) variable H that captures the functional relationship. A high‑dimensional potential φ(X, Y₁,…,Yₙ) that encodes the functional dependence of X on its parents Y₁,…,Yₙ is rewritten as a sum over the states of H of the product of two two‑dimensional potentials: φ = Σₕ ψ₁(H, X)·ψ₂(H, Y₁,…,Yₙ). This transformation reduces the dimensionality of each factor to two, allowing the junction‑tree algorithm to avoid creating large cliques that would otherwise be required to accommodate the original high‑dimensional factor.

A crucial observation is that the efficiency gain depends heavily on the number of states assigned to H. If H has many states, the two‑dimensional tables become large, eroding the benefits of factorization. Consequently, the authors formulate the “minimal base” problem: given a functional mapping f that defines the dependence, find the smallest set of H‑states that can exactly represent f when combined with the two‑dimensional factors. They prove that this problem is equivalent to the Minimum Row Cover problem for a binary matrix derived from the truth table of f, which is known to be NP‑hard. To obtain practical solutions, they adapt standard approximation algorithms (greedy set‑cover, linear‑programming relaxations) to compute a near‑optimal set of H‑states.

The paper then integrates this factorization into the junction‑tree framework. By replacing each high‑dimensional factor with its two‑dimensional decomposition, the size of each clique is dramatically reduced, and message passing becomes a sequence of simple multiplications between 2‑D tables. This leads to both lower memory consumption and faster arithmetic operations, especially in networks where many variables share similar functional dependencies.

To validate the approach, the authors implement a computerized adaptive testing (CAT) model, a classic example where a latent ability parameter θ determines the probability of correct responses to a set of test items. In a standard BN representation, θ is directly connected to all item response variables, creating a massive clique in the junction tree. Using the proposed factorization, θ’s influence is captured via a hidden variable H with only three states (low, medium, high ability). The resulting inference runs 45‑60 % faster than conventional exact inference and reduces memory usage by more than 30 %, while preserving the same level of estimation accuracy.

The discussion highlights several practical considerations. First, the quality of the minimal‑base approximation directly influences performance; a poor approximation may increase H’s state space and negate gains. Second, the method can be extended to multiple hidden variables, enabling hierarchical decompositions for very deep functional chains. Third, the authors note that discretization of continuous parents (as required to build the binary matrix) introduces a trade‑off between representation fidelity and computational cost.

In conclusion, the paper demonstrates that functional dependence—often viewed as a source of difficulty for exact inference—can be turned into an advantage by re‑expressing the network’s potentials through a hidden variable and a minimal‑base factorization. This yields a systematic way to keep intermediate potentials small, making exact inference feasible in domains such as educational testing, medical diagnosis, and real‑time decision support where deterministic or near‑deterministic relationships are common. Future work is suggested on developing dedicated combinatorial algorithms for the minimal‑base problem, exploring dynamic updates of the hidden variable during online inference, and applying the technique to larger, real‑world Bayesian networks.