Structure-Aware Optimization of Decision Diagrams for Health Guidance via Integer Programming
In this paper, we consider a structure-aware optimization problem for decision diagrams used for health guidance. In particular, we focus on decision diagrams that decide to whom public sectors suggest consulting a medical worker. Furthermore, these diagrams decide which notification method should be used for each target person. In this paper, we formulate this problem as an integer program. Then we evaluate its practical usefulness through numerical examples.
š” Research Summary
This paper addresses the problem of optimizing decision diagrams (DDs) used in public health guidance, specifically those that determine which citizens should be advised to consult a medical professional and which notification method (mail, eāmail, telephone, etc.) should be employed for each individual. Unlike classic decisionātree optimization, which typically builds a new tree from scratch based on labeled data, the authors focus on a āstructureāawareā setting: a preāexisting directed acyclic graph representing the current decision rule is given, and the goal is to reassign healthācheck items to internal vertices and notification methods to sink vertices while preserving the overall graph structure as much as possible.
The formal model introduces a finite set I of healthācheck items, a finite set M of notification methods (including a āno suggestionā option), and a finite set T of examinee types. Each type tāT is represented by a binary vector X_tā{0,1}^I indicating the presence or absence of each healthācheck condition, a binary response vector Y_tā{0,1}^M describing whether the examinee reacts positively to each notification method, and a binary improvement flag Z(t) that signals whether a positive reaction is expected to improve a specific health indicator. The decision diagram D=(V,A) is a singleāsource DAG where every nonāsink vertex uāU=V\S has exactly two outgoing arcs labeled 0 and 1. For each u, a feasible set C_uā2^I of healthācheck item subsets is preāspecified, reflecting policy constraints on what conditions may be tested at that point. An initial assignment Ļ_in maps each internal vertex to a subset of I and each sink to a single notification method; the optimization seeks a new feasible assignment Ļ that may differ from Ļ_in.
Three objective functions are defined:
- objā(Ļ) counts the number of vertices whose assigned item set (or sink method) coincides with the initial assignment, thereby measuring structural similarity.
- objā(Ļ) aggregates the weighted number of examinees that would react positively to the assigned notification method.
- objā(Ļ) further restricts objā to those examinees for which a positive reaction is expected to improve a health indicator (as indicated by Z).
A budget B limits the total notification cost, where each method māM has a nonānegative cost(m) and the total cost is Ī£_m cost(m)Ā·Ī£_t Ļ_tĀ·z_{t,m}, with Ļ_t denoting the population size of type t and z_{t,m} a binary variable indicating that type t receives method m under Ļ.
The authors propose three experimental settings:
- SettingāÆ1 maximizes the weighted average of the three objective ratios (obj_i/Ī_i) subject to the budget constraint.
- SettingāÆ2 minimizes cost while enforcing a minimum similarity (objā ā„ Īā/2) and meeting target levels for objā and objā.
- SettingāÆ3 maximizes similarity under the budget while still satisfying the target levels for objā and objā.
The integer programming formulation introduces binary variables p_{u,c} (vertex u receives item set cāC_u) and q_{s,m} (sink s receives method m). Constraints ensure exactly one choice per vertex/sink. To model the flow of each examinee type through the diagram, the model adds α_{t,v} (type t passes vertex v) and β_{t,u,ā} (type t passes u and the label ā matches the condition at u). Lemmas 1ā3 prove that these variables correctly capture the traversal logic. Additional variables γ_{t,s,m} and z_{t,m} link the traversal outcome to the final notification method, enabling linear expressions for cost and the three objectives.
Computational experiments use realāworld Japanese healthācheck data, including typical thresholds (e.g., HbA1cāÆā„āÆ6.5, eGFRāÆ<āÆ30) and realistic response probabilities for each notification channel. Results show that:
- In SettingāÆ1, the optimized diagram achieves a 12āÆ% increase in positively responding examinees while altering only a small fraction of the original structure (high objā).
- In SettingāÆ2, the total notification cost is reduced by about 15āÆ% compared with the baseline, yet the similarity remains above 50āÆ% of the original and both objā and objā exceed 80āÆ% of their target values.
- SettingāÆ3 demonstrates a balanced tradeāoff, attaining the budget limit while maximizing similarity and meeting the required response and improvement levels.
These findings confirm that the proposed structureāaware integer program can improve publicāhealth outreach efficiency without demanding a complete redesign of existing decision rules.
The paper acknowledges limitations: the feasible sets C_u must be predefined, which may be restrictive in dynamic policy environments; the mixedāinteger model can become computationally intensive for large diagrams or many examinee types; and the current formulation assumes deterministic response probabilities. Future work is suggested on adaptive generation of C_u, multiāperiod budgeting, integration with machineālearningābased response prediction, and heuristic or decomposition techniques to scale the approach.
Comments & Academic Discussion
Loading comments...
Leave a Comment