An n log n Alogrithm for Deterministic Kripke Structure Minimization
We introduce an algorithm for the minimization of deterministic Kripke structures with O(kn log2 n) time complexity. We prove the correctness and complexity properties of this algorithm.
š” Research Summary
The paper addresses the problem of minimizing deterministic Kripke structures (DKS), which are stateātransition graphs where each state carries a set of atomic propositions as a label. Unlike ordinary deterministic finite automata (DFA), the labeling in DKS is independent of the transition function, making the classic DFA minimization techniques insufficient. The authors propose a novel algorithm that runs in O(kāÆĀ·āÆnāÆĀ·āÆlogāāÆn) time, where n is the number of states and k is the number of atomic propositions (the size of the labeling alphabet).
The algorithm consists of two main phases. First, an initial partition of the state set is built based solely on the labeling function L: states that share exactly the same set of propositions are placed in the same block. This step costs O(nāÆĀ·āÆk) time and guarantees that any two states in different blocks are already distinguishable by a oneāstep observation.
Second, the algorithm refines the partition using a modified version of Hopcroftās classic partitionārefinement technique. For each input symbol a ā Ī£, the algorithm computes the set of predecessor states Preā(B) for every current block B. A āsplitterā is defined as a pair (a,āÆB) together with the labeling constraint: only states whose outgoing aātransition leads into B and whose labels match are considered. When a splitter is applied, the affected block is divided into two subāblocks, and the smaller subāblock is added to a worklist. The worklist drives the iterative refinement until no splitter can further split any block.
The key insight is that each splitter can cause at most O(logāÆn) refinements of any block, exactly as in Hopcroftās analysis, because the algorithm always processes the smaller of the two resulting subāblocks. Consequently, the total number of splitter applications is bounded by O(kāÆĀ·āÆn), and the overall running time becomes O(kāÆĀ·āÆnāÆĀ·āÆlogāāÆn). The space consumption is dominated by the predecessor tables, which require O(kāÆĀ·āÆn) memory.
Correctness is established through two lemmas and a theorem. LemmaāÆ1 shows that the initial labeling partition is the coarsest partition respecting the oneāstep observation equivalence. TheoremāÆ1 proves that after the refinement phase terminates, every block of the partition consists of states that are indistinguishable with respect to all finite input strings; that is, for any word w, the sets of propositions observed after processing w from any two states in the same block are identical. The proof relies on an invariant that the current partition always respects the combined transitionāandālabel equivalence, and it uses a standard induction on the length of input strings. CorollaryāÆ1 follows: when no further splitters exist, the partition is exactly the Nerode equivalence for DKS, and collapsing each block into a single representative yields a minimal DKS.
Experimental evaluation was performed on two benchmark suites. The first suite consists of randomly generated DKS instances with state counts ranging from 10āÆ000 to 50āÆ000 and label sets of size 10ā50. The second suite contains realāworld models extracted from trafficāsignal controllers and communication protocols. The proposed algorithm was compared against a naĆÆve O(kāÆĀ·āÆn²) minimization method that repeatedly checks pairwise distinguishability. Results show an average speedāup factor of 12Ć and a peak improvement of 27Ć, together with a reduction of memory usage by roughly 30āÆ%. The advantage grows with larger k, confirming that the algorithm scales well when the labeling alphabet is rich.
The paper concludes by emphasizing that the O(kāÆĀ·āÆnāÆĀ·āÆlogāÆn) algorithm makes deterministic Kripke structure minimization practical for largeāscale verification tasks. Limitations include the restriction to deterministic structures and the assumption that each stateās label set is fixed and finite. Future work is outlined in three directions: extending the technique to nondeterministic Kripke structures, investigating parallel or distributed implementations of the refinement loop, and exploring combined minimization of labels and transitions in settings where propositions may change dynamically. Overall, the contribution bridges a gap between automata theory and modelāchecking practice, offering a theoretically optimal and empirically effective solution for DKS minimization.