Computer Science / Game Theory Computer Science / Software Engineering

Pessimistic Testing

February 23, 2026

Reading time: 5 minute

...

#Computer Science #Game Theory #Software Engineering

📝 Original Info

Title: Pessimistic Testing
ArXiv ID: 0910.0996
Date: 2009-10-07
Authors: ** Ernie Cohen (Microsoft) **

📝 Abstract

We propose a new approach to testing conformance to a nondeterministic specification, in which testing proceeds only as long as increased test coverage is guaranteed.

💡 Deep Analysis

Deep Dive into Pessimistic Testing.

We propose a new approach to testing conformance to a nondeterministic specification, in which testing proceeds only as long as increased test coverage is guaranteed.

📄 Full Content

arXiv:0910.0996v1 [cs.SE] 6 Oct 2009 Pessimistic Testing Ernie Cohen Microsoft October 26, 2018 Abstract We propose a new approach to testing conformance to a nondeterministic speci- ﬁcation, in which testing proceeds only as long as increased test coverage is guaran- teed. In testing that a system meets a nondeterministic speciﬁcation [1], it is usually as- sumed that the system is fair to each transition of the speciﬁcation (i.e., the system will make every possible nondeterministic choice if given enough opportunities). But in fact, some transitions might be unlikely or impossible for a given implementation. When this happens, common model-based testing practices (e.g., following a precomputed tour of the state space) often lead to wasted test cycles and poor test coverage. We propose an alternative approach, in which the tester uses a dynamically computed strategy that is guaranteed to eventually increase coverage; testing stops as soon as the system has a strategy to avoid further coverage. As usual, we cast the test problem as a game; here, it is conveniently represented as a (directed) hypergraph. A hypergraph is given by a set of vertices and a set of (hyper)edges. Each edge is given by a head vertex and a set of tail vertices; we say it is incident to its head. An edge is reachable iﬀall of its tail vertices are reachable, and a vertex is reachable iﬀone of its incident edges is reachable (as usual, taking the minimal solution). The rank of a reachable edge is the maximum of the ranks of its tail vertices (0 if the tail is empty), and the rank of a reachable vertex is one plus the minimum rank of its reachable incident edges. In the test context, the hypergraph vertices are system states, each hyperedge repre- sents a possible test stimulus, the head of the hyperedge is the state in which the stimulus can be delivered, and the tail of the hyperedge gives the states to which the system is allowed to transition under the stimulus. To keep track of which states have been ex- plored by the test, we add trivial edges (with empty tails) incident on each state (other than the initial state). When the system ﬁrst visits a state, this incident edge is removed, “marking” the state. Thus, the test state consists of a hypergraph and a current state (an unmarked vertex of the hypergraph), and a move of the testing game consists of the tester choosing an edge incident on the current state and the system choosing a new current state from the tail of the edge (marking the state if it is unmarked). Coverage is measured as the number of marked states. A key observation is that the tester has a strategy to increase coverage iﬀthe current state is reachable. If the current state is unreachable, the system can prevent further marking by always choosing an unreachable successor state. Conversely, if the current 1 state is reachable, the tester’s strategy is to always choose a reachable edge incident on the current state of minimal rank; this results in either movement of the system to a state of lower rank or marking of the new state. Since rank is bounded below by 0, some state is eventually marked. (In fact, this strategy is optimal in the sense of minimizing the upper bound on the number of moves before the next marking.) To operationalize this test strategy, we need to maintain (under game moves) the ranks of reachable edges incident on the current state. The obvious decremental hyper- graph reachability algorithm [2] maintains ranks for all states and edges in total time O(E + S · H), where E is the number of states marked, S is the total number of states, and H is the size of the hypergraph. However, since ranks can only increase, and this algorithm updates ranks in increasing rank order, we can delay updating ranks for edges and states whenever they exceed that of the current system state. We can also delay adding to the graph edges incident on unmarked states (“dead” edges). This improves the worst-case complexity to O(E + R · H′), where R is the maximum rank assumed by the system state prior to termination (i.e., the maximum number of stimuli between explorations), and H′ is the total size of the live edges. In the worst case R is E, but in practice, R grows much more slowly in E; for example, for random graphs of bounded degree, R is O(log(E)). In experiments testing conformance of the Microsoft Hypervi- sor to its functional speciﬁcation [3], the hypergraph computation is dominated by test instrumentation, even on tests with over 105 states. There are several potentially useful transformations of the test hypergraph. First, other kinds of test coverage (edge coverage, branch coverage, etc.) can be obtained using standard techniques. Second, in the common case where an operation can fail without a state change, the head of the hyperedge appears in the tail, which makes the edge unusable in the strategy above. We typically want to allow such an edge to be used as if the failure were impossible, at least until

…(Full text truncated)…

📄 Read Full PDF on ArXiv

📸 Image Gallery

Reference

This content is AI-processed based on ArXiv data.

Pessimistic Testing

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

📸 Image Gallery

Reference

Table of Contents

Table of Contents

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

📸 Image Gallery

Reference

Related Posts

An Empirical Evaluation of LLM-Based Approaches for Code Vulnerability Detection: RAG, SFT, and Dual-Agent Systems

LLM-based Behaviour Driven Development for Hardware Design

Multi-Agent Coordinated Rename Refactoring

Start searching

No results found