The Constrained Virtual Steiner Arborescence Problem: Formal Definition, Single-Commodity Integer Programming Formulation and Computational Evaluation
As the Internet becomes more virtualized and software-defined, new functionality is introduced in the network core: the distributed resources available in ISP central offices, universal nodes, or datacenter middleboxes can be used to process (e.g., filter, aggregate or duplicate) data. Based on this new networking paradigm, we formulate the Constrained Virtual Steiner Arborescence Problem (CVSAP) which asks for optimal locations to perform In-network processing, in order to jointly minimize processing costs and network traffic while respecting link and node capacities. We prove that CVSAP cannot be approximated (unless P = NP), and accordingly, develop the exact algorithm VirtuCast to compute (near) optimal solutions. VirtuCast consists of: (1) a compact single-commodity flow Integer Programming (IP) formulation; (2) a flow decomposition algorithm to reconstruct individual routes from the IP solution. The compactness of the IP formulation allows for computing lower bounds even on large instances quickly, speeding up the algorithm. We rigorously prove VirtuCast’s correctness. To complement our theoretical findings, we have implemented VirtuCast and present an extensive computational evaluation, showing that using VirtuCast realistically sized instances can be solved (close to) optimality. We show that VirtuCast significantly improves upon naive multi-commodity formulations and also initiate the study of primal heuristics to generate feasible solutions during the branch-and-bound process.
💡 Research Summary
The paper addresses a novel optimization problem that arises in modern virtualized and software‑defined networks: the Constrained Virtual Steiner Arborescence Problem (CVSAP). In such environments, distributed processing resources (e.g., middleboxes, micro‑datacenters) can be leased at arbitrary network nodes to perform in‑network functions such as filtering, aggregation, or duplication. CVSAP asks which nodes should be activated as processing sites and how terminals (or a single source) should be connected to a designated root so that the total cost—comprising both the traffic cost on physical links and the activation cost of processing nodes—is minimized while respecting link capacities and node degree limits.
The authors first prove that deciding feasibility of CVSAP is NP‑complete via a reduction from Set Cover, and consequently no polynomial‑time approximation algorithm exists unless P = NP. This motivates the development of an exact algorithm, VirtuCast, which can also provide high‑quality feasible solutions during the solving process.
VirtuCast consists of two main components:
-
A compact single‑commodity integer programming (IP) formulation (IP‑A‑CVSAP).
An extended graph is built by adding a super‑source o⁺ and two super‑sinks (one for the root, one for all activated Steiner nodes). Variables xₑ represent the amount of flow on each physical edge, while binary variables yₛ indicate whether Steiner site s is activated. The model enforces flow conservation, supplies/demands at the root and activated sites, link capacity limits, and couples activation decisions with flow through big‑M constraints. Because only a single commodity is used, the number of variables grows linearly with the number of physical edges and Steiner sites, dramatically smaller than a naïve multi‑commodity model. -
A flow‑decomposition algorithm that converts the IP solution into a Virtual Arborescence.
The single‑commodity flow may contain cycles, but the algorithm repeatedly extracts a maximal‑flow path from the super‑source to a super‑sink, maps this path to a logical edge (u, v) in the arborescence, and records the underlying physical route π(u, v). After each extraction the residual flow is updated, and the process continues until all flow is assigned. The authors prove that this procedure always yields a feasible Virtual Arborescence satisfying the degree constraints of CVSAP, even when processing nodes are nested (i.e., a processing node forwards aggregated data to another processing node).
During the branch‑and‑bound search, a primal heuristic is invoked on every LP relaxation. It uses the fractional yₛ values to guess a set of activated Steiner nodes, then greedily routes the required flow, producing a feasible solution instantly. This heuristic dramatically narrows the optimality gap early in the search.
The experimental evaluation covers synthetic grid topologies, realistic ISP backbones, and varying problem sizes (up to 300 nodes, 100 terminals). Results show that the single‑commodity IP solves instances an order of magnitude faster than the multi‑commodity counterpart, often delivering optimal or near‑optimal solutions (≤ 1 % gap) within minutes. The primal heuristic alone reaches within 5 % of optimality in the first few minutes of the search. Memory consumption is also far lower, enabling the solution of larger instances that would be infeasible for multi‑commodity formulations.
In conclusion, the paper introduces a rigorous graph‑theoretic model for joint traffic‑and‑processing optimization in virtualized networks, proves its computational hardness, and provides a practically efficient exact algorithm. VirtuCast’s compact IP model, provably correct decomposition, and effective heuristic together make it a strong candidate for deployment in SDN controllers or network orchestration platforms that need to place in‑network functions optimally. Future work may extend the model to multiple roots, dynamic traffic patterns, or distributed solution techniques.
Comments & Academic Discussion
Loading comments...
Leave a Comment