Next Article in Journal
Flight Schedule Problem Optimization Based on Discrete Memory-Enhanced Restructured Particle Swarm Optimization Algorithm
Previous Article in Journal
Stroke2Font: A Hierarchical Vector Model with AI-Driven Optimization for Chinese Font Generation
Previous Article in Special Issue
Heuristic Conductance-Aware Local Clustering for Heterogeneous Hypergraphs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Steiner Tree Approximations in Graphs and Hypergraphs

by
Miklós Molnár
1,* and
Basma Mostafa Hassan
2,3
1
Laboratoire d’Informatique, de Robotique et de Microélectronique de Montpellier (LIRMM), University of Montpellier, Centre National de la Recherche Scientifique (CNRS), 161 Rue Ada, 34095 Montpellier Cedex 5, France
2
Operations Research Department, Faculty of Computers & Artificial Intelligence, Cairo University, Giza 12613, Egypt
3
Faculty of Artificial Intelligence & Informatics, Horus University, New Damietta 34518, Egypt
*
Author to whom correspondence should be addressed.
Algorithms 2026, 19(3), 232; https://doi.org/10.3390/a19030232
Submission received: 31 January 2026 / Revised: 12 March 2026 / Accepted: 13 March 2026 / Published: 19 March 2026
(This article belongs to the Special Issue Graph and Hypergraph Algorithms and Applications)

Abstract

The construction of partial minimum spanning trees is an NP-hard problem, leading to the development of various heuristic algorithms. Existing heuristics, including Kruskal’s algorithm, frequently employ shortest paths to connect tree components. This study introduces an approximate algorithm for constructing the minimum Steiner tree, which serves as the optimal structure for diffusion multicast. The proposed approach utilizes graph-based structures that provide advantages over conventional shortest-path methods. The algorithm incorporates connections analogous to those in simple Steiner trees when required. These simple trees are represented by hyperedges, and a Hyper Metric Closure can also be applied. Experimental results indicate that this hypergraph-based method enables constructions that more closely approximate the optimal Steiner tree cost compared to traditional pairwise techniques, offering a scalable balance between computational complexity and routing efficiency.

1. Introduction and Related Work

The Steiner problem is one of the most studied problems in combinatorial optimization. In its Euclidean formulation, the minimum-cost connection problem for a set of points is of significant interest in several logistical decision problems. In the design of microcircuits and electronic boards, the optimal interconnection of elements is governed by the rectilinear Steiner tree and Steiner forest problems. Furthermore, a set of constraints can be represented as a graph, where edges indicate possible connections; such a model corresponds to network topologies in which a multicast group must be established at minimal cost. In our study, this graph-theoretic version is the focus.
Definition 1 
(The Steiner Tree Problem). Given a connected, undirected graph G = ( V , E ) with edge weights w : E R + and a terminal set M V , the objective is to find an acyclic subgraph T G such that M V T and the total cost e E T w ( e ) is minimized [1].
If M = V , the task becomes finding a minimum spanning tree (MST), which efficient algorithms from Kruskal [2] and Prim [3] can compute. When M is a proper subset of V, the Steiner problem is NP-hard. This distinction highlights the range of algorithmic challenges discussed in the literature. Fundamental definitions are provided in the Appendix A.
Traditional heuristics for approximate partial minimum spanning trees typically run in polynomial time. Kruskal’s algorithm applies shortest paths between group members. Takahashi and Matsuyama [4] build on Prim’s algorithm and also use shortest paths. Kou et al. [5] formulate the problem in the metric closure. All these algorithms yield 2-approximations of the minimum Steiner trees. However, heuristics that rely solely on shortest paths suffer from a severe structural limitation: they are inherently pairwise. By considering only distances between two nodes at a time, Kruskal and Takahashi–Matsuyama-based approaches overlook geometric opportunities to simultaneously unite multi-node clusters through central, non-terminal nodes (Steiner points). Models based on hypergraphs or k-limited Steiner trees inherently capture these multi-way synergies, potentially improving coverage by embedding local optimal hubs directly into the routing options.
Building on these ideas, Zelikovsky proposed an 11/6-approximation using a greedy strategy [6]. This proposal is also based on the metric closure but improves the simple MST approach by examining small Steiner trees for triplets of members. Let M S T ( M ) be the MST of the group M in the metric closure. For each triplet z, the minimum Steiner tree T ( z ) is computed in the original graph. The central node c ( z ) of T ( z ) is then considered a new node to cover if the gain given in (1) is positive and maximal.
gain ( z ) = mst ( M ) mst ( M ) d ( z )
where m s t ( M ) is the length of M S T ( M ) , m s t ( M ) is the length of the MST in the modified metric closure after the contraction of the nodes in z, and  d ( z ) is the length of the minimal Steiner tree covering z. By examining all triplets and adding appropriate Steiner points to the metric closure, the problem is approximated on an augmented graph, resulting in a 3-restricted tree.
Later studies by Zelikovsky proposed the relative greedy heuristic (RGH), with a unique selection function. They introduced a general objective function f ( T ) . Karpinski and Zelikovsky also proposed efficient methods for pre-analyzing the gains and losses of tree parts. This preprocessing identifies useful full components and includes their Steiner points as nodes to cover. More general greedy algorithms have also been developed. Du et al. proposed algorithms for the Euclidean Steiner problem using k-tuples [7], with the graph-related version appearing in [8]. It is also possible to progressively construct a tree spanning M by greedily adding nodes to M.
Let z V be a new node and let T ( z ) be a Steiner tree connecting z to the MST by k nodes (cf. Figure 1). Adding T ( z ) to M S T ( M ) creates k 1 cycles, allowing k 1 edges of the metric closure to be revised for a new MST computation.
An improved approximation factor for a special case of the optimal Steiner tree problem is given in [9]. The solution is an LP-based approximation algorithm based on the iterative randomized rounding technique. k-restricted Steiner trees are investigated, with a directed-component cut relaxation for the problem. The solution costs at most ln(4) times that of the optimal k-restricted Steiner tree.
Beyond traditional LP and greedy approaches, modern research has explored diverse computational paradigms. For instance, Chitty et al. recently compared the performance of quantum annealers against traditional metaheuristic methods for solving the Steiner problem [10]. Similarly, the efficiency of machine learning has been demonstrated through ’learning-to-prune’ frameworks, which provide near-optimal solutions with significantly reduced computational overhead [11].
The practical utility of these models continues to expand into critical infrastructure planning. A recent study analyzed the expansion of unbalanced three-phase electric distribution systems, utilizing Steiner tree topologies to minimize total annualized electrification costs [12]. Furthermore, while our current study focuses on static network topologies, recent work by Raikwar et al. has introduced dynamic algorithms for maintaining approximate Steiner trees in graphs whose edge set changes over time [13]. Theoretical foundations for connectivity in hypergraph-based models also continue to evolve, particularly regarding S-Steiner hyper-trees and node reachability in oriented structures.
A large online compendium on the approximability of the Steiner Tree-related optimization problems can be found at [14]. An in-depth study of the Steiner problem in graphs, including descriptions of the most important exact solutions, heuristics, and approximations, is presented in [15].
The main contributions of this paper are as follows:
  • Generalization of Heuristic Frameworks: While traditional Steiner tree approximations typically rely on pairwise shortest-path algorithms, we propose a methodological extension that utilizes small Steiner trees as fundamental building blocks in place of standard paths.
  • Enhanced Metric Closure Models: We introduce a generalized distance graph that transcends the standard shortest-path-based metric closure by incorporating k-limited Steiner trees to better capture multi-node synergies.
  • Optimal k-limited Approximation: We present an analytical framework for optimal Steiner tree approximation based on k-limited components, providing a structural alternative to classical pairwise techniques.

2. Hypergraphs and Steiner Trees

Warme proposed a hypergraph-based model to address the concatenation of full components in the Euclidean Steiner problem [16,17]. In this approach, discovered full components on a set of points are represented as hyperedges. The optimal global Steiner tree is obtained by solving the minimum spanning tree (MST) problem on the constructed hypergraph. Warme also demonstrated that finding the MST in a hypergraph is NP-hard.
A Steiner tree in a graph can be partitioned into edge-disjoint, connected full components by splitting at internal terminals. Each full component corresponds to a hyperedge, resulting in a hypergraph representation of the Steiner tree problem. As shown in [18], this problem is related to the MST problem in weighted 3-uniform hypergraphs, and a fully polynomial randomized approximation scheme is presented.
In [19], a Steiner tree representation of hyperedges is regarded as the edge representation of a hypergraph. The metric closure, or distance graph, is frequently used to solve the Steiner tree problem, as discussed in [20]. This study examines the relationship between the Steiner problem in graphs and the MST problem in hypergraphs, and introduces a branch-and-cut method based on the linear relaxation of the integer-programming formulation.
Polzin and Vahdati Daneshmand [21] compare various hypergraph-based relaxations with those derived from the traditional graph formulation of the Steiner problem. Their analysis suggests that the union of trees connected by hyperedges forms a graph, and the final concatenation is resolved by solving the classical Steiner problem on this structure.
Small Steiner trees corresponding to hyperedges can connect existing subtrees or link isolated members to trees covering a multicast group. Preliminary propositions for these concepts are presented in [22,23]. Theoretical foundations for these ideas are further developed in the following sections.

3. Bases of the Proposed Solutions

Approximate Steiner trees may be constructed by concatenating smaller, optimal components. The following property of optimal Steiner trees supports this method, even when the components are not full components, that is, subtrees in which all leaves are terminals.
Property 1 
(Sub-optimality of Steiner Trees). Let T = ( V T , E , T ) be a minimum Steiner tree (MST) for the terminal set M. Let S T = ( V S , E , S ) be a subtree of T delimited by a set of boundary nodes (leaves) L S V T . If  M S = V S M represents the terminals within this subtree, then S T must be a minimum Steiner tree for the set L S M S .
Proof. 
The proof follows directly: assume S T is not minimal for L S M S , and a smaller tree S T exists. By substituting S T with S T , a Steiner tree smaller than T would be obtained, thereby contradicting the optimality of T.    □
This property serves as the foundation for constructing Steiner trees within a generalized metric closure. However, the optimal decomposition of a Steiner tree into smaller components remains generally unknown. Moreover, concatenating optimal small trees does not guarantee a globally optimal solution. For instance, a collection of shortest paths, each representing a 2-node Steiner tree, that covers M often produces a sub-optimal spanning structure. As shown in Figure 2, the minimum Steiner tree for { a , b , c , d } is T ( a , b , f , g , c , d ) with a length of 32, which notably excludes both the shortest paths between members and the local Steiner tree T ( a , b , e , d ) .
Greedy heuristics, including the Takahashi–Matsuyama and Kruskal-based approaches, construct approximate Steiner trees by iteratively merging terminals or components using shortest paths. Previous work [22,23] proposed improving these strategies by employing small, optimized Steiner trees as connection units rather than simple pairwise edges.

Connections and Distances

The process of connecting trees modifies the measured distance between them. Figure 3 illustrates a basic example where a group M of nodes is depicted by the filled nodes. Two distinct trees, T 1 and T 2 , each span separate subsets of M. The primary objective is to unify the two trees by constructing a common partial spanning tree of minimal length, when possible. To achieve optimal connectivity, various types of connections may be defined, with each connection representing a specific distance between the trees.
Definition 2 
(Optimal Connection of a Node to a Steiner Tree). Let T be a Steiner tree of the group M and n a node not in T. Let C ( l ) be a Steiner tree of n and l leaves in T, such that C ( l ) and T are edge-disjoint. Define G ( T , C ( l ) ) as the union of T and C ( l ) . In general, G ( T , C ( l ) ) does not necessarily form a tree and may contain cycles. Let D ( T , C ( l ) ) denote the set of edges with maximal length that can be removed from T without compromising the connectivity of M { n } . Define T ( T , C ( l ) ) as the resulting tree after removing D ( T , C ( l ) ) from T.
C * ( l ) is an optimal connection of l leaves between n and T if the length of T ( T , C * ( l ) ) is minimized. This length is referred to as the ( l 1 ) -distance of n from T, where the standard distance corresponds to the 0-distance. The optimal connection between n and T is achieved with l * leaves
l e n g t h T ( T , C * ( l * ) ) ) = min l l e n g t h ( T ( T , C * ( l ) )
The computational complexity of determining the optimal connection depends on the size of the Steiner tree. For a tree with n T nodes, the l leaves of C ( l ) can be positioned in n T l distinct ways. In some cases, multiple equivalent Steiner trees may correspond to the same “distance” value, indicating that the connection is not necessarily unique. This non-uniqueness also occurs in the context of shortest paths. Such connections and “distances” can also be defined for linking two Steiner trees.
Definition 3 
(Optimal Connection Between Two Steiner Trees). Let T 1 and T 2 be two Steiner trees covering M 1 and M 2 respectively; M = M 1 M 2 . Let C ( l ) denote a Steiner tree with l leaves, where its leaves are distributed between T 1 and T 2 . Define L 1 and L 2 as the sets of leaves of C ( l ) in T 1 and T 2 , respectively, with  L 1 and L 2 . Furthermore, C ( l ) , T 1 , and  T 2 are edge-disjoint. Let G ( T 1 , T 2 , C ( l ) ) be the union of T 1 , T 2 and C ( l ) . Let D ( T i , C ( l ) ) , i = 1 , 2 be the set of edges with maximal length that can be deleted from T i without loss of connectivity between the members of M i . Let T ( T 1 , T 2 , C ( l ) ) be the tree after removing D ( T i , C ( l ) ) from T i , i = 1 , 2 . The tree C * ( l ) represents an optimal connection of l leaves between T 1 and T 2 if the length of T ( T 1 , T 2 , C * ( l ) ) is minimized.
This length is referred to as the (l-2)-distance between two Steiner trees. According to this concept, the standard distance between two trees, corresponding to the shortest path, is the 0-distance. The 1-distance is obtained by using a 3-tree as the connecting structure. These types of connections are illustrated in Figure 3.
The optimal connection is not restricted to a specific number of leaves. The connection between T 1 and T 2 is achieved using l * leaves.
l e n g t h T ( T 1 , T 2 , C * ( l * ) ) = min l l e n g t h T ( T 1 , T 2 , C * ( l ) )
The computational complexity of determining the optimal connection between the Steiner trees depends on the sizes of the trees. Suppose the trees have n T 1 and n T 2 nodes. The l leaves of C ( l ) can be assigned in n T 1 + n T 2 l different ways. No connection occurs when all leaves are contained within the same tree. This situation arises in n T 1 l + n T 2 l cases, assuming l is less than both n T 1 and n T 2 .
The number of different connections is
N conn = n T 1 + n T 2 l n T 1 l + n T 2 l
For each connection, a corresponding set of edges can be removed from the trees. The sets D ( T i , C ( l ) ) of edges that can be removed from the respective Steiner trees form forests.
Figure 4 illustrates a Steiner tree T covering the nodes of a set M. This tree is to be connected to the remaining members using a tree C, resulting in a single tree. To avoid cycles, a portion of T must be removed. This portion lies within the subtree defined by the leaves of C and constitutes a forest. The following criterion aids in edge selection:
Let T i = ( V i , E i ) be a Steiner tree for M i V i , and C a connected tree. τ V i is the leaves of C in T i . Let A be a tree with maximum cost such that T i A contains at least one node from τ (a leaf of C). If  T i is connected to another element via C, there is a forest with maximum cost that can be removed from T i and contains A.

4. Hypergraph Model and Metric Closure

To formalize and optimize Steiner tree construction, a hypergraph-based model can be employed. This framework is particularly effective when specific sub-components or local Steiner trees for subsets of the terminal group M are precomputed or known. The objective is to achieve a cost-effective coverage of M using a limited set of these components.
Definition 4. 
[Hyperedge Representation of Steiner Components:] Let T s = ( V s , E s ) be a connected subtree (component) of a potential Steiner tree. This component is represented in the generalized metric closure as a hyperedge e s such that the set of nodes incident to e s corresponds to the nodes in V s . The cost of the hyperedge, denoted c ( e s ) , is defined as the sum of the weights of the edges in T s :
c ( e s ) = e E s w ( e )
Definition 5. 
[Coverage and Connectivity:] A tree T is considered covered by a set of hyperedges if every node in T belongs to at least one hyperedge. Connectivity in this model is defined as follows: two hyperedges are connected if they share at least one common node. To ensure global connectivity across a terminal set via hyperedges, there must exist a path of hyperedges between every pair of nodes. In specific configurations, this coverage may manifest as a chain (path) of hyperedges or a hyper-tree.
Definition 6. 
[Cost of Coverage:] The total cost of a coverage is defined as the sum of the costs of its constituent hyperedges.
Lemma 1. 
Let S ( T ) = { e k , k = 1 , , K } be a set of hyperedges covering a tree T. The total cost of S ( T ) is at least the cost of T:
c ( S ( T ) ) = k = 1 , , K c ( e k ) c ( T )
Proof. 
Each hyperedge represents a subtree of T. If we assume all nodes in T are covered but the total cost of T is strictly greater than the sum of the hyperedge costs, it implies at least one edge in T is not represented in any hyperedge. This would result in a loss of connectivity, contradicting the assumption that the hyperedges form a complete coverage of T.    □
Metric closure graphs are frequently used to determine the effective coverage of sets of nodes. Typically, the metric closure encodes the pairwise distances between nodes. The distance between two nodes is defined as the cost of the shortest path connecting them. Trivially, a shortest path is a hyperedge of size 2 (a 2-tree). The metric closure can be generalized to larger (but still limited) hyperedges.

4.1. Generalized and k-Limited Metric Closures

Traditional metric closures utilize shortest paths between node pairs (2-node hyperedges). This concept can be generalized by considering larger, albeit size-limited, hyperedges.
Definition 7. 
Let M be a set of nodes in a connected graph. For  k < | M | , the set of hyperedges representing Steiner trees with a size of at most k forms a k-limited metric closure. This generalized hypergraph includes standard shortest paths alongside hyperedges representing optimal trees for triplets, quadruplets, and higher-order tuples up to size k.
It is assumed that the intersection of hyperedges covering a tree T contains only one node per peer. This condition is necessary to minimize redundancies among the trees associated with the hyperedges. If the intersection of two hyperedges contains multiple nodes, then subgraphs defined by these nodes are present in the trees, resulting in unnecessary costs for effective coverage. Figure 5 illustrates some hyperedges in the generalized metric closure of a small group.

4.2. Optimal Coverage Using k-Limited Components

The procedure for finding the minimum-cost coverage using k-limited components involves two exact algorithms:
1.
Construction of the Hyper Metric Closure ( H M C ( k ) ): We generate hyperedges for every i-tuple in M (where 2 i k ), assigning the cost of the corresponding minimum Steiner tree to each hyperedge (cf. Algorithm 1).
2.
Optimal Hyperedge Selection: We identify the subset of connected hyperedges that covers M at the minimum total cost. This requires enumerating and evaluating connected subsets of H E (cf. Algorithm 2).
Algorithm 1 Hyper Metric Closure.
  • Require: M, k
  • Ensure:  H M C ( k ) = ( M , H E )
  •      H M C ( k ) =
  •     for  i = 2 to k do
  •         for each i-tuple s i M  do
  •                Compute the minimum Steiner tree S T of S i
  •                 w = c o s t ( S T )
  •                Add to H E a hyperedge on s i with cost w
  •         end for
  •     end for
  •             return  H M C ( k )
Algorithm 2 Optimal coverage with hyperedges.
  • Require:  H M C ( k ) = ( M , H E )
  • Ensure: H
  •      W max H M C ( k ) w                                                                      ▹ The maximum of costs
  •      B = W × | M |                                                              ▹ A sufficiently large upper bound
  •     for all  S H H E   do
  •             w H S H w                                                                         ▹ The cost of the subset
  •            if ( S H is connected) ∧ ( S H covers M) ∧ ( w H < B ) then
  •                 B w H
  •                 H S H
  •            end if
  •     end forreturn H
Complexity Note: The H M C ( k ) phase involves m i tuples for each i. Given that Steiner tree computation is NP-complete, several exact algorithms have been proposed to compute it. For instance, a branch-and-cut algorithm is proposed in [24]. The complexity is bounded by O ( i = 2 k m i C ( S T i ) ) , where C ( S T i ) is the cost of a Steiner tree calculation for i nodes. Finding the optimal coverage is significantly more expensive, as the number of hyperedge subsets to examine is 2 | H E | , where | H E | = i = 2 k m i .
This procedure does not yield the minimum-cost Steiner tree, but it produces the best tree that can be constructed using k-limited components. Moreover, the steps in this procedure to determine coverage are very expensive.

4.3. Greedy Strategy for k-Limited Components

To mitigate the computational intensity of the optimal approach, we propose a greedy heuristic (Algorithm 3) that builds the tree iteratively:
1.
Start with an arbitrary node and connect it to its k 1 nearest neighbors in M via a k-limited Steiner tree.
2.
From a leaf of the newly added hyperedge, select the next k 1 nearest uncovered terminals to form the subsequent hyperedge, creating a chain or tree-like structure until all terminals are covered.
Algorithm 3 Greedy algorithm for constructing spanning trees with hyperedges
  • Require: M, M C                                                                           ▹ The usual metric closure
  • Ensure: H                                                                        ▹ A set of hyperdeges covering M
  •      W M M
  •      n a node of W M                                                                                ▹ an arbitrary node
  •      W M W M { n }
  •     while  W M  do
  •            H E                                                                                                        ▹ A node set
  •           for  i = 1 to k do
  •                  n n closest node from n
  •                  H E H E { n n }
  •                  W M W M { n n }
  •                 if  W M =  then
  •                      break
  •                 end if
  •           end for
  •            H H H E
  •     end while
  •             return H
Complexity Note: Using the Floyd–Warshall algorithm, the metric closure can be created with time complexity O ( | V | 3 ) . Algorithm 3 is searching for the ( k 1 ) closest nodes for each node in M. This implies the complexity of O ( k | M | ) .
The selection of the starting point and subsequent steps significantly influences the resulting cost. Notably, an effective solution may form a tree rather than a chain. Since the problem is NP-hard, greedy approaches do not guarantee optimality. The following experiment was conducted to illustrate the performance of the proposed approach. Table 1 presents the results. In the known Geant network topology, random groups of different sizes have been generated. Group sizes are shown in the first column. Varying the value of k, coverages using chains of k-limited hyperedges have been computed. The values of k are in the second column. The cost of the trees determined by the hyperedges is in the 4th column. The connected hyperedges contain some edges more than once. The number of duplicated edges is indicated in the 5th column. Using an exact algorithm, the optimal solutions are the partial minimum spanning trees (minimum Steiner trees). The last two columns show the ratios of the costs of the approximated “trees” to the optimums with and without duplicated edges (reduced ratios), respectively.
Experimental results (Table 1) demonstrate that as the hyperedge limit k increases, the cost of the resulting approximate tree approaches the optimal minimum Steiner tree (PMST). While greedy decisions do not guarantee global optimality, the reduced ratio (after eliminating duplicated edges) indicates significant performance improvements over standard pairwise heuristics.

4.4. Practical Applications

The principles underlying these algorithms have direct applicability in contemporary Internet of Things (IoT) networking scenarios:
  • Industrial IoT (IIoT) Multicast Routing: In dense wireless sensor networks deployed in industrial environments, frequent topology changes and energy constraints render the computation of exact global routing arrays computationally prohibitive. The use of k-limited hyperedge approximations facilitates decentralized construction of efficient multicast trees, such as those required for coordinated firmware updates or synchronized actuator commands, centered around key sensor cluster-heads. This approach significantly reduces evaluation time and extends overall network battery life.
  • Deadlock Resolution in Ad Hoc IoT Meshes: Addressing circular buffer dependencies in randomized IoT mesh topologies requires the construction of acyclic overlays. By applying the k-limited connectivity metric, spanning structures can be generated that map across critical bottleneck endpoint nodes, thereby resolving ring-topology deadlock conditions without necessitating a complete hierarchical recalculation.

5. Experimental Evaluation

To address the practical limitations of computing exact k-limited Steiner trees (which are NP-hard even for small subsets), we conducted a series of computational experiments. The primary objective is to evaluate the empirical execution time of the Hyper Metric Closure (HMC), the optimal coverage, and the greedy heuristic proposed in Section 4, as well as to measure the approximation ratio of the heuristic compared to the absolute exact Steiner tree baseline.

5.1. Methodology and Simulation Environment

The algorithms were implemented in Python 3 using the NetworkX library for core graph operations, including generating random graph instances and computing shortest-path metric closures. The testing environment utilized a standard modern CPU. We generated purely structural Erdős–Rényi topologies ( p = 0.4 ) to simulate dense, unoptimized networks. For each test instance, edge weights were assigned uniformly at random within the interval [ 1 , 20 ] , and a specified subset of nodes was randomly designated as the multicast group M. The parameter k was strictly set to k = 3 (forming 3-trees) based on the theoretical intractability for higher limits.

5.2. Results and Discussion

Table 2 presents the execution time (in seconds) for generating the k = 3 Hyper Metric Closure (Algorithm 1) and the Greedy Coverage heuristic (Algorithm 3). It also tracks the approximation ratio of the greedy solution relative to the absolute minimum Steiner tree cost.
The results explicitly highlight the combinatorial explosion discussed in the theoretical sections. As the graph size strictly increases from | V | = 12 to | V | = 14 (with | M | = 5 ), the time required to build the foundational Hyper Metric Closure (Algorithm 1) skyrockets from approximately 14 s to over 44 s. Evaluating the absolute optimal coverage across these hyperedges (Algorithm 2) triggers exponential stalling, freezing computations, thus validating the mathematical limit. Conversely, the greedy strategy (Algorithm 3) avoids enumerating large numbers of subsets by iteratively combining local geometric metrics. It executed in under 3.5 s for the largest test and produced trees within 1.11 of the theoretically exact, NP-hard minimum, occasionally identifying the optimal tree.

6. Conclusions

In this work, we provide an overview of the most well-known heuristics for solving the Steiner problem in graphs. A good part of the algorithms is based on shortest-path algorithms. In some cases, the constructed trees can be improved by adding small components, namely, partial spanning trees. In these cases, the edge set must be revised; some edges may be removed. Recently, hypergraph-based construction has been proposed. Partial spanning trees can be represented by hyperedges. They can be concatenated using greedy algorithms, or a Hyper Metric Closure can be constructed, after which an approximate solution can be found using it. In our study, we tested the effect of applying hyperedges to Steiner tree approximations. The results are encouraging; the approximated trees approach the optimum. The method’s counterpart is its high computational complexity. Only small Steiner trees can be computed within a reasonable time.
The construction of efficient heuristics for the aforementioned NP-hard optimization is a challenging goal. The “distance” based on Steiner trees used in our paper requires further analysis. The generalized metric closure should also be analyzed, and its properties extended.

Author Contributions

M.M.: conceptualization, formal analysis, investigation, methodology, writing—original draft preparation; B.M.H.: formal analysis, investigation, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding and the APC was funded by MDPI.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Definitions and Related Notions

In this section, commonly referenced concepts and definitions are organized systematically for clarity and ease of reference.
Definition A1 
(Steiner nodes or S-nodes). A Steiner tree T spanning a given node set M can contain nodes in V which are not in M. These are called Steiner nodes (or Steiner points). In this discussion, we refer to S-nodes as nodes with degree greater than 2 in the Steiner tree.
Definition A2 
(Metrical closure). Let M V be a set of nodes in G. Let G M = ( M , D ) be a complete graph on M where the length of an edge d ( a , b ) D corresponds to the length of the shortest path between a and b in G.
Definition A3 
(Full Steiner tree [5]). A Steiner tree T spanning M is full if the internal nodes of T are not in M.
Definition A4 
(k-tree [5]). A k-tree (k-restricted or k-limited Steiner tree) is a full Steiner tree with at most k leaves. A 3-tree is a star.
Definition A5 
(Partition into full components [5]). A Steiner tree of M can always be partitioned into a set of components, which are adjacent, edge-disjoint full Steiner trees sharing leaves in M (cf. Figure A1).
Figure A1. Partition of a Steiner tree into full components (Definition A5).
Figure A1. Partition of a Steiner tree into full components (Definition A5).
Algorithms 19 00232 g0a1
Definition A6 
(k-restricted Steiner tree [1]). A Steiner tree is k-restricted if the components in its partition are k-trees.
Definition A7 
(M-covered forest in a k-tree [6]). Let K be a k-tree of M and let V ( K ) be the set of branching nodes in K. A forest F K is called M-covered if for every branching node v V ( K ) , there is a path in F linking v with an element of M (cf. Figure A2).
Figure A2. M-covered forest in a k-tree (Definition A7).
Figure A2. M-covered forest in a k-tree (Definition A7).
Algorithms 19 00232 g0a2
Definition A8 
(Minimum M-covered forest [6]). In a k-tree, the M-covered forest with minimum cost is denoted as l o s s ( K ) . It contains a path from each Steiner point of K to one of its terminals.
Definition A9 
(Terminal Steiner tree [25]). A Steiner tree that contains only terminal nodes (nodes in M) is a terminal Steiner tree.
Definition A10 
(Gain related to a component H [25]). Let T be a terminal Steiner tree and H be a component connected to a subset of nodes in T. The gain of H relative to T is:
g a i n T ( H ) = d ( R H ) d ( H )
where R H is the set of edges of maximal length that can be removed from T while preserving the Steiner tree property after the union of H and T (cf. Figure A3).
Figure A3. Saving and loss related to a component H (Definitions A3–A5).
Figure A3. Saving and loss related to a component H (Definitions A3–A5).
Algorithms 19 00232 g0a3
Definition A11 
(Saving of H). The saving of H related to T is:
s a v i n g T ( H ) = d ( T ) d ( H ) d ( T [ H ] )
where T [ H ] is the minimum length forest spanning terminals in T not covered by H.
Definition A12 
(Loss of a component [25]). The loss ( l o s s ( H ) ) of a component H is the length of a minimum M-covered forest in H.
Definition A13 
( α -relative gain). The α-relative gain of a full component H is defined as:
g a i n k ( H , α ) = d ( H ) α · l o s s ( H )
Definition A14 
(Loss contracted Full component [25]). Let H be a full component. Each tree in l o s s ( H ) after contraction corresponds to a composed node considered as a terminal. The contracted component C [ H ] covers the terminals in H (cf. Figure A4).
Figure A4. Loss contracted full component C [ H ] (Definition A14).
Figure A4. Loss contracted full component C [ H ] (Definition A14).
Algorithms 19 00232 g0a4

References

  1. Hwang, F.; Richards, D.; Winter, P. The Steiner Tree Problem; Annals of discrete mathematics; North Holland: Amsterdam, The Netherlands, 1992; Volume 53. [Google Scholar]
  2. Kruskal, J.B. On the Shortest Spanning Subtree of a Graph and the Traveling Salesman Problem. Proc. Am. Math. Soc. 1956, 7, 48–50. [Google Scholar] [CrossRef]
  3. Prim, R.C. Shortest connection networks and some generalizations. Bell Syst. Tech. J. 1957, 36, 1389–1401. [Google Scholar] [CrossRef]
  4. Takahashi, H.; Matsuyama, A. An approximate solution for the Steiner problem in graphs. Math. Jpn. 1981, 24, 573–577. [Google Scholar]
  5. Kou, L.; Markowsky, G.; Berman, L. A fast algorithm for Steiner trees. Acta Inform. 1981, 15, 141–145. [Google Scholar] [CrossRef]
  6. Zelikovsky, A. An 11/6-Approximation Algorithm for the Steiner Problem on Graphs. In Fourth Czechoslovakian Symposium on Combinatorics, Graphs and Complexity; Neŝetril, J., Fiedler, M., Eds.; Annals of Discrete Mathematics; Elsevier: Amsterdam, The Netherlands, 1992; Volume 51, pp. 351–354. [Google Scholar] [CrossRef]
  7. Du, D.Z.; Zhang, Y.; Feng, Q. On better heuristic for Euclidean Steiner minimum trees. In Proceedings of the 1991 32nd Annual Symposium of Foundations of Computer Science; IEEE: New York, NY, USA, 1991; pp. 431–439. [Google Scholar] [CrossRef]
  8. Du, D.Z.; Zhang, Y. On better heuristics for Steiner minimum trees. Math. Program. 1992, 57, 193–202. [Google Scholar] [CrossRef]
  9. Byrka, J.; Grandoni, F.; Rothvoß, T.; Sanità, L. An improved LP-based approximation for Steiner tree. In Proceedings of the Forty-Second ACM Symposium on Theory of Computing, STOC ’10; Association for Computing Machinery: New York, NY, USA, 2010; pp. 583–592. [Google Scholar] [CrossRef]
  10. Chitty, D.M.; Charles, J.; Moraglio, A.; Keedwell, E. Comparing Quantum Annealer and Metaheuristic Methods to Solve the Steiner Tree Problem. In Proceedings of the Artificial Evolution; Legrand, P., Liefooghe, A., Lepagnot, J., Monmarché, N., Idoumghar, L., Pallez, D., Siarry, P., Lutton, E., Eds.; Springer: Berlin/Heidelberg, Germany, 2026; pp. 91–106. [Google Scholar]
  11. Zhang, J.; Ajwani, D. Learning to Prune Instances of Steiner Tree Problem in Graphs. In Proceedings of the 2022 International Network Optimization Conference; OpenProceedings.org: Konstanz, Germany, 2022. [Google Scholar]
  12. Riaño-Enciso, L.M.; Cortés-Caicedo, B.; Montoya, O.D.; Grisales-Noreña, L.F.; Hernández, J.C. Sustainable Metaheuristic-Based Planning of Rural Medium- Voltage Grids: A Comparative Study of Spanning and Steiner Tree Topologies for Cost-Efficient Electrification. Sustainability 2025, 17, 8145. [Google Scholar] [CrossRef]
  13. Raikwar, H.; Sadharakiya, H.; Karmakar, S. Dynamic Algorithms for Approximate Steiner Trees. Concurr. Comput. Pract. Exp. 2025, 37, e70040. [Google Scholar] [CrossRef]
  14. University of Bonn. Steiner Tree Compendium. 2026. Available online: https://theory.cs.uni-bonn.de/info5/steinerkompendium/ (accessed on 11 March 2026).
  15. Ljubić, I. Solving Steiner trees: Recent advances, challenges, and perspectives. Networks 2021, 77, 177–204. [Google Scholar] [CrossRef]
  16. Warme, D. A new exact algorithm for rectilinear Steiner trees. In Proceedings of the 1997 Network Design: Connectivity and Facilities Location, DIMACS Series in Discrete Mathematics and Theoretical Computer Science; American Mathematical Society: Providence, RI, USA, 1998; Volume 40, pp. 357–395. [Google Scholar]
  17. Warme, D.M. Spanning Trees in Hypergraphs with Applications to Steiner Trees. Ph.D. Thesis, University of Virginia, Charlottesville, VA, USA, 1998. [Google Scholar] [CrossRef]
  18. Prömel, H.J.; Steger, A. A New Approximation Algorithm for the Steiner Tree Problem with Performance Ratio 5/3. J. Algorithms 2000, 36, 89–101. [Google Scholar] [CrossRef]
  19. Kaufmann, M.; van Kreveld, M.; Speckmann, B. Subdivision Drawings of Hypergraphs. In Graph Drawing; Tollis, I.G., Patrignani, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2009; pp. 396–407. [Google Scholar]
  20. Brazil, M.; Zachariasen, M. Steiner Trees in Graphs and Hypergraphs. In Optimal Interconnection Trees in the Plane: Theory, Algorithms and Applications; Springer International Publishing: Cham, Switzerland, 2015; pp. 301–317. [Google Scholar] [CrossRef]
  21. Polzin, T.; Daneshmand, S.V. On Steiner trees and minimum spanning trees in hypergraphs. Oper. Res. Lett. 2003, 31, 12–20. [Google Scholar] [CrossRef]
  22. Molnar, M. Construction D’arbres de Diffusion Multicast Basée sur une Modification de L’heuristique de Kruskal; Technical Report 3587; INRIA: Le Chesnay-Rocquencourt, France, 1998. [Google Scholar]
  23. Marie, R.; Molnar, M. Arbre Couvrant Partiel Pseudo-Optimal Pour Diffusion Multipoint; Technical Report 3636; INRIA: Le Chesnay-Rocquencourt, France, 1999. [Google Scholar]
  24. Stanojevic, M.J.; Vujosevic, M. An Exact Algorithm for Steiner Tree Problem on Graphs. Int. J. Comput. Commun. Control 2006, 1, 41–46. [Google Scholar] [CrossRef]
  25. Robins, G.; Zelikovsky, A. Tighter Bounds for Graph Steiner Tree Approximation. SIAM J. Discrete Math. 2005, 19, 122–134. [Google Scholar] [CrossRef]
Figure 1. Revision of some edges in the metric closure.
Figure 1. Revision of some edges in the metric closure.
Algorithms 19 00232 g001
Figure 2. Different trees spanning M.
Figure 2. Different trees spanning M.
Algorithms 19 00232 g002
Figure 3. Connections with k-trees and related distances between two Steiner trees.
Figure 3. Connections with k-trees and related distances between two Steiner trees.
Algorithms 19 00232 g003
Figure 4. Forest to remove from a tree T after adding another tree C.
Figure 4. Forest to remove from a tree T after adding another tree C.
Algorithms 19 00232 g004
Figure 5. Some Some hyperedges in a generalized metric closure.
Figure 5. Some Some hyperedges in a generalized metric closure.
Algorithms 19 00232 g005
Table 1. Coverage of trees with hyper chains.
Table 1. Coverage of trees with hyper chains.
GroupkPMST CostT CostNb Dup EdgeRatioRed. Ratio
10236.667.29.21.8418961.315899
33955.75.21.4280761.171327
434.655.45.61.6046331.211142
535.643.93.21.2305611.047492
15247.195.817.22.0408551.289131
347.685.812.91.8161391.222163
448.379.8101.661621.212692
547.472.59.51.5362411.13601
20255.7129.825.82.3565491.379486
355.4113.420.72.0551651.278535
455.598.5151.7721131.228471
556.394.5141.6868821.191532
25266152.132.22.3411951.329114
363.7126.4231.9951721.261589
463.5114.618.71.8038361.239137
564.110915.61.7077491.231063
Table 2. Execution time and approximation ratios ( k = 3 on Erdős–Rényi topologies).
Table 2. Execution time and approximation ratios ( k = 3 on Erdős–Rényi topologies).
NodesTerminalsAlgorithm 1Algorithm 3Greedy Approx.
( | V | )( | M | )Time (s)Time (s)Ratio
1041.3600.3061.11
12514.1300.9511.00 (Opt)
14544.5903.0501.08
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Molnár, M.; Hassan, B.M. Steiner Tree Approximations in Graphs and Hypergraphs. Algorithms 2026, 19, 232. https://doi.org/10.3390/a19030232

AMA Style

Molnár M, Hassan BM. Steiner Tree Approximations in Graphs and Hypergraphs. Algorithms. 2026; 19(3):232. https://doi.org/10.3390/a19030232

Chicago/Turabian Style

Molnár, Miklós, and Basma Mostafa Hassan. 2026. "Steiner Tree Approximations in Graphs and Hypergraphs" Algorithms 19, no. 3: 232. https://doi.org/10.3390/a19030232

APA Style

Molnár, M., & Hassan, B. M. (2026). Steiner Tree Approximations in Graphs and Hypergraphs. Algorithms, 19(3), 232. https://doi.org/10.3390/a19030232

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop