Floyd–Warshall Algorithm for Sparse Graphs

Zugan, Dani; Požar, Rok; Brodnik, Andrej

doi:10.3390/a18120766

Open AccessArticle

Floyd–Warshall Algorithm for Sparse Graphs

by

Dani Zugan

^1,*

,

Rok Požar

^2,3,4

and

Andrej Brodnik

^1,2,3

¹

Faculty of Computer and Information Science, University of Ljubljana, 1000 Ljubljana, Slovenia

²

Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, 6000 Koper, Slovenia

³

Andrej Marušič Institute, University of Primorska, 6000 Koper, Slovenia

⁴

Institute of Mathematics, Physics and Mechanics, 1000 Ljubljana, Slovenia

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(12), 766; https://doi.org/10.3390/a18120766 (registering DOI)

Submission received: 28 October 2025 / Revised: 25 November 2025 / Accepted: 28 November 2025 / Published: 4 December 2025

(This article belongs to the Section Combinatorial Optimization, Graph, and Network Algorithms)

Download

Browse Figures

Versions Notes

Abstract

The Floyd–Warshall algorithm, which uses a classic dynamic programming approach, provides a solution to the all-pairs shortest paths problem. However, for sparse graphs, iteratively applying Dijkstra’s, or some other similar algorithm from each node, often proves to be more efficient. We introduce a novel technique based on a structural decomposition of the input graph into strongly connected components, allowing us to exploit the disconnectedness of the graph by avoiding redundant relaxation attempts on nodes that are not reachable from the source component. Using an empirical evaluation, where execution time is measured, we demonstrate that our approach outperforms existing alternatives on disconnected graphs.

Keywords:

Floyd–Warshall algorithm; all-pairs shortest paths; sparse graphs; strongly connected components

1. Introduction

Real-world problems in fields, such as bio-informatics, logistics, telecommunications, and others, are often modeled using graphs and solved with search for the shortest paths between node pairs of the graph [1]. The shortest path problem has three main variations: finding the shortest path between two specific nodes, computing shortest paths from a single node to all other nodes (single-source shortest path, SSSP), and determining shortest paths between all node pairs (all-pairs shortest path, APSP). There are two fundamentally different approaches to solve the APSP problem. In the first one, we solve the SSSP problem for each node in the graph and combine the results, while in the second approach, we simultaneously construct shortest paths for all node pairs of the graph. In the first approach, usually a Dijkstra-like algorithm is used for graphs with non-negative edge weights. In this paper, we restrict ourselves to graphs with non-negative edge weights, leaving the treatment of negative weights for future work.

Probably the best known second approach is the Floyd–Warshall algorithm, which employs the dynamic programming technique. However, in both approaches, the key operation is a relaxation and it turns out that the time complexity of the solution is linear in the number of relaxation attempts. Consequently, the first approach proves to be more appropriate for sparse graphs and the second for dense ones.

In this paper, we study the feasibility of the second approach on sparse graphs, particularly disconnected directed graphs. In our approach, we propose first identifying strongly connected components of the graph, on each of which we then apply the “APSP algorithm” independently. This is reasonable, as within each strongly connected component, each pair of nodes is mutually reachable. To handle shortest paths between pairs of nodes in different components, we rely on a simple observation: if there is no path from an arbitrarily chosen node in the source component to an arbitrarily chosen node in the destination component, then relaxations can be safely skipped for all node pairs where the first node belongs to the source component and the second to the destination component.

The objective of this paper is first to present a new approach to solve the APSP problem and then empirically evaluate and compare it with the existing algorithms. Empirical evaluation is performed on artificially generated graphs (Erdős–Rényi method [2,3] and the Barabási–Albert method [4]) and real-world graphs derived from practical datasets.

Section 2 reviews related work in all-pairs shortest path algorithms and it is followed by basic definitions. Our solution is presented in Section 4 and evaluated in Section 5. Section 6 summarizes the results and research insights.

2. Related Work

The Floyd–Warshall algorithm [5,6] is a classic APSP solution using dynamic programming, performing

O (n^{3})

relaxation attempts on a graph with n nodes. However, [7] shows that many relaxations are unnecessary, and for complete directed graphs with uniformly distributed weights, their modified version runs in expected

O (n^{2} {log}^{2} n)

time. Similarly, SmartForce [8] achieves significant speedups by avoiding all other but a small fraction relaxation attempts. For graphs that are not complete or are sparse, the Rectangular Algorithm [9] improves performance by blocking the distance matrix and skipping inactive regions. The Improved Floyd–Warshall algorithm [10] further optimizes sparse graphs by dynamically maintaining reachability lists and prioritizing nodes with low in-/out-degree products, reducing iterations when the number in a graph is bigger the number of edges. All variants, however, retain the worst-case

O (n^{3})

time bound.

Another approach to solve the APSP problem is to solve the SSSP problem for each node in the graph (cf. [11]). For graphs with m edges, all of which have non-negative weights, Dijkstra’s algorithm [12] solves the SSSP problem in

O (m + n log n)

when a priority queue with amortized

O (log n)

deletes min operation and

O (1)

other operations is used (e.g., [13]). However, in practice, Dijkstra with pairing heaps [14], despite weaker theoretical bounds, performs better. Returning to the APSP problem, the SSSP-based approach gives us

O (n m + n^{2} log n)

solution. Typically, on sparse graphs (e.g.,

m = o (n^{2})

), this approach is more efficient than Floyd–Warshall [15]; however, for

m = Θ (n^{2})

, it matches Floyd–Warshall’s

O (n^{3})

. Finally for complete graphs, the Hidden Paths algorithm [16] and Uniform Paths algorithm [17,18] give an expected

O (n^{2} log n)

time bound (uniform weights on

[0, 1]

).

3. Preliminaries

A directed graph G is an ordered pair

(V, E)

, where V is a finite, non-empty set of nodes, and

E \subseteq V \times V

is a set of directed edges. We assume that

V = {v_{1}, v_{2}, \dots, v_{n}}

for some integer n.

Each directed edge

e = (u, v) \in E

connects two nodes, called its end nodes, where u is referred to as the initial node, denoted by

ini (e) = u

and v is referred as the terminal node, denoted by

ter (e) = v

.

A path of length k in a directed graph

G = (V, E)

is a sequence of pair-wise distinct nodes

(v_{1}, v_{2}, \dots, v_{k})

such that for each i where

1 \leq i < k

, the directed edge

(v_{i}, v_{i + 1}) \in E

.

A directed graph,

G' = (V', E')

, is a sub-graph of a directed graph

G = (V, E)

if

V' \subseteq V

and

E' \subseteq E

.

A node v is reachable from another node u in a directed graph

G = (V, E)

if there exists a path from u to v. That is, there is a sequence of nodes

(v_{1}, v_{2}, \dots, v_{k})

such that

v_{1} = u

,

v_{k} = v

, and for each i where

1 \leq i < k

, the directed edge

(v_{i}, v_{i + 1})

belongs to E.

A strongly connected component (SCC) of a directed graph

G = (V, E)

is a maximal sub-graph

G' = (V', E')

—with respect to the number of nodes—such that for every pair of nodes

u, v \in V'

, there exists a path from u to v and a path from v to u, meaning the nodes are mutually reachable.

Given a SCC

C \subseteq V

, the set of outgoing edges from C is defined as

Out (C) = {e \in E : ini (e) \in C and ter (e) \notin C} .

A directed graph

G = (V, E)

is a directed acyclic graph (DAG) if it contains no directed cycles. That is, there does not exist a sequence of distinct nodes

v_{1}, v_{2}, \dots, v_{k}

such that

(v_{1}, v_{2}), (v_{2}, v_{3}), \dots, (v_{k - 1}, v_{k}), (v_{k}, v_{1}) \in E

.

A topological ordering of a DAG

G = (V, E)

is a linear ordering of its nodes such that for every directed edge

(u, v) \in E

, node u appears before v in the ordering.

A weighted digraph is a digraph

G = (V, E)

with a weight function

w : E \to R

that assigns each directed edge

e \in E

a weight

w (e)

. A weight function w can be extended to a path

P = (v_{1}, v_{2}, \dots, v_{k})

by

w (P) = \sum_{i = 0}^{m - 1} w (v_{i}, v_{i + 1})

. A shortest path from s to d is a path in G whose weight is infimum among all paths from s to d. The distance between two nodes s and d, denoted by

D_{s, d}

, is the weight of a shortest path from s to d in G.

For simplicity, we will refer to directed graphs simply as graphs throughout the rest of this paper.

4. Our Solution

We propose a shortest-path algorithm that exploits the strongly disconnected structure of graphs. The key idea is to decompose the graph into SCCs and process them in topological order, computing distances within each component and propagating paths across components only via outgoing edges.

4.1. Formal Definition of Shortest Path via SCCs

Let G be a directed graph partitioned into t strongly connected components

C_{1}, C_{2}, \dots, C_{t}

, ordered topologically such that there are no edges from

C_{j}

to

C_{i}

for

j > i

. Further, we denote the union

C_{i} \cup C_{i + 1} \cup \dots \cup C_{t}

by

C^{(i \dots t)}

. Furthermore, for nodes

s, d \in C^{(i \dots t)}

let

D_{s, d}^{(i \dots t)}

denote the length of shortest path from s to d containing only nodes from

C^{(i \dots t)}

. We shorten the notation

D_{s, d}^{(i \dots i)}

to

D_{s, d}^{(i)}

.

Corollary 1.

The length

D_{s, d}

of the shortest path in G is

D_{s, d} = D_{s, d}^{(1 \dots t)} .

The following lemmata establish several structural properties of shortest paths with respect to the SCC decomposition, which will allow us to derive a recursive formulation for

D_{s, d}^{(i \dots t)}

.

Lemma 1.

Let

C_{i}

be a SCC and let

s, d \in C_{i}

. Then the shortest path from s to d containing only nodes from

C^{(i \dots t)}

lies entirely within

C_{i}

.

Proof.

Assume, for contradiction, that the shortest path from s to d exits

C_{i}

, passing through a node

v \notin C_{i}

. Since s can reach v, and v can reach d, by transitivity, this implies v is part of the same SCC as s and d. This contradicts the maximality of

C_{i}

. Therefore, the shortest path must remain inside

C_{i}

. □

Lemma 2.

Let

s \in C_{i}

and let

d \in C^{(i + 1 \dots t)}

. Then the shortest path

P_{s, d}^{(i \dots t)}

from s to d contains only nodes from

C^{(i \dots t)}

and is of the form

P_{s, d}^{(i \dots t)} = P_{s, ini (e)}^{(i)} + e + P_{ter (e), d}^{(i + 1 \dots t)},

where

P_{s, ini (e)}^{(i)}

is a shortest path containing only nodes from

C_{i}

, the edge e is contained in

Out (C_{i})

, and

P_{ter (e), d}^{(i + 1 \dots t)}

is a shortest path containing only nodes from

C^{(i + 1 \dots t)}

.

Proof.

First, since SCCs are topologically ordered, for the edge

e \in Out (C_{i})

we have that

ini (e) \in C_{i}

and

ter (e) \in C_{j}

where

j > i

. The first consequence of this observation is that the shortest path

P_{s, d}^{(i \dots t)}

from s to d contains only nodes from

C^{(i \dots t)}

. The second consequence is that the only way that to reach a later component

C_{j}

from

C_{i}

is to traverse an edge

e \in Out (C_{i})

, and hence

P_{s, d}^{(i \dots t)}

traverses an edge

e \in Out (C_{i})

. Let

P_{s, ini (e)}^{(i)}

denote the sub-path of

P_{s, d}^{(i \dots t)}

from s to

ini (e)

, and let

P_{ter (e), d}^{(i + 1 \dots t)}

denote the sub-path of

P_{s, d}^{(i \dots t)}

from

ter (e)

to d. Since a sub-path of a shortest path is also a shortest path, both

P_{s, ini (e)}^{(i)}

and

P_{ter (e), d}^{(i + 1 \dots t)}

are shortest paths. Next, since

P_{s, ini (e)}^{(i)}

starts and ends in

C_{i}

, by Lemma 1 we have that

P_{s, ini (e)}^{(i)}

contains only nodes from

C_{i}

. Moreover, since

P_{ter (e), d}^{(i + 1 \dots t)}

starts outside the component

C_{i}

, by Lemma 4 we have that

P_{ter (e), d}^{(i + 1 \dots t)}

lies entirely within

C^{(i + 1 \dots t)}

. This completes the proof. □

Remark 1.

Lemma 2 defines the structure of a shortest path

P_{s, d}^{(i \dots t)}

, which permits us to perform relaxations only over

Out (C_{i})

.

Lemma 3.

Let

d \in C_{i}

and let

s \in C^{(i + 1 \dots t)}

. Then

D_{s, d}^{(i \dots t)} = \infty

.

Proof.

Since

s \in C_{j}

for some

j > i

, and the SCCs are topologically ordered, no node in

C_{i}

is reachable from s. Hence, there is no path from s to d, and so

D_{s, d}^{(i \dots t)} = \infty

. □

Lemma 4.

Let s and d be nodes in

C^{(i + 1 \dots t)}

. Then the shortest path form s to d containing only nodes from

C^{(i \dots t)}

lies entirely within

C^{(i + 1 \dots t)}

.

Proof.

Since

s \in C_{j}

for some

j > i

, and the SCCs are topologically ordered, no node in

C_{i}

is reachable from s. Hence, there is no path from s that enters the component

C_{i}

. Consequently, the shortest path from s to d lies entirely within

C^{(i + 1 \dots t)}

. □

We now summarize these observations into the following theorem.

Theorem 1.

Let s and d be nodes in

C^{(i \dots t)}

, then using the above assumptions and notation, the length of a shortest path

P_{s, d}^{(i \dots t)}

can be computed by the recurrence

D_{s, d}^{(i \dots t)} = \{\begin{matrix} D_{s, d}^{(i)} & if s \in C_{i}, d \in C_{i} \\ min_{e \in Out (C_{i})} (D_{s, ini (e)}^{(i)} + w (e) + D_{ter (e), d}^{(i + 1 \dots t)}) & if s \in C_{i}, d \notin C_{i} \\ \infty & if s \notin C_{i}, d \in C_{i} \\ D_{s, d}^{(i + 1 \dots t)} & otherwise . \end{matrix}

(1)

4.2. Algorithm SCC

The algorithm SCC designed by Equation (1) is a classical dynamic programming algorithm, which is also reflected in its structure. A naïve implementation in Algorithm 1 starts with a call of function SCC_Out on the input graph G, which returns a topologically sorted list of SCCs associated with corresponding sets of outgoing edges (line 1). Next, we compute the base cases defined in the first line of Equation (1) (lines 2–4). The remaining distances are computed using the second line and the third line of Equation (1) (lines 6–16). The last line of Equation (1) is reflected in line 6 of Algorithm 1. Note that the outer iteration loop goes over all destination nodes and the inner over all source nodes, which is counter-intuitive, but will prove useful later.

Algorithm 1 Naïve APSP algorithm on input graph G using SCC decomposition
1: $[(C_{1}, Out (C_{i})), \dots, (C_{t}, Out (C_{t}))] \leftarrow SCC_Out (G)$
2: for $i = 1$ to t do
3: $D_{s, d}^{(i)} \leftarrow$ APSP( $C_{i}$ )	{Base case for each component}
4: end for
5:	{Process in reverse topological order}
6: for $i = t - 1$ to 1 do
7: for each node $d \in C^{(i + 1 \dots t)}$ do
8: for each node $s \in C_{i}$ do
9: $D_{s, d}^{(i \dots t)} \leftarrow \infty$	{Initialize}
10: for each edge $e \in Out (C_{i})$ do
11: $D_{s, d}^{(i \dots t)} \leftarrow min (D_{s, d}^{(i \dots t)}, D_{s, ini (e)}^{(i)} + w (e) + D_{ter (e), d}^{(i + 1 \dots t)})$	{Relaxation}
12: end for
13: $D_{d, s}^{(i \dots t)} \leftarrow \infty$	{Third line of (1)}
14: end for
15: end for
16: end for

The time complexity of Algorithm 1 depends on the number of relaxations performed in line 11. Consequently, we want to avoid unnecessary relaxation attempts, which happens, in particular, when the destination node d is not reachable from the source node s.

Such a situation occurs in Figure 1, where destination nodes in components

C_{2}

and

C_{3}

are not reachable from source nodes in component

C_{1}

. Furthermore, destination nodes in component

C_{4}

are reachable from source nodes in

C_{3}

only through one edge in

Out (C_{3})

. We observe that if one node of the component

C_{j}

is not reachable from the source node s through an edge

e \in Out (C_{i})

, no node in

C_{j}

is reachable via e. This brings us to the final version of our algorithm presented in Algorithm 2. To iterate only over the

Out (C_{i})

-edges via which nodes in

C_{j}

are reachable the structure

{Out}_{j} (C_{i})

in lines 8–14 is built. The relaxation in line 19 remains the same as in Algorithm 1.

We conclude the section with an analysis of the space complexity. First, if

s \notin C^{(i \dots t)}

or

d \notin C^{(i \dots t)}

, we extend the definition of

D_{s, d}^{(i \dots t)}

by setting

D_{s, d}^{(i \dots t)} = \infty

. Second, if both

s, d \in C^{(i \dots t)}

, then by Lemma 4 and Corollary 1 we have

D_{s, d}^{(i \dots t)} = D_{s, d}

.

Algorithm 2 APSP algorithm SCC on input graph G avoiding unnecessary relaxations
1: $[(C_{1}, Out (C_{i})), \dots, (C_{t}, Out (C_{t}))] \leftarrow SCC_Out (G)$
2: for $i = 1$ to t do
3: $D_{s, d}^{(i)} \leftarrow$ APSP( $C_{i}$ )	{Base case for each component}
4: end for
5:	{Process in reverse topological order}
6: for $i = t - 1$ to 1 do
7: for $j = i + 1$ to t do
8: ${Out}_{j} (C_{i}) \leftarrow {}$	{Collect only edges via which $C_{j}$ is reachable}
9: $d \leftarrow$ an arbitrary node in $C_{j}$
10: for each edge $e \in Out (C_{i})$ do
11: if $D_{ter (e), d}^{(i + 1 \dots t)} \neq \infty$ then
12: ${Out}_{j} (C_{i}) \leftarrow {Out}_{j} (C_{i}) \cup {e}$	{All nodes in $C_{j}$ are reachable from $ter (e)$ }
13: end if
14: end for	{ ${Out}_{j} (C_{i})$ consist of $Out (C_{i})$ -edges via which nodes in $C_{j}$ are reachable}
15: for each node $d \in C_{j}$ do
16: for each node $s \in C_{i}$ do
17: $D_{s, d}^{(i \dots t)} \leftarrow \infty$	{Initialize}
18: for each edge $e \in {Out}_{j} (C_{i})$ do
19: $D_{s, d}^{(i \dots t)} \leftarrow min (D_{s, d}^{(i \dots t)}, D_{s, ini (e)}^{(i)} + w (e) + D_{ter (e), d}^{(i + 1 \dots t)})$	{Relaxation}
20: end for
21: $D_{d, s}^{(i \dots t)} \leftarrow \infty$	{Third line of (1)}
22: end for
23: end for
24: end for
25: end for

Consequently, we can drop the exponent notation, yielding only

D_{s, d}

, which in turn means that our algorithm needs to only store a single

n \times n

matrix of shortest path values. For this purpose, we can use a 2D array D[s,d].

Notably, in case of sparse graphs, in general, the majority of entries in the distance matrix

D_{s, d}

are ∞ corresponding to unreachability of a node d from a node s. To avoid storing these redundant entries, we, inspired by [19], adopt a more space efficient storing of the matrix

D_{s, d}

—instead of a 2D array D[,] we use a hashmap implementation of dictionary. The two operations on matrix

D_{s, d}

, assigning the value and reading the value, are implemented in straightforward way. The assignment is implemented as a combination of insertion and update, while the value reading uses the find operation, which in case of failure, returns ∞. Consequently, if the time complexity of our solution with a 2D array was

O (f (n))

in the worst case, it stays the same but in the expected case. Moreover, we observe that since the value

D_{s, d}

is never increased, it is assigned value ∞ only once. However, this assignment is already implicitly done at the beginning since

D_{s, d}

is not inserted into a dictionary and hence takes no action and no time.

However, the space complexity changes from

O (n^{2})

to

Θ (m^{*})

, where

m^{*} \leq n (n - 1)

is the number of pairs of nodes s and d, where d is reachable from s. This is optimal.

5. Evaluation

We empirically evaluated SCC algorithm against some other solutions, which the literature recognizes to be among the most efficient ones. This section first introduces the graphs on which the evaluation was performed. The test cases description is followed by brief description of implementation and concluded with the results and a brief discussion.

5.1. Test Cases

The tests were performed on generated graphs and graphs taken from real cases. Furthermore we used two different kinds of generated graphs. The first ones were generated using the Erdős–Rényi random graph model [3]. The generation program takes two parameters n, the number of nodes, and

0 \leq ϵ \leq 2

, the graph density measure. The generated graph has nodes

V = {v_{1}, v_{2}, \dots, v_{n}}

and

m = n^{ϵ}

edges. In a generation process, a list of all possible edges

[(v_{i}, v_{j}) ∣ 1 \leq i, j \leq n, i \neq j]

is formed and randomly permuted. The first m edges are then taken and assigned weights uniformly in the interval

[0, 1]

, and they become edges of a generated graph. In our study, we focused on

ϵ \in [0.5, 1.1]

, since for larger

ϵ

, the number of SCCs decreases quickly to 1.

For the second kind of generated graphs, we used the Barabási–Albert approach with preferential attachment [4]. This approach was designed to simulate real-world networks with heavy-tailed degree distributions and a hub-like structure. The generation procedure starts with a small initial complete undirected graph of

n_{0}

nodes. New nodes are added one by one, and each added node is connected to exactly

n_{0}

existing nodes. The probability of an existing node to be connected to the added node is proportional to its current degree. This results in a scale-free undirected graph with n nodes and

\frac{n_{0} \cdot (n_{0} - 1)}{2} + n_{0} \cdot (n - n_{0})

edges. In the second step we produce a directed graph from the generated undirected graph by replacing each undirected edge

{u, v}

randomly with either

(u, v)

or

(v, u)

. Each edge is in final step assigned a weight drawn uniformly from the interval

[0, 1]

.

For the graphs from the real-world, we used the Stanford Large Network Dataset Collection (SNAP) [20]. In particular we concentrated on Internet peer-to-peer networks collection of graphs modeling the Gnutella file-sharing system [21]. The nodes in these graphs represent the hosts of the Gnutella networks, while edges represent connections between the hosts: the directed edge from node u to node v indicates that host u reports to host v as a neighbor. The graphs are directed and sparse, with several thousand nodes and edges. We used seven out of the nine graphs in the collection as test cases (see Table 1). The two omitted graphs were too big to be processed.

The final step is similar as before and assigns a uniform random weight of

[0, 1]

to each edge. These real-world graphs allow us to evaluate the behavior of our algorithm under realistic network structures that exhibit non-random, small-world, and scale-free properties.

5.2. Algorithm Implementation and Testing System

We implemented the function SCC_Out by slightly adapting the Tarjan’s algorithm for searching SCCs [22,23]. Briefly, the original algorithm applies a depth-first search to find cycles in a graph as they are bases for

C_{i}

components. In turn, an edge that is part of a cycle becomes part of an appropriate

C_{i}

, while the remaining edges belong to one of

Out (C_{i})

. Consequently the adapted algorithm in linear time returns topologically sorted list of pairs

[(C_{i}, Out (C_{i})) ∣ i = 1, \dots, t]

.

To compute APSP distances within each component

C_{i}

we decided to use the Tree algorithm presented in [7].

The evaluated algorithms and graph generation algorithms were implemented using C/C++, and compiled using GCC version 13.1.0. with no compilation switches. Experimental evaluations were conducted on a computer with an Apple M2 processor, 8 GB of RAM, and running macOS Sequoia 15.3.1. For each combination of parameters, we generated three independent graph instances and reported the average execution time, measured in elapsed real-time (in seconds). Detailed execution times for all algorithms and structural properties for all graph instances can be found in Appendix A.

5.3. Results

This section presents the empirical results of the APSP algorithm evaluations across three distinct graph types: Erdős–Rényi, Barabási–Albert and Gnutella. We compared our SCC algorithm with Floyd–Warshall (FW) [5,6], Dijkstra [12], Hidden Paths (Hidden) [16], Uniform Paths (Uniform) [17,18], Tree [7] and Improved Floyd–Warshall algorithm (Toroslu) [10].

Erdős–Rényi Graphs

On Erdős–Rényi graphs, the SCC algorithm consistently outperformed all other algorithms by a significant margin across both tested graph sizes (

n = 2048

and

n = 4096

). The Toroslu algorithm demonstrated competitive performance for lower edge densities (

ϵ \in [0.5, 1.0]

), but its scalability deteriorated noticeably when

ϵ = 1.1

, where execution times increased sharply. In contrast, SCC maintained stable and efficient runtimes even as both the graph size and density increased. The results on Erdős–Rényi graphs are presented in Figure 2.

Barabási–Albert Graphs

In the case of Barabási–Albert graphs, Tree and SCC outperformed all other algorithms. SCC demonstrated particularly strong performance in sparser configurations with

n_{0} = 2

and

n_{0} = 3

, where the graph naturally fragments into smaller strongly connected components that can be efficiently processed independently. However, as the

n_{0}

increases to

n_{0} = 4

and

n_{0} = 5

, the largest strongly connected component begins to dominate the graph, often encompassing most of its nodes. In these denser configurations, the benefits of component decomposition diminish, and the overhead introduced by SCC becomes more pronounced. Consequently, Tree outperforms SCC in such cases. The results are summarized in Figure 3, where algorithms with excessive runtimes are omitted for clarity (Hidden and Uniform).

Gnutella Graphs

On the Gnutella peer-to-peer network graphs, the SCC algorithm achieved the best performance on all but one graph, on which the Tree algorithm outperformed it. The results are presented in Figure 4. For clarity, algorithms with significantly higher runtimes are omitted from the figure (Hidden, Uniform, FW).

6. Conclusions

In this paper, we proposed a new APSP algorithm specifically designed for disconnected graphs scenarios where classical approaches are often inefficient. Our solution combines well-established shortest path techniques with graph decomposition into SCCs and applies selective inter-component relaxations. Consequently, our approach significantly reduces redundant computations and leverages graph structure to improve runtime. A key feature of our approach is that relaxation is only performed between node pairs for which reachability is established—that is, only when intermediate distances are finite. This selective processing significantly reduces computational overhead, particularly in sparse topologies where many node pairs are disconnected. As a result, the algorithm eliminates redundant updates on unreachable paths, leading to substantial performance gains.

We note that the algorithm does not improve the theoretical worst-case bound

O (n^{3})

. In the ill-formed case where the decomposition produces two strongly connected components,

C_{1}

and

C_{2}

, and every vertex in

C_{1}

has an outgoing edge to every vertex in

C_{2}

, the number of outgoing edges becomes

| Out (C_{1}) | = n_{1} (n - n_{1})

, where

n_{1}

denotes the number of vertices in

C_{1}

. Under this configuration, the number of relaxation operations can approach

O (n^{4})

. However, by applying a simple heuristic that detects when the number of outgoing edges

| Out (C_{i}) |

becomes too large, the overall complexity can be reduced to

O (n^{3})

. Thus, while the algorithm performs efficiently on sparse and disconnected graphs, its asymptotic upper bound in the worst case remains the same as that of the classical Floyd–Warshall algorithm.

The impact of this work lies in offering a practical and efficient alternative for APSP computation in domains where sparse, modular, or disconnected graphs naturally arise—such as social networks, dependency graphs, and biological networks. Future directions of research include formalizing the expected complexity in various graph models, validating the approach on real-world datasets.

Author Contributions

All authors (D.Z., R.P., and A.B.) contributed equally to the methodology, software, validation, formal analysis, data curation, and writing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by ARIS grants: D.Z. and A.B. by P2-0359, and R.P by P1-0285, N1-0159, J1-2451, N1-0209, and J5-4596.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Detail Results

The appendix contains complete per-instance results for all algorithms across the three graph kinds used in the evaluation: Table A1 for Erdős–Rényi graphs, Table A2 for Barabási–Albert graphs, and Table A3 for real-world Gnutella graphs. Each table has two groups of columns, where the first one up to a double vertical line lists the properties of a test case graph. The group is followed by the runtimes of compared algorithms (for their description, see Section 5.3) on this graph.

The runtimes are in seconds, while graph properties vary depending on the kind of graph. However, in all cases n and m give the number of nodes and edges respectively in a graph, while sccV and sccE represent the number of nodes and edges in the largest strongly connected component in a graph. Besides the described properties, each graph has additional properties, as described in Section 5.1: Erdős–Rényi graphs have

ϵ

, Barabási–Albert graphs have

n_{0}

, and real-world Gnutella graphs have id.

Table A1. Performance of compared algorithms on Erdős–Rényi graphs.

Graph Properties					Runtimes in Seconds
$n$	$m$	$ϵ$	sccV	sccE	SCC	Tree	FW	Dijkstra	Toroslu	Uniform	Hidden
2048	45	0.5	1	0	0.015	0.056	0.027	0.049	0.018	0.070	0.042
2048	45	0.5	1	0	0.015	0.049	0.022	0.049	0.019	0.066	0.040
2048	45	0.5	1	0	0.015	0.049	0.021	0.048	0.019	0.064	0.040
2048	97	0.6	1	0	0.015	0.049	0.022	0.049	0.020	0.065	0.040
2048	97	0.6	1	0	0.015	0.048	0.022	0.049	0.019	0.065	0.039
2048	97	0.6	1	0	0.015	0.048	0.022	0.048	0.019	0.065	0.039
2048	208	0.7	1	0	0.015	0.048	0.023	0.049	0.020	0.069	0.039
2048	208	0.7	1	0	0.015	0.047	0.021	0.048	0.020	0.066	0.040
2048	208	0.7	1	0	0.015	0.049	0.022	0.049	0.020	0.066	0.039
2048	446	0.8	1	0	0.015	0.048	0.023	0.050	0.021	0.068	0.040
2048	446	0.8	1	0	0.015	0.050	0.030	0.052	0.021	0.069	0.041
2048	446	0.8	1	0	0.015	0.048	0.023	0.050	0.021	0.067	0.040
2048	955	0.9	1	0	0.016	0.048	0.027	0.051	0.022	0.072	0.044
2048	955	0.9	1	0	0.016	0.049	0.026	0.049	0.021	0.074	0.046
2048	955	0.9	2	2	0.017	0.051	0.027	0.051	0.022	0.076	0.046
2048	2048	1.0	11	11	0.018	0.050	0.059	0.087	0.023	0.268	0.194
2048	2048	1.0	33	34	0.020	0.049	0.050	0.091	0.023	0.294	0.214
2048	2048	1.0	27	28	0.019	0.050	0.058	0.094	0.024	0.329	0.237
2048	4390	1.1	1433	3077	0.148	0.201	3.522	6.733	1.212	20.785	16.592
2048	4390	1.1	1458	3169	0.139	0.187	3.542	6.801	1.187	20.346	16.895
2048	4390	1.1	1443	3067	0.135	0.177	3.457	6.662	1.204	19.937	16.421
4096	64	0.5	1	0	0.062	0.312	0.157	0.193	0.074	0.290	0.157
4096	64	0.5	1	0	0.059	0.312	0.156	0.193	0.074	0.279	0.160
4096	64	0.5	1	0	0.057	0.299	0.157	0.193	0.073	0.271	0.161
4096	147	0.6	1	0	0.058	0.301	0.158	0.192	0.075	0.269	0.154
4096	147	0.6	1	0	0.058	0.296	0.156	0.193	0.073	0.264	0.154
4096	147	0.6	1	0	0.058	0.285	0.155	0.193	0.073	0.271	0.154
4096	338	0.7	1	0	0.058	0.281	0.156	0.193	0.075	0.267	0.156
4096	338	0.7	1	0	0.058	0.281	0.152	0.192	0.074	0.272	0.155
4096	338	0.7	1	0	0.058	0.273	0.155	0.192	0.075	0.271	0.156
4096	776	0.8	1	0	0.059	0.267	0.160	0.193	0.076	0.271	0.157
4096	776	0.8	1	0	0.059	0.280	0.160	0.194	0.076	0.269	0.156
4096	776	0.8	1	0	0.058	0.258	0.157	0.192	0.076	0.272	0.155
4096	1783	0.9	1	0	0.060	0.258	0.167	0.193	0.081	0.275	0.163
4096	1783	0.9	1	0	0.061	0.256	0.163	0.193	0.081	0.281	0.163
4096	1783	0.9	1	0	0.061	0.257	0.171	0.196	0.081	0.281	0.161
4096	4096	1.0	39	42	0.066	0.261	0.274	0.245	0.091	0.588	0.396
4096	4096	1.0	5	5	0.067	0.289	0.252	0.232	0.090	0.529	0.345
4096	4096	1.0	4	4	0.067	0.282	0.273	0.245	0.090	0.596	0.401
4096	9410	1.1	3026	6936	0.879	1.562	30.272	32.649	13.761	100.574	85.391
4096	9410	1.1	3083	7094	0.981	1.562	32.037	33.126	13.389	100.732	86.582
4096	9410	1.1	3024	6922	0.942	1.550	31.013	32.423	13.046	98.598	84.568

Table A2. Performance of compared algorithms on Barabási–Albert graphs.

Graph Properties					Runtimes in Seconds
$n$	$m$	$n_{0}$	sccV	sccE	SCC	Tree	FW	Dijkstra	Toroslu	Uniform	Hidden
2048	4093	2	1144	2253	0.082	0.158	7.192	5.905	0.454	16.950	13.669
2048	4093	2	1121	2194	0.075	0.143	6.934	5.788	0.413	16.359	13.453
2048	4093	2	1120	2192	0.074	0.146	7.065	5.761	0.428	16.692	13.486
2048	6138	3	1707	5046	0.203	0.217	11.325	8.948	1.664	24.725	20.806
2048	6138	3	1700	5014	0.205	0.213	11.133	8.956	1.641	24.386	20.691
2048	6138	3	1725	5089	0.200	0.214	11.325	9.062	1.655	25.460	22.626
2048	8182	4	1903	7550	0.368	0.274	13.437	10.644	3.040	27.894	24.112
2048	8182	4	1929	7657	0.356	0.270	13.344	10.568	3.198	28.173	24.035
2048	8182	4	1922	7634	0.355	0.266	13.403	10.631	3.076	28.050	23.913
2048	10,225	5	1988	9894	0.528	0.307	14.483	11.210	4.564	29.119	25.014
2048	10,225	5	1991	9922	0.514	0.338	14.561	11.273	4.446	29.179	24.991
2048	10,225	5	1990	9915	0.511	0.315	14.392	11.164	4.781	29.089	24.937
4096	8189	2	2198	4296	1.207	0.951	55.184	25.283	3.081	74.106	62.453
4096	8189	2	2365	4649	0.417	1.026	57.543	26.791	3.330	78.775	66.696
4096	8189	2	2283	4476	0.414	0.947	56.270	26.018	3.306	76.573	64.510
4096	12,282	3	3418	10,103	1.263	1.609	89.241	39.914	12.991	113.444	98.560
4096	12,282	3	3421	10,091	1.262	1.539	88.777	39.959	13.825	113.321	98.209
4096	12,282	3	3427	10,106	1.219	1.491	88.712	39.779	13.151	112.487	97.259
4096	16,374	4	3836	15,253	2.233	2.099	107.303	46.302	25.277	127.680	112.433
4096	16,374	4	3846	15,268	2.345	2.049	106.901	46.564	26.668	128.111	112.220
4096	16,374	4	3809	15,130	2.282	2.135	106.328	46.060	25.244	127.084	112.210
4096	20,465	5	3967	19,758	3.309	2.501	115.260	49.536	38.982	133.113	118.907
4096	20,465	5	4009	19,992	3.684	2.557	117.030	49.960	38.333	134.054	118.007
4096	20,465	5	3981	19,839	3.588	2.464	115.550	49.561	38.969	133.583	117.695

Table A3. Performance of compared algorithms on Gnutella graphs.

Graph Properties					Runtimes in Seconds
$n$	$m$	id	sccV	sccE	SCC	Tree	FW	Dijkstra	Toroslu	Uniform	Hidden
8717	31,525	4	3226	13,589	3.289	4.925	349.074	84.532	47.398	242.711	209.696
10,876	39,991	5	4317	18,742	9.852	8.474	771.895	144.166	111.391	836.076	366.017
8846	31,839	6	3234	13,453	3.535	5.159	345.323	85.201	43.457	245.265	210.372
6301	20,777	8	2068	9313	0.956	1.694	112.333	36.539	9.781	101.662	86.746
8114	26,013	9	2624	10,776	1.842	3.292	241.695	61.178	22.908	173.528	148.001
22,687	54,705	24	5153	17,695	12.123	26.063	3397.836	391.162	319.434	8082.304	1105.421
26,518	65,369	25	6352	22,928	18.556	38.241	5773.323	573.398	601.366	10,692.474	1734.372

References

Ahuja, R.K.; Magnanti, T.L.; Orlin, J.B. Network Flows: Theory, Algorithms and Applications; Prentice-Hall, Inc.: Wilmington, DE, USA, 1993. [Google Scholar]
Durrett, R. Random Graph Dynamics. In Cambridge Series in Statistical and Probabilistic Mathematics; Cambridge University Press: Cambridge, UK, 2006; Volume 5, pp. 27–69. [Google Scholar]
Erdős, P.; Rényi, A. On random graphs I. Publ. Math. 1959, 6, 290–297. [Google Scholar] [CrossRef]
Barabási, A.; Albert, R. Emergence of Scaling in Random Networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef] [PubMed]
Floyd, R.W. Algorithm 97: Shortest path. Commun. ACM 1962, 5, 345. [Google Scholar] [CrossRef]
Warshall, S. A theorem on boolean matrices. J. ACM 1962, 9, 11–12. [Google Scholar] [CrossRef]
Brodnik, A.; Grgurovič, M.; Požar, R. Modifications of the Floyd–Warshall algorithm with nearly quadratic expected-time. Ars Math. Contemp. 2021, 22, 1–22. [Google Scholar] [CrossRef]
Lancia, G.; Dalpasso, M. Speeding Up Floyd–Warshall’s Algorithm to Compute All-Pairs Shortest Paths and the Transitive Closure of a Graph. Algorithms 2025, 18, 560. [Google Scholar] [CrossRef]
Aini, A.; Salehipour, A. Speeding up the Floyd–Warshall algorithm for the cycled shortest path problem. Appl. Math. Lett. 2012, 25, 1–5. [Google Scholar] [CrossRef]
Toroslu, I.H. The Floyd–Warshall all-pairs shortest paths algorithm for disconnected and very sparse graphs. Software: Pract. Exp. 2023, 53, 1287–1303. [Google Scholar] [CrossRef]
Sapundzhi, F.; Danev, K.; Ivanova, A.; Popstoilov, M.; Georgiev, S. A Performance Comparison of Shortest Path Algorithms in Directed Graphs. Eng. Proc. 2025, 100, 31. [Google Scholar]
Dijkstra, E.W. A note on two problems in connexion with graphs. Numer. Math. 1959, 1, 269–271. [Google Scholar] [CrossRef]
Fredman, M.L.; Tarjan, R. Fibonacci heaps and their uses in improved network optimization algorithms. J. ACM 1987, 34, 596–615. [Google Scholar] [CrossRef]
Fredman, M.L.; Sedgewick, R.; Sleator, D.D.; Tarjan, R.E. The pairing heap: A new form of self-adjusting heap. Algorithmica 1986, 1, 111–129. [Google Scholar] [CrossRef]
Cormen, T.H.; Leiserson, C.E.; Rivest, R.L.; Stein, C. Introduction to Algorithms, 3rd ed.; MIT Press: Cambridge, MA, USA, 2009. [Google Scholar]
Karger, D.; Koller, D.; Phillips, S.J. Finding the hidden path: Time bounds for all-pairs shortest paths. SIAM J. Comput. 1993, 22, 1199–1217. [Google Scholar] [CrossRef]
Demetrescu, C.; Italiano, G.F. A new approach to dynamic all pairs shortest paths. J. ACM 2004, 51, 968–992. [Google Scholar] [CrossRef]
Demetrescu, C.; Italiano, G.F. Experimental analysis of dynamic all pairs shortest path algorithms. ACM Trans. Algorithms 2006, 2, 578–601. [Google Scholar] [CrossRef]
Willard, D.E. Log-Logarithmic Worst-Case Range Queries are Possible in Space $𝒪$ (n). Inf. Process. Lett. 1983, 17, 81–84. [Google Scholar] [CrossRef]
Leskovec, J.; Krevl, A. SNAP Datasets: Stanford Large Network Dataset Collection. 2014. Available online: https://snap.stanford.edu/data/#p2p (accessed on 13 May 2025).
Ripeanu, M.; Foster, I.; Iamnitchi, A. Mapping the Gnutella network: Properties of large-scale peer-to-peer systems and implications for system design. IEEE Internet Comput. 2002, 6, 50–57. [Google Scholar]
Tarjan, R.E. Depth-first search and linear graph algorithms. SIAM J. Comput. 1972, 1, 146–160. [Google Scholar] [CrossRef]
Tarjan, R.E.; Zwick, U. Finding strong components using depth-first search. Eur. J. Comb. 2024, 119, 103815. [Google Scholar] [CrossRef]

Figure 1. SCCs of a directed graph in topological order with outgoing edges.

Figure 2. Results on Erdős–Rényi graphs.

Figure 3. Results on Barabási–Albert graphs.

Figure 4. Results on Gnutella graphs.

Table 1. Graphs chosen from SNAP Internet peer-to-peer networks for test cases and their sizes.

id	n	m	id	n	m
4	10,876	39,994	9	8114	26,013
5	8846	31,839	24	26,518	65,369
6	8717	31,525	25	22,687	54,705
8	6301	20,777

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zugan, D.; Požar, R.; Brodnik, A. Floyd–Warshall Algorithm for Sparse Graphs. Algorithms 2025, 18, 766. https://doi.org/10.3390/a18120766

AMA Style

Zugan D, Požar R, Brodnik A. Floyd–Warshall Algorithm for Sparse Graphs. Algorithms. 2025; 18(12):766. https://doi.org/10.3390/a18120766

Chicago/Turabian Style

Zugan, Dani, Rok Požar, and Andrej Brodnik. 2025. "Floyd–Warshall Algorithm for Sparse Graphs" Algorithms 18, no. 12: 766. https://doi.org/10.3390/a18120766

APA Style

Zugan, D., Požar, R., & Brodnik, A. (2025). Floyd–Warshall Algorithm for Sparse Graphs. Algorithms, 18(12), 766. https://doi.org/10.3390/a18120766

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Floyd–Warshall Algorithm for Sparse Graphs

Abstract

1. Introduction

2. Related Work

3. Preliminaries

4. Our Solution

4.1. Formal Definition of Shortest Path via SCCs

4.2. Algorithm SCC

5. Evaluation

5.1. Test Cases

5.2. Algorithm Implementation and Testing System

5.3. Results

Erdős–Rényi Graphs

Barabási–Albert Graphs

Gnutella Graphs

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Detail Results

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI