1. Introduction
Graph fragmentation is the removal of the smallest possible subset of a graph’s vertices such that all the connected components in the remaining graph are of small size. Equivalently, we may consider the largest induced subgraph in which none of its connected components is larger than that size. This problem has been studied in the context of network resilience in the field of complex networks and is known as network dismantling. Additionally, it has been studied in the combinatorics community, where it is known as graph fragmentation.
In recent years, a few heuristic algorithms have been developed to handle the dismantling problem in various types of networks [
1]. Providing bounds for the size of the set to be removed may help evaluate the performance of these algorithms.
In this paper, we prove bounds on the graph fragmentation problem for several classes of graphs and discuss hard classes of instances to be fragmented.
A fundamental result by Janson and Thomason [
2] considers the dependence of the number of vertices to be removed on the size of the remaining connected components in the context of random graphs. They show that the asymptotic fraction of the vertices to be removed in order to fragment the graph
G into components of size at most
k is essentially the same for all values of
k that satisfy both
and
, if
G is a sparse random graph from the Erdős–Rényi model, henceforth denoted
, or a random regular graph, denoted as
.
Edwards and Farr [
3,
4] addressed this question for some classes of graphs. They parameterized it by
for the fraction of the vertices to be removed and an upper bound
C on the number of vertices in any connected component after removal. That is, a graph
is
-fragmentable if there exists
with
such that every connected component of
S has at most
C vertices. The set
S is called the
fragmenting set. A class of graphs
is
-fragmentable if there is an integer
C such that any member of
is
-fragmentable. The
coefficient of fragmentability is defined as
The general strategy taken in [
3,
4] was to remove vertices until a planar graph remains and then fragment it using separators. They were able to achieve the following bounds for
, the class of graphs with maximum degree
d, with
:
Similar bounds hold for the class of connected graphs of average degree d for a real , as well as for the class of graphs of average degree d for a real .
In [
5], the same authors improved the upper bounds by replacing the fraction
with a function
, defined as follows: Set
,
, and, for any integer
,
Then, they extended the definition to real numbers by linear interpolation—that is, for any integer
and real number
r satisfying
, set
. The following can then be shown:
for all ;
;
when d is an integer, where is the alternating factorial function given by .
In this paper, we are interested in the case where the size of the remaining connected components may depend on n, and generally, our setting is that of random graphs. Hence we define the -fragmentability of a graph G. A vertex set is said to f-fragment a graph if the induced graph does not contain any component of size at least f. A graph G is -fragmentable if there exists a subset S of the vertices of V with , whose removal leaves a graph with no component having more than f vertices.
Definition 1. Let be a sequence of probability distributions over graphs, and let be a randomly chosen graph from . The fragmentation coefficient of Γ
, is For the random graph models, we consider
and
. Any probability strictly between zero and one can be taken instead of
in the definition above. This was shown in [
6] for the decycling number of
and in Lemma 1 for
.
Lemma 1. For , the constant in Definition 1 has an asymptotically negligible effect on and can be replaced with any constant in the interval .
Proof. Let
, and let
be the size of the fragmentation set of
G. We show that for every
, there is a constant
C and a segment of integers
I such that
and
We consider the vertex-exposure martingale (see [
7]). Note that if
differs from
G in only one vertex, then
. Hence, the vertex Lipschitz condition holds, and by Azuma’s inequality:
□
Note that while, for large enough n, the size of the fragmentation set is close to its mean with probability tending to one, this lemma does not provide anything regarding the mean . In particular, for small values of c, we do not know the expected size of the fragmentation set . Additionally, we do not gain any insight regarding the dependency of s in n, and we only know it in some cases. In fact, we generally cannot answer the question of whether the limit exists. Hence, we define
Note that
and
are related to the typical behavior of graphs, whereas
is related to the external behavior of graphs in the family. In these studies, Edwards and Farr [
3,
4] used
d-regular graphs with high girth to establish their lower bound for
. We show below that
is also a lower bound for the fragmentation coefficient of a random
d-regular graph. We approach the problem by considering the vertex expansion properties of random regular graphs.
Haxell et al. [
6] studied a closely related parameter called the decycling number of a graph. They answered a question by Edwards and Farr and showed that
by proving the following lower bound for every
:
We show below that if
d is large enough, then a line graph of a random
d-regular graph actually slightly improves that bound, meaning that it is asymptotically the hardest graph to be fragmented up to a constant factor (in the number of remaining vertices), in the sense that to fragment it, we have to remove almost all the vertices since its fragmentation coefficient satisfies
where
n is the order of the line graph and
d is the degree of its vertices. It should be noted that in [
6,
8], it was proven that decycling and fragmentation of sparse random graphs are almost the same, in the sense that asymptotically, to decycle a random graph costs almost no more than to fragment. This property is used in our study as well. In the same paper, Haxell et al. [
6] showed that the lower bound for the decycling number of random
d-regular graphs for large
d is
.
Bau et al. [
8] gave bounds for the decycling number
of random
d-regular graphs. The upper bounds are based on an algorithmic approach to decycling, and the lower bounds are based on the combinatorial counting of forests. A related question is the computational complexity of finding a minimum fragmenting set. It is known that finding the minimum fragmenting set for an
-fragmentable graph is equivalent to the maximum independent set problem, which is known to be NP-hard. In fact, it has been shown that even approximating the size of the maximum independent set (which is at least
for any
-fragmentable graph) up to a constant factor is Poly-APX-complete [
9]. Deciding whether a graph is
-fragmentable corresponds to the vertex separator problem, which is also known to be NP-hard. It is, therefore, reasonable to assume that many instances of the fragmentation problem are computationally hard. The complexity of finding a shattering set in the case of random graphs remains unknown. In some cases (e.g., random 3-regular graphs), asymptotically optimal solutions are known to be computable in polynomial time.
Organization
In the second section, we give all the necessary background for our work. This background consists of basic notations of asymptotics and graph theory, definitions of the two models of random graphs studied here, and basic definitions from spectral graph theory. In
Section 3.1, we prove some bounds on the fragmenting set of general graphs, and in
Section 3.2, we find general lower and upper bounds for two models of random graphs:
where
and
is a constant and
. In
Section 4, we improve the lower bound for the size of the fragmenting set of
random graphs, where
, and
c is a large constant.
Section 5 deals with the fragmenting set of random
d-regular graphs. In
Section 5.1, we show that asymptotically, the hardest regular graph to be fragmented is a line graph of a random regular graph. In
Section 5.2, we show two lower bounds based on spectral graph theory.
2. Theoretical Background and Notations
We denote by the set of n vertices.
We denote the
set of neighbors of some set
as
where
. The
boundary of some set
is denoted by
. A set
is an
independent set if no two of its vertices are adjacent. A
maximum independent set is an independent set of the largest possible cardinality in
G, and its size is denoted by
. The
chromatic number is the minimum number of colors needed to properly color a graph, where no pair of neighbors is colored by the same color, denoted by
.
A set is a dominating set if every vertex of G outside of S has a neighbor in S. A minimum dominating set is a dominating set of the smallest possible cardinality in G, and is denoted by .
An induced subgraph of a graph G is a subset of vertices together with every edge whose endpoints are both in U and is denoted by , we say that such a U spans . For a graph and a set of vertices , we denote by the induced graph . A subgraph of a graph spanned by a set of edges is a graph with a set of edges , and its set of vertices includes the endpoints of every edge in and possibly some isolated vertices.
Given a graph , we denote by P a partition of into disjoint sets of vertices. The graph induced by a partition consists of connected components that are induced subgraphs of the sets in the partition.
The line graph of a graph G is a graph, denoted by , whose set of vertices is the set of edges of G, and every two vertices of are connected by an edge if and only if their respective edges share a vertex in common as edges in G.
The density of a graph is defined as .
Since every graph
G satisfies
, where
is the maximum vertex degree in
G [
10],
Conclusion 1. it follows that every d-regular graph has an independent set of order at least .
Together with the known inequality
, it follows that
We use the following notation for random graphs. Given a real number , the binomial model of random graphs is a probability space, denoted by , consisting of all graphs on the vertex set , where each graph G with edges occurs with probability . We define a random graph in this model as a graph randomly chosen from with a distribution. We denote this distribution by . We say that is distributed according to and denote it by .
The model of random d-regular graphs is the probability space, denoted by , consisting of all simple d-regular graphs with the vertex set . A random d-regular graph is a graph chosen from with a uniform distribution. We denote this distribution by , and we say that is distributed according to and denote it by .
A random graph is sparse if with high probability, .
Note that for positive constants d and c, and are sparse graphs.
The following theorems were derived by A. Frieze et al. [
11,
12], for the size of the maximum independent set.
Theorem 1. Let and ϵ be fixed. Suppose that for some sufficiently large fixed constant . Then,with high probability [11]. Theorem 2. Let be fixed. There exists a constant such that if is a constant, thenwith high probability [12]. Denote by
A the adjacency matrix,
and by
D the diagonal matrix, whose entries are
, with
being the Kronecker delta. Note that for any
k-regular graph
G,
, where
I is the identity matrix.
The Laplacian matrix of a graph
G is defined as
, and the normalized Laplacian is
, which translates to
Note 1. The normalized adjacency matrix for d-regular graphs, denoted by , is , and its normalized Laplacian is .
We denote the eigenvalues of by and the eigenvalues of by .
The following theorems concern the second-largest eigenvalue of .
Theorem 3. For every , any d-regular graph of order n satisfies when [13]. Theorem 4. For every constant , a random d-regular graph of order n satisfies with high probability that [13]. Theorem 5. Let G be a d-regular graph of order n, and let . Then,where is the number of edges in a subgraph induced by . [14]. Proposition 1. Let X be an independent set of a d-regular graph G of order n. Then, .
Proof. X is an independent set; that is,
. Thus, according to [Corollary 9.2.6] [
7], we obtain
Since this holds for all independent sets, the last equation also holds for
. □
Let
G be a random
d-regular graph of order
n. The Vertex Expansion Ratio of
G, denoted by
, is defined as
We also use another version of the vertex expansion:
We have the following theorem:
Theorem 6. Let be a fixed integer and let G be a random d-regular graph. Then, for every , there exists such that with high probability [15], 4. Fragmentation of
In the previous section, we showed that if
for constant
c, or
for constant
d, then, with high probability, a fragmentation coefficient
is bounded between
Naturally, we are interested in closing this gap.
In this section, we find an improved lower bound for the size of the fragmenting set in the Erdős–Rényi model, which is valid when c is a large constant. We show that this lower bound is . As a consequence, if for a sufficiently large but constant c, then, with high probability, the order of the graph remaining after fragmentation is , where refers to a function that approaches zero as both n and c approach infinity. Hence, surprisingly, if one wants to remove vertices from until all connected components are of size , one should not expect to do much better than leaving all components of size one.
Formally we want to prove the following theorem.
Theorem 12. Let , where and c is a sufficiently large constant. With high probability, the order of the graph remaining after the removal of a fragmenting set is at most .
We have divided the proof into several lemmas.
Note 2. From now until the end of this section, fix c to be a sufficiently large constant, and let p be .
Lemma 3. Let U be the set of vertices remaining after optimal fragmentation. Then, with high probability, in order to make unicyclic, we need to remove at most vertices.
Proof. This follows from the proof of Theorem 11 above, since by definition, is composed of connected components of size . □
For the rest of this section, let m be the number of vertices remaining after optimal fragmentation. Our goal is to bound m from above.
Lemma 4. Let be the event in which a subset K of of order k spans exactly k edges, and let be its characteristic random variable, i.e., Let be a random variable that counts the number of sets of order k that span exactly k edges. Then, for every , if , one has .
Proof. Recall that
. Then,
Now,
But according to Theorem 1, for a sufficiently large
c,
which proves that for
, one has
. □
Lemma 5. Let be the event in which a subset K of order k of a random graph spans exactly i edges, where , and let be a random variable that counts the number of such sets. That is, , whereLet be a random variable that counts the number of subsets of G of order k with at most k edges. Then, for . Proof. We fix
k and omit it from the notation.
and then the expectation of
X is
Denote the summand
i in Equation (
3) by
. Thus, it is enough to show that for every
, one has
for some
. Consider the quotient of
and
for
:
Now, we aim to show that this quotient is less than 1:
The second inequality holds because by assumption,
.
Next,
and the last inequality holds for sufficiently large values of
n.
Denote by
. By induction, we have that for every
,
, where
Therefore,
We already showed that
, and it is clear that
converges to some finite constant when
because
when
. □
Proof of Theorem 12. Recall that
U is the set remaining after optimal fragmentation and
m denotes its size. In order to turn
into a tree or a unicyclic graph, one needs to remove from
U at most
vertices by Lemma 3. Denote the order of the graph remaining after this stage by
k; that is,
. Now, by Lemma 5, if
, then with high probability, there is no such set of order
k that contains at most one cycle. This implies that necessarily
and
since
is linear in
n, and this completes the proof of Theorem 12. □
Conclusion 3. For a sufficiently large c, with high probability, the order of the fragmentation coefficient satisfies where is a vanishing term as both n and c approach infinity. Remark 2. According to Theorem 1, for a sufficiently large c, with high probability. Thus, the order of the optimal fragmenting set of a random graph satisfieswith high probability.