1. Introduction
Algebraic connectivity (AC; also called the spectral gap) of a graph is a fundamental property that measures how fast information diffuses throughout the graph [
1,
2]. It corresponds to the second smallest eigenvalue of the graph Laplacian matrix (the smallest eigenvalue is always zero). In many applications, it is desirable to maximize the algebraic connectivity (i.e., speed of diffusion) subject to certain constraints; this and related problems have a long history [
3,
4,
5,
6,
7,
8,
9]. For example, in communications, the “cost” of a network increases with the number of links. It is therefore desirable to have as few edges as possible, while at the same time maximizing the algebraic connectivity. This leads to a natural question:
For a fixed average degree d,
what is the graph that maximizes algebraic connectivity as the number of vertices ?
Numerous papers address various aspects of this question, see e.g., [
6,
7,
8,
9]. This is one of those situations where the best answer is elusive [
10], but a decent answer can be found relatively quickly [
3,
4]. Graphs with high algebraic connectivity are related to expander graphs, and are important in many applications [
11,
12]. In this paper, we study the algebraic connectivity of
sparse random graphs in the case when the number of edges
m scales linearly with the number of vertices
n, i.e.,
. In other words, we fix the average degree
while letting
It is well known that a random Erdos–Renyi graph (where
m edges are taken at random) needs
edges to be fully connected [
13,
14], and as such, they are poor candidates for maximizing algebraic connectivity in this sparse regime (since the AC of a disconnected graph is zero). A good candidate is a
d-regular graph for which every vertex has a degree
d. It is well known that random regular
d-graphs have an algebraic connectivity that is asymptotic to
as
with fixed
d [
15,
16,
17,
18,
19]. For integer
, this quantity is strictly positive, which assures that these graphs have good expander properties [
20,
21]. A natural question is whether one can perform better than a regular graph, but without “too much work” (in, say,
time). In this paper, we give an affirmative answer when
: we introduce a class of semi-regular random graphs (whose vertices have a degree of either
or
), which is as easy to construct as random
d-regular graphs, but which has better AC.
Another question is whether “expander-type” graphs (which we define to be graphs with AC bounded away from 0 as ) are possible when the average degree d is less than 3, with . For , the answer is yes, and it is provided by semi-regular graphs of degrees 2 and 3, whose average degree is (with . We show that the AC of such graphs is asymptotic to in the limit of small p and large n, with independent of p (the answer is no when , since any graph needs at least edges to be connected).
Our first result is on random semi-regular bipartite graphs, where each vertex has a degree of either or , and we compute their asymptotic AC in the limit . In particular, we will exhibit a family of semi-random bipartite graphs having the same average degree as random d-regular graphs, but which have a higher algebraic connectivity when .
Before stating our result, let us define what we mean by such graphs. Consider a bipartite graph with one part having
vertices of degree
, and the second part having
vertices of degree
, such that every edge is between these two parts, and no edges are within each part. Then,
. Such a graph has
vertices so that
and therefore, the average degree is
We call such a graph a
semi-regular bipartite graph. We now introduce the following random model.
Random semi-regular bipartite (RSRB) graph model: Take two bags. In the first bag, put
copies of
vertices labeled
In the second bag, put
copies of
vertices labeled
Start with an empty graph of
vertices. Then, randomly pick two vertices without replacement—one from each bag—and add an edge between them to the graph. Repeat until the bags are empty. Refer to
Figure 1a and the Matlab code in
Appendix A.
We remark that this model (and the theory below) generally allows for multiple edges. These can be eliminated through a random rewiring postprocessing step. (Given a multiple edge or a self-loop (), choose another edge at random. Then, replace edges and by edges and . Repeat until all multiple edges/loops are eliminated. This operation preserves the degree distribution and edge count). Multi-edges/loops appear to be sufficiently rare that the postprocessing step does not affect the asymptotic results in the large n limit, as confirmed via direct numerical simulations. We now state our main results for RSRB graphs.
Main Result 1. Consider a RSRB graph. In the limit , its spectrum density is asymptotic towhere δ is the Dirac-delta function andIn other words, the number of eigenvalues inside any interval is asymptotic to as Moreover, its algebraic connectivity is asymptotic to Figure 2a shows the shape of the distribution (
3) for the case
(having an average degree of
), and compares it to an numerical histogram of eigenvalues of a 1000-vertex graph for this case. Very good agreement is observed.
Note that in the case
, Formula (
3) reduces to the well-known Mckay distribution of the spectrum of a regular graph [
15]:
Moreover, Formula (
5) simplifies to the classical result
for the algebraic connectivity of a random regular graph of
d vertices (this also shows that a random regular graph and random bipartite regular graphs have the same algebraic connectivity, at least to leading order in
n).
Consider a random cubic graph
and contrast it with an RSRB graph with
. Both cases have the same average degree of
. Formula (
5) gives the asymptotic values of
for the former, and
, for the latter.
Thus, the RSRB graph is about 15% better than a random cubic graph with respect to its expander properties, while having the same number of vertices and edges. Figure 2b shows a histogram for
of 1000 randomly constructed such graphs, comparing these two cases with
. Very good agreement between numerics and asymptotics is observed in both cases.
More generally, the following table shows all possible combinations of
such that
is an integer between 3 and 8, and the corresponding value of
(
5).
All RSRB graphs with integer average degree |
d | 3 | 4 | 5 | 6 | 7 | 8 |
| 3 | 2 | 4 | 3 | 5 | 3 | 6 | 4 | 7 | 4 | 8 | 6 | 5 |
| 3 | 6 | 4 | 6 | 5 | 15 | 6 | 12 | 7 | 28 | 8 | 12 | 20 |
| 0.1715 | 0.1957 | 0.5358 | 0.5535 | 1 | 1.0890 | 1.5278 | 1.5587 | 2.1010 | 2.1435 | 2.7084 | 2.6887 | 2.6671 |
| 0.178 | 0.205 | 0.553 | 0.572 | 1.027 | 1.122 | 1.565 | 1.596 | 2.150 | 2.205 | 2.766 | 2.745 | 2.729 |
std | 0.006 | 0.006 | 0.011 | 0.010 | 0.015 | 0.017 | 0.018 | 0.018 | 0.021 | 0.020 | 0.026 | 0.022 | 0.022 |
diff % | 3.8% | 4.7% | 3.1% | 3.2% | 2.7% | 3.0% | 2.4% | 2.4% | 2.3% | 2.87% | 2.1% | 2.1% | 2.2% |
The row
is the asymptotic formula given by (
5). Row
corresponds to Monte Carlo simulations of
. It shows the average
for 200 randomly chosen RSRB graphs with
edges. The row “diff %” is
. Uniformly good agreement between asymptotics and numerics is observed.
Parameters with higher AC are shown in bold. For , the RSRB graph with has a higher algebraic connectivity than the d-regular graph. On the other hand, for , d-regular graphs excel.
It is interesting to note that for any integer
, Equation (
2) always has a solution with integers
When
d is prime, this solution is unique and is given by
,
More generally, the number of such solutions is precisely the number of Pythagorean triples of leg
d (sequence A046079 in OEIS).
The RSRB graphs above have a constraint In particular, the minimum attainable average degree of such graphs is , corresponding to We can remove this constraint by instead introducing the probability of having degree or as follows.
Random semi-regular (RSR) model: Given
, and
n, let
and let
. In the same bag, put
copies of
vertices labeled
and
copies of
vertices labeled
Create edges by drawing two vertices from the bag at random (without replacement), until only one or zero vertices are left in the bag. See
Appendix A for the Matlab code.
An example of an RSR graph is shown in
Figure 1b. Note that such a graph has an average degree of
. We have the following.
Main Result 2. Consider a random semi-regular graph. Let Let x be the smallest root of the system . Then, in the limit , the AC is given by
In general, eliminating
R from the system
is a straightforward computer algebra computation using a resultant, and yields in a sixth-degree polynomial for
x. It is too complex to present here for the general
—see
Appendix A for the Maple code. In the case
, the RSR graph has an average degree
, and
is the smallest root of
Figure 3 compares the
given by (
8) with numerical computations of
for randomly chosen
RSR graphs. Note that the numerical result approaches the asymptotic value of
as the number of edges
n is increased.
The following table gives the value of
for several choices of
, for which the average degree
, and a comparison with numerics is shown.
|
| 4 | 3 | 3 | 2 | 2 | 2 |
| 4 | 5 | 6 | 5 | 6 | 7 |
p | | 0.5 | 1/3 | 2/3 | 0.5 | 0.4 |
| 0.5359 | 0.44261 | 0.39162 | 0.3333 | 0.25352 | 0.20748 |
| 0.551 | 0.488 | 0.451 | 0.286 | 0.217 | 0.174 |
std | 0.010 | 0.020 | 0.022 | 0.062 | 0.051 | 0.045 |
diff % | 2.8% | 10% | 15% | −14% | −14% | −19% |
Here,
is as given in the Main Result 2;
is the average of 200 simulations with
As apparent from this table, the RSR model performs worse than the RSRB of average degree 4 (including 4-regular graphs). This appears to be true for any average degree
d. However, one advantage of RSR graphs is that they produce graphs with an average degree
, with
bounded away from zero as
The smallest average degree that the RSRB model can have is 2.4, corresponding to
In fact, (
8) shows that
when the average degree is
with
, and
2. Derivation of Main Results
Derivation of spectral density (3). Following [
15,
22], we use the trace method. It is based on the fact that
. Define
Then,
represents the average number of closed walks of length
s on the graph whose adjacency matrix is
A. In the limit
, the eigenvalue distribution therefore satisfies
As in [
15,
22], the derivation consists of (a) computing
and (b) inverting the integral Equation (
9) to determine
To compute
, we write
where
is the number of closed walks of length
s starting from a vertex of degree
, and
is the number for the vertices of degree
A key insight [
15] that allows for the asymptotic determination of
and
is that, locally, a large random regular graph looks like a tree because the probability of encountering a cycle of length
s is incredibly small as
(for fixed
s). The same is true of semi-regular random graphs. For the RSRB graphs, each successive level alternates between vertices of degree
and
We therefore decompose these trees as illustrated in the following diagram, for the cases
and
.
![Entropy 26 01014 i001]()
Here,
A represent the child nodes of degree
and
B child nodes of degree
. Accordingly,
and
satisfy the recursion
with
. Let
be the generating function for
and similarly for
From recursion relations (
10), the corresponding generating functions satisfy
Solving (
11), while keeping in mind that
, we find that
In principle,
can be computed from
by Taylor-expanding near the origin; the first few terms are
. In [
15], an explicit computation of
was combined with Tchebychev polynomials to compute the McKay distribution for
d-regular random graphs. Here, we use a simpler and more powerful technique using complex variables introduced in [
23].
Write
using Cauchy’s integral formula as
where the integration is around a circle
, with a sufficiently small
to avoid any branch cuts of
Using the fact that
is even, we then have
Taking
and recalling (
9), we obtain
This yields the formula for
, namely
This formula was derived in [
23] using an equivalent technique (Stieltjes inversion). Next, we compute
where
is given by (
4). It follows that
for
and
On the other hand, when
we obtain
This recovers Formula (
3) except for the delta mass at the center, which is due to the singularity there. To compute the weight of the delta mass, one can integrate the overall density and impose that
. While the exact integration of (
14) is possible, it is easier and more instructive to compute the number of zero eigenvalues directly, as follows.
Suppose that
Label the components of the eigenvector
and
, corresponding to vertices of degree
and
, respectively. A zero eigenvalue satisfies
,
and
,
Look for solutions of the form where
,
This corresponds to solving
linear equations
, for the
unknown
This is an under-determined system, since
Generically, it has
independent solutions. Therefore, the weight of the delta function at zero should be
Using (
2) yields
, which recovers the weight of the delta function in (
3). We also verified using computer algebra that this indeed agrees with the full integration of (
14), namely, that
,
, so that, indeed,
. This completes the derivation of the spectral density (
3).
Finally, another derivation of the delta weight, as pointed out by James Mingo, is to use Proposition 3.8 in [
24]. It says that if
has a singularity at
, then the associated measure
has a delta mass at
, whose weight is given by
Here,
and the mass weight is then given by
in agreement with direct computation. ▪
Derivation of AC formula for RSRB (
5). Next, we derive the formula for algebraic connectivity (
5). For
d-regular graphs, the Laplacian graph is given by
, where
I is the identity and
A is the adjacency matrix. This allows us to characterize the AC in terms of the second-largest eigenvalue
of
A:
Of course, this is not true if the graph is not regular. The trick is to
regularize the graph by adding enough self-loops to vertices until all vertices have the same degree. Each self-loop counts as a single additional edge (i.e., adding a loop to vertex
j corresponds to adding “1” to the
j-th diagonal entry of the associated adjacency matrix) and crucially, adding self-loops does not change the Laplacian of the graph.
For a
semi-regular graph, assume that
and add
loops to all vertices of degree
. This results in a
-regular graph with loops. As before, let
be the number of closed walks of size
s. Let
be the
local expansion rate, that is, the rate at which
grows:
. Then, by analogy to regular graphs, the AC is given by
. To compute
, we decompose closed loops similarly to (
11). The decomposition now has loops as illustrated below.
Correspondingly, the associated generating functions satisfy
To determine AC, it suffices to determine the growth rate of
. This growth rate is in turn determined by the locations of singularities in the associated generating functions [
25]. In particular, if
, where
p is non-integer and
analytic, then
grows with the rate
in other words,
as
The location of the singularity can be found without solving the full system, and corresponds to the smallest root of the discriminant of the system. To illustrate this, consider the Catalan numbers
, whose generating function satisfies
Explicitly,
, and has a singularity at
, corresponding to the zero of the discriminant of the quadratic
It follows that
as
(indeed, more precise asymptotics
can also be derived from its generating function, although here, we only need the growth rate).
Let us now return to system (
15). Eliminating
B, we find that
A satisfies a quadratic:
Setting its discriminant to zero, we obtain a quartic polynomial for
x, namely
The expansion rate is then
, so that
. Substituting
into (
16), one obtains a fourth-degree polynomial for
:
Shift this polynomial to eliminate the
term by substituting
Almost like magic, it simultaneously eliminates the
term, resulting in a quadratic for
:
Solving for
y using the quadratic formula yields the solution (
5). ▪
Derivation of Result 2. Again, we compute the average number of closed walks of length
s,
As before, each vertex looks like a tree locally. Each child in this tree has the probability
of having degree
, and probability
p of having degree
As in the derivation of (
5), assume that
, and add
self-loops to each vertex of degree
The walks on these trees then decompose according to the following diagram (here given in the case
).
The corresponding generating function
solves the equations:
Eliminating
A and
B yields a cubic
given by (
7). The AC is then given by
, where
x is the singularity of
that is closest to the origin. By the implicit function theorem, this happens when
In other words,
x satisfies
. ▪
3. Discussion and Open Problems
We computed the asymptotics of the AC for two models of semi-regular random graphs: either RSR or RSRB models. While the RSR model is shown to have a smaller AC than a random d-regular graph with the same average degree, we found that the RSRB model has a higher AC than d-regular when .
Ramunajan expander graphs are defined as being
d-regular graphs [
11,
12] whose algebraic connectivity is at least
. As illustrated in
Figure 2b (see also Figure 9 in [
11] and the related table there), a random
d-regular graph has a decent chance of being a Ramunajan graph (around 66% when
as
according to simulations in [
11]). We can extend the definition of Ramunajan expander graphs as being any graph of
average degree
d whose AC is at least
When
, an RSRB
graph is actually a Ramunajan graph with very high probability as
This is illustrated in
Figure 2b, where
. Out of 1000 of such randomly chosen graphs, only
one had an AC less than
.
We relied on careful but formal power series computations. While the results were validated using numerics, a rigorous analysis of the results in this paper is an open problem. Related to this, it is an open question to estimate the accuracy of asymptotics as a function of
. In particular, the accuracy appears to be significantly degraded for the RSR graphs when
and
are far from each other. Take, for example, the case
. Intriguingly, the distribution for AC appears to have multiple peaks and does not necessarily concentrate around the mean, as illustrated in
Figure 4 (note that this does not appear to be the case for RSRB graphs as
Figure 2b illustrates). We pose this as an open problem.
Challenge 1. Describe the full distribution of the AC, particularly for RSR graphs. Explain why it can be multi-peaked when .
Another class of graphs having high algebraic connectivity are the complete bipartite graphs
, which have AC
(with
). The average degree of such a graph is asymptotic to
as
, so that
. This is in contrast to
d-regular random graphs (
). As was noted in [
7],
have higher AC than the
d-regular graphs (for the same asymptotical number of edges) provided that
(since, in which case, this is the
case) while the converse is true when
While complete bipartite graphs have a relatively high AC when
, they are also fragile, in the sense that removing any single edge causes the AC to decrease by one. Random graphs are more robust with respect to edge deletions as they seem to have some redundancy built-in.
Consider the limit of large
for the RSRB model. From (
2) and (
5), one obtains that
and
as
This is one less than the AC of the complete bipartite graph
(which also has the average degree
d as
. More generally, let
be as in (
5) and let
, where
is the average degree. It can be shown that
for any
when
. This discussion suggests the following question.
Challenge 2. Find a family of random graphs which has a higher algebraic connectivity than d-regular random graphs when the average degree . Explore if a more complex degree distribution (e.g., tri-regular) can be better than semi-regular for, say, .
There are several notions of graph connectivity, with AC being just one. Another notion is using the average of reliability polynomial [
26,
27] to compute the effect of edge deletions on graph connectivity. While the full reliability polynomial requires exponential time to compute, we performed the following simple experiment to quickly measure graph robustness. Start with a random 6-regular graph on 500 vertices (i.e., containing 1500 edges). Then, delete edges at random one by one until the graph becomes disconnected. Over 100 simulations, it took on average 460 edge deletions until the graph became disconnected (and in every instance, disconnectivity first occurred when a single vertex lost all of its edges). We then repeated this experiment with
RSRB graph with 500 vertices (which also contains 1500 edges, but has higher AC than a 6-regular random graph). It took on average 325 edge deletions until the disconnection was achieved. These preliminary experiments indicate that
d-regular graphs are more reliable with respect to edge deletions than semi-regular bipartite graphs, even though the latter might have a higher AC. We state this as a conjecture.
Conjecture 1. For a given average degree d, the most reliable graph is a d-regular graph.
Generally speaking, cubic graphs of high girth are good candidates for high-AC graphs [
7]. Since RSRB can have a higher AC, they are also natural candidates for searching high-girth graphs. We pose this as a challenge.
Challenge 3. For a fixed average degree d and fixed number of vertices n, find graphs (not necessarily d-regular) with the highest possible girth.
A
d-regular graph with the smallest possible
n for a given girth
g is called a cage. The literature for looking for high-girth graphs is extensive; see [
28,
29] for an overview. There are powerful methods to perform computer searches for high-girth
d-regular graphs [
30,
31]. As an example, consider
RSRB graphs, whose average degree is
. Do
semi-regular graphs give better cages than cubic graphs with the same number of vertices? The answer is, it depends. Note that that
graphs can be obtained from 6-regular graphs by inserting a vertex in the middle of each edge; conversely, any
graph yields a 6-regular graph by vertex contraction: just delete all 2-degree vertices. According to [
28,
32], a 6-regular 5-cage graph has 40 vertices. By vertex insertion, this yields a girth 10
semi-regular graph with 80 vertices. On the other hand, there are three cubic graphs having girth 10 and only 70 vertices [
28]. So, the cubic is better for girth 10.
Similarly, a 6-regular cage of girth 6 has 62 vertices [
28], which induces a (2,6) graph of girth 12 and 124 vertices. On the other hand, a cubic graph of girth 12 has at least 126 vertices [
28,
33]. So, a (2,6) semi-regular graph is superior for girth 12.
The techniques in this paper are rather general, and can be used to derive algebraic connectivity for many other random graph families. As an example, consider the “small-world”-type graph illustrated in
Figure 1c. Start with a cycle of
n vertices, then connect all odd-numbered vertices to each other at random. See
Appendix A for the Matlab code that generates such a graph. The resulting graph has an average degree of 2.5. To compute its AC, as in the derivation of (
5), we add a self-loop to even-numbered vertices so that each vertex has a degree of 3. We leave it as an exercise to the reader that the average walk-counting sequence
has the generating function that satisfies
Looking at the radius of convergence of the generating functions as before, we find that in the limit
, its AC is asymptotic to the smallest root of the polynomial
explicitly given by
This is validated using Monte Carlo simulations; the average AC of 1000 of such random graphs with
vertices is 0.0557 (std = 0.0029), i.e., a difference of about 7%. This is significantly higher than an RSR graph with
, which also has an average degree of 2.5, but whose AC is