Asymptotics of Closeness Centralities of Graphs

Santiago Frias; Adriana Galindo Silva; Bryan Romero; Darren A. Narayan

doi:10.3390/math13233812

,

and

¹

Department of Mathematics, Michigan State University, East Lansing, MI 48825, USA

²

Mathematics and Statistics Department, Sonoma State University, Rohnert Park, CA 94928, USA

³

Department of Mathematics, University of Massachusetts Boston, Boston, MA 02125, USA

⁴

School of Mathematics and Statistics, Rochester Institute of Technology, Rochester, NY 14623, USA

Mathematics2025, 13(23), 3812;https://doi.org/10.3390/math13233812

This article belongs to the Special Issue Graph Theory and Applications, 3rd Edition

Version Notes

Order Reprints

Abstract

Given a connected graph G with n vertices, the distance between two vertices is the number of edges in a shortest path connecting them. The sum of the distances in a graph G from a vertex v to all other vertices is denoted by

S D_{G} (v)

. The closeness centrality of a vertex in a graph was defined by Bavelas to be

C_{C} (v) = \frac{n - 1}{S D_{G} (v)}

and the closeness centrality of G is

C_{C} (G) = \sum_{v \in G} \frac{n - 1}{S D_{G} (v)}

. We consider the asymptotic limit of

C_{C} (G)

as the number of vertices tends to infinity and provide an elegant and insightful proof of a 2025 result by Britz, Hu, Islam, and Tang,

lim_{n \to \infty} C_{C} (P_{n}) = π

, using uniform convergence and Riemann sums. We applied the same technique for the union of a cycle

C_{m}

and path

P_{n}

and the union of a path and a complete graph. We prove that of all graphs, paths have the minimum closeness centrality. Next we show for any

c \in [π, \infty)

, there exists a sequence of graphs

{G_{n}}

such that

lim_{n \to \infty} C_{C} (G_{n}) = c

. In addition, we investigate the mean distance of a graph,

\bar{l} (G) = \frac{1}{n (n - 1)} \sum_{v \in V (G)} S D (v)

and the normalized closeness centrality,

{\bar{C}}_{C} (G) = \frac{1}{n} C_{C} (G)

. We verify a conjecture of Britz, Hu, Islam, and Tang that the set of products

{\bar{l} (G) {\bar{C}}_{C} (G) : G is finite and connected}

is dense in

[1, 2)

.

Keywords:

graph distance; closeness centrality; asymptotics

MSC:

05C12; 05C09

1. Introduction

Given a graph G, the distance between two vertices is the number of edges in a shortest path connecting them. We consider the sum of the distances from a given vertex to all other vertices. The sum of the distances in a graph G from a vertex v will be denoted by

S D_{G} (v)

or simply

S D (v)

when the graph G is clear. This measure is equivalent to the transmission of a vertex introduced by Handa []. As noted in [], motivation for this metric comes from a variation of the famous Traveling Salesman Problem where the salesman must return to the starting point v after each delivery. The length of the salesman’s tour will be

2 S D_{G} (v)

. This is related to the closeness centrality of a vertex or a graph, defined by Bavelas in 1950 []. The closeness centrality of a vertex v in a graph is

C_{C} (v) = \frac{n - 1}{S D_{G} (v)}

. Then

C_{C} (G) = \sum_{v \in V (G)} \frac{n - 1}{S D_{G} (v)}

.

Closeness centrality has been used to identify important individuals in social networks [] and has been used to analyze the impact on coauthorship networks []. Asymptotic behavior for graph centrality properties have been studied for decades. A practical application of this asymptotic analysis is to identify trends that emerge as networks expand over time. In 2000, Barrat and Weigt [] investigated asymptotic properties of small-world network models. In 2013 Ek, VerSchneider, and Narayan [] determined the asymptotic behavior of the global efficiency of a path. In this paper we investigate the asymptotic behavior of

C_{C} (G)

.

We present a new approach for determining the asymptotic behavior of

C_{C} (G)

for various families of graphs. We recast the sum of the distances as a Riemann sum and then replace the discrete values with a continuous function. Then we determine the asymptotic value of the closeness centrality by computing the definite integral.

We present an elegant and more powerful proof of a result of Britz, Hu, Islam, and Tang [], showing that

lim_{n \to \infty}

C_{C} (P_{n}) = π

. In addition, we resolve a conjecture from their paper, showing that the numbers

\bar{l}

C_{C} (G)

for all connected graphs G form a dense subset of the interval

[1, 2)

. We also establish a universal lower bound for

C_{C} (G)

and show that this bound is tight when G is a path.

2. Asymptotics of Closeness Centralities of a Path

We provide an alternative proof for

lim_{n \to \infty}

C_{C} (P_{n}) = π

in Lemma 2. We start by presenting a combinatorial formula for

S D

values for vertices in a path in Lemma 1.

Lemma 1.

Let

G = P_{n}

with vertices

v_{1}, v_{2}, \dots, v_{n}

. Then

S D (v_{j}) = (\binom{j}{2}) + (\binom{n - j + 1}{2})

.

Proof.

Consider a vertex

v_{i}

. The distances from

v_{j}

to each of the vertices

v_{j - 1}, v_{j - 2}, \dots, v_{1}

are

1, 2, 3, \dots, j - 1

, respectively, which sum to

(\binom{j}{2})

. The distances from

v_{j}

to each of the vertices

v_{j + 1}, v_{j + 2}, \dots, v_{n}

is

1, 2, 3, \dots, n - j

, respectively, which sum to

(\binom{n - j + 1}{2})

. □

We next restate a lemma of Britz, Hu, Islam, and Tang []. In their proof they used approximations through lower and upper bounds which converge asymptotically as n grows large. We provide an alternative proof using a novel technique. Starting with the formula from Lemma 1 we let

S_{n} = \sum_{j = 1}^{n} \frac{n}{S D (v_{j})} = \frac{n}{n - 1} C_{C} (P_{n})

. Then we can view

S_{n}

as a Riemann sum over the partition

{\frac{j}{n}}_{j = 1}^{n}

. Then after carefully changing this to a sum over a continuous domain we can calculate

lim_{n \to \infty}

C_{C} (P_{n})

using a definite integral. We obtain the same stunning result from [] that

lim_{n \to \infty} C_{C} (P_{n}) = π

.

Lemma 2.

lim_{n \to \infty} C_{C} (P_{n}) = π

Proof.

Let

x_{j} = \frac{j}{n}

and we have

\begin{matrix} S_{n} & = \sum_{j = 1}^{n} (\frac{1}{n}) \frac{n^{2}}{j^{2} - j - j n + \frac{n (n + 1)}{2}} \\ = \sum_{j = 1}^{n} (\frac{1}{n}) \frac{1}{(j^{2} - j - j n + \frac{n (n + 1)}{2}) \frac{1}{n^{2}}} \\ = \sum_{j = 1}^{n} (\frac{1}{n}) \frac{1}{x_{j}^{2} - \frac{n + 1}{n} x_{j} + \frac{n + 1}{2 n}} . \end{matrix}

Define a sequence of functions

{g_{n}}

as

g_{n} : [0, 1] \to R

where

g_{n} (x) = \frac{1}{x^{2} - \frac{n + 1}{n} x + \frac{n + 1}{2 n}}

for each

n \in N

. Now define function

g : [0, 1] \to R

,

g (x) = \frac{1}{x^{2} - x + \frac{1}{2}}

and we claim that sequence

{g_{n}}

uniformly converges to g (see Appendix A Lemma A1). Returning to the sum, we have

S_{n} = \sum_{j = 1}^{n} g_{n} (x_{j}) \frac{1}{n} .

Let

ϵ > 0

and by uniform convergence there exists

N > 0

such that for all

n > N

, we have

| g (x) - g_{n} (x) | < ϵ

for each

x \in [0, 1]

. Let

n > N

and we have

\begin{matrix} |\sum_{j = 1}^{n} \frac{1}{n} (g (x_{j}) - g_{n} (x_{j}))| & \leq \sum_{j = 1}^{n} \frac{1}{n} | g (x_{j}) - g_{n} (x_{j}) | \\ < \sum_{j = 1}^{n} \frac{1}{n} ϵ \\ = ϵ . \end{matrix}

Thus

lim_{n \to \infty} \sum_{j = 1}^{n} \frac{1}{n} (g (x_{j}) - g_{n} (x_{j})) = 0 \Rightarrow lim_{n \to \infty} \sum_{j = 1}^{n} g_{n} (x_{j}) \frac{1}{n} = lim_{n \to \infty} \sum_{j = 1}^{n} g (x_{j}) \frac{1}{n}

. Note that the limit on the right is of the right-hand Riemann sum of the function g over the interval

[0, 1]

with sequence of partitions

p_{n} = {\frac{j}{n}}_{j = 1}^{n}

. Since g is Riemann integrable, we have that

\begin{matrix} lim_{n \to \infty} \frac{n}{n - 1} C_{C} (P_{n}) & = lim_{n \to \infty} S_{n} \\ = lim_{n \to \infty} \sum_{j = 1}^{n} g (x_{n}) \frac{1}{n} \\ = \int_{0}^{1} \frac{1}{x^{2} - x + \frac{1}{2}} d x \\ = π . \end{matrix}

□

Lower Bound for Closeness Centralities

We will show that the lower bound for the closeness centrality of a graph is

\frac{n π}{n + 2}

, where n is the number of vertices. We begin with trees. The technique will be as follows. We start with a tree with a vertex v of degree at least three. Then we remove one of the branches and append it to one of the other branches. Informally, we are ’flattening’ a tree, making it more like a path. This is illustrated in Figure 1.

Figure 1. Flattening a tree: Changes to the

S D

values of the moved vertices perfectly cancel out.

Surprisingly the negative

S D (v)

values perfectly cancel out. To see this consider the change in

S D_{T}

and

S D_{W}

values for the vertices

v_{j}

and

v_{s + 2 - j}

. We note that the only distances that may change involve the vertices

v_{s + 2}, v_{s + 3}, \dots, v_{s + l + 1}

.

For

1 \leq j \leq ⌊ (s + 1) / 2 ⌋

, we have

S D_{T} (v_{j}) = \sum_{i = 1}^{l} (i + j - 1)

and

S D_{W} (v_{j}) = \sum_{i = 2}^{l + 1} (s + i - j)

. For

1 \leq j \leq ⌊ (s + 1) / 2 ⌋

, we have

S D_{T} (v_{s + 2 - j}) = \sum_{i = 2}^{l + 1} (s + i - j)

and

S D_{W} (v_{s + 2 - j}) \sum_{i = 1}^{l} (i + j - 1)

.

Hence the negative changes in

S D

values

v_{j}

where

⌊ (s + 1) / 2 ⌋ + 1 \leq j \leq s + 1

cancel out with the postive changes in

S D

values

v_{j}

where

1 \leq j \leq ⌊ (s + 1) / 2 ⌋

.

For the remaining vertices the change in

S D (v)

values will be positive. Hence sum of the closeness centrality values will decrease.

Lemma 3.

Let T be a tree with a vertex

v_{1}

of degree at least three with two pendant paths

P_{s}

and

P_{t}

. Let

V (P_{s}) = v_{1}, v_{2}, \dots, v_{s}

and

E (P_{s}) = {v_{i} v_{i + 1} | 1 \leq i \leq s - 1}

and let

V (P_{t}) =

v_{s + 1}, v_{s + 2}, \dots, v_{s + t}

and

E (P_{t}) = {v_{i} u_{i + 1} | s + 1 \leq i \leq s + t - 1}

. Let W be a tree where

V (W) = V (T)

and

E (W) = {v_{i} v_{i + 1} | 1 \leq i \leq s - 1} \cup {v_{i + 1} u_{1}} \cup {v_{i} v_{i + 1} | s + 1 \leq i \leq s + t - 1}

. Then

C_{C} (W) \leq C_{C} (T)

.

Proof.

We will proceed with

s \geq 2

. Then

S D_{T} (v_{i}) = (\binom{i + t}{2}) + (\binom{s + 1 - i}{2}) + i

when

1 \leq i \leq s

, and

S D_{T} (v_{i}) = (\binom{s + t + 1 - i}{2}) + (\binom{i}{2})

when

s + 1 \leq i \leq s + t

. For W, we have

S D_{W} (v_{i}) = (\binom{i}{2}) + (\binom{s + t + 1 - i}{2})

for

1 \leq i \leq s + t

.

We examine the differences between the

S D

values of the same vertices between trees T and W. We consider two cases, first where the vertices are on the path with S vertices and then where the vertices are on the path with T vertices. We consider two cases.

Case 1.

Let

1 \leq i \leq s

S D_{W} (v_{i}) - S D_{T} (v_{i}) = (\binom{i + 1}{2}) + (\binom{s + t + 1 - i}{2}) - ((\binom{i + t}{2}) + (\binom{s + 1 - i}{2}) + i) = t - 2 i t + s t

Subcase 1 (a): When s is even: $t - 2 (\frac{s}{2} + k) t + s t = t - 2 k t$
$t - 2 (\frac{s}{2} - (k - 1)) t + s t = 2 k t - t S D_{T} (v_{i}) = - S D_{T} (v_{s + 1 - i})$
Subcase 1 (b): When s is odd:
$t - 2 (\frac{s + 1}{2} + k) t + s t = - 2 k t$
$t - 2 (\frac{s + 1}{2} - k) t + s t = 2 k t$
$S D_{T} (v_{i}) = - S D_{T} (v_{s + 1 - i})$
and when $k = 0$ , $S D_{T} (v_{i}) = S D_{T} (v_{s + 1 - i})$ .

Case 2.

Let

s + 1 \leq i \leq s + t

(\binom{i + 1}{2}) + (\binom{s + t + 1 - i}{2}) - ((\binom{s + t + 1 - i}{2}) + (\binom{i}{2}) + i - s + 1) = s - 1

. We note that for all vertices

v \in V (G) - V (T)

that

S D_{T} (v) \leq S D_{W} (v)

. □

We note that the differences

S D_{W} (v_{i}) - S D_{T} (v_{i})

perfectly cancel out with

S D_{W} (v_{s + 1 - i})

−

S D_{T} (v_{s + i - i})

. However we need to show that the reciprocals of the

S D

values

C_{C_{W}} (v_{i}) - C_{C_{T}} (v_{i})

overcompensate for

C_{C_{W}} (v_{s + 1 - i}) - C_{C_{T}} (v_{s + i - i})

. To do this we apply a basic result from number theory that we restate in our next lemma.

Lemma 4.

For positive integers

a, x

, and k,

(\frac{1}{a} - \frac{1}{a + x}) + (\frac{1}{a + x + k} - \frac{1}{a + k}) > 0

.

Proof.

\frac{x}{a (a + x)} > \frac{x}{(a + k) (a + x + k)}

\Leftrightarrow \frac{x}{a (a + x)} + \frac{- x}{(a + k) (a + x + k)} > 0

\Leftrightarrow \frac{a + x - a}{a (a + x)} + \frac{a + k - (a + x + k)}{(a + k) (a + x + k)} > 0

\Leftrightarrow (\frac{1}{a} - \frac{1}{a + x}) + (\frac{1}{a + x + k} - \frac{1}{a + k}) > 0 .

□

Lemma 5.

For any tree

T_{n}

with n vertices,

C_{C} (T_{n}) \geq C_{C} (P_{n})

.

Proof.

If

T_{n}

is a path, then we are done. If not we combine two pendant paths into a single path using Lemma 3. Iterating this process will result in a path with a lower closeness centrality than the original tree. □

Theorem 1.

For any graph G with n vertices,

C_{C} (G) \geq C_{C} (P_{n})

.

Proof.

Given a graph G a minimum distance spanning tree

T_{n}

can be obtained using Dijkstra’s algorithm []. Here

d_{G} (v_{i}, v_{j}) \leq d_{T} (v_{i}, v_{j})

for all pairs of vertices

(v_{i}, v_{j})

. Hence

\frac{1}{d_{T_{n}} (v_{i}, v_{j})} \leq \frac{1}{d_{G} (v_{i}, v_{j})}

for all pairs of vertices

(v_{i}, v_{j})

. This implies

C C (G) \geq C C (T_{n})

. Combining this with Lemmas 3 and 5 we have

C_{C} (G) \geq C_{C} (T_{n}) \geq C_{C} (P_{n})

. □

3. Asymptotics

We will generalize the method used to prove Lemma 2 by replacing the sum of

S D

values with a Riemann sum. We do this by extending the SD functions to a sequence of continuous functions that uniformly converge, and then compute an integral of the limit. We next provide tools from analysis, which will be useful in obtaining the results in this section.

Lemma 6.

Let

I \subseteq R

be a closed interval, and suppose that the sequence of functions

f_{n} : I \to R

uniformly converges to the continuous function

f : I \to R

on I. If f is either strictly negative or positive, then there exists

N > 0

such that

f_{n}

is also strictly negative or positive accordingly for all

n > N

.

Proof.

Without loss of generality suppose that f is strictly positive and by the EVT, it attains a minimum value of

m > 0

. By uniform convergence, there exists

N > 0

such that for all

n > N

we have

\begin{matrix} | f_{n} (x) - f (x) | & < \frac{m}{2} \\ \Rightarrow & - \frac{m}{2} & < f_{n} (x) - f (x) \\ \Rightarrow & f (x) - \frac{m}{2} & < f_{n} (x) \\ \Rightarrow & 0 < m - \frac{m}{2} & < f_{n} (x) \end{matrix}

as desired. □

Lemma 7.

Let

I \subseteq R

be a closed interval and suppose that

(f_{n})

is a sequence of functions that uniformly converge to a continuous function

f : I \to R

on I. If

f_{n} (x) \neq 0

and

f (x) \neq 0

for all

n \geq 1

and

x \in I

, then

(\frac{1}{f_{n}}) ⇉ \frac{1}{f}

. In addition. the function

\frac{1}{f} : I \to R

is continuous.

Proof.

Since f has no roots in I, by the IVT, it must either be positive or negative on I. Without loss of generality, suppose that f is positive on I and by the EVT, f attains a minimum at

x_{m i n} \in I

. Let

m = f (x_{m i n})

and by uniform convergence, there exists

N_{0} > 0

such that for all

n > N_{0}

\begin{matrix} | f_{n} (x) - f (x) | & < \frac{m}{2} \\ \Rightarrow & - \frac{m}{2} & \leq f_{n} (x) - f (x) \\ \Rightarrow & f (x) - \frac{m}{2} & \leq f_{n} (x) \\ \Rightarrow & m - \frac{m}{2} & \leq f_{n} (x) \\ \Rightarrow & 0 < \frac{m}{2} & \leq f_{n} (x) \\ \Rightarrow & 0 < \frac{1}{| f_{n} (x) |} & \leq \frac{2}{m} \end{matrix}

for all

x \in I

.

Let

ϵ > 0

and by uniform convergence there exists

N_{1} > 0

such that for all

n > N_{1}

,

| f - f_{n} | < \frac{m^{2} ϵ}{4}

. Suppose

n > max {N_{0}, N_{1}}

and we have

\begin{matrix} |\frac{1}{f_{n}} - \frac{1}{f}| & = \frac{1}{| f | | f_{n} |} | f_{n} - f | \\ \leq {(\frac{2}{m})}^{2} | f_{n} - f | \\ < \frac{4}{m^{2}} \frac{m^{2} ϵ}{4} \\ = ϵ \end{matrix}

for all

x \in I

. Therefore

(\frac{1}{f_{n}})

uniformly converges to

\frac{1}{f}

on I. As a composition of continuous functions

f : I \to R^{+}

and

\frac{1}{x} : R^{+} \to R

, continuity of

\frac{1}{f}

follows immediately. □

Lemma 6 can be used to weaken the conditions of the above Lemma so that for some

N > 0

,

\frac{1}{f_{n}} ⇉ \frac{1}{f}

while using the restriction

n > N

. For the purposes of this paper, it suffices to assume

N = 1

without loss of generality.

Lemma 8.

Let

I \subseteq R

be a closed interval, and suppose that the sequence of functions

(f_{n} : I \to R)

uniformly converges to the continuous function

f : I \to R

on I. Suppose that

{x_{k}^{n}}_{k = 1}^{m_{n}} \subseteq I

is a family of sequences indexed by

n \in Z^{+}

.

If either

lim_{n \to \infty} \sum_{k = 1}^{m_{n}} f (x_{k}^{n}) \frac{1}{m_{n}} or lim_{n \to \infty} \sum_{k = 1}^{m_{n}} f_{n} (x_{k}^{n}) \frac{1}{m_{n}}

converge, then the limits coincide.

Proof.

Let

ε > 0

and there exists

N > 0

such that for all

n > N

,

| f - f_{n} | < ϵ, \forall x \in D

. Suppose

n > N

and we have

\begin{matrix} | \sum_{k = 1}^{m_{n}} f (x_{k}^{n}) - f_{n} (x_{k}^{n}) | \frac{1}{m_{n}} \leq & \sum_{k = 1}^{m_{n}} | f (x_{k}^{n}) - f_{n} (x_{k}^{n}) | \frac{1}{m_{n}} \\ < & \sum_{k = 1}^{m_{n}} ϵ \frac{1}{m_{n}} \\ = & m_{n} ϵ \frac{1}{m_{n}} \\ = & ϵ \end{matrix}

Thus

lim_{n \to \infty} \sum_{k = 1}^{m_{n}} (f (x_{k}^{n}) - f_{n} (x_{k}^{n})) \frac{1}{m_{n}} = 0

as desired. □

In the special case that the sequences

p_{n} = {\frac{k}{n}}_{k = 1}^{n}

are partitions of the unit interval, we can compute the limit of Riemann sums

lim_{n \to \infty} \sum_{k = 1}^{n} f_{n} (\frac{k}{n}) \frac{1}{n}

by replacing

f_{n}

with the limit f. Since

f : [0, 1] \to R

is continuous, and therefore Riemann integrable, the sum reduces to the integral

\begin{matrix} lim_{n \to \infty} \sum_{k = 1}^{n} f (\frac{k}{n}) \frac{1}{n} = \int_{0}^{1} f (x) d x . \end{matrix}

By the above Lemma, this integral is the value of

lim_{n \to \infty} \sum_{k = 1}^{n} f_{n} (\frac{k}{n}) \frac{1}{n}

.

3.1. Union of a Path and Complete Graph

Let

m, n

be non-negative integers, and

P_{n} \cup K_{m}

will denote a path with n vertices joined to a vertex of a complete graph with m vertices. The vertices of the path will be labeled

v_{1}

through

v_{n}

where

v_{1} = v_{n + 1}

is the junction vertex, and the vertices in the complete graph are labeled

v_{n + 1}

through

v_{n + m}

.

We next determine the

S D

values of

P_{n} \cup K_{m}

considering three different cases:

Both vertices are on the path:
If $v_{j}, v_{i} \in P_{n}$ , then $d (v_{i}, v_{j}) = | j - i | .$
Both distinct vertices are on the complete graph:
If $v_{i}, v_{j} \in K_{m}$ , then $d (v_{i}, v_{j}) = 1 .$
One vertex is in the path and the other is in the complete subgraph.
Without loss of generality, suppose that $v_{i} \in P_{n}$ and $v_{j} \in K_{m} / {v_{n + 1}}$ , then the shortest path from $v_{i}$ to $v_{j}$ is obtained by first traveling from $v_{i}$ to $v_{1}$ and then from $v_{1} = v_{n + 1}$ to $v_{j}$ . Thus

$\begin{matrix} d (v_{i}, v_{j}) = & d (v_{i}, v_{1}) + d (v_{n + 1}, v_{j}) \\ = & i - 1 + 1 = i . \end{matrix}$

Now suppose that $v_{j} \in P_{n}$ and we have

$\begin{matrix} S D (v_{j}) = & \sum_{k = 1}^{n} d (v_{j}, v_{k}) + \sum_{k = n + 2}^{n + m} d (v_{j}, v_{k}) \\ = & \sum_{k = 1}^{n} | j - i | + \sum_{k = n + 2}^{n + m} j \\ = & (\binom{j}{2}) + (\binom{n - j + 1}{2}) + (m - 1) j . \end{matrix}$

If

v_{j} \in K_{m} / {v_{n + 1}}

, then

\begin{matrix} S D (v_{j}) = & \sum_{k = 1}^{n} d (v_{j}, v_{k}) + \sum_{k = n + 2}^{n + m} d (v_{j}, v_{k}) \\ = & \sum_{k = 1}^{n} k + \sum_{k = n + 2}^{n + m} d (v_{j}, v_{k}) \\ = & (\binom{n + 1}{2}) + \sum_{k = n + 2, k \neq j}^{n + m} 1 \\ = & (\binom{n + 1}{2}) + m - 2 . \end{matrix}

Theorem 2.

Let p be a positive real number and suppose that

(n_{k})

and

(m_{k})

are strictly increasing sequences of positive integers such that

lim_{k \to \infty} \frac{m_{k}}{n_{k}} = p

. Let

S_{k} = P_{n_{k}} \cup K_{m_{k}}

, then

\begin{matrix} lim_{k \to \infty} C_{C} (S_{k}) = (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{1}{2}} d x + 2 p + 2 p^{2} \\ lim_{k \to \infty} \frac{S D (S_{k})}{{(n_{k} + m_{k})}^{3}} = \frac{p + \frac{1}{3}}{{(1 + p)}^{3}} . \end{matrix}

Proof.

We will first show the centrality result and let

N_{k} = n_{k} + m_{k}

and note that

S_{k}

has

N_{k} - 1

vertices. Thus

\begin{matrix} C_{C} (S_{k}) = & \sum_{j = 1}^{n_{k}} \frac{N_{k} - 2}{S D (v_{j})} + \sum_{j = n_{k} + 2}^{N_{k}} \frac{N_{k} - 2}{S D (v_{j})} \\ \Rightarrow \frac{N_{k}}{N_{k} - 2} C_{C} (S_{k}) = & \sum_{j = 1}^{n_{k}} \frac{N_{k}^{2}}{S D (v_{j})} \frac{1}{N_{k}} + \sum_{j = n_{k} + 2}^{N_{k}} \frac{N_{k}}{S D (v_{j})} \\ = & \frac{n_{k}}{N_{k}} \sum_{j = 1}^{n_{k}} \frac{N_{k}^{2}}{S D (v_{j})} \frac{1}{n_{k}} + \sum_{j = n_{k} + 2}^{N_{k}} \frac{N_{k}}{S D (v_{j})} \\ = & A_{k} + B_{k} . \end{matrix}

Define sequence of functions

f_{k} : [0, 1] \to R

where

\begin{matrix} f_{k} (x) = \frac{1}{N_{k}^{2}} ((\binom{n_{k} x}{2}) + (\binom{n_{k} - n_{k} x + 1}{2}) + (m_{k} - 1) (n_{k} x)) \end{matrix}

and we will show that

\begin{matrix} f_{k} ⇉ f (x) = \frac{x^{2}}{2 {(1 + p)}^{2}} + \frac{{(1 - x)}^{2}}{2 {(1 + p)}^{2}} + \frac{p x}{{(1 + p)}^{2}} . \end{matrix}

The functions

f_{k} (x)

were obtained by reparameterizing

S D (v_{j})

so that

f (\frac{j}{n_{k}}) = \frac{S D (v_{j})}{N_{k}^{2}}

, for

1 \leq j \leq n_{k}

. This strategy will be used again for the next family of graphs. Now let

a_{k} (x) = \frac{1}{N_{k}^{2}} (\binom{n_{k} x}{2})

,

b_{k} (x) = \frac{1}{N_{k}^{2}} (\binom{n_{k} - n_{k} x + 1}{2})

, and

c_{k} (x) = \frac{1}{N_{k}^{2}} (m_{k} - 1) (n_{k} x)

and we have 3 cases.

$a_{k} ⇉ a (x) = \frac{x^{2}}{2 {(1 + p)}^{2}}$ :
We have

$\begin{matrix} | a_{k} (x) - a (x) | = & | \frac{x^{2}}{2 {(1 + p)}^{2}} - \frac{(n_{k} x) (n_{k} x - 1)}{2 N_{k}^{2}} | \\ = & | \frac{x^{2}}{2 {(1 + p)}^{2}} - \frac{n_{k}^{2} x^{2}}{2 N_{k}^{2}} + \frac{n_{k} x}{2 N_{k}^{2}} | \\ \leq & \frac{x^{2}}{2} | \frac{1}{{(1 + p)}^{2}} - \frac{n_{k}^{2}}{N_{k}^{2}} | + \frac{x}{2} | \frac{n_{k}}{N_{k}^{2}} | \\ \leq & \frac{1}{2} | \frac{1}{{(1 + p)}^{2}} - \frac{n_{k}^{2}}{N_{k}^{2}} | + \frac{1}{2} | \frac{n_{k}}{N_{k}^{2}} | \\ = & \frac{1}{2} | \frac{1}{{(1 + p)}^{2}} - \frac{1}{{(1 + \frac{m_{k}}{n_{k}})}^{2}} | + \frac{1}{2 N_{k}} | \frac{1}{(1 + \frac{m_{k}}{n_{k}})} | . \end{matrix}$

This is true for all $x \in [0, 1]$ and recall that $\frac{m_{k}}{n_{k}} \to p$ and $N_{k} \to \infty$ . Thus $\frac{1}{{(1 + \frac{m_{k}}{n_{k}})}^{2}} \to \frac{1}{{(1 + p)}^{2}}$ and it is also clear that the right term of the last inequality goes to 0 uniformly for all $x \in [0, 1] .$ Then $a_{k} ⇉ a$
$b_{k} (x) ⇉ b (x) = \frac{{(x - 1)}^{2}}{2 {(p + 1)}^{2}}$
The proof is similar to the first case.
$c_{k} (x) ⇉ c (x) = \frac{p x}{{(1 + p)}^{2}}$

$\begin{matrix} | c (x) - c_{k} (x) | = & |\frac{p x}{{(1 + p)}^{2}} - \frac{1}{N_{k}^{2}} (m_{k} - 1) (n_{k} x)| \\ \leq & |\frac{p}{{(1 + p)}^{2}} - \frac{m_{k} n_{k}}{N_{k}^{2}}| + |\frac{n_{k}}{N_{k}^{2}}| . \end{matrix}$

It can be shown that

lim_{k \to \infty} \frac{m_{k} n_{k}}{N_{k}^{2}} = \frac{p}{{(1 + p)}^{2}}

and since

N_{k} \to \infty

, the last inequality goes to 0 uniformly for all

x \in [0, 1]

. The sum of the sequences

f_{k} = a_{k} + b_{k} + c_{k}

uniformly converges to the sum of the limits

f (x) = a + b + c

. Note that

f (x) = 0

if and only if

x^{2} + x (p - 1) + \frac{1}{2} = 0

, which has discriminant

{(p - 1)}^{2} - 2

. Then for all

p \in [0, \sqrt{2} + 1)

,

f (x)

has no real roots. Now suppose

p \geq \sqrt{2} + 1

, and since

x \geq 0

, it follows

\begin{matrix} x^{2} + x (p - 1) + \frac{1}{2} \geq \frac{1}{2} . \end{matrix}

The continuous function

f : [0, 1] \to R

has no nonnegative roots for

p > 0

and by Lemma 6, there exists

N > 0

such that

f_{k}

has no roots for all

k \geq N

. Without loss of generality suppose

N = 1

, and by Lemma 7, sequence

{\frac{1}{f_{k}}}

uniformly converges to continuous function

\frac{1}{f}

on

[0, 1]

. Let

{x_{j}^{k} = \frac{j}{n_{k}}}_{j = 1}^{n_{k}}

be a sequence of partitions of

[0, 1]

, and we have

\begin{matrix} lim_{k \to \infty} A_{k} = & lim_{k \to \infty} \frac{n_{k}}{N_{k}} \cdot lim_{k \to \infty} \sum_{j = 1}^{n_{k}} \frac{1}{f_{k} (\frac{j}{n_{k}})} \frac{1}{n_{k}} \\ = & \frac{1}{1 + p} lim_{k \to \infty} \sum_{j = 1}^{n_{k}} \frac{1}{f (x_{j}^{k})} \frac{1}{n_{k}} \\ = & \frac{1}{1 + p} \int_{0}^{1} \frac{{(1 + p)}^{2}}{x^{2} + x (p - 1) + \frac{1}{2}} d x \\ = & (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{1}{2}} d x . \end{matrix}

Furthemore

\begin{matrix} B_{k} = & \sum_{j = n_{k} + 2}^{N_{k}} \frac{N_{k}}{S D (v_{j})} \\ = & \sum_{j = n_{k} + 2}^{N_{k}} \frac{N_{k}}{(\binom{n_{k} + 1}{2}) + m_{k} - 2} \\ = & \frac{N_{k} (m_{k} - 2)}{(\binom{n_{k} + 1}{2}) + m_{k} - 2} \\ = & \frac{1}{\frac{1}{N_{k} (m_{k} - 2)} (\binom{n_{k} + 1}{2}) + \frac{1}{N_{k}}} . \end{matrix}

It is clear that

\frac{1}{N_{k} (m_{k} - 2)} (\binom{n_{k} + 1}{2}) = \frac{1}{2} (\frac{n_{k}}{N_{k}}) (\frac{n_{k} + 1}{m_{k} - 2})

converges to

\frac{1}{2} \frac{1}{1 + p} \frac{1}{p}

so that

B_{k} ⇉ 2 (p + 1) p

. Thus

\begin{matrix} lim_{k \to \infty} C_{C} (S_{k}) = & lim_{k \to \infty} \frac{N_{k}}{N_{k} - 2} C_{C} (S_{k}) \\ = & lim_{k \to \infty} A_{k} + lim_{k \to \infty} B_{k} \\ = & (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{1}{2}} d x + 2 (p + 1) p \end{matrix}

as desired. As for the sum of distances, we have

\begin{matrix} lim_{k \to \infty} \frac{S D (S_{k})}{N_{k}^{3}} = & lim_{k \to \infty} \frac{n_{k}}{N_{k}} (\sum_{j = 1}^{n_{k}} \frac{S D (v_{j})}{N_{k}^{2}} \frac{1}{n_{k}}) + lim_{k \to \infty} \frac{1}{N_{k}^{3}} \sum_{j = n_{k} + 2}^{N_{k}} S D (v_{j}) \\ = & \frac{1}{1 + p} (lim_{k \to \infty} \sum_{j = 1}^{n_{k}} f_{k} (\frac{j}{n}) \frac{1}{n_{k}}) + lim_{k \to \infty} \frac{m_{k} - 1}{N_{k}^{3}} ((\binom{n_{k} + 1}{2}) + m_{k} - 2) \\ = & \frac{1}{1 + p} (lim_{k \to \infty} \sum_{j = 1}^{n_{k}} f (x_{j}^{k}) \frac{1}{n_{k}}) + \frac{p}{2 {(1 + p)}^{3}} . \end{matrix}

The left-hand side of the last expression is the Riemann sum of f over the interval

[0, 1]

obtained by Lemma 8, and the right-hand limit can be easily computed knowing that

lim_{k \to \infty} \frac{n_{k}}{N_{k}} = \frac{1}{1 + p}

and

lim_{k \to \infty} \frac{m_{k}}{N_{k}} = \frac{p}{1 + p}

. Thus

\begin{matrix} lim_{k \to \infty} \frac{S D (S_{k})}{N_{k}^{3}} = & \frac{1}{{(1 + p)}^{3}} \int_{0}^{1} x^{2} + x (p - 1) + \frac{1}{2} d x + \frac{p}{2 {(1 + p)}^{3}} \\ = & \frac{p + \frac{1}{3}}{{(1 + p)}^{3}} . \end{matrix}

□

The function

S : R_{\geq} 0 \to R

, defined as

S (p) = (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{1}{2}} d x + 2 (p + 1) p

, will be called the shooting star centrality. Now we have the following result.

Corollary 1.

For every

c \in [π, \infty) \cap R

, there exists a sequence of graphs

G_{n}

such that

lim_{n \to \infty} C_{C} (G_{n}) = c .

Proof.

Suppose

c > π

, and note that

\begin{matrix} 0 \leq & (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{1}{2}} d x ⟹ \\ 2 (p + 1) p \leq & (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{1}{2}} d x + 2 (p + 1) p = S (p) ⟹ \\ lim_{p \to \infty} S (p) = & \infty . \end{matrix}

Thus there exists

p_{0} > 0

such that

c < S (p_{0})

and by the IVT, there exists

p \in (0, p_{0})

such that

S (p) = c

. Let

(\frac{m_{k}}{n_{k}})

be a sequence of convergents of p with strictly growing numerator and denominator. Since

lim_{k \to \infty} \frac{m_{k}}{n_{k}} = p

, then

lim_{k \to \infty} C_{C} (P_{n_{k}} \cup K_{m_{k}}) = S (p) = c

. If

c = π

, then

lim_{k \to \infty} C_{C} (P_{k}) = π

by Lemma 2. □

As a consequence of Theorem 1, we have

C_{C} (G) \geq \frac{π n}{n + 2}

. This provides the following result complementary to the above Corollary.

Theorem 3.

Let

C

denote the set of all graph centralities. Then

C \cap [0, π]

is not dense in

[0, π]

.

Proof.

Let

α \in [0, π]

be an irrational number and suppose that

A = C \cap [0, π]

is dense in

[0, π]

. Then α is a limit point of A and there exists a sequence of finite connected graphs

(G_{k})

such that

lim_{k \to \infty} C_{C} (G_{k}) = α

. If

| G_{k} |

is unbounded, then we can extract a subsequence of graphs

H_{k}

with increasing order such that

lim_{k \to \infty} C_{C} (H_{k}) = α

. However, by the above inequality,

\begin{matrix} C_{C} (H_{k}) \geq \frac{π | H_{k} |}{| H_{k} | + 2} \\ ⟹ & lim_{k \to \infty} C_{C} (H_{k}) \geq π \end{matrix}

which is a contradiction. Thus

| G_{k} |

is bounded, and let

N > 0

be the size of the largest graph. Let

R_{n}

denote the finite set of distinct connected graphs of

n \in Z^{+}

vertices up to isomorphism. Then we have

| C_{C} (R_{| G_{k} |}) | \leq | R_{| G_{k} |} | \leq \sum_{j = 1}^{N} | R_{j} |

, and each term

C_{C} (G_{k})

has upmost

\sum_{j = 1}^{N} | R_{j} |

of the same choices. Therefore, the sequence

C_{C} (G_{k})

has finitely many distinct terms which are all rational. But since the sequence converges, there exists

N_{0} > 0

such that all

C_{C} (G_{k})

are equal for

k > N

and the limit α is rational, which is a contradiction. Thus α is not a limit point of

A,

so that it cannot be dense in

[0, π]

as desired. □

Let

C (π)

denote the set of graph closeness centralities greater than

π

. Corollary 1 shows that

C (π)

is a dense subset of

[π, \infty)

. A related result can be obtained by considering the normalized centrality and mean distance of a connected graph G with n vertices as defined by Britz, Hu, Islam, and Tang []:

\begin{matrix} {\bar{C}}_{C} (G) = & \frac{1}{n} \sum_{k = 1}^{n} \frac{n - 1}{S D (v_{k})} \\ \bar{l} (G) = & \frac{1}{n (n - 1)} \sum_{k = 1}^{n} S D (v_{k}) . \end{matrix}

They showed that

1 \leq \bar{l} (G) {\bar{C}}_{C} (G) < 2

for all finite connected graphs G and conjectured the set of all such values,

L = {\bar{l} (G) {\bar{C}}_{C} (G)}

is dense in

[1, 2]

. Let

p \in R^{+}

and define integer sequences

{m_{k}}

,

{n_{k}}

the same way as in the above theorem. Then for the sequence of star graphs

S_{k} = P_{n_{k}} \cup K_{m_{k}}

we have

\begin{matrix} lim_{k \to \infty} \bar{l} (S_{k}) {\bar{C}}_{C} (S_{k}) = & lim_{k \to \infty} \frac{S D (S_{k}) C_{C} (S_{k})}{{(N_{k} - 1)}^{2} (N_{k} - 2)} \\ = & lim_{k \to \infty} \frac{S D (S_{k})}{N_{k}^{3}} C_{C} (S_{k}) \\ = & lim_{k \to \infty} \frac{S D (S_{k})}{N_{k}^{3}} \cdot lim_{k \to \infty} C_{C} (S_{k}) \\ = & \frac{p + \frac{1}{3}}{{(1 + p)}^{3}} S (p) = \bar{S} (p) . \end{matrix}

The function

\bar{S} (p) : R_{\geq 0} \to R

is continuous and note that

\bar{S} (0) = \frac{π}{3}

. Furthermore, it can be shown that

lim_{p \to \infty} \bar{S} (p) = 2

(see Appendix A Lemma A5), which converges from below, and we have the following result by applying the IVT as in Corollary 1.

Corollary 2.

The set

L \cap [\frac{π}{3}, 2]

is dense in

[\frac{π}{3}, 2]

3.2. Mean Distance

Let G be a connected graph of n vertices. Doyle and Graver, ref. [] defined the mean distance of G as

\begin{matrix} μ (G) = \frac{1}{n (n - 1)} S D (G) \end{matrix}

and presented that

μ (P_{n}) = \frac{n + 1}{3}

is a tight upper bound for the mean distance

μ

. It follows that

S D (P_{n}) = \frac{n^{3} - n}{3}

is a tight upper bound for

S D (G)

, and we will define the normalized mean distance as

\bar{S D} (G) = \frac{S D (G)}{S D (P_{n})} = \frac{S D (G)}{\frac{n^{3} - n}{3}}

. It is clear that

\begin{matrix} 0 < \bar{S D} (G) \leq 1 \end{matrix}

for all finitely connected graphs G and let

\bar{S D} \subseteq [0, 1]

denote the set of all such values.

Theorem 4.

The set

\bar{S D}

is dense in

[0, 1]

Proof.

Let

p \in R^{+}

and let

(n_{k}), (m_{k})

be strictly increasing sequence of positive integers whose ratio converges to p. Then by Theorem 2,

\begin{matrix} lim_{k \to \infty} \bar{S D} (S_{n_{k}, m_{k}}) = & lim_{k \to \infty} 3 \frac{S D (S_{n_{k}, m_{k}})}{{(N_{k} - 1)}^{3} - (N_{k} - 1)} \\ = & lim_{k \to \infty} 3 \frac{S D (S_{n_{k}, m_{k}})}{N_{k}^{3}} \\ = & \frac{3 p + 1}{{(1 + p)}^{3}} = f (p) . \end{matrix}

The function

f : R_{\geq 0} \to [0, 1]

is continuous and

lim_{p \to \infty} f (p) = 0

so that by a similar application of the IVT as in Corollary 1, the desired result follows. □

4. Balloon Graphs

Let

B_{2 n}

denote a balloon graph consisting of a cycle

C_{n}

attached to a path

P_{n}

, each containing n vertices. The junction vertex of the cycle and path will be labeled

v_{1}

. Traveling clockwise around the cycle, its vertices will be labeled

v_{1}, v_{2}, \dots, v_{n}, v_{n + 1}

where

v_{1}

and

v_{n + 1}

are the same vertex. Starting at

v_{n + 1}

, the vertices of the path will be labeled from left to right as

v_{n + 1}, v_{n + 2}, \dots, v_{2 n}

.

Now we have the following theorem:

Theorem 5.

lim_{n \to \infty} C_{C} (B_{2 n}) = \frac{4 arctan (\frac{2}{\sqrt{3}})}{\sqrt{3}} + 4 ln (\frac{5}{3})

Proof.

First we will find the shortest distance

d (v_{i}, v_{j})

between vertices

v_{i}, v_{j} \in B_{2 n}

and we have 3 cases:

$v_{i}, v_{j} \in C_{n}$
If both vertices are on the cycle, then the shortest distance is given by the minor arc between them. Without loss of generality, suppose that $1 \leq i < j \leq n$ and the shortest path is either travel clockwise from $v_{i}$ to $v_{j}$ , giving a distance of $j - i$ . Or we travel counterclockwise by going from $v_{i}$ to $v_{1}$ and then jumping from $v_{1}$ to $v_{n}$ . From here we go from $v_{n}$ to $v_{j}$ giving a total distance of $(i - 1) + 1 + (n - j) = n - (j - i)$ . Thus in general

$d (v_{i}, v_{j}) = m i n {| j - i |, n - | j - i |} = \frac{n}{2} - || j - i | - \frac{n}{2}| .$
$v_{i}, v_{j} \in P_{n}$
If both vertices are on the path then $d (v_{i}, v_{j}) = | i - j |$ where $i, j \in [n + 1, 2 n] .$
$v_{i} \in P_{n}$ and $v_{j} \in C_{n}$
If the vertices are on different components of the graph, then it suffices to consider when $v_{i} \in P_{n}$ and $v_{j} \in C_{n}$ . The shortest path is given by first going from $v_{i}$ to $v_{n + 1} = v_{1}$ and then from $v_{1}$ to $v_{j}$ giving

$\begin{matrix} d (v_{i}, v_{j}) & = d (v_{i}, v_{1}) + d (v_{1}, v_{j}) \\ = i - (n + 1) + (\frac{n}{2} - |j - 1 - \frac{n}{2}|) . \end{matrix}$

Now suppose that $v_{j} \in P_{n}$ and we have

$\begin{matrix} S D (v_{j}) = & \sum_{k = 1}^{2 n} d (v_{j}, v_{k}) \\ = & \sum_{k = 1}^{n} d (v_{j}, v_{k}) + \sum_{k = n + 2}^{2 n} d (v_{j}, v_{k}) \\ = & \sum_{k = 1}^{n} (d (v_{1}, v_{k}) + d (v_{1}, v_{j})) + (\sum_{k = n + 1}^{2 n} | j - k |) - d (v_{1}, v_{j}) \\ = & S D_{C_{n}} (v_{1}) + (n - 1) d (v_{1}, v_{j}) + \sum_{k = 1}^{n} | j - n - k | \\ = & S D_{C_{n}} (v_{1}) + S D_{P_{n}} (v_{j}) + (n - 1) (j - n - 1) \\ = & 2 (\binom{⌈ \frac{n}{2} ⌉}{2}) + (\binom{j - n}{2}) + (\binom{2 n - j + 1}{2}) + (n - 1) (j - n - 1) . \end{matrix}$

When expanding

S D_{P_{n}} (v_{j})

we need to replace j with

j - n

since

n + 1 \leq j \leq 2 n

. If

v_{j} \in C_{n}

then

\begin{matrix} S D (v_{j}) = & \sum_{k = 1}^{2 n} d (v_{j}, v_{k}) \\ = & \sum_{k = 1}^{n} d (v_{j}, v_{k}) + \sum_{k = n + 2}^{2 n} d (v_{j}, v_{k}) \\ = & S D_{C_{n}} (v_{j}) + (\sum_{k = n + 1}^{2 n} d (v_{k}, v_{n + 1}) + d (v_{1}, v_{j})) - d (v_{j}, v_{n + 1}) \\ = & S D_{C_{n}} (v_{j}) + S D_{P_{n}} (v_{1}) + (n - 1) d (v_{1}, v_{j}) \\ = & 2 (\binom{⌈ \frac{n}{2} ⌉}{2}) + (\binom{n}{2}) + (n - 1) (\frac{n}{2} - |j - 1 - \frac{n}{2}|) . \end{matrix}

Define a sequence of functions

f_{n} : [0, 1] \to R

as

f_{n} (x) = \frac{1}{n^{2}} (2 (\binom{⌈ \frac{n}{2} ⌉}{2}) + (\binom{n}{2}) + (n - 1) (\frac{n}{2} - |n x - 1 - \frac{n}{2}|))

Note that

f_{n} (\frac{j}{n}) = \frac{S D (v_{j})}{n^{2}}

for

1 \leq j \leq n

, and we will show that

f_{n} ⇉ f

where

f (x) = \frac{5}{4} - |x - \frac{1}{2}|

. Now we have

\begin{matrix} f_{n} (x) = L_{n} - \frac{n - 1}{n} |x - \frac{1}{n} - \frac{1}{2}| \end{matrix}

where

L_{n} = \frac{1}{n^{2}} (2 (\binom{⌈ \frac{n}{2} ⌉}{2}) + (\binom{n}{2})) + \frac{n - 1}{2 n}

. It can be shown that

L_{n} \to \frac{5}{4}

and it is clear that

\frac{n - 1}{n} |x - \frac{1}{n} - \frac{1}{2}| ⇉ |x - \frac{1}{2}|

. Their difference uniformly converges to the difference of the limits, which is

\frac{5}{4} - | x - \frac{1}{2} |

.

Now define new sequence of functions

g_{n} : [1, 2] \to R

as

\begin{matrix} g_{n} (x) = \frac{1}{n^{2}} (2 (\binom{⌈ \frac{n}{2} ⌉}{2}) + (\binom{n x - n}{2}) + (\binom{2 n - n x + 1}{2}) + (n - 1) (n x - n - 1)) . \end{matrix}

Note that

g_{n} (x)

can be expressed as the sum of 4 functions defined as

\begin{matrix} g_{n, 1} (x) = & \frac{2}{n^{2}} (\binom{⌈ \frac{n}{2} ⌉}{2}) \\ g_{n, 2} (x) = & \frac{1}{n^{2}} (\binom{n x - n}{2}) \\ g_{n, 3} (x) = & \frac{1}{n^{2}} (\binom{2 n - n x + 1}{2}) \\ g_{n, 4} (x) = & \frac{1}{n^{2}} (n - 1) (n x - n - 1) . \end{matrix}

It can be shown that each function converges to the following over

[1, 2]

:

\begin{matrix} g_{n, 1} ⇉ \frac{1}{4} \\ g_{n, 2} ⇉ \frac{{(x - 1)}^{2}}{2} \\ g_{n, 3} ⇉ \frac{{(x - 2)}^{2}}{2} \\ g_{n, 4} ⇉ x - 1 . \end{matrix}

The sum

g_{n}

uniformly converges to the function

\begin{matrix} g (x) = & \frac{1}{4} + \frac{{(x - 1)}^{2}}{2} + \frac{{(x - 2)}^{2}}{2} + x - 1 \\ = & {(x - 1)}^{2} + \frac{3}{4} . \end{matrix}

Note that continuous functions f and g have no roots on

[0, 1]

and

[1, 2]

, respectively, so that by Lemma 7,

\frac{1}{f_{n}} ⇉ \frac{1}{f}

and

\frac{1}{g_{n}} ⇉ \frac{1}{g}

, which are both continuous. Define a sequence of partitions

q_{n} = {x_{k}^{n} = \frac{n + k}{n}}_{k = 1}^{n}

, and note that

g (x_{k}^{n}) = \frac{S D (v_{k})}{n^{2}}

for vertices on the path component of

B_{2 n}

. Similarly, define a sequence of partitions

p_{n} = {y_{k}^{n} = \frac{k}{n}}_{k = 1}^{n}

and we have:

\begin{matrix} lim_{n \to \infty} C_{C} (B_{2 n}) = & lim_{n \to \infty} \sum_{k = 1}^{2 n} \frac{2 n}{S D (v_{k})} \\ = & lim_{n \to \infty} 2 \sum_{k = 1}^{n} \frac{n^{2}}{S D (v_{k})} \frac{1}{n} + 2 \sum_{k = n + 1}^{2 n} \frac{n^{2}}{S D (v_{k})} \frac{1}{n} \\ = & lim_{n \to \infty} 2 \sum_{k = 1}^{n} \frac{1}{f_{n} (y_{k}^{n})} \frac{1}{n} + 2 \sum_{k = 1}^{n} \frac{1}{g_{n} (x_{k}^{n})} \frac{1}{n} \\ = & lim_{n \to \infty} 2 \sum_{k = 1}^{n} \frac{1}{f (y_{k}^{n})} \frac{1}{n} + 2 \sum_{k = 1}^{n} \frac{1}{g (x_{k}^{n})} \frac{1}{n} \\ = & 2 \int_{0}^{1} \frac{1}{\frac{5}{4} - | y - \frac{1}{2} |} d y + 2 \int_{1}^{2} \frac{1}{{(x - 1)}^{2} + \frac{3}{4}} d x \\ = & \frac{4 arctan (\frac{2}{\sqrt{3}})}{\sqrt{3}} + 4 ln (\frac{5}{3}) . \end{matrix}

□

Generalized Balloon Graph Asymptotic

Let

B_{n, m}

denote a graph consisting of a path

P_{n}

joined with a cycle

C_{m}

at a single vertex. Starting at the junction vertex

v_{1}

, the cycle will be labeled

v_{1}

through

v_{m}

traveling clockwise. Meanwhile, the path will be labeled

v_{m + 1}

through

v_{n + m}

where

v_{m + 1} = v_{1}

is the junction vertex.

This class of graphs will be referred to as balloon graphs, and the

S D

values are as follows.

Lemma 9.

Let

B_{n, m}

denote a balloon graph and suppose

v_{j} \in B_{n, m}

is a vertex, then

\begin{matrix} S D (v_{j}) = \{\begin{matrix} R (n, m) + (n - 1) (\frac{m}{2} - | j - 1 - \frac{m}{2} |), & 1 \leq j \leq m \\ R (n, m) - (\binom{n}{2}) + (m - 1) (j - m - 1) + (\binom{j - m}{2}) + (\binom{N - j + 1}{2}), & m + 1 \leq j \leq N \end{matrix} \end{matrix}

where

N = n + m

and

R (n, m) = (\binom{⌈ \frac{m}{2} ⌉ + 1}{2}) + (\binom{m - ⌈ \frac{m}{2} ⌉ + 1}{2}) - ⌈ \frac{m}{2} ⌉ + (\binom{n}{2}) .

Proof.

We will calculate the

S D

values by separately considering the cycle

C_{m}

and path

P_{n}

components and adjusting for path going though the junction vertex. Suppose

1 \leq j \leq m

, i.e,

v_{j}

is on the cycle component. Then

\begin{matrix} S D (v_{j}) = & \sum_{k = 1}^{m} d (v_{k}, v_{j}) + \sum_{k = m + 2}^{N} d (v_{k}, v_{j}) \\ = & \sum_{k = 1}^{m} d (v_{k}, v_{j}) + \sum_{k = m + 2}^{N} (d (v_{j}, v_{1}) + d (v_{m + 1}, v_{k})) \\ = & \sum_{k = 1}^{m} d (v_{k}, v_{j}) + (n - 1) d (v_{j}, v_{1}) + \sum_{k = m + 2}^{N} d (v_{m + 1}, v_{k}) \\ = & S D_{C_{m}} (v_{j}) + (n - 1) d (v_{j}, v_{1}) + S D_{P_{n}} (v_{1}) . \end{matrix}

By symmetry of the cycle,

S D_{C_{m}} (v_{j}) = S D_{C_{m}} (v_{1})

, and note that the shortest distance between two vertices

v_{k}, v_{j} \in C_{m}

along the cycle is given by the minor arc between them, which has length

d (v_{j}, v_{k}) = min {| k - j |, m - | k - j |} = \frac{m}{2} - | \frac{m}{2} - | j - k | |

. Thus

\begin{matrix} S D_{C_{m}} (v_{1}) = & \sum_{k = 1}^{m} min {k - 1, m - k + 1 |} \\ = & \sum_{k \leq ⌈ \frac{m}{2} ⌉} min {k - 1, m - k + 1 |} + \sum_{k \geq ⌈ \frac{m}{2} ⌉ + 1} min {k - 1, m - k + 1 |} \\ = & \sum_{k = 1}^{⌈ \frac{m}{2} ⌉} (k - 1) + \sum_{k = ⌈ \frac{m}{2} ⌉ + 1}^{m} (m - k + 1) \\ = & \sum_{k = 1}^{⌈ \frac{m}{2} ⌉} (k - 1) + \sum_{k = 1}^{m - ⌈ \frac{m}{2} ⌉} k \\ = & (\binom{⌈ \frac{m}{2} ⌉ + 1}{2}) - ⌈ \frac{m}{2} ⌉ + (\binom{m - ⌈ \frac{m}{2} ⌉ + 1}{2}) \\ = & R (n, m) - (\binom{n}{2}) . \end{matrix}

Now we have

\begin{matrix} S D (v_{j}) = & R (n, m) + (n - 1) d (v_{j}, v_{1}) + S D_{P_{n}} (v_{1}) - (\binom{n}{2}) \\ = & R (n, m) + (n - 1) (\frac{m}{2} - | \frac{m}{2} - j + 1 |) . \end{matrix}

If

v_{j} \in P_{n}

, then

\begin{matrix} S D (v_{j}) = & \sum_{k = 1}^{m} d (v_{k}, v_{j}) + \sum_{k = m + 2}^{N} d (v_{k}, v_{j}) \\ = & \sum_{k = 1}^{m} (d (v_{k}, v_{1}) + d (v_{m + 1}, v_{j})) + \sum_{k = m + 1}^{N} d (v_{k}, v_{j}) - d (v_{m + 1}, v_{j}) \\ = & S D_{C_{m}} (v_{1}) + (m - 1) d (v_{m + 1}, v_{j}) + S D_{P_{n}} (v_{j}) \\ = & R (n, m) - (\binom{n}{2}) + (m - 1) (j - m - 1) + (\binom{j - m}{2}) + (\binom{N - j + 1}{2}) . \end{matrix}

□

Theorem 6.

Let

p \in R^{+}

and suppose that

(m_{k}), (n_{k})

are strictly increasing sequences of positive integers such that

\frac{m_{k}}{n_{k}} \to p

. Then

\begin{matrix} lim_{k \to \infty} C_{C} (B_{n_{k}, m_{k}}) = 2 (p + 1) (ln (\frac{p^{2} + 2 p + 2}{p^{2} + 2}) + \frac{1}{\sqrt{2 p + 1}} arctan (\frac{2 \sqrt{2 p + 1}}{p^{2} + 2 p})), \\ lim_{k \to \infty} \bar{l} (B_{n_{k}, m_{k}}) = \frac{p^{3} + 2 p^{2} + 4 p + \frac{4}{3}}{4 {(1 + p)}^{3}} . \end{matrix}

Proof.

Let

N_{k} = n_{k} + m_{k}

and the closeness centrality of

B_{n_{k}, m_{k}}

is

\begin{matrix} C_{C} (B_{n_{k}, m_{k}}) = & \sum_{j = 1}^{m_{k}} \frac{N_{k} - 2}{S D (v_{j})} + \sum_{j = m_{k} + 2}^{N_{k}} \frac{N_{k} - 2}{S D (v_{j})} \\ = & A_{k} + B_{k} \end{matrix}

and we will consider the right and left-hand sums separately. Define a sequence of functions

f_{k} : [0, 1] \to R

as

\begin{matrix} f_{k} (x) = \frac{1}{N_{k}^{2}} (R (n_{k}, m_{k}) + (n_{k} - 1) (\frac{m_{k}}{2} - | \frac{m_{k}}{2} - m_{k} x + 1 |)) . \end{matrix}

It can be shown that

\begin{matrix} f_{k} (x) ⇉ f (x) = \frac{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |}{4 {(1 + p)}^{2}} \end{matrix}

(See Appendix A Lemma A6) on

[0, 1]

. This sequence was chosen such that

f_{k} (\frac{j}{n_{k}}) = \frac{S D (v_{j})}{N_{k}^{2}}

for

1 \leq j \leq m_{k}

. Since

\frac{1}{2} - | x - \frac{1}{2} | \geq 0

for all

x \in [0, 1]

, it follows

f : [0, 1] \to R^{+}

is strictly positive, in addition to being continuous. By Lemma 7, sequence

{\frac{1}{f_{k}}}

uniformly converges to continuous function

\frac{1}{f}

, and we have

\begin{matrix} lim_{k \to \infty} \frac{N_{k}}{N_{k} - 2} A_{k} = & lim_{k \to \infty} \sum_{j = 1}^{m_{k}} \frac{N_{k}^{2}}{S D (v_{j})} \frac{1}{N_{k}} \\ = & lim_{k \to \infty} \frac{m_{k}}{N_{k}} \sum_{j = 1}^{m_{k}} \frac{1}{f_{k} (\frac{j}{m_{k}})} \frac{1}{m_{k}} \\ = & \frac{p}{1 + p} lim_{k \to \infty} \sum_{j = 1}^{m_{k}} \frac{1}{f (\frac{j}{m_{k}})} \frac{1}{m_{k}} \\ = & \frac{p}{1 + p} \int_{0}^{1} \frac{1}{f (x)} d x \\ = & \frac{p}{1 + p} \int_{0}^{1} \frac{4 {(1 + p)}^{2}}{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |} d x \\ = & 4 p (1 + p) \int_{0}^{1} \frac{1}{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |} d x . \end{matrix}

Now we will consider

B_{k}

. Define new sequence of functions

g_{k} : [0, 1] \to R

as

\begin{matrix} g_{k} (x) = R (n_{k}, m_{k}) - (\binom{n_{k}}{2}) + (m_{k} - 1) (n_{k} x - 1) + (\binom{n_{k} x}{2}) + (\binom{n_{k} - n_{k} x + 1}{2}) . \end{matrix}

Note that

g_{k} (\frac{j}{n_{k}}) = S D (v_{j + m_{k}})

for

1 \leq j \leq n_{k}

and using a very similar process as in Lemma A6, it can be shown that

\begin{matrix} g_{k} (x) ⇉ g (x) = \frac{4 x^{2} + 4 x (p - 1) + 2 + p^{2}}{4 {(1 + p)}^{2}} . \end{matrix}

Furthermore,

g (x)

has a root iff

x^{2} + x (p - 1) + \frac{p^{2} + 2}{4}

does as well, which has a negative discriminant

{(p - 1)}^{2} - {(p + 2)}^{2}

. Therefore

g (x)

has no roots in

[0, 1]

and by Lemma 7, sequence of functions

{\frac{1}{g_{k}}}

uniformly converges to continuous function

\frac{1}{g}

on

[0, 1] .

Now we have

\begin{matrix} lim_{k \to \infty} \frac{N_{k}}{N_{k} - 2} B_{k} = & lim_{k \to \infty} \sum_{j = m_{k} + 2}^{N_{k}} \frac{N_{k}^{2}}{S D (v_{j})} \frac{1}{N_{k}} \\ = & lim_{k \to \infty} \frac{n_{k}}{N_{k}} \sum_{j = 2}^{n_{k}} \frac{N_{k}^{2}}{S D (v_{j + m_{k}})} \frac{1}{n_{k}} \\ = & \frac{1}{1 + p} lim_{k \to \infty} \sum_{j = 2}^{n_{k}} \frac{1}{g_{k} (\frac{j}{n_{k}})} \frac{1}{n_{k}} \\ = & \frac{1}{1 + p} lim_{k \to \infty} \sum_{j = 2}^{n_{k}} \frac{1}{g (\frac{j}{n_{k}})} \frac{1}{n_{k}} \\ = & \frac{1}{1 + p} \int_{0}^{1} \frac{1}{g (x)} d x \\ = & (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{2 + p^{2}}{4}} d x . \end{matrix}

Thus,

\begin{matrix} lim_{k \to \infty} C_{C} (B_{n_{k}, m_{k}}) = & 4 p (1 + p) \int_{0}^{1} \frac{1}{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |} d x + (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{2 + p^{2}}{4}} d x \\ = & 2 (p + 1) (ln (\frac{p^{2} + 2 p + 2}{p^{2} + 2}) + \frac{1}{\sqrt{2 p + 1}} arctan (\frac{2 \sqrt{2 p + 1}}{p^{2} + 2 p})) \\ = & B (p) \end{matrix}

(see Appendix A Lemmas A2 and A3).

As for the mean sum of distance, we have

\begin{matrix} lim_{k \to \infty} \bar{l} (B_{n_{k}, m_{k}}) = & lim_{k \to \infty} \frac{S D (B_{n_{k}, m_{k}})}{N_{k}^{3}} \\ = & lim_{k \to \infty} \frac{m_{k}}{N_{k}} \sum_{j = 1}^{m_{k}} \frac{S D (v_{j})}{N_{k}^{2}} \frac{1}{m_{k}} + lim_{k \to \infty} \frac{n_{k}}{N_{k}} \sum_{j = 2}^{n_{k}} \frac{S D (v_{j + m_{k}})}{N_{k}^{2}} \frac{1}{n_{k}} \\ = & lim_{k \to \infty} \frac{m_{k}}{N_{k}} \sum_{j = 1}^{m_{k}} f_{k} (\frac{j}{m_{k}}) \frac{1}{m_{k}} + lim_{k \to \infty} \frac{n_{k}}{N_{k}} \sum_{j = 2}^{n_{k}} g_{k} (\frac{j}{n_{k}}) \frac{1}{n_{k}} \\ = & lim_{k \to \infty} \frac{m_{k}}{N_{k}} \sum_{j = 1}^{m_{k}} f (\frac{j}{m_{k}}) \frac{1}{m_{k}} + lim_{k \to \infty} \frac{n_{k}}{N_{k}} \sum_{j = 2}^{n_{k}} g (\frac{j}{n_{k}}) \frac{1}{n_{k}} \\ = & \frac{p}{1 + p} \int_{0}^{1} \frac{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |}{4 {(1 + p)}^{2}} d x + \frac{1}{1 + p} \int_{0}^{1} \frac{4 x^{2} + 4 x (p - 1) + 2 + p^{2}}{4 {(1 + p)}^{2}} d x \\ = & \frac{p^{3} + 2 p^{2} + 4 p + \frac{4}{3}}{4 {(1 + p)}^{3}} \\ = & B_{0} (p) . \end{matrix}

□

Theorem 7.

The set

L

is dense in

[1, 2]

, where

L = {\bar{l} (G) {\bar{C}}_{C} (G) : G is finite and connected} .

Proof.

It can be shown that

lim_{p \to \infty} B (p) = 4

and

lim_{p \to \infty} B_{0} (p) = \frac{1}{4}

(see Appendix A Lemma A4), which corresponds to the balloon graph having relatively negligible vertices on the path component compared to the cycle so that it overall behaves like the latter. The limit of the products is 1 and

B (0) B_{0} (0) = \frac{π}{3}

, which is attributed to the path graph. By continuity of

f (p) = B (p) B_{0} (p)

, for each

y \in (1, \frac{π}{3})

there exists

p \in (0, \infty)

such that

f (p) = y

. Choose strictly increasing sequence of positive integers

(m_{k}), (n_{k})

such that

lim_{k \to \infty} \frac{m_{k}}{n_{k}} = p

and we have

\begin{matrix} lim_{k \to \infty} \bar{l} (B_{n_{k}, m_{k}}) {\bar{C}}_{C} (B_{n_{k}, m_{k}}) = & lim_{k \to \infty} \frac{S D (B_{n_{k}, m_{k}})}{{(n_{k} + m_{k})}^{3}} C_{C} (B_{n_{k}, m_{k}}) \\ = & B_{0} (p) B (p) \\ = & y . \end{matrix}

Thus, every point of

[1, \frac{π}{3}]

is a limit point of

L \cap [1, \frac{π}{3}]

, which is then dense in

[1, \frac{π}{3}]

. By applying Corollary 2, we obtain the entire result. □

5. Conclusions

We verify a conjecture of Britz, Hu, Islam, and Tang that the set of products

{\bar{l} (G) {\bar{C}}_{C} (G) : G is finite and connected}

is dense in

[1, 2)

. The use of Riemann sums provides a method that is computationally advantageous for precisely determining asymptotics of closeness centralities. The key benefit is the use of integration for calcuating asymptotic behavior of complicated combinatorial forumulae. It would be interesting to see how the methods in this paper can be used for other families of graphs. The density results are connected to the inverse Wiener index problem [], where one starts with a prescribed sum of distances over all pairs of vertices and seeks a tree with this index. The following problem presents a problem that could provide a bridge between our work and the inverse Wiener index problem. There is also a potential index to the Harary index [].

Problem 1.

Determine

lim_{n \to \infty} C_{C} (G)

where G is a tree.

The next problem is a natural extension of the first problem.

Problem 2.

Determine

lim_{n \to \infty} C_{C} (G)

where G is a unicyclic graph.

We have started developing quasi-simple curve theory to demonstrate how closeness centrality can be calculated with just calculus. This would provide a segue to a natural definition of closeness centrality for piecewise smooth curves.

Author Contributions

Conceptualization, S.F., A.G.S., B.R. and D.A.N.; Methodology, A.G.S., B.R. and D.A.N.; Validation, A.G.S. and B.R.; Formal analysis, S.F., A.G.S., B.R. and D.A.N.; Investigation, S.F., A.G.S. and B.R.; Writing—original draft, A.G.S.; Writing—review & editing, B.R. and D.A.N.; Supervision, D.A.N.; Project administration, D.A.N.; Funding acquisition, D.A.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a National Science Foundation Research Experiences for Undergraduates Grant #2243938.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

For the sake of completeness, we include details for some of the computations here.

Lemma A1.

Define sequence of functions

g_{n} : [0, 1] \to R

, where

g_{n} (x) = x^{2} - \frac{n + 1}{n} x + \frac{n + 1}{2 n}

. Then

{\frac{1}{g_{n}}}_{n = 1}^{\infty}

uniformly converges to

\frac{1}{g (x)} = \frac{1}{x^{2} - x + \frac{1}{2}}

on

[0, 1]

.

Proof.

We will first show that

g_{n} (x)

uniformly converges to

g (x)

. Let

x \in [0, 1]

, and we have

\begin{matrix} | g (x) - g_{n} (x) | = & | x (\frac{n + 1}{n} - 1) + (\frac{1}{2} - \frac{n + 1}{2 n}) | \\ = & | x (\frac{1}{n}) - \frac{1}{n} | \\ \leq & | x | \frac{1}{n} + \frac{1}{n} \\ \leq & \frac{2}{n} \end{matrix}

and it is clear that

g_{n} ⇉ g

. The strictly positive function

g (x)

has a minimum value of

\frac{1}{4}

on

[0, 1]

, and by uniform convergence, there exists

N > 0

such that

g_{n} (x) > \frac{1}{8}

for all

n > N_{0}

and

x \in [0, 1]

. Let

ϵ > 0

, and there exists

N_{1} > 0

such that

| g - g_{n} | < \frac{ϵ}{64}

for all

n > N_{1}

. Let

n > max (N_{0}, N_{1})

, and we have

\begin{matrix} | \frac{1}{g} - \frac{1}{g_{n}} | = & \frac{1}{| g_{n} | | g |} | g_{n} - g | \\ \leq & {(8)}^{2} | g_{n} - g | \\ < & 64 \frac{ϵ}{64} \\ = & ϵ \end{matrix}

for all

x \in [0, 1]

. □

Lemma A2.

4 p (1 + p) \int_{0}^{1} \frac{1}{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |} d x = 2 (1 + p) ln \frac{p^{2} + 2 p + 2}{p^{2} + 2}

, where p is a positive real number.

Proof.

Let

k = p^{2} + 2 p + 1

and

c = 4 p (1 + p)

, and we have

\begin{matrix} I = 4 p (1 + p) \int_{0}^{1} \frac{1}{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |} d x = & c \int_{0}^{1} \frac{1}{k - 4 p | x - \frac{1}{2} |} d x \\ = & c \int_{0}^{\frac{1}{2}} \frac{1}{k - 4 p (\frac{1}{2} - x)} d x + c \int_{\frac{1}{2}}^{1} \frac{1}{k - 4 p (x - \frac{1}{2})} d x \end{matrix}

We will do two u-substitutions by letting

u = \frac{1}{2} - x

and

u = x - \frac{1}{2}

in the first and second integrals, respectively. Doing this shows that both integrals have the same value, and by doubling the second one after the u-substitution, we get

\begin{matrix} I = & 2 c \int_{0}^{\frac{1}{2}} \frac{1}{k - 4 p u} d u \\ = & 2 c (\frac{1}{- 4 p}) (ln | k - 2 p | - ln | k |) \\ = & \frac{c}{2 p} ln \frac{| k |}{| k - 2 p |} \\ = & 2 (1 + p) ln \frac{p^{2} + 2 p + 2}{p^{2} + 2} \end{matrix}

□

Lemma A3.

Let p be a positive real number. Then,

(1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{p^{2} + 2}{4}} d x =

\frac{2 (p + 1)}{\sqrt{2 p + 1}} arctan (\frac{2 \sqrt{2 p + 1}}{p^{2} + 2})

.

Proof.

By completing the square, we get

x^{2} + x (p - 1) + \frac{p^{2} + 2}{4} = {(x + \frac{p - 1}{2})}^{2} + \frac{2 p + 1}{4}

. Let

c = 1 + p

,

h = \frac{p - 1}{2}

, and

α = \frac{\sqrt{2 p + 1}}{2}

to get

\begin{matrix} I = (1 + p) \int_{0}^{1} \frac{1}{x^{2} + x (p - 1) + \frac{p^{2} + 2}{4}} d x = & c \int_{0}^{1} \frac{1}{{(x + h)}^{2} + α^{2}} d x \\ = & \frac{c}{α^{2}} \int_{0}^{1} \frac{1}{1 + {(\frac{x + h}{α})}^{2}} d x \\ = & \frac{c}{α} \int_{\frac{h}{α}}^{\frac{1 + h}{α}} \frac{1}{1 + u^{2}} d u u - substitution u = \frac{x + h}{α} \\ = & \frac{c}{α} (arctan (\frac{1 + h}{α}) - arctan (\frac{h}{α})) \end{matrix}

Since

p > 0

, the value

1 + \frac{1 + h}{α} \frac{h}{α}

is positive, and using the identity

arctan (x) - arctan (y) = arctan (\frac{x - y}{1 + x y})

, we have

\begin{matrix} I = & \frac{c}{α} arctan (\frac{α}{α^{2} + (1 + h) h}) \\ = & \frac{2 (1 + p)}{\sqrt{2 p + 1}} arctan (\frac{2 \sqrt{2 p + 1}}{p^{2} + 2 p}) \end{matrix}

□

Lemma A4.

Let

p > 0

, then

lim_{p \to \infty} \int_{0}^{1} \frac{4 p (1 + p)}{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |} d x + \int_{0}^{1} \frac{(1 + p)}{x^{2} + x (p - 1) + \frac{2 + p^{2}}{4}} d x = 4

.

Proof.

It suffices to show that the statement holds for any sequence

{p_{k}}_{k = 1}^{\infty}

of positive real numbers with limit ∞. We will consider the integrals separately and define a sequence of functions

f_{k} : [0, 1] \to R

as

\begin{matrix} f_{k} (x) = & \frac{p_{k}^{2} + 2 p_{k} + 2 - 4 p_{k} | x - \frac{1}{2} |}{4 p_{k} (1 + p_{k})} \\ = & \frac{p_{k}}{4 (1 + p_{k})} + \frac{1}{2 p_{k}} - \frac{| x - \frac{1}{2} |}{1 + p_{k}} . \end{matrix}

It is clear that

f_{k} ⇉ \frac{1}{4}

, and by Lemma 7, sequence of functions

{\frac{1}{f_{k}}}_{k = 1}^{\infty}

uniformly converges to 4 on

[0, 1]

. To evaluate

lim_{k \to \infty} \int_{0}^{1} \frac{1}{f_{k}} d x

, we can interchange the limit and integral to obtain

\begin{matrix} lim_{k \to \infty} \int_{0}^{1} \frac{4 p_{k} (1 + p_{k})}{p_{k}^{2} + 2 p_{k} + 2 - 4 p_{k} | x - \frac{1}{2} |} d x = \int_{0}^{1} 4 d x = 4 . \end{matrix}

As for the other integral, suppose that

p_{k} > 1

, and

x^{2} + x (p_{k} - 1) + \frac{2 + p_{k}^{2}}{4} \geq \frac{2 + p_{k}^{2}}{4}

implies

\begin{matrix} 0 \leq \int_{0}^{1} \frac{1 + p_{k}}{x^{2} + x (p_{k} - 1) + \frac{2 + p_{k}^{2}}{4}} d x \leq \int_{0}^{1} \frac{1 + p_{k}}{\frac{2 + p_{k}^{2}}{4}} d x . \end{matrix}

The right integral clearly has limit 0, and by the squeeze theorem, so does the middle sequence. □

Lemma A5.

Let

p > 0

, then

lim_{p \to \infty} \bar{S} (p) = 2

.

Proof.

We have

\begin{matrix} \bar{S} (p) = \frac{p + \frac{1}{3}}{(p + 1)} \int_{0}^{1} \frac{1}{(p + 1) (x^{2} + x (p - 1) + \frac{1}{2})} d x + \frac{2 p (p + \frac{1}{3})}{{(p + 1)}^{2}} . \end{matrix}

The right term has limit 2, and we will now focus on the integral. Without loss of generality, suppose that

p > 1

, and since

x^{2} + x (p - 1) + \frac{1}{2} \geq \frac{1}{2}

, we have

\begin{matrix} 0 \leq \frac{p + \frac{1}{3}}{(p + 1)} \int_{0}^{1} \frac{1}{(p + 1) (x^{2} + x (p - 1) + \frac{1}{2})} d x \leq \frac{p + \frac{1}{3}}{(p + 1)} \int_{0}^{1} \frac{1}{(p + 1) \frac{1}{2}} d x . \end{matrix}

By the sandwich theorem, the middle integral goes to 0 as

p \to \infty

. □

Lemma A6.

Define

R (n_{k}, m_{k})

and sequence of functions

f_{k} : [0, 1] \to R

as in Theorem . Then

f_{k} ⇉ f (x) = \frac{p^{2} + 2 p + 2 - 4 p | x - \frac{1}{2} |}{4 {(1 + p)}^{2}}

on

[0, 1]

.

Proof.

Recall that

\begin{matrix} f_{k} (x) = \frac{1}{N_{k}^{2}} (R (n_{k}, m_{k}) + (n_{k} - 1) (\frac{m_{k}}{2} - | \frac{m_{k}}{2} - m_{k} x + 1 |)) \end{matrix}

where

R (n_{k}, m_{k}) = (\binom{⌈ \frac{m_{k}}{2} ⌉ + 1}{2}) + (\binom{m_{k} - ⌈ \frac{m_{k}}{2} ⌉ + 1}{2}) - ⌈ \frac{m_{k}}{2} ⌉ + (\binom{n_{k}}{2})

. We will first focus on this term. Expanding the binomial coefficients gives

\begin{matrix} \frac{R (n_{k}, m_{k})}{N_{k}^{2}} = \frac{1}{2 N_{k}^{2}} (⌈ \frac{m_{k}}{2} ⌉) (⌈ \frac{m_{k}}{2} ⌉ + 1) + \frac{1}{2 N_{k}^{2}} (m_{k} - ⌈ \frac{m_{k}}{2} ⌉) (m_{k} - ⌈ \frac{m_{k}}{2} ⌉ + 1) + \frac{1}{2 N_{k}^{2}} (n_{k}) (n_{k} - 1) - \frac{1}{2 N_{k}^{2}} ⌈ \frac{m_{k}}{2} ⌉ \end{matrix}

Since the difference

| x - ⌈ x ⌉ |

is upmost 1, this error will be made negligible due to the

\frac{1}{2 N_{k}^{2}}

term. Therefore, we can drop the ceiling function without changing the value of the limit. We can go further and remove all terms and constants made negligible by

\frac{1}{2 N_{k}^{2}}

.

\begin{matrix} lim_{k \to \infty} \frac{R (n_{k}, m_{k})}{N_{k}^{2}} = & lim_{k \to \infty} \frac{1}{2 N_{k}^{2}} (\frac{m_{k}}{2}) (\frac{m_{k}}{2} + 1) + \frac{1}{2 N_{k}^{2}} (\frac{m_{k}}{2}) (\frac{m_{k}}{2} + 1) + \frac{1}{2 N_{k}^{2}} (n_{k}) (n_{k} - 1) - \frac{1}{2 N_{k}^{2}} (\frac{m_{k}}{2}) & (Drop ceiling function) \\ = & lim_{k \to \infty} \frac{m_{k}^{2}}{8 N_{k}^{2}} + \frac{m_{k}^{2}}{8 N_{k}^{2}} + \frac{n_{k}^{2}}{2 N_{k}^{2}} & (Drop 1 ’ s and last term) \\ = & lim_{k \to \infty} \frac{1}{4} {(\frac{m_{k}}{N_{k}})}^{2} + \frac{1}{2} {(\frac{n_{k}}{N_{k}})}^{2} \\ = & \frac{1}{4} \frac{p^{2}}{{(1 + p)}^{2}} + \frac{1}{2 {(1 + p)}^{2}} & (\frac{m_{k}}{n_{k}} \to p, and N_{k} = n_{k} + m_{k}) . \end{matrix}

Now we will consider the other half, and we have

\begin{matrix} \frac{1}{N_{k}^{2}} (n_{k} - 1) (\frac{m_{k}}{2} - | \frac{m_{k}}{2} - m_{k} x + 1 |) = & \frac{(n_{k} - 1) (m_{k})}{N_{k}^{2}} (\frac{1}{2} - | \frac{1}{2} - x + \frac{1}{m_{k}} |) \end{matrix}

By letting

k \to \infty

, the right function clearly uniformly converges to

\frac{p}{{(1 + p)}^{2}} (\frac{1}{2} - | \frac{1}{2} - x |)

on

[0, 1]

. Finally, the sum of functions

f_{k} (x)

uniformly converges to

\begin{matrix} f (x) = \frac{1}{4} \frac{p^{2}}{{(1 + p)}^{2}} + \frac{1}{2 {(1 + p)}^{2}} + \frac{p}{{(1 + p)}^{2}} (\frac{1}{2} - | \frac{1}{2} - x |) \end{matrix}

on

[0, 1]

. □

References

Handa, K. Bipartite graphs with balanced (a, b)-partitions. Ars Combin. 1999, 51, 113–119. [Google Scholar]
Ramanathan, N.; Ramirez, E.; Suzuki-Burke, D.; Narayan, D. Closeness Centrality in Asymmetric Graphs. Theory Appl. Graphs, 2024; in press. [Google Scholar]
Bavelas, A. Communication Patterns in Task-Oriented Groups. J. Acoust. Soc. Am. 1950, 22, 725–730. [Google Scholar] [CrossRef]
Zhang, J.; Luo, Y. Degree Centrality, Betweenness Centrality, and Closeness Centrality in Social Network. In Proceedings of the Advances in Intelligent Systems Research, 2nd International Conference on Modelling, Simulation and Applied Mathematics (MSAM 2017), Bangkok, Thailand, 26–27 March 2017; Volume 132, pp. 300–303. [Google Scholar]
Yan, E.; Ding, Y. Applying centrality measures to impact analysis: A coauthorship network analysis. J. Am. Soc. Inf. Sci. Technol. 2009, 60, 2107–2118. [Google Scholar] [CrossRef]
Barrat, A.; Weigt, M. On the properties of small-world network models. Eur. Phys. J. B 2000, 13, 547–560. [Google Scholar] [CrossRef]
Ek, B.; VerSchneider, C.; Narayan, D. Efficiency of star-like graphs and the Atlanta subway network. Physica A 2013, 392, 5481–5489. [Google Scholar] [CrossRef]
Britz, T.; Hu, X.; Islam, A.; Tang, H. Bounds on the Closeness Centrality of a Graph. Bull. Malays. Math. Sci. Soc. 2025, 48, 1–15. [Google Scholar] [CrossRef]
Disjkstra, E. A note on two problems in connexion with graphs. Numer. Math. 1959, 1, 269–271. [Google Scholar] [CrossRef]
Doyle, J.; Graver, J. Mean distance in a graph. Discret. Math. 1977, 17, 147–154. [Google Scholar] [CrossRef]
Fink, J.; Lužar, B.; Škrekovski, R. Some remarks on inverse Wiener index problem. Discret. Appl. Math. 2012, 160, 1851–1858. [Google Scholar] [CrossRef]
Xu, K.; Das, K. On Harary index of graphs. Discret. Appl. Math. 2011, 159, 1631–1640. [Google Scholar] [CrossRef]

Figure 1. Flattening a tree: Changes to the

S D

values of the moved vertices perfectly cancel out.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Asymptotics of Closeness Centralities of Graphs

Abstract

1. Introduction

2. Asymptotics of Closeness Centralities of a Path

Lower Bound for Closeness Centralities

3. Asymptotics

3.1. Union of a Path and Complete Graph

3.2. Mean Distance

4. Balloon Graphs

Generalized Balloon Graph Asymptotic

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Article Metrics

Citations

Article Access Statistics