Abstract
Given a connected graph G with n vertices, the distance between two vertices is the number of edges in a shortest path connecting them. The sum of the distances in a graph G from a vertex v to all other vertices is denoted by . The closeness centrality of a vertex in a graph was defined by Bavelas to be and the closeness centrality of G is . We consider the asymptotic limit of as the number of vertices tends to infinity and provide an elegant and insightful proof of a 2025 result by Britz, Hu, Islam, and Tang, , using uniform convergence and Riemann sums. We applied the same technique for the union of a cycle and path and the union of a path and a complete graph. We prove that of all graphs, paths have the minimum closeness centrality. Next we show for any , there exists a sequence of graphs such that . In addition, we investigate the mean distance of a graph, and the normalized closeness centrality, . We verify a conjecture of Britz, Hu, Islam, and Tang that the set of products is dense in .
MSC:
05C12; 05C09
1. Introduction
Given a graph G, the distance between two vertices is the number of edges in a shortest path connecting them. We consider the sum of the distances from a given vertex to all other vertices. The sum of the distances in a graph G from a vertex v will be denoted by or simply when the graph G is clear. This measure is equivalent to the transmission of a vertex introduced by Handa []. As noted in [], motivation for this metric comes from a variation of the famous Traveling Salesman Problem where the salesman must return to the starting point v after each delivery. The length of the salesman’s tour will be . This is related to the closeness centrality of a vertex or a graph, defined by Bavelas in 1950 []. The closeness centrality of a vertex v in a graph is . Then .
Closeness centrality has been used to identify important individuals in social networks [] and has been used to analyze the impact on coauthorship networks []. Asymptotic behavior for graph centrality properties have been studied for decades. A practical application of this asymptotic analysis is to identify trends that emerge as networks expand over time. In 2000, Barrat and Weigt [] investigated asymptotic properties of small-world network models. In 2013 Ek, VerSchneider, and Narayan [] determined the asymptotic behavior of the global efficiency of a path. In this paper we investigate the asymptotic behavior of .
We present a new approach for determining the asymptotic behavior of for various families of graphs. We recast the sum of the distances as a Riemann sum and then replace the discrete values with a continuous function. Then we determine the asymptotic value of the closeness centrality by computing the definite integral.
We present an elegant and more powerful proof of a result of Britz, Hu, Islam, and Tang [], showing that . In addition, we resolve a conjecture from their paper, showing that the numbers for all connected graphs G form a dense subset of the interval . We also establish a universal lower bound for and show that this bound is tight when G is a path.
2. Asymptotics of Closeness Centralities of a Path
We provide an alternative proof for in Lemma 2. We start by presenting a combinatorial formula for values for vertices in a path in Lemma 1.
Lemma 1.
Let with vertices . Then .
Proof.
Consider a vertex . The distances from to each of the vertices are , respectively, which sum to . The distances from to each of the vertices is , respectively, which sum to . □
We next restate a lemma of Britz, Hu, Islam, and Tang []. In their proof they used approximations through lower and upper bounds which converge asymptotically as n grows large. We provide an alternative proof using a novel technique. Starting with the formula from Lemma 1 we let . Then we can view as a Riemann sum over the partition . Then after carefully changing this to a sum over a continuous domain we can calculate using a definite integral. We obtain the same stunning result from [] that .
Lemma 2.
Proof.
Let and we have
Define a sequence of functions as where for each . Now define function , and we claim that sequence uniformly converges to g (see Appendix A Lemma A1). Returning to the sum, we have
Let and by uniform convergence there exists such that for all , we have for each . Let and we have
Thus . Note that the limit on the right is of the right-hand Riemann sum of the function g over the interval with sequence of partitions . Since g is Riemann integrable, we have that
□
Lower Bound for Closeness Centralities
We will show that the lower bound for the closeness centrality of a graph is , where n is the number of vertices. We begin with trees. The technique will be as follows. We start with a tree with a vertex v of degree at least three. Then we remove one of the branches and append it to one of the other branches. Informally, we are ’flattening’ a tree, making it more like a path. This is illustrated in Figure 1.
Figure 1.
Flattening a tree: Changes to the values of the moved vertices perfectly cancel out.
Surprisingly the negative values perfectly cancel out. To see this consider the change in and values for the vertices and . We note that the only distances that may change involve the vertices .
For , we have and . For , we have and .
Hence the negative changes in values where cancel out with the postive changes in values where .
For the remaining vertices the change in values will be positive. Hence sum of the closeness centrality values will decrease.
Lemma 3.
Let T be a tree with a vertex of degree at least three with two pendant paths and . Let and and let and . Let W be a tree where and . Then .
Proof.
We will proceed with . Then when , and when . For W, we have for .
We examine the differences between the values of the same vertices between trees T and W. We consider two cases, first where the vertices are on the path with S vertices and then where the vertices are on the path with T vertices. We consider two cases.
- Case 1.
- Let
- Subcase 1 (a): When s is even:
- Subcase 1 (b): When s is odd:
- and when , .
- Case 2.
- Let
. We note that for all vertices that . □
We note that the differences perfectly cancel out with −. However we need to show that the reciprocals of the values overcompensate for . To do this we apply a basic result from number theory that we restate in our next lemma.
Lemma 4.
For positive integers , and k, .
Proof.
□
Lemma 5.
For any tree with n vertices, .
Proof.
If is a path, then we are done. If not we combine two pendant paths into a single path using Lemma 3. Iterating this process will result in a path with a lower closeness centrality than the original tree. □
Theorem 1.
For any graph G with n vertices, .
Proof.
Given a graph G a minimum distance spanning tree can be obtained using Dijkstra’s algorithm []. Here for all pairs of vertices . Hence for all pairs of vertices . This implies . Combining this with Lemmas 3 and 5 we have . □
3. Asymptotics
We will generalize the method used to prove Lemma 2 by replacing the sum of values with a Riemann sum. We do this by extending the SD functions to a sequence of continuous functions that uniformly converge, and then compute an integral of the limit. We next provide tools from analysis, which will be useful in obtaining the results in this section.
Lemma 6.
Let be a closed interval, and suppose that the sequence of functions uniformly converges to the continuous function on I. If f is either strictly negative or positive, then there exists such that is also strictly negative or positive accordingly for all .
Proof.
Without loss of generality suppose that f is strictly positive and by the EVT, it attains a minimum value of . By uniform convergence, there exists such that for all we have
as desired. □
Lemma 7.
Let be a closed interval and suppose that is a sequence of functions that uniformly converge to a continuous function on I. If and for all and , then . In addition. the function is continuous.
Proof.
Since f has no roots in I, by the IVT, it must either be positive or negative on I. Without loss of generality, suppose that f is positive on I and by the EVT, f attains a minimum at . Let and by uniform convergence, there exists such that for all
for all .
Let and by uniform convergence there exists such that for all , . Suppose and we have
for all . Therefore uniformly converges to on I. As a composition of continuous functions and , continuity of follows immediately. □
Lemma 6 can be used to weaken the conditions of the above Lemma so that for some , while using the restriction . For the purposes of this paper, it suffices to assume without loss of generality.
Lemma 8.
Let be a closed interval, and suppose that the sequence of functions uniformly converges to the continuous function on I. Suppose that is a family of sequences indexed by .
If either converge, then the limits coincide.
Proof.
Let and there exists such that for all , . Suppose and we have
Thus as desired. □
In the special case that the sequences are partitions of the unit interval, we can compute the limit of Riemann sums by replacing with the limit f. Since is continuous, and therefore Riemann integrable, the sum reduces to the integral
By the above Lemma, this integral is the value of .
3.1. Union of a Path and Complete Graph
Let be non-negative integers, and will denote a path with n vertices joined to a vertex of a complete graph with m vertices. The vertices of the path will be labeled through where is the junction vertex, and the vertices in the complete graph are labeled through .
We next determine the values of considering three different cases:
- Both vertices are on the path:If , then
- Both distinct vertices are on the complete graph:If , then
- One vertex is in the path and the other is in the complete subgraph.Without loss of generality, suppose that and , then the shortest path from to is obtained by first traveling from to and then from to . ThusNow suppose that and we have
If , then
Theorem 2.
Let p be a positive real number and suppose that and are strictly increasing sequences of positive integers such that . Let , then
Proof.
We will first show the centrality result and let and note that has vertices. Thus
Define sequence of functions where
and we will show that
The functions were obtained by reparameterizing so that , for . This strategy will be used again for the next family of graphs. Now let , , and and we have 3 cases.
- :We haveThis is true for all and recall that and . Thus and it is also clear that the right term of the last inequality goes to 0 uniformly for all Then
- The proof is similar to the first case.
It can be shown that and since , the last inequality goes to 0 uniformly for all . The sum of the sequences uniformly converges to the sum of the limits . Note that if and only if , which has discriminant . Then for all , has no real roots. Now suppose , and since , it follows
The continuous function has no nonnegative roots for and by Lemma 6, there exists such that has no roots for all . Without loss of generality suppose , and by Lemma 7, sequence uniformly converges to continuous function on . Let be a sequence of partitions of , and we have
Furthemore
It is clear that converges to so that . Thus
as desired. As for the sum of distances, we have
The left-hand side of the last expression is the Riemann sum of f over the interval obtained by Lemma 8, and the right-hand limit can be easily computed knowing that and . Thus
□
The function , defined as , will be called the shooting star centrality. Now we have the following result.
Corollary 1.
For every , there exists a sequence of graphs such that
Proof.
Suppose , and note that
Thus there exists such that and by the IVT, there exists such that . Let be a sequence of convergents of p with strictly growing numerator and denominator. Since , then . If , then by Lemma 2. □
As a consequence of Theorem 1, we have . This provides the following result complementary to the above Corollary.
Theorem 3.
Let denote the set of all graph centralities. Then is not dense in .
Proof.
Let be an irrational number and suppose that is dense in . Then α is a limit point of A and there exists a sequence of finite connected graphs such that . If is unbounded, then we can extract a subsequence of graphs with increasing order such that . However, by the above inequality,
which is a contradiction. Thus is bounded, and let be the size of the largest graph. Let denote the finite set of distinct connected graphs of vertices up to isomorphism. Then we have , and each term has upmost of the same choices. Therefore, the sequence has finitely many distinct terms which are all rational. But since the sequence converges, there exists such that all are equal for and the limit α is rational, which is a contradiction. Thus α is not a limit point of so that it cannot be dense in as desired. □
Let denote the set of graph closeness centralities greater than . Corollary 1 shows that is a dense subset of . A related result can be obtained by considering the normalized centrality and mean distance of a connected graph G with n vertices as defined by Britz, Hu, Islam, and Tang []:
They showed that for all finite connected graphs G and conjectured the set of all such values, is dense in . Let and define integer sequences , the same way as in the above theorem. Then for the sequence of star graphs we have
The function is continuous and note that . Furthermore, it can be shown that (see Appendix A Lemma A5), which converges from below, and we have the following result by applying the IVT as in Corollary 1.
Corollary 2.
The set is dense in
3.2. Mean Distance
Let G be a connected graph of n vertices. Doyle and Graver, ref. [] defined the mean distance of G as
and presented that is a tight upper bound for the mean distance . It follows that is a tight upper bound for , and we will define the normalized mean distance as . It is clear that
for all finitely connected graphs G and let denote the set of all such values.
Theorem 4.
The set is dense in
Proof.
Let and let be strictly increasing sequence of positive integers whose ratio converges to p. Then by Theorem 2,
The function is continuous and so that by a similar application of the IVT as in Corollary 1, the desired result follows. □
4. Balloon Graphs
Let denote a balloon graph consisting of a cycle attached to a path , each containing n vertices. The junction vertex of the cycle and path will be labeled . Traveling clockwise around the cycle, its vertices will be labeled where and are the same vertex. Starting at , the vertices of the path will be labeled from left to right as .
Now we have the following theorem:
Theorem 5.
Proof.
First we will find the shortest distance between vertices and we have 3 cases:
- If both vertices are on the cycle, then the shortest distance is given by the minor arc between them. Without loss of generality, suppose that and the shortest path is either travel clockwise from to , giving a distance of . Or we travel counterclockwise by going from to and then jumping from to . From here we go from to giving a total distance of . Thus in general
- If both vertices are on the path then where
- andIf the vertices are on different components of the graph, then it suffices to consider when and . The shortest path is given by first going from to and then from to givingNow suppose that and we have
When expanding we need to replace j with since . If then
Define a sequence of functions as
Note that for , and we will show that where . Now we have
where . It can be shown that and it is clear that . Their difference uniformly converges to the difference of the limits, which is .
Now define new sequence of functions as
Note that can be expressed as the sum of 4 functions defined as
It can be shown that each function converges to the following over :
The sum uniformly converges to the function
Note that continuous functions f and g have no roots on and , respectively, so that by Lemma 7, and , which are both continuous. Define a sequence of partitions , and note that for vertices on the path component of . Similarly, define a sequence of partitions and we have:
□
Generalized Balloon Graph Asymptotic
Let denote a graph consisting of a path joined with a cycle at a single vertex. Starting at the junction vertex , the cycle will be labeled through traveling clockwise. Meanwhile, the path will be labeled through where is the junction vertex.
This class of graphs will be referred to as balloon graphs, and the values are as follows.
Lemma 9.
Let denote a balloon graph and suppose is a vertex, then
where and
Proof.
We will calculate the values by separately considering the cycle and path components and adjusting for path going though the junction vertex. Suppose , i.e, is on the cycle component. Then
By symmetry of the cycle, , and note that the shortest distance between two vertices along the cycle is given by the minor arc between them, which has length . Thus
Now we have
If , then
□
Theorem 6.
Let and suppose that are strictly increasing sequences of positive integers such that . Then
Proof.
Let and the closeness centrality of is
and we will consider the right and left-hand sums separately. Define a sequence of functions as
It can be shown that
(See Appendix A Lemma A6) on . This sequence was chosen such that for . Since for all , it follows is strictly positive, in addition to being continuous. By Lemma 7, sequence uniformly converges to continuous function , and we have
Now we will consider . Define new sequence of functions as
Note that for and using a very similar process as in Lemma A6, it can be shown that
Furthermore, has a root iff does as well, which has a negative discriminant . Therefore has no roots in and by Lemma 7, sequence of functions uniformly converges to continuous function on Now we have
Thus,
(see Appendix A Lemmas A2 and A3).
As for the mean sum of distance, we have
□
Theorem 7.
The set is dense in , where
Proof.
It can be shown that and (see Appendix A Lemma A4), which corresponds to the balloon graph having relatively negligible vertices on the path component compared to the cycle so that it overall behaves like the latter. The limit of the products is 1 and , which is attributed to the path graph. By continuity of , for each there exists such that . Choose strictly increasing sequence of positive integers such that and we have
Thus, every point of is a limit point of , which is then dense in . By applying Corollary 2, we obtain the entire result. □
5. Conclusions
We verify a conjecture of Britz, Hu, Islam, and Tang that the set of products is dense in . The use of Riemann sums provides a method that is computationally advantageous for precisely determining asymptotics of closeness centralities. The key benefit is the use of integration for calcuating asymptotic behavior of complicated combinatorial forumulae. It would be interesting to see how the methods in this paper can be used for other families of graphs. The density results are connected to the inverse Wiener index problem [], where one starts with a prescribed sum of distances over all pairs of vertices and seeks a tree with this index. The following problem presents a problem that could provide a bridge between our work and the inverse Wiener index problem. There is also a potential index to the Harary index [].
Problem 1.
Determine where G is a tree.
The next problem is a natural extension of the first problem.
Problem 2.
Determine where G is a unicyclic graph.
We have started developing quasi-simple curve theory to demonstrate how closeness centrality can be calculated with just calculus. This would provide a segue to a natural definition of closeness centrality for piecewise smooth curves.
Author Contributions
Conceptualization, S.F., A.G.S., B.R. and D.A.N.; Methodology, A.G.S., B.R. and D.A.N.; Validation, A.G.S. and B.R.; Formal analysis, S.F., A.G.S., B.R. and D.A.N.; Investigation, S.F., A.G.S. and B.R.; Writing—original draft, A.G.S.; Writing—review & editing, B.R. and D.A.N.; Supervision, D.A.N.; Project administration, D.A.N.; Funding acquisition, D.A.N. All authors have read and agreed to the published version of the manuscript.
Funding
This research was supported by a National Science Foundation Research Experiences for Undergraduates Grant #2243938.
Data Availability Statement
No new data were created or analyzed in this study.
Conflicts of Interest
The authors declare no conflicts of interest.
Appendix A
For the sake of completeness, we include details for some of the computations here.
Lemma A1.
Define sequence of functions , where . Then uniformly converges to on .
Proof.
We will first show that uniformly converges to . Let , and we have
and it is clear that . The strictly positive function has a minimum value of on , and by uniform convergence, there exists such that for all and . Let , and there exists such that for all . Let , and we have
for all . □
Lemma A2.
, where p is a positive real number.
Proof.
Let and , and we have
We will do two u-substitutions by letting and in the first and second integrals, respectively. Doing this shows that both integrals have the same value, and by doubling the second one after the u-substitution, we get
□
Lemma A3.
Let p be a positive real number. Then, .
Proof.
By completing the square, we get . Let , , and to get
Since , the value is positive, and using the identity , we have
□
Lemma A4.
Let , then .
Proof.
It suffices to show that the statement holds for any sequence of positive real numbers with limit ∞. We will consider the integrals separately and define a sequence of functions as
It is clear that , and by Lemma 7, sequence of functions uniformly converges to 4 on . To evaluate , we can interchange the limit and integral to obtain
As for the other integral, suppose that , and implies
The right integral clearly has limit 0, and by the squeeze theorem, so does the middle sequence. □
Lemma A5.
Let , then .
Proof.
We have
The right term has limit 2, and we will now focus on the integral. Without loss of generality, suppose that , and since , we have
By the sandwich theorem, the middle integral goes to 0 as . □
Lemma A6.
Define and sequence of functions as in Theorem . Then on .
Proof.
Recall that
where . We will first focus on this term. Expanding the binomial coefficients gives
Since the difference is upmost 1, this error will be made negligible due to the term. Therefore, we can drop the ceiling function without changing the value of the limit. We can go further and remove all terms and constants made negligible by .
Now we will consider the other half, and we have
By letting , the right function clearly uniformly converges to on . Finally, the sum of functions uniformly converges to
on . □
References
- Handa, K. Bipartite graphs with balanced (a, b)-partitions. Ars Combin. 1999, 51, 113–119. [Google Scholar]
- Ramanathan, N.; Ramirez, E.; Suzuki-Burke, D.; Narayan, D. Closeness Centrality in Asymmetric Graphs. Theory Appl. Graphs, 2024; in press. [Google Scholar]
- Bavelas, A. Communication Patterns in Task-Oriented Groups. J. Acoust. Soc. Am. 1950, 22, 725–730. [Google Scholar] [CrossRef]
- Zhang, J.; Luo, Y. Degree Centrality, Betweenness Centrality, and Closeness Centrality in Social Network. In Proceedings of the Advances in Intelligent Systems Research, 2nd International Conference on Modelling, Simulation and Applied Mathematics (MSAM 2017), Bangkok, Thailand, 26–27 March 2017; Volume 132, pp. 300–303. [Google Scholar]
- Yan, E.; Ding, Y. Applying centrality measures to impact analysis: A coauthorship network analysis. J. Am. Soc. Inf. Sci. Technol. 2009, 60, 2107–2118. [Google Scholar] [CrossRef]
- Barrat, A.; Weigt, M. On the properties of small-world network models. Eur. Phys. J. B 2000, 13, 547–560. [Google Scholar] [CrossRef]
- Ek, B.; VerSchneider, C.; Narayan, D. Efficiency of star-like graphs and the Atlanta subway network. Physica A 2013, 392, 5481–5489. [Google Scholar] [CrossRef]
- Britz, T.; Hu, X.; Islam, A.; Tang, H. Bounds on the Closeness Centrality of a Graph. Bull. Malays. Math. Sci. Soc. 2025, 48, 1–15. [Google Scholar] [CrossRef]
- Disjkstra, E. A note on two problems in connexion with graphs. Numer. Math. 1959, 1, 269–271. [Google Scholar] [CrossRef]
- Doyle, J.; Graver, J. Mean distance in a graph. Discret. Math. 1977, 17, 147–154. [Google Scholar] [CrossRef]
- Fink, J.; Lužar, B.; Škrekovski, R. Some remarks on inverse Wiener index problem. Discret. Appl. Math. 2012, 160, 1851–1858. [Google Scholar] [CrossRef]
- Xu, K.; Das, K. On Harary index of graphs. Discret. Appl. Math. 2011, 159, 1631–1640. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).