Abstract
Canonical ensembles of walks in a finite connected graph assign the properly normalized probability distributions to all nodes, subgraphs, and nodal subsets of the graph at all time and connectivity scales of the diffusion process. The probabilistic description of graphs allows for introducing the quantitative measures of navigability through the graph, walkability of individual paths, and mutual perspicacity of the different modes of the (diffusion) processes. The application of information theory methods to problems about graphs, in contrast to geometric, combinatoric, algorithmic, and algebraic approaches, can be called information graph theory. As it involves evaluating communication efficiency between individual systems’ units at different time and connectivity scales, information graph theory is in demand for a wide range of applications, such as designing network-on-chip architecture and engineering urban morphology within the concept of the smart city.
Keywords:
statistical ensembles of walks; information graph theory; navigability, walkability, and perspicacity in networking systems MSC:
05Cxx; 05C82; 90Bxx; 91D30
1. Introduction to Information Graph Theory
Real-world networking systems comprising functional units that interact with each other at different and disparate temporal and spatial scales are often metaphorically represented by finite connected undirected graphs [1]. In graph models, the vertices embody the system functional units, and the edges sketch the various interactions between these units. Graph models are useful for computer networking, sociology, urban morphology, smart city networks, chemistry, engineering, communication, logistics and management, security applications, etc., as they help us to answer the fundamental question about relations between the local and global properties of a networking system.
In our work, we assess the quality of these relations by using some information quantities at different graph connectivity scales, ranging from the connectivity between the nearest neighbors to the connectivity with respect to infinitely long paths available in the graph. The proposed information approach to the description of graph structures (which can be called information graph theory) may be especially useful for the assessment of communication efficiency between individual systems’ units at different time and connectivity scales.
One possible application of graph information theory is the design of network-on-chip (NoC), or network-based communications subsystems on an integrated circuit, most typically between semiconductor intellectual property cores schematizing various functions of the computer system in a system on a single chip [2]. (We profoundly thank our reviewer for pointing us at the possible NoC-related application of graph information theory.) NoCs provide the advantage of customized architecture, increased scalability, and bandwidth and come in many network topologies, many of which are still experimental [3].
The interconnections between multiple cores on a chip present a considerable influence on the performance and communication of the chip design regarding the throughput, end-to-end delay, and packet loss ratio. Based on the huge amount of supported heterogeneous cores, efficient communication between the associated processors has to be considered at all levels of system design to ensure global interconnection [4]. Although hierarchical architectures have addressed the majority of the associated challenges of the traditional interconnection techniques, the main limiting factor is scalability, and NoC has proven to be a promising solution. As communicating nodes require a routing algorithm for successful transmission of packets, it is important to design an NoC routing algorithm capable of providing less congested paths, better energy efficiency, and high scalability [5]. It is important to mention that the same choices of routing algorithms, when considered for more than one performance parameter of the network, do not yield expected results, since different applications exhibit different network performances for the same routing technique [6]. Fault-tolerant mapping and routing techniques at different levels of an NoC were surveyed in [7]. In [8], a method for choosing hierarchical coordinates and a greedy forwarding algorithm for path-finding were presented. Nevertheless, reliability is becoming a major concern in NoC design [7]. In particular, the accurate predictions of network packet latency, contention delay, and the static and dynamic energy consumption of the network are in demand [9].
Another evident application field of graph information theory is the study of urban morphology, i.e., urban forms and structures affecting sustainability and urban growth. Along with the spreading of new technologies and the availability of big data, cities are increasingly viewed not simply as places in space but as systems of networks and flows [10]. This became particularly evident in the concept of a smart city, as the synergy of architecture, technology, and the Internet of Things which enables the collection and data exchange of objects embedded with electronics, software, sensors, and network connectivity [11].
All above-mentioned research problems involve a variety of information quantities resolving uncertainty about evolution of some diffusion processes at different connectivity scales in a graph, and therefore the entire approach to graphs summarized in the present paper may be called information graph theory.
Thinking about transportation in networks—imagine the weekend pedestrian flow in a city downtown area—we may notice that some streets are more preferred by walkers than others. While some urban quarters are visited quite often, others seem to be excluded from the rest of urban fabric, as though being barricaded by invisible gates, literally turning the deserted city block into a ghetto. A widespread belief in the influence of the spatial organization of the built environment on the way people move through urban spaces and meet others by chance was common in architectural and urban thinking for centuries [12,13,14]. The dynamics of poverty areas in London studied over the past century in [15] has shown that the creation of poverty areas is ultimately a spatial process relating to spatial segregation and poverty. In order to quantify isolation by identifying the most and least probable paths through the areas with respect to a certain exploration strategy, in the present paper we put forward a concept of walkability.
Navigation focuses on locating a walker’s position with respect to known locations, traversed paths, and structural patterns in the environment [16]. In the probabilistic setting, navigability would represent a share of predictable information about the future location of a traveler given their walking history and the present location. The predictable information about the future location of a walker is obtained with the use of two major navigation strategies, known as path integration (which allows for keeping track of the position and heading while exploring a new space) and landmark-based piloting (re-calculating position of a traveler while in a familiar environment), working in concert during navigation in humans and animals [16].
Finally, as such a circumstance appears quite often in the context of science and technology, we may pose the following question: How long should one observe a networking system for the information collected to be enough to draw a reliable conclusion about its evolution in the long run? Of course, an answer to the above-posed question ultimately depends on the structural complexity of a system (that can often be represented by a graph), the dynamics of the processes in question, as well as on the chosen graph exploration strategy—all are difficult to quantify, in general. In the present paper, we call such a deep, penetrating discernment of the system’s nature and evolution perspicacity and discuss and quantify it in the framework of a proposed model.
In our work, we put forward a toy model, in which the questions of interest may receive quantitative answers. The role of a complex system will be played by a finite connected undirected graph providing a scene for isotropic and a variety of anisotropic diffusions, i.e., different types of random walks, forming the canonical statistical ensembles of walks defined in graphs (see Section 3). Uncertainty about the current state in Markov chains contains predictable (apparent uncertainty) and unpredictable (true uncertainty) information components: the conditional mutual information quantities among the past, present, and future states of the chain (see Section 2). The amount of predictable information can be viewed as a divergence between the ensemble average (defined with respect to the stationary distribution of random walks) and the time-average of walks describing the probability of observing a walker in a unit time of random walks. Consequently, the amount of uncertainty about the future location of random walkers in the graph that can be resolved by using the standard navigation strategies, such as path integration (based on the entire history of travels) and landmark-based piloting (based on the present location of the walker), can be viewed as a navigability potential of a node in the given graph structure with respect to the chosen exploration strategy (the type of diffusion process) (see Section 4). We call the opposite potential characterizing the amount of true uncertainty about the walker’s location in the graph strive, which is quantified as the expected exploration time of a node’s neighborhood by the given type of random walk (Section 4). Information flows associated with the diffusion processes in graphs (see Section 3 and Section 5) can be viewed as the stochastic point processes, called determinantal process or fermionic processes [17], because the corresponding densities are the squared wedge-products (determinants) of eigenvectors of the symmetrized transition matrices. We use the properties of the determinantal processes to assess perspicacity and walkability in graphs (see Section 3 and Section 5). The perfect perspicacity is possible only for the processes that enjoy the same type of symmetry. In Section 6, we discuss an example of the walkability analysis in urban neighborhoods and, eventually, conclude on a discussion of graph information theory and its applications in the last section.
2. Predictable and Unpredictable Information in Markov Chains
The resolution of uncertainty about future events is known as information [18]. For example, each coin flip reveals a single bit of information. The side repeating probability in the coin tossing sequence does not affect the amount of information released at every flip, but rather determines a share of predictable and unpredictable information components in every released bit of information [19,20,21,22]. Indeed, the forthcoming state of the Markov chain describing tossing of a biased coin is completely predictable for when coin tossing generates the stationary sequences heads, heads, heads,… or tails, tails, tails, …, as well as for when an alternating sequence of states arises. However, the side of the coin that will come up on the next toss is utterly unpredictable if .
For a Markov chain , defined over a finite set of N states by a (stationary) irreducible, row-stochastic transition probability matrix , the expected amount of uncertainty about a state over infinitely long sequences is given by the Boltzmann–Gibbs–Shannon entropy [20,22,23,24],
where is a stationary distribution of states over the chain, i.e., the left eigenvector of the Markov chain transition matrix, belonging to the major eigenvalue 1. In all information-related calculations, we shall assume that As the information shared between the past and future in Markov chains is transmitted only through the present, the time average of the conditional entropy of the present state conditioned on all past states simplifies to the entropy rate [19,22,25,26], viz.,
Then, the following formal decomposition involving one-step conditional entropies [19,20,21,22],
represents the state’s uncertainty as a sum of the following three information components: (i) , expressing the amount of information about the future state that can be inferred from the entire past history of the chain, excepting for its present state; (ii) , the mutual information between the future and present states of the chain, expressing the amount of information about the future state that can be inferred from the present state, independently of the past; and (iii) , expressing the true uncertainty about the present state of the chain that cannot be resolved by observing the past history and that does not have repercussions for the future evolution of the chain. Assuming that all past states and the present state of the chain are known, we conclude that the first two quantities contribute to the predictable information component of the chain, viz.,
while the last term quantifies the ephemeral, unpredictable information component, viz.,
which exists at the present moment only and then disappears without consequences [19,20].
The information diagram for a Markov chain is shown in Figure 1. In the context of Markov chains, we may interpret or explain a future event either as a consequence of history, or as an immediate outcome of the present event. However, for , some amount of uncertainty remains unresolved before the event occurs, and such a surprise cannot be avoided, eliminated, or controlled. Given the Markov chain transition matrix T, the entropy excess [19,20,21,22,27,28,29] reads as follows:
Figure 1.
Information diagram for a Markov chain. The amount of uncertainty about the future state of the chain can be resolved partially by observing its past states (at an amount of ), as well as from the present state alone (quantified by ). True uncertainty about the state () exists at the present moment only and then disappears without consequences. All three components sum to the entropy , quantifying uncertainty of a state over the infinitely long sequences generated by the Markov chain.
The conditional mutual information between the present and the future states of the chain conditioned on the past state takes the following form [19,20,21,22]:
The conditional mutual information (7) quantifies a consequence of the present state for the future evolution of the Markov chain. Finally, the amount of true uncertainty existing only at the present moment of time is the entropy residue [19,20,21,22], viz.,
For example, let us consider three games of chance shown in Figure 2a–c having and 4 states, respectively. The probability of repeating a state in each game is taken as . Other transitions are chosen to be equiprobable. For entropy calculations in this section, we use the N-based logarithm , so that the information and entropy quantities for these games of chance are measured in bits, trits, and quadrits, respectively. Namely, a single bit of information is released at every coin flip (see Figure 2a); one trit of information is produced at every round of the rock–paper–scissors game (see Figure 2b); and one quadrit of information is brought about at every round of the card suit game of four states (see Figure 2c). The use of different information units allows for comparing the structure of information components for all three games in a single frame (as shown in Figure 3). Otherwise, the graphs would scale up by and if we used bits as a major information unit. The detailed information analysis of a two-state Markov chain shown in Figure 2a has been developed by us in [20], and in [30] we discussed information components in a three-state chain (Figure 2b). The four-state chain shown in Figure 2c has never before been discussed.
Figure 2.
Three games of chance, with , 3, and 4 states, respectively. In each game, the probability of repeating a state is . Other transitions are chosen to be equiprobable. The information components (6–8) for all three games are the functions of a single independent variable p, as shown in Figure 3. (a) The two-state game of chance describing tossing of a biased coin. (b) The three-state game of chance corresponding to the rock–paper–scissors game. (c) The four-state game of chance corresponding to the card game.
Figure 3.
(a) The information components (6–8) as functions of the state repetition probability p. The bold line represents the information quantities for the coin tossing (see Figure 2a). The amount of information that can be predicted by observing the past history of the chain is highlighted in blue. The amount of information that can be guessed from the present state alone is highlighted in yellow. The information components for the rock–paper–scissors game (Figure 2b) are shown as dotted lines. The information components for the card suit game (Figure 2c) are shown as dashed lines. (b) The amounts of predictable information in three games of chance as functions of the state repetition probability p.
The information components (6–8) for all three games shown in Figure 2 are presented in Figure 3 as functions of the state repetition probability p. In Figure 3a, the predictable information components in a coin tossing game (see Figure 2a) are indicated by the solid line, and the graph regions representing the predictable information components and are highlighted in blue and yellow, respectively.
The future outcome of coin tossing is reliably predictable for or when the game generates the stationary alternating or constant sequences of states, respectively. In this case, the entire uncertainty of a state equals the entropy excess (blue in Figure 3a), viz., 1 bit As or , the reliability of predictions based on the past history of the chain decays; however, the future outcome may then be guessed from the present state of the coin alone, either by alternating the present side of the coin at the next tossing (for ), or by repeating it (for ). The corresponding amount of information, quantifying the degree of reliability of such a guess, is highlighted by yellow in Figure 3a. When , there is a destructive interference between two incompatible hypotheses about the future outcome of coin tossing that causes the gradual attenuation and cancellation of mutual information, , and the coin becomes fair at . At this moment, the entire uncertainty of a state equals the amount of true uncertainty quantified by [20]. Summing the predictable information components together, we obtain the amount of predictable information about the future outcome of coin tossing as a function of p, shown in Figure 3b as a solid line. The coin tossing game (see Figure 2a) is predictable at and , but unpredictable at when the coin is fair.
The information components for other games of chance (Figure 2b,c) are presented in Figure 3a,b by the dotted and dashed lines. The rock–paper–scissors game (Figure 2b) becomes fair at , and the relevant information components are shown as dotted lines. The model of a three-sided coin (susceptible–infected–immune) was recently used for assessing pandemic uncertainty on conditions of vaccination and self-isolation [30]. The card suit game (Figure 2c) is fair when , as shown by the dashed lines in Figure 3a,b. In contrast to the simple coin flipping game, other games of chance, having more possible states, are not completely predictable for , as they retain a certain degree of uncertainty of symbols in the alternating sequences of chain states.
3. Symmetries and Statistical Ensembles of Walks in Finite Connected Undirected Graphs
A finite connected undirected graph , in which V, , is a set of vertices and is a set of edges, is defined by a symmetric adjacency matrix, , such that , iff , but otherwise. The adjacency matrix allows for the following spectral decomposition: in which is an orthogonal system of eigenvectors belonging to the ordered real eigenvalues , where is simple according to the Perron–-Frobenius theorem [31].
Graph symmetries: Multiple eigenvalues of the adjacency matrix indicate the presence of non-identity automorphisms in the graph G [19,32,33,34,35,36]. A group of a graph’s automorphisms preserving the graph structure is formed by all permutations defined on the set of graph vertices V that commute with the graph adjacency matrix, [19,32,34]. If a non-identical exists, and is an eigenvector of the adjacency matrix, such that then the vector is another eigenvector of A belonging to the same eigenvalue , as . An automorphism associated with some eigenvalue of multiplicity is represented by a low-dimensional real orthogonal submatrix corresponding to the related block of a permutation matrix. As the eigenvectors of a symmetric adjacency matrix are real and mutually orthogonal, , it also follows that elements of these eigenvectors are mapped to each other under the action of . For example, the major (Perron) eigenvector of the adjacency matrix of a triangle graph belongs to the maximum eigenvalue , while another, double eigenvalue corresponds to the eigenvectors and mapped to each other by a simple 2-permutation.
Microcanonical ensemble of walks: The microcanonical ensemble of walks (MCE) is a collection of n-walks in G taken with the same probability for all walks of the same length n, in which the (Boltzmann constant and) temperature , and the free energy of n–walks is defined by the logarithm of the number of n-walks available in the graph . For very long walks, , , where , and therefore the intensive free energy (per edge absorbed by a very long MCE walk) equals the graph topological entropy [21,22,37], viz.,
Canonical ensemble of walks: In the canonical ensembles of walks (CNE) defined on G, each of walks of length n available from every node are taken with equal probability. As , the CNE walks are defined by the following irreducible row-stochastic transition matrices [19,21,22]:
The CNE walks are the random walks defining the following stationary distributions (centrality measures [21,22]) of nodes:
In particular, the locally isotropic first order random walks, in which a random walker chooses a node to visit with equal probability among all available nearest neighbors, defines the well-known stationary distribution of nodes (degree centrality), viz., [19,21,22]. For , , so that the series of transition matrices converges to the Ruelle–Bowen random walk [38], viz.,
with a stationary distribution [39] that is related to the eigenvector centrality of nodes [40].
Grand canonical ensemble of walks: Finally, the grand canonical ensemble of walks (GCE) describes the statistics of “fluctuations” of the local growth rate of the number of walks available at the node over a “chemical potential” of the equiprobable n-walks in the graph G, viz.,
In the thermodynamic limit of infinite walks , the exponential factor representing a node’s “fugacity” takes the following form [22]:
The limiting distribution of relative fugacity, describing the change of the system of infinitely long walks (free energy of walks) due to adding/removing a node, takes the form of a Fermi—Dirac distribution, viz.,
typical for the distribution of fermions in a single-particle state i of the energy The Fermi—Dirac distribution, previously observed only in a quantum system of non-interacting fermions, appears in the model due to the non-interacting quality of walks in the graph, similar to the Pauli exclusion principle allowing for only two possible microstates for each single-particle level [22].
Determinantal point processes associated with canonical ensembles of walks—Perspicacity: The CNE walks in a connected undirected graph define the determinantal point processes characterized by the probabilities of random currents calculated as the squared determinants of submatrices made up from the elements of eigenvectors of the symmetrized random walk transition matrices. These determinantal point processes provide the properly normalized probability distributions for all nodes, subgraphs, and nodal subsets at all time scales of the corresponding diffusion process [22]. Slater determinants in quantum mechanics [41] were historically the first examples of determinantal point processes that later acquired multiple applications in physics, applied mathematics, and statistics [42,43,44,45]. The symmetrization similarity transformation [19,22] makes the Markov matrix into a symmetric form, , viz.,
In the rest of this section, we omit the index in the symmetrized transition matrix (16) to simplify the notations. The determinantal point processes may be defined for any type of random walk, and the resulting probability distributions indeed vary for different n. The symmetrized transition matrix (16) can be written in a spectral form as follows: , in which is an orthogonal matrix, , with columns being the orthonormal eigenvectors of the symmetrized transition matrix (16), and is a diagonal matrix of real valued, ordered eigenvalues of the transition matrix, . The repeating eigenvalues indicate the presence of symmetries in the diffusion process, and if is an eigenvector of the symmetrized transition matrix (16) belonging to the multiple eigenvalue , then there exists a permutation matrix , such that and is another eigenvector of belonging to the same eigenvalue . In the latter case, the symmetry may be represented by a low-dimensional real orthogonal submatrix in corresponding to the related blocks of a permutation matrix.
As the dynamics of discrete time random walks evolves with powers of the corresponding transition matrix, the eigenvalues for describe the rates of relaxation processes toward a stationary distribution over graph nodes belonging to the maximal eigenvalue . Herewith, the positive eigenvalues , , describe the exponential decay of the relaxation processes (as ), while the negative eigenvalues correspond to damped oscillations, and both fade away within the finite characteristic relaxation times . The characteristic relaxation time of the stationary distribution is taken to be
The k-th order wedge products, , are the determinants of the corresponding k-th order minors of the matrix . The properties of determinants of submatrices composed of orthonormal vectors were discussed in [19,46] in more detail. In particular, the squared determinants of the k-th order minors of the orthogonal matrix define the properly normalized probability distributions over the index set , viz., For example, the squared elements of the major eigenvector of the symmetrized transition matrix, can be considered the squared determinants of the primitive submatrices of size 1, determining the stationary distribution of random walks, viz., The corresponding probability is interpreted as a measure of likeliness to observe a random walker at in infinite observation time . Other eigenvectors, , , also enjoy the proper normalization, , and therefore can be interpreted as the ephemeral probability distributions decaying in the finite characteristic time, .
The squared wedge products, define the ephemeral probability distributions over pairs of nodes (not necessarily corresponding to graph’s edges) fostered by the modes and of the diffusion process, viz.,
, may be interpreted as the joint probability mass functions (correlation functions) describing the probability amplitudes for a random walker to sojourn at the nodes i and j during the characteristic times and . Using the properties of determinants, we may expand the probability mass function into the joint probability of independent observations of the walker at i and j during the characteristic times and , conditional on obviously incompatible events, such as the simultaneous presence of the random walker at two different nodes, viz.,
If and and are the simple eigenvalues, then the determinants in (17) equal zero for any i and j. Otherwise, if and belong to the same multiple eigenvalue, then for i and j involved in permutations expressing the symmetry of the diffusion process. In the latter case, the vectors and are localized at the graph nodes that enjoy the symmetry, while other vector elements equal zero. As the absolute value of a determinant is nothing but the volume of the parallelogram spanned by two vectors, and , for the determinant to be maximal, these vectors should be orthogonal to each other. The vectors and are mutually orthogonal in ; however, many of their elements are zero (as corresponding to the block structure of the symmetry-related permutation matrix), and others are related to each other by permutation, so there may be orthogonal submatrices, for which the determinant (17) is maximal.
To evaluate the amount of mutual information that knowing either of the diffusion process mode described by the eigenvector provides about the other one described by the eigenvector , we introduce the perspicacity matrix as the mutual information between the diffusion modes as follows:
As all nodes’ densities are in a connected graph, the mutual information (19) is a non-negative symmetric matrix having positive elements for every pair of different eigenmodes , with the zero elements on the major diagonal, due to the properties of the determinants. Herewith, the maximal perspicacity (mutual information) in (19) is obtained for the mutually orthogonal eigenvectors and belonging to the same multiple eigenvalues (if there are any) of Markov transition matrices (10), indicating the presence of symmetries in the diffusion processes. These symmetries depend on the chosen CNE walks (10), grounded in the structural symmetry of the graph (see Section 5 for an example).
Similarly, we may define the third- and higher-order correlation functions using the squared determinants of corresponding submatrices of .
5. Perspicacity and Walkability of the Determinantal Point Processes Associated with Canonical Ensembles of Walks
Perspicacity, as defined in Section 3 as the mutual information (19) between the different modes of diffusion process (described by the mutually orthogonal eigenvectors of a symmetrized transition matrix of random walks), quantifies the ability to discern and understand either of the diffusion modes by knowing the other. Although perspicacity depends on the type of random walk implemented for the graph exploration, it attains its maximum value over the diffusion modes (eigenvectors) localized at the same symmetric subgraphs in the graph (if any exist) and belonging to the same repeating eigenvalue of the related Markov transition matrix.
In Figure 6a, we present the perspicacity matrix (19) calculated for the maximally anisotropic Ruelle–Bowen random walks (12) defined on the Kittel graph shown in Figure 6b,c. The Kittel graph contains a symmetric subgraph (of 9 vertices) affecting the diffusion processes that may be defined on that. For instance, the symmetrized transition matrix of Ruelle–Bowen random walks has two eigenvectors, and related to each other by some permutation and belonging to the same double eigenvalue, .
Figure 6.
(a) The perspicacity matrix (19) calculated for the maximally anisotropic Ruelle–Bowen random walks (12) defined on the Kittel graph with 23 vertices and 63 edges (shown in panels (b,c)). The symmetric perspicacity matrix represents mutual information (measured in bits) between the diffusion modes defined by the eigenvectors and . The mutual information (perspicacity) attains its maximum value for the eigenvectors ( and ) belonging to the same double eigenvalue (), indicating the structural symmetry in the graph. (b) The vertices of the Kittel graph are highlighted according to the squared elements by hues of gray (white stands for 0, black stands for 1). The eigenvector is localized at the graph nodes that enjoy the graph symmetry (automorphism) and that are related to the elements of the eigenvector shown in (c) by permutation. (c) The vertices of the Kittel graph are highlighted according to the squared elements by hues of gray (white stands for 0, black stands for 1). The eigenvector is localized at the graph nodes that enjoy the graph symmetry (automorphism) and that are related to the elements of the eigenvector shown in (b) by permutation.
The squared elements of the eigenvectors and are shown in Figure 6b,c. The vertices of the Kittel graph are highlighted by hues of gray (white stands for 0, black stands for 1) according to the squared elements of in Figure 6b and the squared elements of in Figure 6c. The mutual information (19) between the eigenvectors and attains its maximum available value (of 9 bits). Thus, perspicacity, a penetrating discernment of either of the diffusion modes when the other mode is known, is fully possible only within the framework of structural symmetry common to both modes.
In urban studies, walkability is an essential concept of sustainable urban design associated with the ideal of a dense, mixed-used, walkable neighborhood fostering higher levels of creativity, reduced crime, improved health, and civic engagement, in contrast to the dull and deadly cul-de-sacs of car-oriented suburbs [50]. Walkability relies on the complex interdependence between built environment density, mix, and pedestrian access in synergy, which is difficult to quantify and measure for urban neighborhoods. In the present section, we propose a mathematically robust approach to the walkability assessment in built environments (naturally represented by connected undirected graphs). Essentially, our approach is based on the following idea: the most walkable path in a graph is that most likely to be walked (over a relatively long period of observations). The determinantal point process associated with CNE walks can be used to estimate the likelihood that either graph edge is traversed by random walkers. Namely, each and every edge of the graph is assigned the probability (17) to be chosen for a walk during the longest available characteristic times and , viz.,
The standard Dijkstra’s algorithm [51,52] for finding the shortest/longest paths between nodes in a weighted graph can then be used to chart the most/least probable paths between two given nodes. In Figure 7, we have highlighted the most probable (shown by the solid line) and least probable (shown by the dot-dashed line) paths between the given pair of vertices in the Kittel graph.
Figure 7.
The most probable path (shown by the solid line) and the least probable path (shown by the dot-dashed line) between the given pair of vertices in the Kittel graph. All graph nodes are shown in blue for color contrasting.
6. Discussion: On the Walkability Analysis of Urban Neighborhoods
Cities can be considered to be among the largest and most complex artificial networks created by human beings. At the same time, the city is the biggest communication editor that determines the social and economic wellbeing not only of our present generation but also for those generations to come, as it provides an interface for our everyday mutual interactions. Sociologists think that isolation worsens an area’s economic prospects by reducing opportunities for commerce and engenders a sense of isolation in inhabitants, both of which can fuel poverty and crime [19]. The speed and scale of urban growth require urgent global action to help cities prepare for growth and to avoid their being the future epicenters of poverty and human suffering. Physical segregation of minority groups dispersed over the spatially isolated pockets of streets and other inner-city areas being socially barricaded by railways and industries causes their economic marginalization [35].
Together with the “Neurocognition and Action – Biomechanics” research group of the Center for Cognitive Interaction Technology at Bielefeld University (Germany), we worked on a project aimed at improving the urban neighborhood of state-subsidized rental apartments located on Carlmeyerstraße in Bielefeld.
Although it is often impossible to change the structure of built environments in urban neighborhoods, one can try to alter the social function of a place in order to increase its attendance and improve the local social climate in it. The idea was to encourage visiting the least-walkable paths and areas surrounding them by placing sports equipment in them. In order to quantify walkability in the urban neighborhood and identify the least-walkable paths through it, we used the approach discussed in Section 5. The spatial graph of the neighborhood was created by tessellation of the available walkable space, with overlapping rectangles anchored at the built environment (see Figure 8a). Each rectangle shown in Figure 8a represented a vertex of the spatial graph, and each overlap of these rectangles was interpreted as an edge in the graph. These edges were weighted by the probabilities defined in (22), and then the least probable paths connecting the rectangles were identified with the use of Dijkstra’s algorithm.
Figure 8.
The urban neighborhood of state-subsidized rental apartments located on Carlmeyerstraße in Bielefeld (Germany). (a) Tessellation of walkable space in the neighborhood with 66 overlapping rectangles anchored at the built environment. (b) The least-walkable paths in the neighborhood with places for sports equipment indicated.
The university initiative received positive feedback from residents of the neighborhood.
7. Conclusions
In the present paper, we studied three important applications of CNE walks defined in finite connected graphs. The CNE walks define the properly normalized probability distributions for all nodes, subgraphs, and nodal sets of the graph at all time scales of the diffusion process.
The first application is related to the quantitative measures of navigability in graphs with respect to the different graph exploration strategies. In general, frequently visited graph nodes are predicted more efficiently than those that are little frequented. In particular, each node of the graph can be characterized by the navigability and strive potentials that depend on the node’s function in the graph structure and the chosen graph exploration strategy. The navigability potential is defined as the probability that the node serves as a precursor for other nodes, while the strive potential of the node is interpreted as its neighborhood exploration time.
The second application is related to the mutual information quantifying the discernment of either of the diffusion modes by knowing the other, which we call perspicacity. We have shown that perfect perspicacity is possible only when both diffusion modes enjoy the same symmetry (grounded in the structural symmetry of the graph).
Finally, the third application is related to a walkability assessment of the graph, including identification of the most and least probable paths in that. We assume that the most walkable path is a graph is that most likely to be walked over a long enough observation time. The proposed walkability analysis was used for the assessment of walkability in urban neighborhoods and received positive feedback from the neighborhood residents.
The application of information theory methods to problems about graphs, in contrast to geometric, combinatoric, algorithmic, and algebraic approaches, can be called information graph theory. As it involves evaluating communication efficiency between individual systems’ units at different time and connectivity scales, information graph theory is in demand for a wide range of applications, such as designing network-on-chip architecture and engineering urban morphology within the concept of the smart city.
Funding
This research received no external funding.
Data Availability Statement
The data are contained within the article.
Acknowledgments
The author is grateful to his institution for the administrative and technical support.
Conflicts of Interest
The author declares no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| NoC | Network-on-chip |
| MCE | Microcanonical ensemble (of walks) |
| CNE | Canonical ensemble (of walks) |
| GCE | Grand canonical ensemble (of walks) |
References
- Johnson, J. Hypernetworks in the Science of Complex Systems; World Scientific Series “Complexity Science”; 516 Sherfield building Imperial College London SW7 2AZ United Kingdom; Imperial College Press: London, UK, 2014; Volume 3. [Google Scholar]
- Venkataraman, N.L.; Kumar, R. Design and analysis of application specific network on chip for reliable custom topology. Comput. Netw. 2019, 158, 69–76. [Google Scholar] [CrossRef]
- Manzoor, M.; Mir, R.N. A Review Of Design Approaches For Enhancing The Performance of Nocs at Communication Centric Level. Scalable Comput. Pract. Exp. 2021, 22, 347–364. [Google Scholar] [CrossRef]
- Alimi, I.A.; Patel, R.K.; Aboderin, O.; Abdalla, A.M.; Gbadamosi, R.A.; Muga, N.J.; Pinto, A.N.; Teixeira, A.L. Network-on-Chip Topologies: Potentials, Technical Challenges, Recent Advances and Research Direction. In Network-on-Chip-Architecture, Optimization, and Design Explorations; IntechOpen: London, UK, 2022. [Google Scholar]
- Kaleem, M.; Isnin, I.F.B. A Survey on Network on Chip Routing Algorithms Criteria. In Advances on Smart and Soft Computing. Advances in Intelligent Systems and Computing; Saeed, F., Al-Hadhrami, T., Mohammed, F., Mohammed, E., Eds.; Springer: Singapore, 2021; Volume 1188. [Google Scholar]
- Gogoi, A.; Ghoshal, B.; Sachan, A.; Kumar, R.; Manna, K. Application driven routing for mesh based Network-on-Chip architectures. Integr. Vlsi J. 2022, 84, 26–36. [Google Scholar] [CrossRef]
- Nassima, K.; Mouloud, K. A survey on fault-tolerant application mapping techniques for Network-on-Chip. J. Syst. Archit. 2019, 92, 39–52. [Google Scholar]
- Romanov, A.; Myachin, N.; Sukhov, A. Fault-Tolerant Routing in Networks-on-Chip Using Self-Organizing Routing Algorithms. In Proceedings of the IECON 2021—47th Annual Conference of the IEEE Industrial Electronics Society, Toronto, ON, Canada, 13–16 October 2021; pp. 1–6.
- Trajkovic, J.; Karimi, S.; Hangsan, S.; Zhang, W. Prediction Modeling for Application-Specific Communication Architecture Design of Optical NoC. ACM Trans. Embed. Comput. Syst. 2022, 21, 35. [Google Scholar] [CrossRef]
- D’Acci, L.; Batty, M. (Eds.) The Mathematics of Urban Morphology. In Birkhauser Series: Modeling and Simulation in Science; Springer: Berlin/Heidelberg, Germany, 2020; ISBN 9783030123833. [Google Scholar]
- Rassia, S.T.; Pardalos, P. (Eds.) Smart City Networks. In Through the Internet of Things; Springer Series Optimization and Its Applications; Springer: Cham, Switzerland, 2017; ISBN 978-3-319-61312-3. [Google Scholar]
- Hillier, B.; Hanson, J. The Social Logic of Space; (1993, reprint, paperback edition ed.); Cambridge University Press: Cambridge, UK, 1984. [Google Scholar]
- Hillier, B. Space Is the Machine: A Configurational Theory of Architecture; Cambridge University Press: Cambridge, UK, 1999; ISBN 0-521-64528-X. [Google Scholar]
- Hillier, B. The Common Language of Space: A Way of Looking at the Social, Economic and Environmental Functioning of Cities on a Common Basis; Bartlett School of Graduate Studies: London, UK, 2004. [Google Scholar]
- Vaughan, L.; Chatford, D.; Sahbaz, O. Space and Exclusion: The Relationship between Physical Segregation. Economic Marginalization and Poverty in the City; Paper presented to Fifth Intern; Space Syntax Symposium: Delft, The Netherlands, 2005. [Google Scholar]
- Epstein, R.A.; Vass, L.K. Neural systems for landmark-based wayfinding in humans. Philos. Trans. R. Soc. B Biol. Sci. 2014, 369, 20120533. [Google Scholar] [CrossRef]
- Macchi, O. The coincidence approach to stochastic point processes. Adv. Appl. Probab. 1975, 7, 83–122. [Google Scholar] [CrossRef]
- Webler, F.; Andersen, M. Measurement in the Age of Information. Information 2022, 13, 111. [Google Scholar] [CrossRef]
- Volchenkov, D. Grammar of Complexity: From Mathematics to a Sustainable World. In Nonlinear Physical Science; World Scientific Series; Higher Education Press: Beijing, China, 2018; pp. 1–3. [Google Scholar]
- Volchenkov, D. Memories of the Future. Predictable and Unpredictable Information in Fractional Flipping a Biased Coin. Entropy 2019, 21, 807. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Volchenkov, D. Infinite Ergodic Walks in Finite Connected Undirected Graphs. Entropy 2021, 23, 205. [Google Scholar] [CrossRef]
- Volchenkov, D.; Suh, C.S. Statistical Mechanics of Long Walks in Dynamic Complex Networks: Statistical Arguments for Diversifying Selection. Dynamics 2022, 2, 252–269. [Google Scholar] [CrossRef]
- Khinchin, A.I. Mathematical Foundations of Information Theory; Dover: New York, NY, USA, 1957. [Google Scholar]
- Ramshaw, J.D. The Statistical Foundations of Entropy; World Scientific: Singapore, 2018. [Google Scholar]
- Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley: Hoboken, NJ, USA, 1991; p. 576. [Google Scholar]
- Grabski, F. Discrete state space Markov processes. In Semi-Markov Processes: Applications in System Reliability and Maintenance; Elsevier: Amsterdam, The Ntherlands, 2015; pp. 1–17. ISBN 978-0-12-800518-7. [Google Scholar]
- Watanabe, S.; Accardi, L.; Freudenberg, W.; Ohya, M. (Eds.) Algebraic Geometrical Method in Singular Statistical Estimation; Series in Quantum Bio-Informatics; World Scientific: Singapore, 2008; pp. 325–336. [Google Scholar]
- James, R.G.; Ellison, C.J.; Crutchfield, J.P. Anatomy of a bit: Information in a time series observation. Chaos 2011, 21, 037109. [Google Scholar] [CrossRef] [Green Version]
- Travers, N.F.; Crutchfield, J.P. Infinite excess entropy processes with countable-state generators. Entropy 2014, 16, 1396–1413. [Google Scholar] [CrossRef] [Green Version]
- Volchenkov, D. Assessing Pandemic Uncertainty on Conditions of Vaccination and Self-isolation. Lobachevskii J. Math. 2022, 43, 490–500. [Google Scholar] [CrossRef]
- Minc, H. Nonnegative Matrices; John Wiley & Sons: New York, NY, USA, 1988. [Google Scholar]
- Mowshowitz, A. Graphs, groups and matrices. Proc. Summer Meet. Canad. Math. Congr. Cogr. Numer. 1971, 4, 509–522. [Google Scholar]
- Chan, A.; Godsil, C. Symmetry and Eigenvectors. In Graph Symmetry, Algebraic Methods and Applications; Hahn, G., Sabidussi, G., Eds.; Kluwer: Dordrecht, The Netherlands, 1997; pp. 75–106. [Google Scholar]
- Godsil, C.; Royle, G. Graduate Texts in Mathematics. Im Algebraic Graph Theory; Springer: New York, NY, USA, 2001; Volume 207. [Google Scholar]
- Blanchard, P.; Volchenkov, D. Understanding Complex Systems. In Mathematical Analysis of Urban Spatial Networks; Springer Series; Springer: Berlin/Heidelberg, Germany, 2009; ISBN 978-3-540-87828-5. [Google Scholar]
- Blanchard, P.; Volchenkov, D. Introduction to Random Walks and Diffusions on Graphs and Databases. In Springer Series Synergetics; Springer: Berlin/Heidelberg, Germany, 2011; Volume 10, ISBN 978-3-642-19591-4. [Google Scholar]
- Adler, R.L.; Konheim, A.G.; McAndrew, M.H. Topological entropy. Trans. Am. Math. Soc. 1965, 114, 309–319. [Google Scholar] [CrossRef]
- Delvenne, J.C.; Libert, A.S. Centrality measures and thermodynamic formalism for complex networks. Phys. Rev. E 2011, 83, 046117. [Google Scholar] [CrossRef] [Green Version]
- Burda, Z.; Duda, J.; Luck, J.M.; Waclaw, B. Localization of the Maximal Entropy Random Walk. Phys. Rev. Lett. 2009, 102, 160602. [Google Scholar] [CrossRef] [Green Version]
- Landau, E. Zur relativen Wertbemessung der Turnierresultate. Dtsch. Wochenschach 1895, 11, 366–369. [Google Scholar]
- Slater, J. The Theory of Complex Spectra. Phys. Rev. 1929, 34, 1293–1322. [Google Scholar] [CrossRef]
- Vershik, A.M. Asymptotic Combinatorics with Applications to Mathematical Physics a European Mathematical Summer School Held at the Euler Institute, St. Petersburg, Russia, July 9–20, 2001; Springer: Berlin, Germany, 2003; p. 151. ISBN 978-3-540-44890-7. [Google Scholar]
- Deng, N.; Zhou, W.; Haenggi, M. The Ginibre point process as a model for wireless networks with repulsion. IEEE Trans. Wirel. Commun. 2015, 14, 107–121. [Google Scholar] [CrossRef]
- Miyoshi, N.; Shirai, T. A Cellular Network Model with Ginibre Configured Base Stations. Adv. Appl. Probab. 2016, 46, 832–845. [Google Scholar] [CrossRef] [Green Version]
- Katz, N.M.; Sarnak, P. Zeroes of Zeta Functions and Symmetry. Bull. AMS 1999, 36, 1–26. [Google Scholar] [CrossRef] [Green Version]
- Muir, T. Orthogonants. In Treatise on the Theory of Determinants; Revised and Enlarged by W. H. Metzler; Dover: New York, NY, USA, 1960; Chapter 14. [Google Scholar]
- Golledge, R.G. (Ed.) Wayfinding Behavior. In Cognitive Mapping and Other Spatial Processes; John Hopkins University Press: Baltimore, MD, USA, 1999. [Google Scholar]
- Woods, C.T.; Rudd, J.; Robertson, S.; Davids, K. Wayfinding: How Ecological Perspectives of Navigating Dynamic Environments Can Enrich Our Understanding of the Learner and the Learning Process in Sport. Sport.-Med.-Open 2020, 6, 51. [Google Scholar] [CrossRef]
- Kohler, M.; Wehner, R. Idiosyncratic route-based memories in desert ants. NEurobiology Learn. Mem. 2004, 83, 1–12. [Google Scholar]
- Jacobs, J. The Death and Life of Great American Cities; Knopf Doubleday Publishing Group: New York, NY, USA, 1992. [Google Scholar]
- Dijkstra, E.W. A note on two problems in connexion with graphs. Numerische Mathematik 1959, 1, 269–271. [Google Scholar] [CrossRef] [Green Version]
- Mehlhorn, K.; Sanders, P. Chapter 10. Shortest Paths. In Algorithms and Data Structures: The Basic Toolbox; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).