Grand Canonical Ensembles of Sparse Networks and Bayesian Inference

Ginestra Bianconi

doi:10.3390/e24050633

¹

School of Mathematical Sciences, Queen Mary University of London, London E1 4NS, UK

²

The Alan Turing Institute, The British Library, London NW1 2DB, UK

Entropy2022, 24(5), 633;https://doi.org/10.3390/e24050633

This article belongs to the Topic Complex Systems and Network Science

Version Notes

Order Reprints

Abstract

Maximum entropy network ensembles have been very successful in modelling sparse network topologies and in solving challenging inference problems. However the sparse maximum entropy network models proposed so far have fixed number of nodes and are typically not exchangeable. Here we consider hierarchical models for exchangeable networks in the sparse limit, i.e., with the total number of links scaling linearly with the total number of nodes. The approach is grand canonical, i.e., the number of nodes of the network is not fixed a priori: it is finite but can be arbitrarily large. In this way the grand canonical network ensembles circumvent the difficulties in treating infinite sparse exchangeable networks which according to the Aldous-Hoover theorem must vanish. The approach can treat networks with given degree distribution or networks with given distribution of latent variables. When only a subgraph induced by a subset of nodes is known, this model allows a Bayesian estimation of the network size and the degree sequence (or the sequence of latent variables) of the entire network which can be used for network reconstruction.

Keywords:

network ensembles; hierarchical models; Bayesian inference

1. Introduction

Networks [1,2] have the ability to capture the topology of complex systems ranging from the brain to financial networks. Network models are key to have reliable unbiased null models of the network and to explain emergent phenomena of network evolution. Network model can be classified in two major classes: equilibrium maximum entropy models [3,4,5,6,7,8,9,10,11,12,13,14,15] and growing network models [1,16,17,18]. While growing network models have a number of nodes that increases in time, maximum entropy models are used so far only for treating networks of a given number of nodes N. In this paper we are interested in extending the realm of maximum entropy network models to networks of varying network size N.

Maximum entropy network ensembles are the least biased ensembles satisfying a given set of constraints. As such maximum entropy ensembles are widely used as null models and for network reconstruction starting from features associated to the nodes of the network. Given the profound relation between information theory and statistical mechanics [19,20], maximum entropy network ensembles can be distinguished between microcanonical ensembles and canonical ensembles [3,21,22] similarly to the analogous distinction traditionally introduced in statistical mechanics for ensembles of particles. Microcanonical network ensembles are ensembles of networks of N nodes satisfying some hard constraints (such as the total number of links, or the given degree sequence). Canonical network ensembles instead are ensembles of networks of N nodes satisfying some soft constraints, (such as the expected total number of links or the expected degree sequence). The canonical ensembles with expected degree sequence can be also formulated as latent variable models where the latent variables can be associated to the nodes [5,23].

Maximum entropy models have been very successful in solving challenging inference models [6,8,24,25,26], however they have the limitation that they only treat networks with a given fixed number of nodes N. Indeed in several scenarios, the number of nodes might not be fixed or might not be known. In this context an important problem is to compare networks of different network sizes. For instance in brain imaging one might choose a finer grid or a coarser grid of brain regions and an outstanding problem in machine learning is how to build neural networks that can generalize well when tested on network data with different network size than the network data in the training set [27,28].

In order to have network ensembles that can treat networks of different size, here we introduce the grand canonical network ensembles in which the number of nodes can vary. A well-defined grand-canonical network ensemble necessarily needs to be exchangeable [29], i.e., needs to be invariant under permutation of labels of the nodes of the network, so that removing or adding a node has an effect that is independent of the particular choice of the node added or removed.

The research on exchangeable networks is currently very vibrant. The graphon model [30] is the most well established exchangeable network model. However this model is dense, i.e., the number of links scales quadratically with the number of nodes while the vast majority of the network data is sparse with a total number of links scaling linearly with the network size. In other words most of the real world networks have constant average degree. However popular models for sparse networks such as the configuration model [31] and the exponential random graphs [4] are not exchangeable. In fact these models treat networks of labelled nodes with given degree or with given expected degree sequence. Therefore the network ensemble is not invariant under permutation of the node labels, except if all the degrees of all the expected degrees of the network are the same (for a more diffused discussion of why these networks are not exchangeable see discussion in ref. [32]). Several works have been proposed exchangeable network models in the when the average degree of the network diverges sublinearly with the network size [33,34,35,36,37,38]. Only recently, in ref. [32], a framework able to model sparse exchangeable networks in the limit of constant degree, has been proposed. The model is very general and has been extended to treat generalized network structures including multiplex networks [39] and simplicial complexes [40]. However the model is well defined only for finite networks of large but finite number of nodes N as exchangeable sparse networks need to obey the Aldous-Hoover theorem [41,42] according to which infinite sparse exchangeable networks must vanish. An alternative strategy for formulating exchangeable ensembles is to consider ensembles of unlabelled networks for which several results are already available [43].

Here we build on the recently proposed exchangeable sparse network ensembles [32] to formulate hierarchical grand-canonical ensembles of sparse networks. The proposed grand-canonical ensembles are hierarchical models [25,44] with variable number of nodes N and with given degree distribution or alternatively given latent variable distributions. The grand canonical approach provides a way to circumvent the limitations imposed by the Aldous-Hoover theorem because in this framework one considers a mixture of network ensembles with finite but unspecified and arbitrary large network sizes. In this paper we define the grand-canonical ensembles and we characterize them with statistical mechanics methods, evaluating their entropy, the marginal probability of a link and proposing generative algorithms to sample networks from these ensembles. [Note that the proposed grand canonical ensembles differ from the ensembles proposed in refs. [45,46], as in our case we consider networks with undetermined number of nodes, while in refs. [45,46] is the total sum of weights of weighted networks that is allowed to vary. From the statistical mechanics perspective our approach is fully classical while in refs. [45,46] networks ensembles are treated as quantum mechanical ensembles where the particles are associated to the links of the network and the adjacency matrix elements play the role of occupation numbers.].

Finally, we use the gran-canonical network ensembles to solve an inference problem. We consider a scenario in which the entire network has an unknown number of nodes, and we have only access to a subgraph induced by a subset of its nodes. In this hypothesis we use the grand-canonical network models to perform a Bayesian estimation of the true parameters of the network model (given by the network size and the degree sequence or the sequence of latent variables). This a posteriori estimate of the parameters can then be used to reconstruct the unknown part of the network.

2. The Grand Canonical Network Ensemble with Given Degree Distribution

We consider the hierarchical grand canonical ensemble of exchangeable sparse simple networks where we associate to every network

G = (V, E)

with

N = | V | > N_{0}

nodes the probability

P (G) = P (N) P (k | N) P (G | k, N)

(1)

where

P (N)

indicates the probability that the network G has N nodes,

P (k | N)

indicates the conditional probability that the network has degree sequence

k

given that the network has N nodes, and

P (G | k, N)

indicates the probability of the network G with adjacency matrix

a

given that the network has N nodes and degree sequence

k

(see Figure 1 for a schematic representation of the model).

Figure 1. Schematic representation of the hierarchical grand canonical ensemble of exchangeable sparse simple networks. The proposed ensemble is a hierarchical model of networks in which first the total number of nodes N is drawn from a

P (N) = π (N)

distribution, then a given degree sequence

k = {k_{1}, k_{2}, \dots k_{N}}

is drawn from the distribution

P (k | N)

among all the degree sequence with the total number of nodes N; finally a network G with adjacency matrix

a

drawn from the distribution

P (G | k, N)

among all the networks with a given total number of nodes N and degree sequence

k

. Panel (a) describes the hierarchical nature of the model, panel (b) provide an example of subsequent draw of the total number of nodes, the degree sequence and the adjacency matrix of the network, panel (c) is a visualization of the construction of a network according to the proposed model.

To be specific we consider the following model giving rise to the hierarchical grand canonical ensemble of exchangeable simple models:

(1): Drawing the total number of nodes N of the network. Let us discuss suitable choices for the distribution of the number of nodes N with N greater or equal than some minimum number of nodes $N_{0}$ . We indicate the distribution $P (N)$ as

$\begin{matrix} P (N) = π (N), for N \geq N_{0} . \end{matrix}$

(2)

While a statistical mechanics approach would suggest to take a distribution $π (N)$ with a well defined mean value (such as the exponential distribution)

$\begin{matrix} π (N) = C e^{- μ N} for N \geq N_{0}, \end{matrix}$

(3)

where C is a normalization constant and $μ > 0$ , in the context of network science it might actually be relevant to consider also broad distributions $π (N)$ such as power-law distributions

$\begin{matrix} π (N) = D N^{- ν} for N \geq N_{0}, \end{matrix}$

(4)

where D is a normalization constant and $ν > 1$ .
(2): Drawing the degree sequence of the network. In order to obtain a sparse exchangeable network ensemble with given degree distribution $p (k)$ having finite average degree $⟨ k ⟩$ , minimum allowed degree $\hat{m}$ and maximum allowed degree K we consider the following expression for the probability of a given degree sequence given the total number of nodes

$\begin{matrix} P (k | N) & = & \prod_{i = 1}^{N} [p (k_{i}) \hat{θ} (K - k_{i}) θ (k_{i} - \hat{m})] δ (\sum_{i = 1}^{N} k_{i}, ⟨ k ⟩ N), \end{matrix}$

(5)

where $\hat{θ} (x)$ indicates the Heaviside function $\hat{θ} (x) = 1$ if $x \geq 0$ and $\hat{θ} (x) = 0$ otherwise and where we used the notation $⟨ k ⟩ = \sum_{k} k p (k)$ . In the following we will indicate with L the total number of links of the network given by $L = ⟨ k ⟩ N / 2$ . Note that $P (k | N)$ is independent of the labels of the nodes, i.e., all the degree sequences that can be obtained by a permutation of the node labels of a given degree sequence have the same probability $P (k | N)$ .
(3): Drawing the adjacency matrix of the network. The probability of a network G with adjacency matrix $a$ given the total number of nodes N of the network and the degree sequence $k$ is chosen in the least biased way by drawing the network from a uniform distribution, i.e., the conditional probability $P (G | k, N)$ is equivalent to the probability of a network in the microcanonical ensemble. Therefore, by indicating with $N (k | N)$ the total number of networks with N nodes and degree sequence $k$ and with $\sum_{N} (k) = ln N (k | N)$ the entropy of the ensemble we can express $P (G | k, N)$ as

$\begin{matrix} P (G | k, N) = \frac{1}{N (k | N)} = e^{- \sum_{N} (k)} \end{matrix}$

(6)

Note that for sparse networks of $N \geq N_{0}$ nodes the entropy $\sum_{N} (k)$ obeys the Bender-Canfield formula as long as the network has a structural cutoff $K_{S}$ , i.e., as long as $k_{i} ≪ K_{S} = \sqrt{⟨ k ⟩ N_{0}}$ [3,21,22,47]

$\begin{matrix} \sum_{N} (k) = ln (\frac{(2 L)!!}{\prod_{i = 1}^{N} k_{i}!}) + o (N) \end{matrix}$

(7)

where in Equation (7) we indicate with $k = {k_{1}, k_{2}, \dots, k_{N}}$ the degree sequence with $k_{i}$ , the degree of node i, given by $k_{i} = \sum_{j = 1}^{N} a_{i j}$ .

It follows that the hierarchical grand canonical ensemble for exchangeable sparse networks can be cast into an Hamiltonian ensemble with probability

P (G)

given by

\begin{matrix} P (G) = \frac{1}{Z} e^{- H (G)} δ (⟨ k ⟩ N / 2, \sum_{i < j} a_{i j}) \hat{θ} (K - {max}_{i = 1}^{N} k_{i}) \hat{θ} ({min}_{i = 1}^{N} k_{i} - \hat{m}), \end{matrix}

(8)

with Hamiltonian

H (G)

given by

\begin{matrix} H (G) = - ln π (N) - \sum_{i = 1}^{N} ln [p (k_{i}) k_{i}! δ (k_{i}, \sum_{j = 1}^{N} a_{i j})] + ln ((⟨ k ⟩ N)!!) . \end{matrix}

(9)

This Hamiltonian is global and is invariant under permutation of the node labels, therefore this hierarchical grand canonical ensemble is exchangeable. Indeed we have that the probability of a network

P (G)

given by Equation (8) obeys

\begin{matrix} P (G) = P (\tilde{G}) \end{matrix}

(10)

where

\tilde{G}

is any network obtained from network G under a generic permutation

σ

of the labels of the nodes. Moreover we note that for

π (N) = δ (N, \bar{N})

, i.e., when the network size is fixed this model reduces to the exchangeable model for sparse network ensemble proposed in ref. [32].

3. The Grand Canonical Network Ensemble with Given Distribution of the Latent Variables

The grand canonical formalism can also be easily extended to treat network models with latent variables

θ

associated to the nodes of the network

G = (V, E)

. Note that here and in the following we assume that the latent variables take discrete values. To this end we can consider the soft grand canonical hierarchical model associating to each network with

N = | V | > N_{0}

nodes, latent variables

θ

and adjacency matrix

a

the probability

\begin{matrix} P (G, θ, N) = P (N) P (θ | N) P (G | θ, N) \end{matrix}

(11)

with

\begin{matrix} P (N) = π (N), \end{matrix}

(12)

where

π (N)

is an arbitrary prior on the number of nodes in the network defined for

N \geq N_{0}

. Typical examples of the distribution

π (N)

are given by Equations (3) and (4). The probability of the latent variables is chosen to be exchangeable and given by

\begin{matrix} P (θ | N) = \prod_{i = 1}^{N} p (θ_{i}) \end{matrix}

(13)

where

p (θ_{i})

is the probability distribution of each latent variable. The distribution

p (θ)

can be chosen arbitrarily, as long as the expectation of

θ

is finite. The probability of the network given the network size and the latent variables is chosen to be derived by a Bernoulli variable for each link, with probability of observing a link between node i and node j conditioned on the value of their latent variables given by

p_{N} (θ_{i}, θ_{j})

, i.e.,

\begin{matrix} P (G | θ, N) = \prod_{i < j} p_{N} {(θ_{i}, θ_{j})}^{a_{i j}} {(1 - p_{N} (θ_{i}, θ_{j}))}^{1 - a_{i j}} . \end{matrix}

(14)

To be concrete we consider the following expression for the probability

p_{N} (θ_{i}, θ_{j})

which is the general expression of the marginal probability of a link in canonical network ensembles (or equivalently exponential random graph models),

\begin{matrix} p_{N} (θ_{i}, θ_{j}) = \frac{θ_{i} θ_{j} / N}{1 + θ_{i} θ_{j} / N} . \end{matrix}

(15)

The advantage of taking this expression for the probability

p_{N} (θ_{i}, θ_{j})

is that

p_{N} (θ_{i}, θ_{j})

is always smaller or equal to one for every value of the latent variables. Therefore in this model we do not need to impose a structural cutoff on the latent variables. In summary the grand canonical network ensemble with given latent variable distribution is a hierarchical network model in which given the network size and latent variables the network is drawn according to a canonical ensemble of networks. In this ensemble the probability of a network G can be written in Hamiltonian form as

\begin{matrix} P (G) = \frac{1}{Z} e^{- H (G)} \end{matrix}

(16)

with Hamiltonian

H (G)

given by

\begin{matrix} H (G) = - ln π (N) - \sum_{i = 1}^{N} p_{N} (θ_{i}) - \sum_{i < j} \{a_{i j} ln p_{N} (θ_{i}, θ_{j}) + (1 - a_{i j}) ln [1 - p_{N} (θ_{i}, θ_{j})]\} . \end{matrix}

(17)

This Hamitonian is invariant under permutation of the node labels, therefore this model is exchangeable.

4. The Entropy of Grand Canonical Ensembles

In this paragraph we show that the entropy S [3,48] of the two proposed grand canonical network ensembles, defined as

\begin{matrix} S = \sum_{G} P (G) ln P (G), \end{matrix}

(18)

can be decomposed into contributions that reflect the uncertainty related to an increasing number of hierarchical levels of the model. In order to show this results we discuss separately the entropy of the two proposed grand canonical ensembles.

4.1. Entropy of the Grand Canonical Ensemble with Given Degree Distribution

The entropy S of the ensemble fixing the degree distribution can be decomposed into the entropy of the model at different levels of the hierarchy according to the following expression,

\begin{matrix} S = S_{π (N)} + {⟨ S_{p (k)} ⟩}_{π (N)} + {⟨\sum_{N} (k)⟩}_{π (N), p (k)} \end{matrix}

(19)

where

S_{π (N)}

is the entropy associated to the number of typical choices of the total number of nodes N,

{⟨ S_{p (k)} ⟩}_{π (N)}

is the entropy associated to the choice of the degree sequence averaged over the distribution

π (N)

and

{⟨\sum_{N} (k)⟩}_{π (N), p (k)}

is the average of the Gibbs entropy [3] of the networks with given degree sequence averaged over the distribution

π (N)

and

P (k | N)

. In other words we have

\begin{matrix} S_{π (N)} & = & - \sum_{N > N_{0}} π (N) ln π (N), \\ {⟨ S_{p (k)} ⟩}_{π (N)} & = & \sum_{N > N_{0}} π (N) [- N \sum_{k} p (k) ln p (k)], \\ {⟨\sum_{N} (k)⟩}_{π (N), p (k)} & = & \sum_{N > N_{0}} π (N) \sum_{k} P (k | N) \sum_{N} (k) . \end{matrix}

(20)

4.2. Entropy of the Grand Canonical Ensemble with Given Latent Variable Distribution

Similarly to the previous case, it is easy to show that the entropy of the ensemble fixing the distribution of the latent variables can be decomposed into the entropy of the model at different levels of their hierarchy, according to the following expression

\begin{matrix} S = S_{π (N)} + {⟨ S_{p (θ)} ⟩}_{π (N)} + {⟨S_{N} (θ)⟩}_{π (N), p (θ)}, \end{matrix}

(21)

where

S_{π (N)}

is the entropy associated to the number of typical choices of the total number of nodes N,

{⟨ S_{p (θ)} ⟩}_{π (N)}

is the entropy associated to the choice of the latent variable distribution averaged over the distribution

π (N)

and

{⟨S_{N} (θ)⟩}_{π (N), p (k)}

is the average of the Shannon entropy [3] of the networks with given sequence of latent variables averaged over the distribution

π (N)

and

P (θ | N)

. In other words we have

\begin{matrix} S_{π (N)} & = & - \sum_{N > N_{0}} π (N) ln π (N), \\ {⟨ S_{p (θ)} ⟩}_{π (N)} & = & \sum_{N > N_{0}} π (N) [- N \sum_{θ} p (θ) ln p (θ)], \\ {⟨S_{N} (θ)⟩}_{π (N), p (θ)} & = & \sum_{N > N_{0}} π (N) \sum_{θ} P (θ | N) S_{N} (θ), \end{matrix}

(22)

where the Shannon entropy

S_{N} (θ)

of the network given the sequence of latent variables and the network size N can be expressed as

\begin{matrix} S_{N} (θ) = - \sum_{i < j} [p_{N} (θ_{i}, θ_{j}) ln p_{N} (θ_{i}, θ_{j}) + (1 - p_{N} (θ_{i}, θ_{j})) ln (1 - p_{N} (θ_{i}, θ_{j}))] . \end{matrix}

(23)

5. Marginal Probability of a Link

5.1. The Case of the Grand Canonical Ensemble with Given Degree Distribution

The grand canonical ensemble of exchangeable sparse network ensembles is an ensemble in which the total number of nodes is not specified. If we consider the networks of this ensemble having a given number of nodes N, the model reduces to the exchangeable sparse network ensemble proposed in ref. [32] whose marginal probability of a link

(i, j)

is given by

\begin{matrix} {\tilde{p}}_{i j} = \sum_{k} p (k) \sum_{k^{'}} p (k^{'}) \frac{k k^{'}}{⟨ k ⟩ N} . \end{matrix}

(24)

Since the grand-canonical ensemble of sparse exchangeable networks with given degree distribution can be interpreted as a mixture of the exchangeable sparse models proposed in ref. [32] with different size N, it is immediate to show that the marginal probability of a link between node i and node j in the grand canonical ensembles is given by the exchangeable expression,

\begin{matrix} p_{i j} = \sum_{N > N_{0}} π (N) \sum_{k, k^{'}} p (k) p (k^{'}) \frac{k k^{'}}{⟨ k ⟩ N} = \sum_{N > N_{0}} π (N) \frac{⟨ k ⟩}{N} . \end{matrix}

(25)

Moreover the probability that two nodes are connected given that they have degree k and

k^{'}

is given by

\begin{matrix} p_{i j | k_{i} = k, k_{j} = k^{'}} = p (k, k^{'}) = k k^{'} \sum_{N > N_{0}} \frac{π (N)}{⟨ k ⟩ N} . \end{matrix}

(26)

Finally the probability that two nodes are connected given that they have degree k and

k^{'}

and the actual size of the network is N is given by the uncorrelated network expression

\begin{matrix} p_{i j | k_{i} = k, k_{j} = k^{'}, N} = p_{N} (k, k^{'}) = \frac{k k^{'}}{⟨ k ⟩ N} . \end{matrix}

(27)

From these expressions of the marginal probability of a link it is possible to appreciate how the hierarchical grand canonical ensemble of sparse exchangeable networks circumvents the difficulties arising form the Aldous-Hoover theorem without violating it. Indeed the marginal probability

p_{N} (k, k^{'})

of a link conditioned on the degrees of the two linked nodes and the number of nodes N of the network vanishes in the limit

N \to \infty

, however if the number of nodes of the network is arbitrarily large but unknown the marginal probability of the link remains finite (as both

p_{i j}

and

p (k, k^{'})

are finite).

5.2. The Case of the Grand Canonical Ensemble with Given Latent Variable Distribution

For the grand canonical ensemble with given latent variable distribution

p (θ)

we have that the marginal probability of a link is given by

\begin{matrix} p_{i j} = \sum_{N > N_{0}} π (N) \sum_{θ, θ^{'}} p (θ) p (θ^{'}) p_{N} (θ, θ^{'}) . \end{matrix}

(28)

The probability of the link given the latent variable of the nodes is given by

\begin{matrix} p (θ, θ^{'}) = θ θ^{'} \sum_{N > N_{0}} π (N) \frac{1}{N + θ θ^{'}}; \end{matrix}

(29)

The probability of a link given the network size and the latent variables is given by

\begin{matrix} p_{N} (θ, θ^{'}) = \frac{θ θ^{'} / N}{1 + θ θ^{'} / N} . \end{matrix}

(30)

As we discussed in the case of the grand canonical ensemble with given degree distribution also for the grand canonical ensemble with given latent variable distribution the grand canonical approach allows to circumvent the Aldous-Hoover theorem without violating it as the marginal probability of a link in an arbitrarily large network of unknown size is finite.

6. Generating Single Instances of Grand-Canonical Network Ensembles

In this section we describe two algorithms to generate single instances of the proposed grand canonical ensembles. In particular we will discuss a Metropolis-Hastings ensemble to generate single instances of networks drawn from the grand canonical ensemble with given degree distribution and a Monte Carlo algorithm to generate single instances of networks drawn from the grand canonical ensemble with given distribution of latent variables.

6.1. Metropolis-Hastings Algorithm for the Grand-Canonical Ensemble with Given Degree Distribution

The grand-canonical exchangeable ensemble of sparse networks can be obtained by implementing a Metropolis-Hastings algorithm using the network Hamiltonian given by Equation (9).

(1): Start with a network of N nodes having exactly $L = ⟨ k ⟩ N / 2$ links and in which the minimum degree is greater of equal to $\hat{m}$ and the maximum degree is smaller or equal to K.
(2): Perform the Metropolis-Hastings algorithm for exchangeable sparse networks with N nodes (defined below);
(3): Propose to change the number of nodes to $N^{'} = N + 1$ (addition of one node) or $N^{'} = N - 1$ (removal of one node) with equal probability and accept the move with probability $max (1, π (N^{'}) / π (N))$ as long as $N^{'} > N_{0}$ . If the move is accepted change the number of nodes adding or removing a node, set the number of links to $L = ⟨ k ⟩ N / 2$ and ensure that each node has minimal degree at least $\hat{m}$ and maximum degree less than K. In particular if a node is added ensure it has at least $\hat{m}$ links by rewiring randomly the existing links of the networks and adding a number of links so that the total number of links is the integer that better approximates $⟨ k ⟩ N / 2$ . Instead, if a node needs to be removed, choose a random node of the network remove it and rewire/remove links in order to enforce that the total number of links is the integer that better approximates $⟨ k ⟩ N / 2$ .

The Metropolis-Hastings algorithm for the exchangeable sparse networks with N nodes is the same algorithm used in Ref. [32] for exchangeable networks with finite size N and is indicated below.

(1)

Start with a network of N nodes having exactly

L = ⟨ k ⟩ N / 2

links and in which the minimum degree is greater of equal to

\hat{m}

and the maximum degree is smaller or equal to K.

(2)

Iterate the following steps until equilibration:

(i): Let $a$ be the adjacency matrix of the network;
(i): Choose randomly a random link $ℓ = (i, j)$ between node i and j and choose a pair of random nodes $(i^{'}, j^{'})$ not connected by a link.
(ii): Let $a^{'}$ be the adjacency matrix of the network in which the link $(i, j)$ is removed and the link $(i^{'}, j^{'})$ is inserted instead. Draw a random number r from a uniform distribution in $[0, 1]$ , i.e., $r \sim U (0, 1)$ . If $r < \max (1, e^{- Δ H})$ where $Δ H = H (a^{'}) - H (a)$ and if the move does not violate the conditions on the minimum and maximum degree of the network, replace $a$ by $a^{'}$ .

The Metropolis-Hastings algorithm can be used to sample the space of networks with variable number of nodes and given (stable) degree distribution (see Figure 2).

Figure 2. Results of the Metropolis-Hastings algorithm for generating grand canonical ensembles with given degree distribution. The number of nodes

N (t)

as a function of time t in the Metropolis-Hastings simulation of an exponential networks (panel (a)) and networks with more general degree distribution (panel (c)) are shown together with the average degree distribution of the networks that is stable as the number of networks varies (symbols of panel (b) and (d)). The solid lines in panel (b) and panel (d) indicate the target degree distributions

p (k) = C e^{- k / m}

with

m = 5

(for panel (b)) and

p (k) = C {(3 + k)}^{- γ}

with

γ = 3.4

(for panel (d)). The prior on the number of nodes is taken to be exponential

π (N) = C e^{- N / \bar{N}}

with

\bar{N} = 1000

with

N_{0} = 500

and

K = 16

.

6.2. Monte Carlo Generation of Grand Canonical Network Ensemble with Given Latent Variable Distribution

A single instance of the grand canonical model with given latent variable distribution can be obtained by performing the following algorithm:

1: Draw the network size N from the $π (N)$ distribution;
2: Draw the latent variable $θ_{i}$ of each node i independently from the latent variable distribution $p (θ)$ .
3: Draw each link $(i, j)$ of the network with probability $p_{N} (θ_{i}, θ_{j})$ .

7. Bayesian Estimation of the Network Parameters Given Partial Knowledge of the Network

In this section we will use the grand canonical network ensembles for calculating the posterior distribution of the network parameters given partial information of a network

G = (V, E)

. In particular let us assume that we only know the subgraph

\hat{G} (\hat{V}, \hat{E})

induced by a set of nodes

\hat{V} \subset V

of

\hat{N} = | \hat{V} |

nodes and of adjacency matrix

\hat{a}

and we do not have access to the full network G with adjacency matrix

a

. Without loss of generality let us label the nodes of the network in such a way that the labels i with

1 \leq i \leq \hat{N}

indicate the nodes in

\tilde{V}

(denote as sampled nodes) and the labels i with

i > \hat{N}

indicate the nodes in

V \ \hat{V}

(denoted also as unsampled or unknown nodes). We indicate with

κ

the degree sequence of the sampled network

\hat{G}

. Our goal is to make a Bayesian estimation of the network size N and the true network parameters given the observed subgraph

\hat{G}

. These a posteriori estimates of the true parameters of the network can then be used to reconstruct the unknown part of the network G.

7.1. Inferring the True Parameters with the Grand Canonical Ensemble with Given Degree Distribution

In this paragraph we will use the grand canonical ensemble with given degree distribution to find the posterior probability distribution of the network parameters. For convenience we will indicate with

k_{i}

the true degree of the sampled nodes

1 \leq i \leq \hat{N}

and we will indicate

q_{i}

the true degree of the remaining unsampled

N - \hat{N}

nodes

\hat{N} + 1 \leq i \leq N

. To this end, using the Bayes rule we get the following expression for the posterior distribution of the network parameters given the observed subgraph

\hat{G}

\begin{matrix} P (N, k, q | \hat{G}) = \frac{P (N) P (k, q | N) P (\hat{G} | k, q, N)}{P (\hat{G})} \end{matrix}

(31)

where

\begin{matrix} P (N) & = & π (N), \\ P (k, q | N) & = & \prod_{i = 1}^{\hat{N}} p (k_{i}) \prod_{i = 1 + \hat{N}}^{N} p (q_{i}), \\ P (\hat{G} | k, q, N) & = & e^{- Δ_{N} \sum (k, q | κ)}, \end{matrix}

(32)

with

Δ_{N} \sum (k, q | κ)

given by

\begin{matrix} Δ_{N} \sum (k, q | κ) = \sum_{N} (k, q) - {\sum^{^}}_{N} (k, q | κ) . \end{matrix}

(33)

Here

\sum_{N} (k, q)

indicates the entropy of the network fo size N with degree sequence

[k, q]

whose expression is given by the Bender-Canfield formula [3,21,22,47] (Equation (7)) which reads in this case

\begin{matrix} \sum_{N} (k, q) = (2 L)!! {[\prod_{i = 1}^{\hat{N}} k_{i}! \prod_{i = 1 + \hat{N}}^{N} q_{i}!]}^{- 1} . \end{matrix}

(34)

Moreover

{\sum^{^}}_{N} (k, q | κ)

indicates the logarithm of the number of networks of N nodes having

\hat{G}

(with adjacency matrix

\hat{a}

and degree sequence

κ

) as induced subgraph between the

\hat{N}

sampled nodes.

Moreover in Equation (31)

P (\hat{G})

indicates the evidence of the data given by

\begin{matrix} P (\hat{G}) = \sum_{N} \sum_{k, q} π (N) \prod_{i = 1}^{\hat{N}} p (k_{i}) \prod_{i = 1 + \hat{N}}^{N} p (q_{i}) e^{- Δ_{N} \sum (k, q | κ)} . \end{matrix}

(35)

Calculating the entropy

{\sum^{^}}_{N} (k, q | κ)

using statistical mechanics methods including the use of a functional order parameter (see Appendix A), we derive the following expression:

\begin{matrix} {\sum^{^}}_{N} (k, q | κ) & = & ln [\frac{M! (Q - M)!!}{\prod_{i = 1}^{\hat{N}} (k_{i} - κ_{i})! \prod_{i = 1 + \hat{N}}^{N} q_{i}!} (\begin{matrix} Q \\ M \end{matrix}) δ (Q + M, 2 L + 2 \hat{L})] \end{matrix}

(36)

where M indicates the number of links between the sampled nodes and the unsampled nodes and Q indicates the sum over all the degrees of the unsampled nodes, i.e.,

\begin{matrix} M & = & \sum_{i = 1}^{\hat{N}} (k_{i} - κ_{i}), \\ Q & = & \sum_{i = 1 + \hat{N}}^{N} q_{i}, \end{matrix}

(37)

where M and Q need to satisfy the constraint enforcing that the total number of true links is given by

L = ⟨ k ⟩ N / 2

. Therefore, indicating with

\hat{L} = \sum_{i = 1}^{\hat{N}} κ_{i} / 2

, we must impose

\begin{matrix} Q + M = 2 L - 2 \hat{L} . \end{matrix}

(38)

The expression obtained for the entropy

{\sum^{^}}_{N} (k, q | κ)

implies that the asymptotic expression for the number of networks with N nodes, degree sequence

[k, q]

having

\hat{G}

as a subgraph is given by (see Appendix A for the derivation)

\begin{matrix} N (k, q | κ, N) = e^{{\sum^{^}}_{N} (k, q | κ)} = \frac{M! (Q - M)!!}{\prod_{i = 1}^{\hat{N}} (k_{i} - κ_{i})! \prod_{i = 1 + \hat{N}}^{N} q_{i}!} (\begin{matrix} Q \\ M \end{matrix}) δ (Q + M, 2 L + 2 \hat{L}) . \end{matrix}

(39)

This expression admits a simple combinatorial interpretation. In fact the networks with degree sequence

[k, q]

having as subgraph

\hat{G}

can be constructed by adding (unsampled) links to the graph

\hat{G}

. The unsampled part of the network can be constructed by assigning to each node i with

1 \leq i \leq \hat{N}

a number of stubs given by

k_{i} - κ_{i}

and to each node i with

i > \hat{N}

a number of stubs given by

q_{i}

. The unsampled networks can then be obtained by matching the stubs pairwise with the constrains that the stubs of the first

\hat{N}

nodes can be only matched with the stubs of the unsampled nodes

i > \hat{N}

. Therefore the reconstructed part of the network is formed by a bipartite network between the sampled and the unsampled nodes with a number of links given by M and a simple network among the unsampled nodes with number of links given by

(Q - M) / 2

. The number of matchings of the M links of the bipartite network is given by

M!

the number of matching of the stubs of the simple network among unsampled nodes is

(Q - M)!!

. In order to get the number of distinct networks G with degree sequence

[k, q]

having as subgraph

\hat{G}

we need to divide by the number of permutations of the stubs belonging to the same nodes and we need to multiply by Q choose M indicating the number of ways in which we can choose the M stubs of the unsampled nodes to be matched with the stubs of the sampled nodes.

Given the expression for

{\sum^{^}}_{N} (k, q | κ)

provided by Equation (36), we can deduce the explicit expression for

Δ_{N} \sum (k, q | κ)

:

\begin{matrix} Δ_{N} \sum (k, q | κ) & = & ln [\prod_{i = 1}^{\hat{N}} \frac{k_{i}!}{(k_{i} - κ_{i})!} \frac{M! (Q - M)!!}{(⟨ k ⟩ N)!!} (\begin{matrix} Q \\ M \end{matrix}) δ (Q + M, 2 L - 2 \hat{L})] . \end{matrix}

(40)

It follows that the describe Bayesian inference assigns a probability to the model parameters a probability

\begin{matrix} P (N, k, q | \hat{G}) \propto π (N) \prod_{i = 1}^{\hat{N}} p (k_{i}) \prod_{i = 1 + \hat{N} + 1}^{N} p (q_{i}) e^{- Δ_{N} \sum (k, q | κ)}, \end{matrix}

(41)

with

Δ_{N} \sum (k, q | κ)

given by Equation (40). From this expression, imposing with a delta function that

M = \sum_{i = 1}^{\hat{N}} (k_{i} - κ_{i})

, expressing the delta in integral form and using the saddle point to evaluate the integral, we can calculate the marginal probability

P (k_{i} | \hat{G}, ω)

that a sampled node i with

1 \leq i \leq \hat{N}

has true degree

k_{i} \geq κ_{i}

given M and Q, i.e.,

\begin{matrix} P (k_{i} | \hat{G}, ω) \propto p (k_{i}) \frac{k_{i}!}{(k_{i} - κ_{i})!} e^{- ω k_{i}} \hat{θ} (k_{i} - κ_{i}) \end{matrix}

(42)

where

ω

is related to M by

\begin{matrix} M = \sum_{i = 1}^{\hat{N}} \frac{\sum_{k} p (k) k \frac{k!}{(k - κ_{i})!} e^{- ω k}}{\sum_{k^{'}} p (k^{'}) \frac{k^{'}!}{(k^{'} - κ_{i})!} e^{- ω k^{'}}} . \end{matrix}

(43)

In Figure 3 we show the difference between an exponential prior distribution

p (k)

on the degree of the nodes and the posterior marginal probability of the true degree of the sampled nodes

P (k | \hat{G}, ω)

plotted for different values of the sampled degree

κ

of the same node. Finally, we can calculate the a posteriori probability

P (N | \hat{G}, M)

that the real networks has N nodes, conditioned to M and to the sampled subrgraph

\hat{G}

. To this end we sum Equation (41) over all the possible values of the degrees

k

and

q

such that Equation (37) are satisfied. Therefore, by inserting Equation (40) into Equation (41), enforcing Equation (37) with Kronecker deltas and integrating over all the possible values of

k

and

q

we get

\begin{matrix} P (N | \hat{G}, M) \propto π (N) \hat{θ} (N - \hat{N}) C_{M, N} I^{(k)} (M) I^{(q)} (M, N), \end{matrix}

(44)

where

\begin{matrix} C_{M, N} & = & \frac{M! (Q - M)!!}{(⟨ k ⟩ N)!!} (\begin{matrix} Q \\ M \end{matrix}), \\ I^{(k)} (M) & = & \sum_{k} [\prod_{i = 1}^{\hat{N}} p (k_{i}) \frac{k_{i}!}{(k_{i} - κ_{i})!} δ (M, \sum_{i = 1}^{\hat{N}} k_{i})], \\ I^{(q)} (M, N) & = & \sum_{q} [\prod_{i = 1 + \hat{N}}^{N} p (q_{i}) δ (Q, \sum_{i = 1 + \hat{N}}^{N} q_{i})], \end{matrix}

(45)

where

Q = ⟨ k ⟩ N - 2 \hat{L} - M

. By expressing the Kronecker deltas in an integral form according to the expression

\begin{matrix} δ (x, y) = \frac{1}{2 π} \int_{- π}^{π} e^{i ω (x - y)}, \end{matrix}

(46)

performing a Wick rotation and evaluating the integrals at the saddle point, we can express

I^{(k)} (M)

and

I^{(q)} (M, N)

as

\begin{matrix} I^{(k)} (M) & = & \frac{1}{2 π} [\prod_{i = 1}^{\hat{N}} \sum_{k > κ_{i}} p (k) \frac{k!}{(k - κ_{i})!} e^{- ω^{⋆} k}] e^{ω^{⋆} M}, \\ I^{(q)} (M, N) & = & \frac{1}{2 π} {[\sum_{q} p (q) e^{- {\bar{ω}}^{⋆} q}]}^{N - \hat{N}} e^{{\bar{ω}}^{⋆} Q}, \end{matrix}

(47)

with

ω^{⋆}

and

{\bar{ω}}^{⋆}

fixed by the saddle point equations

\begin{matrix} M & = & \sum_{i = 1}^{\hat{N}} \frac{\sum_{k > κ_{i}} p (k) \frac{k!}{(k - κ_{i})!} k e^{- ω^{⋆} k}}{\sum_{k > κ_{i}} p (k) \frac{k!}{(k - κ_{i})!} e^{- ω^{⋆} k}}, \\ Q & = & (N - \hat{N}) \frac{\sum_{q} p (q) q e^{- {\bar{ω}}^{⋆} q}}{\sum_{q} p (q) e^{- {\bar{ω}}^{⋆} q}} . \end{matrix}

(48)

Figure 3. Marginal posterior probability for the true degree and of the true latent variable of a sampled node. The posterior probability

P (k_{i} | \hat{G}, ω)

(panel (a)) of the true degree of a sampled nodes depends on the degree

κ

of the nodes in the sampled network

\hat{G}

and is non-zero only for

k \geq κ

. The posterior probability

P (θ | \hat{G}, \bar{θ})

of the latent variable of a sampled node (panel (b)) can be non-zero on the entire range of

θ

values allowed by the prior. Here we have plotted

P (k_{i} | \hat{G}, ω)

and

P (θ | \hat{G}, \bar{θ})

for different values of

κ

and we have chosen

ω = 2

and

\bar{θ} = 0.6

. The dashed lines indicate the exponential prior on the degrees (panel (a)) and on the latent variables (panel (b)).

In Figure 4 we display the marginal a posteriori distribution

P (N | \hat{G}, M)

as function of M demonstrating that the sampled network can modify significantly the prior assumptions on the total number of nodes in the network.

Figure 4. Marginal posterior probability for the true number of nodes in the grand canonical ensemble with given degree distribution and in the grand canonical ensemble with given latent variable distribution. The posterior probability

P (N | \hat{G}, M)

in panel (a) of the true number of nodes depends on the total number M of true but not observed links of the sampled nodes and on the total number of sampled links

\hat{L}

; the posterior probability

P (N | \hat{G})

in panel (b) depends instead only on the degree

κ

of the nodes in the sampled network

\hat{G}

. We took

N_{0} = 100

and the priors given by

π (N) \propto e^{- N / \hat{N}}

,

p (k) \propto e^{- k / m}

,

p (θ) \propto e^{- θ / m}

with

\hat{N} = 200

, and

m = 7

. In panel (a) we have plotted

P (N | \hat{G}, M)

for different values of

M = (⟨ k ⟩ - n) \hat{N}

with

n = 1, 2, 3, 4

and

\hat{L} = \hat{N} / 2

; in panel (b) we have plotted

P (N | \hat{G})

assuming that

\hat{G}

is regular with all sampled nodes having sampled degree

κ = 1, 2, 3, 4, 5

. The dashed lines indicate the exponential prior

π (N)

on the number of nodes.

7.2. Inferring the True Parameters with the Grand Canonical Ensemble with Given Latent Variable Distribution

In this section we treat the problem of Bayesian estimation of the parameters of the true network G given the sampled network

\hat{G}

using the grand canonical model with given latent variable distribution. Let us indicate with

θ_{i}

the latent variables of the sampled nodes

1 \leq i \leq \hat{N}

and with

ϕ_{i}

the latent variables of the unsampled nodes

i > \hat{N}

. Using Bayes rule we have

\begin{matrix} P (N, θ, ϕ | \hat{G}) = \frac{P (N) P (θ, ϕ | N) P (\hat{G} | θ, ϕ, N)}{P (\hat{G})}, \end{matrix}

(49)

where

P (\hat{G} | θ, ϕ, N)

is independent of

ϕ

, i.e.,

P (\hat{G} | θ, ϕ, N) = P (\hat{G} | θ, N)

and where

\begin{matrix} P (N) & = & π (N), \\ P (θ, ϕ | N) & = & \prod_{i = 1}^{\hat{N}} p (θ_{i}) \prod_{i = 1 + \hat{N}}^{N} p (ϕ_{i}), \\ P (\hat{G} | θ, N) & = & \prod_{i < j | i, j \in \hat{V}} p_{N} {(θ_{i}, θ_{j})}^{{\hat{a}}_{i j}} {(1 - p_{N} (θ_{i}, θ_{j}))}^{1 - {\hat{a}}_{i j}} \end{matrix}

(50)

with

p_{N} (θ_{i}, θ_{j})

given by Equation (15) and with

\hat{a}

indicating the adjacency matrix of the sampled subgraph

\hat{G}

. In Equation (49)

P (\hat{G})

indicates the evidence of the data given by

\begin{matrix} P (\hat{G}) = \sum_{N} \sum_{θ} π (N) \prod_{i = 1}^{\hat{N}} p (θ_{i}) P (\hat{G} | θ, N) . \end{matrix}

(51)

Since, as we have observed previously,

P (\hat{G} | θ, ϕ, N)

is independent of

ϕ

the Bayesian estimation of the parameters

ϕ

reduces simply to the prior in this case. Therefore we focus here only on the Bayesian estimate of the latent variables

θ

, i.e., we consider

\begin{matrix} P (N, θ | \hat{G}) = \frac{P (N) P (θ | N) P (\hat{G} | θ, N)}{P (\hat{G})}, \end{matrix}

(52)

with

P (N), P (\hat{G} | θ, N), P (\hat{G})

having the same definition as above and

\begin{matrix} P (θ | N) = \prod_{i = 1}^{\hat{N}} p (θ_{i}) . \end{matrix}

(53)

Using the explicit expression of

p_{N} (θ_{i}, θ_{j})

given by Equation (15), we can express the likelihood

P (\hat{G} | θ, N)

of the sampled network as

\begin{matrix} P (\hat{G} | θ, N) & = & \prod_{i = 1}^{\hat{N}} θ_{i}^{κ_{i}} \prod_{i < j | i, j \in \hat{V}} {(1 + \frac{θ_{i} θ_{j}}{N})}^{- 1} \frac{1}{N^{\hat{L}}}, \end{matrix}

(54)

where

\hat{L}

is the number of links of the sampled network

\hat{G}

. In the limit

N ≫ 1

we can approximate this expression as

\begin{matrix} P (\hat{G} | θ, N) ≃ \int d \bar{θ} \prod_{i = 1}^{\hat{N}} [θ_{i}^{κ_{i}} e^{- θ_{i} \bar{θ} / 2}] \frac{1}{N^{\hat{L}}} δ (\bar{θ}, \sum_{j = 1}^{\hat{N}} θ_{j} / N) \end{matrix}

(55)

with this approximation we get that the posterior probability

P (N, θ | \hat{G})

is given by

\begin{matrix} P (N, θ | \hat{G}) \propto & π (N) \frac{1}{N^{\hat{L}}} \int d \bar{θ} \prod_{i = 1}^{\hat{N}} [p (θ_{i}) θ_{i}^{κ_{i}} e^{- θ_{i} \bar{θ} / 2}] δ (\bar{θ}, \sum_{j = 1}^{\hat{N}} θ_{j} / N) . \end{matrix}

(56)

Calculating the marginal posterior probability of a single latent variable conditional of

\bar{θ}

we get

\begin{matrix} P (θ_{i} | \hat{G}, \bar{θ}) = p (θ_{i}) θ_{i}^{κ_{i}} e^{- θ_{i} \bar{θ} / 2} . \end{matrix}

(57)

In Figure 3 we show the difference between an exponential prior distribution

p (θ)

on the latent variables of the nodes and the posterior marginal probability of the true latent variables of the sampled nodes

P (θ | \hat{G}, \bar{θ})

plotted for different values of the sampled degree

κ

of the same node.

Stating from Equation (56) we can also calculate the posterior distribution

P (N | \hat{G})

of the true number of nodes

N > \hat{N}

. To this end we express the delta function in an integral form and we sum over all possible latent variables

θ

, obtaining

\begin{matrix} P (N | \hat{G}) \propto π (N) \hat{θ} (N - \hat{N}) \frac{1}{N^{\hat{L} - 1}} \frac{1}{2 π} \int d \bar{θ} d ω e^{i N ω \bar{θ}} I^{(θ)} (ω, \bar{θ}) \end{matrix}

(58)

where

I^{(θ)} (ω, \bar{θ})

is given by

\begin{matrix} I^{(θ)} = \prod_{i = 1}^{\hat{N}} (\sum_{θ} p (θ) θ^{κ_{i}} e^{- θ (\bar{θ} / 2 - i ω)}) . \end{matrix}

(59)

The integrals in Equation (58) can be calculated at the saddle point getting

\begin{matrix} P (N | \hat{G}) \propto π (N) \hat{θ} (N - \hat{N}) \frac{1}{N^{\hat{L} - 1}} e^{N \frac{{({\bar{θ}}^{⋆})}^{2}}{2}} [\prod_{i = 1}^{\hat{N}} (\sum_{θ} p (θ) θ^{κ_{i}} e^{- θ {\bar{θ}}^{⋆}})] \end{matrix}

(60)

where

\begin{matrix} {\bar{θ}}^{⋆} = \frac{1}{N} \sum_{i = 1}^{\hat{N}} \frac{\sum_{θ} p (θ) θ^{κ_{i} + 1} e^{- θ {\bar{θ}}^{⋆}}}{\sum_{θ} p (θ) θ^{κ_{i}} e^{- θ {\bar{θ}}^{⋆}}} . \end{matrix}

(61)

In Figure 4 we display the marginal a posteriori distribution

P (N | \hat{G})

on the true number of nodes in the simplified assumption in which

\hat{G}

is regular and all degree

κ

are the same demonstrating that the sampled network can modify significantly the prior assumptions on the total number of nodes in the network.

8. Conclusions

In this paper we have proposed grand canonical network ensembles formed by networks of varying number of nodes. The grand canonical network ensembles we have introduced are both sparse and exchangeable, i.e., have a finite average degree and are invariant under permutation of the node labels. The grand canonical ensembles are hierarchical network models in which first the network size is selected, then the degree sequence (or the sequence of latent variables) and finally the network adjacency matrix is selected. The model circumvents the difficulties imposed by the Aldous-Hoover theorem that states that exchangeable infinite sparse network ensembles vanish, as the network is a mixture of finite networks, although the networks can have an arbitrarily large network size. Here we show how the grand-canonical ensembles can be used to perform a Bayesian estimation of the network parameters when only partial information about the network structures is known. This a posteriori estimation of the network parameters can then be used for network reconstruction.

The grand canonical framework for sparse exchangeable network ensembles is here described for the case simple networks but has the potential to be extended to generalized network structures including directed, bipartite networks, multiplex networks and simplicial complexes following the lines outlined in ref. [32].

In conclusion we hope that this work, proposing hierarchical grand canonical network ensembles able to treat networks of different size and relating network theory to statistical mechanics will stimulate further results of mathematicians, physicists, and computer scientists working in network science and related machine learning problems.

Funding

G.B. acknowledges support from the Royal Society IEC\NSFC\191147.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Derivation of ${\sum^{^}}_{N}$ (k,q|κ)

In this Appendix our goal is to derive the asymptotic expression of

{\sum^{^}}_{N} (k, q | κ)

in the limit of large network size of the sampled network

\hat{N} ≫ 1

, and of the true network

N = (1 + α) \hat{N} ≫ 1

with

α > 0

.

Let us assume that the sampled subgraph G is the network between the sampled nodes

1 \leq i \leq \hat{N}

and has adjacency matrix

\hat{a}

. The true network is instead formed by N nodes with adjacency matrix

a

. We assume that

a

has the block structure given by

\begin{matrix} a = (\begin{matrix} \hat{a} & b \\ b^{⊤} & \tilde{a} \end{matrix}), \end{matrix}

(A1)

where

b

indicates the

\hat{N} \times α \hat{N}

matrix between sampled nodes and the unsampled nodes and

\tilde{a}

indicates tha

(α \hat{N}) \times (α \hat{N})

adjacency matrix among the unsampled nodes. As we have mentioned in the main text

{\sum^{^}}_{N} (k, q | κ)

is the logarithm of the number

N (k, q | κ, N)

of networks (or adjacency matrices

a

) with degree sequence

[k, q]

and admitting as a subgraph

\hat{G}

having sampled degree sequence

κ

. In statistical mechanics we also call

N (k, q | κ, N)

the partition function of its corresponding statistical mechanics network model, and we indicate it by Z. In terms of the matrices

b

and

\tilde{a}

the partition function

Z = N (k, q | κ) = exp ({\sum^{^}}_{N} (k, q | κ))

can be written as

\begin{matrix} Z & = & \sum_{b, \tilde{a}} \prod_{i = 1}^{\hat{N}} δ (k_{i} - \sum_{j = 1}^{N} a_{i j}) \prod_{i = 1 + \hat{N}}^{N} δ (q_{i} - \sum_{j = 1}^{N} a_{i j}) δ (2 L - \sum_{i = 1}^{N} k_{i}) \end{matrix}

(A2)

Expressing the Kronecker deltas in the integral form and performing the sum over the elements of the matrices

b

and

\tilde{a}

we obtain

\begin{matrix} Z & = & \int D ω \int D \tilde{ω} \int \frac{d λ}{2 π} e^{G (ω, \tilde{ω}, λ)} \end{matrix}

(A3)

with

\begin{matrix} G (ω, \tilde{ω}, λ) & = & \sum_{i = 1}^{\hat{N}} [i ω_{i} (k_{i} - κ_{i})] + \sum_{i = 1 + \hat{N}}^{N} [i {\tilde{ω}}_{i} q_{i}] + \sum_{i = 1}^{\hat{N}} \sum_{j = 1}^{\hat{N}} ln (1 + e^{- i ω_{i} - i {\tilde{ω}}_{j} - i λ}) \\ + & \frac{1}{2} \sum_{i = \hat{N} + 1}^{N} \sum_{j = \hat{N} + 1}^{N} ln (1 + e^{- i {\tilde{ω}}_{i} - i {\tilde{ω}}_{j} - i λ}) + i λ (L - \hat{L}), \end{matrix}

(A4)

and with

D ω = \prod_{i = 1}^{\hat{N}} [d ω_{i} / (2 π)]

and

D \tilde{ω} = \prod_{i = 1 + \hat{N}}^{N} [d {\tilde{ω}}_{i} / (2 π)]

. Let us now introduce the functional order parameters [22,49,50].

\begin{matrix} c_{κ, k} (ω) & = & \frac{1}{\hat{N} \hat{P} (κ, k)} \sum_{i = 1}^{\hat{N}} δ (ω - ω_{i}) δ (k, k_{i}) δ (κ, κ_{i}), \\ ρ_{q} (\tilde{ω}) & = & \frac{1}{α \hat{N} \tilde{P} (q)} \sum_{i = 1 + \hat{N}}^{N} δ (\tilde{ω} - {\tilde{ω}}_{i}) δ (q, q_{i}), \end{matrix}

(A5)

where

\hat{P} (k, κ)

is the fraction of sampled nodes with degree

κ

in the sampled network and total inferred degree k;

\tilde{P} (q)

is the fraction of unsampled nodes with degree q. Moreover we have indicated with

L = ⟨ k ⟩ N / 2

and with

\hat{L} = \sum_{i = 1}^{\hat{N}} κ_{i} / 2

. By enforcing the definition of the order parameters with a series of delta functions we obtain

\begin{matrix} 1 = \int d c_{κ, k} (ω) δ (c_{κ, k} (ω) - \frac{1}{\hat{N} \hat{P} (κ, k)} \sum_{i = 1}^{\hat{N}} δ (ω - ω_{i}) δ (k, k_{i}) δ (κ, κ_{i})) \\ = \int \frac{d {\hat{c}}_{κ, k} (ω) d c_{κ, k} (ω)}{2 π / (\hat{N} \hat{P} (κ, k) Δ ω)} exp [i Δ ω {\hat{c}}_{κ, k} (ω) [\hat{N} \hat{P} (κ, k) c_{κ, k} (ω) - \sum_{i = 1}^{\hat{N}} δ (ω - ω_{i}) δ (k, k_{i}) δ (κ, κ_{i})]] . \\ 1 = \int d ρ_{q} (\tilde{ω}) δ (ρ_{q} (\tilde{ω}) - \frac{1}{α \hat{N} \tilde{P} (q)} \sum_{i = 1 + \hat{N}}^{N} δ (\tilde{ω} - {\tilde{ω}}_{i}) δ (q, q_{i})) \\ = \int \frac{d {\hat{ρ}}_{q} (\tilde{ω}) d ρ_{q} (\tilde{ω})}{2 π / (α \hat{N} \tilde{P} (q) Δ \tilde{ω})} exp [i Δ \tilde{ω} {\hat{ρ}}_{q} (\tilde{ω}) [α \hat{N} \tilde{P} (q) ρ_{q} (\tilde{ω}) - \sum_{i = 1 + \hat{N}}^{N} δ (\tilde{ω} - {\tilde{ω}}_{i}) δ (q, q_{i})]] . \end{matrix}

After inserting these expressions into the partition function in the limit

Δ ω \to 0

, indicating with

\sum^{'}

the sum over the allowed degree range we obtain

\begin{matrix} Z = \sum_{κ}^{'} \sum_{k}^{'} \sum_{q}^{'} \int \prod_{κ, k} D c_{κ, k} (ω) \int \prod_{κ, k} D {\hat{c}}_{κ, k} (ω) \int \prod_{q} D ρ_{q} (\tilde{ω}) \int \prod_{q} D {\hat{ρ}}_{q} (\tilde{ω}) \int \frac{d λ}{2 π} e^{\hat{N} f} \end{matrix}

with

f = f (c (ω, k), \hat{c} (ω, k), ρ (\tilde{ω}, q), \hat{ρ} (\tilde{ω}, q), λ, h)

given by

\begin{matrix} f = \sum_{\hat{m} \leq κ \leq K} \sum_{κ \leq k \leq K} \hat{P} (κ, k) i \int d ω {\hat{c}}_{κ, k} (ω) c_{κ, k} (ω) + α i \int d ω \sum_{\hat{m} \leq q \leq K} \tilde{P} (q) {\hat{ρ}}_{q} (\tilde{ω}) ρ_{q} (\tilde{ω}) \\ + i λ (L - \hat{L}) / \hat{N} + Ψ + \sum_{\hat{m} \leq κ \leq K} \sum_{κ \leq k \leq K} \hat{P} (κ, k) ln \int \frac{d ω}{2 π} e^{i ω (k - κ) - i {\hat{c}}_{κ, k} (ω, k)} \\ + α \sum_{\hat{m} \leq q \leq K} \tilde{P} (q) ln \int \frac{d \tilde{ω}}{2 π} e^{i \tilde{ω} q - i {\hat{ρ}}_{q} (\tilde{ω})}, \end{matrix}

(A6)

where

Ψ

is given by

\begin{matrix} Ψ = \frac{α^{2} \hat{N}}{2} \sum_{\hat{m} \leq q \leq K, \hat{m} \leq q^{'} \leq K} \tilde{P} (q) \tilde{P} (q^{'}) \int d ω \int d {\tilde{ω}}^{'} ρ_{q} (\tilde{ω}) ρ_{q^{'}} ({\tilde{ω}}^{'}) ln (1 + e^{- i \tilde{ω} - i {\tilde{ω}}^{'} - i λ}) \\ + α \hat{N} \sum_{\hat{m} \leq κ \leq K} \sum_{κ \leq k \leq K} \hat{P} (κ, k) \sum_{\hat{m} \leq q \leq K} \tilde{P} (q) \int d ω \int d \tilde{ω} c_{κ, k} (ω) ρ_{q} (\tilde{ω}) ln (1 + e^{- i ω - i \tilde{ω} - i λ}), \end{matrix}

and where the functional measures are defined as

\begin{matrix} D c_{κ, k} (ω) & = & lim_{Δ ω \to 0} \prod_{ω} [d c_{k κ} (ω) \sqrt{\hat{N} \hat{P} (κ, k) Δ ω / (2 π)}] \\ D {\hat{c}}_{κ, k} (ω) & = & lim_{Δ ω \to 0} \prod_{ω} [d {\hat{c}}_{κ, k} (ω) \sqrt{\hat{N} \hat{P} (κ, k) Δ ω / (2 π)}], \\ D ρ_{q} (\tilde{ω}) & = & lim_{Δ \tilde{ω} \to 0} \prod_{\tilde{ω}} [d ρ_{q} (\tilde{ω}) \sqrt{\hat{N} α P \tilde{(} q) Δ \tilde{ω} / (2 π)}], \\ D {\hat{ρ}}_{q} (\tilde{ω}) & = & lim_{Δ \tilde{ω} \to 0} \prod_{\tilde{ω}} [d {\hat{ρ}}_{q} (\tilde{ω}) \sqrt{\hat{N} α \tilde{P} (q) Δ \tilde{ω} / (2 π)}] . \end{matrix}

(A7)

By putting

\begin{matrix} e^{- i λ} = \frac{z}{\hat{N}}, \end{matrix}

(A8)

and performing a Wick rotation in

λ

and assuming

z / \hat{N} = e^{- i λ}

real and much smaller than one, i.e.,

z / \hat{N} ≪ 1

which is allowed in the sparse regime, we can linearize the logarithm and express

Ψ

as

\begin{matrix} Ψ = z α ν (\frac{1}{2} α ν + \hat{ν}), \end{matrix}

(A9)

with

\begin{matrix} ν & = & \sum_{\hat{m} \leq q \leq K} \tilde{P} (q) \int d \tilde{ω} ρ_{q} (\tilde{ω}) e^{- i \tilde{ω}} . \\ \hat{ν} & = & \sum_{\hat{m} \leq κ \leq K} \sum_{\hat{κ} \leq k \leq K} \hat{P} (κ, k) \int d ω c_{κ, k} (ω) e^{- i ω} . \end{matrix}

(A10)

The saddle point equations determining the value of the partition function can be obtained by performing the (functional) derivative of f with respect to the functional order parameters, obtaining

\begin{matrix} - i {\hat{c}}_{κ, k} (ω) & = & z α ν e^{- i ω}, \\ - i {\hat{ρ}}_{q} (ω) & = & z (α ν + \hat{ν}) e^{- i \tilde{ω}}, \\ c_{κ, k} (ω) & = & \hat{P} (κ, k) \frac{\frac{1}{2 π} e^{i ω (k - κ) - i {\hat{c}}_{κ, k} (ω)}}{\int \frac{d ω^{'}}{2 π} e^{i ω^{'} (k - κ) - i {\hat{c}}_{κ, k} (ω^{'})}}, \\ ρ_{q} (\tilde{ω}) & = & \tilde{P} (q) \frac{\frac{1}{2 π} e^{i \tilde{ω} q - i {\hat{ρ}}_{q} (\tilde{ω})}}{\int \frac{d {\tilde{ω}}^{'}}{2 π} e^{i {\tilde{ω}}^{'} q - i {\hat{ρ}}_{q} ({\tilde{ω}}^{'})}}, \\ 2 \frac{L - \hat{L}}{\hat{N}} & = & z α ν (α ν + 2 \hat{ν}) . \end{matrix}

(A11)

Let us first calculate the integrals

\begin{matrix} I_{κ, k} & = & \int \frac{d ω}{2 π} e^{- i ω (k - κ) - i {\hat{c}}_{κ, k} (ω)} = \frac{1}{(k - κ)!} {(z α ν)}^{k - κ}, \\ I_{q} & = & \int \frac{d \tilde{ω}}{2 π} e^{- i \tilde{ω} q - i {\hat{ρ}}_{q} (\tilde{ω})} = \frac{1}{q!} {[z (α ν + \hat{ν})]}^{q}, \end{matrix}

(A12)

Using these expressions for the integral we can write the functional order parameters as

\begin{matrix} c_{κ, k} (ω) = \hat{P} (κ, k) \frac{1}{2 π} \frac{e^{i ω (k - κ) + (z α ν) e^{- i ω}}}{I_{κ, k}}, \\ ρ_{q} (\tilde{ω}) = \tilde{P} (q) \frac{1}{2 π} \frac{e^{i \tilde{ω} q + [z ν (α ν + \hat{ν})] e^{- i \tilde{ω}}}}{I_{q}} . \end{matrix}

(A13)

With this expression, using a similar procedure we can express

ν

as

\begin{matrix} \hat{ν} & = & \int d ω \sum_{\hat{m} \leq κ \leq K} \sum_{κ \leq k \leq K} c_{κ, k} (ω) e^{- i ω} = \sum_{\hat{κ} \leq k \leq K} \hat{P} (κ, k) (k - κ) {(α z ν)}^{- 1} . \\ ν & = & \int d \tilde{ω} \sum_{\hat{m} \leq q \leq K} ρ_{q} (\tilde{ω}) e^{- i \tilde{ω}} = \sum_{\hat{m} \leq q \leq K} \tilde{P} (q) q {[z (α ν + \hat{ν})]}^{- 1} . \end{matrix}

(A14)

Combing these equations with the last saddle point equation it is immediate to show that

z, ν

and

\hat{ν}

are given by

\begin{matrix} z & = & 1, \\ α ν & = & \sqrt{(Q - M) / \hat{N}}, \\ \hat{ν} & = & \frac{M / \hat{N}}{\sqrt{(Q - M) / \hat{N}}} . \end{matrix}

(A15)

with

\begin{matrix} 2 L - 2 \hat{L} = M + Q . \end{matrix}

(A16)

Calculating the free energy

\hat{N} f

at the saddle point, we get

\begin{matrix} \hat{N} f & = & - \frac{1}{2} (Q - M) - M + (L - \hat{L}) ln \hat{N} + \hat{N} \sum_{\hat{\vec{m}} \leq κ \leq K} \sum_{\hat{κ} \leq k \leq K} \hat{P} (κ, k) ln \frac{{(α ν)}^{k - κ}}{(k - κ)!} \\ + α \hat{N} \sum_{\hat{m} \leq q \leq K} \tilde{P} (q) ln \frac{{[α ν + \hat{ν}]}^{q}}{q!}, \end{matrix}

(A17)

which leads to the following asymptotic expression for

Z = N (k, q | κ, N) = exp

({\sum^{^}}_{N} (k, q | κ))

\begin{matrix} Z = N (k, q | κ, N) & ≃ & \frac{M! (Q - M)!!}{\prod_{i = 1}^{\hat{N}} k_{i}! \prod_{i = 1 + \hat{N}}^{N} q_{i}!} (\begin{matrix} Q \\ M \end{matrix}) . \end{matrix}

(A18)

References

Barabási, A.L. Network Science; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
Newman Mark, E. Networks: An Introduction; Oxford University Press: Cambridge, UK, 2010. [Google Scholar]
Anand, K.; Bianconi, G. Entropy measures for networks: Toward an information theory of complex topologies. Phys. Rev. E 2009, 80, 045102. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Park, J.; Newman, M.E. Statistical mechanics of networks. Phys. Rev. E 2004, 70, 066117. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bianconi, G. Information theory of spatial network ensembles. In Handbook on Entropy, Complexity and Spatial Dynamics; Edward Elgar Publishing: Cheltenham, UK, 2021. [Google Scholar]
Cimini, G.; Squartini, T.; Saracco, F.; Garlaschelli, D.; Gabrielli, A.; Caldarelli, G. The statistical physics of real-world networks. Nat. Rev. Phys. 2019, 1, 58–71. [Google Scholar] [CrossRef] [Green Version]
Krioukov, D.; Papadopoulos, F.; Kitsak, M.; Vahdat, A.; Boguná, M. Hyperbolic geometry of complex networks. Phys. Rev. E 2010, 82, 036106. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Orsini, C.; Dankulov, M.M.; Colomer-de Simón, P.; Jamakovic, A.; Mahadevan, P.; Vahdat, A.; Bassler, K.E.; Toroczkai, Z.; Boguná, M.; Caldarelli, G.; et al. Quantifying randomness in real networks. Nat. Commun. 2015, 6, 8627. [Google Scholar] [CrossRef]
Peixoto, T.P. Entropy of stochastic blockmodel ensembles. Phys. Rev. E 2012, 85, 056122. [Google Scholar] [CrossRef] [Green Version]
Radicchi, F.; Krioukov, D.; Hartle, H.; Bianconi, G. Classical information theory of networks. J. Phys. Complex. 2020, 1, 025001. [Google Scholar] [CrossRef]
Pessoa, P.; Costa, F.X.; Caticha, A. Entropic dynamics on Gibbs statistical manifolds. Entropy 2021, 23, 494. [Google Scholar] [CrossRef]
Kim, H.; Del Genio, C.I.; Bassler, K.E.; Toroczkai, Z. Constructing and sampling directed graphs with given degree sequences. New J. Phys. 2012, 14, 023012. [Google Scholar] [CrossRef] [Green Version]
Del Genio, C.I.; Kim, H.; Toroczkai, Z.; Bassler, K.E. Efficient and exact sampling of simple graphs with given arbitrary degree sequence. PLoS ONE 2010, 5, e10012. [Google Scholar] [CrossRef] [Green Version]
Coolen, A.C.; Annibale, A.; Roberts, E. Generating Random Networks and Graphs; Oxford University Press: Oxford, UK, 2017. [Google Scholar]
Bassler, K.E.; Del Genio, C.I.; Erdős, P.L.; Miklós, I.; Toroczkai, Z. Exact sampling of graphs with prescribed degree correlations. New J. Phys. 2015, 17, 083052. [Google Scholar] [CrossRef]
Barabási, A.L.; Albert, R. Emergence of scaling in random networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dorogovtsev, S.N.; Dorogovtsev, S.N.; Mendes, J.F. Evolution of Networks: From Biological Nets to the Internet and WWW; Oxford University Press: Oxford, UK, 2003. [Google Scholar]
Kharel, S.R.; Mezei, T.R.; Chung, S.; Erdős, P.L.; Toroczkai, Z. Degree-preserving network growth. Nat. Phys. 2021, 18, 100–106. [Google Scholar] [CrossRef]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620. [Google Scholar] [CrossRef]
Huang, K. Introduction to Statistical Physics; Chapman and Hall: London, UK; CRC: Boca Raton, FL, USA, 2009. [Google Scholar]
Anand, K.; Bianconi, G. Gibbs entropy of network ensembles by cavity methods. Phys. Rev. E 2010, 82, 011116. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bianconi, G.; Coolen, A.C.; Vicente, C.J.P. Entropies of complex networks with hierarchically constrained topologies. Phys. Rev. E 2008, 78, 016114. [Google Scholar] [CrossRef] [Green Version]
Caldarelli, G.; Capocci, A.; De Los Rios, P.; Munoz, M.A. Scale-free networks from varying vertex intrinsic fitness. Phys. Rev. Lett. 2002, 89, 258702. [Google Scholar] [CrossRef] [Green Version]
Bianconi, G.; Pin, P.; Marsili, M. Assessing the relevance of node features for network structure. Proc. Natl. Acad. Sci. USA 2009, 106, 11433–11438. [Google Scholar] [CrossRef] [Green Version]
Airoldi, E.M.; Blei, D.; Fienberg, S.; Xing, E. Mixed membership stochastic blockmodels. Adv. Neural Inf. Process. Syst. 2008, 21, 1981–2014. [Google Scholar]
Ghavasieh, A.; Nicolini, C.; De Domenico, M. Statistical physics of complex information dynamics. Phys. Rev. E 2020, 102, 052304. [Google Scholar] [CrossRef]
Bevilacqua, B.; Zhou, Y.; Ribeiro, B. Size-invariant graph representations for graph classification extrapolations. In Proceedings of the International Conference on Machine Learning, PMLR, London, UK, 8–11 November 2021; pp. 837–851. [Google Scholar]
Cotta, L.; Morris, C.; Ribeiro, B. Reconstruction for powerful graph representations. Adv. Neural Inf. Process. Syst. 2021, 34. [Google Scholar] [CrossRef]
De Finetti, B. Funzione Caratteristica Di un Fenomeno Aleatorio; Accademia Nazionale Lincei: Rome, Italy, 1931; Volume 4. [Google Scholar]
Lovász, L. Large Networks and Graph Limits; American Mathematical Society: Providence, RI, USA, 2012; Volume 60. [Google Scholar]
Chung, F.; Lu, L. The average distances in random graphs with given expected degrees. Proc. Natl. Acad. Sci. USA 2002, 99, 15879–15882. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bianconi, G. Statistical physics of exchangeable sparse simple networks, multiplex networks, and simplicial complexes. Phys. Rev. E 2022, 105, 034310. [Google Scholar] [CrossRef] [PubMed]
Caron, F.; Fox, E.B. Sparse graphs using exchangeable random measures. J. R. Stat. Soc. Ser. Stat. Methodol. 2017, 79, 1295. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Borgs, C.; Chayes, J.T.; Cohn, H.; Holden, N. Sparse exchangeable graphs and their limits via graphon processes. arXiv 2016, arXiv:1601.07134. [Google Scholar]
Veitch, V.; Roy, D.M. The class of random graphs arising from exchangeable random measures. arXiv 2015, arXiv:1512.03099. [Google Scholar]
Veitch, V.; Roy, D.M. Sampling and estimation for (sparse) exchangeable graphs. Ann. Stat. 2019, 47, 3274–3299. [Google Scholar] [CrossRef] [Green Version]
Borgs, C.; Chayes, J.T.; Smith, A. Private graphon estimation for sparse graphs. arXiv 2015, arXiv:1506.06162. [Google Scholar]
Borgs, C.; Chayes, J.; Smith, A.; Zadik, I. Revealing network structure, confidentially: Improved rates for node-private graphon estimation. In Proceedings of the 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS), Paris, France, 7–9 October 2018; pp. 533–543. [Google Scholar]
Bianconi, G. Multilayer Networks: Structure and Function; Oxford University Press: Oxford, UK, 2018. [Google Scholar]
Bianconi, G. Higher-Order Networks: An Introduction to Simplicial Complexes; Cambridge University Press: Cambridge, UK, 2021. [Google Scholar]
Aldous, D.J. Representations for partially exchangeable arrays of random variables. J. Multivar. Anal. 1981, 11, 581–598. [Google Scholar] [CrossRef] [Green Version]
Hoover, D.N. Relations on Probability Spaces and Arrays of Random Variables; Institute for Advanced Study: Princeton, NJ, USA, 1979; Volume 2, p. 275. [Google Scholar]
Paton, J.; Hartle, H.; Stepanyants, J.; van der Hoorn, P.; Krioukov, D. Entropy of labeled versus unlabeled networks. arXiv 2022, arXiv:2204.08508. [Google Scholar]
Peixoto, T.P. Hierarchical block structures and high-resolution model selection in large networks. Phys. Review X 2014, 4, 011047. [Google Scholar] [CrossRef] [Green Version]
Gabrielli, A.; Mastrandrea, R.; Caldarelli, G.; Cimini, G. Grand canonical ensemble of weighted networks. Phys. Rev. E 2019, 99, 030301. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Straka, M.J.; Caldarelli, G.; Saracco, F. Grand canonical validation of the bipartite international trade network. Phys. Rev. E 2017, 96, 022306. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bender, E.A.; Canfield, E.R. The asymptotic number of labeled graphs with given degree sequences. J. Comb. Theory Ser. A 1978, 24, 296–307. [Google Scholar] [CrossRef] [Green Version]
Bianconi, G. Entropy of network ensembles. Phys. Rev. E 2009, 79, 036114. [Google Scholar] [CrossRef] [Green Version]
Courtney, O.T.; Bianconi, G. Generalized network structures: The configuration model and the canonical ensemble of simplicial complexes. Phys. Rev. E 2016, 93, 062311. [Google Scholar] [CrossRef] [Green Version]
Monasson, R.; Zecchina, R. Statistical mechanics of the random K-satisfiability model. Phys. Rev. E 1997, 56, 1357. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Schematic representation of the hierarchical grand canonical ensemble of exchangeable sparse simple networks. The proposed ensemble is a hierarchical model of networks in which first the total number of nodes N is drawn from a

P (N) = π (N)

distribution, then a given degree sequence

k = {k_{1}, k_{2}, \dots k_{N}}

is drawn from the distribution

P (k | N)

among all the degree sequence with the total number of nodes N; finally a network G with adjacency matrix

a

drawn from the distribution

P (G | k, N)

among all the networks with a given total number of nodes N and degree sequence

k

. Panel (a) describes the hierarchical nature of the model, panel (b) provide an example of subsequent draw of the total number of nodes, the degree sequence and the adjacency matrix of the network, panel (c) is a visualization of the construction of a network according to the proposed model.

Figure 2. Results of the Metropolis-Hastings algorithm for generating grand canonical ensembles with given degree distribution. The number of nodes

N (t)

as a function of time t in the Metropolis-Hastings simulation of an exponential networks (panel (a)) and networks with more general degree distribution (panel (c)) are shown together with the average degree distribution of the networks that is stable as the number of networks varies (symbols of panel (b) and (d)). The solid lines in panel (b) and panel (d) indicate the target degree distributions

p (k) = C e^{- k / m}

with

m = 5

(for panel (b)) and

p (k) = C {(3 + k)}^{- γ}

with

γ = 3.4

(for panel (d)). The prior on the number of nodes is taken to be exponential

π (N) = C e^{- N / \bar{N}}

with

\bar{N} = 1000

with

N_{0} = 500

and

K = 16

.

Figure 3. Marginal posterior probability for the true degree and of the true latent variable of a sampled node. The posterior probability

P (k_{i} | \hat{G}, ω)

(panel (a)) of the true degree of a sampled nodes depends on the degree

κ

of the nodes in the sampled network

\hat{G}

and is non-zero only for

k \geq κ

. The posterior probability

P (θ | \hat{G}, \bar{θ})

of the latent variable of a sampled node (panel (b)) can be non-zero on the entire range of

θ

values allowed by the prior. Here we have plotted

P (k_{i} | \hat{G}, ω)

and

P (θ | \hat{G}, \bar{θ})

for different values of

κ

and we have chosen

ω = 2

and

\bar{θ} = 0.6

. The dashed lines indicate the exponential prior on the degrees (panel (a)) and on the latent variables (panel (b)).

Figure 4. Marginal posterior probability for the true number of nodes in the grand canonical ensemble with given degree distribution and in the grand canonical ensemble with given latent variable distribution. The posterior probability

P (N | \hat{G}, M)

in panel (a) of the true number of nodes depends on the total number M of true but not observed links of the sampled nodes and on the total number of sampled links

\hat{L}

; the posterior probability

P (N | \hat{G})

in panel (b) depends instead only on the degree

κ

of the nodes in the sampled network

\hat{G}

. We took

N_{0} = 100

and the priors given by

π (N) \propto e^{- N / \hat{N}}

,

p (k) \propto e^{- k / m}

,

p (θ) \propto e^{- θ / m}

with

\hat{N} = 200

, and

m = 7

. In panel (a) we have plotted

P (N | \hat{G}, M)

for different values of

M = (⟨ k ⟩ - n) \hat{N}

with

n = 1, 2, 3, 4

and

\hat{L} = \hat{N} / 2

; in panel (b) we have plotted

P (N | \hat{G})

assuming that

\hat{G}

is regular with all sampled nodes having sampled degree

κ = 1, 2, 3, 4, 5

. The dashed lines indicate the exponential prior

π (N)

on the number of nodes.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Grand Canonical Ensembles of Sparse Networks and Bayesian Inference

Abstract

1. Introduction

2. The Grand Canonical Network Ensemble with Given Degree Distribution

3. The Grand Canonical Network Ensemble with Given Distribution of the Latent Variables

4. The Entropy of Grand Canonical Ensembles

4.1. Entropy of the Grand Canonical Ensemble with Given Degree Distribution

4.2. Entropy of the Grand Canonical Ensemble with Given Latent Variable Distribution

5. Marginal Probability of a Link

5.1. The Case of the Grand Canonical Ensemble with Given Degree Distribution

5.2. The Case of the Grand Canonical Ensemble with Given Latent Variable Distribution

6. Generating Single Instances of Grand-Canonical Network Ensembles

6.1. Metropolis-Hastings Algorithm for the Grand-Canonical Ensemble with Given Degree Distribution

6.2. Monte Carlo Generation of Grand Canonical Network Ensemble with Given Latent Variable Distribution

7. Bayesian Estimation of the Network Parameters Given Partial Knowledge of the Network

7.1. Inferring the True Parameters with the Grand Canonical Ensemble with Given Degree Distribution

7.2. Inferring the True Parameters with the Grand Canonical Ensemble with Given Latent Variable Distribution

8. Conclusions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Derivation of ${\sum^{^}}_{N}$ (k,q|κ)

References

Article Metrics

Citations

Article Access Statistics

Grand Canonical Ensembles of Sparse Networks and Bayesian Inference

Abstract

1. Introduction

2. The Grand Canonical Network Ensemble with Given Degree Distribution

3. The Grand Canonical Network Ensemble with Given Distribution of the Latent Variables

4. The Entropy of Grand Canonical Ensembles

4.1. Entropy of the Grand Canonical Ensemble with Given Degree Distribution

4.2. Entropy of the Grand Canonical Ensemble with Given Latent Variable Distribution

5. Marginal Probability of a Link

5.1. The Case of the Grand Canonical Ensemble with Given Degree Distribution

5.2. The Case of the Grand Canonical Ensemble with Given Latent Variable Distribution

6. Generating Single Instances of Grand-Canonical Network Ensembles

6.1. Metropolis-Hastings Algorithm for the Grand-Canonical Ensemble with Given Degree Distribution

6.2. Monte Carlo Generation of Grand Canonical Network Ensemble with Given Latent Variable Distribution

7. Bayesian Estimation of the Network Parameters Given Partial Knowledge of the Network

7.1. Inferring the True Parameters with the Grand Canonical Ensemble with Given Degree Distribution

7.2. Inferring the True Parameters with the Grand Canonical Ensemble with Given Latent Variable Distribution

8. Conclusions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Derivation of ∑ ^ N (k,q|κ)

References

Article Metrics

Citations

Article Access Statistics

Appendix A. Derivation of ${\sum^{^}}_{N}$ (k,q|κ)