Next Article in Journal
Selected Methods for Designing Monetary and Fiscal Targeting Rules Within the Policy Mix Framework
Previous Article in Journal
Coupled Dynamics of Information–Epidemic Spreading with Resource Allocation and Transmission on Multi-Layer Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Set of Master Variables for the Two-Star Random Graph

by
Pawat Akara-pipattana
1 and
Oleg Evnin
2,3,*
1
Université Paris-Saclay, CNRS, LPTMS, 91405 Orsay, France
2
High Energy Physics Research Unit, Department of Physics, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
3
Theoretische Natuurkunde, Vrije Universiteit Brussel and International Solvay Institutes, 1050 Brussels, Belgium
*
Author to whom correspondence should be addressed.
Entropy 2025, 27(10), 1081; https://doi.org/10.3390/e27101081
Submission received: 23 September 2025 / Revised: 14 October 2025 / Accepted: 16 October 2025 / Published: 19 October 2025
(This article belongs to the Section Statistical Physics)

Abstract

The two-star random graph is the simplest exponential random graph model with nontrivial interactions between the graph edges. We propose a set of auxiliary variables that control the thermodynamic limit where the number of vertices N tends to infinity. Such ’master variables’ are usually highly desirable in treatments of ‘large N’ statistical field theory problems. For the dense regime when a finite fraction of all possible edges are filled, this construction recovers the mean-field solution of Park and Newman, but with explicit control over the 1 / N corrections. We use this advantage to compute the first subleading correction to the Park–Newman result, which encodes the finite, nonextensive contribution to the free energy. For the sparse regime with a finite mean degree, we obtain a very compact derivation of the Annibale–Courtney solution, originally developed with the use of functional integrals, which is comfortably bypassed in our treatment.

1. Introduction

Many quantum and statistical systems simplify when the number of degrees of freedom becomes large, for example the number of internal degrees of freedom of fields at each spacetime point, or the size of matrices, etc. One is often talking about ’large N’ limits [1], where N parametrizes the number of degrees of freedom. A ’holy grail’ problem for such large N limits is the search for ’master variables’ or ’master fields’, such that, when the original problem is reformulated through these variables, N becomes a numerical parameter in the action (or in the statistical probability distribution), and changing N no longer affects the set of the degrees of freedom involved. As this parameter tends to infinity, the large N limit is often controlled by an explicit saddle point in terms of the master variables, which condense, as a result, to definite values.
The master variables can be of a different nature than the original variables. In the simplest case of the O ( N ) vector model whose fundamental variables ϕ i are vectors under O ( N ) rotations, scalar fields constructed from these vectors condense to definite values at large N and play the role of master variables [1]. Similar ’scalar condensation’ via different applications of the Hubbard–Stratonovich transformation occurs in multi-matrix models with a large number of matrices transformed into one another by rotations [2,3,4], or in computations of random tensor eigenvalue distributions in a large number of dimensions [5,6,7]. The situation can be much more exotic, however. For example, for large random matrices of size N × N , the resolvents, which are functions of an external spectral parameter, condense to definite values [8,9]. Thus, while the original variables are numbers, the master variables are functions. This is especially visible in the corresponding treatments of sparse random matrix ensembles [10,11,12,13,14,15]: starting with random matrices, one arrives at explicit functional integrals in terms of the functional master variables that acquire a saddle point structure at large N. Finally, in quantum gauge theories, loop variables make a prominent appearance at large N and are believed to condense to definite values [16,17,18,19]. Those are functionals on the loop space, while the original variables are fields in the physical spacetime.
Motivated by this diversity of large N limits and the master variables that control them, we would like to revisit here the two-star random graph model. This is probably the simplest statistical graph model with nontrivial edge interactions and nontrivial thermodynamics, formulated as a Gibbs-like maximal entropy ensemble with controlled average number of edges and ’2-stars’ (which one can think of as the number of paths of length 2). By contrast, if one only controls the number of edges, one ends up with the edges being filled or not filled randomly and independently, which is known as the Erdös–Rényi random graph. This model is of fundamental importance in the physics of networks [20], providing a point of departure for more sophisticated constructions, but its thermodynamics is trivial (this is for the case of distinguishable vertices; thermodynamics of unlabeled graphs with indistinguishable vertices is nontrivial even within the Gibbs ensemble that only controls the number of edges [21,22]).
We thus turn to the two-star random graph on N vertices and look for a set of master variables that control the large N limit. One generally expects that the master variables are invariants of the symmetries of the theory, and here we must keep in mind that, unlike O ( N ) models, the matrix models defining random graph ensembles are typically invariant under vertex permutations, but not under any form of continuous rotations, leaving a much bigger set of invariants.
In the dense regime (a finite fraction of all possible edges occupied), a mean-field solution of the two-star model was developed by Park and Newman [23]. This solution is simple, elegant and manifestly correct given the comparisons with numerics, but it does not bring under explicit control the 1 / N corrections, which is where the master variables proposed here lead to improvement. In the sparse regime (a finite average degree), the model was solved using auxiliary fields by Annibale and Courtney [24]. This solution, however, goes through functional integrals at the intermediate stages. The master variables proposed here will recover it in a more compact and elementary fashion. (We mention for additional perspective the mathematical works [25,26] dealing with related topics, as well as some further relevant physics literature on the two-star model and its generalizations [27,28,29].)
Our treatment will proceed with defining the relevant master variables in Section 2, showing how they can be used to reproduce the Park–Newman mean-field solution in the dense regime in Section 3, spelling out the computation of 1 / N corrections to this solution in Section 4, followed by a very compact derivation of the large N solution in the sparse case in Section 5. We will give a physical interpretation of the condensation of master variables in this solution in Section 6, and then conclude with a brief summary and discussion.

2. The Two-Star Random Graph and Its Auxiliary Field Representation

For analytic considerations, one starts by representing graphs as adjacency matrices. For each graph on N vertices, define a real symmetric zero-diagonal N × N adjacency matrix A i j whose entries equal 1 if there exists an edge between vertices i and j, and 0 otherwise. The probability of each graph in the two-star model is then given by
P [ A ] = e H [ A ] / Z , Z A e H [ A ] , H [ A ] α k l A k l + β k l m A k l A l m ,
where A implies summing over all possible adjacency matrices, that is, over all graph configurations. This is a typical Gibbs-like maximal entropy ensemble with two ’fugacities’ α and β . Introducing the vertex degrees k j l A j l , one can understand j l A j l = j k j as twice the number of edges, while l j m A l j A j m = j k j 2 . These are the two quantities whose expectation values are controlled by their thermodynamic conjugates α and β . By linear redefinitions of α and β , one can equivalently control instead the number of edges and the number of ’two-star’ motifs (ordered triplets of distinct nodes with at least two edges connecting them), given by j k j ( k j 1 ) . While all of these definitions are physically equivalent, the conventions explicitly stated in (1) prove convenient for our subsequent analytic work.
To solve the ensemble (1) in the thermodynamic limit N , one needs to evaluate the partition function Z. Direct summation over A is impossible due to the quadratic nonlinearity in H. We start with a simple Hubbard–Stratonovich transform, as in [23]:
e β k m A k l A l m = 1 π d ϕ e ϕ 2 + 2 i ϕ β k A k l .
We need to introduce one such variable ϕ for each value of l = 1 N , obtaining
Z = 1 π N / 2 A d ϕ 1 d ϕ N e l ϕ l 2 e k < l A k l [ 2 α + 2 i ( ϕ k + ϕ l ) . β ] ,
where we expressed the exponent through the independent entries A k l with k < l . The summation over A k l with k < l then trivializes since the summand is factorized over the entries of the adjacency matrix. One thereby arrives at the following vector model in terms of ϕ l :
Z = 1 π N / 2 d ϕ 1 d ϕ N e k ϕ k 2 k < l 1 + e 2 α + 2 i ( ϕ k + ϕ l ) β ,
which is often more convenient to represent as
Z = 1 π N / 2 d ϕ 1 d ϕ N e k ϕ k 2 + S [ ϕ ] , S [ ϕ ] k < l log 1 + e 2 α + 2 i ( ϕ k + ϕ l ) β .
One expects that such vector models simplify when N is large. This is what in fact happens, though the details depend crucially on the N-scalings of the thermodynamic parameters α and β that determine whether the graph is in the sparse or dense regime (both of which are an option at large N, defining thus different large N limits of the model).
A mean-field solution of (5) in the dense regime was originally presented in [23]. The dense regime corresponds to α = O ( 1 ) , β = O ( 1 / N ) at N . In this regime, ϕ k condense to definite values of order N (in the current conventions) plus fluctuations of order 1. The idea of [23] is then to expand around this condensation point as ϕ k = N ϕ 0 + φ k and use the Taylor expansion for S:
k ϕ k 2 + S [ ϕ ] = S 0 + k ϕ k 2 + 1 2 k l 2 S ϕ k ϕ l φ k φ l + 1 6 k l m 3 S ϕ k ϕ l ϕ m φ l φ l φ m + .
The mean-field value ϕ 0 is chosen to ensure that the term linear in φ is absent. As an estimate for Z of (5), one then takes e S 0 times the Gaussian determinant from integrating the exponential of the quadratic terms in (6), with all the higher-order terms neglected.
All the indications are that the results of the above mean-field analysis are exact at N , and they compare very well with the corresponding numerics [23]. And yet, why could the higher-order terms in (6) be neglected? Most naively, differentiating S with respect to ϕ inserts powers of β , and since β = O ( 1 / N ) , this amounts to manifest negative powers of N, which is reassuring. This is, however, not yet the whole story. The number of variables grows with N, and in the standard Feynman-like expansion while evaluating (5) with (6) substituted, one will have multiple sums over indices ranging from 1 to N. This will introduce positive powers of N that will compete with the negative powers. (The ability of fluctuations in a large number of variables to compromise 1 / N expansions is highlighted in a similar context in Section 8.2 of [16]). More than that, if we repeatedly differentiate S with respect to the same component of ϕ , the resulting expression has N terms, contributing further positive factors of N.
Due to all the above ingredients, the story with higher-order corrections to [23], which we briefly sketch in Appendix A, appears somewhat bewildering. While the naive evaluation within the Gaussian approximation produces the correct result at large N that can be verified numerically, the higher-order corrections come with a swarm of positive and negative powers of N, and it is difficult to give a concise argument as to how exactly they are suppressed, and why they do not upset the Gaussian estimate (though it is known empirically that they do not). While one could attempt diagrammatic accounting of the powers of N in these corrections, somewhat in the spirit of random tensor considerations [30,31], it is anything but easy.
Instead of attempting diagrammatic analysis of (5) with the expansion (6), we shall follow here a different strategy, introducing a further set of auxiliary variables, somewhat akin to those recently used for analyzing graphs with prescribed degree sequences [15,32,33]. In this way, conventional saddle points will emerge that control the large N limit of (5), where N becomes a numerical parameter in the ’action,’ and the set of integration variables is N-independent. This will have other useful applications, besides elucidating the analytics behind the mean-field solution of [23].
We first write (5) as
Z = 1 π N / 2 k d ϕ k e ϕ k 2 1 + e 2 α + 4 i ϕ k β k , l = 1 N exp 1 2 log 1 + e 2 α + 2 i ( ϕ k + ϕ l ) β ,
and then expand the logarithm as log ( 1 + x ) = x x 2 / 2 + x 3 / 3 + to obtain
Z = 1 π N / 2 k d ϕ k e ϕ k 2 1 + e 2 α + 4 i ϕ k β k , l = 1 N exp J = 1 ( 1 ) J 2 J e J ( 2 α + 2 i ( ϕ k + ϕ l ) β ) = 1 π N / 2 k d ϕ k e ϕ k 2 1 + e 2 α + 4 i ϕ k β exp J = 1 ( 1 ) J e 2 α J 2 J k e 2 i J ϕ k β 2 .
An attractive feature of the last expression is that it can be completely factorized over ϕ k at the cost of introducing one more Hubbard–Stratonovich transformation with respect to the variables x J that couple to k e 2 i J ϕ k β . In this way, one gets
Z = 1 π N / 2 J = 1 d x J 2 π J e x J 2 / 2 J k d ϕ k e ϕ k 2 exp J = 1 i J + 1 e J α e 2 i J ϕ k β x J / J 1 + e 2 α + 4 i ϕ k β ,
or more suggestively, since all the ϕ k integrals are identical to each other,
Z = J = 1 d x J 2 π J e x J 2 / 2 J e N S [ x ] ,
e S [ x ] = 1 π d ϕ e ϕ 2 1 + e 2 α + 4 i ϕ β exp J = 1 i J + 1 J e J α x J e 2 i J ϕ β .
The integral in the last line gives an effective action S [ x ] , in terms of which one has to deal with a saddle point problem at large N, with N playing no more of a role than providing a large saddle point parameter.

3. The Dense Regime at Leading Order

We now turn to the dense regime of [23], where α 1 and β 1 / N , which we write as
β = B N .
These scalings are necessary to ensure a nontrivial infinite N limit with a finite mean connectivity (the fraction of all possible edges that are filled).
For the first-pass treatment in this section, as in [23], we only keep the extensive part of the free energy, that is, contributions to ( log Z ) / N that survive at N . For that, we compute e S up to the order O ( 1 ) at N , so that e N S is computed correctly up to factors that stay finite at N . In this way, we reproduce the solution of the model at the precision level of [23], while the subleading corrections will be discussed in the next section.
For constructing a saddle point estimate of (9) and (10), it is important to identify the N-scaling of the variables responsible for the dominant contribution. In view of the mean-field picture of [23] that has been verified by comparisons with numerics, the field ϕ condenses to values of order N . Since x J are Hubbard–Stratonovich conjugates of k e i J ϕ k β , they are expected to condense at values of order N, since the sum over k consists of N identical terms of order 1. We shall see that this regime ϕ N , x J N is indeed consistent and produces a saddle point estimate of Z with controlled corrections.
Motivated by the above picture, we write ϕ = i B N ϕ 0 + φ . We will choose ϕ 0 to ensure that the integral over φ is dominated by φ of order 1. Then, neglecting all terms suppressed by 1 / N ,
e S [ x ] = e N B ϕ 0 2 π 1 + e 2 α 4 B ϕ 0 d φ e φ 2 2 i B N ϕ 0 φ × exp J = 1 i J + 1 J e J ( α + 2 B ϕ 0 ) x J 1 + 2 i J φ B / N 2 J 2 φ 2 B / N .
We then choose ϕ 0 to ensure that the terms of order O ( N ) that are linear in φ cancel out, which would guarantee that only values of φ of order 1 contribute to the integral:
ϕ 0 = J = 1 i J + 1 e J ( α + 2 B ϕ 0 ) x J N .
One is then left with an elementary Gaussian integral over φ that yields
e S [ x ] = e N B ϕ 0 2 1 + e 2 α 4 B ϕ 0 exp J = 1 i J + 1 J e J ( α + 2 B ϕ 0 ) x J 1 + 2 B J = 1 i J + 1 J e J ( α + 2 B ϕ 0 ) x J N 1 / 2 .
The corrections to this formula are of order 1 / N , which will give at most multiplicative contributions of order O ( 1 ) in e N S , and those cannot affect the extensive part of the free energy.
From the above, we write S as
S = N S main + S corr , S main B ϕ 0 2 + J = 1 i J + 1 J e J ( α + 2 B ϕ 0 ) x J N ,
S corr 1 2 log 1 + 2 B J = 1 i J + 1 J e J ( α + 2 B ϕ 0 ) x J N 1 2 log 1 + e 2 α 4 B ϕ 0 .
where the leading terms of order N have been separated out explicitly, and ϕ 0 is a function of x J implicitly determined by (13). To construct a saddle point estimate of (9), we look for stationary points of N S [ x ] J x J 2 / 2 J , determined at leading order in N by
x K N 2 S main [ x ] J x J 2 2 J = 0 .
With the dominant terms in S collected in (15), this yields
x K / K = 2 N 2 B ϕ 0 ϕ 0 x K + N i K + 1 K e K ( α + 2 B ϕ 0 ) 2 B N ϕ 0 x K J = 1 i J + 1 e J ( α + 2 B ϕ 0 ) x J .
The terms with ϕ 0 / x K cancel out in view of (13), leaving the following saddle point configuration:
x K = X K N i K + 1 e K ( α + 2 B ϕ 0 ) .
Note that X K are consistently of order N. At this saddle point, (13) becomes an explicit equation for ϕ 0
ϕ 0 = J = 1 ( 1 ) J + 1 e J ( 2 α + 4 B ϕ 0 ) = 1 e 2 α + 4 B ϕ 0 + 1 ,
as in the mean-field solution of [23]. If we expand (9) around the saddle point configuration X K , the Gaussian integral over fluctuations is manifestly of order O ( 1 ) at large N, as we shall see more explicitly in the next section. Then, nonvanishing contributions to the free energy per vertex may only come from the saddle point exponential evaluated at X K , that is
log Z N = J X J 2 2 J N + N S main [ X ] + S corr [ X ] .
From (15), (16) and (18), this is evaluated as
log Z N = N B ϕ 0 2 + N 1 2 log 1 + e 2 α 4 B ϕ 0 1 2 log 1 + 2 B ϕ 0 ( 1 ϕ 0 ) .
where we have used the evident formulas
J = 1 ( 1 ) J + 1 J z J = log ( 1 + z ) , J = 1 ( 1 ) J + 1 z J = z 1 + z , J = 1 J ( 1 ) J + 1 z J = z ( 1 + z ) 2 ,
and also equation (13) to simplify the terms of order 1. The expression is identical to the result of [23] up to a change of notation: what we call α here is B in [23], and what we call B is J N / ( N 1 ) in [23].

4. The Dense Regime: Subleading Corrections

To keep the story clean, we shall take the saddle point values deduced in the previous section as an input for identifying the relevant point of expansion, and restart the derivation independently departing, once again, from the partition function (9) and (10). We will see that this results in an explicit controllable 1 / N expansion that we will process in a matter-of-fact manner.
In (9) and (10) with β = B / N , we introduce the following variable redefinitions motivated by the previous section:
ϕ i B N ϕ 0 + φ , ϕ 0 e 2 α + 4 B ϕ 0 + 1 1 ,
x K X K + χ K , X K N i K + 1 e K ( α + 2 B ϕ 0 ) .
To avoid clutter in the sums, we define the shorthand
c J X J J N = i J + 1 J e J ( α + 2 B ϕ 0 ) .
In the previous section, we computed S up to the order O ( 1 ) , and then the free energy is computed up to the order O ( N ) , since S enters the expression for Z as e N S and one has to take a logarithm of Z. In order to compute the next correction, which is the nonextensive contribution to the free energy of order O ( 1 ) , one correspondingly has to compute S up to the order O ( 1 / N ) . To do so, with the above notation, we rewrite (10) as
e S [ χ ] = e N B ϕ 0 2 d φ e φ 2 2 i B N ϕ 0 φ π 1 + e 2 α 4 B ϕ 0 e 4 i φ B / N exp J = 1 c J ( X J + χ J ) e 2 i J φ B / N .
We expand the denominator in the integrand as
1 + a e b / N 1 / 2 = ( 1 + a ) 1 / 2 1 a 2 ( 1 + a ) b N + a 2 2 a 8 ( 1 + a ) 2 b 2 N + ,
and, keeping in mind that X J N , expand the integrand up to order 1 / N . Importantly, terms involving φ N in the exponent cancel out because of our choice of ϕ 0 and X J , and one is left with the following Gaussian integral (where we omit odd powers of φ , which integrate to 0):
e S [ χ ] = e N B ϕ 0 2 + N J c J 2 J + J c J χ J π 1 + e 2 α 4 B ϕ 0 d φ e φ 2 1 + 2 B J c J 2 J 3 { 1 + O ( N 2 ) + 1 N [ 2 B φ 2 ( 2 e 2 α + 4 B ϕ 0 1 ) ( 1 + e 2 α + 4 B ϕ 0 ) 2 2 B φ 2 J c J J χ J 2 2 B φ 2 J c J J 2 χ J + 4 B φ 2 1 + e 2 α + 4 B ϕ 0 J c J J χ J + 2 B 2 φ 4 3 J c J 2 J 5 + 8 B 2 φ 4 3 J c J J χ J K c K 2 K 4 8 B 2 φ 4 3 ( 1 + e 2 α + 4 B ϕ 0 ) J c J 2 J 4 8 B 3 φ 6 9 J c J 2 J 4 2 ] } .
We will often encounter the following sums, for which we introduce explicit notation:
σ n J = 1 c J 2 J n + 1 = ( 1 ) J + 1 J ϕ 0 1 ϕ 0 J J n = x d d x n log ( 1 + x ) | x = ϕ 0 1 ϕ 0 .
Evaluating the φ -integral in (25) then yields
e S [ χ ] = e N B ϕ 0 2 + N σ 0 + J c J χ J ( 1 + e 2 α 4 B ϕ 0 ) ( 1 + 2 B σ 2 ) { 1 + O ( N 2 ) + 1 N [ B ( 2 e 2 α + 4 B ϕ 0 1 ) ( 1 + e 2 α + 4 B ϕ 0 ) 2 ( 1 + 2 B σ 2 ) B J c J J χ J 2 1 + 2 B σ 2 B J c J J 2 χ J 1 + 2 B σ 2 + 2 B J c J J χ J ( 1 + e 2 α + 4 B ϕ 0 ) ( 1 + 2 B σ 2 ) + B 2 σ 4 2 ( 1 + 2 B σ 2 ) 2 + 2 B 2 σ 3 J c J J χ J ( 1 + 2 B σ 2 ) 2 2 B 2 σ 3 ( 1 + e 2 α + 4 B ϕ 0 ) ( 1 + 2 B σ 2 ) 2 5 B 3 σ 3 2 3 ( 1 + 2 B σ 2 ) 3 ] } .
Raising this expression to the power of N, so as to substitute it in (9), and making use of ( 1 + x / N ) N e x at large N, we arrive at
e N S [ χ ] = e N 2 B ϕ 0 2 + N 2 σ 0 + N J c J χ J ( 1 + e 2 α 4 B ϕ 0 ) ( 1 + 2 B σ 2 ) N / 2 exp [ B ( 2 e 2 α + 4 B ϕ 0 1 ) ( 1 + e 2 α + 4 B ϕ 0 ) 2 ( 1 + 2 B σ 2 ) + B 2 σ 4 2 ( 1 + 2 B σ 2 ) 2 2 B 2 σ 3 ( 1 + e 2 α + 4 B ϕ 0 ) ( 1 + 2 B σ 2 ) 2 5 B 3 σ 3 2 3 ( 1 + 2 B σ 2 ) 3 + 2 B 2 σ 3 J c J J χ J ( 1 + 2 B σ 2 ) 2 + 2 B J c J J χ J ( 1 + e 2 α + 4 B ϕ 0 ) ( 1 + 2 B σ 2 ) B J c J J 2 χ J 1 + 2 B σ 2 B J c J J χ J 2 1 + 2 B σ 2 ] ,
where all possible corrections to this expression are suppressed by powers of 1 / N . Evaluating σ n in terms of ϕ 0 from (26)—the explicit formulas are tabulated in Appendix B—and also keeping in mind that e 2 α + 4 B ϕ 0 = ( 1 ϕ 0 ) / ϕ 0 and 1 + e 2 α 4 B ϕ 0 = 1 / ( 1 ϕ 0 ) , we obtain
e N S [ χ ] = e N 2 B ϕ 0 2 ( 1 ϕ 0 ) N 2 + N / 2 e N J c J χ J [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] N / 2 exp [ B ϕ 0 ( 2 3 ϕ 0 ) 1 + 2 B ϕ 0 ( 1 ϕ 0 ) 2 B 2 ϕ 0 2 ( 1 ϕ 0 ) ( 1 2 ϕ 0 ) [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 2 + B 2 ϕ 0 ( 1 ϕ 0 ) ( 1 6 ϕ 0 + 6 ϕ 0 2 ) 2 [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 2 5 B 3 ϕ 0 2 ( 1 ϕ 0 ) 2 ( 1 2 ϕ 0 ) 2 3 [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 3 + 2 B ϕ 0 [ 1 + B ( 1 ϕ 0 ) ] J c J J χ J [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 2 B J c J J χ J 2 1 + 2 B ϕ 0 ( 1 ϕ 0 ) B J c J J 2 χ J 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] .
From this,
J x J 2 2 J + N S [ x ] = N 2 B ϕ 0 2 N ( N 1 ) 2 log ( 1 ϕ 0 ) N 2 log [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] + B ϕ 0 ( 2 3 ϕ 0 ) 1 + 2 B ϕ 0 ( 1 ϕ 0 ) + B 2 ϕ 0 ( 1 ϕ 0 ) ( 1 10 ϕ 0 + 14 ϕ 0 2 ) 2 [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 2 5 B 3 ϕ 0 2 ( 1 ϕ 0 ) 2 ( 1 2 ϕ 0 ) 2 3 [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 3 + 2 B ϕ 0 [ 1 + B ( 1 ϕ 0 ) ] J c J J χ J [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 2 B J c J J 2 χ J 1 + 2 B ϕ 0 ( 1 ϕ 0 ) B J c J J χ J 2 1 + 2 B ϕ 0 ( 1 ϕ 0 ) .
Importantly, the term involving N J c J χ J in the first line of (29) cancels out in (30). This leaves in (9) a Gaussian integral over χ J where all integration variables are only allowed to take values of order 1, leading to an explicit finite result. This reflects the suitability of our parametrization in (22) and (23). Had we chosen a different expansion point, large terms of the form N χ J would have compromised the usefulness of evaluating the Gaussian integrals at this stage, as they would induce rearrangements in the 1 / N corrections.
To evaluate (9), it remains to integrate e J x J 2 / 2 J + N S [ x ] over χ J . To do so, we employ the following Hubbard–Stratonovich transformation for the exponential of the last term in (30):
exp B J c J J χ J 2 1 + 2 B ϕ 0 ( 1 ϕ 0 ) = d y e y 2 π exp 2 y i B 1 + 2 B ϕ 0 ( 1 ϕ 0 ) J c J J χ J .
It is now straightforward, even if somewhat laborious, to evaluate the remaining Gaussian integral over χ J where the quantities σ n defined in (26) appear once again, and can thereafter be explicitly expressed through ϕ 0 . The result is
J = 1 d χ J e χ J 2 / 2 J 2 π J exp c J J 2 B ϕ 0 [ 1 + B ( 1 ϕ 0 ) ] [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 2 J B 1 + 2 B ϕ 0 ( 1 ϕ 0 ) + 2 y i B 1 + 2 B ϕ 0 ( 1 ϕ 0 ) χ J = exp [ B 2 ϕ 0 ( 1 ϕ 0 ) ( 1 10 ϕ 0 + 18 ϕ 0 2 ) + 8 B 3 ϕ 0 3 ( 1 ϕ 0 ) 2 ( 5 ϕ 0 2 ) + 8 B 4 ϕ 0 4 ( 1 ϕ 0 ) 3 ( 3 ϕ 0 1 ) 2 [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 4 + 2 y i B 1 + 2 B ϕ 0 ( 1 ϕ 0 ) 3 / 2 ϕ 0 ( 1 ϕ 0 ) ( 4 ϕ 0 1 ) + 4 B ϕ 0 3 ( 1 ϕ 0 ) 2 1 + 2 B ϕ 0 ( 1 ϕ 0 ) 2 y 2 B ϕ 0 ( 1 ϕ 0 ) 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] .
We have once again used the expressions for sums of the form J c J 2 J n + 1 given in Appendix B. To undo the Hubbard–Stratonovich transformation in (31), we need to multiply the last line with e y 2 / π and integrate over y. This yields
d y π exp y 2 1 + 2 B σ 2 1 + 2 B ϕ 0 ( 1 ϕ 0 ) + 2 y i B 1 + 2 B ϕ 0 ( 1 ϕ 0 ) 3 / 2 ϕ 0 ( 1 ϕ 0 ) ( 4 ϕ 0 1 ) + 4 B 2 ϕ 0 3 ( 1 ϕ 0 ) 2 1 + 2 B ϕ 0 ( 1 ϕ 0 ) = exp B 3 ϕ 0 2 ( 1 ϕ 0 ) 2 [ 4 ϕ 0 1 + 4 B ϕ 0 2 ( 1 ϕ 0 ) ] 2 [ 1 + 2 B ϕ 0 ( 1 ϕ 0 ) ] 4 [ 1 + 4 B ϕ 0 ( 1 ϕ 0 ) ] 1 + 4 B ϕ 0 ( 1 ϕ 0 ) 1 + 2 B ϕ 0 ( 1 ϕ 0 ) 1 / 2 .
Finally, we need to gather all the contributions to Z of (9), consisting of the χ -independent part of (30), the second y-independent line of (32) and (33). This leaves for the free energy
F log Z = N 2 B ϕ 0 2 N ( N 1 ) 2 log ( 1 ϕ 0 ) N 2 log 1 + 2 Δ + 2 Δ B ϕ 0 2 1 + 2 Δ 5 B Δ 2 ( 1 2 ϕ 0 ) 2 3 ( 1 + 2 Δ ) 3 B Δ 2 [ 4 ϕ 0 ( 1 + Δ ) 1 ] 2 ( 1 + 2 Δ ) 4 ( 1 + 4 Δ ) 1 2 log 1 + 4 Δ 1 + 2 Δ + B Δ 1 10 ϕ 0 + 16 ϕ 0 2 + 2 Δ ( 1 14 ϕ 0 + 24 ϕ 0 2 ) + 2 Δ 2 ( 1 12 ϕ 0 + 20 ϕ 0 2 ) ( 1 + 2 Δ ) 4 .
with Δ B ϕ 0 ( 1 ϕ 0 ) introduced for compactness. The first line reflects the mean-field result of [23], while the subsequent two lines give the first nontrivial correction that the novel representations developed here allowed us to derive.
As the computations leading to (34) are rather convoluted, it is important to implement some independent checks. One such check is provided by the particle–hole duality respected by the two-star model: in the original partition function (1), one can change the summation variable from the adjacency matrix A to the ’inverted’ adjacency matrix, where all filled edges are replaced by empty ones and vice versa. This manifestly relates partition functions, and hence free energies, at two different values of α at any given N. As (34) is expressed through ϕ 0 ( α , B ) rather than α , one has to deal with the corresponding transformation for ϕ 0 that maps it to 1 ϕ 0 + O ( 1 / N ) . The subleading 1 / N corrections in this transformation mix the different orders of N in (34) and since the different contributions must cancel out in the end, this provides a nontrivial check of the subleading corrections in the last two lines of (34). Our formula passes this test. We provide an explicit implementation in Appendix C.
Further validation of (34) is provided by comparisons with numerical Monte Carlo sampling of the two-star model. We follow a strategy similar to what we have previously employed in [34] for more complicated related models, reviewed in Appendix D. In our comparisons between analytics and numerics, we focus on the degree variance. In terms of the free energy F log Z , the averages of the mean degree k j k j / N = k 1 and mean degree squared k 2 j k j 2 / N = k 1 2 are expressed as
k = 1 N F α , k 2 = F B .
To compute these derivatives, one also needs the derivatives of ϕ 0 with respect to α and B:
ϕ 0 α = 2 ϕ 0 ( 1 ϕ 0 ) 1 + 4 B ϕ 0 ( 1 ϕ 0 ) , ϕ 0 B = 4 ϕ 0 2 ( 1 ϕ 0 ) 1 + 4 B ϕ 0 ( 1 ϕ 0 ) .
Then, the variance is
V k 2 k 2 = F B 1 N F α 2 .
We separate out the contribution due to the extensive part of free energy, given by the first line of (34):
F 0 N 2 B ϕ 0 2 N ( N 1 ) 2 log ( 1 ϕ 0 ) N 2 log 1 + 2 B ϕ 0 ( 1 ϕ 0 ) .
The contribution to the degree variance V coming from F 0 is
F 0 B 1 N F 0 α 2 = N ϕ 0 ( 1 ϕ 0 ) 1 + 2 Δ ϕ 0 2 [ 2 B ϕ 0 ( ϕ 0 2 ) + 2 B + 1 ] 2 ( 1 + 2 Δ ) 2 ( 1 + 4 Δ ) 2 ,
with Δ = B ϕ 0 ( 1 ϕ 0 ) . The leading order variance
V 0 N ϕ 0 ( 1 ϕ 0 ) 1 + 2 Δ
matches the one derived in [23]. We are keeping the subleading order in (39) as it will combine with the higher-order corrections. The remaining nonextensive piece of the free energy (34) is
F c o r r 2 Δ B ϕ 0 2 1 + 2 Δ 5 B Δ 2 ( 1 2 ϕ 0 ) 2 3 ( 1 + 2 Δ ) 3 B Δ 2 [ 4 ϕ 0 ( 1 + Δ ) 1 ] 2 ( 1 + 2 Δ ) 4 ( 1 + 4 Δ ) 1 2 log 1 + 4 Δ 1 + 2 Δ + B Δ 1 10 ϕ 0 + 16 ϕ 0 2 + 2 Δ ( 1 14 ϕ 0 + 24 ϕ 0 2 ) + 2 Δ 2 ( 1 12 ϕ 0 + 20 ϕ 0 2 ) ( 1 + 2 Δ ) 4 .
Since this expression is bulky, it is more practical to use symbolic computation software for handling its derivatives, and we rely on SymPy [35] for this purpose, followed by evaluating the resulting expressions for each set of parameters. To verify (34) numerically, we then extract point-by-point the degree variance from Monte Carlo simulations, with V 0 subtracted, and compare it with
V c o r r F c o r r B 1 N F c o r r α 2 ϕ 0 2 [ 2 B ϕ 0 ( ϕ 0 2 ) + 2 B + 1 ] 2 ( 1 + 2 Δ ) 2 ( 1 + 4 Δ ) 2 .
In Figure 1, we present the results of this comparison, showing excellent agreement. We opt for the moderate number of vertices N = 200 as it allows us to keep the higher-order 1 / N corrections to the analytics sufficiently small, but at the same time the Monte Carlo equilibration for such moderately sized graphs happens sufficiently fast to attain a very high precision. This high precision is necessary since we are measuring a small subleading contribution to the variance. (We additionally display the results of numerical simulations for N = 50 .)
We have been dealing above with the dominant 1 / N correction to the mean-field result of [23]. If one is to compute further 1 / N corrections to the free energy, the process is completely algorithmic, even if it will become more and more laborious with each next order. Indeed, higher-order corrections will manifest themselves as contributions of order 1 / N 2 and smaller in the formula for e S on top of what is written explicitly in (27). All of these contributions are furthermore polynomial in χ J . Then, when writing e J x J 2 / 2 J + N S [ x ] in (9), these corrections will give contributions suppressed by 1 / N or more in the exponent, all of which can be re-expanded as additive contributions polynomial in χ J . Thus, one will end up with the same Gaussian integrals as in our treatment above, except that they will have extra polynomial insertions in terms of χ J suppressed by powers of N. All such Gaussian integrals with polynomial insertions can be evaluated using the standard formulas, leaving behind an explicit series in powers of 1 / N that will correct our result in (34).

5. The Sparse Regime

We now turn to the sparse regime,
α = 1 2 log N c , β = O ( 1 ) .
In this case, the J-sum in (10) works very differently from the dense case considered above, since e J α turns into powers of c / N , introducing stronger and stronger 1 / N suppression in contributions from higher J. This leads to considerable simplifications.
To identify the relevant scalings of x J in terms of powers of N explicitly, consider the 1 / N expansion of e S given by (10), similar to what has been employed in the previous sections, but with the new large-N scalings (43) of α and β :
e S [ x ] = 1 π d ϕ e ϕ 2 1 + c e 4 i ϕ β / N exp [ c N x 1 e 2 i ϕ β i c 2 N x 2 e 4 i ϕ β + 1 3 c N 3 / 2 x 3 e 6 i ϕ β + ] .
From this, we see that it is self-consistent to assume that x J 2 = O ( 1 ) . Indeed, if that is so, the leading contribution to e S from x J 2 scales as N J / 2 . When raised to the power N to obtain e N S in (9), this will yield exp [ O ( N 1 J / 2 ) ] , since [ 1 + O ( N γ ) ] N e O ( N 1 γ ) . Thus, for J 3 , the contribution of e N S in (9) is completely negligible at large N, and one is left with an empty Gaussian integral of e x J 2 / 2 J , so that the fluctuations of x J 3 are of order 1 as assumed. For x 2 , e N S gets a contribution of order 1 that should be integrated together with the Gaussian factor e x 2 2 / 4 in (9). This integral will just produce a factor of order 1 in the partition function, and hence a nonextensive contribution to the free energy that does not alter the expectation values of the vertex degrees and their squares at large N, so we can ignore it. The fluctuations of x 2 are again of order 1 as assumed. The only contribution where the structure is different, and it defines the end result, is that of x 1 . Indeed, assuming that x 1 is of order 1 is inconsistent, since that would have yielded e N S of the form e O ( N ) , and the Gaussian factor e x 1 2 / 2 would not have been able to keep the fluctuations at order 1 as assumed. The consistent assumption is that x 1 is of order N , since in that case, e S is of order 1, and hence e N S is of the form e O ( N ) , while the Gaussian factor e x 1 2 / 2 is of the same order, so that the two factors balance each other self-consistently producing an explicit saddle point, as we shall see immediately.
From the above estimates, for the leading order analysis, S will only depend on x 1 at large N, and to incorporate the relevant scaling of x 1 , we write x 1 x N / c . Then,
Z = N 2 π c d x e N [ S ( x , β ) x 2 / 2 c ] , S ( x , β ) log 1 π d ϕ exp ϕ 2 + x e 2 i ϕ β .
The first integral has an explicit saddle point structure at large N, and the saddle point is at x = X satisfying
X = c S x | x = X .
The free energy per vertex f satisfies
f log Z N = S ( X , β ) X 2 / 2 c ,
where X is the solution of (45), and we have only kept the contributions nonvanishing at large N. It is convenient to represent e S ( x ) as a series
e S ( x , β ) = 1 π d ϕ e ϕ 2 d = 0 x d d ! e 2 i d ϕ β = d = 0 x d d ! e β d 2 .
From (45),
X = c d = 1 X d 1 e β d 2 / ( d 1 ) ! d = 0 X d e β d 2 / d ! .
The mean degree is
k f α = d c d α f c = 2 c f c = X 2 c + 2 c X c X S ( X , β ) X 2 / 2 c .
The last term vanishes by the saddle point equation, giving
k = X 2 c .
At the same time,
k 2 f β = S ( X , β ) β X β X S ( X , β ) X 2 / 2 c .
Again, due to the saddle point equation,
k 2 = S ( X , β ) β = d = 1 d 2 X d e β d 2 / d ! d = 0 X d e β d 2 / d ! .
Expressing X from (50) and substituting it into (48), multiplied by X, and into (52), we obtain
k = d = 1 d c d / 2 k d / 2 e β d 2 / d ! d = 0 c d / 2 k d / 2 e β d 2 / d ! ,
k 2 = d = 1 d 2 c d / 2 k d / 2 e β d 2 / d ! d = 0 c d / 2 k d / 2 e β d 2 / d ! .
This is identical to the Annibale–Courtney solution of [24], but now derived using only elementary integrals, and in a few lines rather than a few pages. At any given c and β , (53) has to be solved as a nonlinear equation for k , whereupon k 2 is computed directly from (54).

6. The Sparse Regime: Condensation of the Master Variable

As we have seen in the previous section, the solution of the two-star model in the sparse regime is determined by a saddle point structure in terms of x 1 . The Park–Newman variables ϕ k , on the other hand, do not condense to definite values, since there is a saddle point structure in the integral defining Z in (44), but not in the integral defining S. Likewise, the vertex degrees do not condense, since both mean degree and degree fluctuations are of order 1.
To develop a physical intuition for the condensation of x 1 , it is natural to turn to its Hubbard–Stratonovich conjugate j e 2 i ϕ j β / N . We can compute the expectation value of this observable by inserting it into the partition function (4):
1 N j e 2 i ϕ j β = 1 π N / 2 N Z [ d ϕ n ] j e 2 i ϕ j β e k ϕ k 2 k < l 1 + e 2 α + 2 i ( ϕ k + ϕ l ) β .
This expression can be processed in two different ways. First, we can introduce the x J variables, pushing the structure toward (9) and (10), and then apply the saddle point analysis of Section 5. The only difference due to the new factor of j e 2 i ϕ j β is that we should multiply the integrand on the right-hand side of (8) with e α x 1 , since with this extra factor, integrating out x 1 precisely reproduces j e 2 i ϕ j β . In this way,
1 N j e 2 i ϕ j β = e α N Z J = 1 d x J 2 π J e x J 2 / 2 J x 1 e N S [ x ] = X c ,
where we have used x 1 x N / c and the fact that x condenses to the definite value X determined by (45), with fluctuations suppressed by 1 / N in view of the saddle point considerations of the previous section. Alternatively, we can take (55) and return to the original adjacency matrix variables defining the two-star model. We thus undo the step leading from (3) to (4) to obtain
1 N j e 2 i ϕ j β = 1 π N / 2 N Z A j [ d ϕ n ] e 2 i ϕ j β e l ϕ l 2 e k l A k l ( α + 2 i ϕ l β ) , = 1 N Z A j e β ( 2 k j + 1 ) e H [ A ] = e β N j e 2 β k j ,
where H [ A ] is the original two-star Hamiltonian (1) and k j l A j l are the vertex degrees. Note that all vertices are equivalent so we can write j e 2 β k j / N as simply e 2 β k . From (56) and (57), we then obtain
X = c e β e 2 β k .
In other words, the condensation value of the master variable x 1 that controls the sparse regime of the two-star model is proportional to the expectation value of a specific exponential function of the vertex degree in terms of the original random graph variables.
As a byproduct of these considerations, we arrive at the following curious thermodynamic relation: from (58) and (50),
e 2 β k = e β k c .
There is thus a relation, at large N and in thermodynamic equilibrium, between the expectation values of degrees and their exponentials.

7. Conclusions

We have revisited solutions of the two-star random graph model in the thermodynamic limit, in both dense and sparse regimes. Previous approaches relied on the variables ϕ k dating back to [23], which are Hubbard–Stratonovich conjugates of the vertex degrees. We pointed out that considerable empowerment of the formalism results from introducing further variables x J as in (8). These variables can be thought of as Hubbard–Stratonovich conjugates of k e 2 i J ϕ k β / N . The usage of such exponentials of the fields is ubiquitous in conformal field theories [36]—see [37] for a recent discussion—but to the best of our knowledge they have not appeared previously in studies of random graphs.
In the dense regime, it is known that the variables ϕ k condense themselves to definite values at large N. It is predictable that all variables x J condense to definite values as well, with small fluctuations. Our representation provides for effective control over these fluctuations, with a straightforward bookkeeping of the 1 / N factors, which is challenging in the language of ϕ k . We thus manage to compute the nonextensive part of the free energy in the dense regime. In the sparse regime, the situation is more peculiar: the variables ϕ k do not condense—the representation for S in (44) does not have a large N saddle point structure—but the variables x J do, leading to a simple saddle point calculation of the free energy. This calculation is phrased entirely in terms of elementary one-dimensional integrals. The condensation value of the x-variable determining the large N behavior in the sparse regime can be expressed in terms of the original graph geometry as the expectation value of a certain exponential of the vertex degree given by (58).
We schematically summarize the interplay between our work and the previous treatments in [23,24] in Table 1. (We stress that this table only shows the aspects of [23,24] relevant for the considerations here, while those papers additionally consider many other important questions, in particular, in relation to the phenomenology of phase transitions in two-star graphs.)
Besides elucidating the analytic structure of the two-star model, we hope that the techniques presented here will usefully transfer to further related settings. A number of interesting extensions of the two-star model can be seen in the literature [38,39,40,41,42], and they provide an attractive avenue for exploration in this regard.

Author Contributions

Conceptualization, O.E.; Methodology, O.E.; Formal analysis, P.A.-p. and O.E.; Investigation, P.A.-p. and O.E.; Data curation, P.A.-p.; Writing—original draft, P.A.-p. and O.E.; Writing—review & editing, O.E. All authors have read and agreed to the published version of the manuscript.

Funding

O.E. is supported by the C2F program at Chulalongkorn University and by NSRF via grant number B41G680029.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

We thank Sergei Nechaev for valuable discussions on closely related topics.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Naive Expansion Around the Park–Newman Solution

To illustrate the power of our formalism, we would like to briefly contrast it with naive expansion around the Park–Newman solution [23]. In our notation, the representation for the partition function used for obtaining that solution is given by (5), which we reproduce here for convenience:
Z = 1 π N / 2 d ϕ 1 d ϕ N e k ϕ k 2 + S [ ϕ ] , S [ ϕ ] k < l log 1 + e 2 α + 2 i ( ϕ k + ϕ l ) B / N .
To derive the mean-field solution, one assumes that the dominant configuration is site-independent, ϕ k = i B N ϕ 0 , where ϕ 0 satisfies the same equation we derived in Section 3:
ϕ 0 = 1 e 2 α + 4 B ϕ 0 + 1 .
One then expands as ϕ k = i B N ϕ 0 + φ k and writes
k ϕ k 2 + S [ ϕ ] = S 0 + k φ k 2 + 1 2 k l 2 S ϕ k ϕ l φ k φ l + 1 6 k l m 3 S ϕ k ϕ l ϕ m φ l φ l φ m + .
Equation (A2) ensures that there are no terms linear in φ k in this expansion. Then, the exponential of all of these terms starting with the cubic one should be expanded in a Taylor series, so that a Gaussian integral over φ k emerges from (A1) with an infinite series of polynomial insertions.
To get a sense of how this expansion works, we write out the general formulas for the derivative tensors
n S ϕ j 1 ϕ j n = k < l d n d x n log 1 + e x | x = 2 α + 2 i ( ϕ k + ϕ l ) B / N m = 1 n 2 i B N ( δ k j m + δ l j m ) .
Notice the Kronecker symbols involving only the two indices k, l and one external index j m , so that the derivative tensors vanish unless there are at most two distinct values present among the indices j m . The first few orders evaluated at the saddle point are
2 S ϕ k ϕ l = 4 B N ϕ 0 ( 1 ϕ 0 ) for i j , 4 B ( N 1 ) N ϕ 0 ( 1 ϕ 0 ) for i = j ; 3 S ϕ k ϕ l ϕ m = 0 with more than 2 distinct values in { k , l , m } , 8 i B N 3 / 2 ϕ 0 ( 1 ϕ 0 ) ( 1 2 ϕ 0 ) with 2 distinct values in { k , l , m } , 8 i B N 3 / 2 ( N 1 ) ϕ 0 ( 1 ϕ 0 ) ( 1 2 ϕ 0 ) all indices are the same ; 4 S ϕ k ϕ l ϕ m ϕ n = 0 with more than 2 distinct values in { k , l , m , n } , 16 B 2 N 2 ϕ 0 ( 1 ϕ 0 ) ( 1 6 ϕ 0 + 6 ϕ 0 2 ) with 2 distinct values in { k , l , m , n } , 16 B 2 N 2 ( N 1 ) ϕ 0 ( 1 ϕ 0 ) ( 1 6 ϕ 0 + 6 ϕ 0 2 ) all indices are the same .
One can in principle build the standard Feynman perturbation theory from this expansion, with the inverse of the matrix 2 S / ϕ k ϕ l used as the propagator, and the higher-order terms in the φ -expansion used as vertices. However, the expressions for the diagrams will involve summations over indices taking values from 1 to N, contributing positive powers of N that will compete with the negative powers seen in the denominators of the derivative formulas above. Furthermore, the powers of N, both in the propagator and the vertices, depend on the number of coincident indices. This leads to complicated accounting of the powers of N. By contrast, with the variables x J and the representation (9) and (10) used in the main text, N is a purely numerical parameter in the action, and the powers of N are recovered straightforwardly—essentially, by means of Taylor expansions.

Appendix B. Some Useful Formulas

While the expressions below can be straightforwardly derived from (26), they are used extensively in our manipulations and we summarize them for the convenience of the reader:
σ 0 = J = 1 c J 2 J = log ( 1 ϕ 0 ) ,
σ 1 = J = 1 c J 2 J 2 = 1 1 + e 2 α + 4 B ϕ 0 = ϕ 0 ,
σ 2 = J = 1 c J 2 J 3 = ϕ 0 ( 1 ϕ 0 ) ,
σ 3 = J = 1 c J 2 J 4 = ϕ 0 ( 1 ϕ 0 ) ( 1 2 ϕ 0 ) ,
σ 4 = J = 1 c J 2 J 5 = ϕ 0 ( 1 ϕ 0 ) ( 1 6 ϕ 0 + 6 ϕ 0 2 ) .

Appendix C. Particle–Hole Duality

The partition function of the two-star model is defined as
Z A e H [ A ] , H [ A ] α k l A k l + β k l m A k l A l m ,
We want to change the summation variable as A j k = 1 A ˜ j k δ j k , which precisely corresponds to flipping the state of all the edges (filled to empty, empty to filled). Then,
H [ A ( A ˜ ) ] = α 2 β ( N 1 ) k l A ˜ k l + β k l m A ˜ k l A ˜ l m + α N ( N 1 ) + β ( N 3 2 N 2 + N ) .
This means that under this transformation, in the dense regime (substitute β = B / N ), the free energy F log Z changes as
F ( α , B ) = α N ( N 1 ) + B ( N 1 ) 2 + F α 2 B + 2 B N , B .
Since we express our results as functions of ϕ 0 ( α , B ) defined by (22), rather than as functions of α and B, it is handy to correspondingly recast the transformation α α ˜ α 2 B + 2 B N as ϕ 0 ϕ 0 ˜ . To do so, we first express α through ϕ 0 as
α = 1 2 log 1 ϕ 0 ϕ 0 2 B ϕ 0 ,
enact the transformation α α ˜ , and then write the equation defining ϕ ˜ 0 through α ˜ :
ϕ ˜ 0 = 1 1 + e 2 α ˜ + 4 B ϕ ˜ 0 = 1 1 + ϕ 0 1 ϕ 0 e 4 B ( 1 ϕ 0 ) + 4 B / N e 4 B ϕ 0 ˜ .
This equation can be solved in terms of an 1 / N expansion as
ϕ ˜ 0 1 ϕ 0 1 N 4 B ϕ 0 ( 1 ϕ 0 ) 1 + 4 B ϕ 0 ( 1 ϕ 0 ) 1 N 2 8 ϕ 0 B 2 ( 1 ϕ 0 ) ( 1 2 ϕ 0 ) [ 1 + 4 B ϕ 0 ( 1 ϕ 0 ) ] 3 + .
With this transformation, we write
F ( ϕ 0 , B ) = N ( N 1 ) 1 2 log 1 ϕ 0 ϕ 0 2 B ϕ 0 + B ( N 1 ) 2 + F ϕ ˜ 0 ( ϕ 0 , B ) , B .
This relation is respected by our result (34) up to order O ( 1 ) , providing a validation of our derivations.

Appendix D. Monte Carlo Sampling

We sample graphs from the two-star ensemble using a particular version of Metropolis Monte Carlo algorithm that we have previously employed in [34] for related models. We summarize it below. General discussion can be found in [43].
We start with an adjacency matrix of an Erdös–Rényi graph generated by randomly connecting any two vertices with probability 1 / 2 . Then, for each Monte Carlo step, we propose an update move which flips the current edge (filled to empty, empty to filled) for a randomly picked pair of vertices ( i , j ) . The change in the adjacency matrix can be formally written as
Δ i j A m n = ( δ i m δ j n + δ i n δ j m ) ( 1 2 A i j ) .
This move is then accepted with probability min 1 , e H ( A ) H ( A + Δ i j A ) as per the usual Metropolis algorithm.
Computing the full Hamiltonian takes a number of operations of order O ( N 2 ) , which can be time-consuming when one has to take a large number of samples from a big system. One can optimize the numerical process by computing only the change of the Hamiltonian from each proposed update move. For this, one needs to compute the change in the sum of degrees k d k = k l A k l , and in k d k 2 = k l m A k l A l m . From (A15), the change in degree of node m is
Δ i j d m = ( δ i m + δ j m ) ( 1 2 A i j ) .
It follows that
Δ i j m d m = 2 ( 1 2 A i j ) , Δ i j m d m 2 = 2 ( 1 2 A i j ) ( d i + d j ) + 1 .
The change in the Hamiltonian is thus
Δ i j H ( A ) = H ( A + Δ i j A ) H ( A ) = 2 ( 1 2 A i j ) ( α + β ( d i + d j ) + β ) .
This allows one to compute the acceptance probability with a number of operations of order 1.
We compute the mean degree k and the mean degree squared k 2 by averaging over 5 × 10 5 Monte Carlo sample points, which are taken at intervals of N 2 / 10 elementary Monte Carlo steps with the update rule described above. The long interval between the samples ensures that a significant part of the graph is updated by the stochastic evolution before each next sample is taken.

References

  1. Moshe, M.; Zinn-Justin, J. Quantum field theory in the large N limit: A review. Phys. Rep. 2003, 385, 69–228. [Google Scholar] [CrossRef]
  2. Hotta, T.; Nishimura, J.; Tsuchiya, A. Dynamical aspects of large N reduced models. Nucl. Phys. B 1999, 545, 543–575. [Google Scholar] [CrossRef]
  3. Mandal, G.; Mahato, M.; Morita, T. Phases of one dimensional large N gauge theory in a 1/D expansion. J. High Energy Phys. 2010, 2010, 34. [Google Scholar] [CrossRef]
  4. Evnin, O. Randomized Wilson loops, reduced models and the large D expansion. Nucl. Phys. B 2011, 853, 461–474. [Google Scholar] [CrossRef]
  5. Sasakura, N. Real eigenvector distributions of random tensors with backgrounds and random deviations. Prog. Theor. Exp. Phys. 2023, 2023, 123A01. [Google Scholar] [CrossRef]
  6. Delporte, N.; Sasakura, N. The edge of random tensor eigenvalues with deviation. J. High Energy Phys. 2025, 2025, 71. [Google Scholar] [CrossRef]
  7. Majumder, S.; Sasakura, N. Three cases of complex eigenvalue/vector distributions of symmetric order-three random tensors. Prog. Theor. Exp. Phys. 2024, 2024, 093A01. [Google Scholar] [CrossRef]
  8. Staudacher, M. Combinatorial solution of the two-matrix model. Phys. Lett. B 1993, 305, 332–338. [Google Scholar] [CrossRef]
  9. Eynard, B. Master loop equations, free energy and correlations for the chain of matrices. J. High Energy Phys. 2003, 2003, 18. [Google Scholar] [CrossRef]
  10. Rodgers, G.J.; Bray, A.J. Density of states of a sparse random matrix. Phys. Rev. B 1988, 37, 3557. [Google Scholar] [CrossRef]
  11. Bray, A.J.; Rodgers, G.J. Diffusion in a sparsely connected space: A model for glassy relaxation. Phys. Rev. B 1988, 38, 11461. [Google Scholar] [CrossRef]
  12. Fyodorov, Y.V.; Mirlin, A.D. On the density of states of sparse random matrices. J. Phys. A 1991, 24, 2219. [Google Scholar] [CrossRef]
  13. Mirlin, A.D.; Fyodorov, Y.V. Universality of level correlation function of sparse random matrices. J. Phys. A 1991, 24, 2273. [Google Scholar] [CrossRef]
  14. Akara-pipattana, P.; Evnin, O. Random matrices with row constraints and eigenvalue distributions of graph Laplacians. J. Phys. A 2023, 56, 295001. [Google Scholar] [CrossRef]
  15. Akara-pipattana, P.; Evnin, O. Statistical field theory of random graphs with prescribed degrees. J. Stat. Mech. 2025, 2025, 053402. [Google Scholar] [CrossRef]
  16. Polyakov, A.M. Gauge Fields and Strings; Routledge: London, UK, 1987. [Google Scholar] [CrossRef]
  17. Das, S.R. Some aspects of large N theories. Rev. Mod. Phys. 1987, 59, 235. [Google Scholar] [CrossRef]
  18. Makeenko, Y. Methods of Contemporary Gauge Theory; Cambridge University Press: Cambridge, UK, 2002. [Google Scholar] [CrossRef]
  19. Shen, H.; Smith, S.A.; Zhu, R. A new derivation of the finite N master loop equation for lattice Yang-Mills. Electr. J. Prob. 2023, 29, 1–18. [Google Scholar] [CrossRef]
  20. Newman, M. Networks: An Introduction; Oxford University Press: Oxford, UK, 2018. [Google Scholar] [CrossRef]
  21. Paton, J.; Hartle, H.; Stepanyants, H.; Hoorn, P.v.; Krioukov, D. Entropy of labeled versus unlabeled networks. Phys. Rev. E 2022, 106, 054308. [Google Scholar] [CrossRef] [PubMed]
  22. Evnin, O.; Krioukov, D. Ensemble inequivalence and phase transitions in unlabeled networks. Phys. Rev. Lett. 2025, 134, 207401. [Google Scholar] [CrossRef]
  23. Park, J.; Newman, M.E.J. Solution of the two-star model of a network. Phys. Rev. E 2004, 70, 066146. [Google Scholar] [CrossRef] [PubMed]
  24. Annibale, A.; Courtney, O.T. The two-star model: Exact solution in the sparse regime and condensation transition. J. Phys. A 2015, 48, 365001. [Google Scholar] [CrossRef]
  25. Radin, C.; Yin, M. Phase transitions in exponential random graphs. Ann. App. Prob. 2013, 23, 2458–2471. [Google Scholar] [CrossRef]
  26. Mukherjee, S.; Xu, Y. Statistics of the two-star ERGM. Bernoulli 2023, 29, 24–51. [Google Scholar] [CrossRef]
  27. Bizhani, G.; Paczuski, M.; Grassberger, P. Discontinuous percolation transitions in epidemic processes, surface depinning in random media, and Hamiltonian random graphs. Phys. Rev. E 2012, 86, 011128. [Google Scholar] [CrossRef]
  28. Gorsky, A.; Valba, O. Finite-size effects in exponential random graphs. J. Compl. Net. 2020, 8, cnaa008. [Google Scholar] [CrossRef]
  29. Biondini, G.; Moro, A.; Prinari, B.; Senkevich, O. p-star models, mean field random networks and the heat hierarchy. Phys. Rev. E 2022, 105, 014306. [Google Scholar] [CrossRef] [PubMed]
  30. Gurau, R.; Ryan, J.P. Colored tensor models—A review. SIGMA 2012, 8, 20. [Google Scholar] [CrossRef]
  31. Gurău, R. Random Tensors; Oxford University Press: Oxford, UK, 2017. [Google Scholar] [CrossRef]
  32. Kawamoto, T. Entropy of microcanonical finite-graph ensembles. J. Phys. Compl. 2023, 4, 035005. [Google Scholar] [CrossRef]
  33. Evnin, O.; Horinouchi, W. A Gaussian integral that counts regular graphs. J. Math. Phys. 2024, 65, 093301. [Google Scholar] [CrossRef]
  34. Akara-pipattana, P.; Chotibut, T.; Evnin, O. The birth of geometry in exponential random graphs. J. Phys. A 2021, 54, 425001. [Google Scholar] [CrossRef]
  35. Meurer, A.; Smith, C.P.; Paprocki, M.; Čertík, O.; Kirpichev, S.B.; Rocklin, M.; Kumar, A.M.; Ivanov, S.; Moore, J.K.; Singh, S.; et al. SymPy: Symbolic computing in Python. PeerJ Comput. Sci. 2017, 3, e103. [Google Scholar] [CrossRef]
  36. Francesco, P.; Mathieu, P.; Sénéchal, D. Conformal Field Theory; Section 6; Springer: Berlin/Heidelberg, Germany, 1997. [Google Scholar] [CrossRef]
  37. Benedetti, D.; Lauria, E.; Mazáč, D.; van Vliet, P. One-dimensional Ising model with 1/r1.99 interaction. Phys. Rev. Lett. 2025, 134, 201602. [Google Scholar] [CrossRef] [PubMed]
  38. Snijders, T.A.B.; Pattison, P.E.; Robins, G.L.; Handcock, M.S. New specifications for exponential random graph models. Socio. Meth. 2006, 36, 99. [Google Scholar] [CrossRef]
  39. Avetisov, V.; Gorsky, A.; Maslov, S.; Nechaev, S.; Valba, O. Phase transitions in social networks inspired by the Schelling model. Phys. Rev. E 2018, 98, 032308. [Google Scholar] [CrossRef]
  40. Bolfe, M.; Metz, F.L.; Guzmán-González, E.; Castillo, I.P. Analytic solution of the two-star model with correlated degrees. Phys. Rev. E 2021, 104, 014147. [Google Scholar] [CrossRef]
  41. Catanzaro, A.; Patil, S.; Garlaschelli, D. A general solution for network models with pairwise edge coupling. arXiv 2025, arXiv:2503.20805. [Google Scholar] [CrossRef]
  42. Marzi, M.; Giuffrida, F.; Garlaschelli, D.; Squartini, T. Reproducing the first and second moment of empirical degree distributions. arXiv 2025, arXiv:2505.10373. [Google Scholar] [CrossRef]
  43. Newman, M.E.J.; Barkema, G.T. Monte Carlo Methods in Statistical Physics; Oxford University Press: Oxford, UK, 1999. [Google Scholar] [CrossRef]
Figure 1. The difference δ V between degree variance and the leading order prediction V 0 given by (40) as a function of α . The crosses represent numerical measurements from Monte Carlo sampling for graphs with N = 200 vertices; the dots are the same for N = 50 vertices, deviating from the analytic curve slightly stronger because of higher 1 / N corrections, while the solid lines are the analytic prediction (42). Two different values are presented: B = 1.3 on the left and B = 2 on the right.
Figure 1. The difference δ V between degree variance and the leading order prediction V 0 given by (40) as a function of α . The crosses represent numerical measurements from Monte Carlo sampling for graphs with N = 200 vertices; the dots are the same for N = 50 vertices, deviating from the analytic curve slightly stronger because of higher 1 / N corrections, while the solid lines are the analytic prediction (42). Two different values are presented: B = 1.3 on the left and B = 2 on the right.
Entropy 27 01081 g001
Table 1. Comparison with the previously explored approaches:
Table 1. Comparison with the previously explored approaches:
Park–Newman [23]Annibale–Courtney [24]Current Treatment
Mean-field dense regime
Systematic 1 / N corrections
Sparse regime
Functional integrals avoided
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Akara-pipattana, P.; Evnin, O. A Set of Master Variables for the Two-Star Random Graph. Entropy 2025, 27, 1081. https://doi.org/10.3390/e27101081

AMA Style

Akara-pipattana P, Evnin O. A Set of Master Variables for the Two-Star Random Graph. Entropy. 2025; 27(10):1081. https://doi.org/10.3390/e27101081

Chicago/Turabian Style

Akara-pipattana, Pawat, and Oleg Evnin. 2025. "A Set of Master Variables for the Two-Star Random Graph" Entropy 27, no. 10: 1081. https://doi.org/10.3390/e27101081

APA Style

Akara-pipattana, P., & Evnin, O. (2025). A Set of Master Variables for the Two-Star Random Graph. Entropy, 27(10), 1081. https://doi.org/10.3390/e27101081

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop