Next Article in Journal
Sequential Discrimination of Mixed Quantum States
Previous Article in Journal
Maxwellian Distribution-Based Hall Transport Coefficients for Charged Particles in Magnetic Disk Array
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Interaction Order Estimation in Tensor Curie–Weiss Models

by
Somabha Mukherjee
Department of Statistics and Data Science, Faculty of Science, National University of Singapore, Singapore 117546, Singapore
Entropy 2025, 27(3), 245; https://doi.org/10.3390/e27030245
Submission received: 22 January 2025 / Revised: 23 February 2025 / Accepted: 24 February 2025 / Published: 27 February 2025
(This article belongs to the Special Issue Ising Model—100 Years Old and Still Attractive)

Abstract

:
In this paper, we consider the problem of estimating the interaction parameter p of a p-spin Curie–Weiss model at inverse temperature β , given a single observation from this model. We show, by a contiguity argument, that joint estimation of the parameters β and p is impossible, which implies that the estimation of p is impossible if β is unknown. These impossibility results are also extended to the more general p-spin Erdös–Rényi Ising model. The situation is more delicate when β is known. In this case, we show that there exists an increasing threshold function β * ( p ) , such that for all β , consistent estimation of p is impossible when β * ( p ) > β , and for almost all β , consistent estimation of p is possible for β * ( p ) < β .

1. Introduction

The Ising model [1] was originally introduced in Physics as a model for ferromagnetism, and has since then found numerous interesting applications in diverse areas, such as image processing [2], neural networks [3], spatial statistics [4], and disease mapping in epidemiology [5]. The classical (2-spin) Ising model is a discrete probability measure on the set of all binary strings of a fixed length, given by the following:
P β ( x ) : = exp β H N ( x ) Z N ( β ) for x { 1 , 1 } N
where the Hamiltonian  H N ( x ) is given by
H N ( x ) : = 1 i , j N J i , j x i x j ,
J : = ( ( J i , j ) ) 1 i , j N is the interaction matrix (often taken as the scaled adjacency matrix of some graph), β > 0 denotes the interaction strength (called inverse temperature in the physics literature), and Z N ( β ) is the normalizing constant, needed to ensure that the probabilities in (1) add up to 1 (referred to as the partition function in Physics). The Hamiltonian 1 i , j N J i , j x i x j of the model (1) only captures pairwise dependence between the binary variables, arising from an underlying network/interaction structure. Unfortunately, in most real-life scenarios, pairwise interactions are not enough to explain all of the complex dependencies arising in network data, and one has to take into account higher-order interactions arising from peer-group effects. Multi-body interactions are also common in many other branches of science; for example in Chemistry [6], it is known that the atoms on a crystal surface do not just interact in pairs, but in triangles, quadruples, and higher-order tuples. A natural extension of the classical 2-spin Ising model with a focus on capturing higher-order dependencies is the p-spin Ising model [7,8,9,10], where the quadratic interaction term in the sufficient statistic is replaced by a multilinear polynomial of degree p 2 . The probability mass function of this model is given by the following:
P β , p ( x ) : = exp β H N ( x ) Z N ( β , p ) for x { 1 , 1 } N
where the Hamiltonian H N ( x ) is given by the following:
H N ( x ) : = 1 i 1 , , i p N J i 1 i p x i 1 x i p ,
J : = ( ( J i 1 i p ) ) ( i 1 , , i p ) [ N ] p is the interaction tensor (often taken as the scaled adjacency tensor of a hypergraph), β > 0 denotes the interaction strength (or inverse temperature), p is the interaction order, and Z N ( β , p ) is the normalizing constant (or partition function).
The p-spin Ising models have applications, including their role in explaining the microscopic theory of magnetism in solid and fluid 3He films absorbed on the surface of graphite (see [11]). They also serve as important resources for analog quantum simulation and quantum computation, contribute to the mapping of fermions to artificial spin systems in quantum algorithms for quantum chemistry, and appear in spin models of cuprate superconductor Hamiltonians (see [12] and the references therein). Furthermore, they play crucial roles in error suppression schemes for quantum annealers, adiabatic topological quantum computation, and other related areas. p-spin Ising and spin glass models have also appeared in a number of classical and recent works, such as [7,8,10,13,14,15,16,17,18] and p-spin versions of the closely related Potts model have appeared in [19,20,21].
The estimation of the parameter β in the model (2), assuming that the interaction order p is known, has been studied exptensively in the past (see, for example, [22,23,24] for the case p = 2 and [9,10,16] for the case p 3 ). For the p = 2 case, N -consistency of the so called maximum pseudolikelihood estimator (MPLE) of β under some general assumptions on the model (2) was first established in [23]. These assumptions can be verified easily for Ising models on many dense graphs at low temperatures (high values of β ), and for Ising models on bounded-degree graphs as well as popular spin-glass models such as the Sherrington-Kirkpatrick model at all temperatures. Some of these results were extended in [22] where different rates of consistency were obtained for Ising models on sparse graphs, such as the sparse Erdös–Rényi graphs or sparse regular graphs, even at high temperatures. In addition, inestimability of β was established in [22] for Ising models on a sequence of dense graphs converging to a graphon for all β below a certain threshold, given by the inverse spectral norm of the limiting graphon. Precise central limit theorems for the maximum likelihood estimator (MLE) of β were derived for the 2-spin Ising model on complete graphs (also known as the Curie–Weiss model) in [25]. Some of these results have been extended in [9,10,16] for the case p 3 .
Now, suppose that one is given a sample X from a p-spin Ising model (2), with no other information on any of the parameters. A natural question is whether it is possible to estimate the interaction order p of the model. This is highly relevant from a practical perspective, as in many real-life applications such as recommender systems, strong peer-group effects exist, and two-interaction models are known to fit the data much worse than higher-order interaction models. For example, in [26], it was empirically shown that the 3-spin Ising model fitted the Last.fm music (accessed on 9 December 2020) (http://millionsongdataset.com/lastfm/) much better than the classical 2-spin model, thereby indicating the presence of complex peer-group effects in this dataset. This is a fan network database for a number of popular artists and bands, and the data consist of a list of binary opinions from the fan base for each artist, indicating whether each user is a fan of that artist or not as well as the user friendship network. The exact procedure involved first estimating the parameter β assuming a 2-spin Ising model from the data using the maximum pseudolikelihood estimator, followed by simulating a number of samples from the corresponding 2-spin Ising model with the estimated β as the parameter. It was observed that for most artists under consideration, the true Hamiltonian from the data lies outside the 2.5 % and 97.5 % quantiles of the histogram of the sampled Hamiltonians, thereby suggesting misfit. When a 3-spin Ising model was fit with the triangles in the user network as the hyperedges, the true Hamiltonian actually fell comfortably within the bulk of the histogram, thereby suggesting a good fit. In a different context, in [11], the authors mentioned that experimental and theoretical data indicated the relevance of three, four, five and six-spin exchange processes over a wide density range in adsorbed 3He films.
One might, in such situations, be interested in figuring out a systematic way of determining the value of the interaction order p for which the corresponding p-tuple interaction model fits the data best. To the best of our knowledge, this reverse problem of estimating the interaction order p given a single observation from the model (2) even with known β has not been addressed in the literature before. This question may be quite difficult for arbitrary underlying interaction tensors J , which necessitates some convenient structural assumptions on this tensor.
In this paper, we assume that tensor J has all entries equal to N 1 p , which corresponds to the p-spin Curie–Weiss model [7,9,27], given by the following:
P β , p ( x ) : = exp β N x ¯ p Z N ( β , p ) for x { 1 , 1 } N
where x ¯ : = 1 N i = 1 N x i . Even under this structural assumption, the possibility of estimating p depends on whether the interaction strength β is known or not. We will show that consistent estimation of p is impossible if β is unknown, which will be a consequence of our argument on the impossibility of the joint consistent estimation of β and p. At the heart of these impossibility results is the fact that the sufficient statistic X ¯ in the model (3) converges to a mixture of point masses at the maximizers of a certain function H β , p , and that these maximizers do not uniquely determine the tuple ( β , p ) . This idea is formalized by a contiguity argument between the Curie–Weiss measures and N-fold products of Rademacher distributions, with the largest maximizer of H β , p as the mean. It should be mentioned here that the related problem of joint inestimability of ( β , h ) for the classical 2-spin Curie–Weiss model with an additional magnetic field parameter h was addressed in [24] using similar contiguity arguments. We also extend these impossibility results to the more general p-spin Erdös–Rényi Ising models.
The situation is more intricate when β is known. In this case, we will show that there exists a strictly increasing threshold function β * ( p ) , such that the consistent estimation of p is impossible for β * ( p ) > β . However, for almost all β (to be precise, for all but possibly countably many β ), p can be estimated consistently whenever β * ( p ) < β . The question of exactly describing the exceptional set of countably many β s for which the region β * ( p ) < β is inestimable for p, is still open. Finally, we want to mention that although joint consistent estimation of β and p is impossible using just one sample from the p-spin Curie–Weiss model (3), we still hope to do so using multiple samples.

2. Impossibility of Jointly Estimating ( β , p )

We start by showing that joint consistent estimation of β and p using only one sample X : = ( X 1 , , X N ) from the model (3) is, in general, impossible. Towards this, for every m [ 0 , 1 ) , define a set:
Θ m : = { ( β , p ) ( 0 , ) × D : m is the unique non - negative global maximizer of H β , p }
where D denotes the set of all integers 3 , and
H β , p ( x ) : = β x p 1 2 ( 1 + x ) log ( 1 + x ) + ( 1 x ) log ( 1 x ) for x [ 0 , 1 ] .
Theorem 1.
For every m [ 0 , 1 ) , such that | Θ m | 2 , there does not exist any sequence of estimators (measurable functions of X ), which is consistent for ( β , p ) Θ m under the model (3).
The following lemma is crucial for proving Theorem 1.
Lemma 1.
For every m [ 0 , 1 ) , denote the distribution of a Rademacher random variable with mean m by μ. Then, the product measure Q : = i = 1 N μ is contiguous to P β , p for all m [ 0 , 1 ) , and all ( β , p ) Θ m .
Proof. 
To begin with, note that on the event E K : = { N ( X ¯ m ) K } , we have:
Q ( x ) P β , p ( x ) = Z N ( β , p ) i = 1 N 1 + m 2 ( 1 + X i ) / 2 1 m 2 ( 1 X i ) / 2 exp ( β N X ¯ p ) = 2 N Z N ( β , p ) exp N β X ¯ p 1 2 ( 1 + X ¯ ) log ( 1 + m ) + ( 1 X ¯ ) log ( 1 m ) .
Now, by Lemma 3.2 and Lemma 3.4 in [9], we have:
2 N Z N ( β , p ) = e N H β , p ( m ) Θ ( 1 ) .
Also, by Taylor expansion, we have:
β X ¯ p 1 2 ( 1 + X ¯ ) log ( 1 + m ) + ( 1 X ¯ ) log ( 1 m ) H β , p ( m ) = H β , p ( X ¯ ) H β , p ( m ) + 1 2 ( 1 + X ¯ ) log 1 + X ¯ 1 + m + ( 1 X ¯ ) log 1 X ¯ 1 m = O ( X ¯ m ) 2 + ( X ¯ m ) 1 + X ¯ 1 + m 1 X ¯ 1 m = O ( X ¯ m ) 2 .
Hence, we have:
Q ( X ) P β , p ( X ) = Θ ( 1 ) e N O ( X ¯ m ) 2 C K
for some constant C K . Hence, for any event A N { 1 , 1 } N , we have:
Q ( X A N , E K ) = E P β , p Q ( X ) P β , p ( X ) 1 X A N , E K C K P ( X A N , E K ) ,
thereby providing the following:
Q ( X A N ) C K P ( X A N ) + Q ( E K c ) .
Now, suppose that P ( X A N ) 0 . Then, for every K > 0 , we have the following:
lim sup N Q ( X A N ) lim sup N Q N ( X ¯ m ) > K = P ( N ( 0 , 1 m 2 ) > K ) .
Taking the limit as K throughout the above inequality, we can conclude that Q ( X A N ) 0 , which completes the proof of Lemma 1. □
With Lemma 1 in hand, we are now ready to prove Theorem 1.
Proof of Theorem 1
Suppose that there exists a sequence T N ( X ) R 2 of consistent estimators of ( β , p ) on Θ m . Fixing two different points ( β 1 , p 1 ) and ( β 2 , p 2 ) Θ m , we can construct disjoint neighborhoods B 1 and B 2 around them, respectively. Through the consistency of T N ( X ) on Θ m , we have the following:
P β i , p i ( T N ( X ) B i ) 1 for i = 1 , 2 .
From Lemma 1, we have the following:
Q ( T N ( X ) B i ) 1 for i = 1 , 2 ,
which contradicts the facts that B 1 , B 2 are disjointed, and Q is a probability measure. □
Remark 1.
Our argument implies that the consistent estimation of p is impossible if β is unknown. Because, if there were such a consistent estimator p ^ : = p ^ ( X ) , then we could choose ( β 1 , p 1 ) and ( β 2 , p 2 ) from some Θ m ; construct disjoint open intervals I 1 and I 2 around p 1 and p 2 , respectively; and argue from contiguity that Q ( p ^ ( X ) I i ) 1 for i = 1 , 2 , which is a contradiction.
The question is how do the sets Θ m look for different values of m [ 0 , 1 ) ? To answer this, let us define for each ( β , p ) ,
β * ( p ) : = sup β > 0 : sup x [ 0 , 1 ] H β , p ( x ) = 0 .
It follows from [9] that for p D , the function H β , p has a unique postitive maximizer m * ( β , p ) if β > β * ( p ) . By convention, we define m p = m * ( β * ( p ) , p ) : = lim β β * ( p ) m * ( β , p ) .
Proposition 1.
The sequence { m p } p D 1 as p , which can be sorted in ascending order as 0 < m p 1 m p 2 < 1 . Further, if we denote Θ m | 2 to be the projection of Θ m onto the p-coordinate, then
Θ m | 2 = Ø if 0 < m m p 1 , { p 1 , , p k } if m p k < m m p k + 1 , k 1 .
The proof of Proposition 1 is technical, and is given in Appendix A. Note that for m ( 0 , 1 ) , the set Θ m is uniquely determined by its projection Θ m | 2 , because every p Θ m | 2 uniquely corresponds to the element ( β p , p ) Θ m , where β p : = p 1 m 1 p tanh 1 ( m ) . Proposition 1 thus provides a complete description of the family of sets Θ m for all values of m ( 0 , 1 ) , and states that although each of these sets is finite, we can choose sets as large as possible from this family. The following proposition describes the set Θ 0 .
Proposition 2.
Θ 0 = { ( β , p ) ( 0 , ) × D : β < β * ( p ) } .
The proof of Proposition 2 follows from the following three facts proved in [9]:
  • If β < β * ( p ) , then 0 is the unique global maximizer of H β , p .
  • If β = β * ( p ) , then H β . p has two different non-negative global maximizers.
  • If β > β * ( p ) , then H β , p has a unique non-negative global maximizer, which happens to be positive.
Remark 2.
All of the results in this section also hold almost surely for the somewhat more general p-spin Erdös–Rényi Ising model [27], where the interaction tensor J in (2) is given by α 1 N 1 p A , with the entries A i 1 , , i p of the tensor A being i.i.d. Bernoulli random variables with a mean α, for some fixed α ( 0 , 1 ) . This follows from the fact that the Erdös–Rényi Ising and Curie–Weiss measures are mutually contiguous, which follows from Lemma 6.6 in [28].

3. Estimation of p When β Is Known

Throughout this section, we assume that the parameter β is known. To begin with, for every β > 0 , let us define the following two sets:
U β : = { p 2 : β * ( p ) > β } and L β : = { p 2 : β * ( p ) < β } .
For Lemma A.1 in [10], the set U β is empty if β log 2 , and is of the form { q , q + 1 , } for some integer q 2 otherwise.
Theorem 2.
When β is known, no sequences of estimators that are consistent with p U β exist.
Proof. 
It follows from Proposition 2 and Lemma 1, that the N-fold product measure of the mean-0 Rademacher distribution is contiguous to P β , p for all p U β . The proof now follows from the argument given in Remark 1. □
Remark 3.
One can consider an extension of the model (3) by adding an external magnetic field parameter h as follows:
P β , p ( x ) : = exp β N x ¯ p + h N x ¯ Z N ( β , p ) for x { 1 , 1 } N
It has been shown in [9] that the consistent estimation of h is always possible when β is known. Proposition 2 shows that this is not the case for estimating p, which is impossible if β * ( p ) > β . This inestimability region also coincides with that for estimating β when p is known (see [10]).
Now, we turn our attention to the set L β , and we break it down to the following two parts:
L β 1 : = { p L β : q L β , q p such that m * ( β , q ) = m * ( β , p ) } and L β 2 : = L β L β 1 .
where m * ( β , q ) is defined as the largest non-negative maximizer of H β , q .
Theorem 3.
When β is known, there does not exist any sequence of estimators that is consistent for p L β 1 .
Proof. 
Once again, from Lemma 1, we know that the N-fold product of the Rademacher distribution with a mean of m * ( β , p ) is contiguous to both P β , p and P β , q , where p L β 1 and p q L β is such that m * ( β , q ) = m * ( β , p ) . Clearly, q L β 1 . Once again, the rest of the proof follows from the arguments of Remark 1. □
We now show that it is possible to estimate p consistently on the set L β 2 , if β is known. Towards this, let us define the following estimator of p:
p ^ ( β , δ ) : = arg min 2 q log | X ¯ | 1 ( 2 β + δ ) + 2 , β * ( q ) < β m * ( β , q ) 2 X ¯ 2
The intuition behind constructing our estimator, is that X ¯ 2 P m * ( β , p ) 2 as N ; hence, the distance between X ¯ 2 and m * ( β , q ) 2 is expected to be minimized at q = p for large N. The next theorem shows that with a high probability, this is indeed the case.
Theorem 4.
For every δ > 0 and p L β 2 , we have
P β , p ( p ^ β , δ = p ) = 1 e C N
for some constant C = C β , p , δ > 0 not depending on N.
Proof. 
Suppose that X P β , p for some p L β 2 . It follows from Lemma A.1 in [10] and the proof of Proposition 1 that as q , m * ( β , q ) 1 if β log 2 and m * ( β , q ) 0 otherwise. Hence, in any case, m * ( β , p ) ( 0 , 1 ) is not an accumulation point of the sequence { m * ( β , q ) } q 2 . This shows that there exists ε > 0 , such that | m * ( β , q ) 2 m * ( β , p ) 2 | > ε for all q L β not equal to p.
Now, it follows from the proofs of Lemma 3.1 and Lemma 3.3 in [9] that
P β , p | m * ( β , p ) 2 X ¯ 2 | > ε 3 e C 1 N
for some constant C 1 > 0 not depending on N. Also, it follows from the proofs of Lemma 3.1 and Lemma 3.3 in [9] and the proof of Proposition 1, that
P β , p X ¯ 2 < ( 2 β + δ ) 2 / ( 2 p ) e C 2 N i . e . P β , p p > log | X ¯ | 1 ( 2 β + δ ) + 2 e C 2 N
for some constant C 2 > 0 not depending on N. Define E 1 : = { | m * ( β , p ) 2 X ¯ 2 | ε / 3 } and E 2 : = { p log | X ¯ | 1 ( 2 β + δ ) + 2 } . It is clear that p ^ ( β , δ ) = p on the event E 1 E 2 . Theorem 4 now follows, as P ( E 1 c E 2 c ) e C 1 N + e C 2 N . □
Remark 4.
One can tune the parameter δ > 0 in (5) to allow for an optimal number of integers over which the minimization is to be performed, in order to ensure proper convergence of p ^ ( β , δ ) . Also, in case β < log 2 , eventually β * ( q ) > β , so one just stops at the largest q, for which β * ( q ) < β in the minimization (5).
In the reverse problem of estimating β with known p, consistent estimation is possible for all β > β * ( p ) (see [9]). In contrast, a further challenging inestimability region L β 1 arises in our problem of estimating p with known β . The next proposition shows that for almost all β > 0 , the set L β 1 is actually empty, i.e., the entire region L β is estimable.
Proposition 3.
There exists a countable set Ξ ( 0 , ) , such that L β 1 = Ø for β Ξ .
The proof of Proposition 3 is given in Appendix B. However, in the proof we provide an exact enumeration of one countable set Λ satisfying Proposition 3, namely:
Λ : = { s p , q : q > p 2 are integers } where s p , q : = 1 p p q 1 p q p tanh 1 p q 1 q p ,
our analysis did not allow us to dig in any further. In particular, the following question is open:
  • Open Problem: Give an exact enumeration of the minimal countable set Ξ 0 = { β > 0 : L β 1 Ø } satisfying Proposition 3.
Note that if β = s p , q for some integers q > p 2 , then m : = ( p / q ) 1 / ( q p ) is a stationary point of both H β , p and H β , q . We can actually say that β Ξ 0 if this stationary point turns out to be a global maximizer of both these functions. However, checking the last condition will likely involve more intricate analysis, and is left as an open question.

Funding

This research was funded by the FoS Tier 1 grant WBS A-8001449-00-00 and the FRC Tier 1 grant WBS A-8002932-00-00.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No data were used in this work.

Acknowledgments

The author thanks Paul Switzer for bringing to notice the problem of estimating p, and Luc Devroye for several interesting discussions related to this problem.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Proof of Proposition 1

To show that the sequence m p 1 as p , note that with a Taylor expansion, we have the following:
H β , p ( x ) β x p x 2 2 for all β > 0 .
Hence, H β , p ( x ) < 0 for all x < ( 2 β ) 1 / ( 2 p ) , and so, if β > β * ( p ) , then m * ( β , p ) ( 2 β ) 1 / ( 2 p ) . Thus, for every β ( β * ( p ) , β * ( p ) + ε ) , we have m * ( β , p ) > ( 2 ( β * ( p ) + ε ) ) 1 / ( 2 p ) . As ε > 0 is arbitrary, we thus have:
m p ( 2 β * ( p ) ) 1 / ( 2 p )
It is now easy to see that the right-hand side of (A1) converges to 1 as p , whereas the left-hand side is bounded above by 1. This proves our claim.
Now, as the sequence { m p } p D is convergent to its supremum, it can be arranged in increasing order as m p 1 m p 2 < 1 . Also, by (A1) and Lemma A.1 in [10], m p 1 ( log 4 ) 1 / ( 2 p 1 ) > 0 . Suppose that ( β , p ) Θ m for some m > 0 . It follows from [9] that β > β * ( p ) . By Lemma 11 in [28], the function m * ( β , p ) is strictly increasing in β on ( β * ( p ) , ) ; hence,
m = m * ( β , p ) > m p .
It follows from (A2), that Θ m = Ø for m ( 0 , m p 1 ] .
Let us now assume that m ( m p k , m p k + 1 ] . If ( β , p ) Θ m , then, by (A2), we have m p k + 1 > m p ; hence, p { p 1 , , p k } . Conversely, suppose that p = p i for some 1 i k . Then, m > m p ; hence, there exists β 0 > β * ( p ) , such that m > m * ( β 0 , p ) . By Lemma B.3 in [28], we know that lim β m * ( β , p ) = 1 , which implies the existence of β 1 > β * ( p ) , such that m < m * ( β 1 , p ) . Finally, Lemma B.3 in [28] also states that the function m * ( β , p ) is continuous in β on ( β * ( p ) , ) , which in view of the intermediate value theorem, implies the existence of β ( β 0 , β 1 ) , such that m = m * ( β , p ) , i.e., ( β , p ) Θ m . This completes the proof of Proposition 1. □

Appendix B. Proof of Proposition 3

For every pair of integers q > p 2 , define:
s p , q : = 1 p p q 1 p q p tanh 1 p q 1 q p ,
and let Λ : = { s p , q : q > p 2 are integers } . Clearly, Λ is a countable set. Suppose that L β 1 Ø . Then, there exist q > p 2 such that both the functions H β , p and H β , q are maximized at the same point m ( 0 , 1 ) . Hence, we have H β , q ( m ) = H β , p ( m ) = 0 , i.e.,
β q m q 1 = β p m p 1 = tanh 1 ( m ) .
From the first equality, we have m = ( p / q ) 1 / ( q p ) , and from the last equality, we have:
β = tanh 1 ( m ) p m p 1 = s p , q Λ .
Hence, L β 1 = Ø if β Λ , which completes the proof of Proposition 3. □

References

  1. Ising, E. Beitrag zur theorie der ferromagnetismus. Z. für Phys. 1925, 31, 253–258. [Google Scholar] [CrossRef]
  2. Geman, S.; Graffigne, C. Markov random field image models and their applications to computer vision. In Proceedings of the International Congress of Mathematicians, Berkeley, CA, USA, 3–11 August 1986; pp. 1496–1517. [Google Scholar]
  3. Hopfield, J.J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci. USA 1982, 79, 2554–2558. [Google Scholar] [CrossRef] [PubMed]
  4. Banerjee, S.; Carlin, B.P.; Gelfand, A.E. Hierarchical Modeling and Analysis for Spatial Data; Chapman and Hall/CRC: Boca Raton, FL, USA, 2014. [Google Scholar]
  5. Green, P.J.; Richardson, S. Hidden Markov models and disease mapping. J. Am. Stat. Assoc. 2002, 97, 1055–1070. [Google Scholar] [CrossRef]
  6. Aslanov, L.A. Crystal symmetry and atomic interactions model. Comput. Math. Appl. 1988, 16, 443–451. [Google Scholar] [CrossRef]
  7. Barra, A. Notes on ferromagnetic p-spin and REM. Math. Methods Appl. Sci. 2009, 32, 783–797. [Google Scholar] [CrossRef]
  8. Derrida, B. Random-energy model: Limit of a family of disordered models. Phys. Lett. 1980, 45, 79. [Google Scholar] [CrossRef]
  9. Mukherjee, S.; Son, J.; Bhattacharya, B.B. Phase transitions of the maximum likelihood estimates in the p-spin Curie-Weiss model. Bernoulli 2025, 31, 1502–1526. [Google Scholar] [CrossRef]
  10. Mukherjee, S.; Son, J.; Bhattacharya, B.B. Estimation in tensor Ising models. Inf. Inference J. IMA 2022, 11, 1457–1500. [Google Scholar] [CrossRef]
  11. Ananikyan, L.N. The hexagonal recursive approximation with multisite-interaction Ising model for the solid and fluid 3He system. Int. J. Mod. Phys. B 2007, 21, 755–772. [Google Scholar] [CrossRef]
  12. Bergamaschi, T.R.; Menke, T.; Banner, W.P.; Paolo, A.D.; Weber, S.J.; Hirjibehedin, C.F.; Kerman, A.J.; Oliver, W.D. Distinguishing multi-spin interactions from lower-order effects. Phys. Rev. Appl. 2022, 18, 044018. [Google Scholar] [CrossRef]
  13. Contucci, P.; Mingione, E.; Osabutey, G. Limit theorems for the cubic mean-field Ising model. arXiv 2023, arXiv:2303.14578. [Google Scholar] [CrossRef]
  14. Eichelsbacher, P. Stein’s method and a cubic mean-field model. arXiv 2024, arXiv:2404.07587. [Google Scholar] [CrossRef]
  15. Gardner, E. Spin glasses with p-spin interactions. Nucl. Phys. B 1985, 257, 747–765. [Google Scholar] [CrossRef]
  16. Mukherjee, S.; Son, J.; Bhattacharya, B. Fluctuations of the magnetization in the p-Spin Curie-Weiss model. Communinations Math. Phys. 2021, 387, 681–728. [Google Scholar] [CrossRef]
  17. Osabutey, G. Phase properties of the mean-field Ising model with three-spin interaction. J. Math. Phys. 2024, 65, 061901. [Google Scholar] [CrossRef]
  18. Sherrington, D.; de Almeida, J.R.L. Beyond Ising SK. arXiv 2022, arXiv:2208.07309. [Google Scholar]
  19. Bhowal, S.; Mukherjee, S. Limit theorems and phase transitions in the tensor Curie-Weiss Potts model. arXiv 2023, arXiv:2307.01052. [Google Scholar]
  20. Bhowal, S.; Mukherjee, S. Rates of convergence of the magnetization in the tensor Curie-Weiss Potts model. J. Stat. Phys. 2025, 192, 2. [Google Scholar] [CrossRef]
  21. Wu, F.Y. The Potts model. Rev. Mod. Phys. 1982, 54, 235–268. [Google Scholar] [CrossRef]
  22. Bhattacharya, B.B.; Mukherjee, S. Inference in Ising models. Bernoulli 2018, 24, 493–525. [Google Scholar] [CrossRef]
  23. Chatterjee, S. Estimation in spin glasses: A first step. Ann. Stat. 2007, 35, 1931–1946. [Google Scholar] [CrossRef]
  24. Ghosal, P.; Mukherjee, S. Joint estimation of parameters in Ising model. Ann. Stat. 2020, 18, 785–810. [Google Scholar] [CrossRef]
  25. Comets, F.; Gidas, B. Asymptotics of Maximum Likelihood Estimators for the Curie-Weiss Model. Ann. Stat. 1991, 19, 557–578. [Google Scholar] [CrossRef]
  26. Mukherjee, S. Limit Theorems for Dependent Combinatorial Data, with Applications in Statistical Inference. arXiv 2021, arXiv:2108.12233v2. [Google Scholar]
  27. Agliar, E.; Barra, A.; Camboni, F. Notes on ferromagnetic diluted p-spin model. Rep. Math. Phys. 2011, 68, 1–22. [Google Scholar] [CrossRef]
  28. Mukherjee, S.; Ghosh, S.; Son, J.; Mukherjee, S. Efficient estimation in tensor Curie-Weiss and Erdös-Rényi Ising models. Electron. J. Stat. 2024, 18, 2405–2449. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mukherjee, S. Interaction Order Estimation in Tensor Curie–Weiss Models. Entropy 2025, 27, 245. https://doi.org/10.3390/e27030245

AMA Style

Mukherjee S. Interaction Order Estimation in Tensor Curie–Weiss Models. Entropy. 2025; 27(3):245. https://doi.org/10.3390/e27030245

Chicago/Turabian Style

Mukherjee, Somabha. 2025. "Interaction Order Estimation in Tensor Curie–Weiss Models" Entropy 27, no. 3: 245. https://doi.org/10.3390/e27030245

APA Style

Mukherjee, S. (2025). Interaction Order Estimation in Tensor Curie–Weiss Models. Entropy, 27(3), 245. https://doi.org/10.3390/e27030245

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop