## 1. Introduction

The concept of entropy plays a key role in describing the transition between disorder and order. Specifically, the emergence of order in a random phase is indicated by a significant reduction of entropy. Calculating this reduction requires to know the probability of each possible system configuration that is compatible with the given system constraints—a challenging problem both methodologically and computationally. If we consider a system of N elements each of which can be in one of two states—for example up and down spins in a physical system, or agents with opposite opinions in a social system, or agents with the two strategies cooperate or defect in an economic system—the number of possible configurations is ${2}^{N}$, which can be quite large. In fact, statistical physics was founded in the 19th century to provide an efficient solution based on the concept of statistical ensembles and state sums.

In this paper, we address the problem by proposing a stochastic approach that allows to decompose such probabilities for systems characterized by neighbor-neighbor interactions. Our candidate model to describe this interaction is the so-called

voter model which is discussed in more detail in

Section 3. In this model, agents are in one of two discrete states,

$\sigma \in \{0,1\}$, denoted as “opinions”. They change their opinion in response to the opinions in their neighborhood. In order to define such a neighborhood, we have chosen a one-dimensional cellular automaton (CA), in which consecutively numbered cells

$i=1,...,N$ represent agents (see

Figure 1). We assume that the CA forms a ring to close the system. Each agent

i then has two neighbors

$i-1$,

$i+1$,

i.e., their opinions

${\theta}_{i}$ form a triplet

${T}_{i}=\{{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1}\}$. The second-order neighborhood that also takes the neighbors of

$i-1$,

$i+1$ into account, then results in a quintuplet of opinions

${Q}_{i}=\{{\theta}_{i-2},{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1},{\theta}_{i+2}\}$.

**Figure 1.**
One-dimensional cellular automaton (CA) with triplet (${T}_{i}$) and quintuplet (${Q}_{i}$) definition of the neighborhood of agent i.

**Figure 1.**
One-dimensional cellular automaton (CA) with triplet (${T}_{i}$) and quintuplet (${Q}_{i}$) definition of the neighborhood of agent i.

Our major assumption of the voter model is that changes in the opinion of agent i are only caused by the first-order neighborhood. Specifically, the agent responds to the local frequency of opinions in the triplet that also includes its own opinion. However, because the dynamics of agent i depends on its neighbors, it is also coupled to their dynamics, i.e., to the second-order neighborhood of agent i, and so forth. This specifically denotes the problem that we are going to discuss in this paper. In a stochastic approach, we are faced with a system of N coupled dynamic equations for the probabilities $p({\theta}_{i},t)$ to find any agent i with opinion ${\theta}_{i}$ at time t. This coupling exists only through the two neighbors of each agent, i.e., changes in a far distant cell only propagate slowly through the CA by means of neighbor-neighbor interactions. So, precisely, how large should be the neighborhood taken into account for the dynamics of the CA? Or, how far reaching are correlations in the changes of opinions?

To answer this question, in this paper we propose three different analytic approximations for the dynamics which take different neighborhood sizes into account: zero-order–no correlations between neighboring agents, first-order–correlations between an agent and one of its nearest neighbors (so called pair approximation), second-order–correlations between an agent and its second-nearest neighbors. We test the validity of these dynamic approximations by comparing them to stochastic simulations of the CA, averaged over a number of runs.

Our emphasis is of course on the second-order approximation, which extends previous investigations. But we want to understand under which circumstances this approximation fares better than the simpler ones. Therefore, we have chosen different variants of the voter model, also known as nonlinear voter models. The non-linearity is with respect to the response to the local frequency of opinions. We further compare deterministic and stochastic dynamics, to show whether our analytic approximations can correctly predict the dynamics. Our variable of interest is the expected fraction of opinion 1, denoted as $\u2329x\left(t\right)\u232a$, which is an aggregated variable for which we derive the dynamics based on a stochastic approach.

Our paper combines, and extends, previous investigations in different directions. One line of research refers to the approximation of higher-order probability distributions by means of lower-order distributions [

1] which has been also applied to the voter model [

2]. This implies a loss of (microscopic) information which can be quantified by means of information-theoretic measures [

3]. The question whether the coarse-grained dynamics is still Markovian can be answered by analyzing emergent macroscopic memory effects using information-theoretic measures [

4]. Such measures have also been applied to voter models [

5]. In most cases, the Markov chain analysis becomes quite cumbersome and therefore is restricted to one-dimensional CA [

6].

Another line of research considers different forms of neighborhood approximations specifically for binary state-dynamics (see [

7] for a good overview), which also has been applied to the voter model, already [

8]. We note that our pair approximation approach follows [

8], but applies it here to a one-dimensional CA, which results in different expressions for

$\u2329x\left(t\right)\u232a$ and the correlations

${c}_{1|1}\left(t\right)$.

Compared to two-dimensional CA [

9,

10] or even complex networks [

11,

12], one could find one-dimensional CA too simple. But this judgment is in fact not justified. Already one-dimensional CA have proven to exhibit a really complex dynamics, with a chance to derive analytic expressions. Extensions of the simple voter model, for example the Sznajd model [

13] or the

q-voter model [

14], could be thoroughly analyzed for one-dimensional CA.

The emphasis of our investigations is on the validity of the analytic approximations for nonlinear voter models. This non-linearity can be introduced in different ways. In References [

15,

16,

17], the authors discuss it on the level of individual agents that respond to neighboring influences in a heterogeneous manner. Specifically, Stark

et al. [

16] assumes a heterogeneous inertia for agents to change their opinions, Castellano

et al. [

15] assumes a heterogeneous neighborhood size to influence agents, while Xiong

et al. [

17] assumes a heterogeneous weight for the influence of agents.

Compared to these approaches, we assume a homogeneous, but nonlinear response of agents on the local frequency of opinions. Specifically, we consider the majority rule, where the tendency of agents’ to change their opinion increases with the frequency of the opposite opinion. The minority rule, on the other hand, assumes exactly the opposite, i.e., a decreasing tendency. This can be simply varied by one parameter α, which however is assumed to be the same for all agents.

Eventually, we would like to point out that we refrain from interpreting our model in a social context. Although agents are called “voters” and their states are called “opinions”, the simplicity of the underlying assumptions does not justify to sell the model as a reflection of a social system. We see it rather as a very generic setup to better understand the impact of local feedback on emerging systemic properties, such as consensus (

i.e., a “ferromagnetic” phase) or coexistence (

i.e., a “paramagnetic” phase). But we acknowledge that, despite this basic limitations, the voter model has been applied in various context, e.g., to model investors’ behavior in financial markets [

18], emerging communication networks [

19], or invasion of species [

20]. A good overview of spin-type models in sociophysics is given in [

21].

## 2. Stochastic Approach

#### 2.1. Defining the Cellular Automaton

In this paper, we consider a one-dimensional CA consisting of

N cells, each of which is identified by the index

$i\in N$ (see

Figure 1). Each cell shall be characterized by a discrete value

${\theta}_{i}=\{0,1\}$, hence the total distribution of states is given by the vector

$\Theta =\{{\theta}_{1},...,{\theta}_{i-2},{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1},{\theta}_{i+2},...,{\theta}_{N}\}$. Assuming a torus space, each cell

i has a clearly defined neigborhood of first and second nearest neighbors,

${\underline{\theta}}_{i}^{\prime}=\{{\theta}_{i-1},{\theta}_{i+1}\}$ and

${\underline{\theta}}_{i}^{\u2033}=\{{\theta}_{i-2},{\theta}_{i+2}\}$. The probability to find cell

i in state

${\theta}_{i}$ at time

t (where time shall be measured in discrete steps) is

$p({\theta}_{i},t)$. Consequently, the conditional probabilities

$p\left({\theta}_{i}\right|{\underline{\theta}}_{i}^{{}^{\prime}},t)$ and

$p\left({\theta}_{i}\right|{\underline{\theta}}_{i}^{{}^{\prime}},{\underline{\theta}}_{i}^{{}^{\u2033}},t)$ describe the probability to find cell

i in state

${\theta}_{i}$ given that it has the first and second nearest neighbors

${\underline{\theta}}_{i}^{{}^{\prime}}$,

${\underline{\theta}}_{i}^{{}^{\u2033}}$.

Under Markov assumptions the Chapman–Kolmogorov equation holds for the probability

$p({\theta}_{i},t)$ to find cell

i in state

${\theta}_{i}$ at time

$t+1$:

The propagator

$p[({\theta}_{i},t+1)\leftarrow ({\widehat{\theta}}_{i},t)]$ denotes the transition probability to go from any given state

${\widehat{\theta}}_{i}$ at time

t to the assumed state

${\theta}_{i}$ in the next time step,

$t+1$. Here the summation is over all possible realizations

${\theta}^{\star}$ of

${\widehat{\theta}}_{i}$,

i.e., the

${2}^{1}$ possible states

$\{0,1\}$. At this point, we make our 1st assumption, namely that any change of

${\theta}_{i}$ depends on the nearest neighbors, given by

${\underline{\theta}}_{i}^{\prime}=\{{\theta}_{i-1},{\theta}_{i+1}\}$. That means a triplet

${T}_{i}=\{{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1}\}$ decides about the value of

${\theta}_{i}$ in the next time step. If we define the probability of a triplet configuration as

then the probability

$p({\theta}_{i},t)$ results as the marginal distribution of the triplet probability:

Here the summation is over all possible realizations ${\underline{\theta}}^{\prime \star}$ of the nearest neighborhood ${\underline{\theta}}_{i}^{\prime}$, i.e., ${2}^{2}$ different possibilities. $p\left({\theta}_{i}\right|{T}_{i},t)$ denotes the conditional probability to find ${\theta}_{i}$ as the focal cell given a triplet ${T}_{i}$, and the summation is over all possible realizations ${T}^{\star}$ of ${T}_{i}$, i.e., ${2}^{3}$ diffferent possibilities.

Based on the assumption that the nearest neighborhood matters, we can rewrite Equation (

1) as

where

$p\left({\theta}_{i}\right|{\underline{\theta}}_{i}^{\prime},t)$ denotes the conditional probability to find the focal cell in state

${\theta}_{i}$ given the neighborhood

${\underline{\theta}}_{i}^{\prime}$. Using Bayes’ rule, we can express this probability as:

With this, we can eventually rewrite the Chapman–Kolmogorov Equation (

1) for the single cell

i in terms of the triplet probabilities,

$p({T}_{i},t)$:

The propagator $p[({\theta}_{i},t+1)\leftarrow ({T}_{i},t)]$ describes the transition probabilities to go from any possible triplet ${T}_{i}$ to a state ${\theta}_{i}$ during the next time step.

This equation leaves us with the further specification of the triplet probability,

$p({T}_{i},t)$. While, according to our 1st assumption, the occurence of

${\theta}_{i}$ is just determined by the nearest neighbors

${\underline{\theta}}_{i}^{\prime}=\{{\theta}_{i-1},{\theta}_{i+1}\}$, the occurrence of either

${\theta}_{i-1}$ or

${\theta}_{i+1}$ also depends on their nearest neighbors,

i.e., the second nearest neighbors of

i,

${\underline{\theta}}_{i}^{\u2033}=\{{\theta}_{i-2},{\theta}_{i+2}\}$. That means a quintuplet

${Q}_{i}=\{{\theta}_{i-2},{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1},{\theta}_{i+2}\}$ decides about the value of the triplet

${T}_{i}=\{{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1}\}$ in the next time step. If we consider a quintuplet configuration

${Q}_{i}$, then we can define the triplet probability as the marginal probability:

Here the summation is over all possible realizations ${\underline{\theta}}^{\u2033\star}$ of the nearest neighborhood ${\underline{\theta}}_{i}^{\u2033}$, i.e., ${2}^{2}$ different possibilities. $p\left({T}_{i}\right|{Q}_{i},t)$ denotes the conditional probability to find the focal triplet ${T}_{i}$ given a quintuplet ${Q}_{i}$, and the summation is over all possible realizations ${Q}^{\star}$ of ${Q}_{i}$, i.e., ${2}^{5}$ diffferent possibilities.

For the dynamics for the probability to find a triplet in state

${T}_{i}$ at time

$t+1$, we can write a Chapman–Kolmogorov equation quite similar to Equation (

6)

The propagator $p[({T}_{i},t+1)\leftarrow ({Q}_{i},t)]$ describes the transition probabilities to go from any possible quintuplet ${Q}_{i}$ to a triplet ${T}_{i}$ during the next time step.

In order to specify the quintuplet probability $p({Q}_{i},t)$, we may consider a neighborhood of seven, etc. However, following the above procedure repeatedly is neither convenient nor practicable since at the end we have to consider ${2}^{N}$ possible configurations. Therefore, in the next section, we present a more convenient method to determine the quintuplet probability distribution.

#### 2.2. Quintuplet Approximation

Instead of defining the quintuplet distribution in terms of higher-order probability distributions, following the procedure of the previous section, we now make our 2nd assumption by expressing these probabilities in terms of lower-order distributions, i.e., triplet distributions, this way arriving at a closed form description of the problem.

In the following we refer to the theory of approximating discrete probability distributions already developed in the late 1950 [

1], which was also applied to voter model (VM) [

2]. The idea is to approximate an

n-th order probability distribution,

$p({x}_{1},...,{x}_{n})$ by products containing the probabilities of given subsets, e.g.,

$p({x}_{i},{x}_{j},{x}_{k})$. It is known that such a product approximation contains at most

n terms, since every new term has to contain at least one variable

${x}_{i}$ not contained in previous terms. For example, the factorization

$p({x}_{1},...,{x}_{n})={\prod}_{i}p\left({x}_{i}\right)$ already satisfies this condition, although it may be not a good approximation, since it holds only for ideal systems.

We note that the approximation procedure is not unique, i.e., there are various ways of constructing a product approximation. For example, taking $p({x}_{1},{x}_{2},{x}_{3})$, this can be approximated by $p({x}_{1},{x}_{3})p\left({x}_{2}\right)$ as well as by $p({x}_{1},{x}_{3})p\left({x}_{2}\right|{x}_{1})$ or by $p({x}_{1},{x}_{3})p\left({x}_{2}\right|{x}_{3})$. Which one of the above approximations is the most suitable has to be determined based on the information available about the triplet distribution $p({x}_{1},{x}_{2},{x}_{3})$. In some cases, there are measurements of this distribution, or additional information about the dependency between the ${x}_{i}$ as we will use below.

With reference to our quintuplet approximation using the triplets defined, we propose the following procedure: we first note that there are different triplets contained in the quintuplet configuration, for example:

${T}_{i}=\{{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1}\}$,

${T}_{i-1}=\{{\theta}_{i-2},{\theta}_{i-1},{\theta}_{i}\}$,

${T}_{i+1}=\{{\theta}_{i},{\theta}_{i+1},{\theta}_{i+2}\}$. These are valid triplets because they consider the correct neighborhood relations, whereas for example

$\{{\theta}_{i-1},{\theta}_{i+1},{\theta}_{i+2}\}$ would be inappropriate. Out of the valid triplets, we may choose for example the first one,

$\{{\theta}_{i-1},{\theta}_{i},{\theta}_{i+1}\}$ and complete the product approximation as follows:

In order to express the conditional probabilities using the triplet distribution, we apply Bayes’ rule:

which results in the final quintuplet approximation:

We note that this product approximation is the only possible one, given that we have an ordered set of variables ${\theta}_{i}$ according to their neighborhood relations.

#### 2.3. Closed Form Dynamics

Using the approximation explained above, we have reduced the higher-order description to the level of triplet distributions,

$p({\theta}_{i-1},{\theta}_{i},{\theta}_{i+1},t)$,

etc., which results in a closed form dynamics. We now have to specify the initial conditions for the distributions. While we are able to make any suitable assumption about the initial triplet distribution, we assume here that the occupation of the different cells is initially statistically independent,

i.e.,

For further investigations, we have set $p(\theta ,0)=p(1-\theta ,0)=1/2$.

In order to solve the above set of equations, we have two different possibilities: we can solve the dynamics (a) on the level of the distribution

$p({\theta}_{i},t)$, Equation (

6), or (b) on the level of the triplet distribution

$p({T}_{i},t)$, Equation (

9). Variant (a) requires more computational effort, since we have to replace any

$p({T}_{i},t)$ as a marginal distribution of triplets, whereas in variant (b), given the reduction of the quintuplet distributions, we already have a closed form dynamics for the triplet distributions, which is computationally more efficient.

Variant (b), however, requires to know the transition probabilities on the level of the triplets, whereas these are usually specified on the level of single cell changes (see also the following section). Hence, we have to determine the propagators

$p[({T}_{i},t+1)\leftarrow ({Q}_{i},t)]$ in terms of

$p[({\theta}_{i},t+1)\leftarrow ({T}_{i},t)]$. Since our 1st assumption specifies that the change of every cell in the triplet only depends on its nearest neighbors, we are able to factorize these transition probabilities:

This completes our closed form description. To calculate the time-dependent probability distribution of triplets, we have to insert Equation (

12), Equation (

14) into Equation (

9) and from the result calculate

$p({\theta}_{i},t)$ as the marginal distribution, Equation (

3). The only remaining task is now to specify the transition probabilities for single cells,

$p[({\theta}_{i},t+1)\leftarrow ({T}_{i},t)]$ at which point the VM comes into play.

## 3. Nonlinear Voter Dynamics

#### 3.1. Transition Rates

The voter model, in its basic form, is applied to a population of N agents, where each agent is characterized by a discrete value, its “opinion”, ${\theta}_{i}\in \{0,1\}$. The opinion dynamics at the level of the agents works as follows: two agents i and j are randomly chosen from the population, and agent j adopts the opinion of agent i, i.e., ${\theta}_{j}(t+1)={\theta}_{i}\left(t\right)$. After N such update events, time is increased by 1 (random sequential update).

The rather simplified rule limits the applications of the voter model to any real voting process or opinion dynamics. In a well mixed population, the probability for agent

j to adopt a given opinion

$\sigma \in \{0,1\}$ is simply proportional to the global frequency

${x}_{\sigma}\left(t\right)$ of agents with the respective opinion:

This dynamics is known to always converge to complete consensus, $x\to 1$ or $x\to 0$. The only interesting question is then how long it will take to reach this state dependent on the system size N and the initial condition $x\left(0\right)$.

This picture becomes more complex if instead of a well-mixed population a defined neighborhood for each agent is assumed. This can be a social network where agents have links to other agents, or simply a lattice where the neighborhood is given by the geometry that defines the nearest and second nearest neighbors of an agent. Then, instead of the global frequency, the probability of an agent to adopt a given opinion depends on the local frequency,

${f}_{i}^{\sigma}$, of this opinion in the neighborhood of that agent:

Here ${\delta}_{x,y}$ means the Kronecker delta, which is 1 only for $x=y$ and zero otherwise. In this paper, we restrict ourselves to the one-dimensional regular lattice, where each agent is represented by a cell i that has precisely two neighbors. Different from the basic voter rule, in the calculation of the local frequency we have taken the opinion ${\theta}_{i}$ of the focal cell i into account. This adds some inertia to the dynamics as agent i counts towards the local minority/majority. It also avoids stalemate situations where the two neighbors have different opinions.

The linear voter model would assume that the transition rate

of a focal cell

i to change its opinion from

${\theta}_{i}=\sigma $ to

$1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}{\theta}_{i}=1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}\sigma $ is directly proportional to the local frequency

${f}_{i}^{\left(1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}\sigma \right)}$,

i.e., the opposite opinion in the neighborhood. The expression of Equation (

17) is a specification of the more general transition probability

$p[({\theta}_{i},t+1)\leftarrow ({T}_{i},t)]$ used in Equation (

14). It takes into account that for frequency dependent processes the change of

${\theta}_{i}$ depends not directly on the local distribution

${T}_{i}$, but only on the local frequency

${f}_{i}^{\left(1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}\sigma \right)}$.

We are more interested in the non-linear case, where the local frequency still plays a role, however the response to the opposite opinion can be different. The linear voter model can then be generalized into a majority rule where the transition toward the opposite opinion monotonously increases with the respective local frequency. Different from this, the minority rule would assume that the transition toward the opposite opinion monotonously decreases with the respective local frequency, i.e., the minority opinion is favored. Eventually, we could also have mixed rules with a non-monotonous frequency dependence, which become possible only if the local frequency depends at least on the opinion of three agents, as it is the case in our model.

Instead of an analytical expression, we specify the transition rate

$w\left(1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}{\theta}_{i}\right|{\theta}_{i}=\sigma ,{f}_{i}^{\sigma})$ by using free parameters

${\alpha}_{0}$,

${\alpha}_{1}$,

${\alpha}_{2}$,

${\alpha}_{3}$ which cover all of the above cases in a general manner. With these, the transition rates shall be defined as follows:

The general case of four independent transition rates in Equation (

18) can be reduced to two transition rates

${\alpha}_{0}$,

${\alpha}_{1}$ by assuming a symmetry between the two states 0 and 1,

i.e.,

${\alpha}_{1}+{\alpha}_{2}=1$,

${\alpha}_{0}+{\alpha}_{3}=1$. The condition

then denotes the above mentioned majority rule, because the transition rate increases with an increasing fraction of the opposite opinion

$\left(1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}\sigma \right)$ in the neighborhood. This process is also known as positive freqency dependent invasion. In ecology, it means that individuals of abundant species have a better chance to survive.

Opposite to that, for the so-called negative freqency dependent invasion process the probability that cell

i shall switch to the opposite opinion

$\left(1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}\sigma \right)$ decreases with the fraction number of individuals of subpopulation

σ in the neighborhood. This implies

In an ecological context negative frequency dependent invasion means that individuals of a rare subpopulation have a better chance to survive.

The rate

${\alpha}_{0}$ in Equation (

18) applies for the case where cell

i is only surrounded by opinions of the same kind. In a deterministic model, there would be no force to change the current state. In a stochastic CA however all possible processes should have a certain non-zero probability to occur, therefore a rather small value

${\alpha}_{0}\equiv \epsilon ={10}^{-4}$ is used to avoid absorbing states in the dynamics. We will then only vary the remaining rate

$\alpha \equiv {\alpha}_{1}$, which is the only free parameter in the model. Table summarizes the different transition rates given the possible local configurations

${T}_{i}$. It should be read as follows: given that the local configuration is e.g.,

${T}_{i}\left(t\right)=\left\{001\right\}$ at time

t, the probability to find

${T}_{i}(t+1)=\left\{011\right\}$ is

α. If

$\alpha \le 1/2$ this transition rate increases with

${f}_{i}^{\left(1\right)}$, hence it denotes a majority voting rule, otherwise it denotes a minority voting rule.

Now that the transition rates for the nonlinear voter model are specified, we have different ways to proceed: (1) We can run stochastic computer simulations of the one-dimensional CA, to get some intuition about the dynamics and the role of the parameter

α. This will be described in the following

Section 3.2; (2) We can also derive an approximate macroscopic dynamics for the global frequency

$x\left(t\right)$, which will be done in

Section 3.3 and

Section 3.4 using two different approximations; (3) Eventually, we can use the closed-form description already derived in

Section 2.3, to numerically calculate

$x\left(t\right)$. The stochastic simulations will denote our reference case, used to compare the two different approximate macroscopic dynamics and the numerical calculations.

#### 3.2. Computer Simulations of the CA Model

For a first insight into the dynamics, we have conducted stochastic simulations of the CA described above. In this section, we will only refer to particular runs, to show some snapshots of the dynamics, while in

Section 4 also averaged simulation results of the global variables are discussed.

For our simulations, we have used a one-dimensional CA of

$N=640$ cells with periodic boundary conditions, where all cells are simultaneously updated. The discrete time scale is defined by generations. We have checked that the main results do not change if

N is increased to 6400. The initial configuration of the CA refers to a homogeneous distribution (within discrete limits) of both opinions,

i.e., initially each cell is randomly assigned one of the possible states,

$\{0,1\}$, with a probability that is equal to the initial global frequency

$x(t=0)$. At each time step, the transition rates are calculated for each cell according to Equation (

18) and the values are compared with a random number

`rnd` drawn from the intervall

$(0,1)$. If

`rnd` is less than the calculated transition rate, the respective transition process is carried out, otherwise the cell remains in its current state. Since each transition only depends on the current local configuration, memory effects are not considered here.

It follows from the above description that the case ${\alpha}_{0}=\epsilon =0$ and $\alpha =0$ refers to a deterministic positive majority voting rule, simply because the state of cell i never changes unless the two nearest neighbor cells have adopted the same opinion. However, then, it will always change, such that all three cells have the same opinion. Similarly, a deterministic minority voting rule is described by $\epsilon =0$, and $\alpha =1$.

The results of the computer simulations are illustrated in

Figure 2. We first note the remarkable difference in the outcome between majority and minority voting. In the latter, cells change their state frequently, however no pattern evolves because the cell always adopts an opinion different from the two neighbors, if these have the same opinion. Thus, we see a rather random and unstructured pattern, in which both opinions coexist with a long-term fraction

$x=0.5$. This overall behavior is also not changed in case of a stochastic dynamics.

**Figure 2.**
Evolution of the cellular automaton (CA) for different rules: (**left column**) majority voting; (**right column**) minority voting; (**top row**) deterministic dynamics; (**bottom row**) stochastic dynamics. Parameters: (top left) $\epsilon =0.0$, $\alpha =0.0$, (top right) $\epsilon =0.0$, $\alpha =1.0$, (bottom left) $\epsilon ={10}^{-4}$, $\alpha =0.2$, (bottom right) $\epsilon ={10}^{-4}$, $\alpha =0.8$. $N=640$, initial condition: $x(t=0)=0.5$, (black) indicates $\sigma =0$, (white) indicates $\sigma =1$.

**Figure 2.**
Evolution of the cellular automaton (CA) for different rules: (**left column**) majority voting; (**right column**) minority voting; (**top row**) deterministic dynamics; (**bottom row**) stochastic dynamics. Parameters: (top left) $\epsilon =0.0$, $\alpha =0.0$, (top right) $\epsilon =0.0$, $\alpha =1.0$, (bottom left) $\epsilon ={10}^{-4}$, $\alpha =0.2$, (bottom right) $\epsilon ={10}^{-4}$, $\alpha =0.8$. $N=640$, initial condition: $x(t=0)=0.5$, (black) indicates $\sigma =0$, (white) indicates $\sigma =1$.

In contrast, for the majority voting rule we observe the formation of local clusters of the same opinion. In the deterministic case, the resulting pattern becomes stationary very fast, such that the clusters remain rather small. In the stochastic case, however, we observe the interesting phenomenon that very large clusters of cells with the same opinion emerge. These domains do not grow up to system size because of the small perturbation

$\epsilon ={10}^{-4}$, that enables cells to adopt a different opinion even in a homogeneous neighborhood. The non-linearity (

$\alpha =0.2$ is different from the linear case

$\alpha =1/3$) then amplifies such random deviations. This phenomenon has been discussed, for the case of a two-dimensional CA, in [

8]. As a consequence, we observe a correlated coexistence of both opinions characterized by the facts that (i) each opinion at times is the majority and (ii) forms larger clusters, and (iii) the dynamics is always in non-equilibrium. This is clearly visible in the long-term behavior, shown in

Figure 2 (bottom left).

In

Section 4, we further investigate these different regimes numerically, after we have derived some appropriate approximations of the macroscopic dynamics.

#### 3.3. Derivation of the Macroscopic Dynamics

A formal description for the dynamics of the CA starts with the probability

$p({\theta}_{i},t)$ used in Equation (

1). By means of

we can obtain the key variable of the macroscopic dynamics

$\u2329{x}_{\sigma}\left(t\right)\u232a$, which is the expected global frequency of each opinion in the population. Note that, different from Equation (

15) where

${x}_{\sigma}\left(t\right)$ is used,

$\u2329{x}_{\sigma}\left(t\right)\u232a$ describes the ensemble average over very many simulations. This is of some importance when interpreting the results. For the linear voter model, it is known that the system converges always to consensus,

i.e.,

$x\to 0$ or

$x\to 1$. If we use the initial condition

$x\left(0\right)=0.5$, then in 50% of the cases we observe

$x\to 0$, and in 50%

$x\to 1$. However, when averaging over all of these outcomes, we find

$\u2329x\u232a=0.5$.

Assuming a master equation for the dynamics of

$p({\theta}_{i},t)$ with the transition rates specified in Equation (

18), we can derive a rate equation for

$\u2329{x}_{\sigma}\u232a$ as discussed in detail in [

8]:

Here, the summation is over all possible

${2}^{n}$ opinion patterns

${\underline{\sigma}}^{\prime}$ for the neighborhood of cell

i. These are binary strings

${\sigma}_{i-1}{\sigma}_{i+1}$ that indicate the particular values of the nearest neighbors

$i-1$,

$i+1$. With

$n=2$ neighbors, these would be 00, 01, 10, 11. Together with the possible state for cell

i,

i.e.,

$\sigma =0$ or

$\sigma =1$, the respective local frequencies

${f}_{i}^{\sigma}$ can be derived according to Equation (

16), such that all transition rates in Equation (

22) are specified. The solution of Equation (

22) would however require the computation of the averaged global frequencies

$\u2329{x}_{1,{\underline{\sigma}}^{\prime}}\u232a$ and

$\u2329{x}_{0,{\underline{\sigma}}^{\prime}}\u232a$ for all possible opinion patterns

${\underline{\sigma}}^{\prime}$ over time, which would be a tremendous effort. Therefore, in [

8] two analytic approximations have been discussed to solve this problem. Here, we only summarize the results.

In the first approximation, the so-called mean-field limit, the state of each cell does not depend on the opinion of its neighbors, but is only influenced via a mean field. In this case the opinion distribution factorizes:

and we find with

$\u2329x\u232a\equiv \u2329{x}_{1}\u232a=1-\u2329{x}_{0}\u232a$ for the macroscopic dynamics, Equation (

22) in the mean-field limit:

For the calculation of the

$\u2329{x}_{{\sigma}_{j}}\u232a$ we have to look at each possible opinion pattern

${\underline{\sigma}}^{\prime}$. The mean-field approach assumes that the occurence of each 1 or 0 in the pattern can be described by the global frequencies

x and

$\left(1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}x\right)$, respectively (for simplicity, the abbreviation

$x\equiv \u2329x\u232a$ will be used in the following). Taking the example 100,

i.e.,

${\sigma}_{i}=0$ for cell

i and

${\sigma}_{i-1}=1$,

${\sigma}_{i+1}=0$ for its neighbors, would result in

$(1-\u2329x\u232a)\u2329{x}_{{\sigma}_{i-1}}\u232a\u2329{x}_{{\sigma}_{i+1}}\u232a=x{\left(1\phantom{\rule{1.00006pt}{0ex}}-\phantom{\rule{1.00006pt}{0ex}}x\right)}^{2}$. Inserting further the transition rates, Equation (

18), we find this way the equation for the mean-field dynamics as:

The fixed points of the mean-field dynamics can be calculated from Equation (

25) by means of

$\dot{x}=0$. In the limit

$\epsilon =0$, we find:

The three stationary solutions denote either consensus toward one of the opinions, or coexistence of both opinions with an equal share. In order to verify the stability of the solutions, we have further investigated the Jacobian

$d/dx$ of Equation (

25). The results can be concluded as follows: Below a critical reinforcement,

$0<\alpha <1/3$,

$x=0$ and

$x=1$ are stable attractors and

$x=0.5$ is the separatrix. Above the critical reinforcement,

$1/3<\alpha <1$,

$x=0.5$ becomes the stable attractor, while

$x=0$ and

$x=1$ are unstable. We will come back on this after discussing the second approximation.

#### 3.4. Pair Approximation

The mean-field approximation assumes that the local occurrence of opinions is determined by the global frequencies rather than by local interactions. A better approximation should take local correlations between neighboring cells into account. Our second approximation, the so-called pair approximation, is based on the assumption that the state of each cell i is only correlated to the states of each of its nearest neighbors, separately. i.e., the two neighbors are only correlated through the focal cell and not to each other. Therefore, the neighborhood ${\sigma}_{i-1}{\sigma}_{i}{\sigma}_{i+1}$ is decomposed in pairs of nearest neigbor cells, ${\sigma}_{i}{\sigma}_{i+1}$, ${\sigma}_{i-1}{\sigma}_{i}$ which are called doublets. The expected value of the global frequency of doublets is denoted as $\u2329{x}_{\sigma ,{\sigma}^{\prime}}\u232a$ where σ refers to the focal cell and ${\sigma}^{\prime}$ to one of its neighbors.

We can then approximate the global frequency of a specific opinion pattern

$\u2329{x}_{\sigma ,{\underline{\sigma}}^{\prime}}\u232a$ in Equation (

22) as:

where the

${c}_{\sigma |{\sigma}^{\prime}}$ are the correlations:

that depend on the doublet frequency

$\u2329{x}_{\sigma ,{\sigma}^{\prime}}\u232a$ and the global frequency of opinions

$\u2329{x}_{{\sigma}^{\prime}}\u232a$ neglecting higher order correlations.

${c}_{\sigma |{\sigma}^{\prime}}$ can be interpreted as the conditional probability that a randomly chosen nearest neighbor of a cell in state

${\sigma}^{\prime}$ is in state

σ. With the relations:

and using again

$\u2329x\u232a\equiv \u2329{x}_{1}\u232a$, these correlations can be expressed in terms of only

${c}_{1|1}$ and

$\u2329x\u232a$ as follows:

We then find for the macroscopic dynamics, Equation (

22), in pair approximation:

With respect to Equation (

30), Equation (

31) now depends on two variables,

$\u2329x\u232a$ and

${c}_{1|1}$. In order to derive a closed description, we need an additional equation for

${\dot{c}}_{1|1}$, that can be obtained from Equation (

28):

Equation (

32) requires additionally the time derivative of the global doublet frequency

$\u2329{x}_{1,1}\u232a$. We note that the three coupled equations for

$\u2329x\u232a$,

${c}_{1|1}$ and

$\u2329{x}_{1|1}\u232a$ can be easily solved numerically. In the

Appendix, we have derived explicit expressions for these equations for the one-dimensional CA discussed here, using the transition rates of Equation (

18).

## 5. Conclusions

With our investigations of the one-dimensional voter model, we want to achieve two goals: (i) a better understanding of the non-linearity in the voter dynamics that was mostly studied as a linear model, only; (ii) a probabilistic description, and possible approximations, of the dynamics for the fraction of opinion 1, $x\left(t\right)$.

The non-linearity can be easily expressed by means of a free parameter α. The critical value ${\alpha}^{c}=1/3$ distinguishes between two different rules, majority voting ($\alpha >{\alpha}^{c}$) and minority voting ($\alpha <{\alpha}^{c}$), whereas $\alpha ={\alpha}^{c}$ refers to the border case of the linear voter model. It is known from well-mixed populations that majority voting should result in consensus, i.e., the asymptotic dominance of only one opinion, whereas minority voting should result in coexistence, i.e., the occurrence of both opinions in different fractions.

Our main focus was the role of local correlations in determining this outcome. For this we have used one-dimensional cellular automata (CA) in which each cell

i, characterized by its opinion

${\theta}_{i}$, has a defined neighborhood of two cells with possibly different opinions, denoted as triplet. The transition probability of a cell to change its opinion is then determined by the local frequency of opinions in its neighborhood (including the focal cell) and the non-linear response to this information, expressed by means of

α. The values of

α define a certain probability to switch to the opposite opinion,

i.e., we can use them to switch between a deterministic (

$\alpha =0,1$) or a stochastic (

$\alpha \ne 0,1$) dynamics. In the latter case, we have further assumed a very small probability

${\alpha}_{0}\equiv \epsilon ={10}^{-4}$ to perturb a state of complete consensus, which allows a non-stationary dynamics of the CA as shown in

Figure 2 (bottom right).

While the minority rule only results in random coexistence of the two different opinions, the majority rule generates more interesting results. In particular, we observe a correlated coexistence characterized by the formation of large domains of the same opinion which change continuously. i.e., we have a non-equilibrium dynamics in which each opinion, for a certain time, can form large clusters of the majority opinion.

The question then is how to describe this dynamics mathematically. In this paper, we follow a probabilistic approach, i.e., each cell has a certain probability of a given opinion $\sigma \in \{0,1\}$, which also depends on the probabilities of the nearest neighbors, second-nearest neighbors, and so forth. In order to close the dynamics, we have proposed three different approximations at different levels of the description.

The first level is the aggregated description in terms of the global fraction of opinion 1, $x\left(t\right)$, for which we derive a dynamics for the expected value, $\u2329x\left(t\right)\u232a$. On this level, we discuss two approximations. The simplest one is the mean-field approximation, in which no correlations between neighboring states are considered. So, we call this the zero-order approximation. It gives us a prediction for $\u2329x\left(t\right)\u232a$ derived from the well-mixed case. In contrast, the pair approximation considers a correlation between a cell and its neighbor, i.e., the triplet consisting of a cell and its two neighbors is decomposed in two cell-neighbor pairs. Correlations between neighbors are not considered, so we call this the 1st-order approximation. Hence, we have a prediction for $\u2329x\left(t\right)\u232a$ coupled to the dynamics of the pair correlations ${c}_{1|1}\left(t\right)$.

The third approximation does not refer to the aggregated level, but to the stochastic dynamics of a triplet, i.e., a cell with its two neighbors, that is determined by the larger neighborhood of a quintuplet, i.e., considers also the second nearest neighbors of the cell. Therefore, we call this the 2nd-order approximation. Using certain assumptions, we are able to provide a closed form dynamics for this larger neighborhood in terms of a probabilistic equation.

To compare the validity of these mathematical approximations, we use as a reference case stochastic computer simulations of the one-dimensional CA, which are averaged over a larger number of runs. We have discussed the majority and the minority voting, as well as the deterministic and the stochastic dynamics, separately.

In conclusion, we can summarize that the zero-order approximation only predicts the stationary outcome of the minority voting correctly, but fails for the majority voting rule, both with respect to the dynamics and the stationary outcome.

The first-order approximation performs comparably well in comparison to the second-order approximation only for the majority voting. The stationary outcome is correctly predicted both for the deterministic and the stochastic case, also the dynamics is covered fairly good. However, the first-order approximation fails to predict the dynamics of the minority voting. This case is only well covered by the second-order approximation that gives not only a correct description of $\u2329x\left(t\right)\u232a$, but also of the pair correlations $\u2329{c}_{1|1}\left(t\right)\u232a$.

Commenting specifically on the correlations, we recall again that both the first and second-order approximations lead to comparably good asymptotic results only for the majority voting. But they clearly predict a faster formation of domains, i.e., a convergence to their stationary value, as compared to the CA simulations. This limits their usability to fully understand the emergence of long-range correlations. In the case of minority voting, our first-order approximation fails, while the second-order approximation for $\u2329{c}_{1|1}\left(t\right)\u232a$ could be even seen as accurate in its computational prediction. This is not so surprising if we recall that minority voting rules, different from majority voting, do not result in long-range correlations.

Hence, we can conclude that the quintuplet approximation that covers also the second-order neighborhood, is accurate enough to describe the dynamics of the CA on the macroscopic level. While it is of course understandable, that an approximation that considers more information is usually more accurate, we should also relate this conclusion to the computational effort. Here, it turns out that the 2nd-order approximation, although stochastic,

i.e., needs to be averaged over a number of runs, performs very fast because of the closed form dynamics. Of course, the 1st-order approximation, the closed form of which is given in the

Appendix, is computationally even simpler. But there is a trade-off with the accuracy of the prediction. Still, as long only majority voting is considered, the 1st-order approximation should be preferred, both for simplicity, accuracy and computational effort.

Our last remark is about the coexistence of the two opinions, which is the more interesting scenario compared to consensus,

i.e., the existence of only one opinion. Here, we are not so much interested in the trivial case of random coexistence without any structure formation, which is characterized by

$\u2329x\u232a=0.5$ and

${c}_{1|1}=0.5$. We focus more on the case of correlated coexistence which has an interesting complexity because of the formation of local structures, nicely shown in

Figure 2 (lower left). On the level of our approximations, this state is characterized by

$\u2329x\u232a=0.5$ and

${c}_{1|1}\to 1$.

i.e., both opinions form at times large domains, indicated by the high pair correlation, but none of the two opinions entirely dominates the dynamics, as the expected value for its fraction is about 0.5.

Such insights can be generalized to other cases of frequency dependent processes which e.g., play a role in population ecology (invasion or extinction of species). For most of these applications a two-dimensional CA is more appropriate as it was discussed in [

8]. The one-dimensional CA investigated here, on the other hand, allows a mathematical approximation of the stochastic dynamics in terms of the 2nd-order neighborhood, which gives a much higher predictive power. This formalism can be also used for other frequency dependent processes in one-dimensional CA.