Evolutionary Processes in Quantum Decision Theory

The review presents the basics of quantum decision theory, with an emphasis on temporary processes in decision making. The aim is to explain the principal points of the theory. How an operationally-testable, rational choice between alternatives differs from a choice decorated by irrational feelings is elucidated. Quantum-classical correspondence is emphasized. A model of quantum intelligence network is described. Dynamic inconsistencies are shown to be resolved in the frame of the quantum decision theory.


Introduction
In recent years, there has been great interest in the possibility of formulating decision theory in the language of quantum mechanics. Numerous references on this topic can be found in the books [1][2][3][4] and review articles [5][6][7][8]. This interest is caused by the inability of classical decision theory [9] to comply with the behavior of real decision makers, which requires the development of other approaches. Resorting to the techniques of quantum theory gives hope for a better representation of behavioral decision making. There are several variants of using quantum mechanics for interpreting conscious effects. The aim of the present review is not the description of the existing variants, which would need too much space and can be found in the cited literature [1][2][3][4][5][6][7][8], but a survey of the approach suggested by the author and his coworkers. This approach was coined [10] quantum decision theory (QDT).
In the present review, we limit ourselves by considering quantum decision theory, but we do not touch other trends in the ramified applications of quantum techniques, such as quantum approaches in physics, chemistry, biology, economics and finance, quantum information processing, quantum computing, and quantum games. It is evident that all those fields cannot be reasonably described in a single review.
Although the theory of quantum games shares similarities with decision theory, there exists an important difference between the standard treatment of quantum games [11][12][13][14][15] and the main idea of quantum decision theory presented in the review. In the theory of quantum games, one usually assumes that players are quantum devices of a kind following quantum rules [16,17]. However, in the approach of quantum decision theory [10], decision makers do not have to be quantum devices; moreover, they can be real human beings. The mathematics of QDT is analogous to the mathematics in the theory of quantum measurements, where an observer is a classical human being, while the observed processes are characterized by quantum laws. In QDT, quantum theory is a technical language allowing for the description of decision processes. Quantum techniques turn out to be a very convenient tool for characterizing realistic human decision processes incorporating rational-irrational duality, because quantum techniques are designed for taking into account the dual nature of quantum world, such as the particle-wave duality. The mathematical generalization of decision-making processes by including into them the rational-irrational duality is one of the main achievements of QDT. Summarizing, the specific features of QDT, distinguishing it from many variants of decision theory employing quantum techniques, are as follows.
(i) The mathematics of QDT is analogous to that used for characterizing quantum measurements, so that there is a direct correspondence between QDT and the theory of quantum measurements. (ii) The approach is general, not only in the description of both decision theory and quantum measurements, but also in its mathematical formulation, allowing for the interpretation of various decision events or quantum measurements. (iii) The theory shows the difference between the decision making with respect to operationally testable events and the choice under uncertainty caused by irrational subconscious feelings. (iv) Single decisions and successive decisions are described on equal footing. (v) Temporal evolution of decision processes is formulated. (vi) Quantum-classical correspondence is preserved explaining the relation between quantum and classical probabilities. (vii) No paradoxes of classical decision theory arise. (viii) Different dynamic decision inconsistencies find natural explanation in the frame of QDT. (ix) The theory can be applied to single decision makers and also to decision-maker societies forming quantum intelligence networks.
Of course, not all above-mentioned aspects will and can be considered in detail in the present review. To describe all of them would require a huge book. We shall concentrate on the principal points the theory is based on and will emphasize temporal effects that have not been sufficiently discussed in the previous publications. In order to make clear for the reader the basic ideas of the QDT, it is useful to present the foundations in several steps. First, we need to define the process of decision making in the case of operationally testable events. This introduces the method of calculating quantum probabilities in the case of projection valued measures. Then it is straightforward to generalize the considerations to dual processes, when a decision is made by taking into account rational and irrational sides, which involves the use of positive operator-valued measures. Next, the approach to describing a society of decision makers, forming a quantum intelligence network, is presented. Finally, several dynamic decision inconsistencies are considered as an illustration of how they can be naturally resolved in the frame of QDT.

Operationally Testable Events
The mathematics of QDT is similar to that employed for treating quantum measurements. Therefore, each formula can be interpreted from two angles-from the point of view of quantum measurements or decision theory [10,[18][19][20]. Below, as a rule, we follow the interpretation related to decision theory, only occasionally mentioning quantum measurements. Some parts of the scheme below are familiar to quantum mechanics, but here we suggest the interpretation in the language of decision theory, which is needed for introducing the basic notions of QDT. These basic notions are introduced step by step in order to better demonstrate the logic of the QDT. We will not plunge into the details of numerous applications that can be found in the published literature. The main aim of this review is the explanation of the principal points of the approach, with the emphasis on the evolutionary side of decision processes.
The typical illustration of decision making is based on the choice between given alternatives, say forming a set {A n }, enumerated by the index n = 1, 2, . . . , N A . The alternatives correspond to events that can be operationally tested. Each alternative is represented by a vector |A n , where the bra-ket notation [21] is used. The vectors of alternatives pertain to a Hilbert space of alternatives being a closed linear envelope over an orthonormal basis. Decisions are made by a subject whose mind is represented by a Hilbert space that can be called the subject space of mind, or just the subject space. Thus, the total space where decisions are made is the decision space This is a closed linear envelope over an orthonormal basis, The alternatives from the given set, represented by vectors |A n , are assumed to be orthonormal to each other, which means that the alternatives are mutually exclusive. The alternative vectors are not required to form a basis. For each alternative, it is possible to show a corresponding alternative operator These operators enjoy the properties of projectorŝ whose family forms a projection-valued measure. The state of a decision maker is represented by a statistical operator that is a semi-positive trace-one operator, Tr Hρ (t) = 1 , depending on time t and acting on the space H. In decision making,ρ is termed the decision-maker state or just the decision state. The pair {H, ρ(t)} is a decision ensemble. The probability of observing an event A n is p(A n , t) = Tr Hρ (t)P(A n ) , which is uniquely defined for a Hilbert space of dimensionality larger than two [22]. SinceP(A n ) acts on the space H A , probability (6) can be rewritten as where the trace is over the space H A , and with the trace over the space H S . Note that probability (7) can also be written as Expression (6), or (7), is the predicted probability of observing an event A n , or in decision theory, this is the predicted probability of choosing an alternative A n . When comparing the predicted theoretical probabilities with empirical probabilities, the former are interpreted in the frequentist framework [23,24]. Thus, p(A n , t) is the fraction of equivalent decision makers preferring the alternative A n at time t. This can be reformulated in the different way: p(A n , t) is the frequency of choosing the alternative A n by a decision maker, if the choice is repeated many times under the same conditions until the time t.
We say that an alternative A i is stochastically preferred (or simply preferred) to an alternative A j if and only if p(A i ) > p(A j ), and the alternatives A i and A j are stochastically indifferent if and only if p(A i ) = p(A j ).

Single Decision Making
Usually, one considers decision making as a process instantaneous in time, which is certainly an overly rough modeling. Here we stress the importance of dealing with realistic decision making developing in time. The evolutionary picture saves us from different inconsistencies arising in the toy model of instantaneous decisions [18,25].
Suppose at the initial time t = 0 we plan to make a choice between the alternatives from the given set. This implies a stage of preparation, when the decision maker is characterized by a stateρ(0). The evolution of the decision-maker state in time can be represented aŝ involving the evolution operatorÛ(t). To keep the state normalization intact, the evolution operator has to be unitary,Û By employing a self-adjoint evolution generatorĤ(t), the time transformation of the evolution operator can be written as the Lie differential equation with the initial conditionÛ (0) =1 .
A very important clause is the requirement that the considered decision makers be well defined individuals, who do not drastically vary in time; otherwise, there would be no meaning in predicting their decisions. In other words, the mental features of decision makers at one moment of time are to be similar to those at a different moment of time. Briefly speaking, we can say that the decision makers should have the property of self-similarity. This imposes a restriction on the form of the evolution generator. This restriction can be written as the commutator Recall that in quantum theory the operator commuting with the evolution generator is called the integral of motion. In our case, condition (12) does not mean that the evolution generator does not depend on time, but it tells us that the properties of this operator are in some sense invariant with time, preserving the similarity of the decision makers at different moments of time. Thus, in decision theory, condition (12) can be understood as the invariance of the properties of decision makers; that is, of the decision-maker self-similarity. In the theory of operator differential equations, commutator (12) is termed the Lappo-Danilevsky condition [26]. Under this provision, the evolution operator, satisfying the evolution Equation (10), with the initial condition (11), reads as This form of the evolution operator satisfies the group properties that can be represented as the property of functional self-similarity [27].
Till now, we have not specified the choice of a basis for the decision space H. It is convenient to accept as the basis that one composed of the eigenfunctions of the evolution generator, such that Generally, the eigenfunctions here can depend on time. However, there are two cases when this dependence is either negligible or absent at all. One possibility is when the eigenfunctions vary with time much slower than the eigenvalues E nα (t). Then there exists a time horizon till which the time variation of the eigenfunction can be neglected, which is regulated by a kind of adiabatic conditions [28]. The other case is the already assumed Lappo-Danilevsky condition (12) implying the decision-maker self-similarity. It is easy to show that the Lappo-Danilevsky condition (12) is equivalent to the validity of eigenproblem (14) with time-independent eigenfunctions. Exactly the same situation happens in quantum theory in the case of nondestructive or nondemolition measurements [29][30][31][32][33][34][35].
This basis is complete, since the evolution generator is self-adjoint. Then where In this way, we find the expression for the time-dependent predicted probability of choosing the alternative A n at time t, Again, to compare this expression with experimental observations, it is reasonable to interpret it as a frequentist probability [23,24], although other interpretations are admissible [36].
The evolution generator is defined on the decision space H and characterizes the velocity of the change of the decision-maker state caused by the influence of the set of alternatives. The impact of the evolution generator on the decision-maker state is assumed to be finite, such that, if the decision process starts at time t A and ends at time t A + τ A , being taken in the time interval [t A , t A + τ A ], then This impact can be asymptotically large, remaining finite. The speed of the decision process can be quantified by a rate parameter g, which the eigenvalue E nα (t) = E nα (t, g) depends on in such a way that in the limit of a slow process while under a fast process In the case of a slow process, This yields the probability which implies that the decision state practically does not change: When the process is fast, in the sum (17) one has This leads to the probability with the stateρ whereP m ≡ | m m | .
In the intermediate cases of the process rate, one has to resort to the general probability (17).

Changing Initial Conditions
Assume that a subject made a decision during the time interval [t A , t n ≡ t A + τ A ] choosing the alternative A n . Then what would be the following evolution of the probability p(A n , t)? Strictly speaking, there is nothing special in the fact of one subject making a decision at any moment of time. According to the frequentist understanding, the probability gives a distribution over an ensemble of decision makers or over many repeated decisions. This means that the probability yields a fraction of decision makers (or the fraction of decisions) preferring this or that alternative. In the following moments of time, the probability will continue to be defined by Equation (17).
However, one can put the question in a different manner: Suppose we are interested in the decision making of just a single subject who certainly chose the alternative A n , so that at the moment t n the probability became one, instead of that prescribed by Equation (17). Then what would be the following evolution of the probability? Again, there is nothing extraordinary in that case. Before making a decision, the predicted probability was described by Equation (17). After making the decision, if we insist that a posteriori probability became one, this means that we have to replace p(A n , t n ) by one, treating the latter as a new initial condition. Thus, the replacement p(A n , t n ) −→ 1 (25) means nothing but the change of the initial condition for the equation describing the evolution of the decision state. The related replacementρ assumes that starting from the moment of time t n , we treat as the new initial condition the state defined by the equality Tr AρL (t n )P(A n ) = 1 .
The simplest solution for the latter equation is the Lüders state [37] ρ L (t n ) =P Note that hereρ L (t n ) is the a posteriori post-decision state, whileρ A (t n ) is the a priori anticipated state; that is,ρ The replacement (26) often is labeled "collapse". For wave functions, this replacement is equivalent to the sudden change from a state |ψ to the state |A n . It is really easy to check that the wave function "collapse" implies the replacement Under the name "collapse", one means a sudden jump of the state, which can be dramatic, provided the wave function or decision state describes real matter. However, a decision state and a wave function are nothing but the probabilistic characteristics that can be used for calculating and predicting a priori quantum probabilities satisfying evolution equations. Fixing a decision state at some moment of time simply means fixing new initial conditions for the state evolution. Thus, the Lüders state is just the new initial condition for the moment of time t n , after which again the predicted probabilities have to be found from the state evolution with the evolution operatorÛ This defines the decision statê after the time t n . Respectively, this decision state gives the probability of choosing an alternative A m after the time t n , Comparing Equations (29) and (32) yields the initial condition Following the same steps as in the previous section, we get the time evolution of the probability for t > t n , where In the limit of a slow process, this results in And under a fast process, we havê Let us consider the situation, when a subject, after fixing the initial condition (29), where the alternative A n was chosen, is interested in finding the probability of deciding on the alternative A m . In the slow-process limit, we get Introducing the Wigner [38] probability yields the relation where This equation reminds us the relation between the classical joint and conditional probabilities. Therefore, sometimes one is tempted to interpret the Wigner probability, p W (A m , t n ) as a joint probability of two events, A n and A m , and the Lüders probability p L (A m , t n ) as a conditional probability of the event A m under the event A n that previously happened. This interpretation, however, cannot pretend to provide a generalization of classical probabilities to quantum theory. This, first of all, because the derived relation is a very particular case of extremely slow processes, when g → 0, but not the general expression. Second, this relation is not valid for all times, but it only connects the terms at time t n . What is more important is that this relation contains the terms taken from different sides of t n ; that is, connecting the predicted a priori probability with the post-decision a posteriori probability, But the meaningful definition of a probability should be valid for any time t > t n . Third, the generalization of a classical expression assumes to contain the classical one as a particular case. But it is easy to check that the Lüders probability is which is symmetric with respect to the interchange between A m and A n . For commuting events, corresponding to the classical case, the Lüders probability becomes trivial δ mn . Contrary to this, the classical conditional probability is neither symmetric nor trivial, which is confirmed by numerous empirical observations [39,40]. In the best case, the Lüders probability is just a transition probability [18,25].
Sometimes one tries to save the interpretation of the Lüders probability as a conditional probability by resorting to the assumption of degenerate quantum states. This, nevertheless, does not save the situation because of several reasons: The origin of degeneracy is never defined, which makes the assumption groundless. The degeneracy is not important, since it always can be lifted by infinitesimally small variations of the problem [18,25]. Finally, the degeneracy does not make classical expressions a particular case of quantum formulae; i.e., there is no quantum-classical correspondence [18,25,41].
As we see, the consideration of realistic decision processes developing across time helps us to avoid misrepresentation of the obtained expressions.
The use of formal relations, neglecting evolutionary processes, can lead to meaningless results like negative and complex probabilities [18,25,42].

Successive Decision Making
When we consider the alternatives from the same set, say {A n }, the probability of any A n is described by the approach elucidated in the previous sections, which leads to the set of the probabilities {p(A n , t) : n = 1, 2, . . . , N A }. A rather different problem is the definition of successive decisions with respect to alternatives from different sets. Below we consider this problem by employing some of the methods from the theory of quantum measurements [21,[42][43][44], essentially modifying them, however, in order to adjust to quantum decision theory [18,25].
Suppose there are two sets of alternatives, {A n : n = 1, 2, . . . , N A } and {B k : , a subject makes a decision with respect to the alternatives A n . Then, in the time According to the general approach of defining quantum joint probabilities [18,25], each set of alternatives is represented by the related set of alternative vectors and alternative projection operators, Respectively, there are two spaces of alternatives: whose bases, in general, are different. Recollecting the existence of the subject space of mind H S , we have the decision space The decision-maker state is subject to the time evolution (9). The evolution operator satisfies the evolution equation (Equation (10)), now with the evolution generator Again it is convenient to choose the basis composed of the eigenfunctions of the evolution generator, such thatĤ This assumes the self-similarity of the decision maker in the form of the Lappo-Danilevsky condition (12), because of which we havê where The joint probability that, first, a decision on A n is made (an event A n happens) and later a decision on B k is made (an event B k occurs) is defined as This also can be written as whereρ Expanding this results in the expression for the sought joint probability of choosing first the alternative A n , and later, the alternative B k , yields The process of making decisions is supposed to be smooth, such that its impact does not result in finite-time divergences, The quantity E nkα (t) characterizes the speed of the process of the subject deliberating about the given alternatives. For convenience, it is possible to define a rate parameter, entering the eigenvalue in such a way that a slow process would imply while a fast process would mean that When the process is slow, we have Then the corresponding probability becomes while the decision state reads asρ It is useful to stress that even if at the initial moment of time the decisions on A n and B k formally look to be not explicitly connected, so that the decision state does not factorize anyway, being entangled through the subject's mind. Only in the extreme, hardly realistic cases when at the initial moment of time all processes are absolutely not correlated, so that the joint probability factorizes: Thus, even deciding on different alternatives, the total decision state, generally, remains entangled, thereby producing entangled states in the process of decision making [45]. The produced entanglement can be measured [46][47][48][49][50][51].
For a fast process, we have Then the decision state reduces tô It is important to emphasize that the joint probability (48) is real-valued, It is not symmetric with respect to the interchange of A n and B k , since the evolution generator (44) depends on the order of decisions, the first being A n and the second being B k . As is postulated at the beginning of this section, at the time interval [t A , t A + τ A ], a subject makes a decision on the alternatives A n . Then, in the time This means that the evolution generator can be presented aŝ From here its dependence on the order of decisions is evident. We may notice that the interchange of the decisions on A n and B k is similar to the inversion of time, hence Additionally, it is useful to compare the joint probability (49) with the Kirkwood distribution [52]. The latter is defined for two coinciding in time events, whose projectors pertain to the same space of alternatives H A = H B , which gives It easy to see from the complex conjugate expression which tells us that the Kirkwood distribution, generally, is complex-valued. Thus, the complex Kirkwood distribution and the real joint probability (49) are principally different.

Dual Decision Process
Subjects rarely make decisions based solely on the usefulness of alternatives, as prescribed by utility theory [9], especially when decisions are to be made under uncertainty. The choice between alternatives is practically always not purely objective and based on well defined values that could be quantified, but strong subjective feelings, biases, and heuristics are involved in the process of making decisions [53,54]. Intuition and emotions are also subjective, but they help us to make decisions, making them part of the decision-making process [55]. Taking account of subjective feelings and emotions in making decisions is important for the problem of human-computer interaction [56] and the creation of artificial intelligence [57,58].
Understanding that human decision making involves two sides, the rational evaluation of utility and intuitive irrational attraction or repulsion towards each of the alternatives has been thoroughly investigated and elucidated in the approach called dual-process theory [59][60][61][62]. Quantum techniques seem to be the most appropriate for portraying the duality of human decision processes, since quantum theory presupposes the so-called wave-particle duality. Below we show how the dual-process approach [59][60][61][62] is formulated in quantum language [10,46,[63][64][65].
Thus, to describe decision making by real subjects it is necessary to take into account the rational-irrational duality of decision making. First, we need to say several words on the meaning of the notion "rational" that can be understood in different senses [66]. It is necessary to distinguish between the philosophical and psychological meanings of this term. In philosophical writings, one gives the definition of "rational" as all which leads to the desired goal [67]. However, under that definition, any illogical uncontrolled feeling that leads to the goal should be named rational. In decision making, one uses the psychological meaning of the term "rational" as what can be explained and evaluated logically and what follows explicitly formulated rules. On the contrary, emotions, intuition, and moral feelings cannot be logically and explicitly formulated and quantified, especially at the moment of making a decision. Thus, "rational" in decision making is what can be explicitly formulated, based on clear rules, deterministic, logical, prescriptive, normative. While "irrational" is just the opposite to "rational".
In this way, one has to separate the psychological and philosophical definitions of "rational" or "irrational". The psychological definition assumes that the distinction between "rational" and "irrational" is done at the moment of making a decision. Conversely, the philosophical definition of "rational" as what leads to the goal is a definition that can work better afterwards, when the goal has been reached. Only then it becomes clear what was leading to the goal and what was not. At the moment of making a decision, it is not always clear what leads to the goal and what does not.
The separation of decision making into rational and irrational has a simple and well defined psychological meaning based on actual physiological processes in the brain [68]. One often calls rational processes conscious, while irrational processes as subconscious.
From the mathematical point of view, the above can be formulated as follows. Let us consider the choice between a set of alternatives A n . An alternative is represented by a vector |A n in a Hilbert space of alternatives H A and by a projectorP(A n ) acting on this space. The subject space of mind is H S . Thus, the decision space is As is explained above, when choosing between alternatives, in addition to the rational evaluation of the utility of each alternative, the decision maker experiences irrational feelings. This implies that each alternative A n , and its representation |A n in the space of alternatives H A , is complimented by a set of subjective feelings z n represented by a vector |z n in the subject space of mind H S . That is, in the process of decision making one compares not merely the alternatives A n , but the composite prospects π n ≡ A n z n .
Being a member of the subject space of mind, the vector |z n allows for the expansion in which b nα are random quantities, which signifies a non-deterministic character of irrational feelings and emotions. We do not impose an excessive number of conditions; instead we try to limit ourselves by the minimal number of restrictions. The required conditions will be imposed on the resulting expressions describing probability. Therefore, the vectors |z n are not forced to be obligatory normalized, so that the scalar product does not have to be one. Additionally, the vectors |z n with different labels are not orthogonal to each other. Respectively, the operator is not a projector, sinceP The prospect A n z n is represented by the vector in the decision space H. The prospect operator P(A n z n ) ≡ | A n z n z n A n | =P(A n ) P (z n ) , with the propertyP (A m z m )P(A n z n ) = δ mn z n | z n P (A n z n ) , is not idempotent and is not a projector. The resolution of unity is understood in the weak sense as the equality on average which implies The extended version of the latter equality reads as The set of the operators {P(A n z n )} forms a kind of the positive operator-valued measure [69]. Thus, making a decision on an alternative A n , a decision maker, as a matter of fact, considers a prospect π n = A n z n , as far as, in addition to the rational quantification of the alternative utility, this alternative is subject to irrational subconscious evaluations. A decision maker not merely makes a decision on the usefulness of an alternative, but also experiences feelings of attraction or repulsion to this alternative. Thus, the probability of choosing an alternative is in fact a behavioral probability of the associated prospect comprising a decision on the utility and on the attractiveness of the alternative [65]. Keeping in mind that de facto, one practically always considers prospects associated with the given alternatives, and in order not to complicate notation, we shall denote the behavioral probability of a prospect p(π n , t) ≡ Tr Hρ (t)P(π n ) ≡ p(A n , t) as identified with the alternative probability p(A n , t) = Tr Hρ (t)P(A n z n ) .
The resolution of unity (70) or (71) guarantees the probability normalization Separating in expression (73) from off-diagonal terms we obtain the behavioral probability In the theory of quantum measurements, the diagonal term corresponds to classical theory, while the second, off-diagonal term is due to quantum interference and coherence. The transition from quantum to classical measurements and from quantum to classical probabilities is called decoherence [70]. Thus, the first term corresponds to a classical probability enjoying the properties In decision theory, the classical probability is responsible for a rational choice of alternatives and can be named rational fraction or utility fraction, since its value is prescribed by the utility of the alternative [25,46,65,71,72].
The second term, entirely due to quantum effects, has the properties following from Equations (77) and (78). The property (79) The quantum term q in decision theory is associated with irrational feelings characterizing the emotional attitude of the decision maker to the quality of considered alternatives, because of which it can be called irrational factor, attraction factor, or quality factor [25,46,65,71,72].
As is seen, for an alternative A i to be stochastically preferred over A j , so that p(A i ) > p(A j ), it is not always sufficient to be more useful, but it is necessary to be sufficiently attractive. An optimal alternative is that possessing the largest probability among the set of the considered alternatives.
For any generalization of decision theory, it is very important to include as a particular case the classical decision theory based on expected utility. Quantum decision theory [25,46,65,71,72] satisfies this requirement. The return to classical decision theory happens when the quantum term q tends to zero. This can be called the quantum-classical correspondence principle: In the theory of quantum measurements, the disappearance of the quantum term, corresponding to quantum coherence, is called decoherence. The vanishing of this term can be due to the influence of surrounding environment [70,[73][74][75] or to the action of measurements [35,76]. In decision theory, the decoherence can be caused by the influence of society providing information to the decision maker [77]. In both these cases, random disturbances from either surrounding or measurement devices, lead to the irreversibility of time arrow [78,79].

Evolution of Behavioral Probability
The temporal evolution of the behavioral probability (77) can be treated as in Section 3. The time dependence enters the matrix element If the process in the space of mind is asymptotically slow, then and we get αA n |ρ(t) | A n β αA n |ρ(0) | A n β .
This means that f (A n , t) f (A n , 0) , q(A n , t) q(A n , 0) , so that the probability practically does not change: In the opposite case of a fast process, when Then the rational fraction reads as while the quantum term, characterizing irrational feelings, vanishes: Therefore, the behavioral probability reduces to the rational part, Successive decision making can be described similarly to Section 5. For two successive decisions, one has to consider the probability p(B k A n , t) = Tr Hρ (t)P(B k z k ) P (A n z n ) .
The above results demonstrate the essential dependence of the irrational term q from the speed of making decisions. The fact that, under a slow decision process, the probability practically does not vary, remaining close to that existing at the initial time, is not surprising. What is more interesting is that under a fast decision process, the quantum term vanishes. This can be explained as follows. The quantum term is caused by the interference and entanglement of subconscious feelings in the subject consciousness. In order that these processes happen, they need some time. However, in the case of extremely fast decision making, the subject just has no time for deliberation when the acts of coherence and entanglement could develop.

Evaluation of Initial Probability
Before considering any evolutional process, it is necessary to prescribe initial conditions corresponding to the initial moment of time that can be set as t = 0. The initial probability is formed by the initial rational fraction and irrational factor, To estimate the value of the initial rational fraction, it is possible to use the Luce rule [80], according to which, if the alternative A n is characterized by an attribute a n , then the alternative weight can be written as f (A n ) = a n ∑ n a n (a n ≥ 0) .
For concreteness, let us study the case where alternatives are represented by lotteries where x i are outcomes and p n (x i ) are the outcome probabilities normalized so that Employing a utility function u(x), one can define [9] the expected utility The rational fraction describes the classical weight of an alternative that in the present case is connected with the expected utility in the following way [71,72]. If the expected utilities of all alternatives are semi-positive, they can be associated with the alternative attributes, whereas if all utilities are negative, the alternative attributes can be defined by the inverse quantities In the case wherein among the expected utilities there are both positive and negative utilities, it is possible to use the shift taking account of the available wealth U 0 . This is done in the following way. Finding the minimal negative utility interpreted as the wealth available to decision makers before they make decisions. Then the shifted utilities are used instead of U(A n ), and we return to the case of semi-positive expected utilities. The so defined rational fraction, or utility fraction, enjoys the natural properties: For semi-positive utilities, the fraction tends to zero, when its utility tends to zero, and tends to one, when its utility is very large, For negative expected utilities, we have and respectively, When considering empirical data, the rational fraction characterizes the fraction of decision makers making decisions on the basis of rational normative rules. A more elaborate expression for the rational fraction can be derived by employing the minimization of an information functional [65,71,72].
The irrational factor q, as has been explained above, is a random quantity distributed over the interval [−1, 1]. However, being a random variable does not preclude it to possess a typical average quantity. The ways of defining a non-informative prior for q are described in [46,65,71,72]. Below we present a slightly modified derivation of the non-informative prior for the irrational factor.
Let the related distribution function be denoted as ϕ(x). This distribution is normalized Then the average value of a positive irrational factor is defined as Respectively, the average value of a negative factor is Proposition 1. The non-informative priors for the average irrational factors are Proof. With the notation the normalization condition (103) takes the form In the average irrational factors (104) and (105), x is a monotonic function, while ϕ(x) is integrable. Therefore, employing the theorem of average yields where As a non-informative prior for the values of x ± and λ ± one takes the averages over the domains of their definition, that is, Substituting this into quantities (104) and (105) proves equalities (106).
This property is termed the quarter law. The mentioned above alternation law and the quarter law can be used for estimating the aggregate values of the behavioral probabilities according to the rule The sign of the irrational factor is prescribed in accordance to the alternative being attractive or repulsive [46,65,71,72].
This approach was employed for explaining a number of paradoxes in classical decision making [46,63,64,71,72,81,82] and was found to be in good agreement with a variety of experimental observations [65,71,72,83].

Quantum Intelligence Network
In the previous sections, we have considered quantum decision making by subjects that act independently of each other. Even then the probability of alternatives could vary with time due to the entangling properties of the evolution operator inducing entanglement in the decision space [50,51,84], in addition to that caused by the initial subject state of mind [85]. The situation becomes much more involved when subjects exchange their information with each other. When there is a society of subjects interacting through the exchange of information, we obtain a network. Since the agents make decisions, we have an intelligence network. The probabilities of alternatives will vary owing to the informational interaction. From empirical investigations, it is known that decision makers do alter their decisions as a result of mutual charing of information [86][87][88][89][90][91][92][93]. This type of intelligence networks plays an important role as prolegomena into the problem of artificial intelligence [94][95][96].
Assume that we are considering a society of N agents enumerated by the index i = 1, 2, . . . , N, who decide with respect to alternatives A n , with n = 1, 2, . . . , N A . Thus, the probability for an i-th agent deciding on an alternative A n at the moment of time t is p i (A n , t). Each probability satisfies the standard normalization Accordingly, the related rational fraction f i (A n , t) is normalized as And for the irrational factor, one has To give a realistic description for the temporal evolution of an intelligence network, it is required to take into account that decision making needs some time after receiving information. Denoting this delay time as τ, allows us to represent the dynamics of a behavioral probability as The rational fraction weakly varies with time, so that on the time scale shorter than the discounting time [97] it can be treated as constant. Its value is prescribed by the utilities of the given alternatives, as is explained in Section 8.
The time dependence of the irrational factor can be derived following Section 7, similarly to the derivation of the coherent interference term in the theory of quantum measurements, where the role of irrational subconscious feelings is played by the random influence of environment [35,76,78,79,98]. If a decision maker at time t accumulates the amount of information M i (t) from other members of the society, then the irrational factor is discounted from its initial value, becoming equal to The amount of the accumulated information composes [77,99] the information-memory functional in which J ij (t, t ) is the interaction-memory function and µ ij (t) is the information gain by a subject i from a subject j at time t. To exclude self-action, one has to set either J ii = 0 or µ ii = 0. At the initial moment of time there is no yet additional information, so that one has the initial condition The information gain can be modeled by the Kullback-Leibler [100,101] relative information The information gain (120) is semi-positive, µ ij ≥ 0, due to the inequality ln x ≥ 1 − 1/x. As is evident, µ ii = 0. Depending on the range of the interactions between the agents and the longevity of their memory, there can arise different situations whose typical examples are as follows.
Long-range interactions have the form Short-range interactions act only between the nearest neighbors, where δ <ij> equals one for i and j being the nearest neighbours and is zero otherwise.
Long term memory, that lasts forever, does not depend on time, Short-term memory, on the contrary, corresponds to the situation, when only the last step is remembered, Combining these ultimate cases gives us the following four possibilities for the information-memory functional.
Long-range interactions and long-term memory: Long-range interactions and short-term memory:

Short-range interactions and long-term memory:
Short-range interactions and short-term memory: Keeping in mind modern human societies, we have to accept that long-range interactions are more realistic. This is because the modern-day information exchange practically does not depend on the location of interacting agents who are able to exchange information through phone, Skype, Zoom, etc.

Case of Two Alternatives
A very frequent situation is when agents have to decide between two suggested alternatives, that is, when N A = 2. Then it is straightforward to simplify the notation by setting Now the dynamics is governed by the equation with the irrational factor The information gain takes the form As is seen, µ ii = 0. Setting initial conditions, it is necessary to obey the restriction Another realistic simplification could be when the considered society consists of two types of agents essentially differing by their initial decisions. Then the question is: How the initial decisions would vary with time being caused by the mutual exchange of information? Will agents with different initial decisions come to a consensus, as it often happens after a number of interactions [102,103]?
When the society can be divided into two parts of typical agents, with each part having similar initial decisions within the group, but essentially different initial decisions between the groups, the situation becomes equivalent to the consideration of two typical agents with these different initial decisions. As is explained above, long-range interactions are more realistic for the information exchange in the modern society. Below we present the results of numerical calculations for this type of interactions. The behavior of the society strongly depends on the type of memory the agents have.
Long-term memory. In the case of long-term memory, the information-memory functionals for two groups are The rational fractions are kept constant in time and the parameters are set J = 1 and τ = 1. The society dynamics is strongly influenced by the initial decisions. There can happen two types of behavior depending on the relations between the initial rational fractions and irrational factors.
(i) Rational group conventions. There is the rational-irrational accordance in the initial choice of both groups. Then at the initial moment of time, one group, say the first group, estimates the utility of the first alternative higher than the second group. Taking account of irrational feelings keeps the same preference with respect to behavioral probabilities: Recall that f i (t) ≡ f i (A 1 , t) and p i (t) ≡ p i (A 1 , t). Respectively, if the first group estimates the utility of the first alternative lower than the second group, the irrational feelings do not change this preference: In the case of this rational-irrational accordance, independently of initial conditions, the behavioral probabilities tend to the respective rational fractions: (ii) Common convention. At the initial time, the inequalities between the rational fractions of the groups and the inequalities between their behavioral probabilities are opposite with each other, that is, one has either or In this case, the behavioral probabilities tend with time to the common convention: approximately equal to These results can be interpreted in the following way. In the situation of the rational-irrational accordance, the decision makers are more rational, while irrational feelings, such as emotions, play less important role. Under the prevalence of rational arguments, the agents are inclined to choose the alternative with a higher utility.
In the case of the rational-irrational discordance, the decision makers are forced to more efficiently exchange information, as a result of which they manage to develop a mutual convention.
Short-term memory. In this case, the information-memory functionals are We set again J = 1 and τ = 1. Numerical solution of the evolution equations reveals the existence of two types of possible dynamics.
(i) Group conventions. The behavioral probabilities for each group tend with time to their own limits not coinciding with the corresponding rational fractions: (ii) Everlasting fluctuations. The behavioral probabilities for both groups do not tend to any fixed point, but demonstrate everlasting oscillations. The details of the above numerical solutions can be found in [99].
In a society with short-term memory, there is no enough accumulated information for the formation of a common convention. Each group in such a society either develops their own goal, not necessarily rational, or constantly fluctuates without elaborating a consensus.

Dynamic Decision Inconsistencies
There are several so-called dynamic or time inconsistencies in decision making characterizing situations in which a decision-maker's preferences change over time in such a way that a preference at one moment of time can become inconsistent with a preference at another point in time. Below we consider some of these inconsistencies and show that they find quite natural explanations in QDT.

Question Order Bias
The order that several questions are asked in a survey or study can influence the answers that are given as much as by 40% [104,105]. That is because the human brain has a tendency to organize information into patterns. The earlier questions may provide information that subjects use as context in formulating their subsequent answers, or affect their thoughts, feelings, and attitudes towards the questioned problem. Sociological research gives a number of illustrations of this bias [105]. Thus, from a December 2008 poll we know that when people were asked, "All in all, are you satisfied or dissatisfied with the way things are going in this country today?" immediately after having been asked, "Do you approve or disapprove of the way George W. Bush is handling his job as president?" 88 percent said they were dissatisfied, compared with only 78 percent without the context of the prior question.
In classical probability theory, the probability of joint events is symmetric. This is contrary to the quantum decision theory. Assume an interrogator first asks a question A, so that the answer suggests two alternatives: yes (A 1 ) or no (A 2 ). After this, the interrogator poses another question B, also with the possible answers: yes (B 1 ) or no (B 2 ). As follows from Section 5, the probability p(A i B j ) does not equal the probability p(B j A i ), since they are defined through different decision states. The situation is similar to that occurring in the problem of quantum contextuality [106][107][108], where two random variables (A i B j and B j A i ) cannot be characterized by a single density matrix.
In that way, taking account of dynamic evolution in QDT shows that, in general, the alternatives are not commutative, in the sense that p(A i B j ) = p(B j A i ), which is in agreement with the empirical data.

Planning Paradox
In classical decision theory, there is the principle of dynamic consistency according to which a decision taken at one moment of time should be invariant in time, provided no new information has become available and all other conditions are not changed. Then a decision maker, preferring an alternative at time t 1 should retain the choice at a later time t 2 > t 1 . However, this principle is often broken, which is called the effect of dynamic inconsistency.
A stylized example of dynamic inconsistency is the planning paradox, when a subject makes a plan for the future, while behaving contrary to the plan as soon as time comes to accomplish the latter. A typical case of this inconsistency is the stop-smoking paradox [109,110]. A smoker, well understanding the damage to health caused from smoking, plans to stop smoking in the near future, but time goes by, "the future" comes; however, the person does not stop smoking. Numerous observations [109] show that 85% of smokers do plan to stop smoking in the near future; however, only 36% really stop smoking during the next year after making the plan. It is possible to pose the question: would it be feasible to predict the percentage of subjects who will really stop smoking during the next year, knowing that at the present time 85% of them plan on doing this. Below we show that QDT allows us to make this prediction [63].
Let us denote by A 1 the alternative to stop smoking in the near future, and by A 2 , the alternative to not stop smoking. Let us denote by B 1 the decision to really stop smoking, and by B 2 , the decision of refusing to really stop smoking. According to QDT, the corresponding probabilities are The utility factors do not change in time, so that the utility of planning to stop in the near future is the same as the utility to stop in reality and the utility to not stop in the near future is the same as that of not stopping in reality: Planning to stop smoking, subjects understand the attractiveness of this due to health benefits. Hence the average attraction factors, according to Section 8, are But as soon as one has to stop smoking in reality, one feels uneasy from the necessity to forgo the pleasure of smoking, because of which the attraction factors become This is to be complemented by the normalization conditions Since 85% of subjects plan to stop smoking, we have Solving the above equations, the fraction of subjects who will really stop smoking is predicted to be p(B 1 ) = 0.35 , p(B 2 ) = 0.65 . This is in beautiful agreement with the observed fraction of smokers really stopping smoking during the next year after making the decision [109], Thus, knowing only the percentage of subjects planning to stop smoking, it is straightforward, by means of QDT, to predict the fraction of those who will stop smoking in reality. This case is also of interest because it gives an example of preference reversal: when planning to stop smoking, the relation between the probabilities is reversed as compared to the relation between the fractions of those who have really stopped smoking,

Disjunction Effect
The disjunction effect is the violation of the sure-thing principle [111]. This principle states: if the alternative A 1 is preferred to the alternative A 2 , when an event B 1 occurs, and it is also preferred to A 2 , when an event B 2 occurs, then A 1 should be preferred to A 2 , when it is not known which of the events, either B 1 or B 2 , has occurred. This principle is easily illustrated for classical probability. Let B = B 1 + B 2 be the alternative when it is not known which of the events, either B 1 or B 2 , has occurred. For a classical probability, one has f (A n B) = f (A n B 1 ) + f (A n B 2 ) (n = 1, 2) .
From here, it immediately follows that if However, empirical studies have discovered numerous violations of the sure-thing principle, which was called the disjunction effect [112]. Such violations are typical for two-step composite games of the following structure. First, a group of agents takes part in a game, where each agent can either win (event B 1 ) or lose (event B 2 ), with equal probability 0.5. They are then invited to participate in a second game, having the right either to accept the second game (event A 1 ) or to refuse it (event A 2 ). The second stage is realized in different variants: One can either accept or decline the second game under the condition of knowing the result of the first game. Or one can either accept or decline the second game without knowing the result of the first game. The probabilities, as usual, are understood in the frequentist sense as the fractions of individuals making the corresponding decisions [99].
From the experiment of Tversky and Shafir [112], we have This shows that the alternative A 1 B is more useful than A 2 B, since which seems to agree with the sure-thing principle. However, f (A n B) is not yet the whole probability that reads as p(A n B) = f (A n B) + q(A n B) (n = 1, 2) .
Making a decision to play, without knowing the result of the first game, is less attractive, because of which q(A 1 B) = −1/4, while q(A 2 B) = 1/4. As a result, we find which is in good agreement with the empirical data of Tversky and Shafir [112], The dynamic consistency of the disjunction effect can be analyzed as in Sections 9 and 10. Details can be found in [99], where it is shown that, when decision makers are allowed to exchange information, the absolute values of the attraction factors diminish with time. This conclusion also is in good agreement with empirical observations [113].

Dynamic Preference Intransitivity
Transitivity is of central importance to both psychology and economics. It is the cornerstone of normative and descriptive decision theories [9]. Individuals, however, are not perfectly consistent in their choices. When faced with repeated choices between alternatives A and B, people often choose in some instances A and B in others. Such inconsistencies are observed even in the absence of systematic changes in the decision maker's taste, which might be due to learning or sequential effects. The observed inconsistencies of this type reflect inherent variability or momentary fluctuation in the evaluative process. Then preference should be defined in a probabilistic fashion [80]. Nevertheless, there occur several choice situations where time transitivity may be violated even in a probabilistic form. Then one says that there is dynamic preference intransitivity [114][115][116][117][118][119][120].
The occurrence of preference intransitivity depends on the considered decision model and on the accepted definition of transitivity. But the general meaning is as follows. Suppose one evaluates three alternatives A, B, C, considering them in turn by pairs. One compares A and B, and, according to the selected definition of preference, concludes that A is preferred over B, which can be denoted as A > B. Then one compares B and C, finding that B > C. Finally, comparing C and A, one discovers that C > A. This results in the preference loop A > B > C > A signifying the intransitivity effect.
As a simple illustration of the intransitivity effect, we can adduce the Fishburn [116] example. Imagine that a person is about to change jobs. When selecting a job, the person evaluates the suggested salary and the prestige of the position. There are three choices: job A, with the salary 65,000$, but low prestige; job B, with the salary 58,000$ and medium prestige; and job C, with the salary 50,000$ and high prestige. The person chooses A over B because of the better salary, and with a small difference in prestige, B over C because of the same reason; and comparing C and A, the person prefers C because of the higher prestige, although a lower salary. Thus, one comes to the preference loop A > B > C > A.
Let us show how this problem can be resolved in QDT. Recall that the definition of the behavioral probability (72) is contextual, in the sense that it depends on the initial conditions for the decision state and on the given time. This means that the comparison of each pair of alternatives constitutes a separate contextual choice, even if the external conditions remain unchanged. The utility factors can be calculated as described in Section 8. Thus, considering the pair A and B in the Fishburn example, we have the utility factors f 1 (A) = 0.528 , where the label marks the moment of time t 1 and an initial conditionρ 1 (t 1 ) for the decision state. Due to the close prestige of the both jobs, their attraction factors coincide, which, as follows from the alternation law in Section 6, gives q 1 (A) = q 1 (B) = 0 .
Since p 1 (A) > p 1 (B), the job A at time t 1 is stochastically preferred over B.
Similarly, comparing the jobs B and C at time t 2 , we get the utility factors f 2 (B) = 0.537 , f 2 (C) = 0.463 , and the attraction factors q 2 (B) = q 2 (C) = 0 .
Hence, the probabilities are p 2 (B) = 0.537 , p 2 (C) = 0.463 , which implies that B at time t 2 is stochastically preferred over C.
In the comparison of the jobs C and A at time t 3 , we find the utility factors Then, at time t 3 , the job C is stochastically preferred over A. However, we should not forget that these comparisons were accomplished at different moments of time and under different initial conditions. Therefore, there is nothing extraordinary such that differently defined probabilities can be intransitive. Even more, there are arguments [121] that such intransitivities can be advantageous for alive beings in the presence of irreducible noise during neural information processing.
The arising preference loops can be broken in two ways. First, the process of comparison requires time during which there can appear additional information. The attraction factors, as is explained in Sections 9 and 10, vary with time, diminishing as time increases. When q 3 (C) and q 3 (A) tend to zero then p 3 (C) −→ f 3 (C) = 0.435 , Since now p 3 (C) < p 3 (A), A becomes preferred over C; hence, the preference loop is broken. The other very natural way is as follows. As soon as there appears a preference loop, this implies that decisions at different moments of time and under different contexts should not be compared. One has to reconsider the whole problem at one given moment of time t. Considering all three alternatives A, B, C in the frame of one given choice, we have

Conclusions
The basic ideas of quantum decision theory are presented, with the emphasis on the problems of time evolution of decision processes. The relations between operationally testable events and behavioral features of making decisions are elucidated. The interplay of rational and irrational sides of decision making is explained. The approach to describing quantum intelligence networks is developed. As illustrations, several time inconsistencies are analyzed.
The main points of quantum decision theory are based on the techniques used in the theory of quantum measurements. Therefore, the presented approach can be employed for characterizing evolutional processes in quantum measurements. The behavior of many self-organizing complex systems is similar to decision making [122], because of which the approach could be useful in applications to complex systems and to the problems of artificial intelligence.
Funding: This research received no external funding.