Sensory or Intelligence Data Compression Can Drive the Yerkes–Dodson Effect

Wallace, Rodrick

doi:10.3390/sym17020235

Open AccessArticle

Sensory or Intelligence Data Compression Can Drive the Yerkes–Dodson Effect

by

Rodrick Wallace

The New York State Psychiatric Institute (NYSPI), Box 47, 1051 Riverside Drive, New York, NY 10032, USA

Symmetry 2025, 17(2), 235; https://doi.org/10.3390/sym17020235

Submission received: 9 January 2025 / Revised: 30 January 2025 / Accepted: 3 February 2025 / Published: 6 February 2025

Download

Browse Figures

Versions Notes

Abstract

New probability models of inherently embodied cognition derived from the asymptotic limit theorems of information and control theories show, where the Weber–Fechner, Stevens, Hick–Hyman, and Pieron’s psychophysics laws—and analogous processes of sensory data rate compression—operate, that sufficient arousal will engender the classic Yerkes–Dodson effect responses for ‘easy’ and ‘difficult’ challenges, depending on the level of ‘noise’ impeding the cognition rate. A ‘hallucination’ mode is found to arise at low arousal, and, in the face of sufficient noise, a ‘panic’ mode at high arousal. Systems that are ‘ductile’ in a formal sense, however, are not afflicted by such hallucination, although panic remains for difficult challenges. Similar dynamics that surround organized conflict on ‘Clausewitz landscapes’ of fog, friction, and deadly adversarial intent have long been studied. We find a central mechanism for cognitive failure under increasing stress across a very broad range of modalities to be enough—usually badly needed—compression of sensory/intelligence and internal information transmission rates. It seems possible, with some effort, to convert the probability models developed here into robust statistical tools for the study and limited control of critical real-time, real-world embodied cognitive phenomena associated with cellular, neural, individual, machine, and institutional systems and their many composites.

Keywords:

cognition; control theory; data compression; Fisher zero phase transition; information theory; organized conflict; perception; probability distribution; psychophysics laws

A wide consensus has emerged in recent years that successful policymaking and programming in conflict situations must start with an accurate understanding of local context, conflict actors, causes, and the dynamic relationships among them.
—Woocher [1].

...[A]ccurate and stable neural networks exist, however, modern algorithms do not compute them.
—Bastounis et al. [2].

1. Introduction

The Weber–Fechner law [3] that sensation perception is proportional to the logarithm of sensation energy has been found to approximately hold across a surprising range of modalities in human enterprise [4,5]:

Weight perception.
Sound intensity perception.
Brightness perception.
Numerical cognition.
Dose–response chemoreception.
Public finance in mature democracies.
Emotional intensity.

The usual mathematical form of the Weber–Fechner (WF) law is that the strength of the perceived ‘sensation’ P varies as the ‘energy’ of the sensory impulse S according to

P \approx k ln (\frac{S}{S_{0}})

(1)

where k is an appropriate constant and

S_{0}

a characteristic minimum detection level.

The WF law is most often applied to ‘local’ psychophysics for which results can be found with only 50 percent accuracy. So-called ‘global’ psychophysical observations, to higher precision, are often subsumed under the Stevens law [6]:

P \approx k I^{a}

(2)

where I is an intensity measure and a an appropriate exponent. As Mackey [7] shows, however, the Stevens law can, in a sense, be derived from the WF law.

‘Sensory’ or ‘intelligence’ data compression are essential for cognitive systems that must act in real-world environments of fog, friction, and adversarial intent. Without some form of data-sorting, winnowing, or other form of compression, one is simply overwhelmed by unredacted sensation and left paralyzed. This is, indeed, a central purpose of misinformation in organized conflict.

From another perspective, the widely observed Hick–Hyman law [8] states that the response time

T_{R}

of a cognitive agent to an incoming sensory or other data stream will be proportional to the Shannon uncertainty of that stream. That is, more complex information takes longer to process, in direct proportion to that complexity, as measured by the Shannon uncertainty expression:

T_{R} \propto - \sum_{j} P_{j} ln (P_{j})

(3)

where

P_{j}

are the (perceived) probabilities of the different possible options.

Likewise, Pieron’s law finds response time given as [5]

T_{R} \propto I^{- b}

(4)

where I is the intensity of the sensory signal and b a positive constant.

Reina et al. [5] characterize these matters as follows:

Psychophysics, introduced in the nineteenth century by Fechner, studies the relationship between stimulus intensity and its perception in the human brain. This relationship has been explained through a set of psychophysical laws that hold in a wide spectrum of sensory domains, such as sound loudness, musical pitch, image brightness, time duration, vibrotactile frequency, weight, and numerosity. More recently, numerous studies have shown that a wide range of organisms at various levels of complexity obey these laws. For instance, Weber’s law, which Fechner named after his mentor Weber, holds in humans as well as in other mammals, fish, birds, and insects. Surprisingly, also organisms without a brain can display such behaviour, for instance slime moulds and other unicellular organisms.

...[F]or the first time, we show that superorganismal behaviour, such as honeybee nest-site selection, may obey the same psychophysical laws displayed by humans in sensory discriminatory tasks.

Superorganismal cognitive behavior in organized conflict, however, has long been observed to obey something qualitatively similar to these psychophysics laws. Examples abound, e.g., during the Japanese attack on Pearl Harbor [9]:

On the morning of 7 December 1941, the SCR-270 radar at the Opana Radar Site on northern Oahu detected a large number of aircraft approaching from the north. This information was conveyed to Fort Shafter’s Intercept Center. The report was dismissed by Lieutenant Kermit Tyler who assumed that it was a scheduled flight of aircraft from the continental United States. The radar had in fact detected the first wave of Japanese Navy aircraft about to launch the attack on Pearl Harbor.

US radar subsequently—and accurately—traced Japanese aircraft in their post-attack return flights to their carriers.

Here, essential data—complex, and, at the time, highly novel, radar observations—were compressed to what in probability theory would be characterized as a set of measure zero. The result might well be described as a form of ‘hallucination’, the arbitrary assignment of meaning to some (or to the absence of) signal.

Half a year later, however, the US Navy’s institutional cognition had considerably progressed. Wirtz [10], in a prize-winning essay, writes

The story of how Naval intelligence paved the way for victory at Midway is embedded in the culture of the U.S. Navy, but the impact of this narrative extends far beyond the service. Today, scholars use the events leading up to Midway to define intelligence success—an example of a specific event prediction that was accurate, timely, and actionable, creating the basis for an effective counterambush of the Imperial Japanese Navy. Yet an important element of the Midway intelligence story has been overlooked over the years: Those who received intelligence estimates understood the analysis and warnings issued and acted effectively on them.

Wirtz [10] emphasizes, for this success, the tight bonding of the newly enhanced intelligence component, under Lieutenant Commander Edwin Layton, with the overall Pacific Command, under Admiral Chester W. Nimitz. This bonding represented, in a formal sense, an enhanced coupling channel capacity.

As Wirtz [10] put the matter, “...[O]nly nine days elapsed between Layton’s forecast of coming events and the detection of Japanese carriers northwest of Midway”.

This was barely sufficient time for Nimitz to assemble the needed countermeasures. The rest, as they say, is history.

In short, a sufficiently large channel capacity linking intelligence with command obviated both Pearl Harbor-style institutional hallucination and panic.

There is yet another longstanding ‘psychophysics’ context to such matters.

With regard to the dynamics of hallucination and panic in embodied cognitive agents, the Yerkes–Dodson effect—studied as early as 1905 [11,12,13]—states that, for a simple task, ‘performance’ varies as an S-shaped curve with increasing ‘arousal’, while for a difficult task, the pattern is an inverse-U (see Figure 1).

Here, we will show that it is possible to derive versions of Figure 1 from the Weber–Fechner, Stevens, Hick–Hyman, and Pieron laws (and analogous de facto sensory compression modes), using probability models based on the asymptotic limit theorems of information and control theories. These are interpreted through the lenses of first- and second-order models abducted from statistical physics and the Onsager approximation to nonequilibrium thermodynamics. While (at least to the author) surprisingly straightforward, this effort is not without subtlety.

With some further—and necessarily considerable—investment of resources, the probability models explored here can be developed into robust statistical tools for application to a broad spectrum of embodied cognitive phenomena at and across various scales and levels of organization. Such tools would be particularly useful in systems where it is necessary to condense incoming ‘sensory’ and/or ‘intelligence’ information in real time. Possible applications to embodied entities and processes range from the cellular through the neural, individual, machine, institutional, and their many possible composites.

As Wirtz [10] implies, those familiar with the planning or direction of organized conflict are sadly familiar with the general dynamics studied here.

We begin with some necessary methodological boilerplate.

2. Rate Distortion Control Theory

French et al. [14] modeled the Yerkes–Dodson effect using a neural network formalism, interpreting the underlying mechanism in terms of a necessary form of real-time data compression that eventually fails as task complexity increases. Data compression implies matters central to information theory, for which there is a formal context, i.e., the Rate Distortion Theorem [15].

Information theory has three classic asymptotic limit theorems [15,16]:

The Shannon Coding Theorem. For a stationary transmission channel, a message recoded as ‘typical’ with respect to the probabilities of that channel can be transmitted without error at a rate C characteristic of the channel, i.e., its capacity. Vice versa, it is possible to argue for a tuning theorem variant in which a transmitting channel is tuned to be made typical with respect to the message being sent, so that, formally, the channel is ‘transmitted by the message’ at an appropriate dual-channel capacity.
The Shannon–McMillan or Source Coding Theorem. Messages transmitted by an information source along a stationary channel can be divided into sets: a very small one congruent with a characteristic grammar and syntax, and a much larger one of vanishingly small probability not so congruent. For stationary (in time), ergodic sources, where long-time averages are cross-sectional averages, the splitting criterion dividing the two sets is given by the classic Shannon uncertainty. For nonergodic sources, which are likely to predominate in biological and ecological circumstances, matters require a ‘splitting criterion’ to be associated with each individual high-probability message [16].
The Rate Distortion Theorem (RDT). This involves message transmission under conditions of noise for a given information channel. There will be, for that channel, assuming a particular scalar measure of average distortion, D, between what is sent and what is received, a minimum necessary channel capacity $R (D)$ . The theorem asks what is the ‘best’ channel for transmission of a message with the least possible average distortion. $R (D)$ can be defined for nonergodic information sources via a limit argument based on the ergodic decomposition of a nonergodic source into a ‘sum’ of ergodic sources.

The RDT can be reconfigured in an inherently embodied control theory context, if we envision a system’s topological information from the DRT as ‘simply’ another form of noise, adding to the average distortion D (see Figure 2). The punctuation implied by the DRT [17] emerges from this model if there is a critical maximum average distortion that characterizes the system. Other systems may degrade more gracefully, or, as we show below, have even more complicated patterns of punctuation that, in effect, generalize the Data Rate Theorem.

We will expand perspectives on the dynamics of cognition/regulation dyads across rates of arousal, across the possibly manifold set of basic underlying probability distributions that characterize such dyads at different scales, levels of organization, and indeed, across various intrinsic patterns of arousal.

We again reiterate that inherent to Figure 2 is a fundamental ‘real-time’ embodiment directly instantiated by the Comparison and Control Channel feedback loop and the definition of the average distortion D.

3. Scalarizing Essential Resource Rates

The RDT holds that, under a given scalar measure of average distortion D between a sequence of signals that has been sent and the sequence that has actually been received in message transmission—measuring the difference between what was ordered and what was observed—there is a minimum necessary channel capacity

R (D)

determined by the rate at which a set of essential resources, indexed by some scalar measure Z, can be provided to the system ‘sending the message’ in the presence of noise and an opposing rate of topological information characteristic of the inherent instability of the control system under study.

The Rate Distortion Function (RDF)

R (D)

is necessarily convex in D, so that

d^{2} R / d D^{2} \geq 0

[15]. For a nonergodic process in which the cross-sectional mean is not the same as the time-series mean, the RDF can still be defined as the average across the RDFs of the ergodic components of that process, and is thus convex in D.

The relations between the minimum necessary channel capacity

R (D)

and the rates of any essential resources will usually be quite subtle.

A ‘simple’ scalar resource rate index—say Z—which we adopt as a first approximation, must be composed of a minimum of three interacting components:

The rate at which subcomponents of the system of interest can communicate with each other, defined by a channel capacity $C$ .
The rate at which ‘sensory’ information is available from the embedding environment, associated with a channel capacity $H$ .
The rate $M$ at which ‘metabolic’ or other free-energy ‘materiel’ resources can be provided to a subsystem of a full entity, organism, institution, machine, and so on.

These rates must be compounded into an appropriate scalar measure:

Z (C, H, M) .

Most simply, this might perhaps be taken as their product, the sum of their logs, or some other component.

Following [13], ‘Z’ will most likely be a 3 by 3 matrix, including necessary interaction crossterms. An n-dimensional matrix

Z

has n scalar invariants

r_{i}

under appropriate transformations that are determined by the characteristic equation

\det [Z - γ I] = \sum_{i = 1}^{n} {(- 1)}^{i - 1} γ^{i - 1} r_{i}

(5)

I

is the n-dimensional identity matrix, det is the determinant, and

γ

is a real-valued parameter.

r_{1}

is the matrix trace and

r_{n}

the matrix determinant. Given these n scalar invariants, it will often be possible to construct

Z (r_{1}, 0 \dots, r_{n})

as their scalar function, analogous to the principal component analysis of a correlation matrix.

Wallace [18] provides an example in which two such indices are minimally necessary, a matter requiring sophisticated Lie Group methods.

4. The Fundamental Model

Feynman [19] and Bennett [20] argue that information is a form of free energy, not an ‘entropy’, in spite of information theory’s Shannon uncertainty taking the same mathematical form as entropy for a simple—indeed, simplistic—ergodic system. Feynman [19] illustrates this equivalence using a simple ideal machine to convert information from a message into useful work.

We next apply something much like the standard formalism of statistical mechanics—given a proper definition of ‘temperature’—for a cognitive, as opposed to a ‘simply’ physical, system.

We consider the full ensemble of high-probability developmental trajectories available to the system, writing these as

Y_{j}, j = 1, 2, \dots

. Each trajectory is associated with a Rate Distortion Function-defined minimum-necessary-channel capacity

R_{j}

for a particular maximum average scalar distortion

D_{j}

. Then, assuming some basic underlying probability model having the distribution

ρ (c, x)

, where c is a parameter set, we can define a pseudoprobability for a meaningful ‘message’

Y_{j}

sent into the system of Figure 2 as

d P = \frac{ρ (c, R / g (Z)) d R}{\int_{0}^{\infty} ρ (c, R / g (Z)) d R} = \frac{ρ (c, R_{j} / g (Z)) d R}{g (Z)}

(6)

since

ρ

is a probability density that integrates to unit value.

Again,

Y_{j}

is a particular trajectory, while the (possibly generalized) integral is over all possible ‘high probability’ paths available to the system.

We implicitly impose the Shannon–McMillan Source Coding Theorem so that the overall set of possible system trajectories can be divided in two distinct equivalence classes defining a ‘fundamental groupoid’ whose ‘symmetry-breaking’ imposes the essential two-fold structure. These are, first, a very large set of measure zero—vanishingly low probability—that is not consistent with the underlying grammar and syntax of some basic information source, and a much smaller consistent set [16].

Again,

R_{j}

is the RDT channel capacity, keeping the average distortion less than a given limit

D_{j}

for message

Y_{j}

.

g (Z)

is a yet-to-be-determined temperature analog depending on the scalar resource rate Z.

ρ (c, x)

, for physical systems, is usually taken as the Boltzmann distribution:

ρ (x) = exp [- x]

. We suggest that, for cognitive phenomena—from the living and institutional to the machine and composite—it is necessary to move beyond analogs with physical system analogs. That is, it becomes necessary to explore the influence of a variety of different probability distributions, including those with ‘fat tails’ [21,22], on the dynamics of cognition/regulation stability.

The standard methodology from statistical physics [23] identifies the denominator of Equation (3) as a partition function. This allows for the definition of an iterated free-energy analog F as

ρ (c, F / g (Z)) \equiv \int_{0}^{\infty} ρ (c, R / g (Z)) d R = g (Z) .

(7)

Once again, adapting a standard argument, now from chemical physics [24], we can define a cognition rate for the system as a reaction rate analog:

L \equiv \frac{\int_{R_{0}}^{\infty} ρ (c, R / g (Z)) d R}{\int_{0}^{\infty} ρ (c, R / g (Z)) d R} = \frac{1}{g (Z)} \int_{R_{0}}^{\infty} ρ (c, R / g (Z)) d R

(8)

where

R_{0}

is the minimum channel capacity needed to keep the average distortion below a critical value

D_{0}

for the full system of Figure 2.

The basic underlying probability model of Equation (6)—via Equation (8)—determines system dynamics, but not system structure, and cannot be associated with a particular underlying network form, although they are related [25,26]. More specifically, a set of distinctly different networks may all be mapped onto any single given dynamic behavior pattern; and, indeed, vice versa, the same static network may display a spectrum of behaviors [27]. We focus on dynamics rather than network topology.

Abducting a canonical first-order approximation from nonequilibrium thermodynamics [28], we can now define a ‘real’ entropy—as opposed to a ‘Shannon uncertainty’ that is basically Feynman’s [19] free energy—from the iterated free energy F by taking the standard Legendre Transform [23], so that

S \equiv - F + Z d F / d Z

(9)

The standard first-order Onsager approximation from nonequilibrium thermodynamics is then [28]

\partial Z / \partial t \approx d S / d Z = Z d F^{2} / d Z^{2}

(10)

where the scalar diffusion coefficient has been set equal to one.

For a second-order model,

\begin{matrix} S_{2} = - F + Z d F / d Z + Z^{2} d^{2} F / d Z^{2} \\ \partial Z / \partial t = d S_{2} / d Z + Z d S_{2}^{2} / d Z^{2} = \\ 3 Z d^{2} F / d Z^{2} + Z^{2} d^{3} F / d Z^{3} + \\ Z (3 d^{2} F / d Z^{2} + 5 Z d^{3} F / d Z^{3} + Z^{2} d^{4} F / d Z^{4}) \end{matrix}

(11)

5. Two Probability Distributions

We consider rates of cognition for two different underlying probability models, according to their underlying ‘hazard rates’

Q (x)

, defined as

Q (x) = \frac{ρ (x)}{1 - \int_{0}^{x} ρ (u) d u}

(12)

We choose hazard rates

Q (x) = 1, 1 / (1 + x)

. The resulting integral equations lead to probability distributions

ρ (x) = exp [- x], 1 / {(1 + x)}^{2}

. The first is simply the Boltzmann distribution and the second is ‘fat-tailed’, without finite mean or variance.

For the Boltzmann distribution, the ‘temperature’ and cognition rate relations from Equations (7) and (8) are

\begin{matrix} g = \frac{- F}{W (n, - F)} \\ L = exp [- R_{0} / g] \end{matrix}

(13)

where

W (n, x)

is the Lambert W-function of order n that satisfies the relation

W (n, x) exp [W (n, x)] = x

It is real-valued only for

n = 0, - 1

and

x > - exp [- 1]

. This condition ensures the existence of Fisher zero phase transitions in the system [29,30,31].

For the

1 / {(1 + x)}^{2}

distribution,

\begin{matrix} g = - F + \frac{1}{2} + \frac{\sqrt{1 - 4 F}}{2} \\ L = \frac{g}{g + R_{0}} \end{matrix}

(14)

Here, there is also a necessary condition for a real-valued temperature analog, leading again to the possibility of Fisher zero phase transition.

6. Weber–Fechner Implies Yerkes–Dodson

In addition to noise inherent to the RDT approach, suppose that there is a ‘noise’ representing the difficulty of the problem addressed, which is what Figure 1 identifies as ‘impairment of divided attention, working memory, decision-making and multitasking’. That ‘noise’ is incorporated via the second term in the stochastic differential equations

\begin{matrix} d Z_{t} = (d S / d Z) d t + σ Z_{t} d B_{t} \\ d Z_{t} = (d S_{2} / d Z + Z d^{2} S_{2} d Z^{2}) d t + σ Z_{t} d B_{t} \end{matrix}

(15)

where

σ

is the magnitude of the difficulty noise and

d B_{t}

is ordinary Brownian noise.

We next replace the

(d Z / d t) d t

terms with their full expressions in their respective free-energy constructs F, to first and second order.

It is here that the Weber–Fechner law forecloses possibilities. For Equation (1), we set

k = S_{0} = 1, S = Z

and assume that perception must be stabilized under the Ito Chain Rule [32,33]. That is, we require that

< d ln (Z_{t}) > = 0

for both first- and second-order models.

Surprisingly, F, the first-, and

F_{2}

, the second-order expressions are ‘easily’ found to be

\begin{matrix} F (Z) = \frac{1}{4} σ^{2} Z^{2} + C_{1} Z + C_{2} \\ F_{2} = C_{4} + \frac{1}{24} σ^{2} Z^{2} + C_{1} Z + \frac{C_{3}}{Z} + C_{2} ln (Z) \end{matrix}

(16)

calculated, for example, using the elementary MAPLE computer algebra programs given by Cyganowski et al. [33] for the Ito Chain Rule of an SDE (see the mathematical appendix).

Both expressions have potentially dominant terms in

σ^{2} Z^{2}

that ensure the punctuated emergence of imaginary-valued cognition rates via the algebraic forms of the temperature analogs

g (Z)

.

In what follows, for F,

C_{1} = - 1, C_{2} = 3

, and for

F_{2}

,

C_{4} = 3, C_{1} = - 1, C_{2} = C_{3} = 1

. In the expressions for cognition rate L, we set

n = 0, R_{0} = 1

and calculate cognition rates vs.

{σ, Z}

for both distributions and both orders of approximation. The results are shown in Figure 3. The top two are, respectively, from left to right, first and second order for the Boltzmann distribution. The bottom two, again left to right, first and second order, for the

1 / {(1 + x)}^{2}

distribution.

Although there are detailed differences between orders and distribution form results—the Boltzmann with finite mean and variance, and the other fat-tailed and without—the general pattern is that increasing ‘noise’

σ

transforms the cognition rate from the simple task of Figure 1 into the inverse-U of the Yerkes–Dodson relation. Note that sufficient ‘noise’

σ

fully collapses all systems. A subsequent section explores this dynamic in deeper detail, focusing on the character of the assumed underlying probability distribution.

In more detail, Figure 4 takes two-dimensional cross-sections across Figure 3a, setting

σ = 0, 0.5

, letting Z vary as ‘arousal’, and dividing cognition rate into real- and imaginary-valued components. At low arousal, both systems suffer a nonzero imaginary-valued ‘hallucination’ mode. For

σ = 0.5

, the intermediate zone, with a zero imaginary-valued component, follows the classic inverse-U of the Yerkes–Dodson effect, and imaginary-valued ‘panic’ emerges at high Z. The

σ = 0

system corresponds to the ‘easy’ problem of the Yerkes–Dodson.

7. Stevens Implies Yerkes–Dodson

If, instead of WF data compression according to

< d ln (Z_{t}) > = 0

, we impose a version of Stevens law compression [7,34,35] as

< d Z_{t}^{1 / n} > = 0, n > 1

, then ‘elementary’ calculation based on the Ito Chain Rule finds

\begin{matrix} F (Z) = \frac{(σ^{2} n - σ^{2}) Z^{2}}{4 n} + C_{1} Z + C_{2} \\ F_{2} (Z) = - \frac{C_{1}}{2 Z} - C_{2} ln (Z) + \frac{Z^{2} σ^{2}}{24} - \frac{Z^{2} σ^{2}}{24 n} + C_{3} Z + C_{4} \end{matrix}

(17)

For large enough n, the results are similar to those following from Equation (16), producing, in Figure 5, for

n = 3

, close analogs to Figure 3 and Figure 4, given appropriate boundary conditions. In particular, the condition for the term in Z is again taken as −1, and the constant boundary condition as 3, with the other two again set equal to one.

8. Hick–Hyman Implies Yerkes–Dodson

Recall that the Hick–Hyman law’s [8] assertion of an individual-level response time to a multimodal task challenge increases as the Shannon uncertainty across the modes of that challenge.

For what we perform here, defining

Z (C, H, M)

in terms of appropriate information and ‘materiel’ streams, the rate of response will be determined as

\propto Z^{- 1}

. In the formalism of this study, we must now examine the nonequilibrium steady-state condition

< d Z_{t}^{- 1} > = 0

, leading to

\begin{matrix} F (Z) = \frac{1}{2} σ^{2} Z^{2} + C_{1} Z + C_{2} \\ F_{2} (Z) = \frac{σ^{2} Z^{2}}{12} - \frac{C_{1}}{2 Z} - C_{2} ln (Z) + C_{3} Z + C_{4} \end{matrix}

(18)

These relations again produce analogs to Figure 3 and Figure 5 in first and second order across the Boltzmann and

1 / {(1 + x)}^{2}

distributions.

9. Pieron Implies Yerkes–Dodson

The Pieron law [5] states that system response time will be proportional to the intensity I of the sensory input signal as

T_{R} \propto I^{- b}, b > 0

. Calculation finds, for first- and second-order Onsager approximations and letting

I = Z_{t}

, the following:

\begin{matrix} F (Z) = \frac{σ^{2} (b + 1) Z^{2}}{4} + C_{1} Z + C_{2} \\ F_{2} (Z) = - \frac{C_{1}}{2 Z} - C_{2} ln (Z) + \frac{Z^{2} σ^{2} b}{24} + \frac{σ^{2} Z^{2}}{24} + C_{3} Z + C_{4} \end{matrix}

(19)

Given the same boundary conditions as above, one again obtains, for Boltzmann and

1 / {(1 + x)}^{2}

distributions, close analogs to Figure 3 and Figure 4.

10. Other Compression Schemes

Suppose that we are constructing a cognitive embodied machine, institution, or composite entity, and are able to choose a particular data compression scheme

V (Z)

under some given environmental or other selection pressure, according to the first- and second-order ‘ordinary volatility’ schemes of Equation (15). Then, in the first and second orders, the Ito Chain Rule calculation gives

\begin{matrix} F (Z) = \int (\int - \frac{σ^{2} Z (\frac{d^{2}}{d Z^{2}} V (Z))}{2 (\frac{d}{d Z} V (Z))} d Z) d Z + C_{1} Z + C_{2} \\ F_{2} (Z) = \\ \int (\int \frac{- \frac{(\int \frac{σ^{2} (\int \frac{Z^{3} (\frac{d^{2}}{d Z^{2}} V (Z))}{\frac{d}{d Z} V (Z)} d Z) - 2 C_{1}}{Z^{2}} d Z)}{2} + C_{2}}{Z^{2}} d Z) d Z + C_{3} Z + C_{4} \end{matrix}

(20)

For appropriate boundary conditions, in all cases,

σ = 0

collapses system dynamics to the ‘easy’ Y-D problem.

Details for

V (Z_{t}) = arctan (Z_{t})

are left as an exercise.

A mathematically sophisticated reader might derive general results across sufficiently draconian compression schemes using methods similar to those of Appleby et al. [36]. Indeed, Equations (16)–(20) can probably be used to bracket most reasonable approaches.

French et al. [14] were right: data compression provides a possible route to the Yerkes–Dodson effect.

11. The Ductile-Brittle Transition

Figure 3, Figure 4 and Figure 5, as constructed, display disconcerting patterns of instability at low and, under significant noise

σ

, at high levels of arousal Z. These patterns are, in a sense, ‘artifacts’ of the boundary conditions imposed in the definitions of the free-energy constructs F. Such ‘artifacts’ haunted materials science and engineering until the latter part of the 20th century when a broad range of substances, from glasses to ceramics and metals, were understood to undergo sharp temperature-dependent phase transitions from ductile to brittle under sudden stress.

Many readers will be familiar with the standard laboratory demonstration of freezing a normally ductile substance in liquid nitrogen, and then easily shattering it with a light blow. Cognitive systems subject to the Yerkes–Dodson effect, it seems, are likewise subject to kinds of brittleness that can, perhaps, be more formally characterized.

Recall the first expression of Equation (16) for F under the Weber–Fechner law, and Figure 4a,b, where

C_{1} = - 1, C_{2} = 3

for

σ = 0, 0.5

, the ‘easy’ and ‘difficult’ modes of the Yerkes–Dodson effect. Figure 6 displays the analog to Figure 4, for which, again,

C_{1} = - 1

, but now

C_{2} = exp [- 1]

. Hallucination has disappeared.

Boundary conditions, it seems, together define another ‘temperature’, characterizing the onset of both ‘hallucination’ and ‘panic’ modes. Indeed, for Figure 4b, panic begins at

Z = 12.678

, while in Figure 6b, in the absence of hallucination, panic onset is increased to

Z = 16

.

The keys to the matter are found in Equation (16), expressing the form of F under the WF law, and the expressions for g and L in Equation (13) under the Boltzmann distribution. Both g and L depend on the Lambert W-function of orders 0 and

- 1

. Recall that the natures of both g and L are determined by

W (0, - F)

, real-valued only for

- F > - exp [- 1]

. Some elementary algebra, based on the first form of Equation (16), finds the Z-values determining the limits and onset of hallucination and panic given as

\begin{matrix} Z_{H} = - \frac{2 (C_{1} + \sqrt{σ^{2} e^{- 1} - σ^{2} C_{2} + C_{1}^{2}})}{σ^{2}} \\ Z_{P} = \frac{- 2 C_{1} + 2 \sqrt{σ^{2} e^{- 1} - σ^{2} C_{2} + C_{1}^{2}}}{σ^{2}} \end{matrix}

(21)

Further, solving the relation

Z_{P} - Z_{H} = 0

for

σ^{2}

gives the maximum tolerable level of ‘noise’ as

σ^{2} = - \frac{C_{1}^{2}}{e^{- 1} - C_{2}}

(22)

These relations can ‘easily’ be extended to various orders of F and to different underlying probability distributions, for example, the

1 / {(1 + x)}^{2}

distribution leading to the expressions of Equation (14).

12. The Tyranny of Time

An extension of the formalism to a multiple-subsystem structure involves time as well as resource constraints. That is, not only are the individual components of the composite rate index Z constrained, but the time available for effective action is limited.

This can be addressed in first order as a classic Lagrangian optimization under environmental shadow price constraints abducted from economic theory [37,38]. Here, time and other resource limits are assumed to very strongly dominate system dynamics.

The aim is to maximize the total cognition rate across a full system of n subcomponents. Each has an associated cognition rate

L_{j}

and is allotted resources at the rate

Z_{j}

for time

T_{j}

.

A Lagrangian optimization can be conducted as

\begin{matrix} L \equiv \sum_{i} L_{i} + λ (Z - \sum_{i} Z_{i}) + μ (T + \sum_{i} T_{i}) \\ \partial L / \partial Z = λ, \partial L / \partial T = μ \\ \partial L_{i} / \partial Z_{i} = λ, \partial L_{i} / \partial T_{i} = μ \end{matrix}

(23)

To reiterate, from economic theory,

λ

and

μ

are shadow prices imposed by environmental externalities including fog, friction, and usually skilled adversarial intent.

After some heuristics,

\partial Z_{i} / \partial T_{i} = \frac{μ}{λ} \equiv ω

(24)

Taking

g (Z_{i})

as the volatility function, the fundamental stochastic differential equation under draconian constraints of time and resource rates is

d Z_{t}^{i} = ω d t + σ_{i} g_{i} (Z_{t}^{i}) d B_{t}

(25)

If the data compression function is

V (Z) = ln (Z), Z^{- 1}, Z^{1 / n}

, and so on, then application of the Ito Chain Rule to determine the nonquilibrium steady-state condition

< d V (Z_{t}^{i}) > = 0

gives, by suppressing indices, the following:

\begin{matrix} \frac{2 ω}{σ^{2}} = - \frac{g {(Z)}^{2} (\frac{d^{2}}{d Z^{2}} V (Z))}{\frac{d}{d Z} V (Z)} \equiv Q (Z) \\ Z_{i} = Q_{i}^{- 1} (2 ω / σ^{2}) \end{matrix}

(26)

Typically,

Q^{- 1} \to 0

as

σ

increases or

ω

declines.

For the three forms of

V (Z)

, given just the above, under ordinary volatility, so that

g_{i} (Z_{i}) = Z_{i}

,

Z_{i} = \frac{2 ω}{σ^{2}}, \frac{ω}{σ^{2}}, \frac{2 ω n}{σ^{2} (n - 1)}

(27)

Declining environmental shadow price ratio is thus synergistic with rising ‘noise’ to drive essential resource rates below critical values for important subcomponents across a variety of data compression schemes.

13. The Tyranny of Adaptation

Another necessary extension is, however, much less direct, even in first order.

Wirtz [10] presents the matter thusly:

Standard operating procedures that continue for years or even decades give opponents the time they need to devise innovative tactics, technologies, and stratagems that are hard to detect, analyze, and forecast. Small changes can delay opponents’ schemes, while larger changes can invalidate their plans altogether. Routine can be exploited. It is possible for planners and commanders to minimize the challenge facing intelligence by providing opponents with a new problem to plan against before they devise a way to solve the old one. And rest assured, they are working on finding ways to sidestep current force postures.

That is, while institutional cognition rates are most often perceived by combatatants/participants on tactical and operational levels and relatively short time frames, strategic enterprise takes place over Darwinian and Lamarckian evolutionary spatial, social, and time scales. Wallace ([39], Chs. 1 and 9) explores the dynamics of such an evolutionary process in terms of ‘punctuated equilibrium’ phase transitions analogous to the Fisher zero phase changes studied here. Extension and application of the cognitive dynamics models studied here to evolutionary time scales, however, remains to be conducted.

14. The Compression/Sensitivity Explosion

We have, thus far, explored systems for which the underlying fundamental distribution was characterized by either a fixed or declining hazard rate function—Equation (12)—under Weber–Fechner or Stevens data compression schemes, to first and second orders. It is conceivable, perhaps even to be expected, however, that some systems may need—or be driven—to increase the rate of signal detection with rising system burden, so that, from Equation (12),

Q (x) = x^{m}, m > 0

. This diktat generates an extended version of the Rayleigh Distribution as

\begin{matrix} ρ (m, x) = x^{m} exp [- \frac{x^{m + 1}}{m + 1}] \\ < x > = {(1 + m)}^{\frac{1}{1 + m}} Γ (\frac{2 + m}{1 + m}) \end{matrix}

(28)

Given a free-energy measure F, calculation finds the temperature analog and cognition rate as

\begin{matrix} g (m, F) = - \frac{F {(1 + m)}^{- \frac{1}{1 + m}} {(- \frac{1}{W (- \frac{F}{1 + m})})}^{- \frac{m}{1 + m}}}{W (- \frac{F}{1 + m})} \\ L (m, g) = e^{- \frac{g^{- 1 - m}}{1 + m}} \end{matrix}

(29)

where, again, W is the Lambert W-function and we have again set the ‘trigger level’ as

H_{0} = 1

.

Recall, by contrast, the expressions for the Boltzmann distribution:

g = - F / W (- F), L = exp [- 1 / g] .

We define F by again imposing Weber–Fechner compression, i.e., stability as

< d ln (x) > = 0

, according the first-order relation of Equation (16). Figure 7a shows, for comparison, the Boltzmann distribution form of

g (σ, Z)

, subsequently filtered through the relation

L = exp [- 1 / g]

to produce the cognition rate model of Figure 3a. Figure 7b shows

g (m, σ, Z)

from Equation (29), taking

m = 1 / 2 \to 3

by increments of

1 / 2

. These results are then fed through the second expression of Equation (29) to give highly complex and counterintuitive variants of Figure 3 and Figure 4 that are left as an exercise.

Requiring, or imposing, rising signal detection rates with increasing signal strength, under conditions of signal compression and ‘noise’, can produce incomprehensively complicated cognitive dynamics.

Somewhat heuristically, in signal detection systems, sensitivity is often represented by a threshold value chosen above the mean noise level. As sensitivity increases, the system becomes more responsive to smaller changes in signal strength. This can be visualized as lowering the threshold for detection, allowing weaker signals to be recognized. Combining increasing sensitivity with data compression schemes like the Weber–Fechner law creates a system that is highly responsive to small changes in input while also compressing the range of those inputs. This combination can lead to instabilities for several reasons:

Amplification of Noise: As sensitivity increases, the system becomes more susceptible to detecting noise as signals, potentially leading to false positives.
Compression of Dynamic Range: The Weber–Fechner law—like similar schemes—compresses the perceived intensity of stimuli, which can make it difficult for the system to distinguish between important signals and background noise at higher intensities.
Feedback Loops: In complex systems with multiple interconnected modules, increased sensitivity can create feedback loops that amplify small fluctuations, potentially leading to system-wide instabilities that extend far beyond those illustrated in Figure 7.

15. Discussion

One art of science is the ability to infer the general from the particular. The probability models of inherently embodied cognitive systems studied here suggest that, in a surprisingly general manner, compression of sensory/intelligence and necessary internal data streams can drive some form of the Yerkes–Dodson effect’s contrast between ‘easy’ and ‘difficult’ tasks.

This occurs through such mechanisms as the increasing impairment of divided attention, limits on working memory, difficulties in decision-making, and the burdens of multitasking [11,12]. Further, the inverse-U patterns of difficult problems appear routinely bracketed by hallucination at low, and panic at high, arousal, depending critically on the interaction between ‘boundary conditions’ and the basic underlying probability distribution or distributions. A ‘ductile’, as opposed to a ‘brittle’ system, from these perspectives, emerges when boundary conditions have been adjusted to eliminate the hallucination mode. For noisy systems also burdened by the topological information stream of an embedding—and often adversarial—Clausewitzian ‘roadway’, however, panic remains a risk at sufficient arousal.

These general patterns were found across a variety of modeling modes, including different basic underlying probability distributions and two orders of approximation in an Onsager nonequilibrium approach.

Another important art of science, however, is recognizing the severe limitations of mathematical modeling in the study of complex real-world phenomena. As Pielou [40] argues in the context of theoretical ecology, the principal utility of mathematical models is speculation, the raising of questions to be answered by the analysis of observational and experimental data, the only sources of new knowledge.

This being said, similar arguments have often been made regarding the dynamics of organized conflict on Clausewitz landscapes of fog, friction, and deadly adversarial intent (e.g., Refs. [18,39,41,42] and the many classic and classical references therein). One underlying mechanism, then, seems related to sufficient—and usually badly needed—compression scaling of sensory/intelligence information data rates, enabling Maskirovka, spoofing, and related deceptions. Internal data streams are, likewise, often compressed, leading to the synergisms explored in Figure 3, Figure 4 and Figure 5, and suggesting the possibility of some mitigation in Figure 6 and Equations (21) and (22).

The Fisher zero phase transitions implied by Equations (13) and (14), and related distribution models, suggest, however, that inverse-U signal transduction and patterns of hallucination or panic will not be constrained to circumstances of data compression. We have explored only one tree in a very large forest.

In addition—and perhaps centrally—new probability models of poorly understood complex phenomena, if solidly based on appropriate asymptotic limit theorems, can serve as the foundation for building new and robust statistical tools useful in the analysis and modest control of those phenomena.

The development, testing, and validation of such statistical tools, however, is not a project for the timid.

16. Mathematical Appendix: Stochastic Differential Equations

We first recall Einstein’s 1905 [43] analysis of Brownian motion for a large number of particles N:

\begin{matrix} \partial ρ (x, t) / \partial t = μ \partial^{2} ρ (x, t) / \partial x^{2} \\ ρ (x, t) = \frac{N}{\sqrt{4 π μ t}} exp [- \frac{x^{2}}{4 μ t}] \\ < x^{2} > \propto t \end{matrix}

(30)

where t is the time, x is a location variate, and

μ

is a ‘diffusion coefficient’.

The last relation represents the average value of x for a particle across the jittering system.

More generally, as a consequence of this insight, it is possible to write, for a Brownian ‘stochastic differential’

d B_{t}

that might be seen as perturbing some base function, the fundamental relation

< d B_{t}^{2} > = d t

(31)

and on this result hangs a considerable tale.

We follow something of Cyganowski et al. ([33], Section 8.4), who provide related programs in the computer algebra program MAPLE.

We are given a base ‘ordinary’ differential equation

d X / d t = f (t, X (t))

perturbed by a Brownian stochastic variate

d B_{t}

that follows Equation (31). The ‘perturbed’ solution

X_{t}

solves the stochastic differential equation (SDE)

d X_{t} = f (t, X_{t}) d t + g (t, X_{t}) d B_{t}

(32)

We are then given a function Y that depends on both t and

X_{t}

. Some calculation—based on Equation (31)—finds that

Y (t, X_{t})

solves the SDE

d Y_{t} = L^{0} Y_{t} d t + L^{1} Y_{t} d B_{t}

(33)

where

L^{0}

and

L^{1}

are the operators

\begin{matrix} L^{0} Y = \partial Y / \partial t + f \partial Y / \partial x + \frac{1}{2} g^{2} \partial Y^{2} / \partial x^{2} \\ L^{1} Y = g \partial Y / \partial x \end{matrix}

(34)

This is the famous Ito Chain Rule.

To prove this, as Cyganowski et al. ([33], Section 8.4) show, one expands

Δ Y_{t} = Y (t + Δ t, X_{t} + Δ X_{t}) - Y (t, X_{t})

to second order, using

Δ X_{t} = f Δ t + g Δ B_{t} + \dots

. Some tedious algebra produces a second-oder term in

B_{t}

\begin{matrix} \frac{1}{2} g^{2} \partial^{2} Y / \partial x^{2} {(Δ B_{t})}^{2} \to \\ \frac{1}{2} g^{2} \partial^{2} Y / \partial x^{2} d t \end{matrix}

(35)

by Equation (31) that is then brought into the ‘

d t

’ part of the expression. This is one source of the inherent strangeness of stochastic differential equations.

We are concerned with ‘nonequilibrium steady states’ averaged across the stochastic jitter and represented as

< d Y_{t} > = L^{0} Y = 0

(36)

A simple MAPLE computer algebra program for the Ito Chain Rule is given in Cyganowski et al. ([33], p. 238), adapted here as

L0 := proc(X,a,b) local Lzero, U;

Lzero := diff(U(x,t),t)+a*diff(U(x,t),t,x)+

1/2 * b*b * diff(U(x,t), x, x);

eval(subs(U(x,t)=X,Lzero));

end:

where X is the expression of Y in Equation (31) in terms of x. a is

f (t, X_{t})

and b is

g (t, X_{t})

in Equation (31), likewise expressed in terms of the base-variable x.

Funding

This research received no external funding.

Data Availability Statement

All data are contained within the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Woocher, L. Conflict Assessment and Intelligence Analysis: Commonality, Convergence, and Complementarity. 2011. Available online: https://ciaotest.cc.columbia.edu/wps/usip/0022842/f_0022842_18725.pdf (accessed on 2 February 2025).
Bastounis, A.; Hansen, A.; Vlacic, V. Why deep learning is unstable despite the existence of stable neural networks. arXiv 2021, arXiv:2109.06098v1. [Google Scholar]
Kandel, E.; Jessell, T.; Schwartz, J.; Siegelbaum, S.; Hudspeth, A. Principles of Neural Science, 5th ed.; McGraw-Hill: New York, NY, USA, 2013; p. 451. [Google Scholar]
Wikipedia. 2025. Available online: https://en.wikipedia.org/wiki/Weber%E2%80%93Fechner_law (accessed on 2 February 2025).
Reina, A.; Bose, T.; Trianni, V.; Marshall, J. Psychophysical laws and the superorganism. Sci. Rep. 2018, 8, 4387. [Google Scholar] [CrossRef] [PubMed]
Wikipedia. 2025. Available online: https://en.wikipedia.org/wiki/Stevens%27s_power_law (accessed on 2 February 2025).
MacKay, D. Psychophysics of perceived intensity: A theoretical basis for Fechner’s and Stevens’ laws. Science 1963, 139, 12131216. [Google Scholar] [CrossRef]
Sayood, K. Information theory and cognition: A review. Entropy 2018, 20, 706. [Google Scholar] [CrossRef] [PubMed]
Wikipedia. 2025. Available online: https://en.wikipedia.org/wiki/Radar_warning_of_Pearl_Harbor_attack (accessed on 2 February 2025).
Wirtz, J. The Battle of Midway: Five Intelligence Takeaways for Today, Proceedings of the US Naval Institute. 2024. Available online: https://www.usni.org/magazines/proceedings/2024/december/battle-midway-five-intelligence-takeaways-today (accessed on 2 February 2025).
Diamond, D.; Campbell, A.; Park, C.; Halonen, J.; Zoladz, P. The Temporal Dynamics Model of Emotional Memory Processing. Neural Plast. 2007. [Google Scholar] [CrossRef] [PubMed]
Fricchione, G. Mind body medicine: A modern bio-psycho-social model forty-five years after Engel. BioPsychoSocial Med. 2023, 17, 12. [Google Scholar] [CrossRef] [PubMed]
Wallace, R.; Fricchione, G. Stress-induced failure of embodied cognition: A general model. BioSystems 2024. Available online: https://www.sciencedirect.com/science/article/abs/pii/S0303264724000789 (accessed on 2 February 2025). [CrossRef]
French, V.; Anderson, E.; Putman, G.; Alvager, T. The Yerkes-Dodson law simulated with an artificial neural network. Cogn. Syst. 1999, 5, 136–147. [Google Scholar]
Cover, T.; Thomas, J. Elements of Information Theory, 2nd ed.; Wiley: New York, NY, USA, 2006. [Google Scholar]
Khinchin, A. Mathematical Foundations of Information Theory; Dover: New York, NY, USA, 1957. [Google Scholar]
Nair, G.; Fagnani, F.; Zampieri, S.; Evans, R. Feedback control under data rate constraints: An overview. Proc. IEEE 2007, 95, 108138. [Google Scholar] [CrossRef]
Wallace, R. How AI founders on adversarial landscapes of fog and friction. J. Def. Model. Simul. 2021, 19, 519–538. [Google Scholar] [CrossRef]
Feynman, R. Lectures on Computation; Westview Press: New York, NY, USA, 2000. [Google Scholar]
Bennett, C.H. The thermodynamics of computation. Int. J. Theor. Phys. 1982, 21, 905–940. [Google Scholar] [CrossRef]
Derman, E.; Miller, M.; Park, D. The Volatility Smile; Wiley: New York, NY, USA, 2016. [Google Scholar]
Taleb, N.N. (Ed.) Statistical Consequences of Fat Tails: Real World Preasymptotics, Epistemology, and Applications; STEM Academic Press: New York, NY, USA, 2020. [Google Scholar]
Landau, L.; Lifshitz, E. Statistical Physics, 3rd ed.; Part 1; Elsevier: New York, NY, USA, 2007. [Google Scholar]
Laidler, K. Chemical Kinetics, 3rd ed.; Harper and Row: New York, NY, USA, 1987. [Google Scholar]
Watts, D.; Strogatz, S. Collective dynamics of small world networks. Nature 1998, 393, 440–442. [Google Scholar] [CrossRef] [PubMed]
Barabasi, A.; Albert, R. Emergence of scaling in random networks. Science 1999, 286, 509–512. [Google Scholar] [CrossRef] [PubMed]
Harush, U.; Barzel, B. Dynamic patterns of information flow in complex networks. Nat. Commun. 2017, 8, 2181. [Google Scholar] [CrossRef] [PubMed]
de Groot, S.; Mazur, P. Nonequilibrium Thermodynamics; Dover: New York, NY, USA, 1984. [Google Scholar]
Dolan, B.; Janke, W.; Johnston, D.; Stathakopoulos, M. Thin Fisher zeros. J. Phys. A 2001, 34, 6211–6223. [Google Scholar] [CrossRef]
Fisher, M. Lectures in Theoretical Physics; University of Colorado Press: Boulder, CO, USA, 1965; Volume 7. [Google Scholar]
Ruelle, D. Cluster property of the correlation functions of classical gases. Rev. Mod. Phys. 1964, 36, 580–584. [Google Scholar] [CrossRef]
Protter, P. Stochastic Integration and Differential Equations, 2nd ed.; Springer: New York, NY, USA, 2006. [Google Scholar]
Cyganowski, S.; Kloeden, P.; Ombach, J. From Elementary Probability to Stochastic Differential Equations with MAPLE; Springer: New York, NY, USA, 2002. [Google Scholar]
Dehaene, S. The neural basis of the Weber-Fechner law: A logarithmic mental number line. Trends Cogn. Sci. 2003, 7, 145–147. [Google Scholar] [CrossRef]
Krueger, L. Reconciling Fechner and Stevens: Toward a unified psychophysical law. Behav. Brain Sci. 1989, 12, 251267. [Google Scholar] [CrossRef]
Appleby, J.; Mao, X.; Rodkina, A. Stabilization and destabilization of nonlinear differential equations by noise. IEEE Trans. Autom. Control 2008, 53, 683–691. [Google Scholar] [CrossRef]
Jin, H.; Hu, Z.; Zhou, X. A convex stochastic optimization problem arising from portfolio selection. Math. Financ. 2008, 18, 171–183. [Google Scholar] [CrossRef]
Robinson, S. Shadow prices for measures of effectiveness II: General model. Oper. Res. 1993, 41, 536–548. [Google Scholar] [CrossRef]
Wallace, R. Cognitive Dynamics on Clausewitz Landscapes: The Control and Directed Evolution of Organized Conflict; Springer: New York, NY, USA, 2020. [Google Scholar]
Pielou, E.C. Mathematical Ecology; Wiley: New York, NY, USA, 1977. [Google Scholar]
Wallace, R. Carl von Clausewitz, the Fog-of-War, and the AI Revolution: The Real World Is Not a Game of Go; Springer: New York, NY, USA, 2021. [Google Scholar]
Wallace, R. Fog, friction, and failure in organized conflict: A formal study. Axioms 2024, 13, 111. [Google Scholar] [CrossRef]
Einstein, A. Investigations on the Theory of the Brownian Motion; Dover Publications: New York, NY, USA, 1956. [Google Scholar]

Figure 1. The Yerkes–Dodson effect for simple and difficult tasks: S-shaped and inverse-U with rising ‘arousal’.

Figure 2. An inherently embodied control system reinterpreted through the Rate Distortion Theorem. The unstable system’s ‘topological information’ [17] is added to ‘noise’ to determine the minimum channel capacity

R (D)

needed for transmission with average distortion D between what is wanted and what is observed. The punctuation implied by the DRT emerges from the RDT if there is a critical maximum average distortion. Real-time ‘embodiment’ is inherent to the existence of a feedback loop.

Figure 2. An inherently embodied control system reinterpreted through the Rate Distortion Theorem. The unstable system’s ‘topological information’ [17] is added to ‘noise’ to determine the minimum channel capacity

R (D)

needed for transmission with average distortion D between what is wanted and what is observed. The punctuation implied by the DRT emerges from the RDT if there is a critical maximum average distortion. Real-time ‘embodiment’ is inherent to the existence of a feedback loop.

Figure 3. Weber–Fechner compression. Noise-driven transitions of cognition rate from ‘easy’ to ‘difficult’ task inverse-U for two orders of approximation and two distributions for the WF law. (a) First-order Boltzmann. (b) Second-order Boltzmann. (c) First-order

1 / {(1 + x)}^{2}

. (d) Second-order

1 / {(1 + x)}^{2}

. Sufficiently large

σ

collapses all systems.

Figure 3. Weber–Fechner compression. Noise-driven transitions of cognition rate from ‘easy’ to ‘difficult’ task inverse-U for two orders of approximation and two distributions for the WF law. (a) First-order Boltzmann. (b) Second-order Boltzmann. (c) First-order

1 / {(1 + x)}^{2}

. (d) Second-order

1 / {(1 + x)}^{2}

. Sufficiently large

σ

collapses all systems.

Figure 4. Two-dimensional cross-sections across Figure 3a, (a) fixing

σ = 0

, (b) fixing

σ = 0.5,

with Z as ‘arousal’. Cognition rate is divided into real- and imaginary-valued components. At low arousal, both systems show a nonzero imaginary-valued ‘hallucination’ mode. For

σ = 0.5

, ‘panic’ emerges at sufficiently high arousal, and the intermediate zone, with a zero imaginary component, follows the classic inverse-U. The

σ = 0

system corresponds to the ‘easy’ problem of the Yerkes–Dodson effect.

Figure 4. Two-dimensional cross-sections across Figure 3a, (a) fixing

σ = 0

, (b) fixing

σ = 0.5,

with Z as ‘arousal’. Cognition rate is divided into real- and imaginary-valued components. At low arousal, both systems show a nonzero imaginary-valued ‘hallucination’ mode. For

σ = 0.5

, ‘panic’ emerges at sufficiently high arousal, and the intermediate zone, with a zero imaginary component, follows the classic inverse-U. The

σ = 0

system corresponds to the ‘easy’ problem of the Yerkes–Dodson effect.

Figure 5. Stevens compression for

n = 3

. Noise-driven transitions of cognition rate from ‘easy’ to ‘difficult’ task inverse-U for two orders of approximation and two distributions. (a) First-order Boltzmann. (b) Second-order Boltzmann. (c) First-order

1 / {(1 + x)}^{2}

. (d) Second-order

1 / {(1 + x)}^{2}

. Again, sufficiently large

σ

collapses the system.

Figure 5. Stevens compression for

n = 3

. Noise-driven transitions of cognition rate from ‘easy’ to ‘difficult’ task inverse-U for two orders of approximation and two distributions. (a) First-order Boltzmann. (b) Second-order Boltzmann. (c) First-order

1 / {(1 + x)}^{2}

. (d) Second-order

1 / {(1 + x)}^{2}

. Again, sufficiently large

σ

collapses the system.

Figure 6. The analog to Figure 4 for which, again, from the first of Equation (16),

C_{1} = - 1

, but now

C_{2} = exp [- 1]

. (a) Here,

σ = 0

. (b)

σ = 0.5

. The boundary conditions act together, defining another ‘temperature’, characterizing the onset of both ‘hallucination’ and ‘panic’ modes. For Figure 4b, panic begins at

Z = 12.678

, while here, absent hallucination, panic onset is increased to

Z = 16

in Figure 6b.

Figure 6. The analog to Figure 4 for which, again, from the first of Equation (16),

C_{1} = - 1

, but now

C_{2} = exp [- 1]

. (a) Here,

σ = 0

. (b)

σ = 0.5

. The boundary conditions act together, defining another ‘temperature’, characterizing the onset of both ‘hallucination’ and ‘panic’ modes. For Figure 4b, panic begins at

Z = 12.678

, while here, absent hallucination, panic onset is increased to

Z = 16

in Figure 6b.

Figure 7. (a)

g (σ, Z)

for the Boltzmann distribution, producing the relatively simple Y-D cognition rate model of Figure 3a. (b) The hazard rate model

Q (x) = x^{m}

, showing

g (m, σ, Z)

from Equation (29). Here,

m = 1 / 2 \to 3

by increments of

1 / 2

. These versions of g are then fed through the second expression of Equation (29) to give highly irregular variants of Figure 3.

Figure 7. (a)

g (σ, Z)

for the Boltzmann distribution, producing the relatively simple Y-D cognition rate model of Figure 3a. (b) The hazard rate model

Q (x) = x^{m}

, showing

g (m, σ, Z)

from Equation (29). Here,

m = 1 / 2 \to 3

by increments of

1 / 2

. These versions of g are then fed through the second expression of Equation (29) to give highly irregular variants of Figure 3.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wallace, R. Sensory or Intelligence Data Compression Can Drive the Yerkes–Dodson Effect. Symmetry 2025, 17, 235. https://doi.org/10.3390/sym17020235

AMA Style

Wallace R. Sensory or Intelligence Data Compression Can Drive the Yerkes–Dodson Effect. Symmetry. 2025; 17(2):235. https://doi.org/10.3390/sym17020235

Chicago/Turabian Style

Wallace, Rodrick. 2025. "Sensory or Intelligence Data Compression Can Drive the Yerkes–Dodson Effect" Symmetry 17, no. 2: 235. https://doi.org/10.3390/sym17020235

APA Style

Wallace, R. (2025). Sensory or Intelligence Data Compression Can Drive the Yerkes–Dodson Effect. Symmetry, 17(2), 235. https://doi.org/10.3390/sym17020235

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sensory or Intelligence Data Compression Can Drive the Yerkes–Dodson Effect

Abstract

1. Introduction

2. Rate Distortion Control Theory

3. Scalarizing Essential Resource Rates

4. The Fundamental Model

5. Two Probability Distributions

6. Weber–Fechner Implies Yerkes–Dodson

7. Stevens Implies Yerkes–Dodson

8. Hick–Hyman Implies Yerkes–Dodson

9. Pieron Implies Yerkes–Dodson

10. Other Compression Schemes

11. The Ductile-Brittle Transition

12. The Tyranny of Time

13. The Tyranny of Adaptation

14. The Compression/Sensitivity Explosion

15. Discussion

16. Mathematical Appendix: Stochastic Differential Equations

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI