Information Theory Meets Quantum Chemistry: A Review and Perspective

Zhao, Yilin; Zhao, Dongbo; Rong, Chunying; Liu, Shubin; Ayers, Paul W.

doi:10.3390/e27060644

Open AccessReview

Information Theory Meets Quantum Chemistry: A Review and Perspective

by

Yilin Zhao

¹

,

Dongbo Zhao

²

,

Chunying Rong

³,

Shubin Liu

^4,5,*

and

Paul W. Ayers

^1,*

¹

Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON L8S 4M1, Canada

²

Institute of Biomedical Research, Yunnan University, Kunming 650500, China

³

College of Chemistry and Chemical Engineering, Hunan Normal University, Changsha 410081, China

⁴

Department of Chemistry, University of North Carolina, Chapel Hill, NC 27599-3420, USA

⁵

Research Computing Center, University of North Carolina, Chapel Hill, NC 27599-3420, USA

^*

Authors to whom correspondence should be addressed.

Entropy 2025, 27(6), 644; https://doi.org/10.3390/e27060644

Submission received: 22 May 2025 / Revised: 10 June 2025 / Accepted: 13 June 2025 / Published: 16 June 2025

(This article belongs to the Special Issue The Information-Theoretic Approach in Density Functional Theory and Beyond)

Download

Browse Figures

Review Reports Versions Notes

Abstract

In this survey, we begin with a concise introduction to information theory within Shannon’s framework, focusing on the key concept of Shannon entropy and its related quantities: relative entropy, joint entropy, conditional entropy, and mutual information. We then demonstrate how to apply these information-theoretic tools in quantum chemistry, adopting either classical or quantum formalisms based on the choice of information carrier involved.

Keywords:

quantum chemistry; information theory; classical information theory; quantum information theory; Shannon entropy; von Neumann entropy; relative entropy; joint entropy; conditional entropy; mutual information; information-theoretic approach; orbital entanglement

Graphical Abstract

1. Motivation

Information theory was first established in the 1920s through the works of Harry Nyquist and Ralph Hartley [1,2] and propelled to prominence in the 1940s by Claude Shannon [3]. The concept of information theory, which encompasses the quantification, storage, and communication of information [4,5], is too broad to be simply described. Today, information theory serves as a versatile tool in statistics, natural sciences, machine learning, quantum computing, and numerous other fields.

In molecular electronic structure theory, we routinely encounter sets of non-negative values that sum to unity, corresponding to valid probability distributions. This includes properly normalized electron density distributions, the eigenvalues of the reduced density matrix, the squared modulus of wave function coefficients in an orthonormal basis, and more [6,7]. Information theory can be used to analyze these probability distributions, an approach that has been actively pursued by researchers with remarkable success since the 1970s [8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34]. Incorporating concepts from information theory into quantum chemistry has provided valuable insights into the nature and behavior of electronic systems.

The two most popular approaches to electronic structure theory are quantum many-body theory [35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50] and density functional theory (DFT) [51,52,53,54]. Integrating these strategies with classical information theory (CIT) leads to what is normally called the information-theoretic approach (ITA) [8,9,10,11,12,13,14,25,26,27,28,29,30,31,32,33,34]; integrating these strategies with quantum information theory (QIT) [15,16,17,18,19,20,21,22,23,24] leads to a different set of tools, with different interpretations and utility. Some key concepts of classical and quantum information theory in quantum chemistry that we will discuss in this article are summarized in Figure 1. Although information theory extends far beyond Shannon’s work, other frameworks such as Rényi and Fisher information have their own application domains [55,56,57,58]. Due to space limitations, our discussion in this paper will be confined to Shannon’s framework, although we note that the Rényi formulation and Tsallis formulations [55,57], in particular, reduce to Shannon’s in appropriate limits and, as such, the following analysis can be generalized to these quantities.

This review concisely introduces some fundamental aspects of information theory and its applications in quantum chemistry. Section 2 discusses the basic concepts of Shannon entropy and its related quantities: relative entropy, joint entropy, conditional entropy, and mutual information. To integrate quantum information theory into quantum chemistry, we introduce the reduced density matrix (RDM) in Section 3, which serves as a powerful tool to simplify the complexity of quantum states while retaining essential information about the state of a subsystem. Section 4 introduces the application of information theory in quantum chemistry; for the classical approach, we convert the reduced density matrix in the position representation to derive the electron density and pair density, which act as information carriers for classical information theory. We also explore the corresponding quantum concepts in Section 5, such as von Neumann entropy and quantum mutual information, which are used to analyze the entanglement in quantum many-body systems.

2. Brief Introduction to Information Theory

2.1. Shannon Entropy

One of the core concepts in information theory is the Shannon entropy, named after Claude Shannon, the founder of information theory [3]. Let X be a discrete random variable with alphabet

X

and probability distribution function

p (x) = Pr {X = x}

for

x \in X

. The Shannon entropy is defined as:

\begin{matrix} H_{(X)} & = - \sum_{x \in X} p (x) log p (x) \\ = E_{p} [log \frac{1}{p (X)}] \end{matrix}

(1)

where the expectation is denoted by

E

. Thus, if

X \sim p (x)

, the expected value of the random variable

g (X)

is written as:

E_{p} [g (X)] = \sum_{x \in X} g (x) p (x)

(2)

The Shannon entropy admits multiple interpretations, one of which states that it provides a mathematical framework to measure the uncertainty of a random event. The term entropy originates from the Greek word trope (meaning change) and was first introduced by Clausius in 1854 in the context of the second law of thermodynamics [59]. Shannon himself explained his rationale for adopting this term in a disarming way:

My greatest concern was what to call it. I thought of calling it ‘information’, but the word was overly used, so I decided to call it ‘uncertainty’. When I discussed it with John von Neumann, he had a better idea. Von Neumann told me, “You should call it entropy, for two reasons. In the first place your uncertainty function has been used in statistical mechanics under that name, so it already has a name. In the second place, and more important, nobody knows what entropy really is, so in a debate you will always have an advantage [60].

2.2. Relative Entropy

To quantify how distinguishable two probability distributions are, by introducing a second probability distribution

q (x) = Pr {X = x}

, where

x \in X

, we now look for a distinguishability measure of those two probability distributions [61].

Analogous to the concept of distance (norm) in Euclidean space, information theory defines the relative entropy (information divergence)

D_{(P | | Q)}

between two probability distributions

p (x)

and

q (x)

. Among the most popular divergence measures include the Bregman divergences [62,63] and f-divergences [64,65,66]. The f-divergence is defined as follows:

D_{f (P | | Q)} = \sum_{x \in X} f (\frac{p (x)}{q (x)}) q (x)

(3)

where

f (x)

is a convex function satisfying

f (1) = 0

. The Bregman divergence is defined as follows:

D_{F (P | | Q)} = F (P) - F (Q) - \sum_{x \in X} \frac{δ F [p (x)]}{δ q (x)} (p (x) - q (x))

(4)

where

F [p]

is a convex functional.

Most measures of relative entropy fit these frameworks, but the Kullback–Leibler (KL) divergence [67] is both an f-divergence and a Bregman divergence. Therefore, the Kullback–Leibler divergence is the most widely used definition of relative entropy, with its expression given by the following:

\begin{matrix} D_{K L (P | | Q)} & = \sum_{x \in X} p (x) log (\frac{\begin{matrix} p (x) \end{matrix}}{\begin{matrix} q (x) \end{matrix}}) \\ = E_{p} [log \frac{p_{(X)}}{q_{(X)}}] \end{matrix}

(5)

Formally, a metric (such as the Euclidean norm) must satisfy four axioms for all distributions P, Q, and R: [68]

Non-negativity: $d (P, Q) \geq 0$
Identity of indiscernibles: $d (P, Q) = 0 \Leftrightarrow P = Q .$
Symmetry: $d (P, Q) = d (Q, P)$
Triangle inequality: $d (P, Q) \leq d (P, R) + d (R, Q)$

Figure 2 provides an intuitive illustration of the non-symmetric nature of the Kullback–Leibler divergence. Additionally, counterexamples exist where

D_{K L (P | | Q)} > D_{K L (P | | R)} + D_{K L (R | | Q)}

(6)

demonstrating that the Kullback–Leibler divergence violates the triangle inequality. Thus, the Kullback–Leibler divergence is considered a premetric but not a full metric.

2.3. Bivariate Entropy

To study correlations between variables, it is useful to extend the Shannon entropy to the multivariate case. For simplicity, consider a pair of variables

(X, Y)

. Then, bivariate information-theoretic quantities such as joint entropy, conditional entropy, and mutual information can be defined, and their relationships are illustrated in Figure 3.

The most straightforward two-variable extension of Shannon entropy is the joint entropy

H_{(X, Y)}

, which measures the total uncertainty when considering a pair of variables together, and its corresponding expression is as follows:

\begin{matrix} H_{(X, Y)} & = - \sum_{x \in X} \sum_{y \in Y} p (x, y) log p (x, y) \\ = E_{p} [log \frac{1}{p_{(X, Y)}}] \end{matrix}

(7)

where

p (x, y)

is the joint probability distribution function. Since a pair of variables can be treated as a single vector of length two, this extension does not introduce fundamentally new concepts.

Other bivariate entropy measures include conditional entropy and mutual information, which can be expressed in the framework of the Kullback–Leibler divergence we introduced in Equation (5). The conditional entropy of one variable, given another, is defined as the expected value of the entropy of the conditional distributions, averaged over the conditioning variable.

\begin{matrix} H_{(Y | X)} & = \sum_{x \in X} p (x) H_{(Y | X = x)} \\ = - \sum_{x \in X} p (x) \sum_{y \in Y} p (y | x) log p (y | x) \\ = - \sum_{x \in X} \sum_{y \in Y} p (x, y) log p (y | x) \\ = E_{p} [log \frac{1}{p_{(Y | X)}}] \end{matrix}

(8)

Compared to the concept of entropy, which is a probabilistic measure of uncertainty; information is a measure of a reduction in that uncertainty. The mutual information is a measure of the amount of information that one variable contains about another. It is defined as follows:

\begin{matrix} I_{(X; Y)} & = D_{K L (p (x, y) | | p (x) p (y))} \\ = \sum_{x \in X} \sum_{y \in Y} p (x, y) log \frac{p (x, y)}{p (x) p (y)} \\ = E_{p} [log \frac{p_{(X, Y)}}{p_{(X)} p_{(Y)}}] \end{matrix}

(9)

The conditional and mutual information, defined using the Kullback–Leibler divergence, exhibit several important properties, enabling a clear and rigorous analysis.

The chain rule.

$H_{(X, Y)} = H_{(X)} + H_{(Y | X)}$

(10)
Subadditivity.

$H_{(X, Y)} \leq H_{(X)} + H_{(Y)}$

(11)
The relationship between different bivariate entropy,

$\begin{matrix} I_{(X; X)} & = H_{(X)} \end{matrix}$

(12)

$\begin{matrix} I_{(X; Y)} & = I_{(Y; X)} \end{matrix}$

(13)

$\begin{matrix} I_{(X; Y)} & = H_{(X)} - H_{(X | Y)} \end{matrix}$

(14)

$\begin{matrix} I_{(X; Y)} & = H_{(Y)} - H_{(Y | X)} \end{matrix}$

(15)

$\begin{matrix} I_{(X; Y)} & = H_{(X)} + H_{(Y)} - H_{(X, Y)} \end{matrix}$

(16)

$\begin{matrix} I_{(X; Y)} & = H_{(X, Y)} - H_{(X | Y)} - H_{(Y, X)} \end{matrix}$

(17)

3. Basic Ingredients of Information Theory in Quantum Chemistry: Reduced Density Matrix

First proposed by Paul Dirac in 1930 [69], the reduced density matrix (RDM) becomes a fundamental tool in quantum chemistry and many-body physics, enabling the analysis of subsystems within larger quantum systems [70,71,72,73,74]. Because electron–electron repulsion is a two-body operator, the one- and two-electron reduced density matrices (1-RDM and 2-RDM) are the quantities of greatest interest in quantum chemistry, as it suffices to determine most molecular properties, including the electronic energy [75,76,77]. Similarly, for the application of information theory in quantum chemistry, the reduced density matrix emerges as the natural mathematical framework for elucidating the correlations and entanglement [78,79].

3.1. Density Matrix and Reduced Density Matrix

For quantum many-body systems, other than the usual state vectors

| Ψ 〉

(mathematically described as rays in a projective Hilbert space), quantum states can also be represented by the density matrix

D

(density operator

\hat{D}

), which unifies the description of pure and mixed states. If a density matrix is obtained as

D = | Ψ 〉 〈 Ψ |

(18)

then the quantum state is a pure state. A mixed state cannot be expressed this way and instead requires a statistical mixture,

D = \sum_{n} ω_{n} | Ψ_{n} 〉 〈 Ψ_{n} | (ω_{n} \geq 0, \sum_{n} ω_{n} = 1)

(19)

where

{| Ψ_{n} 〉}

is an ensemble of pure states with weights (probability)

ω_{n} \geq 0

.

To further elucidate the concepts of pure states and mixed states, we employ a two-state quantum system as an illustrative example. It is often implemented using a spin

\frac{1}{2}

physical particle, with two basis states

{| 0 〉, | 1 〉}

. A pure state of such a two-state quantum system is represented by a state vector:

| ψ 〉 = α | 0 〉 + {β | 1 〉 | α |}^{2} + {| β |}^{2} = 1 α, β \in C

(20)

The constraint conditions in Equation (20) indicate that a pure state of a two-state quantum system can be equivalently expressed as

| ψ 〉 = cos (\frac{θ}{2}) | 0 〉 + e^{i φ} sin (\frac{θ}{2}) | 1 〉 θ \in [0, π], φ \in [0, 2 π]

(21)

Thus, the pure state of a two-state quantum system can be uniquely determined by two parameters

θ

and

φ

. Here, we can establish a geometric representation for this two-state quantum system: every pure state

| ψ 〉

defined in Equation (21) uniquely corresponds to a point (cos

φ

sin

θ

, sin

φ

sin

θ

, cos

θ

) on the surface of the Bloch sphere as shown in Figure 4.

For the mixed states (the same applies to pure states) of a two-state quantum system, it is evident that the density matrix can be represented by a 2 × 2 matrix, since

\begin{matrix} D & = \sum_{n} ω_{n} | ψ_{n} 〉 〈 ψ_{n} | \\ = \sum_{n} ω_{n} (| α_{n} |^{2} | 0 〉 〈 0 | + α_{n} β_{n}^{*} | 0 〉 〈 1 | + α_{n}^{*} β_{n} | 1 〉 〈 0 | + | β_{n} |^{2} | 1 〉 〈 1 |) \\ = [\begin{matrix} \sum_{n} ω_{n} {| α_{n} |}^{2} & \sum_{n} ω_{n} α_{n} β_{n}^{*} \\ \sum_{n} ω_{n} α_{n}^{*} β_{n} & \sum_{n} ω_{n} {| β_{n} |}^{2} \end{matrix}] \end{matrix}

(22)

It can be rigorously proven that every point within the Bloch sphere corresponds to a mixed state. In particular, the center point represents the maximally mixed state, as established in quantum information theory [80].

For conciseness, we will henceforth assume that the density matrix (DM) and the reduced density matrix (RDM) correspond to pure states, but most of the analysis extends directly to mixed states. The k-electron reduced density matrix (k-RDM) is defined as the partial trace, over

N - k

dimensions, of an N-electron density matrix; it is usually expanded on the basis of single-particle states (i.e., spin orbitals). Specifically,

{}^{k}{\bar{D}} = \sum_{p_{1}, \dots, p_{k}, q_{1}, \dots, q_{k}} | p_{1}, \dots, p_{k} 〉 〈 q_{1}, \dots, p_{k} | ({}^{k}{\bar{D}}_{q_{1}, \dots, q_{k}}^{p_{1}, \dots, p_{k}})

(23)

where

{}^{k}{\bar{D}}_{q_{1}, \dots, q_{k}}^{p_{1}, \dots, p_{k}} = 〈 Ψ | a_{p_{1}}^{†} \dots, a_{p_{k}}^{†} a_{q_{k}}, \dots, a_{q_{1}} | Ψ 〉

(24)

are elements of the k-RDM, which is Hermitian and positive semidefinite because the density matrix is Hermitian and positive semidefinite. We use the overbar above symbols when we need to explicitly distinguish quantities expressed in the spin-orbital, rather than the spatial orbital, basis.

3.2. 1-RDM and 2-RDM

The quantities of greatest interest to chemists are the one- and two-electron reduced density matrices. The 1-RDM is defined as follows:

\begin{matrix} {}^{1}{\bar{D}} & = \sum_{p q} | p 〉 〈 q | 〈 Ψ | a_{p}^{†} a_{q} | Ψ 〉 \\ = \sum_{p q} {}^{1}{\bar{D}}_{q}^{p} | p 〉 〈 q | \end{matrix}

(25)

and the 2-RDM is defined as follows:

\begin{matrix} {}^{2}{\bar{D}} & = \sum_{p q r s} | p q 〉 〈 r s | 〈 Ψ | a_{p}^{†} a_{q}^{†} a_{s} a_{r} | Ψ 〉 \\ = \sum_{p q r s} {}^{2}{\bar{D}}_{r s}^{p q} | p q 〉 〈 r s | \end{matrix}

(26)

In our convention, the trace of the 1-RDM is the number of electrons and the trace of the 2-RDM is the number of electron pairs:

\begin{matrix} Tr [{}^{1}{\bar{D}}] & = N \end{matrix}

(27)

\begin{matrix} Tr [{}^{2}{\bar{D}}] & = (\binom{N}{2}) = \frac{N (N - 1)}{2} \end{matrix}

(28)

The reader is cautioned that some authors use different normalization conventions (e.g., unit normalization or the number of non-unique electron pairs,

Tr [{}^{2}{\bar{D}}] = N (N - 1)

).

The reduced density matrix (RDM) can be represented in a spin block format because the projection of the spin vector onto a specified axis,

{\hat{S}}^{z}

, commutes with the molecular Hamiltonian.

[\hat{H}, {\hat{S}}^{z}] = \hat{H} {\hat{S}}^{z} - {\hat{S}}^{z} \hat{H} = 0

(29)

Thus, the one-particle reduced density matrix (1-RDM) will have a block-diagonal form.

{}^{1}{\bar{D}} = [\begin{matrix} {}^{1}{\bar{D}}_{α}^{α} & 0 \\ 0 & {}^{1}{\bar{D}}_{β}^{β} \end{matrix}]

(30)

Similarly, the spin-block format of the two-particle reduced density matrix (2-RDM) is as follows:

{}^{2}{\bar{D}} = [\begin{matrix} {}^{2}{\bar{D}}_{α α}^{α α} & 0 & 0 & 0 \\ 0 & {}^{2}{\bar{D}}_{α β}^{α β} & {}^{2}{\bar{D}}_{β α}^{α β} & 0 \\ 0 & {}^{2}{\bar{D}}_{α β}^{β α} & {}^{2}{\bar{D}}_{β α}^{β α} & 0 \\ 0 & 0 & 0 & {}^{2}{\bar{D}}_{β β}^{β β} \end{matrix}]

(31)

Note that knowledge of any one of the four opposite-spin blocks, e.g.,

{}^{2}{\bar{D}}_{β α}^{α β}

, suffices to determine the others using (anti)symmetry.

3.3. 3-RDM and 4-RDM

In general, reduced density matrices of order greater than two are essentially redundant for most quantum chemistry problems, as electrons interact only pairwise. However, for certain niche quantum chemistry applications, the information provided by the 3-RDM and 4-RDM is still required. The three-electron reduced density matrix (3-RDM) can be explicitly defined as follows:

\begin{matrix} {}^{3}{\bar{D}} & = \sum_{p q r s t u} | p q r 〉 〈 s t u | 〈 Ψ | a_{p}^{†} a_{q}^{†} a_{r}^{†} a_{u} a_{t} a_{s} | Ψ 〉 \\ = \sum_{p q r s t u} {}^{3}{\bar{D}}_{s t u}^{p q r} | p q r 〉 〈 s t u | \end{matrix}

(32)

and, similarly, the four-electron reduced density matrix (4-RDM) is defined as follows:

\begin{matrix} {}^{4}{\bar{D}} & = \sum_{p q r s t u v w} | p q r s 〉 〈 t u v w | 〈 Ψ | a_{p}^{†} a_{q}^{†} a_{r}^{†} a_{s}^{†} a_{w} a_{v} a_{u} a_{t} | Ψ 〉 \\ = \sum_{p q r s t u v w} {}^{4}{\bar{D}}_{t u v w}^{p q r s} | p q r s 〉 〈 t u v w | \end{matrix}

(33)

We can easily determine lower-order reduced density matrices (RDM) by taking the partial trace of a higher-order one. However, using higher-order reduced density matrices than required is undesirable, as the computational cost and memory requirements of high-order reduced density matrices are prohibitively large. For example, the storage required to store the complete 3-RDM and 4-RDM scales as

O (n^{6})

and

O (n^{8})

, respectively, where n is the number of basis functions. Higher-order reduced density matrices can be systematically, but approximately, expressed in terms of lower-order reduced density matrices using diagrammatic and statistical techniques [81,82,83,84,85,86,87,88,89,90,91].

Specifically, using the cumulant expansion one can decompose higher-order RDM into sums of products of lower-order quantities and nonreducible k-order cumulants

{}^{k}Δ

[81,82,88,92]. As a starting point, the 1-RDM can be expressed as the sum of a mean-field term and a correlated cumulant term:

{}^{1}D_{q}^{p} = {({}^{1}D_{0})}_{q}^{p} + {}^{1}Δ_{q}^{p}

(34)

where

{}^{1}D_{0}

is the known mean-field 1-RDM, the two-particle reduced density matrix (2-RDM) can then be expressed as the wedge (or Grassmann) product of two one-particle reduced density matrices (1-RDM) and a cumulant 2-RDM.

{}^{2}D_{r s}^{p q} = 2 {}^{1}D_{r}^{p} \land {}^{1}D_{s}^{q} + {}^{2}Δ_{r s}^{p q}

(35)

where the wedge product, denoted as ∧, is defined as an antisymmetric tensor product. Given two reduced density matrices of orders k and l, denoted

{}^{k}D

and

{}^{l}D

respectively, their wedge product yields a (k + l)-order RDM satisfying

\begin{matrix} {}^{k}D \land {}^{l}D = \frac{1}{(k + l)!} \sum_{π} sgn (π) P_{π} ({}^{k}D \otimes {}^{l}D) \end{matrix}

(36)

where

P_{π}

permutes the upper and lower indices according to the permutation

π

.

sgn (π)

is the parity of the permutation (+1 for even, −1 for odd). As the most elementary example, the wedge product of two 1-RDMs is given by

{}^{1}D_{r}^{p} \land {}^{1}D_{s}^{q} = {}^{1}D_{r}^{p} {}^{1}D_{s}^{q} - {}^{1}D_{s}^{p} {}^{1}D_{r}^{q}

(37)

The cumulant expansions of the three-particle and four-particle reduced density matrices (3-RDM and 4-RDM) are given by

{}^{3}D_{s t u}^{p q r} = 6 {}^{1}D_{s}^{p} \land {}^{1}D_{t}^{q} \land {}^{1}D_{u}^{r} + 9 {}^{2}D_{s t}^{p q} \land {}^{1}D_{u}^{r} + {}^{3}Δ_{s t u}^{p q r}

(38)

and

\begin{matrix} {}^{4}D_{t u v w}^{p q r s} & = 24 {}^{1}D_{t}^{p} \land {}^{1}D_{u}^{q} \land {}^{1}D_{v}^{r} \land {}^{1}D_{w}^{s} + 72 {}^{2}D_{t u}^{p q} \land {}^{1}D_{v}^{r} \land {}^{1}D_{w}^{s} \\ \begin{matrix} + \end{matrix} 24 {}^{2}D_{t u}^{p q} \land {}^{2}D_{v w}^{r s} + 16 {}^{3}D_{t u v}^{p q r} \land {}^{1}D_{w}^{s} + {}^{4}Δ_{t u v w}^{p q r s} \end{matrix}

(39)

The traces of the density matrices in these expressions give the number of distinguishable pairs, triples, and quartets of electrons, respectively. Dividing by

k!

, where k is the order of the reduced density matrix, recovers the definition we’ve used elsewhere.

4. Classical Information Theory in Quantum Chemistry

The Hohenberg–Kohn theorems [51,52,93,94,95] imply that all ground state properties are functionals of the electron density; this establishes the theoretical basis for using classical information theory that extracts chemical insights by treating electron densities as probability distributions.

4.1. Electron Density in Position Space

In Section 3, we establish reduced density matrices as the basic ingredients for information-theoretic analysis in quantum chemistry. For molecular systems, the concept of probability distribution

p (x)

in general information theory is specialized as an analytical functional of the electron density

ρ (r)

, which depends solely on the spatial coordinates,

r \in R^{3}

:

ρ (r) = N \int | Ψ (x_{1}, x_{2}, \dots, x_{N}) |^{2} d s_{1} d x_{2} \dots d x_{N}

(40)

By construction,

ρ (r) \geq 0, \int ρ (r) d r = N

(41)

Another fundamental information carrier in quantum chemistry is the two-electron distribution function, or pair density, defined as

Γ (r_{1}, r_{2}) = \frac{N (N - 1)}{2} \int {| Ψ (x_{1}, x_{2}, \dots, x_{N}) |}^{2} d s_{1} d s_{2} d x_{3} \dots d x_{N}

(42)

By definition,

Γ (r_{1}, r_{2}) \geq 0, \int \int Γ (r_{1}, r_{2}) d r_{1} d r_{2} = \frac{N (N - 1)}{2}

(43)

Analogous to density-functional theory, a complete description of electronic structure can be constructed based on the pair density (and also higher-order electron distribution functions) using appropriate generalizations of the Hohenberg–Kohn theorem [96,97,98,99,100,101,102,103,104].

The electron density and pair density can be computed from the reduced density matrices (RDMs) we introduced in Section 3. However, in that section, we considered the spin-resolved reduced density matrices, and it is more convenient in this context to trace out the spin coordinates and obtain a representation of the reduced density matrices in terms of spatial orbitals, i.e.,

\begin{matrix} {}^{1}D & = {}^{1}{\bar{D}}_{α}^{α} + {}^{1}{\bar{D}}_{β}^{β} \end{matrix}

(44)

\begin{matrix} {}^{2}D & = {}^{2}{\bar{D}}_{α α}^{α α} + {}^{2}{\bar{D}}_{α β}^{α β} + {}^{2}{\bar{D}}_{β α}^{β α} + {}^{2}{\bar{D}}_{β β}^{β β} \end{matrix}

(45)

Next, we transform the one- and two-electron reduced density matrices from the (second-quantized) spatial-orbital representation into the (first-quantized) position representation,

\begin{matrix} ρ (r; r^{'}) & = {}^{1}D_{ν}^{μ} ϕ_{μ} (r) ϕ_{ν} (r^{'}) \end{matrix}

(46)

\begin{matrix} Γ (r_{1}, r_{2}; r_{1}^{'}, r_{2}^{'}) & = {}^{2}D_{κ λ}^{μ ν} ϕ_{μ} (r_{1}) ϕ_{ν} (r_{2}) ϕ_{κ} (r_{1}^{'}) ϕ_{λ} (r_{2}^{'}) \end{matrix}

(47)

We use the indices

μ

,

ν

,

κ

, and

λ

to label atomic orbitals. The one-electron density

ρ (r)

(Equation (40)) and the pair-electron density

Γ (r_{1}, r_{2})

(Equation (42)) represent the diagonal components of the spinless one- and pair-electron reduced density matrix in position space, respectively, which is obtained by setting the un-primed spatial variables equal to the primed spatial variables,

r_{i} = r_{i}^{'}

. Note that off-diagonal elements of the orbital representation of the RDM contribute to diagonal elements of the spatial representation of the RDM and vice versa. This has significant implications for the N-representable of electron distribution functions [96,105].

4.2. Information-Theoretic Approach Chemical Descriptors

Taking electron density as the information carrier, classical information theory has been applied to DFT since the mid-20th century to study atoms and molecular systems within the framework of the information-theoretic approach (ITA) [8,9,10,11,12,13,14,25,26,27,28,29,30,31,32,33,34].

The Shannon entropy, with

ρ (r)

as its information carrier, measures the spatial delocalization of the electron density and is defined as follows:

S_{S} \equiv S_{S (X)} = - \int ρ (r) log ρ (r) d r

(48)

In accordance with the traditions of the information-theoretic approach, the base-10 logarithm is used throughout Section 4.2. In the information-theoretic approach, we employ a broader set of formulas beyond Shannon entropy. To maintain notational clarity, we adopt a systematic symbolic convention where Shannon’s formula is specifically denoted by the symbol

S_{S}

. When only the one-electron density is considered, the subscript_(X) in Equation (48) is typically omitted.

By introducing the reference density

ρ^{0} (r)

, which corresponds to the electron density of the promolecule constructed under the assumption that each atom retains its density as if it were isolated [106,107,108,109,110,111], we can define the relative Shannon entropy

S_{S}^{r}

. This quantity, also referred to as the Kullback–Leibler divergence, information gain, or information divergence [14,26,27,111,112,113,114,115], is defined by

S_{S}^{r} \equiv D_{K L (X | | X^{0})} = \int ρ (r) log \frac{ρ (r)}{ρ^{0} (r)} d r

(49)

The joint entropy

S_{S (X, Y)}

, which measures the localization of the pair of electrons in their respective spaces, is defined as follows:

S_{S (X, Y)} = - \int \int Γ (r_{1}, r_{2}) log Γ (r_{1}, r_{2}) d r_{1} d r_{2}

(50)

In addition to joint entropy, other bivariate entropy measures need to maintain the same normalization condition for the state density and the reference state density. Thus, the normalized one- and pair-electron densities

ρ^{σ} (r)

and

Γ^{σ} (r_{1}, r_{2})

are defined as

ρ^{σ} (r) \equiv σ (r) = ρ (r) / N

(51)

and

Γ^{σ} (r_{1}, r_{2}) = Γ (r_{1}, r_{2}) / (\binom{N}{2})

(52)

where N is the number of electrons. Following the definition of Parr and Bartolotti in 1983 [116], these unit-normalized densities are also referred to as shape functions

σ (r)

[116,117,118,119]. They exhibit the obvious non-negative properties

ρ^{σ} (r) \geq 0 \forall r

and

Γ^{σ} (r_{1}, r_{2}) \geq 0 \forall {r_{1}, r_{2}}

.

Using the normalized pair electron density as the distribution functions, the joint entropy

S_{S (X, Y)}^{σ}

is defined as

S_{S (X, Y)}^{σ} = - \int \int Γ^{σ} (r_{1}, r_{2}) log Γ^{σ} (r_{1}, r_{2}) d r_{1} d r_{2}

(53)

The conditional entropy is defined as the Kullback–Leibler divergence of the unit-normalized pair electron density from the unit-normalized electron densities.

\begin{matrix} S_{S (X | Y)}^{σ} & = - D_{K L (Γ^{σ} (r_{1}, r_{2}) | | ρ^{σ} (r_{2}))} \\ = - \int ρ^{σ} (r_{2}) d r_{2} \int Γ^{σ} (r_{1} | r_{2}) log Γ^{σ} (r_{1} | r_{2}) d r_{1} \\ = - \int \int Γ^{σ} (r_{1}, r_{2}) log \frac{Γ^{σ} (r_{1}, r_{2})}{ρ^{σ} (r_{2})} d r_{1} d r_{2} \end{matrix}

(54)

The mutual information

I_{(X; Y)}

, is defined as the Kullback–Leibler divergence of the unit-normalized pair electron density from the product of two unit-normalized electron densities. The mutual information then measures the divergence of the pair electron density from the value it would have if the electrons moved entirely independently.

\begin{matrix} S_{S (X; Y)}^{σ} \equiv I_{(X; Y)} & = D_{K L (Γ^{σ} (r_{1}, r_{2}) | | ρ^{σ} (r_{1}) ρ^{σ} (r_{2}))} \\ = \int \int Γ^{σ} (r_{1}, r_{2}) log \frac{Γ^{σ} (r_{1}, r_{2})}{ρ^{σ} (r_{1}) ρ^{σ} (r_{2})} d r_{1} d r_{2} \end{matrix}

(55)

Within the information theory framework introduced in Section 2, which utilizes the one- and pair-electron densities as fundamental information carriers, all the classical information-theoretic quantities we defined in this subsection preserve the essential mathematical properties we introduced in Section 2.3.

Examples and Illustrations

Using the information-theoretic approach, we can systematically interpret chemical concepts such as chemical bonds, chemical reactivity, electron shells, lone electron pairs, and more [8,9,10,12,13,14,120,121,122,123,124,125,126,127,128,129,130]. The information-theoretic approach descriptors can be categorized based on the scope of molecular features they capture:

Global Descriptors: Assign a value to the entire system.
Local Descriptors: Assign a value to each position in the system.
Non-local Descriptors: Assign a value to each pair of positions in the system.

The global ITA descriptors describe the overall properties of the system as a whole, enabling prediction of properties such as polarizability, aromaticity, acidity/basicity, and reactivity [9,10,12,120,121,122,123,131,132]. Figure 5 plots the correlation between Shannon entropy aromaticity (

Δ S_{S}

) [120] and several established aromaticity indices: the harmonic oscillator model of aromaticity (HOMA) [133,134], the aromatic fluctuation index (FLU) [135], and nucleus-independent chemical shifts (NICS) [136,137]. The strong correlations demonstrate that Shannon entropy effectively characterizes aromaticity.

The local ITA descriptors can serve as regioselectivity indicators, through a coarse-graining process, which involves integrating their values over atoms, functional groups, or fragment regions; condensed descriptors are obtained [131,132,133]. These descriptors help identify the most reactive atoms, functional groups, or bonds. Figure 6 provides an example of how local Shannon entropy is applied to reveal electron shell structures in noble gas atoms [125]. The radial distribution functions of the Shannon entropy densities are defined as

\begin{matrix} {\bar{h}}_{(X)} (r) & = \int r^{2} h_{(X)} (r, θ, ϕ) sin (θ) d θ d ϕ \\ = 4 π r^{2} h_{(X)} (r) \end{matrix}

(56)

It shows step-like increases in entropy at shell boundaries.

Non-local ITA descriptors quantify how the properties of a molecule at one location respond to changes at another distant point within the same molecule. These descriptors can also be condensed into response matrices, to quantify the correlation between fragments of the system [139]. Figure 7 illustrates the application of joint entropy, conditional entropy, and the mutual information kernel to analyze electron correlation in the krypton atom.

5. Quantum Information Theory in Quantum Chemistry

In this section, we transition our discussion of information theory to the quantum realm [21,23,24,47,78,79,80,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155]. As we delve deeper, it will become evident that the quantum case holds far richer possibilities, primarily due to the superposition principle.

5.1. Bipartite Entanglement

In quantum chemistry, for a system of basis set

{| n_{p} 〉}

with finite cardinality L, each basis

| n_{p} 〉

associates with a local Hilbert space

H_{i}

. Then, the total Hilbert space

H

spanned by the entire basis set is the tensor product of all local Hilbert spaces

H = \otimes_{p = 1}^{L} H_{p}

[24,142,143,144]. For an arbitrary state

| Ψ 〉 \in H

\begin{matrix} | Ψ 〉 & = \sum_{n_{1}, \dots, n_{L}} C_{n_{1}, \dots, n_{L}} | n_{1}, \dots, n_{L} 〉 \\ = \sum_{n_{1}, \dots, n_{L}} C_{n_{1}, \dots, n_{L}} | n_{1} 〉 \otimes \dots \otimes | n_{L} 〉 \end{matrix}

(57)

When the system is divided into two parts A and B, the composite Hilbert space

H \equiv H_{A B}

of the whole system is given as follows:

H \equiv H_{AB} = H_{A} \otimes H_{B}

(58)

where

H_{A}

and

H_{B}

are the Hilbert spaces of subsystems A and B, respectively. Only when there is no entanglement between the systems can the state

| Ψ_{A B} 〉

be represented as a tensor product of the states of the subsystems

| Ψ_{A} 〉

and

| Ψ_{B} 〉

, as shown in Figure 8a.

| Ψ 〉 \equiv | Ψ_{A B} 〉 = | Ψ_{A} 〉 \otimes | Ψ_{B} 〉

(59)

As a consequence of quantum entanglement among arbitrary subsystems, as shown in Figure 8b, for a generic

| Ψ 〉 \in H

, it should be represented as a series of tensor products of basis states of subsystems A and B,

| Ψ 〉 \equiv | Ψ_{A B} 〉 = \sum_{p q} C_{p q} | Ψ_{A}^{p} 〉 \otimes | Ψ_{B}^{q} 〉

(60)

The strategy for measuring bipartite entanglement emerges from the concept of partial measurements [145]. In Section 3.1, we have an introduction of the density matrix for both pure and mixed states; for the sake of brevity, here we only consider the case of pure states. A bipartite reduced density matrix of a pure state can be approached by tracing out (“averaging over”) one of the subsystems:

{}^{A}D = {Tr}_{B} | Ψ 〉 〈 Ψ | {}^{B}D = {Tr}_{A} | Ψ 〉 〈 Ψ |

(61)

thus,

{}^{A}D

and

{}^{B}D

are the reduced density matrices of each subsystem.

The quantification of bipartite entanglement can be approached through quantum information theory with the von Neumann entropy [141], which quantifies the degree of mixedness of a quantum state, measuring the departure of a given density operator from a pure state (for which

S = 0

, representing complete knowledge of the system) [80]. In the specific case of pure states in bipartite quantum systems, it quantifies the entanglement of the quantum system as

S (D) = - Tr (D ln D)

(62)

where

D

is the density matrix. As the quantum counterpart of Shannon entropy we introduced in Equation (1), the von Neumann entropy can be expressed as the Shannon entropy of the eigenvalues

λ_{p}

for the density matrix:

S (D) = - \sum_{p} λ_{p} ln λ_{p}

(63)

To measure the entanglement between the bipartite subsystems, the entanglement entropy

S_{A}

and

S_{B}

are defined as follows:

\begin{matrix} S_{A} & = - Tr ({}^{A}D ln {}^{A}D) \end{matrix}

(64)

\begin{matrix} S_{B} & = - Tr ({}^{B}D ln {}^{B}D) \end{matrix}

(65)

For pure states, the relation

S_{A} = S_{B}

holds, as can be proven via Schmidt decomposition [80].

5.2. Orbital Reduced Density Matrix

For a wavefunction expressed in terms of spatial orbitals, each orbital holds four possible occupations, which can be empty

| 0 〉

, singly occupied with a spin-up electron

| ↑ 〉

, singly occupied with a spin-down electron

| ↑ 〉

, and doubly occupied with an electron pair

| ↑ ↓ 〉

, i.e., the possible orbital occupations are

\begin{matrix} { \end{matrix} | n_{i} 〉 \begin{matrix} } \end{matrix} = {| 0 〉, | ↑ 〉, | ↓ 〉, | ↑ ↓ 〉}

(66)

As shown in Figure 9, if the system is divided into subsystem A composed of p-orbitals and complement subsystem B (orbital bath) composed of the remaining

L - p

orbitals, the RDM of subsystem A is then called the p-orbital reduced density matrix (p-orbital RDM), which can be defined in terms of the full, N-electron RDM or, equivalently, the N-electron wavefunction.

The 1-orbital RDM

{}_{o}^{1}D_{p}

corresponding to the one orbital partition in Figure 9a is expressed in the basis {-, ↑, ↓, ↑↓}, as shown in Table 1 [146,147,148,149], the elements of the matrix can be represented in terms of the spin-dependent 1-electron RDM

{}^{1}{\bar{D}}

and 2-electron RDM

{}^{2}{\bar{D}}

, where the indices p and

\bar{p}

indicate spin-up and spin-down electrons of p-th orbital respectively.

The two orbital partition is shown in Figure 9b, elements of the 2-orbital RDM

{}_{o}^{2}D_{p q}

are summarized in Table 2 [146,147,148,149]. Note that

{}_{o}^{2}D_{p q}

requires only some diagonal elements of the 3- and 4-electron reduced density matrices, as well as a few off-diagonal elements of the 1-, 2-, and 3-electron reduced density matrices.

We should note that the 1-orbital RDM

{}_{o}^{1}D_{p}

can be further simplified for seniority-zero state. Since it excludes singly occupied orbitals and we have the relations

{}^{1}{\bar{D}}_{p}^{p} = {}^{1}{\bar{D}}_{\bar{p}}^{\bar{p}}

, the corresponding basis {-, ↑↓} holds a 2 cardinality, thus the 1-orbital RDM of a seniority-zero state is simplified as a

2 \times 2

matrix:

{}_{o}^{1}D_{p} = [\begin{matrix} 1 - {}^{1}D_{p}^{p} & 0 \\ 0 & {}^{1}D_{p}^{p} \end{matrix}]

(67)

Following the same simplification method, the 2-orbital RDM

{}_{o}^{2}D_{p q}

of a seniority-zero state expressed in the basis {−, ↑↓, ↑↓, ~~↑↓↑↓~~} is reduced to a

4 \times 4

matrix:

{}_{o}^{2}D_{p q} = [\begin{matrix} 1 - {}^{1}D_{p}^{p} - {}^{1}D_{q}^{q} + {}^{2}D_{p \bar{q}}^{p \bar{q}} & 0 & 0 & 0 \\ 0 & {}^{1}D_{p}^{p} - {}^{2}D_{p \bar{q}}^{p \bar{q}} & {}^{2}D_{q \bar{q}}^{p \bar{p}} & 0 \\ 0 & {}^{2}D_{p \bar{p}}^{q \bar{q}} & {}^{1}D_{q}^{q} - {}^{2}D_{q \bar{p}}^{q \bar{p}} & 0 \\ 0 & 0 & 0 & {}^{2}D_{p \bar{q}}^{p \bar{q}} \end{matrix}]

(68)

5.3. Orbital Entanglement

With the preliminary knowledge of quantum information theory and orbital reduced density matrix, we can define the single-orbital entropy and mutual information from 1- and 2-orbital RDM. The first quantity, single–orbital entropy

s {(1)}_{p}

, measures the entanglement between a given orbital p and the complementary orbital bath, using the eigenvalues of the one–orbital reduced density matrix as information carriers. The single–orbital entropy

s {(1)}_{p}

is defined as follows:

\begin{matrix} s {(1)}_{p} & = - Tr ({}_{o}^{1}D_{p} ln {}_{o}^{1}D_{p}) \\ = - \sum_{α}^{M} λ_{α, p} ln λ_{α, p} \end{matrix}

(69)

where

λ_{α, p}

and M are the eigenvalues and dimension of the p-th one-orbital reduced density matrix respectively, with

M = 2

for seniority-zero state and

M = 4

for other states. The total quantum information encoded in the system is given by the sum of single-orbital entropy:

I_{t o t} = \sum_{p}^{L} s {(1)}_{p}

(70)

Given two states described by the one-orbital reduced density matrices

{}_{o}^{1}D_{p}

and

{}_{o}^{1}D_{q}

, one can define the relative orbital entropy by the KL divergence:

\begin{matrix} s {(1)}_{(p | | q)} & = D_{K L ({}_{o}^{1}D_{p} | | {}_{o}^{1}D_{q})} \\ = Tr ({}_{o}^{1}D_{p} ln \frac{{}_{o}^{1}D_{p}}{{}_{o}^{1}D_{q}}) \end{matrix}

(71)

It measures how much the entanglement of orbital p deviates from that of orbital q.

If the system is divided into two orbitals

(p, q)

and the remaining

L - 2

orbital bath as shown in Figure 9b, the entanglement between them is quantified by the two-orbital entropy

s {(2)}_{(p, q)}

defined as

\begin{matrix} s {(2)}_{(p, q)} & = - Tr ({}_{o}^{2}D_{p q} ln {}_{o}^{2}D_{p q}) \\ = - \sum_{α}^{M} λ_{α, p q} ln λ_{α, p q} \end{matrix}

(72)

where

λ_{α, p q}

and M are the eigenvalues and dimension of the two-orbital reduced density matrix respectively, with

M = 4

for seniority-zero state and

M = 16

for other states. The conditional orbital entropy can be written by the KL divergence as follows:

\begin{matrix} s {(2)}_{(p | q)} & = - D_{K L ({}_{o}^{2}D_{p q} | | {}_{o}^{1}I_{p} \otimes {}_{o}^{1}D_{q})} \\ = - Tr ({}_{o}^{2}D_{p q} ln \frac{{}_{o}^{2}D_{p q}}{{}_{o}^{1}I_{p} \otimes {}_{o}^{1}D_{q}}) \\ = s {(2)}_{(p, q)} - s {(1)}_{q} \end{matrix}

(73)

where

{}_{o}^{1}I_{p}

is the identity matrix with the same dimensions as

{}_{o}^{1}D_{p}

. The total amount of entanglement between two orbitals p and q can be measured by the orbital pair mutual information, written by the KL divergence as

\begin{matrix} I_{(p; q)} & = D_{K L ({}_{o}^{2}D_{p q} | | {}_{o}^{1}D_{p} \otimes {}_{o}^{1}D_{q})} \\ = Tr ({}_{o}^{2}D_{p q} ln \frac{{}_{o}^{2}D_{p q}}{{}_{o}^{1}D_{p} \otimes {}_{o}^{1}D_{q}}) \\ = s {(1)}_{p} + s {(1)}_{q} - s {(2)}_{(p, q)} \end{matrix}

(74)

As an application of the information theory framework introduced in Section 2 where the orbital reduced density matrix serves as the information carrier, those quantum information theory quantities naturally inherit key properties we introduced in Section 2.3, including but not limited to the chain rule and subadditivity.

Examples and Illustrations

The concept of orbital entanglement serves as a complementary tool for interpreting electronic structures, proving particularly valuable in strongly correlated systems. In this subsection, we will present several representative application examples.

Electron correlation effects are conventionally categorized as dynamic and nondynamic (also termed static), where in this classification static is used synonymously with nondynamic [156]. The dynamic correlation effect, though large, arise from relatively small contributions from a large number of configurations. In contrast, the nondynamic/static effect involving large contributions arising from a few configurations, which collectively address orbital (near-) degeneracy. In contexts demanding more nuanced differentiation, static electron correlation (strict degeneracy) and nondynamic correlation (near-degeneracy) are considered separate effects in our formalism [148]. As presented in Refs. [47,148], the strength of single-orbital entropy

s {(1)}_{p}

to measure orbital entanglement and the orbital-pair mutual information

I_{(p; q)}

can be associated with different types of electron correlation effects. In Table 3, we map the strength of orbital interactions onto a certain type of correlation effects. Computational investigations demonstrate that orbitals with nondynamic/static electron correlation effects signify substantial multireference character in the system. In contrast, weakly entangled orbitals predominantly exhibit dynamic correlation effects; systems where all orbitals are weakly entangled can usually be adequately treated by single-reference approaches.

Quantitative visualization of

s {(1)}_{p}

and

I_{(p; q)}

enables a more intuitive analysis. As shown in Figure 10, the strength of the orbital-pair mutual information classified in Table 3 is color-coded,

Blue lines: Nondynamic correlated orbital pairs.
Red lines: Static correlated orbitals.
Green: Dynamic correlated orbitals.

An important issue for the density matrix renormalization group (DMRG) method [157,158,159] is the order of orbitals in the one-dimensional matrix product state (MPS) wavefunction ansatz; an optimal ordering of orbitals corresponding to maximum entanglement will produce the most efficient results [21,160,161]. Since strong (nondynamic/static) electron correlation is essential for proper molecular dissociation into fragments, orbital entanglement provides both a fundamental framework for understanding bond formation/breaking processes [150] and a practical tool for analyzing chemical reactivity.

For many strongly correlated calculation methods such as complete active space self-consistent field (CASSCF) [41,162], the selection of the complete active space (CAS space) is a crucial prerequisite step to keep computational costs within feasible limits. As shown in Figure 11, for the CAS methodology, all the molecular orbitals are classified into three spaces:

Inactive space: Always doubly occupied.
Active space: All the possible configurations are allowed.
Virtual space: Always empty.

Specifically, the active space, usually denoted as CAS

(n, m)

where n and m are the number of electrons and orbitals respectively, should encompass orbitals and electrons essential for capturing strong electron correlation effects. Orbital entanglement serve as powerful tools for identifying critical orbitals in active space [163,164]. Comparing the entanglement diagrams as shown in Figure 10, we can evaluate the quality and convergence behavior of those active-space calculations.

6. Summary and Outlook

This review presents a unified perspective on information theory and its applications in quantum chemistry, integrating both classical and quantum frameworks. Beginning with fundamental concepts like Shannon entropy and its related concepts such as joint entropy, relative entropy, conditional entropy, and mutual information, we demonstrate how information theory can be applied to molecular systems through information carriers such as the electron density and orbital reduced density matrix. The discussion bridges classical concepts with their quantum counterparts, including classical Shannon entropy and quantum correlations to the quantum von Neumann entropy and entanglement. By tracing the historical development from early works by Shannon’s foundational contributions, we highlight how information theory has evolved into a versatile framework with broad applications in quantum chemistry, particularly for analyzing electronic structure and quantum phenomena in chemical systems.

This article only scratches the surface of the vast scope for existing and future applications of information theory in quantum chemistry. Notably, for the application of information theory in quantum chemistry, Shannon’s framework is not the only reasonable formula; the Rényi entropy, the Fisher information, and other f- and Bregman divergences can also be used as a measure of correlation and entanglement. Importantly, these concepts extend far beyond simple pairwise interactions: from single- and pair-electron densities to many-body electron distributions, and from bipartite systems to complex multipartite quantum entanglement. The physical manifestations of information carriers are likewise diverse, ranging from atoms in molecules (AIM) to localized molecular orbitals. By creatively combining the above extended concepts as well as other potential extensions, we can even derive additional novel concepts and tools for advancing our understanding of quantum chemistry.

Furthermore, other domains such as machine learning and quantum computing that represent one of the most active research frontiers in science, also possess profound foundations in information theory. The integration of quantum chemistry with information theory, machine learning, and quantum computing is establishing a transformative new paradigm for quantum chemical research.

Author Contributions

Conceptualization, Y.Z., D.Z., C.R., S.L. and P.W.A.; methodology, Y.Z., D.Z., C.R., S.L. and P.W.A.; software, Y.Z.; validation, Y.Z.; formal analysis, Y.Z.; investigation, Y.Z.; resources, Y.Z.; data curation, Y.Z.; writing—original draft preparation, Y.Z.; writing—review and editing, Y.Z. and P.W.A.; visualization, Y.Z.; supervision, S.L. and P.W.A.; project administration, Y.Z.; funding acquisition, D.Z., C.R., S.L. and P.W.A. All authors have read and agreed to the published version of the manuscript.

Funding

D.Z. is supported by the National Natural Science Foundation of China (Grant No. 22203071), the High-Level Talent Special Support Plan and the China Scholarship Council. C.R. acknowledges support from the National Natural Science Foundation of China (Grant No. 22373034). The authors appreciate financial support from the Natural Sciences and Engineering Research Council of Canada (NSERC) and the Canada Research Chairs.

Acknowledgments

The authors also acknowledge computational resources from the Digital Research Alliance of Canada.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

$H, H_{(X)}$	Shannon entropy
$D_{K L (P \| \| Q)}$	Relative entropy, Kullback–Leibler divergence
$H_{(X, Y)}$	Joint entropy
$H_{(X \| Y)}$	Conditional entropy
$I_{(X; Y)}$	Mutual information
$D, \hat{D}$	Density matrix, Density operator
${}^{k}{\bar{D}}, {}^{k}D$	k-Electron reduced density matrix with spin and spatial orbital
$ρ (r)$	Electron density at position $r$
$ρ^{σ} (r), σ (r)$	Unit-normalized electron density (shape function) at position $r$
$Γ (r_{1}, r_{2})$	Pair-electron density at position $r_{1}$ and $r_{2}$
$Γ^{σ} (r_{1}, r_{2})$	Unit-normalized pair-electron density at position $r_{1}$ and $r_{2}$
$S_{S}, S_{S (X)}$	Shannon entropy with electron density
$S_{S}^{r}, S_{S (X \| \| X^{0})}^{r}$	Relative entropy with electron density
$S_{S (X, Y)}$	Joint entropy with electron density
$S_{S (X \| Y)}$	Conditional entropy with electron density
$S_{S (X; Y)}$	Mutual information with electron density
${}_{o}^{k}D$	k-Orbital reduced density matrix
$s {(1)}_{p}$	One-orbital entropy
$s {(2)}_{(p \| \| q)}$	Orbital relative entropy
$s {(2)}_{(p, q)}$	Two-orbital entropy
$s {(2)}_{(p \| q)}$	Orbital conditional entropy
$I_{(p; q)}$	Orbital mutual information

References

Hartley, R.V.L. Transmission of information. Bell Syst. Tech. J. 1928, 7, 535–563. [Google Scholar] [CrossRef]
Nyquist, H. Certain factors affecting telegraph speed. Bell Syst. Tech. J. 1924, 3, 324–346. [Google Scholar] [CrossRef]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Cover, T.M.; Thomas, J.A. Elements of Information Theory, 2nd ed.; John Wiley & Sons, Ltd: New York, NY, USA, 2005. [Google Scholar] [CrossRef]
Witten, E. A mini-introduction to information theory. Riv. Nuovo Cim. 2020, 43, 187–227. [Google Scholar] [CrossRef]
Parr, R.G.; Yang, W. Density-Functional Theory of Atoms and Molecules; Oxford University Press: Oxford, UK, 1995. [Google Scholar] [CrossRef]
Helgaker, T.; Jørgensen, P.; Olsen, J. Molecular Electronic-Structure Theory; John Wiley & Sons, Ltd: Hoboken, NJ, USA, 2000. [Google Scholar] [CrossRef]
Liu, S. Information-Theoretic Approach in Density Functional Reactivity Theory. Acta Phys.-Chim. Sin. 2016, 32, 98–118. [Google Scholar] [CrossRef]
Zhao, D.; Zhao, Y.; He, X.; Li, Y.; Ayers, P.W.; Liu, S. Accurate and Efficient Prediction of Post-Hartree–Fock Polarizabilities of Condensed-Phase Systems. J. Chem. Theory Comput. 2023, 19, 6461–6470. [Google Scholar] [CrossRef]
Zhao, D.; Zhao, Y.; He, X.; Ayers, P.W.; Liu, S. Efficient and accurate density-based prediction of macromolecular polarizabilities. Phys. Chem. Chem. Phys. 2023, 25, 2131–2141. [Google Scholar] [CrossRef]
Zhao, D.; Zhao, Y.; Xu, E.; Liu, W.; Ayers, P.W.; Liu, S.; Chen, D. Fragment-Based Deep Learning for Simultaneous Prediction of Polarizabilities and NMR Shieldings of Macromolecules and Their Aggregates. J. Chem. Theory Comput. 2024, 20, 2655–2665. [Google Scholar] [CrossRef]
Zhao, Y.; Zhao, D.; Liu, S.; Rong, C.; Ayers, P.W. Why are information-theoretic descriptors powerful predictors of atomic and molecular polarizabilities. J. Mol. Model. 2024, 30, 361. [Google Scholar] [CrossRef]
Rong, C.; Zhao, D.; He, X.; Liu, S. Development and Applications of the Density-Based Theory of Chemical Reactivity. J. Phys. Chem. Lett. 2022, 13, 11191–11200. [Google Scholar] [CrossRef]
Liu, S. Identity for Kullback-Leibler divergence in density functional reactivity theory. J. Chem. Phys. 2019, 151, 141103. [Google Scholar] [CrossRef] [PubMed]
Wu, W.; Scholes, G.D. Foundations of Quantum Information for Physical Chemistry. J. Phys. Chem. Lett. 2024, 15, 4056–4069. [Google Scholar] [CrossRef] [PubMed]
Materia, D.; Ratini, L.; Angeli, C.; Guidoni, L. Quantum Information reveals that orbital-wise correlation is essentially classical in Natural Orbitals. arXiv 2024, arXiv:2404.14093. [Google Scholar] [CrossRef] [PubMed]
Aliverti-Piuri, D.; Chatterjee, K.; Ding, L.; Liao, K.; Liebert, J.; Schilling, C. What can quantum information theory offer to quantum chemistry? Faraday Discuss. 2024, 254, 76–106. [Google Scholar] [CrossRef]
Nowak, A.; Legeza, O.; Boguslawski, K. Orbital entanglement and correlation from pCCD-tailored coupled cluster wave functions. J. Chem. Phys. 2021, 154, 084111. [Google Scholar] [CrossRef]
Ding, L.; Mardazad, S.; Das, S.; Szalay, S.; Schollwöck, U.; Zimborás, Z.; Schilling, C. Concept of Orbital Entanglement and Correlation in Quantum Chemistry. J. Chem. Theory Comput. 2021, 17, 79–95. [Google Scholar] [CrossRef]
Ratini, L.; Capecci, C.; Guidoni, L. Natural Orbitals and Sparsity of Quantum Mutual Information. J. Chem. Theory Comput. 2024, 20, 3535–3542. [Google Scholar] [CrossRef]
Legeza, O.; Sòlyom, J. Optimizing the density-matrix renormalization group method using quantum information entropy. Phys. Rev. B 2003, 68, 195116. [Google Scholar] [CrossRef]
Convy, I.; Huggins, W.; Liao, H.; Birgitta Whaley, K. Mutual information scaling for tensor network machine learning. Mach. Learn. Sci. Technol. 2022, 3, 015017. [Google Scholar] [CrossRef]
Legeza, O.; Sólyom, J. Two-Site Entropy and Quantum Phase Transitions in Low-Dimensional Models. Phys. Rev. Lett. 2006, 96, 116401. [Google Scholar] [CrossRef]
Szalay, S.; Pfeffer, M.; Murg, V.; Barcza, G.; Verstraete, F.; Schneider, R.; Legeza, O. Tensor product methods and entanglement optimization for ab initio quantum chemistry. Int. J. Quantum Chem. 2015, 115, 1342–1391. [Google Scholar] [CrossRef]
Sears, S.B.; Parr, R.G.; Dinur, U. On the Quantum-Mechanical Kinetic Energy as a Measure of the Information in a Distribution. Isr. J. Chem. 1980, 19, 165–173. [Google Scholar] [CrossRef]
Nalewajski, R.F.; Parr, R.G. Information theory, atoms in molecules, and molecular similarity. Proc. Natl. Acad. Sci. USA 2000, 97, 8879–8882. [Google Scholar] [CrossRef] [PubMed]
Nalewajski, R.F.; Parr, R.G. Information Theory Thermodynamics of Molecules and Their Hirshfeld Fragments. J. Phys. Chem. A 2001, 105, 7391–7400. [Google Scholar] [CrossRef]
Levine, R.D.; Bernstein, R.B. Energy disposal and energy consumption in elementary chemical reactions. Information theoretic approach. Acc. Chem. Res. 1974, 7, 393–400. [Google Scholar] [CrossRef]
Procaccia, I.; Levine, R.D. The populations time evolution in vibrational disequilibrium: An information theoretic approach with application to HF. J. Chem. Phys. 1975, 62, 3819–3820. [Google Scholar] [CrossRef]
Dinur, U.; Kosloff, R.; Levine, R.; Berry, M. Analysis of electronically nonadiabatic chemical reactions: An information theoretic approach. Chem. Phys. Lett. 1975, 34, 199–205. [Google Scholar] [CrossRef]
Procaccia, I.; Levine, R.D. Vibrational energy transfer in molecular collisions: An information theoretic analysis and synthesis. J. Chem. Phys. 1975, 63, 4261–4279. [Google Scholar] [CrossRef]
Levine, R.D.; Manz, J. The effect of reagent energy on chemical reaction rates: An information theoretic analysis. J. Chem. Phys. 1975, 63, 4280–4303. [Google Scholar] [CrossRef]
Levine, R.D. Entropy and macroscopic disequilibrium. II. The information theoretic characterization of Markovian relaxation processes. J. Chem. Phys. 1976, 65, 3302–3315. [Google Scholar] [CrossRef]
Levine, R.D. Information Theory Approach to Molecular Reaction Dynamics. Ann. Rev. Phys. Chem. 1978, 29, 59–92. [Google Scholar] [CrossRef]
Slater, J.C. The Theory of Complex Spectra. Phys. Rev. 1929, 34, 1293–1322. [Google Scholar] [CrossRef]
Hartree, D.R. Some Relations between the Optical Spectra of Different Atoms of the same Electronic Structure. II. Aluminium-like and Copper-like Atoms. Math. Proc. Camb. Phil. Soc. 1926, 23, 304–326. [Google Scholar] [CrossRef]
Fock, V.A.Z. Näherungsmethode zur Lösung des quantenmechanischen Mehrkörperproblems. Z. Phys. 1930, 61, 126–148. [Google Scholar] [CrossRef]
Roothaan, C.C.J. New Developments in Molecular Orbital Theory. Rev. Mod. Phys. 1951, 23, 69–89. [Google Scholar] [CrossRef]
Koga, T.; Tatewaki, H.; Thakkar, A.J. Roothaan-Hartree-Fock wave functions for atoms with Z ≤ 54. Phys. Rev. A 1993, 47, 4510–4512. [Google Scholar] [CrossRef]
Purvis, G.D.; Bartlett, R.J. A full coupled-cluster singles and doubles model: The inclusion of disconnected triples. J. Chem. Phys. 1982, 76, 1910–1918. [Google Scholar] [CrossRef]
Shavitt, I. The history and evolution of configuration interaction. Mol. Phys. 1998, 94, 3–17. [Google Scholar] [CrossRef]
Shavitt, I.; Bartlett, R.J. Many-Body Methods in Chemistry and Physics: Theory and Applications; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar] [CrossRef]
Cooper, N.R.; Leese, M.R. Configuration interaction methods in molecular quantum chemistry. J. Mol. Struct.-Theochem. 2000, 94, 71–78. [Google Scholar]
Coester, F.; Kümmel, H. Short-range correlations in nuclear wave functions. Nucl. Phys. 1960, 17, 477–485. [Google Scholar] [CrossRef]
Ahlrichs, R. Many body perturbation calculations and coupled electron pair models. Comput. Phys. Commun. 1979, 17, 31–45. [Google Scholar] [CrossRef]
Bartlett, R.J. Many-body perturbation-theory and coupled cluster theory for electron correlation in molecules. Annu. Rev. Phys. Chem. 1981, 32, 359–401. [Google Scholar] [CrossRef]
Bartlett, R.J.; Musiał, M. Coupled-cluster theory in quantum chemistry. Rev. Mod. Phys. 2007, 79, 291–352. [Google Scholar] [CrossRef]
Asadchev, A.; Gordon, M.S. Fast and Flexible Coupled Cluster Implementation. J. Chem. Theory Comput. 2013, 9, 3385–3392. [Google Scholar] [CrossRef]
Møller, C.; Plesset, M.S. Note on an Approximation Treatment for Many-Electron Systems. Phys. Rev. 1934, 46, 618–622. [Google Scholar] [CrossRef]
Cremer, D. Møller–Plesset perturbation theory: From small molecule methods to methods for thousands of atoms. WIREs Comput. Mol. Sci. 2011, 1, 509–530. [Google Scholar] [CrossRef]
Hohenberg, P.; Kohn, W. Inhomogeneous Electron Gas. Phys. Rev. 1964, 136, B864–B871. [Google Scholar] [CrossRef]
Kohn, W.; Sham, L.J. Self-Consistent Equations Including Exchange and Correlation Effects. Phys. Rev. 1965, 140, A1133–A1138. [Google Scholar] [CrossRef]
Perdew, J.P.; Schmidt, K. Jacob’s ladder of density functional approximations for the exchange-correlation energy. AIP Conf. Proc. 2001, 577, 1–20. [Google Scholar] [CrossRef]
Engel, E.; Dreizler, R.M. Density Functional Theory: An Advanced Course; Theoretical and Mathematical Physics; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar] [CrossRef]
Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
Onicescu, O. Theorie de l’information energie informationelle. Comptes Rendus l’Acad. Sci. Ser. AB 1966, 263, 841–842. [Google Scholar]
Rényi, A. Probability Theory; North-Holland: Amsterdam, The Netherlands, 1970. [Google Scholar]
Fisher, R.A. Theory of Statistical Estimation. Math. Proc. Camb. Philos. Soc. 1925, 22, 700–725. [Google Scholar] [CrossRef]
Clausius, R. The Mechanical Theory of Heat—Scholar’s Choice Edition; Creative Media Partners, LLC: Burbank, CA, USA, 2015. [Google Scholar]
Accardi, L. Topics in quantum probability. Phys. Rep. 1981, 77, 169–192. [Google Scholar] [CrossRef]
Bengtsson, I.; Zyczkowski, K. Geometry of Quantum States: An Introduction to Quantum Entanglement; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar]
Bregman, L.M. The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Comput. Math. Math. Phys. 1967, 7, 200–217. [Google Scholar] [CrossRef]
Banerjee, A.; Merugu, S.; Dhillon, I.S.; Ghosh, J. Clustering with Bregman divergences. J. Mach. Learn. Res. 2005, 6, 1705–1749. [Google Scholar]
Ali, S.M.; Silvey, S.D. A general class of coefficients of divergence of one distribution from another. J. R. Stat. Soc. Ser. B Methodol. 1966, 28, 131–142. [Google Scholar] [CrossRef]
Csiszár, I. Information-type measures of difference of probability distributions and indirect observations. Stud. Sci. Math. Hung. 1967, 2, 299–318. [Google Scholar]
Liese, F.; Vajda, I. On divergences and informations in statistics and information theory. IEEE Trans. Inform. Theory 2006, 52, 4394–4412. [Google Scholar] [CrossRef]
Kullback, S.; Leibler, R.A. On Information and Sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Burago, D.; Burago, J.D.; Ivanov, S. A Course in Metric Geometry; American Mathematical Society: Providence, RI, USA, 2001. [Google Scholar]
Dirac, P.A.M. Note on Exchange Phenomena in the Thomas Atom. Math. Proc. Camb. Philos. Soc. 1930, 26, 376–385. [Google Scholar] [CrossRef]
Mazziotti, D.A. (Ed.) Reduced-Density-Matrix Mechanics: With Application to Many-Electron Atoms and Molecules, 1st ed.; Advances in Chemical Physics; Wiley: Hoboken, NJ, USA, 2007; Volume 134. [Google Scholar] [CrossRef]
Gidopoulos, N.I.; Wilson, S.; Lipscomb, W.N.; Maruani, J.; Wilson, S. (Eds.) The Fundamentals of Electron Density, Density Matrix and Density Functional Theory in Atoms, Molecules and the Solid State; Progress in Theoretical Chemistry and Physics; Springer: Dordrecht, The Netherlands, 2003; Volume 14. [Google Scholar] [CrossRef]
Absar, I. Reduced hamiltonian orbitals. II. Optimal orbital basis sets for the many-electron problem. Int. J. Quantum Chem. 1978, 13, 777–790. [Google Scholar] [CrossRef]
Absar, I.; Coleman, A.J. Reduced hamiltonian orbitals. I. a new approach to the many-electron problem. Int. J. Quantum Chem. 2009, 10, 319–330. [Google Scholar] [CrossRef]
Coleman, A.J.; Absar, I. Reduced hamiltonian orbitals. III. Unitarily invariant decomposition of hermitian operators. Int. J. Quantum Chem. 1980, 18, 1279–1307. [Google Scholar] [CrossRef]
Mazziotti, D.A. Two-Electron Reduced Density Matrix as the Basic Variable in Many-Electron Quantum Chemistry and Physics. Chem. Rev. 2012, 112, 244–262. [Google Scholar] [CrossRef]
Mazziotti, D.A. Parametrization of the two-electron reduced density matrix for its direct calculation without the many-electron wave function: Generalizations and applications. Phys. Rev. A 2010, 81, 062515. [Google Scholar] [CrossRef]
Eugene DePrince III, A. Variational determination of the two-electron reduced density matrix: A tutorial review. WIREs Comput. Mol. Sci. 2024, 14, e1702. [Google Scholar] [CrossRef]
Barcza, G.; Legeza, O.; Marti, K.H.; Reiher, M. Quantum-Information Analysis of Electronic States of Different Molecular Structures. Phys. Rev. A 2011, 83, 012508. [Google Scholar] [CrossRef]
Szalay, S.; Barcza, G.; Szilvasi, T.; Veis, L.; Legeza, O. The Correlation Theory of the Chemical Bond. Sci. Rep. 2017, 7, 2237. [Google Scholar] [CrossRef]
Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information: 10th Anniversary Edition; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar] [CrossRef]
Kubo, R. Generalized Cumulant Expansion Method. J. Phys. Soc. Jpn. 1962, 17, 1100–1120. [Google Scholar] [CrossRef]
Ziesche, P. Cumulant Expansions of Reduced Densities, Reduced Density Matrices, and Green’s Functions. In Many-Electron Densities and Reduced Density Matrices; Cioslowski, J., Ed.; Kluwer: New York, NY, USA, 2000; pp. 33–56. [Google Scholar]
Alcoba, D.R.; Valdemoro, C. Family of modified-contracted Schrödinger equations. Phys. Rev. A 2001, 64, 062105. [Google Scholar] [CrossRef]
Valdemoro, C.; Alcoba, D.R.; Tel, L.M.; Perez-Romero, E. Imposing Bounds on the High-Order Reduced Density Matrices Elements. Int. J. Quantum Chem. 2001, 85, 214–224. [Google Scholar] [CrossRef]
Valdemoro, C. Spin-Adapted Reduced Hamiltonians 2. Total Energy and Reduced Density Matrices. Phys. Rev. A 1985, 31, 2123–2128. [Google Scholar] [CrossRef] [PubMed]
Valdemoro, C. Spin-Adapted Reduced Hamiltonians. 1. Elementary Excitations. Phys. Rev. A 1985, 31, 2114–2122. [Google Scholar] [CrossRef] [PubMed]
Mazziotti, D.A. Contracted Schrödinger equation: Determining quantum energies and two-particle density matrices without wave functions. Phys. Rev. A 1998, 57, 4219–4234. [Google Scholar] [CrossRef]
Kutzelnigg, W.; Mukherjee, D. Cumulant expansion of the reduced density matrices. J. Chem. Phys. 1999, 110, 2800–2809. [Google Scholar] [CrossRef]
Kutzelnigg, W.; Mukherjee, D. Irreducible Brillouin conditions and contracted Schrödinger equations for n-electron systems. II. Spin-free formulation. J. Chem. Phys. 2002, 116, 4787–4801. [Google Scholar] [CrossRef]
Kutzelnigg, W.; Mukherjee, D. Irreducible Brillouin conditions and contracted Schrödinger equations for n-electron systems. III. Systems of noninteracting electrons. J. Chem. Phys. 2004, 120, 7340–7349. [Google Scholar] [CrossRef]
Mukherjee, D.; Kutzelnigg, W. Irreducible Brillouin conditions and contracted Schrödinger equations for n-electron systems. I. The equations satisfied by the density cumulants. J. Chem. Phys. 2001, 114, 2047–2061. [Google Scholar] [CrossRef]
Nooijen, M.; Wladyslawski, M.; Hazra, A. Cumulant approach to the direct calculation of reduced density matrices: A critical analysis. J. Chem. Phys. 2003, 118, 4832–4848. [Google Scholar] [CrossRef]
Levy, M. Universal Variational Functionals of Electron-Densities, 1st- Order Density-Matrices, and Natural Spin-Orbitals and Solution of the V-Representability Problem. Proc. Natl. Acad. Sci. USA 1979, 76, 6062–6065. [Google Scholar] [CrossRef]
Kohn, W.; Becke, A.D.; Parr, R.G. Density Functional Theory of Electronic Structure. J. Phys. Chem. 1996, 100, 12974–12980. [Google Scholar] [CrossRef]
Ayers, P.W. Axiomatic Formulations of the Hohenberg-Kohn Functional. Phys. Rev. A 2006, 73. [Google Scholar] [CrossRef]
Ayers, P.W. Using classical many-body structure to determine electronic structure: An approach using k-electron distribution functions. Phys. Rev. A 2006, 74, 042502. [Google Scholar] [CrossRef]
Ziesche, P. Attempts toward a pair density functional theory. Int. J. Quantum Chem. 1996, 60, 1361–1374. [Google Scholar] [CrossRef]
Ziesche, P. Pair density functional theory—A generalized density functional theory. Phys. Lett. A 1994, 195, 213–220. [Google Scholar] [CrossRef]
Nagy, Á. Spherically and system-averaged pair density functional theory. J. Phys. Chem. 2006, 125, 184104. [Google Scholar] [CrossRef]
Nagy, Á. Time-Dependent Pair Density from the Principle of Minimum Fisher Information. J. Mol. Model. 2018, 24, 234. [Google Scholar] [CrossRef]
Levy, M.; Ziesche, P. The pair density functional of the kinetic energy and its simple scaling property. J. Chem. Phys. 2001, 115, 9110–9112. [Google Scholar] [CrossRef]
Ayers, P.W. Generalized Density Functional Theories Using the K-Electron Densities: Development of Kinetic Energy Functionals. J. Math. Phys. 2005, 46, 062107. [Google Scholar] [CrossRef]
Cuevas-Saavedra, R.; Ayers, P.W. Coordinate scaling of the kinetic energy in pair density functional theory: A Legendre transform approach. Int. J. Quantum Chem. 2009, 109, 1699–1705. [Google Scholar] [CrossRef]
Ayers, P.W.; Golden, S.; Levy, M. Generalizations of the Hohenberg-Kohn Theorem: I. Legendre Transform Constructions of Variational Principles for Density Matrices and Electron Distribution Functions. J. Chem. Phys. 2006, 124, 054101. [Google Scholar] [CrossRef]
Ayers, P.W.; Davidson, E.R. Linear Inequalities for Diagonal Elements of Density Matrices. In Reduced-Density-Matrix Mechanics: With Application to Many-Electron Atoms and Molecules; John Wiley & Sons, Ltd: Hoboken, NJ, USA, 2007; Chapter 16; pp. 443–483. [Google Scholar] [CrossRef]
Keyvani, Z.A.; Shahbazian, S.; Zahedi, M. To What Extent are “Atoms in Molecules” Structures of Hydrocarbons Reproducible from the Promolecule Electron Densities? Chem. Eur. J. 2016, 22, 5003–5009. [Google Scholar] [CrossRef]
Spackman, M.A.; Maslen, E.N. Chemical properties from the promolecule. J. Phys. Chem. 1986, 90, 2020–2027. [Google Scholar] [CrossRef]
Hirshfeld, F.L. Bonded-Atom Fragments for Describing Molecular Charge Densities. Theor. Chim. Act. 1977, 44, 129–138. [Google Scholar] [CrossRef]
Hirshfeld, F.L. XVII. Spatial Partitioning of Charge Density. Isr. J. Chem. 1977, 16, 198–201. [Google Scholar] [CrossRef]
Nalewajski, R.F.; Switka, E. Information Theoretic Approach to Molecular and Reactive Systems. Phys. Chem. Chem. Phys. 2002, 4, 4952–4958. [Google Scholar] [CrossRef]
Nalewajski, R.F. Information Principles in the Theory of Electronic Structure. Chem. Phys. Lett. 2003, 372, 28–34. [Google Scholar] [CrossRef]
Nagy, Á.; Romera, E. Relative Rényi entropy and fidelity susceptibility. Europhys. Lett. 2015, 109, 60002. [Google Scholar] [CrossRef]
Nagy, Á. Relative information in excited-state orbital-free density functional theory. Int. J. Quantum Chem. 2020, 120, e26405. [Google Scholar] [CrossRef]
Laguna, H.G.; Salazar, S.J.C.; Sagar, R.P. Entropic Kullback-Leibler Type Distance Measures for Quantum Distributions. Int. J. Quantum Chem. 2019, 119, e25984. [Google Scholar] [CrossRef]
Borgoo, A.; Jaque, P.; Toro-Labbe, A.; Van Alsenoy, C.; Geerlings, P. Analyzing Kullback-Leibler Information Profiles: An Indication of Their Chemical Relevance. Phys. Chem. Chem. Phys. 2009, 11, 476–482. [Google Scholar] [CrossRef] [PubMed]
Parr, R.G.; Bartolotti, L.J. Some remarks on the density functional theory of few-electron systems. J. Phys. Chem. 1983, 87, 2810–2815. [Google Scholar] [CrossRef]
Ayers, P.W. Density per particle as a descriptor of Coulombic systems. Proc. Natl. Acad. Sci. USA 2000, 97, 1959–1964. [Google Scholar] [CrossRef] [PubMed]
Ayers, P.W. Information Theory, the Shape Function, and the Hirshfeld Atom. Theor. Chem. Acc. 2006, 115, 370–378. [Google Scholar] [CrossRef]
Ayers, P.W.; Cedillo, A. The Shape Function. In Chemical Reactivity Theory: A Density Functional View; Chattaraj, P.K., Ed.; Taylor and Francis: Boca Raton, FL, USA, 2009; Chapter 19; p. 269. [Google Scholar] [CrossRef]
Noorizadeh, S.; Shakerzadeh, E. Shannon entropy as a new measure of aromaticity, Shannon aromaticity. Phys. Chem. Chem. Phys. 2010, 12, 4742–4749. [Google Scholar] [CrossRef]
Yu, D. Studying on Aromaticity Using Information-Theoretic Approach in Density Functional Reactivity Theory. Ph.D. Thesis, Hunan Normal University, Changsha, China, 2019. [Google Scholar]
Yu, D.; Stuyver, T.; Rong, C.; Alonso, M.; Lu, T.; De Proft, F.; Geerlings, P.; Liu, S. Global and local aromaticity of acenes from the information-theoretic approach in density functional reactivity theory. Phys. Chem. Chem. Phys. 2019, 21, 18195–18210. [Google Scholar] [CrossRef]
Li, M.; Wan, X.; Rong, C.; Zhao, D.; Liu, S. Directionality and additivity effects of molecular acidity and aromaticity for substituted benzoic acids under external electric fields. Phys. Chem. Chem. Phys. 2023, 25, 27805–27816. [Google Scholar] [CrossRef]
Rong, C.; Wang, B.; Zhao, D.; Liu, S. Information-theoretic approach in density functional theory and its recent applications to chemical problems. WIREs Comput. Mol. Sci. 2020, 10, e1461. [Google Scholar] [CrossRef]
Liu, S. On the relationship between densities of Shannon entropy and Fisher information for atoms and molecules. J. Chem. Phys. 2007, 126, 191107. [Google Scholar] [CrossRef]
Nalevajski, R.F. On phase/current components of entropy/information descriptors of molecular states. Mol. Phys. 2014, 112, 2587–2601. [Google Scholar] [CrossRef]
Nalewajski, R. Phase/current information descriptors and equilibrium states in molecules. Int. J. Quantum Chem. 2014, 115, 1274–1288. [Google Scholar] [CrossRef]
Nalewajski, R.F. Resultant Information Description of Electronic States and Chemical Processes. J. Phys. Chem. A 2019, 123, 9737–9752. [Google Scholar] [CrossRef]
Nalewajski, R.F. Information-Theoretic Descriptors of Molecular States and Electronic Communications between Reactants. Entropy 2020, 22, 749. [Google Scholar] [CrossRef] [PubMed]
Nalewajski, R.F. Information origins of the chemical bond: Bond descriptors from molecular communication channels in orbital resolution. Int. J. Quantum Chem. 2009, 109, 2495–2506. [Google Scholar] [CrossRef]
Liu, S.; Rong, C.; Lu, T. Information Conservation Principle Determines Electrophilicity, Nucleophilicity, and Regioselectivity. J. Phys. Chem. A 2014, 118, 3698–3704. [Google Scholar] [CrossRef]
Liu, S. Quantifying Reactivity for Electrophilic Aromatic Substitution Reactions with Hirshfeld Charge. J. Phys. Chem. A 2015, 119, 3107–3111. [Google Scholar] [CrossRef]
Kruszewski, J.; Krygowski, T. Definition of aromaticity basing on the harmonic oscillator model. Tetrahedron Lett. 1972, 13, 3839–3842. [Google Scholar] [CrossRef]
Krygowski, T.M. Crystallographic studies of inter- and intramolecular interactions reflected in aromatic character of .pi.-electron systems. J. Chem. Inf. Comput. Sci. 1993, 33, 70–78. [Google Scholar] [CrossRef]
Matito, E.; Duran, M.; Solà, M. The aromatic fluctuation index (FLU): A new aromaticity index based on electron delocalization. J. Chem. Phys. 2004, 122, 014109. [Google Scholar] [CrossRef]
Schleyer, P.v.R.; Maerker, C.; Dransfeld, A.; Jiao, H.; van Eikema Hommes, N.J.R. Nucleus-Independent Chemical Shifts: A Simple and Efficient Aromaticity Probe. J. Am. Chem. Soc. 1996, 118, 6317–6318. [Google Scholar] [CrossRef]
Chen, Z.; Wannere, C.S.; Corminboeuf, C.; Puchta, R.; Schleyer, P.v.R. Nucleus-Independent Chemical Shifts (NICS) as an Aromaticity Criterion. Chem. Rev. 2005, 105, 3842–3888. [Google Scholar] [CrossRef] [PubMed]
Zhao, Y.; Zhao, D.; Liu, S.; Rong, C.; Ayers, P.W. Extending the information-theoretic approach from the (one) electron density to the pair density. J. Chem. Phys. 2025. accepted for publication. [Google Scholar]
Sagar, R.P.; Guevara, N.L. Mutual information and correlation measures in atomic systems. J. Chem. Phys. 2005, 123, 044108. [Google Scholar] [CrossRef] [PubMed]
Heinosaari, T.; Ziman, M. The Mathematical Language of Quantum Theory: From Uncertainty to Entanglement, 1st ed.; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar] [CrossRef]
Bengtsson, I.; Życzkowski, K. Geometry of Quantum States: An Introduction to Quantum Entanglement, 2nd ed.; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar] [CrossRef]
Ciarlet, P.G.; Lions, J.L. In Computational Chemistry: Reviews of Current Trends. In Computational Chemistry: Reviews of Current Trends; North-Holland: Amsterdam, The Netherlands, 2003. [Google Scholar]
Reed, M.; Simon, B. Methods of Modern Mathematical Physics. IV, Analysis of Operators; Academic Press: London, UK, 1978. [Google Scholar]
Yserentant, H. On the regularity of the electronic Schrödinger equation in Hilbert spaces of mixed derivatives. Numer. Math. 2004, 98, 731–759. [Google Scholar] [CrossRef]
Islam, R.; Ma, R.; Preiss, P.M.; Eric Tai, M.; Lukin, A.; Rispoli, M.; Greiner, M. Measuring entanglement entropy in a quantum many-body system. Nature 2015, 528, 77–83. [Google Scholar] [CrossRef]
Rissler, J.; Noack, R.M.; White, S.R. Measuring orbital interaction using quantum information theory. Chem. Phys. 2006, 323, 519–531. [Google Scholar] [CrossRef]
Mazziotti, D.A. Entanglement, Electron Correlation, and Density Matrices, 1st ed.; Wiley: Hoboken, NJ, USA, 2007; Volume 134, pp. 493–535. [Google Scholar] [CrossRef]
Boguslawski, K.; Tecmer, P.; Legeza, O.; Reiher, M. Entanglement Measures for Single- and Multireference Correlation Effects. J. Phys. Chem. Lett. 2012, 3, 3129–3135. [Google Scholar] [CrossRef]
Boguslawski, K.; Tecmer, P. Orbital entanglement in quantum chemistry. Int. J. Quantum Chem. 2015, 115, 1289–1295. [Google Scholar] [CrossRef]
Zhao, Y.; Boguslawski, K.; Tecmer, P.; Duperrouzel, C.; Barcza, G.; Legeza, O.; Ayers, P.W. Dissecting the bond-formation process of d 10-metal–ethene complexes with multireference approaches. Theor. Chem. Acc. 2015, 134, 120. [Google Scholar] [CrossRef]
Duperrouzel, C.; Tecmer, P.; Boguslawski, K.; Barcza, G.; Örs, L.; Ayers, P.W. A quantum informational approach for dissecting chemical reactions. Chem. Phys. Lett. 2015, 621, 160–164. [Google Scholar] [CrossRef]
Boguslawski, K.; Tecmer, P.; Legeza, O. Analysis of two-orbital correlations in wave functions restricted to electron-pair states. Phys. Rev. B 2016, 94, 155126. [Google Scholar] [CrossRef]
Boguslawski, K.; Réal, F.; Tecmer, P.; Duperrouzel, C.; Pereira Gomes, A.S.; Legeza, Ö.; Ayers, P.W.; Vallet, V. On the multi-reference nature of plutonium oxides: PuO22+, PuO₂, PuO₃ and PuO₂(OH)₂. Phys. Chem. Chem. Phys. 2017, 19, 4317–4329. [Google Scholar] [CrossRef] [PubMed]
Brandejs, J.; Veis, L.; Szalay, S.; Barcza, G.; Pittner, J.; Legeza, Ö. Quantum information-based analysis of electron-deficient bonds. J. Chem. Phys. 2019, 150, 204117. [Google Scholar] [CrossRef] [PubMed]
Wu, L.A.; Sarandy, M.S.; Lidar, D.A.; Sham, L.J. Linking entanglement and quantum phase transitions via density-functional theory. Phys. Rev. A 2006, 74, 052335. [Google Scholar] [CrossRef]
Raghavachari, K.; Anderson, J.B. Electron Correlation Effects in Molecules. J. Phys. Chem. 1996, 100, 12960–12973. [Google Scholar] [CrossRef]
White, S.R. Density matrix formulation for quantum renormalization groups. Phys. Rev. Lett. 1992, 69, 2863–2866. [Google Scholar] [CrossRef]
White, S.R. Density-matrix algorithms for quantum renormalization groups. Phys. Rev. B 1993, 48, 10345–10356. [Google Scholar] [CrossRef]
White, S.R.; Martin, R.L. Ab initio quantum chemistry using the density matrix renormalization group. J. Chem. Phys. 1999, 110, 4127–4130. [Google Scholar] [CrossRef]
Fiedler, M. Algebraic connectivity of graphs. Czech. Math. J. 1973, 23, 298–305. [Google Scholar] [CrossRef]
Fiedler, M. A property of eigenvectors of nonnegative symmetric matrices and its application to graph theory. Czech. Math. J. 1975, 25, 619–633. [Google Scholar] [CrossRef]
Levine, B.G.; Durden, A.S.; Esch, M.P.; Liang, F.; Shu, Y. CAS without SCF—Why to use CASCI and where to get the orbitals. J. Chem. Phys. 2021, 154, 090902. [Google Scholar] [CrossRef] [PubMed]
Stein, C.J.; Reiher, M. Automated Selection of Active Orbital Spaces. J. Chem. Theory Comput. 2016, 12, 1760–1771. [Google Scholar] [CrossRef] [PubMed]
Ding, L.; Knecht, S.; Schilling, C. Quantum Information-Assisted Complete Active Space Optimization (QICAS). J. Phys. Chem. Lett. 2023, 14, 11022–11029. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Illustration of some representative concepts of classical information theory (CIT) and quantum information theory (QIT) in quantum chemistry confined to Shannon’s framework.

Figure 2. Different measures of distinguishability between two probability distributions

P = (p, 1 - p)

and

Q = (q, 1 - q)

.

Figure 2. Different measures of distinguishability between two probability distributions

P = (p, 1 - p)

and

Q = (q, 1 - q)

.

Figure 3. A pseudo Venn diagram illustrating the relationships between Shannon entropy

H_{(X)}

or

H_{(Y)}

along with consequent concepts: joint entropy

H_{(X, Y)}

, conditional entropy

H_{(X | Y)}

or

H_{(Y | X)}

, and mutual information

I_{(X; Y)}

.

Figure 3. A pseudo Venn diagram illustrating the relationships between Shannon entropy

H_{(X)}

or

H_{(Y)}

along with consequent concepts: joint entropy

H_{(X, Y)}

, conditional entropy

H_{(X | Y)}

or

H_{(Y | X)}

, and mutual information

I_{(X; Y)}

.

Figure 4. For the Bloch sphere, every point on the surface corresponds to a pure state, every point inside corresponds to a mixed state, and the point located exact center of the Bloch sphere corresponds to a maximally mixed state.

Figure 6. Radial distribution functions of the Shannon entropy densities

4 π r^{2} s_{s (X)} (r)

Figure 6. Radial distribution functions of the Shannon entropy densities

4 π r^{2} s_{s (X)} (r)

Figure 7. Contour plots of radial distribution function for joint entropy kernel

r_{1}^{2} s_{s (X, Y)} (r_{1}, r_{2}) r_{2}^{2}

, conditional entropy kernel

r_{1}^{2} s_{s (X | Y)} (r_{1}, r_{2}) r_{2}^{2}

and mutual information kernel

r_{1}^{2} s_{s (X; Y)} (r_{1}, r_{2}) r_{2}^{2}

Figure 7. Contour plots of radial distribution function for joint entropy kernel

r_{1}^{2} s_{s (X, Y)} (r_{1}, r_{2}) r_{2}^{2}

, conditional entropy kernel

r_{1}^{2} s_{s (X | Y)} (r_{1}, r_{2}) r_{2}^{2}

and mutual information kernel

r_{1}^{2} s_{s (X; Y)} (r_{1}, r_{2}) r_{2}^{2}

Figure 8. Bipartite of quantum many-body system for the separable state (product state) and entangled state.

Figure 9. Graph illustration one- and two- orbital bipartite of the system and the corresponding vector state of the composite system

A B

is displayed.

Figure 9. Graph illustration one- and two- orbital bipartite of the system and the corresponding vector state of the composite system

A B

is displayed.

Figure 10. Single-orbital entropy and an alternative definition of orbital mutual information, as proposed in Ref. [148], is given by

I_{(p; q)} = \frac{1}{2} [s {(2)}_{p q} - s {(1)}_{p} - s {(1)}_{q}] (1 - δ_{p q})

, specifically applied to Ni(C2H4) through DMRG(36,33) calculations. The classified strengths of mutual information in Table 3 are represented using a color code, with the dynamic entanglement effects being disregarded. Reproduced with permission from Ref. [150] Copyright 2015, The Author.

Figure 10. Single-orbital entropy and an alternative definition of orbital mutual information, as proposed in Ref. [148], is given by

I_{(p; q)} = \frac{1}{2} [s {(2)}_{p q} - s {(1)}_{p} - s {(1)}_{q}] (1 - δ_{p q})

, specifically applied to Ni(C2H4) through DMRG(36,33) calculations. The classified strengths of mutual information in Table 3 are represented using a color code, with the dynamic entanglement effects being disregarded. Reproduced with permission from Ref. [150] Copyright 2015, The Author.

Figure 11. Orbital classification in CAS Methodology: Inactive space remain doubly occupied, active space allow all possible configurations, and virtual space are unoccupied.

Table 1. The matrix elements of 1-orbital RDM

{}_{o}^{1}D_{p}

expressed in terms of 1- and 2-electron RDM.

Table 1. The matrix elements of 1-orbital RDM

{}_{o}^{1}D_{p}

expressed in terms of 1- and 2-electron RDM.

	-	↑	↓	↑↓
-	$1 - {}^{1}{\bar{D}}_{p}^{p} - {}^{1}{\bar{D}}_{\bar{p}}^{\bar{p}} + {}^{2}{\bar{D}}_{p \bar{p}}^{p \bar{p}}$	0	0	0
↑	0	${}^{1}{\bar{D}}_{p}^{p} - {}^{2}{\bar{D}}_{p \bar{p}}^{p \bar{p}}$	0	0
↓	0	0	${}^{1}{\bar{D}}_{\bar{p}}^{\bar{p}} - {}^{2}{\bar{D}}_{p \bar{p}}^{p \bar{p}}$	0
↑↓	0	0	0	${}^{2}{\bar{D}}_{p \bar{p}}^{p \bar{p}}$

Table 2. The matrix elements of 2-orbital RDM

{}_{o}^{2}D_{p q}

expressed in terms of 1-, 2-, 3-, and 4-electron RDM.

Table 2. The matrix elements of 2-orbital RDM

{}_{o}^{2}D_{p q}

expressed in terms of 1-, 2-, 3-, and 4-electron RDM.

	-	↑	↑	↓	↓	↑↑	↓↓	↑↓	↑↓	↓↑	↑↓	~~↑↑↓~~	~~↑↓↑~~	~~↓↑↓~~	~~↑↓↓~~	~~↑↓↑↓~~
-	1,1	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0
↑	0	2,2	2,3	0	0	0	0	0	0	0	0	0	0	0	0	0
↑	0	3,2	3,3	0	0	0	0	0	0	0	0	0	0	0	0	0
↓	0	0	0	4,4	4,5	0	0	0	0	0	0	0	0	0	0	0
↓	0	0	0	5,4	5,5	0	0	0	0	0	0	0	0	0	0	0
↑↑	0	0	0	0	0	6,6	0	0	0	0	0	0	0	0	0	0
↓↓	0	0	0	0	0	0	7,7	0	0	0	0	0	0	0	0	0
↑↓	0	0	0	0	0	0	0	8,8	8,9	8,10	8,11	0	0	0	0	0
↑↓	0	0	0	0	0	0	0	9,8	9,9	9,10	9,11	0	0	0	0	0
↓↑	0	0	0	0	0	0	0	10,8	10,9	10,10	10,11	0	0	0	0	0
↑↓	0	0	0	0	0	0	0	11,8	11,9	11,10	11,11	0	0	0	0	0
↑↑↓	0	0	0	0	0	0	0	0	0	0	0	12,12	12,13	0	0	0
↑↓↑	0	0	0	0	0	0	0	0	0	0	0	13,12	13,13	0	0	0
↓↑↓	0	0	0	0	0	0	0	0	0	0	0	0	0	14,14	14,15	0
↑↓↓	0	0	0	0	0	0	0	0	0	0	0	0	0	15,14	15,15	0
↑↓↑↓	0	0	0	0	0	0	0	0	0	0	0	0	0	0	0	16,16

(1, 1) = 1 - {}^{1}D_{p}^{p} - {}^{1}D_{\bar{p}}^{\bar{p}} - {}^{1}D_{q}^{q} - {}^{1}D_{\bar{q}}^{\bar{q}} + {}^{2}D_{p \bar{p}}^{p \bar{p}} + {}^{2}D_{q \bar{q}}^{q \bar{q}} + {}^{2}D_{p q}^{p q} + {}^{2}D_{p \bar{q}}^{p \bar{q}} + {}^{2}D_{\bar{p} q}^{\bar{p} q} + {}^{2}D_{\bar{p} \bar{q}}^{\bar{p} \bar{q}} - {}^{3}D_{p q \bar{q}}^{p q \bar{q}} - {}^{3}D_{\bar{p} q \bar{q}}^{\bar{p} q \bar{q}} - {}^{3}D_{p \bar{p} q}^{p \bar{p} q} - {}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}}

+ {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (2, 2) = {}^{1}D_{q}^{q} - {}^{2}D_{p q}^{p q} - {}^{2}D_{\bar{p} q}^{\bar{p} q} - {}^{2}D_{q \bar{q}}^{q \bar{q}} + {}^{3}D_{p \bar{q} q}^{p \bar{q} q} + {}^{3}D_{p \bar{p} q}^{p \bar{p} q} + {}^{3}D_{\bar{p} q \bar{q}}^{\bar{p} q \bar{q}} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (2, 3) = {(3, 2)}^{†} = {}^{1}D_{p}^{q} - {}^{2}D_{p \bar{p}}^{q \bar{p}} - {}^{2}D_{p \bar{q}}^{q \bar{q}}

+ {}^{3}D_{p \bar{p} \bar{q}}^{q \bar{p} \bar{q}} (3, 3) = {}^{1}D_{p}^{p} - {}^{2}D_{p \bar{p}}^{p \bar{p}} - {}^{2}D_{p q}^{p q} - {}^{2}D_{p \bar{q}}^{p \bar{q}} + {}^{3}D_{p q \bar{q}}^{p q \bar{q}} + {}^{3}D_{p \bar{p} q}^{p \bar{p} q} + {}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (4, 4) = {}^{1}D_{\bar{q}}^{\bar{q}} - {}^{2}D_{p \bar{q}}^{p \bar{q}} - {}^{2}D_{\bar{p} \bar{q}}^{\bar{p} \bar{q}} - {}^{2}D_{q \bar{q}}^{q \bar{q}} +

{}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}} + {}^{3}D_{p q \bar{q}}^{p q \bar{q}} + {}^{3}D_{\bar{p} q \bar{q}}^{\bar{p} q \bar{q}} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (4, 5) = {(5, 4)}^{†} = {}^{1}D_{\bar{p}}^{\bar{q}} - {}^{2}D_{p \bar{p}}^{p \bar{q}} - {}^{2}D_{q \bar{p}}^{q \bar{q}} + {}^{3}D_{p q \bar{p}}^{p q \bar{q}} (5, 5) = {}^{1}D_{\bar{p}}^{\bar{p}} - {}^{2}D_{\bar{p} q}^{\bar{p} q} - {}^{2}D_{\bar{p} \bar{q}}^{\bar{p} \bar{q}} - {}^{2}D_{p \bar{p}}^{p \bar{p}} +

{}^{3}D_{\bar{p} q \bar{q}}^{\bar{p} q \bar{q}} + {}^{3}D_{p \bar{p} q}^{p \bar{p} q} + {}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (6, 6) = {}^{2}D_{p q}^{p q} - {}^{3}D_{p \bar{p} q}^{p \bar{p} q} - {}^{3}D_{p q \bar{q}}^{p q \bar{q}} + {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (7, 7) = {}^{2}D_{\bar{p} \bar{q}}^{\bar{p} \bar{q}} - {}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}} - {}^{3}D_{\bar{p} q \bar{q}}^{\bar{p} q \bar{q}} + {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (8, 8)

= {}^{2}D_{q \bar{q}}^{q \bar{q}} - {}^{3}D_{p q \bar{q}}^{p q \bar{q}} - {}^{3}D_{p \bar{q} q}^{p \bar{q} q} + {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (8, 9) = {(9, 8)}^{†} = {}^{2}D_{p \bar{q}}^{q \bar{q}} - {}^{3}D_{p \bar{q} \bar{p}}^{q \bar{q} \bar{p}} (8, 10) = {(10, 8)}^{†} = - {}^{2}D_{q \bar{p}}^{q \bar{q}} + {}^{3}D_{p q \bar{p}}^{p q \bar{q}} (8, 11) = {(11, 8)}^{†}

= {}^{2}D_{p \bar{p}}^{q \bar{q}} (9, 9) = {}^{2}D_{p \bar{q}}^{p \bar{q}} - {}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}} - {}^{3}D_{p q \bar{q}}^{p q \bar{q}} + {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (9, 10) = {(10, 9)}^{†} = - {}^{2}D_{q \bar{p}}^{p \bar{q}} (9, 11) = {(11, 9)}^{†} = {}^{2}D_{p \bar{p}}^{p \bar{q}} - {}^{3}D_{p q \bar{p}}^{p q \bar{q}} (10, 10)

= {}^{2}D_{\bar{p} q}^{\bar{p} q} - {}^{3}D_{p \bar{p} q}^{p \bar{p} q} - {}^{3}D_{\bar{p} q \bar{q}}^{\bar{p} q \bar{q}} + {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (10, 11) = {(11, 10)}^{†} = - {}^{2}D_{p \bar{p}}^{q \bar{p}} + {}^{3}D_{p \bar{p} \bar{p}}^{q \bar{p} \bar{q}} (11, 11) = {}^{2}D_{p \bar{p}}^{p \bar{p}} - {}^{3}D_{p \bar{p} q}^{p \bar{p} q} - {}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}} + {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (12, 12)

= {}^{3}D_{p q \bar{q}}^{p q \bar{q}} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (12, 13) = {(13, 12)}^{†} = - {}^{3}D_{p q \bar{q}}^{p q \bar{q}} (13, 13) = {}^{3}D_{p \bar{p} q}^{p \bar{p} q} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (14, 14) = {}^{3}D_{\bar{p} q \bar{q}}^{\bar{p} q \bar{q}} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (14, 15) = {(15, 14)}^{†}

= - {}^{3}D_{\bar{p} p \bar{q}}^{\bar{p} q \bar{q}} (15, 15) = {}^{3}D_{p \bar{p} \bar{q}}^{p \bar{p} \bar{q}} - {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}} (16, 16) = {}^{4}D_{p \bar{p} q \bar{q}}^{p \bar{p} q \bar{q}}

.

Table 3. Relation between the strength of orbital entanglement and electron correlation effects.

Correlation Effects	Intensity	$s {(1)}_{p}$	$I_{(p; q)}$ ^a
Nondynamic	Strong	>0.5	≈10⁻¹
Static	Medium	0.5–0.1	≈10⁻¹
Dynamic	Weak	<0.1	≈10⁻¹

^a An alternative definition of orbital mutual information from Ref. [148] is employed:

I_{(p; q)} = \frac{1}{2} [s {(2)}_{p q} - s {(1)}_{p} - s {(1)}_{q}] (1 - δ_{p q})

.

δ_{p q}

is the Kronecker delta to ensure

I_{(p; q)} = 0

when

p = q

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, Y.; Zhao, D.; Rong, C.; Liu, S.; Ayers, P.W. Information Theory Meets Quantum Chemistry: A Review and Perspective. Entropy 2025, 27, 644. https://doi.org/10.3390/e27060644

AMA Style

Zhao Y, Zhao D, Rong C, Liu S, Ayers PW. Information Theory Meets Quantum Chemistry: A Review and Perspective. Entropy. 2025; 27(6):644. https://doi.org/10.3390/e27060644

Chicago/Turabian Style

Zhao, Yilin, Dongbo Zhao, Chunying Rong, Shubin Liu, and Paul W. Ayers. 2025. "Information Theory Meets Quantum Chemistry: A Review and Perspective" Entropy 27, no. 6: 644. https://doi.org/10.3390/e27060644

APA Style

Zhao, Y., Zhao, D., Rong, C., Liu, S., & Ayers, P. W. (2025). Information Theory Meets Quantum Chemistry: A Review and Perspective. Entropy, 27(6), 644. https://doi.org/10.3390/e27060644

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Information Theory Meets Quantum Chemistry: A Review and Perspective

Abstract

1. Motivation

2. Brief Introduction to Information Theory

2.1. Shannon Entropy

2.2. Relative Entropy

2.3. Bivariate Entropy

3. Basic Ingredients of Information Theory in Quantum Chemistry: Reduced Density Matrix

3.1. Density Matrix and Reduced Density Matrix

3.2. 1-RDM and 2-RDM

3.3. 3-RDM and 4-RDM

4. Classical Information Theory in Quantum Chemistry

4.1. Electron Density in Position Space

4.2. Information-Theoretic Approach Chemical Descriptors

Examples and Illustrations

5. Quantum Information Theory in Quantum Chemistry

5.1. Bipartite Entanglement

5.2. Orbital Reduced Density Matrix

5.3. Orbital Entanglement

Examples and Illustrations

6. Summary and Outlook

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI