1. Introduction
In classical mechanics, the virial theorem (VT) is well-known [
1]. It relates the total kinetic energy of a stable system of discrete particles bound by potential forces with that of the total potential energy of the system. The validity of this theorem for atoms was proven by Finkelstein [
2] for the Schrödinger equation in 1928. Fock [
3,
4], who used the variational method to derive the Hartree–Fock method for atoms, was the first to derive the theorem for the Dirac equations using a stretching method also referred to as a “scaling” [
5] or “dilation” procedure [
6].
The nonrelativistic wave equation for an N-electron atomic system,
, is often expressed in Cartesian coordinates along with approximate solutions,
, obtained with the variational Hartree–Fock method [
7]. In this approximation, the multielectron wave function is expressed as a determinant of one-electron wave functions or more generally as a linear combination of antisymmetrized products of orbitals, one for each electron, of the same form as wave functions of the one-electron case. As illustrated below, replacing
with an expansion in terms of Legendre polynomials, namely,
where
and
, introduces Slater integrals into the energy expression [
8,
9].
The virial theorem is a special test of the wave function. When the spatial variables of a normalized wave function are inflated or scaled by the transformation
for all electrons, then for a stationary solution, the energy is still stationary, namely,
This stationary condition holds for the exact solution of the wave equations, but it also holds for a solution of the variational equation itself. This stationary condition that leads to the virial theorem (VT) ratio. Thus, it is a property of a stationary solution along with other theorems such as Brillouin’s [
10,
11].
In atomic calculations,
is expressed in spherical coordinates as a vector-coupled configuration state function (CSF) for groups of equivalent electrons with given total spin-angular quantum numbers and parity in nonrelativistic theory or total
J and parity in relativistic theory, where equivalent electrons have the same radial functions. In spherical coordinates, the wave function for a group of equivalent electrons is a product of radial and spin-angular factors. In nonrelativistic theory, the radial factor is simply
,
, where
a identifies the group of equivalent electrons through the
quantum numbers, and
is the occupation number of the subshell. In Dirac theory, the radial factor is a
matrix with diagonal elements that are similar products of large (
) and small (
) components, along with a two-component vector of spin-angular components, where
a refers to the
quantum numbers, with
and
for
and
, respectively [
12]. Multiplying the wave equation by
and integrating over all coordinates, we obtain an energy functional (expression) in terms of the unknown radial functions. In both approximations, the energy functional for
is a list of one-electron integrals (
) and two-electron Slater integrals, so that
where
a refers to a subshell of
equivalent electrons, and
b also refers to a subshell. The direct
and exchange
integral are particular cases of the generalized Slater integral
defined in
Section 6.1Coefficients
and
of the direct and exchange radial integrals, respectively, result from the integration of Coulomb Operator (
1) over spin-angular coordinates. This integration is highly selective [
13] and limits the value of
k that defines the Slater integrals for a given state.
In this section, we only consider the Dirac–Hartree–Fock (DHF) case. In multiconfiguration expansions where is a linear combination of CSFs, Slater integrals of different symmetries also appear. Scaling affects only the radial factor of the CSF wave function and not the spin-angular factors.
For multielectron atoms, the variational method for relativistic orbitals leads to a system of equations, one for each orbital
a with quantum numbers
, whose radial functions are varied. In the numerical multiconfiguration Dirac–Hartree–Fock method (MCDHF), these equations have the following form:
where
c is the constant used for the speed of light,
is a two-component vector of large
and small
components of the radial functions, i.e.,
,
is the potential,
is the orbital energy, and
S is a
identity matrix in the
r variable. In numerical methods, for cases with two or more electrons, potential
includes only the direct interactions, whereas the exchange contributions are included in the two-component functions
along with contributions of the type
from off-diagonal Lagrange multipliers that ensure the orthogonality of orbitals of the same
symmetry. The numerical solution of these equations involves the matching of outward and inward integration procedures. For more details, see [
6,
12,
14,
15].
In B-spline methods in which the radial functions for large and small components of one-electron spinors are expanded in B-spline bases, all variational contributions to the energy expression are included in a matrix, orbitals rotated for stationary energy, and Lagrangian multipliers eliminated through the use projection operators [
16,
17]. Orbital energy
is then an eigenvalue of an interaction matrix.
It is not obvious from the form of Equation (
5) that, as
, the solution approaches the nonrelativistic limit with
. This is clearer if factor
c is included in the definition of the orbital, i.e., if
was replaced by
, so that the second column of the matrix was divided by
c and the second equation was also divided by
c. In the nonrelativistic limit, the equation becomes the radial Schödinger equation but with the difference that the Dirac equation includes a “mass energy” correction. Thus, the virial theorem (VT) ratio is
, in the relativistic case, and
in the nonrelativistic case for exact solutions of the variational equations, where
V is the potential energy, and
T the kinetic energy.
The virial theorem (VT) is a special test of the computed wave function involving the scaling of the radial functions. When numerical methods are used, the tests simply regard how well the differential equations are solved, but they are also a test of how accurately the algorithms have determined Lagrangian multipliers, and with which assumptions or simplifications. There are cases where Lagrangian multipliers can be set to zero, the wave function remains unchanged, and energy is stationary. At the same time, there are cases where Lagrangian multipliers are sufficiently large so that radial functions have extra nodes making node counting an art. A good case for study are VT results from the B-spline Dirac–Hartree–Fock program
DBSR_HF [
16] and the nonrelativistic B-spline Hartree–Fock
SPHF code [
18] for the
case shown in
Table 1. Orthogonal transformations can be expressed as orbital rotations. In the
DBSR_HF code, the default is no rotations and off-diagonal Lagrangian multipliers set to zero; however, as seen in
Table 1, this result has a rather poor ratio, indicating 2–3 digits of accuracy, whereas with rotation, the same numerical methods are almost as accurate as those for the nonrelativistic B-spline method.
Table 2 reports the total energy (
E) in units of
along with the VT sum
and VT ratio for the neutral atoms, He
, Be [He]
, Ne [He]
, Ar [Ne]
, Kr [Ar]
, and Rn [Xe]
using the
DBSR_HF program to high precision. To improve accuracy, the first nonzero grid point was set to be
, the exponential grid-step parameter was reduced to he = 0.125, and the convergence for the energy was set to scf_tol = 1.d-16, the change in the largest value of the radial function to orb_tol = 1.d-10, and the tolerance for the tail region to end_tol = 1.d-10. The first two tolerances were for a relative change. The entire calculation was performed in double precision arithmetic with 15–16 digits of numerical accuracy. In other words, the requested accuracy was machine precision for the total energy. We see immediately that the VT sum grew in magnitude as the number of shells and total energy increased, except for the value for Kr with the
subshell where the value was especially large. In atomic spectroscopy, the digits after the decimal point are important, as wavelengths depend on energy differences that are usually a fraction of the
unit. The
DBSR_HF also reports the VT ratio that is related to the accuracy of the total energy, which was small for Rn because the total energy was large.
Both Kim [
19], and Lindgren and Rosén [
20], used the
operator and associated Slater integrals in their analysis of the relativistic Hartree–Fock equations, but relied on the
operator for deriving the VT theorem. In this paper, we explore scaling to see how the VT results are achieved when Slater integrals are used. We first consider the scaling of the radial equations of associated subshells of equivalent electrons, including the scaling of Slater integrals, and then relate the sum of these equations to the scaling of the total energy. We refer to the stationary condition for the orbital equation as VT
, whereas VT refers to that of the total energy.
2. General Theory
In order to analyze the effect on the energy of a single electron in a potential from the scaling of the radial function, it is convenient to write Equation (
5) in operator form, namely,
where
where
is the kinetic energy operator,
the potential energy operator,
M is a constant matrix referred to as the mass operator, and
S is the identity matrix.
Consider scaling perturbation
of the radial function, so that Equation (
6) becomes
where
is now a function of
. Before deriving an expression for the energy parameter, let us descale the entire radial equation by replacing
r by
, so that the equation becomes
The scaled radial function went back to the original function, whereas the operators in the radial equation changed.
In Equation (
9), function
is normalized; so, by multiplying on the left by
and integrating, it follows that
For a stationary solution
with respect to scaling parameter
, the energy satisfies variational condition
In order to continue the analysis, let us consider a continuous operator
and define the expectation value of the operator to be
and
. Then, according to Equations (
8) and (
10),
and the variational condition of Equation (
11):
Thus, we show that the scaling of the energy depends only on the scaling of and .
The scaling of
is simple. Clearly
as
. Operator
scales in the same manner, so that
However, the scaling of
is more complicated. In general, since the differentiation with respect to
and the integration with respect to
r commute, we know that
Now let
. Then,
Hence, with expectation values,
where we express the final scaling as
and a correction,
, the deviation from linear scaling. The above
operator was first introduced by Fock [
3,
4], and we refer to it as the Fock rule. This rule is useful when scaling is not straightforward.
The discussion has so far assumed a single electron in a potential. We extend this to a single electron in a Hartree potential for equivalent electrons in a given subshell, and lastly an orbital in a Hartree–Fock potential for multiple subshells of equivalent electrons, all of which are scaled. We show that, in general, the scaling condition of Equation (
14) resulting from the stationary condition of orbital energy Equation (
11) with respect to the variation of
(at
) has the form
where
may be zero for exact solutions in the case of a single orbital, but may not be zero in the case where the variation includes multiple subshells. Expressions for
appearing in Equation (
20) are determined as a correction arising from the nonhydrogenic potential. We refer to this equation as the virial theorem of an orbital (VT
) to distinguish it from the VT for the total energy for systems with multiple orbitals and show that
for a solution that satisfies the virial theorem. Here,
is the occupation number of shell
a.
Nonrelativistic Case
In nonrelativistic theory, small component
. Consequently,
and can be omitted. Though the values of Slater integrals are different, the scaling is the same. The nonrelativistic equation for an electron in a potential is
Thus, the kinetic energy operator is
for which
As a result, the energy equation and the stationary condition for the dilation
are the equivalent of Equations (
13) and (
20), respectively. For the total energy, the VT is usually written as
6. Multielectron Systems with Two or More Subshells
In order to derive expressions for the various kinetic, potential, and other properties of subshells, let us now individually consider the scaling of each of the contributions from radial integrals and their coefficients that appear in the energy functional of a state, and their contribution to the radial equations, and then derive the scaling equation for the total energy by summing over all subshells. This approach includes both direct and exchange contributions to the potential. For an exact (self-consistent) solution for which
, we now have
by Equation (
32).
In our analysis, we consider Dirac–Hartree–Fock energy expressions that include only one-electron integrals
and Slater integrals
or
that are 2-electron contributions to
where
Given an orbital basis and an energy expression, the kinetic and potential energies for subshells can readily be computed directly without considering orbital potentials.
The one-electron integrals multiplied by their coefficients contribute directly to , , and , whereas Slater integrals contribute to two subshells, when or twice to subshell when .
6.1. Slater Integrals
The relativistic Slater integral, usually denoted as
, is a two-dimensional integral with two coordinates, e.g.,
. Let us denote radial factors in terms of vector products of large and small components
In this definition,
is defined by the orbitals of the first coordinate and the contribution to the potential by the orbital of the second coordinate. Slater integrals have many symmetries arising from the symmetry of a product and the symmetry of the coordinate system. In the canonical form, orbitals are in a designated order, such that
,
and, if
, then also
. Consequently, a typical canonical integral is
by the symmetry of the coordinate system. Because of this symmetry, we consider only the first case in our study.
Each
function itself is the sum of two integrals. In particular,
where one is integrated over
and one over
as indicated below:
Because variable
r only defines the range of integration of the
X integrals, the sum of derivatives of
and
obeys a useful relation
6.2. Scaling of the Operator
For the variation of a contribution to the potential from orbitals of the second coordinate, we need to consider the contribution to a potential, for example,
where
Factors
and
scale respectively as
and
, and given Equation (
36), direct differentiation is more straightforward than the Fock rule (
18) and yields
6.3. Symmetry of Slater Integrals in DHF Energy Expressions
The above derivation was a general derivation for any combination of orbitals. In DHF calculations, only
or
symmetries occur. Then, if
and
, the contribution to scaling
would be
Here, we introduce subscript
to denote the
function required for the calculation of the expectation value. Scaling in the other coordinate yields a contribution to
In the case of a
Slater integral, there is only one possibility:
and
, and integral
would contribute to both
and
, multiplied by the coefficient of the Slater integral.
In this notation, a Slater integral can be expressed as an expectation value of a potential function, namely,
6.4. Total Energy Virial Equation
Computationally, the list of one-electron integrals and Slater integrals along with their coefficients defines the total energy of a state and also determines the subshell quantities
,
,
, and
. Then,
are total values that contribute directly to the total energy and its dilation equation. However, the sum of potential energies of subshells is different. Because the Slater integrals with orbitals
from subshells
contribute to both
and
(or twice to
when
), we have
where
is the potential energy for the total energy and in both coordinates, and
represents the contribution from the 2-body Slater integrals. Then, we have two equations:
Consequently,
. Substituting into Energy Equation (
45), we obtain mass dilation condition
Thus, an exact solution requires that , in which case there are two virial conditions, namely, and
6.5. Slater Integral Scaling Condition
Condition is satisfied if for each Slater integral the dilation contributions from both coordinates, and the Slater integral is zero. There are three cases that need to be considered:
: In Equation (
38) the scaling is the same in both cooordinates and expressing
in terms of potentials; then, it follows that
by Equation (
38).
: In this case, the contribution is also the same for each coordinate, and the sum (involving both subshells) is
: In this case, the scaling depends on the coordinate, and the effect of scaling for orbital
a differs from that for orbital
b. However, the combined contribution to the dilation equation for the total energy should still be zero, namely,
In this case, the scaling within a given coordinate does not need to be zero, although the sum for the two coordinates should be zero.
In summary, the scaling or dilation conditions for (Dirac)–Hartree–Fock calculations for the three types of Slater integrals are:
Thus, the stationary conditions for dilation require a balance of the contributions to the potential from the inner () and outer () regions. Particularly intricate is the condition for the condition, involving two different charge densities.
7. Results and Analysis
Traditionally, the virial theorem for the total energy is computed (in sum form) simply as
, which would be
in the formalism of Equation (
46). In order to gain insight into how the Slater integral method achieved linear scaling, we predicted the scaling contribution to the stationary condition of each integral and computed
, which showed the extensive balancing of contributions of an orbital potential from the charge distribution. However, the most extensive analysis of the orbital equations for an atomic system is the study of the scaling of orbital equations leading to the calculation of the total energy.
Table 5 shows results for the ground states of ions of Fm (
) as more closed shells are added with values for He-like, Be-like, and Ne-like studies using radial functions from
DBSR_HF [
16] for standard (not high-precision) calculations, and Ar-like with radial functions from
grasp [
25]. In each case, radial functions were transformed to the numerical grid of the revised procedures [
15], and the various components were computed and analyzed. Displayed is information about the scaling of each individual orbital subshells, the most important being
. For He-like, the value is small but somewhat larger than the value reported in
Table 4 because the present values are not high-precision results.
Let us confirm some results for the first case of
. The formula for the energy is
Then, the orbital subshell equations for the energy and dilation are, respectively,
and
with
by Equation (
38). As a result, for an exact solution of
, where
and
, we have conditions
from which it follows that
Here,
is both a weighting factor for the expectation value and the charge density function that determines a contribution to the potential as a function of
r.
Another interesting case is Be-like
. The
Slater integral connects two different orbital equations and the dilation conditions (Equation (
51) are distributed to both equal size and opposite sign. When summed for the total energy, these contributions are cancelled out. In larger systems with multiple subshells, similar cancellations are involved.
Frequently, when spectral calculations are performed for multiple levels, a fixed core approximation is used where the orbitals for the core are not varied.
Table 6 shows high-precision virial results for the case where the
orbital is fixed at the value for the Be
, and only the
orbital is varied. In this fixed core approximation, the
values no longer cancel. The fixed core approximation is often desirable in the case of heavy elements. Codes such as
dbsr-hf should report the virial theorem results only for the orbitals that are varied, which means that the interaction of varied subshells with the fixed core subshells needs to be omitted in analysis.
Some spline methods for nonrelativistic variational equations, updating a few or all orbitals simultaneously, were investigated by Froese Fischer et al. [
26]. This is a quadratically convergent Newton–Raphson method that has achieved better performance. In their study, the calculation for Be
converged in 4 iterations with a virial theorem deviating from
by
, considerably fewer than the typical 12 SCF iterations. Newton–Raphson methods have not yet been applied to atomic relativistic equations
1 but are expected to perform similarly.