www.mdpi.org/ijms/ The State-Universal Multi-Reference Coupled-Cluster Theory: An Overview of Some Recent Advances

Abstract: Some recent advances in the area of multi-reference coupled-cluster the-ory of the state-universal type are overviewed. An emphasis is placed on the follow-ing new developments: (i) the idea of combining the state-universal multi-referencecoupled-cluster singles and doubles method (SUMRCCSD) with the multi-referencemany-body perturbation theory (MRMBPT), in which cluster amplitudes of the SUM-RCCSD formalism that carry only core and virtual orbital indices are replaced by theirﬁrst-order MRMBPT estimates; and (ii) the idea of combining the recently proposedmethod of moments of coupled-cluster equations with the SUMRCC formalism. It isdemonstrated that the new SUMRCCSD(1) method, obtained by approximating theSUMRCCSD cluster amplitudes carrying only core and virtual orbital indices by theirﬁrst-orderMRMBPTvalues, providestheresultsthatarecomparabletothoseobtainedwith the complete SUMRCCSD approach.Keywords: Coupled-cluster theory; Multi-reference formalism; State-universal multi-reference coupled-cluster method; Many-body perturbation theory; Method of mo-ments of coupled-cluster equations c 2002 by MDPI, Basel, Switzerland. Reproduction for noncommercial purposes permitted.

The genuine MRCC theories classify as either the Fock-space or valence-universal (VU) methods [4,6, or the Hilbert-space or state-universal (SU) approaches [4,6,.The VUMRCC methods, which require a simultaneous consideration of ground and excited states of a given many-electron system and its ions (different sectors of the Fock space), are excellent for describing vertical excitation energies, vertical ionization potentials, and vertical electron affinities.Unfortunately, the VUMRCC methods suffer from intruder states and unphysical multiple solutions [98,99], which, together with the requirement of considering different sectors of the Fock space, make accurate VUMRCC calculations of larger portions of molecular potential energy surfaces (PESs) rather difficult.In spite of these formal and practical complications, the recent large scale applications of the VUMRCC methods to realistic atomic and molecular problems (see, e.g., Refs.[88][89][90][91][92][94][95][96][97] and the interesting new ideas, such as the idea of exploiting similarity transformations to separate eigenvalue problems for different valence sectors of the Fock space [93] and the idea of using the VUMRCC concepts in the so-called similarity transformed equation-of-motion CC theory [100,101], show a lot of promise. The SUMRCC approaches, which do not require a consideration of different sectors of the Fock space, seem to be well suited for studies of molecular PESs.Indeed, as shown in Refs.41,43,44,[46][47][48][49][50]54, the PES and property function scans with the SUMRCC methods can be very successful and highly accurate results can be obtained.The SUMRCC method is also capable of providing an extremely accurate description of electronic energy separations in small molecular systems, as has been illustrated by the calculations of the singlet-triplet (A 1 A 1 − X 3 B 1 ) [51,52] and singlet-singlet (2 [50] energy gaps in methylene.For example, the orthogonally spin-adapted (OSA) [102][103][104][105][106] two-reference SUMRCCSD (SUMRCC singles and doubles) approach [39,41,[45][46][47][48][49][50]54] including all relevant direct and coupling terms [50], which contrary to the statements made in Ref. 107 represents the most complete formulation of this method to date, combined with the open-shell CCSD approach [108], gives 3133 cm −1 for the singlet-triplet separation T 0 .This compares extremely well with the spectroscopically derived value of T 0 of 3147 ± 5 cm −1 [109].There are open problems in the SUMRCC theory (cf. the remarks below), but this and other examples show that the SUMRCC theory is a highly promising formalism, which needs to be developed further.Unfortunately, apart from the earlier advances in formulating, implementing, and testing the spin-adapted and spin-orbital SUMRCCSD methods [4,6,[38][39][40][41][42][43][44][45][46][47][48][49][50][51][52][53][54][55][56] and apart from the recent activity in our group (see, e.g., Refs.[57][58][59] and the group of Pal [110], who formulated the response SUMRCC theory, the development of the genuine SUMRCC method has practically stopped, which is a situation that this paper hopes to change (at least, to some extent).
There are several open problems in the SUMRCC theory that must be addressed if we wish this method to become a useful alternative for routine and accurate ab initio calculations of ground and excited-state molecular PESs.There are, for example, problems related to the existence of multiple [47,57] and singular [41,[46][47][48][49]57] solutions, intruder states [46,47,49,57], and the existence of the so-called intruder solutions [57].The latter solutions are related to a specific algebraic nature of the generalized Bloch equation, on which the SUMRCC theory is based [111].The existence of multiple intruder solutions of the SUMRCCSD equations may cause severe convergence problems or, at the very least, a significant decrease in accuracy of the calculated electronic energies in regions of PESs where the electronic states of interest are no longer clearly separated from the rest of the electronic spectrum [47][48][49]57].There is also a long-standing problem of generalizing the existing two-reference OSA SUMRCCSD theory [39,41,[45][46][47][48][49][50]54] and its two electron/two orbital spin-orbital analog [43,44] to larger reference spaces, which has not been solved yet in a satisfactory manner.Our recent studies of the solutions of the generalized Bloch equation [111] clearly demonstrate that the SUMRCC calculations would benefit from using larger reference spaces.Unfortunately, it is not easy to propose an efficient computational scheme that would allow us to perform routine SUMRCC calculations for larger reference or model spaces.Part of the problem is the fact that the wave function ansatz of Jeziorski and Monkhorst [38], on which all SUMRCC approaches are based, requires that a separate cluster operator T (p) is assigned to each reference configuration |Φ p (p = 1, . . ., M).Aside from various mathematical difficulties that this assumption creates, the requirement of having a separate cluster operator T (p) for each reference configuration |Φ p leads to an excessively large number of cluster amplitudes when the dimension of model space (M ) is large and when we are only interested in a few low-lying states.
The present paper reviews some of our recent work that addresses the above problems.It is evident from the results described in Ref.48 and from the comments made in our recent work [111] that the inclusion of higher-than-doubly excited clusters in the SUMRCC formalism should help to eliminate at least some of the difficulties encountered in the SUMRCCSD calculations.The existence of multiple and pathological solutions of the SUMRCCSD equations is largely related to an asymmetric treatment of the excitation manifolds corresponding to different reference configurations and to a nonlinear nature of the generalized Bloch equation [57,111], so that the standard corrections due to triply [53] and other higher-than-doubly excited clusters, based on the multi-reference many-body perturbation theory (MRMBPT) [112][113][114], do not eliminate these problems.As a matter of fact, the use of the standard, MRMBPT-based estimates of higher-order corrections within the SUMRCC formalism is a rather risky procedure, since the MRMBPT approach suffers from intruder states in regions where the SUMRCCSD approach fails.In our view, it is more important to examine first the mathematical relationship between the approximate and exact SUMRCC formalisms, so that one can suggest new ways of systematically correcting the results of the SUMRCC calculations, particularly in all these difficult cases where the conventional MRMBPT arguments fail.Clearly, it would be highly desirable to have a simple method of correcting the energies of electronic states obtained in the SUMRCC calculations in a state-specific and non-iterative manner that would resemble the well-known noniterative a posteriori corrections due to triples or triples and quadruples characterizing, for example, the popular CCSD(T) method [115] and its CCSD(TQ f ) analog [116].
We have recently suggested an approach, termed the method of moments of the state-universal multi-reference coupled-cluster equations (MM-SUMRCC), which provides us with simple recipes for systematically improving the SUMRCCSD results by adding the state-specific noniterative corrections due to triples and other higher-than-doubly excited clusters to the energies obtained by solving the SUMRCCSD equations [117].The MM-SUMRCC approach differs from the standard multi-reference approaches in that, in computing the energy corrections due to higher-order clusters, in the MM-SUMRCC theory we rely on the explicit and rigorous relationship between the energies obtained in the SUMRCC (e.g., SUMRCCSD) calculations and the exact energy values.In the standard multi-reference methods, we can only hope that by adding sufficiently many higher-order terms to the equations representing low-order approaches, one obtains better results; the control over the choice of higher-order terms is limited by the fact that we use the MRMBPT or similar arguments, which may not be sufficiently transparent in situations where the MRMBPT approach suffers from intruder states.The MM-SUMRCC theory, which is a multireference analog of the recently developed method of moments of the single-reference [7,[118][119][120][121] and equation-of-motion [121][122][123] CC equations, is overviewed in this paper.
We also discuss the recently proposed SUMRCCSD(1) approach, which is based on the idea of combining the SUMRCCSD method with the MRMBPT approach [59].In the SUMRCCSD(1) method, we approximate the doubly excited cluster amplitudes that carry only core and virtual orbital indices by their first-order MRMBPT estimates.In addition, we assume that the singly excited cluster amplitudes corresponding to core-virtual excitations vanish.As we discuss it in this paper, both assumptions lead to a completely symmetric treatment of the core-virtual cluster amplitudes of the SUMRCCSD theory, which in the SUMRCCSD(1) method no longer depend on the reference label p.The more symmetric treatment of the excitation manifolds corresponding to different reference configurations in the SUMRCCSD(1) method may have a positive effect on the calculated energies, while offering us considerable savings in the computer effort.In the SUMRCCSD theory, each individual core-virtual excitation is represented by as many independent cluster amplitudes as the number of reference configurations, although in the intuitive description we would only use as many independent cluster amplitudes for a given core-virtual excitation as the number of electronic states of interest.In the SUMRCCSD(1) method, we simplify this situation to the utmost by replacing many sets of cluster amplitudes carrying core and virtual orbital indices, each labeled by the corresponding reference label p, by a single set.Moreover, instead of solving for cluster amplitudes carrying only core and virtual indices, in the SUMRCCSD(1) approach we approximate them by the simple first-order MRMBPT expressions.This leads to additional savings in the computer effort.Since core-virtual excitations are known to be rather insensitive to nuclear geometry [124], the molecular PESs resulting from the SUMRCCSD(1) calculations should be virtually parallel to the SUMRCCSD PESs.
Whenever possible, we illustrate our new ideas with the examples of numerical calculations.Some examples are taken from our recent work [59], while some other examples are new.We hope that this overview of our recent efforts in the area of MRCC theory will stimulate further activity aimed at the development of accurate and practical genuine MRCC methods of the state-universal type that may one day be used in routine ab initio calculations.

The State-Universal Multi-Reference Coupled-Cluster Theory
As mentioned in the Introduction, the SUMRCC theory belongs to a wider category of methods which are referred to as the genuine MRCC approaches.All genuine MRCC methods involve three basic concepts, namely, that of the multi-dimensional model or reference space M 0 , that of the wave operator U , and that of the effective Hamiltonian H eff [4,6].Alternative formulations of genuine multi-reference methods, in which the effective Hamiltonians are obtained by similarity transformations of the Hamiltonian (this frees us from the necessity of solving for the wave operator) can be developed [93,125] (cf., also, Refs.9, 10, 40), but we do not use them here, since the SUMRCC theory of Jeziorski and Monkhorst [38] is based on the exponential parameterization of the wave operator and solving the generalized Bloch equation for the wave operator represented by the Jeziorski-Monkhorst ansatz.
The model space M 0 , is spanned by a suitably chosen set of M configuration state functions |Φ p , p = 1, . . ., M, that provide a reasonable zero-order description of the target space, spanned by M quasi-degenerate eigenstates |Ψ µ , µ = 1, . . ., M, of the electronic Hamiltonian H.The wave operator U : M 0 → M is defined as a one-to-one mapping between M 0 and M. It is usually assumed that U satisfies the intermediate normalization condition, where P is the projection operator onto M 0 , It is also assumed that U annihilates states belonging to the orthogonal complement M ⊥ 0 , so that where Equations ( 3) and ( 5) immediately imply that U 2 = U , so that the wave operator U , just like operators P and Q, is idempotent.However, unlike P and Q, the wave operator U is not Hermitian, U = U † .The wave operator U is obtained by solving the equation, which is known in the literature as the generalized Bloch equation [4,6,[8][9][10]78].Once the wave operator is determined, the energies E µ of the electronic states |Ψ µ , µ = 1, . . ., M, are obtained by diagonalizing the effective Hamiltonian, within M 0 .The corresponding wave functions |Ψ µ are calculated using the formula, where the zero-order states, are the right eigenstates of H eff , In the SUMRCC formalism, we use the Jeziorski-Monkhorst ansatz for the wave operator U , i.e. [38], where T (p) is the cluster operator corresponding to reference configuration |Φ p .The intermediate normalization condition, Eq. ( 3), requires that each T (p) generates states belonging to M ⊥ 0 , when acting on the corresponding |Φ p .The wave operator U , Eq. ( 12), reduces to the well-known wave operator of the single-reference CC theory, U SRCC = e T |Φ Φ|, when model space M 0 is a one-dimensional space spanned by a single reference configuration |Φ .
In order to define the remaining elements of the SUMRCC theory, we must specify the method of constructing the model space M 0 and cluster operators T (p) .As in all multi-reference ab initio methods, in order to define the reference configurations |Φ p , we divide all molecular spin-orbitals into the three disjoint subsets of core, active, and virtual spin-orbitals.The core spin-orbitals (designated by i, j, . ..) are occupied and the virtual ones (designated by a, b, . ..) are unoccupied in all reference configurations |Φ p , p = 1, . . ., M. The reference configurations differ in the occupancies of active spin-orbitals (designated by I, J, . . ., for spin-orbitals occupied in a given reference determinant |Φ p , and by A, B, . . ., for spin-orbitals unoccupied in this |Φ p ).All possible distributions of active electrons among active spin-orbitals result in a complete model (or active) space (CAS).The use of CAS is essential to obtain size extensive results [38], although it is possible to formulate the size extensive SUMRCC method employing incomplete model spaces by relaxing the intermediate normalization condition [126,127].In this paper, we consider the CAS formulation of the SUMRCC theory.
In analogy to the standard single-reference CC theory, each cluster operator T (p) is a sum of its many-body components T (p)  n .In the exact SUMRCC formalism, where N is the number of electrons in a system under consideration.In the standard SUMRCC approximations, the many-body expansion of each cluster operator T (p) is truncated at some (usually low) excitation level.Thus, if m A is the excitation level defining a given standard SUMRCC approximation, referred here and elsewhere in this work to as method A, the corresponding cluster operators T (p) have the following form: Note that the value of m A is the same for all values of p.The SUMRCCSD method is obtained by setting m A = 2.In a conventional SUMRCC theory of Jeziorski and Monkhorst [38], the system of coupled nonlinear equations for the unknown cluster operators T (p)  m is obtained by replacing Eq. ( 7) by the equation HU|Φ p = UHU|Φ p , where U is defined by Eq. ( 12), premultiplying this equation on the left by e −T (p) , and projecting the resulting equation on the excited configurations relative to |Φ p belonging to M ⊥ 0 .The final equations for cluster operators T (p) m defining the exact SUMRCC theory can be written as follows: where the left-hand side direct term is defined as and the right-hand side coupling term, which reflects the multi-reference nature of the SUMRCC theory, takes the form The operators (p) E K are the excitation operators, generating the excited configurations relative to |Φ p belonging to M ⊥ 0 , when acting on |Φ p .These operators are also used to represent cluster operators where coefficients (p) t K are the corresponding cluster amplitudes.Because of this representation, the system of equations, Eq. ( 15), represents a system of coupled nonlinear algebraic equations for cluster amplitudes (p) t K .For the complete model space M 0 , the excitation operators (p) E K and cluster amplitudes (p) t K carry at least one inactive (i.e., core or virtual) spin-orbital index [38].
The above equations defining the exact SUMRCC theory, particularly Eqs. ( 15)-( 17) and ( 19), can be rewritten in a somewhat more symbolic form, namely, e −T (p) e T (q) P (q) H eff ]|Φ p = 0 (p = 1, . . ., M), (23) where subscript C designates the connected part of the corresponding operator expression.In deriving Eqs. ( 22) and ( 23), we used the well-known fact that [4,6,7] Operator P appearing in Eq. ( 22) is the projection operator onto M 0 defined by Eq. ( 4), whereas operators Q (p) appearing in Eq. ( 23) are the projection operators onto the manifolds of excited configurations relative to |Φ p belonging to M ⊥ 0 that are generated by excitation operators n designates a projection operator onto the subspace of M M ⊥ 0 spanned by the n-tuply excited configurations relative to |Φ p that belong to M ⊥ 0 , we can write In the exact theory, all manifolds of excitations M ⊥(p) 0 , used to define cluster operators T (p) , are identical, i.e., M ⊥(p) 0 = M ⊥ 0 for all p = 1, . . ., M. This is a consequence of the fact that independent of the value of p.Because of this symmetric treatment of the manifolds of excitations corresponding to different references |Φ p , the exact SUMRCC method is completely equivalent to an eigenvalue problem for eigenstates |Ψ µ , µ = 1, . . ., M [111] (see, also, Ref. 117).This is no longer the case, when cluster operators T (p) are truncated in a standard manner according to Eq. ( 14).Indeed, in the standard SUMRCC approximations, the system of equations ( 22) and ( 23) is replaced by a truncated system of equations for the many-body components where operators T (p) A are defined by Eq. ( 14), H eff A is the effective Hamiltonian of the approximate SUMRCC method A, and A is a projection operator onto the excited configurations used to define T (p) The manifolds of excitations M ⊥(p)

used to define the truncated cluster operators T (p)
A , are usually different for different reference configurations |Φ p (cf., e.g., the lists of excitations (p) E K , corresponding to different references |Φ p of the OSA SUMRCCSD formalism of Refs.39,41,[45][46][47][48][49][50]54,given in Ref. 47).As pointed out in Ref. 111, this asymmetric treatment of the manifolds of excitations corresponding to different reference configurations causes that the approximate SUMRCC schemes based on Eqs. ( 27) and ( 28) (including the existing SUMRCCSD methods) are not equivalent to any Hermitian eigenvalue problem.This significant distortion of the exact SUMRCC theory, resulting from the truncation of the many-body expansions of all operators T (p) at the same excitation level m A , leads to a number of pathologies in approximate SUMRCC calculations based on Eqs. ( 27) and (28).These pathologies include the existence of an excessive number of real and complex solutions that lack physical interpretation and the appearance of the intruder solution problem [57].The multi-reference extension of the MMCC theory, which we recently suggested in Ref. 117, and which we overview in the next section, offers a possibility of reducing the severity of problems encountered in the standard SUMRCC (e.g., SUMRCCSD) calculations by incorporating higher-order effects in the SUMRCC formalism and by reinforcing the symmetric treatment of the M ⊥(p) 0,A subspaces.As mentioned in the Introduction, inclusion of higher-than-doubly excited clusters in the SUM-RCC formalism should help to eliminate some pathologies that are encountered in the SUMRCCSD calculations.Although one can suggest ways of improving the SUMRCCSD results by adding the corrections due to triply excited clusters based on the standard MRMBPT theory [53], the use of the MRMBPT arguments in constructing such corrections may, in general, be a risky procedure, since the MRMBPT approach is often plagued by intruder states.Undoubtedly, it would be useful to have an alternative method of correcting the SUMRCCSD results, which does not necessarily rely on the standard MRMBPT arguments.
We have recently introduced a new, nonstandard method of correcting the results of various CC calculations, termed the method of moments of coupled-cluster equations (MMCC) [7,[117][118][119][120][121][122][123].The main idea of the MMCC formalism is that of the noniterative energy corrections which, when added to the energies obtained in approximate CC or equation-of-motion CC (EOMCC) [128][129][130] calculations, such as CCSD or EOMCCSD, recover the exact (full configuration interaction or full CI) energies of ground or excited states.It has been demonstrated that the MMCC formalism allows us to renormalize the existing noniterative single-reference CC approximations, such as CCSD(T) [115], CCSD(TQ f ) [116], and CCSDT(Q f ) [116], so that they can correctly describe entire ground-state PESs in situations where the standard arguments based on MBPT, on which the CCSD(T), CCSD(TQ f ), and similar approximations are based, completely fail [7,26,[117][118][119][120][121][131][132][133] (cf., also, Ref. 134 for a rederivation of the renormalized CCSD(T) expressions, published one year earlier in Refs.7, 118, 119, and for some additional tests).It has also been demonstrated that the EOMCC-based excited-state MMCC theory allows us to introduce a new hierarchy of simple noniterative EOMCC approximations that remove the pervasive failing of the EOMCCSD and perturbative EOMCCSDT approximations in describing excited-state PESs [122,123].Clearly, the MMCC methodology provides us with new ways of systematically improving the CC or EOMCC results by adding simple noniterative corrections to the CC or EOMCC energies.Thus, it might be useful to investigate the possibility of extending the MMCC formalism to a multi-reference case.
Encouraged by the remarkable performance of the single-reference MMCC approximations, we have recently generalized the MMCC formalism to a multi-reference case by proposing the method of moments of the state-universal multi-reference coupled-cluster equations (MM-SUMRCC) [117].The main idea of the MM-SUMRCC theory is that of the noniterative, state-specific energy corrections which, when added to energies E A µ , obtained by solving the approximate SUMRCC (e.g., SUMR-CCSD) equations, recover the exact (full CI) energies E µ of the electronic states of interest.The main purpose of the approximate MM-SUMRCC calculations is to estimate corrections δ µ , so that the resulting energies E A µ + δ µ remain very close to the corresponding exact energies E µ .Each correction δ µ is a nontrivial functional of the corresponding exact electronic wave function |Ψ µ and the generalized moments of the SUMRCC equations, i.e., the SUMRCC equations projected on the excited configurations whose excitation level exceeds that defining a given SUMRCC approximation.The precise mathematical definition of the generalized moments of the SUMRCC equations, which is consistent with Eq. ( 28) and which is used in the MM-SUMRCC formalism of Ref. 117, is as follows: where [. ..] m designates the m-body component of the corresponding operator expression.The generalized moments Γ (p) m (m A )|Φ p , Eq. ( 31), can be viewed as the most fundamental quantities of the Jeziorski-Monkhorst theory, as defined by Eqs. ( 27) and ( 28), since the system of equations for the many-body components of cluster operators T (p) A , p = 1, . . ., M, Eq. ( 28), is immediately obtained by imposing a requirement that the lowest moments Γ The generalized moments Γ (p) m (m A )|Φ p , which enter the formula for the noniterative corrections δ µ , Eq. ( 30), are those with m > m A .As shown in Ref. 117, the explicit formula for corrections δ µ , in terms of moments Γ (p)  m (m A )|Φ p , is as follows: where c A pµ = Φ p |χ A µ are the coefficients defining the model-space states obtained by diagonalizing the effective Hamiltonian of the approximate SUMRCC method A in M 0 , is an overlap of the exact wave function |Ψ µ and the SUMRCC wave function obtained with method A, and (e A .The derivation of Eq. ( 33) is based on considering the energy functional where [cf.Eq. ( 12)] is an approximate wave operator obtained by solving the SUMRCC equations of method A, |χ is a state belonging to M 0 , and |Ψ is an N -electron wave function.For |Ψ = |Ψ µ and for |χ equal to the corresponding eigenstate |χ A µ of the effective Hamiltonian H eff A , we immediately obtain The derivation of Eq. ( 33) proceeds as follows: First, we show the following general relationship [117]: where Next, we prove that quantities The substitution of Eq. ( 42) into Eq.( 40) and the use of the fact that in method A the lowest moments Γ (p) m (m A )|Φ p , with m = 1, . . ., m A , vanish [cf.Eq. ( 32)] allow us to write Ψ|(e By setting |Ψ = |Ψ µ and |χ = |χ A µ in Eq. ( 43) and using Eqs.(38) and (39), we obtain the desired Eq. (33).
Equation (33) is the basic equation of the MM-SUMRCC theory.We can use it to improve the results of approximate SUMRCC calculations in the following way: First, we solve the equations of a given SUMRCC method, Eqs. ( 27) and (28), to determine the cluster operators, T (p) A , p = 1, . . ., M, and the corresponding effective Hamiltonian, H eff A .Next, we construct the generalized moments Γ (p)  m (m A )|Φ p with m > m A using Eq.(31).Finally, we use cluster operators A , generalized moments Γ (p)  m (m A )|Φ p , and the right eigenvectors of H eff A , i.e., c A µ = (c A 1µ , . . ., c A Mµ ), to calculate corrections δ µ with the help of Eq. (33).
We should also note the formal similarity of Eq. ( 33) and the single-reference MMCC energy formula discussed in Refs.7, 118, 119 (for a review, see Ref. 121).Indeed, in the single-reference (M = 1) case, the d A µ denominator term in Eq. ( 33) reduces to Ψ µ |e T (1) A |Φ 1 , since the model-space state |χ A µ becomes proportional to the reference configuration |Φ 1 .In addition, when M = 1, the generalized moments of the SUMRCC equations, Eq. ( 31), reduce to the generalized moments of the single-reference CC equations defined in Refs.7, 118, 119.We obtain [cf.Eq. ( 31)], Γ (1)  m (m A )|Φ 1 = Q (1)  m (He where M m (m A )|Φ 1 are the generalized moments of the single-reference CC equations.The substitution of Eq. ( 44) into the M = 1 variant of Eq. ( 33) leads to the following result: which clearly is the single-reference MMCC energy formula, provided that we identify T (1) A and |Φ 1 with, respectively, the cluster operator and the reference configuration of the standard singlereference CC method.
The exact form of Eq. ( 33) cannot be used in practical calculations, since we usually do not know the exact (full CI) wave functions |Ψ µ , µ = 1, . . ., M. However, we can calculate the approximate values of corrections δ µ which, when added to the SUMRCC energies E A µ , may give very good estimates of the exact energies E µ , if we use simple estimates of wave functions |Ψ µ , provided by one of the relatively inexpensive ab initio methods.Independent of the approximate form of |Ψ µ chosen for such calculations, corrections δ µ can be calculated in a state-specific manner.Our belief that simple estimates of wave functions |Ψ µ may be sufficient to obtain accurate δ µ values is based on the success of the single-reference MMCC methods and their renormalized CC analogs [7,26,[117][118][119][120][121][122][123][131][132][133][134], in which simple perturbative or CI wave functions are used to construct the relevant energy corrections.
For example, we can use wave functions |Ψ µ obtained in truncated multi-reference CI (MRCI) calculations (using, e.g., the popular MRCISD method or one of its approximate variants) and use the resulting corrections δ µ to improve the results of the SUMRCCSD calculations (the m A = 2 case).We can also think of using the CISDt or CISDtq [7,120,122,123] approaches, in which triply and quadruply excited configurations of the single-reference CI method are selected via active orbitals, to construct wave functions |Ψ µ in Eq. (33).In either case, we should be able to significantly improve the quality of the SUMRCCSD results and reinforce a fully symmetric treatment of the manifolds of excitations corresponding to different reference configurations, which is broken by the SUMRCCSD and other SUMRCC approximations.Indeed, when m A < N, the M  27) and ( 28) are not equivalent to any Hermitian eigenvalue problem which, in turn, leads to various problems in SUMRCC calculations.However, if we do not truncate the summations over n and m in Eq. ( 33) in any arbitrary manner and if we simply let the projection onto a suitably chosen approximate wave function |Ψ µ select terms in the summations over p, n, and m in the numerator of Eq. ( 33), we will obtain a fully symmetric treatment of the M ⊥(p) 0,A subspaces corresponding to different references |Φ p .In order for this scheme to work, we only have to assume that the CI expansions of wave functions |Ψ µ contain some M ⊥ 0 configurations whose excitation level relative to at least one of the M references |Φ p exceeds m A .This is certainly true for the MRCISD wave functions and their CISDt and CISDtq analogs if we are interested in correcting the SUMRCCSD results.The projection onto |Ψ µ in the numerator of Eq. (33) will select precisely those subsets of the generalized moments Γ (p)  m (m A )|Φ p (usually, different subsets of Γ (p) 's for different values of p) that are needed to restore a symmetric treatment of the manifolds of excitations in the approximate SUMRCC (e.g., SUMRCCSD) calculations.Although this particular way of improving the SUMRCCSD results by using the MRCISD, CISDt, or CISDtq wave functions |Ψ µ in Eq. ( 33) has not been tested yet in actual numerical calculations, we believe that we should be able to obtain significant improvements in the calculated SUMRCC energies, particularly in regions plagued by intruder states or intruder solutions, where there is an apparent need to incorporate higher-than-doubly excited clusters and have a more symmetric treatment of the M ⊥(p) 0,A subspaces in the SUMRCC calculations.We can also contemplate other ways of using Eq.(33).We can, for example, introduce the multi-reference analogs of the MMCC(m A , m B ) approximations suggested in Refs.energy formula can be given the following form [117]: where As implied by Eq. ( 46), the MM-SUCC(m A , m B ) method uses moments Γ (p) m (m A )|Φ p with m = m A + 1, . . ., m B .For typical applications of Eq. ( 46) (e.g., m A = 2 and m B = 3 or 4), the Γ (p) m (m A )|Φ p moments with m = m A + 1, . . ., m B form a small subset of all Γ (p) 's.
The simplest example of the MM-SUCC(m A , m B ) approximation is the MM-SUCC(2,3) scheme, in which we use Eq. ( 46) to correct the results of the SUMRCCSD calculations (the m A = 2 case).In this case, the only generalized moments of the SUMRCC equations that need to be considered are the Γ where representing an overlap of the "trial" wave function |Ψ µ and the SUMRCCSD wave function [cf.Eq. ( 36)] The coefficients c SUMRCCSD pµ , p = 1, . . ., M, are the components of the right eigenvector of the SUMRCCSD effective Hamiltonian, whose matrix elements, designated here by H eff qp (2), are defined as follows [cf.Eqs. ( 19) and ( 24)]: where and are the singly and doubly excited clusters of the SUMRCCSD approach.The Γ (p) 3 (2)|Φ p moments that appear in Eq. ( 49) can be expressed in terms of the projections of the SUMRCCSD equations on the triply excited configurations relative to |Φ p .If i, j, k, . . .(a, b, c, . ..) represent the spin-orbitals that are occupied (unoccupied) in the reference configuration |Φ p and if (p) E abc ijk = X a X i X b X j X c X k are the excitation operators that generate the triply excited configurations relative to |Φ p (X a and X i are the usual creation and annihilation operators, respectively), we can write where 2 ) e T (q) 1 +T (q) 2 |Φ q H eff qp (2) (54) are the projections of the SUMRCCSD equations on triexcited configurations ( (p) E ijk abc represents the hermitian adjoint to (p) E abc ijk ).In the case of the complete model space M 0 , considered here, at least one index among i, j, k, a, b, c in Eqs. ( 53) and ( 54) must be inactive.
The MM-SUCC(2,3) scheme described above represents a multi-reference analog of the singlereference MMCC(2,3) method introduced in Refs.7, 118, 119 (cf., also, Refs.[120][121][122][123].In analogy to the latter method, it might be useful to consider the second-order MRMBPT [MRMBPT(2)] wave function or, perhaps even better, an analog of the MRMBPT(2) wave function, obtained by replacing the lowest-order T  48) and ( 49) would lead to a multi-reference extension of the recently proposed completely renormalized CCSD(T) [CR-CCSD(T)] method [7, 26, 117-119, 121, 132, 133].The spectacular successes of the single-reference CR-CCSD(T) approach in calculations of groundstate PESs involving bond breaking, where the standard CCSD(T) approach completely fails, suggest that the multi-reference analog of the CR-CCSD(T) approach, obtained by inserting the MRMBPT(2)-like wave functions |Ψ µ in Eqs.(48) and (49), may provide excellent results, particularly in difficult situations in which the use of the low-order MRMBPT theory alone to estimate the higher-order (e.g., T (p) 3 ) effects is not entirely appropriate due to the presence of intruder states.
We realize, of course, that using the approximate CI or MBPT methods to calculate wave functions |Ψ µ in Eq. ( 33) and considering the truncated MM-SUCC(m A , m B ) schemes will cause the resulting energies to be no longer strictly size extensive (in a sense of introducing the unlinked terms into the MM-SUMRCC energies).However, our experience with the CI-based singlereference MMCC methods [7,120,122,123] and the MBPT-based renormalized CC approaches, such as CR-CCSD(T) [7, 26, 117-119, 121, 132, 133], demonstrates that the presence of unlinked terms in the MMCC approximations does not have an effect on the excellent performance of the approximate MMCC schemes.In fact, we have recently performed a number of calculations showing that the CR-CCSD(T) approach provides approximately size extensive results, as long as we remain within the range of general applicability of this approach, which is a single bond breaking (cf., e.g., Refs.133,135).Moreover, a number of studies by the Paldus Waterloo group involving the so-called reduced MRCCSD (RMRCCSD) approach indicate that using the relatively inexpensive MRCI wave functions to estimate the higher-order contributions of the CC theory [6,[136][137][138][139][140][141], at the risk of introducing unlinked terms into the calculations, tremendously benefits the CC results.The direct use of the final energy expressions in CC calculations, as is done in our MMCC theory [7,[118][119][120][121][122][123] and its multi-reference extension discussed here and in Ref. 117, which may result in the introduction of unlinked terms, is also exploited in the Brillouin-Wigner MRCC method [32][33][34][35][36][37].As in the approximate MMCC case, the Brillouin-Wigner MRCC approach is not size extensive.However, the Brillouin-Wigner MRCC results are excellent (even for molecular systems containing heavier atoms), which is again suggesting to us that the presence of unlinked terms in the approximate MM-SUMRCC energy expressions may not have a detrimental effect on the final results.We should also keep in mind that all approximate MMCC methods (just like the RMRCCSD approach of Paldus and Li) introduce unlinked terms in very high orders, so that it is quite likely that the results of approximate MM-SUMRCC calculations will be very good, in spite of the presence of unlinked terms in the approximate MM-SUMRCC energy expressions.
Clearly, the new ideas described in this section need to be implemented and tested numerically.Work is under way in our laboratory towards implementing various MM-SUMRCC approximations.The results of this effort will be reported as soon as they become available.

The State-Universal Multi-Reference Coupled-Cluster Method with Perturbative Description of Core-Virtual Excitations: The SUMRCCSD(1) Approach
One of the main problems that slows down further development of the SUMRCC method in the direction of extending it to larger reference spaces is the fact that the Jeziorski-Monkhorst ansatz requires that a separate cluster operator T (p) is assigned to each reference configuration |Φ p (p = 1, . . ., M).This requirement leads to an excessively large number of cluster amplitudes when the number of reference configurations is large.In the Jeziorski-Monkhorst formalism, we are forced to solve for all M T (p) cluster operators, even if we are interested in calculating a few low-lying states.In particular, each individual core-virtual excitation is in the SUMRCC theory represented by as many independent cluster amplitudes as the number of references.This is somewhat counterintuitive, since ideally we should only be required to determine as many cluster amplitudes for a given core-virtual excitation as is the number of electronic states under consideration.For example, in the popular MRCISD approach, we are required to determine as many CI coefficient vectors as is the number of calculated states.The latter number is usually much smaller than the number of reference configurations used in such calculations.As a matter of fact, at least in the first-order MRMBPT, the values of the core-virtual amplitudes representing the doubly excited clusters T (p) 2 do not depend on the reference label p [38].It is, therefore, quite reasonable to introduce an approximation, in which all core-virtual amplitudes of the SUMRCC theory are approximated by their first-order MRMBPT estimates.This new approximation, referred to as the SUMRCCSD(1) method [59], is discussed in this section.
As shown below, the SUMRCCSD(1) approach provides the results of the SUMRCCSD quality at the fraction of the computer cost associated with the SUMRCCSD calculations.Alternative ways of simplifying the SUMRCCSD scheme are possible by considering the state-selective methods employing the Jeziorski-Monkhorst ansatz (cf., e.g., Refs.[30][31][32][33][34][35][36][37].Although these new methods are highly promising, in this paper we focus on simplifying the original SUMRCC method within its conventional effective Hamiltonian formulation, which is exactly what the SUMRCCSD(1) approximation offers by reducing the number of cluster amplitudes that need to be determined by the iterative procedure and by reducing the number of nonlinear equations that need to be solved.
be the excitation operators generating the singly and doubly excited configurations relative to reference |Φ p .In terms of these operators, the singly and doubly excited clusters of the SUMRCCSD approach take the usual form, where (p) t i a and (p) t ij ab are the singly and doubly excited cluster amplitudes obtained by solving the SUMRCCSD equations.Since we are assuming here that M 0 is complete, the operators (p) E a i and (p) E ab ij and the corresponding cluster amplitudes (p) t i a and (p) t ij ab must carry at least one inactive (i.e., core or virtual) spin-orbital label.
In general, the (p) t i a and (p) t ij ab values depend on label p. Indeed, suppose the spin-orbitals i, j, . . .and a, b, . . .(commonly designated as the ρ, σ, . . .spin-orbitals) are obtained in the canonical Hartree-Fock calculations and suppose the reference |Φ 1 is the ground-state Hartree-Fock determinant.Let us construct the remaining reference configurations |Φ p , p = 2, . . ., M, in a usual way by choosing some occupied and some unoccupied spin-orbitals in |Φ 1 as active spin-orbitals and by promoting active electrons (electrons occupying active spin-orbitals in |Φ 1 ) to active spin-orbitals that are unoccupied in |Φ 1 .Let us also introduce the Fock matrix elements where z σ ρ = ρ|z|σ and v τ ω ρσ = ρσ|v|τ ω − ρσ|v|ωτ are the one-electron and antisymmetrized two-electron molecular integrals defining the electronic Hamiltonian, and let us partition the Hamiltonian into the unperturbed operator and the perturbation where in Eq. ( 59) are the molecular orbital energies obtained in canonical Hartree-Fock calculations.With this partitioning of the Hamiltonian, one can perform the usual MRMBPT analysis of the wave function cluster expansions and show, for example, that the first-order MRMBPT estimates for the singly and doubly excited cluster amplitudes, (p) t i a (1) and (p) t ij ab (1), respectively, are where in Eq. ( 62) we used the fact that (1) f i a = 0 (in general, for the canonical Hartree-Fock spinorbitals, the p = 1 matrix elements (1) f σ ρ vanish unless ρ = σ).Clearly, the values of (p) t i a (1) and (p) t ij ab (1) depend on p, since different spin-orbitals are occupied in different reference configurations |Φ p .In addition, the (p) t i a (1) amplitudes vanish for p = 1 and are, in general, nonzero for p > 1.The order-by-order analysis of the SUMRCCSD method demonstrates that differences between cluster amplitudes associated with different references |Φ p are even larger in higher orders of MRMBPT [38].
Although the (p) t i a and (p) t ij ab amplitudes are, in general, p-dependent and should be treated as independent parameters in the process of solving the SUMRCCSD equations, the first-order core-virtual biexcited amplitudes, (p) t ij ab (1), do not depend on label p.We have, This observation suggests an approximation in which, instead of solving for the large number of core-virtual amplitudes (p) t ij ab , we simply estimate their values using the first-order MRMBPT expression, Eq. ( 64) [59].The first-order core-virtual monoexcited amplitudes (p) t i a (1) depend on p, but they are usually much smaller than the (p) t ij ab (1) amplitudes, so that we may simply neglect the core-virtual (p) t i a (1) amplitudes altogether and invoke an additional approximation in which [59] (p) t i a (1) ≡ t i a (1) = 0.
In this way, we can reinforce the fully symmetric treatment of both singly and doubly excited core-virtual cluster amplitudes.The significance of the symmetric treatment of the manifolds of excitations corresponding to different reference configurations |Φ p , which are not treated symmetrically in the standard SUMRCCSD scheme, has been discussed in the previous sections.Equations (64) and ( 65) define the SUMRCCSD(1) method [59].Since the core-virtual amplitudes are determined a priori by Eqs. ( 64) and ( 65), in the SUMRCCSD(1) approach we no longer consider the amplitude equations corresponding to core-virtual excitations.In other words, Eqs. ( 15)-( 17) of the standard SUMRCCSD theory in which (p) E K = (p) E a i , (p) E ab ij are simply ignored, so that the number of equations and the number of unknown cluster amplitudes that need to be determined in the iterative procedure are identical.As shown in Table 1, this considerably reduces the computer effort involved, since in the SUMRCCSD(1) method we only consider a relatively small subset of all SUMRCCSD amplitude equations corresponding to single and double excitations that carry at least one active index, i.e., (p) We use these equations to determine the relatively small subset of all (p) t i a and (p) t ij ab amplitudes that carry at least one active index.Even for small systems, such as the methylene molecule, and even for the simple two-reference (M = 2) singlet case, the savings in the computer effort are substantial.For example, the total number of spin-and symmetry-adapted cluster amplitudes used in the two-reference OSA SUMRCCSD calculations for the the lowest two 1 A 1 states of methylene, as described by the double-zeta plus polarization (DZP) basis set of Refs.142, 143, is 1341 (720 for |Φ 1 and 621 for |Φ 2 ; see Table 1).The number of core-virtual amplitudes for each reference is in this case 318, so that the total number of core-virtual amplitudes considered in the two-reference OSA SUMRCCSD calculations for the DZP methylene molecule is 636.These 636 amplitudes are estimated in the SUMRCCSD(1) method by the first-order MRMBPT expression, Eq. ( 64), and by Eq. ( 65), and only the remaining 705 cluster amplitudes that carry at least one active orbital index are determined iteratively by solving the relevant subset of all SUMRCCSD equations.A similar ∼ 50 % reduction in the number of amplitudes that have to be determined in the SUM-RCCSD(1) iterative procedure characterizes the calculations for methylene employing a larger [5s4p3d2f 1g/3s2p1d] basis set of Ref. 144 (cf.,also,Ref. 52).In this case, the total number of core-virtual amplitudes considered in the two-reference OSA SUMRCCSD calculations is 10432, while the number of all singly and doubly excited cluster amplitudes is 22611 (see Table 1).The 10432 core-virtual amplitudes are estimated in the SUMRCCSD(1) calculations via Eqs.(64) and (65), whereas the remaining 12179 amplitudes are determined iteratively.This significant reduction in the number of cluster amplitudes that have to be determined in the SUMRCCSD(1) iterative procedure would be even larger for larger many-electron systems, since the number of core-virtual amplitudes increases with the number of core electrons.
In order to test the SUMRCCSD(1) method, we performed a few benchmark calculations using the two-reference OSA SUMRCCSD(1) approach, which we implemented by modifying the two-reference OSA SUMRCCSD code described in Ref. 50 (see Refs. 58, 145 for a parallel version).Let us recall that in the two-reference OSA SUMRCCSD method of Ref. 50 (see Refs. 39, 41, 45-49 for earlier developments), the model space M 0 is spanned by two closed-shell configurations |Φ 1 and |Φ 2 involving two active electrons and two active orbitals that belong to different symmetry species of the spatial symmetry group of the system.The M = 2 model space, M 0 = span{|Φ 1 , |Φ 2 }, is complete if we are only interested in the totally symmetric singlet eigenstates of the Hamiltonian [39,41,[45][46][47][48][49][50]54].The explicit equations of the two-reference OSA SUMRCCSD(1) theory, in terms of the OSA mono-and biexcited cluster amplitudes, are a straightforward modification of the two-reference OSA SUMRCCSD equations presented elsewhere [39,41,45,50].
The results of the SUMRCCSD(1) calculations discussed in this section include the lowest two 1 A 1 states of the DZP H4 model [146,147] (see Table 2 and Fig. 1), the lowest two 1 Σ + g states of the Li 2 molecule, as described by the double zeta (DZ) basis set of Refs.148, 149 (see Table 3), and the lowest two 1 A 1 states of methylene, as described by the DZP basis set [142,143] and the [5s4p3d/3s2p] [52] and [5s4p3d2f 1g/3s2p1d] atomic natural orbital [144] basis sets (see Table 4).The results of the SUMRCCSD(1) calculations for H4 and CH 2 were reported earlier [59].The results for Li 2 are new.
We compare the SUMRCCSD(1) results with those obtained with the two-reference SUMR-CCSD and MRMBPT(2) methods.The MRMBPT(2) results were obtained with the method described in Ref. 150 and implemented in gamess [151].It is useful to compare the MRMBPT(2) and SUMRCCSD(1) results, since both approaches rely on the first-order MRMBPT corrections to the wave function.Whenever possible, we also compare the SUMRCCSD(1) results with the results of full CI calculations.The results of the two-reference SUMRCCSD and full CI calculations for the DZP H4 model can be found in Ref. 50.The SUMRCCSD results for the lowest two 1 A 1 states of the DZP model of methylene can be found in Refs.50, 58 (recall that the lowest 1 A 1 state represents in this case the first-excited state; the ground state of CH 2 is 3 B 1 ; cf. the Introduction).The results of the two-reference SUMRCCSD calculations for methylene for the [5s4p3d/3s2p] and [5s4p3d2f 1g/3s2p1d] basis sets were reported in Refs.52, 58.We begin our discussion with the H4 model, which consists of two slightly stretched H 2 molecules arranged in an isosceles trapezoidal configuration, with all nearest-neighbor H-H separations fixed at 2.0 bohr.The geometry of the H4 model is determined by a single parameter α ∈ [0, 1  2 ].The α = 0 and α = 1 2 limits correspond to square and linear conformations, respectively [146].Table 1.Numbers of the spin-and symmetry-adapted singly and doubly excited cluster amplitudes used in the two-reference OSA SUMRCCSD calculations for molecular systems discussed in this work.By changing parameter α, we can continuously vary the degree of configurational quasi-degeneracy involving the ground-state RHF (restricted Hartree-Fock) reference configuration and the doubly excited configuration obtained by promoting two electrons from the highest-energy occupied orbital (1b 2 ) to the lowestenergy unoccupied orbital (2a 1 ).Indeed, in the α = 0 limit, the absolute values of the coefficients at configurations |Φ 1 and |Φ 2 , which dominate the full CI expansions of the lowest two 1 A 1 states in this region, are identical.For α = 0.5, the ground-state wave function becomes essentially nondegenerate, with |Φ 1 representing the leading configuration in the corresponding full CI expansion.The results of our SUMRCCSD(1) calculations for the lowest two 1 A 1 states of the DZP H4 model (the 1 1 A 1 and 2 1 A 1 states) are shown in Table 2 and Fig. 1.In the SUMRCCSD(1), SUMRCCSD, and MRMBPT(2) calculations, we chose orbitals 1b 2 and 2a 1 as active.This choice of active orbitals is justified by the dominant role of the |Φ 1 and |Φ 2 configurations, Eqs. ( 66) and (67), respectively, in the full CI expansions of the lowest two 1 A 1 states of the H4 system.
As can be seen from Table 2 and Fig. 1, the SUMRCCSD(1) potential energy curves representing the lowest two 1 A 1 states of are shifted by ∼ 1 millihartree or less relative to the corresponding SUMRCCSD curves.The errors in the SUMRCCSD(1) results, relative to full CI, range between 0.5 (α ≈ 0) and 2.7 (α ≈ 0.5) millihartree for the ground state and between 4.1 (α ≈ 0) and 3.0 (α ≈ 0.5) millihartree for the first-excited 1 A 1 state.The errors obtained with the SUMRCCSD method are virtually identical to those obtained with the simpler SUMRCCSD(1) approximation.The SUMRCCSD(1) and SUMRCCSD vertical excitation energies ∆ corresponding to the 1 1 A 1 → 2 1 A 1 transition agree to within 0.5-1.5 millihartree for all values of α.For comparison, the MRMBPT(2) approach completely fails for α > 0.15 due to intruder states.The errors in the MRMBPT(2) results for the 1 1 A 1 → 2 1 A 1 excitation energies, relative to full CI, are as large as 172.8 millihartree in the α ≈ 0.5 region.Even in the region of small α values, where the MRMBPT(2) approach works best, the errors in the MRMBPT(2) results for the 1 1 A 1 → 2 1 A 1 excitation energies are much larger than those obtained with the SUMRCCSD(1) method (cf., e.g., the 9.4 millihartree error in the MRMBPT(2) value of ∆ at α = 0 with the 3.6 millihartree error obtained with the SUMRCCSD(1) approach for the same value of α).
The excellent performance of the SUMRCCSD(1) approach is also observed in the calculations for the DZ model of Li 2 .We considered three geometries in this case: the equilibrium geometry, R = R e = 5.051 bohr (R is the internuclear separation), and two stretches of the Li-Li bond, R = 1.5R e and R = 2R e .For larger distances R, the ground and the first-excited states of the 1 Σ + g symmetry are dominated by the RHF configuration and the doubly excited configuration so that it is prudent to choose the highest-energy occupied and the lowest-energy unoccupied molecular orbitals, 2σ g and 2σ u , respectively, in our SUMRCCSD(1), SUMRCCSD, and MRMBPT(2) calculations.The results of the two-reference SUMRCCSD(1), SUMRCCSD, and  68), and |Φ 2 , Eq. ( 69), as references, along with the corresponding full CI results, are shown in Table 3.
The results in Table 3 show that the SUMRCCSD(1) approach provides the potential energy curves of Li 2 that are in excellent agreement with the high quality curves obtained with the SUMRCCSD method.The SUMRCCSD(1) energies for the lowest two 1 Σ + g states of Li 2 are only slightly above the corresponding SUMRCCSD energies [the difference between the SUMRCCSD(1) and SUMRCCSD energies is ∼ 2.5 millihartree, independently of the value of R].In consequence, the errors in the SUMRCCSD(1) results, relative to full CI, are very small (2.7-2.8 millihartree for the ground state and 3.1-3.2millihartree for the first-excited 1 Σ + g state).The errors in the 1 1 Σ + g → 2 1 Σ + g vertical excitation energies (designated by ∆) obtained with the SUMRCCSD(1) method, relative to full CI, are 0.4-0.5 millihartree, independently of the value of R. The SUMRCCSD results for the 1 1 Σ + g → 2 1 Σ + g excitation energies are virtually identical to those obtained with the simpler SUMRCCSD(1) approach.The two-reference MRMBPT(2) method works reasonably well in the quasi-degenerate region corresponding to larger R values, where the lowest two 1 Σ + g states are dominated by configurations ( 68) and ( 69), but the SUMRCCSD(1) results are much more accurate [cf., e.g., the 12.8 millihartree error in the MRMBPT(2) result for the vertical excitation energy ∆ at R = 2R e with the 0.3-0.4millihartree errors obtained at the same value of R with the SUMRCCSD and SUMRCCSD(1) approaches].In the R ≈ R e region, the ground state is dominated by the RHF configuration |Φ 1 , Eq. ( 68), but the first-excited state of the 1 Σ + g symmetry has significant contributions from configurations belonging to M ⊥ 0 .As a result, the error in the MRMBPT(2) value for the vertical excitation energy ∆ is as large as 54.0 millihartree at R = R e .Remarkably enough, the errors in the SUMRCCSD and SUMRCCSD(1) results for the 1 1 Σ + g → 2 1 Σ + g excitation energy are as little as 0.5 millihartree at R = R e .This clearly shows that we can tremendously benefit from incorporating the MRMBPT ideas into the SUMRCCSD scheme.
Our final example is methylene [59].As mentioned in the Introduction, the OSA SUMRCCSD approach provides excellent (∼ spectroscopic) results for the very small energy gap between the ground state, X 3 B 1 , and the lowest excited state of the 1 A 1 symmetry, A 1 A 1 ≡ 1 1 A 1 [51,52].The OSA SUMRCCSD method is also capable of providing the excellent description of the singletsinglet (2 1 A 1 − 1 1 A 1 ) energy separation [50].For example, the full CI value of the singletsinglet energy gap in the DZP methylene molecule is 168.907millihartree [143].The two-reference SUMRCCSD method gives 169.885 millihartree [50], in excellent agreement with full CI.
The success of the two-reference SUMRCCSD method in describing the 2 1 A 1 − 1 1 A 1 energy gap in methylene is largely (but not entirely) related to the quasi-degenerate character of the lowest two 1 A 1 states, which are both dominated by two closed-shell configurations,  Basis set and geometries taken from Ref. [52].
d Basis set and geometries taken from Refs.[52,144].
and |Φ 2 = |(1a 1 ) 2 (2a 1 ) 2 (1b 2 ) 2 (1b 1 ) 2 |, (71) involving active orbitals 3a 1 and 1b 1 .Thus, by choosing orbitals 3a 1 and 1b 1 as active orbitals in the SUMRCCSD calculations, we should (and we do) obtain excellent results for the singletsinglet energy separation.It is interesting to note though that the two-reference MRMBPT(2) method does not provide very good results in this case, in spite of the apparently two-reference character of the lowest two 1 A 1 states.For example, the error in the 1 1 A 1 → 2 1 A 1 excitation energy obtained with the MRMBPT(2) approach for the DZP methylene model, relative to full CI, is 11.344 millihartree.This should be compared to a much smaller, 0.978, millihartree error obtained with the two-reference SUMRCCSD method (see Table 4).
The results in Table 4 show that the description of the lowest two 1 A 1 states of methylene by the two-reference SUMRCCSD(1) approach is almost as good as that provided by its parent SUMRCCSD analog.Although the differences between the SUMRCCSD(1) and SUMRCCSD individual energies are somewhat larger than in the case of the H4 and Li 2 systems, the vertical excitation energies corresponding to the 1 1 A 1 → 2 1 A 1 transition in methylene (the ∆ values in Table 4) obtained in the SUMRCCSD(1) and SUMRCCSD calculations are essentially identical, independent of the basis set employed.Indeed, the differences between the ∆ values resulting from the SUMRCCSD(1) and SUMRCCSD calculations are 0.958 millihartree for the DZP basis set, 1.389 millihartree for the [5s4p3d/3s2p] basis set, and 1.503 millihartree for the largest [5s4p3d2f 1g/3s2p1d] basis set.The difference between the ∆ values obtained in the SUMR-CCSD(1) and full CI calculations with the DZP basis set is 1.936 millihartree, which should be compared to a 0.978 millihartree difference between the SUMRCCSD and full CI values of ∆.
Thus, we can summarize this section by stating that the SUMRCCSD(1) approach provides a viable alternative to the SUMRCCSD method.At least for the small molecular systems tested in this work and in our original study [59], the SUMRCCSD(1) approach is capable of providing the results of the SUMRCCSD quality at the fraction of the computer effort associated with the SUMRCCSD calculations.We are encouraged by the preliminary results overviewed in this section and we will work on generalizing the two-reference SUMRCCSD(1) approach of Ref. 59 to larger model spaces.

Summary
We overviewed our recent results in the area of the genuine SUMRCC theory.We focused on a new type of the noniterative corrections to the SUMRCC energies, obtained by extending the MMCC formalism of Refs.7, 117-123 to a multi-reference case, and on combining the SUMRCCSD and MRMBPT approaches by replacing the cluster amplitudes of the SUMRCCSD method that carry only core and virtual orbital indices by their first-order MRMBPT estimates.We believe that the multi-reference variant of the MMCC theory provides us with a systematic way of improving the SUMRCC results by adding relatively simple, noniterative, state-specific corrections to the SUMRCC energies.In particular, the multi-reference extension of the MMCC formalism suggests simple ways of restoring the symmetric treatment of manifolds of excitations corresponding to different reference configurations, broken by the conventional SUMRCC approximations, and simple ways of incorporating higher-than-double excitations in the SUMRCCSD method.We demonstrated that the SUMRCCSD(1) approach of Ref. 59, in which the core-virtual cluster amplitudes of the SUMRCCSD theory are approximated by the first-order MRMBPT expressions, gives the results of the SUMRCCSD quality at the fraction of the computer effort associated with the SUMRCCSD calculations.We believe that the SUMRCCSD(1) approach may allow us to extend the existing two-orbital/two-electron SUMRCCSD approaches to larger active spaces, since the SUMRCCSD(1) method eliminates one of the main bottlenecks of the SUMRCC theory, which is the requirement of assigning a separate large set of core-virtual cluster amplitudes to each reference configuration.Clearly, there may exist other ways of estimating the SUMRCC cluster amplitudes via low-order MRMBPT expressions.We are currently exploring other MRMBPTbased SUMRCC approximations.The results of these studies will be reported as soon as they become available.

3 A
New Type of the Noniterative Corrections to Multi-Reference Coupled-Cluster Energies: The Method of Moments of the State-Universal Multi-Reference Coupled-Cluster Equations 7, 118, 119 (see, also, Refs.120-123).The multi-reference MMCC(m A , m B ) approximations [referred to as the MM-SUMRCC(m A , m B ) or MM-SUCC(m A , m B ) schemes] are obtained by truncating the summation over n in Eq. (33) at n = m B , where m A < m B < N. The multi-reference MMCC(m A , m B ) |Φ p moments.The MM-SUCC(2,3) energy expression is[117] MRMBPT(2)  formula by their SUMRCCSD values, as a source of wave function |Ψ µ in the MM-SUCC(2,3) energy expressions, Eqs.(48) and(49).The MRMBPT(2) wave function is the lowest-order MRMBPT wave function that contains information about the T (p) 3 cluster components.The use of wave functions |Ψ µ of this type in Eqs. ( c spanned by the excited configurations relative to |Φ p are usually different for different p values.As mentioned earlier, this asymmetric treatment of manifolds of excitations corresponding to different references |Φ p causes that the conventional SUMRCC approaches based on Eqs. ( 149nd N D designate, respectively, the numbers of all singly and doubly excited cluster amplitudes used in the two-reference OSA SUMRCCSD calculations.N S(CV ) and N D(CV ) designate, respectively, the numbers of singly and doubly excited cluster amplitudes corresponding to core-virtual excitations.The [4s2p] contraction of the Dunning-Hay basis set of Ref. 148 (see Ref.149).The lowest-energy molecular orbital (1a 1 ) is kept frozen and the Cartesian components of the d and (if present in a basis set) f and g orbitals are employed.
a N b The H4 model of Ref. 146.c The [2s1p] basis set taken from Ref. 147.d e f The DZP basis set taken from Refs.142, 143.g Basis set taken from Ref. 52. h Basis set taken from Refs.52, 144.

Table 4 .
59eBased on the results reported in Ref.59.The lowest-energy molecular orbital was kept frozen and the Cartesian components of the d and (if present in a basis set) f [142,143]set, geometries, and full CI results taken from Refs.[142,143].