Abstract
We describe a method for computing transport coefficients from the direct evaluation of large deviation functions. This method is general, relying on only equilibrium fluctuations, and is statistically efficient, employing trajectory based importance sampling. Equilibrium fluctuations of molecular currents are characterized by their large deviation functions, which are scaled cumulant generating functions analogous to the free energies. A diffusion Monte Carlo algorithm is used to evaluate the large deviation functions, from which arbitrary transport coefficients are derivable. We find significant statistical improvement over traditional Green–Kubo based calculations. The systematic and statistical errors of this method are analyzed in the context of specific transport coefficient calculations, including the shear viscosity, interfacial friction coefficient, and thermal conductivity.
1. Introduction
The evaluation of transport coefficients from molecular dynamics simulations is a standard practice throughout physics and chemistry. Despite significant interest and much study, such calculations remain computationally demanding. Traditional methods exploit Green–Kubo relationships [1,2] and rely on integrating equilibrium time correlation functions [3,4,5,6]. While general, depending only on identifying a relevant molecular current, these methods often suffer from large statistical errors due to finite time averaging, making them cumbersome to converge [7,8]. Alternative methods exist that directly drive a current through the system by the application of specific boundary conditions [9,10] or by altering the equations of motion [11,12,13,14]. These direct methods typically mitigate sampling difficulties by requiring that only the current is averaged rather than its time correlation function. However, such methods are generally not transferable to different transport processes. Moreover, as a nonequilibrium simulation, the details of how the current is generated can affect their convergence [15] and fidelity [16,17]. Here, we propose a new way to compute transport coefficients that utilizes only equilibrium fluctuations, as in Green–Kubo calculations, but is evaluated by averaging a current, as in direct methods. Rather than applying a physical field to drive a current, we apply a statistical bias to the system’s dynamics and measure the resultant response. The response is codified in the relative probability of a given current fluctuation, so this calculation is identical to the evaluation of a free energy, albeit in a path ensemble [18]. Such path ensemble free energies are known as large deviation functions [19], and with trajectory based importance sampling methods to aid in their calculation, we arrive at a method to evaluate transport coefficients that is both general and quickly convergent.
Large deviation theory has emerged as a useful formalism for considering the fluctuations of time integrated observables [20]. In fact, large deviation theory underpins many recent developments in nonequilibrium statistical mechanics, including generalized fluctuation theorems [21,22] and thermodynamic uncertainty principles [23,24]. The large deviation function is a scaled cumulant generating function and, like its equilibrium counterpart, the free energy, it codifies the stability and response of dynamical systems. While these advances in nonequilibrium statistical mechanics have yielded important relationships for systems far from equilibrium, they have also brought new insight into near-equilibrium phenomena. Andrieux and Gaspard have illustrated this especially clearly, resolving how Onsager’s reciprocal relations and their generalizations beyond linear response follow from the large deviation function for the total entropy production and its symmetry provided by the fluctuation theorem [25]. They have shown the connection between the moments of a large deviation function for a time integrated current and phenomenological transport coefficients within both linear and nonlinear response regimes [26,27]. We use this insight—that large deviation functions can encode the dynamical response of a system driven away from equilibrium—to construct an efficient method for the evaluation of transport coefficients from molecular dynamics simulations.
To compute a large deviation function for a time integrated molecular current, we employ a trajectory based importance sampling procedure. Beginning with transition path sampling [28], Monte Carlo algorithms have been derived to uniformly sample path space for systems evolving with detailed balanced dynamics. These methods have found application in computing rate constants and finding reaction pathways for complex condensed phase processes spanning autoionization to viral capsid assembly [29,30,31,32,33,34,35,36]. Indeed, it was identified in early work that a reaction rate constant could be computed from a thermodynamic-like integration along path space, resulting in a relative free energy in trajectory space [18,28]. This observation was never generalized to other dynamical responses, like phenomenological transport coefficients. With the development of diffusion Monte Carlo algorithms like the cloning algorithm that directly target large deviation functions [37,38], such generalization is possible. Recent extensions of diffusion Monte Carlo algorithms that incorporate importance sampling using an iterative feedback approach [39], cumulant expansions [40], or approximate auxiliary processes [41] have improved the efficiency of these algorithms enough to make calculations for complex, high-dimensional systems possible. In this way, we proceed numerically by computing directly an effective thermodynamic potential like that Onsager identified when he first formulated his thermodynamic theory of linear response [42], with the large deviation function serving to characterize this potential. This conceptually distinct approach from traditional methodologies provides new ways of thinking about transport processes that are amenable to the kinds of analysis typically reserved for static equilibrium observables, such as their dependence on ensemble and generalization to nonlinear regimes [43]. While we are restricted to linear response coefficients in this article, generalization to nonlinear response is straightforward [27].
The rest of the paper is organized in the following manner. In Section 2, we summarize important results of the large deviation theory, and illustrate its connection to phenomenological transport coefficients. We also outline the simulation methodology used to compute large deviation functions. In Section 3, we test our method by comparing it with calculations using the Green–Kubo formalism. We study three specific cases: the shear viscosity of TIP4P/2005 water [44], the interfacial friction coefficient between a Lennard–Jones fluid and a Lennard–Jones wall, and the thermal conductivity of a Weeks–Chandler–Anderson [45] solid. We use these models to frame a discussion of the relative systematic and statistical errors associated with our new methodology in comparison to traditional Green–Kubo calculations. We provide some final remarks on our method in Section 4.
2. Theory and Methodology
We consider systems evolving according to a Markovian stochastic dynamics, though generalization to deterministic dynamics is possible. In the absence of an external stimulus, these dynamics obey microscopic reversibility and thus will sample a Boltzmann distribution [46]. Under a bias, applied either at the boundaries of the system or through an external field, a current is expected to arise. If the bias is small, the response of the system can be linearized and a transport coefficient, L, is defined through
where J is a current and X is its conjugate generalized force, which could be proportional to a temperature or concentration gradient. The entropy production for this process is equal to the product of the force and the current, or [47]. The transport coefficient, L, is thus a response function relating the applied force to the generated current, , in the limit that . This is the object we aim to compute.
2.1. Transport Coefficients from Large Deviation Functions
To compute the response coefficient, L, we must identify a corresponding dynamic variable whose fluctuations will report on the system’s response to the bias. Specifically, we define a time averaged current as
where is some observation time, and is a fluctuating variable computable from the molecular configuration, , at time t. If j is correlated over a finite amount of time, then the fluctuations of J can be studied by computing its cumulant generating function,
where is known as the large deviation function and is a statistical field conjugate to J [19]. Here, the average is taken within an ensemble of paths of length , denoted as a vector of all the configurations visited over that time, or . The probability of observing such a path is given by
where represents the distribution of initial conditions, and are the transition probabilities between time-adjacent configurations.
As a cumulant generating function, the derivatives of report on the fluctuations of the current J. For example, the first two derivatives yield
where is the average current and is its deviation from the mean, whose squared average yields the variance of J. Because the dynamics we consider obey microscopic reversibility, obeys a generalized fluctuation theorem, where X is the generalized force as in Equation (1). This symmetry relates the likelihood of a current to its time-reversed conjugate [26,48], and implies a fluctuation-dissipation relationship, or a relation between the second derivative of the large deviation function
and the transport coefficient, L. Here, denotes the average in the biased path ensemble. In the limit of ,
where, as previously observed [26], we find that the curvature of the large deviation function around is equal to the response function L up to a factor of 2. Analogously, higher order derivatives can be related to nonlinear transport coefficients. For small values of , the large deviation function can be expanded as
which is parabolic, and completely determined by L. This implies the distribution of J is Gaussian, with a variance of . This inversion is a direct reflection of Onsager’s notion of an effective thermodynamic potential, where the probability of a current is given by the exponential of the entropy production.
The connection between the large deviation result and the Green–Kubo formalism can be understood by expanding the definition of J in Equation (5). Without loss of generality, within Green–Kubo theory, a transport coefficient can be computed from
where is an integral over the time correlation function of , and the long time limit is equal to L [47]. As for an equilibrium system, where , it is straightforward to relate the second derivative of the large deviation function with respect to evaluated at , to L as
where we have made use of the time-translational invariance of the equilibrium averaged time correlation function and assumed that decays faster than . This equation is known as the Einstein–Helfand relation and is well known to yield an equivalent expression for transport coefficients [49]. Provided an estimate of accurate enough to compute , we thus have a means of evaluating L.
2.2. Calculation of Large Deviation Functions
To evaluate the large deviation function for J, we use a variant of path sampling known as the cloning algorithm [37,38]. The cloning algorithm is based on a diffusion Monte Carlo procedure [50] where an ensemble of trajectories are integrated in parallel. Each individual trajectory is known as a walker, and collectively the walkers undergo a population dynamics whereby short trajectory segments are augmented with a branching process that results in walkers being pruned or duplicated in proportion to a weight. This algorithm has been used extensively in the study of driven lattice gases [51] and models of glasses [52,53]. Alternative methods for importance sampling trajectories, such as transition path sampling [31] or forward flux sampling [54], could be used similarly.
Generally, to importance sample large deviation functions, the original trajectory ensemble, , can be biased to the form [55]
where the large deviation function is the normalization constant computable as in Equation (3). Ensemble averages for an arbitrary observable, , within the unbiased distribution and the biased one, are related by
where the denominator is in the limit of large . If we choose in Equation (12), then we find a familiar relationship between biased ensembles
where is the probability of observing a given value of the current J in the biased ensemble, and is that in the unbiased ensemble. This demonstrates that is computable as a change in normalization through histogram reweighting [55].
In order to arrive at a robust estimate for , the two distributions, and , must have significant overlap. However, for large systems or long observation times, each distribution narrows, and sampling by brute force is exponentially difficult. To evaluate the large deviation function, the cloning algorithm samples by noting that it can be expanded to
where we have discretized the integral for J with timestep , and are the corresponding transition probabilities per . The argument of the product is the transition probability times a bias factor that is local in time. This combination of terms cannot be lumped together into a physical dynamics, as it is unnormalized. However, it can be interpreted as a population dynamics where the nonconservative part proportional to the bias is represented by adding and removing walkers. In particular, in the cloning algorithm, trajectories are propagated in two steps. First, walkers are integrated according to the normalized dynamics specified by for a trajectory of length . Over this time, a bias is accumulated according to
where, due to the multiplicative structure of the Markov chain, the bias is simply summed in the exponential. After the trajectory integration, identical copies of the ith trajectory are generated in proportion to ,
where is a uniform random number between 0 and 1 and is the floor function. This process will result in a different number of walkers, and thus each walker in the new population is copied or deleted uniformly until are left. With this algorithm, the large deviation function can be evaluated after each branching step as the deviation of the normalization
which is an exponential average over the bias factors of each walker. In the limit of a large number of walkers, this estimate is unbiased [56]. The local estimate can be improved by averaging over the observation time
which, upon repeated cycles of integration and population dynamics, yields a statistically converged estimate of . Alternatively, can be computed from histogram reweighting using Equation (13) from the distribution of Js generated from each walker. In the preceding, all calculations are integrated with LAMMPS [57] and where specified, combined with a diffusion Monte Carlo code called the Cloning Algorithm for Nonequilibrium Stationary States (CANSS) [41]. A detailed description of convergence criteria for this algorithm can be found in Reference [58].
3. Results and Discussion
To illustrate the utility of our method, we have tested it in three model transport processes. In Table 1, we list all the transport coefficients considered in this section, along with their corresponding Green–Kubo relations and the dynamical variables whose large deviation function we compute. For all of the models studied, we generate trajectories by integrating a Langevin equation of motion, as a Markovian, stochastic equation is needed for the calculation of the large deviation function using the method presented in the previous section. For the position of particle i, denoted , this equation has the form
where the dots denote time derivatives, is the total intermolecular potential from all N particles at position , is the particle’s mass, is the frictional coefficient, and is a random force. The statistics of the random force is determined by the fluctuation-dissipation theorem, which, for each component, is
where is Boltzmann’s constant times temperature, is Dirac’s delta function and is the Kronicker delta. For all our calculations, we have chosen carefully so that the thermostat has little effect on the transport properties of the system and we are able to recover response coefficients consistent with calculations done using Newtonian trajectories, when possible.
Table 1.
Transport coefficients with corresponding Green–Kubo formula and dynamical observable.
3.1. Validation of Methodology: Shear Viscosity
To illustrate our methodology, we first consider the evaluation of the shear viscosity, , which is typically easy to compute with traditional methods. The phenomenological law that defines the shear viscosity is Newton’s law of viscosity, which relates the shear stress of a fluid to an imposed shear rate
where is the -component of the stress tensor, and is the gradient of the x component of velocity in the y-direction. The relevant molecular current for this process is the momentum flux, which is equivalent to . The stress tensor is computable as
where V is the constant volume of the system, and and are the velocity and force exerted on particle i in the k direction, respectively. Given this identification of the current, its associated thermodynamic force is . From Equations (1) and (27), we can identify the relation between the shear viscosity and L as .
We compute the shear viscosity for the TIP4P/2005 model of water [44], which has been reported previously using Green–Kubo theory [59]. Our simulation system consists of 216 water molecules with density = 1 g/ and temperature T = 298 K, integrated with the Langevin equation in Equation (19) with . The simulation is thus done in an ensemble of constant number of molecules N, volume V, and temperature T (NVT). We have verified that, for , the shear viscosity computed is the same as that from an ensemble with constant energy (NVE) or an NVT ensemble using a Nosé–Hoover thermostat [60]. The molecules are held rigid with the SHAKE algorithm [61] and we employ a timestep of 1 fs. For all of the calculations, we first equilibrate the simulation for 20 ns.
First, we compute using the Green–Kubo formula in Equation (21). Note that other elements of the stress tensor can be averaged over to achieve better statistics. In both the Green–Kubo method and the new proposed calculation, the statistical benefit would be identical, so for notational clarity we will consider only the component. We average the stress autocorrelation function over 20 ns, and this function is shown in Figure 1a. The time correlation function is oscillatory due to the inertial recoil of the dense fluid, and has largely decayed within 1 ps, though there is a slow component to the decay from the approximate conservation of momentum for times shorter than the timescale for the Langevin thermostat. From Green–Kubo theory, the viscosity is the integral of this function. Shown in the main part of Figure 1a, is as a function of the upper limit of the integral as in Equation (9), which has plateaued by ps. Also shown are the associated statistical errors, which grow with . The calculated shear viscosity from five independent simulations and a cutoff time of 20 ns is ·. This value is in good agreement with that previously reported [59].
Figure 1.
(a) shear viscosity of the TIP4P/2005 water model as a function of the integration time . Shading indicates the error bar computed from the standard error. The inset is the normalized autocorrelation function . (b) large deviation function for , as a function of the biasing parameter . The error bars indicate the standard error of the mean from five individual samples. The red line shows the parabolic fit of the data. The inset is the original and the biased probability distribution of at .
Alternatively, we can compute the shear viscosity from the large deviation function for defined in Equation (22). As the shear viscosity decays quickly for this model, importance sampling is unnecessary, so we illustrate the basic principle by brute force reweighting. Specifically, we generate an estimate of with ps, from a 20 ns long equilibrium trajectory. Then, we reweight the distribution to compute according to Equation (13). Examples of the equilibrium and biased distributions are shown in the inset of Figure 1b. The added bias shifts the distribution to a different mean, and the overlap between these two distributions determines the efficiency of our sampling. The large deviation function , shown in the main panel in Figure 1b, is evaluated by Equation (3) for different s. The parabolic form of is in agreement with the Gaussian distribution of the fluctuation in in the linear response regime. Given that is a parabola centered at the origin, it is straightforward to compute from fitting the curve in Figure 1b to Equation (8) over a range of . From this, we obtain an estimate of the viscosity ·, which is in agreement with our Green–Kubo result. Both errors reported are the standard error of the mean.
3.2. Analysis of Systematic Error: Interfacial Friction Coefficient
Having validated the basic methodology, we next focus on the systematic errors determining its convergence. As a case study, we consider computing the interfacial friction coefficient between a liquid–solid interface. This friction coefficient is defined by the linear relationship,
where is the total lateral force exerted on the solid wall on the x direction, A is the lateral area of the interface, and is the tangential velocity of the fluid relative to the solid. As before, we can identify a relevant molecular current as the momentum flux along the wall, in this case proportional to
the sum of the x component of the forces of all liquid particles on the wall particles, where the force is given by the gradient of the liquid–solid interaction potential, . Given this current, we can identify its conjugate force as , and, consequently, the friction coefficient is given by .
The system is modeled as a fluid of monatomic particles confined between two stationary atomistic walls parallel to the plane. The fluid particles interact through a Lennard–Jones (LJ) potential with characteristic length scale d, energy scale , time with m as the mass of the fluid particle, and is truncated at 2.5d. Reduced units will be used throughout this and the following section. The walls are separated by a distance along the z-axis. Periodic boundary conditions are imposed along x- and y-directions, with the lateral dimensions of the simulation domain . Each wall is constructed with 1568 atoms distributed as (111) planes of face-centered-cubic lattice with density , while the fluid density is . The wall atoms do not interact with each other, but are allowed to oscillate about their equilibrium lattice sites under the harmonic potential , with a spring constant . The mass of the wall atoms is chosen to be . The interaction between the wall and the fluid atoms is also modeled by a LJ potential with the same length scale d and truncation, but a slightly smaller energy , to model the solvophobicity of the wall [62]. Only the wall particles are thermostatted by the Langevin equations in Equation (19) using .
Previous studies have recognized that is difficult to compute due to the confinement of the corresponding hydrodynamic fluctuations [63,64], which results in a large systematic error. This difficulty has led to some questioning the reliability and applicability of Green–Kubo calculations, such as the one derived in [65] and shown in Equation (23), to compute . Indeed, we have found that the details of the simulation, such as the ensemble, system geometry and used in the Langevin thermostat, all have an important influence on the calculation of . This sensitivity is because the fluctuations that determine the friction are largely confined to two spatial dimensions, which is well known to result in correlations that have hydrodynamic long time tails, whose integral may be divergent [66]. However, both our large deviation function method and the Green–Kubo calculations are based on equilibrium fluctuations. Provided an ensemble, simulation geometry, and equation of motion, the system samples the exact same trajectories, so we expect the friction coefficient computed in both ways to agree. Shown in the inset of Figure 2a is the Green–Kubo correlation function, which includes a very slow decay extending to at least 100 , following short time oscillatory behavior from the layered density near the liquid–solid interface. The main panel of Figure 2a shows computed with increasing integration time, . Averaging over four independent samples with a cutoff , our estimation of the friction coefficient is . The interfacial friction coefficient is also computed from the large deviation function, with , using the time integrated force, Equation (24), as our dynamical observable. The large deviation function and the average time integrated force, , are shown in the main panel and inset of Figure 2b, respectively, demonstrating that within the range of we consider the system still responds linearly. With and , importance sampling gives us an estimate of the friction coefficient as , in reasonable agreement with the Green–Kubo estimate and with previous reports [63].
Figure 2.
(a) interfacial friction coefficient computed from Green–Kubo method as a function of the integration time . Inset is the normalized force autocorrelation function . (b) large deviation function of the dynamical observable with . The red line is the parabolic fit. The inset is the average observable in the biased ensemble, with the linear fit in red.
In both the Green–Kubo and the large deviation function calculations, the main source of systematic error is from finite time. This error is especially highlighted in this example, where the time correlation function decays very slowly. We consider the systematic errors in the estimate of by defining a relative error as
where is the finite time value of the friction coefficient, and its asymptotic value at . The form of the time dependent systematic error is different in the Green–Kubo method compared to the large deviation estimate. In the Green–Kubo method, systematic errors come from truncating the integral before the correlation function has decayed, and we denote this time, , as the cutoff time in the integral of the correlation function. In the large deviation calculation, systematic errors come from both truncating the integral as well as sub-time-extensive contributions to the exponential expectation value, which are more analogous to finite size effects in normal free energy calculations. These contributions are both determined by the path length . The relative systematic error is shown in Figure 3 for both methods. For this case, it appears that the Green–Kubo method always has a smaller error than the large deviation function method, though their magnitudes are comparable.
Figure 3.
Relative systematic error due to finite time in the estimation of in the Green–Kubo method (black) and the large deviation function (LDF) method (blue). The time, t, in the x-axis denotes the upper limit of the integral, , in the Green–Kubo method, or the total length of the trajectory, , in computing the large deviation function. The red line is a fit to the function , and the orange line is a fit to .
In the Green–Kubo method, it follows that, if we know the analytical form of the correlation function, we can determine the scaling of the relative error. In the case of interfacial friction, Barrat and Boquet have proposed that for a cylindrical geometry where the dimension on the confined direction is much smaller than the other two directions, the force autocorrelation should decay asymptotically as ∼ using hydrodynamic arguments [65]. This is a direct consequence of the fact that the velocity autocorrelation function decays as ∼ in a two-dimensional system [66], neglecting the self-consistent mode coupling correction that adds an imperceptible correction [67,68]. This is confirmed in our simulation result in Figure 3 (orange line), where the integral of the force correlation function decays as ∼.
Since the large deviation function has a Gaussian form, we can analyze the form of the finite time correction exactly as
where is the long time limit of the large deviation function, and is its finite time estimate. This follows from a fluctuation correction about a saddle point integration. Physically, this correction arises from a that is too short, such that is not the dominant contribution to the tilted propagator, but rather includes temporal boundary terms from the overlap of the distribution of initial conditions and the steady state distribution generated under finite [56]. If we expand the first term, we arrive at
which consists of the term included in the Green–Kubo expression, as well as an additional term modulated by a factor of . Given that the correlation decays as ∼, the first term on the right hand side scales as ∼, as in the Green–Kubo method, while the second term scales as ∼. This form is shown in Figure 3 and agrees very well with our data. These additional terms explain why the magnitude of the systematic error is larger for the large deviation function. In cases where the Green–Kubo correlation function decays faster than , we expect that the dominant contribution to the error will come from the last term in Equation (32).
3.3. Analysis of Statistical Error: Thermal Conductivity
We finally discuss the statistical error of our method by studying the thermal conductivity, , of a solid system with particles that interact via the Weeks–Chandler–Anderson (WCA) potential [45]. The thermal conductivity is defined through Fourier’s law,
where is the energy current per unit area and is the temperature gradient. From the expression for entropy production, the thermodynamic force is given by , and so the thermal conductivity . As the relevant molecular current, we study the fluctuations of the heat flux given by
where is the per-particle energy, is the force on atom i due to its neighbor k from the pair potential, and is the coordinate vector between the two particles. We use a system size of unit cells, with lattice spacing 1.49d. A Langevin thermostat with maintains the system at the state point , , which yields identical results for as an NVE calculation. We focus on the diagonal component, , of the thermal conductivity tensor.
Within Green–Kubo theory, the thermal conductivity can be computed by integrating the autocorrelation function of the x component of the heat flux, , as in Equation (25). The inset of Figure 4a is the decay of the autocorrelation function, which comprises a fast decay from the high-frequency vibrational modes, followed by a slower decay that contributes most to the thermal conductivity and arises due to the low frequency acoustic modes [69]. To compute from the integral, as shown in the main part of Figure 4a, the upper time limit is chosen as , though the relaxation of the correlation extends only to around 5. To compute from the large deviation function, we study fluctuations in the time averaged heat flux, , defined in Equation (26). The transport coefficient, , is again calculated using Equation (10) by assuming the large deviation function as a parabola, which is justified in Figure 4b. The inset there shows clearly the linear response of the biased ensemble average, , computed from Equation (12). Given sufficient statistics the two methods converge to the same value. The estimate of thermal conductivity from the Green–Kubo method using a long trajectory of is , while the estimate from the large deviation function using walkers and is .
Figure 4.
Calculation of the thermal conductivity, , of a WCA solid at , . (a) calculated by integrating the heat flux correlation function up to time . The data are averaged from four samples and the error bars are standard deviations, which are smaller than the symbols. The inset shows the normalized heat flux correlation function, . (b) large deviation function of dynamical observable, , as a function of the bias . The red line is the parabolic fit. The inset is the average observable in the biased ensemble, , as a function of with the linear fit in red.
While the average values of agree between the two methods, the statistical convergence varies significantly. To make a fair comparison, we set the total observation time of the trajectories to the same time as the upper limit of the Green–Kubo integral, i.e., , which is much longer than the characteristic decay of the current autocorrelation function. To compensate for computational overhead of propagating trajectories in parallel in the cloning algorithm, the total averaging time of the Green–Kubo method is chosen as , and equals the walker number, , so that the two methods require approximately the same computational effort. Both and will be denoted as reflecting the number of independent samples of each fluctuating quantity. We measure the statistical error by the relative error
which is plotted in Figure 5 for both methods. As usual, the statistical error depends on both the relative size of observable fluctuations, and the number of independent samples. We find that as the standard deviations of both methods scale as as expected, our importance sampling clearly helps to suppress the statistical error compared to the Green–Kubo method with similar computational effort, decreasing the magnitude of the error by an order of magnitude at fixed . Even though we have to choose a bias small enough to guarantee a linear response, we do see that larger bias helps to yield statistically reliable results.
Figure 5.
Relative statistical error in the measurement of , the Green–Kubo method (black) and the large deviation function method with (red) and (blue). denotes the number of walkers used in evaluating the large deviation function, or , an indicator of the total averaging time in the Green–Kubo method. The solid lines are fits of function .
Jones and Mandadapu have performed a rigorous error analysis on the estimates of Green–Kubo transport coefficients with the assumption that the current fluctuations follow a Gaussian process [6]. They found that the variance of is a monotonically increasing function of , and arrived at an upper bound for the relative error
which depends only on the number of trajectory segments of length . As a consequence, the statistics become worse when the system has longer correlation times, and there are no ways of controlling the intrinsic variance of the observable. On the other hand, in the large deviation method, the relative error in the large deviation function is
which depends not only on the number of samples, in this case , but also has a dependence on and L. In general, as increases, the walkers will become more correlated. However, within the regime of linear response, or to first order in , the number of uncorrelated walkers should be . Because the large deviation function, , scales as while its second derivative, , has no dependence on , the relative size of the fluctuations can be tuned by changing away from 0. This is verified in Figure 5, where increased generates an order of magnitude reduction in the statistical error relative to the Green–Kubo calculation. This decrease in the statistical error is also confirmed for a series of s. This tunability afforded by the large deviation function calculation is the same advantage afforded by direct simulation of transport processes where the relative size of fluctuations is determined by the size of the average current produced by driving the system away from equilibrium. Instead of evaluating from the large deviation function directly, we could have derived it from the change in the average current produced at a given . However, in such a case, the relative error would only scale as rather than .
4. Conclusions
In this paper, we have explored the possibility of calculating transport coefficients from a large deviation function or a path ensemble free energy. The robustness of our method is tested by a variety of model systems ranging in composition and complexity of molecular interactions. Our method is general, and we expect the addition of importance sampling to be beneficial in instances where statistical errors are dominant. More precisely, our analysis shows that the systematic errors for both the Green–Kubo calculation and the large deviation calculation are asymptotically the same if the time correlation function decays faster than . If the correlation function decays slower, than there will be a larger systematic error for the large deviation function calculation that will need to be converged at large . In such cases, the form of this error follows from Equation 32 and scales as . Such slow decay is expected for low-dimensional systems where the current includes hydrodynamic modes. Our analysis of the relative statistical errors between the Green–Kubo and the large deviation function calculations show that our method requires generically fewer statistically uncorrelated samples for comparable statistical accuracy. This is a consequence of the importance sampling employed. The magnitude of this statistical efficiency, defined as , or the number of independent samples needed for a given error, increases linearly with the size of the transport coefficient, L, and increases rapidly with the increasing bias, as .
While we have considered only linear response coefficients, our method can be easily extended to the nonlinear regime or to off-diagonal entries in the Onsager matrix, where Green–Kubo formulas are even more cumbersome to evaluate and few direct methods exist or can be formulated. These extensions are possible since the diffusion Monte Carlo algorithm is capable of sampling rare fluctuations in the non-Gaussian tails of the distribution. Moreover, it is also possible to probe the response around nonequilibrium steady states, as the method presented here does not rely on an underlying Boltzmann distribution.
Acknowledgments
D.T.L. and C.Y.G. was supported by the UC Berkeley College of Chemistry. The authors would like to thank Ushnish Ray for useful discussions and for developing the use of LAMMPS with the CANSS package, available at https://github.com/ushnishray/CANSS for the calculation of nonequilibrium properties of complex systems.
Author Contributions
C.Y.G. and D.T.L. conceived and designed the simulations; C.Y.G. performed the simulations; C.Y.G. and D.T.L. analyzed the data; and C.Y.G. and D.T.L. wrote the paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Green, M.S. Markoff random processes and the statistical mechanics of time-dependent phenomena. II. Irreversible processes in fluids. J. Chem. Phys. 1954, 22, 398–413. [Google Scholar] [CrossRef]
- Kubo, R. Statistical-mechanical theory of irreversible processes. I. General theory and simple applications to magnetic and conduction problems. J. Phys. Soc. Jpn. 1957, 12, 570–586. [Google Scholar] [CrossRef]
- Levesque, D.; Verlet, L.; Kürkijarvi, J. Computer “experiments” on classical fluids. IV. Transport properties and time-correlation functions of the Lennard–Jones liquid near its triple point. Phys. Rev. A 1973, 7, 1690. [Google Scholar] [CrossRef]
- Schelling, P.K.; Phillpot, S.R.; Keblinski, P. Comparison of atomic-level simulation methods for computing thermal conductivity. Phys. Rev. B 2002, 65, 144306. [Google Scholar] [CrossRef]
- Galamba, N.; de Nieto Castro, C.A.; Ely, J.F. Thermal conductivity of molten alkali halides from equilibrium molecular dynamics simulations. J. Chem. Phys. 2004, 120, 8676–8682. [Google Scholar] [CrossRef] [PubMed]
- Jones, R.E.; Mandadapu, K.K. Adaptive Green–Kubo estimates of transport coefficients from molecular dynamics based on robust error analysis. J. Chem. Phys. 2012, 136, 154102. [Google Scholar] [CrossRef] [PubMed]
- Evans, D.J.; Streett, W.B. Transport properties of homonuclear diatomics: II. Dense fluids. Mol. Phys. 1978, 36, 161–176. [Google Scholar] [CrossRef]
- Hess, B. Determining the shear viscosity of model liquids from molecular dynamics simulations. J. Chem. Phys. 2002, 116, 209–217. [Google Scholar] [CrossRef]
- Tenenbaum, A.; Ciccotti, G.; Gallico, R. Stationary nonequilibrium states by molecular dynamics. Fourier’s law. Phys. Rev. A 1982, 25, 2778. [Google Scholar] [CrossRef]
- Baranyai, A.; Cummings, P.T. Steady state simulation of planar elongation flow by nonequilibrium molecular dynamics. J. Chem. Phys. 1999, 110, 42–45. [Google Scholar] [CrossRef]
- Hoover, W.G.; Evans, D.J.; Hickman, R.B.; Ladd, A.J.C.; Ashurst, W.T.; Moran, B. Lennard–Jones triple-point bulk and shear viscosities. Green–Kubo theory, Hamiltonian mechanics, and nonequilibrium molecular dynamics. Phys. Rev. A 1980, 22, 1690. [Google Scholar] [CrossRef]
- Evans, D.J. Homogeneous NEMD algorithm for thermal conductivity—Application of non-canonical linear response theory. Phys. Lett. A 1982, 91, 457–460. [Google Scholar] [CrossRef]
- Mandadapu, K.K.; Jones, R.E.; Papadopoulos, P. A homogeneous nonequilibrium molecular dynamics method for calculating thermal conductivity with a three-body potential. J. Chem. Phys. 2009, 130, 204106. [Google Scholar] [CrossRef] [PubMed]
- Müller-Plathe, F. A simple nonequilibrium molecular dynamics method for calculating the thermal conductivity. J. Chem. Phys. 1997, 106, 6082–6085. [Google Scholar] [CrossRef]
- Zhou, X.W.; Aubry, S.; Jones, R.E.; Greenstein, A.; Schelling, P.K. Towards more accurate molecular dynamics calculation of thermal conductivity: Case study of GaN bulk crystals. Phys. Rev. B 2009, 79, 115201. [Google Scholar] [CrossRef]
- Tuckerman, M.E.; Mundy, C.J.; Balasubramanian, S.; Klein, M.L. Modified nonequilibrium molecular dynamics for fluid flows with energy conservation. J. Chem. Phys. 1997, 106, 5615–5621. [Google Scholar] [CrossRef]
- Tenney, C.M.; Maginn, E.J. Limitations and recommendations for the calculation of shear viscosity using reverse nonequilibrium molecular dynamics. J. Chem. Phys. 2010, 132, 014103. [Google Scholar] [CrossRef] [PubMed]
- Geissler, P.L.; Dellago, C. Equilibrium time correlation functions from irreversible transformations in trajectory space. J. Phys. Chem. B 2004, 108, 6667–6672. [Google Scholar] [CrossRef]
- Touchette, H. The large deviation approach to statistical mechanics. Phys. Rep. 2009, 478, 1–69. [Google Scholar] [CrossRef]
- Touchette, H. Introduction to dynamical large deviations of Markov processes. arXiv, 2017; arXiv:1705.06492. [Google Scholar]
- Jarzynski, C. Nonequilibrium equality for free energy differences. Phys. Rev. Lett. 1997, 78, 2690. [Google Scholar] [CrossRef]
- Crooks, G.E. Entropy production fluctuation theorem and the nonequilibrium work relation for free energy differences. Phys. Rev. E 1999, 60, 2721–2726. [Google Scholar] [CrossRef]
- Barato, A.C.; Seifert, U. Thermodynamic uncertainty relation for biomolecular processes. Phys. Rev. Lett. 2015, 114, 158101. [Google Scholar] [CrossRef] [PubMed]
- Gingrich, T.R.; Horowitz, J.M.; Perunov, N.; England, J.L. Dissipation bounds all steady-state current fluctuations. Phys. Rev. Lett. 2016, 116, 120601. [Google Scholar] [CrossRef] [PubMed]
- Gaspard, P. Multivariate fluctuation relations for currents. New J. Phys. 2013, 15, 115014. [Google Scholar] [CrossRef]
- Andrieux, D.; Gaspard, P. Fluctuation theorem and Onsager reciprocity relations. J. Chem. Phys. 2004, 121, 6167–6174. [Google Scholar] [CrossRef] [PubMed]
- Andrieux, D.; Gaspard, P. A fluctuation theorem for currents and non-linear response coefficients. J. Stat. Mech. Theory Exp. 2007. [Google Scholar] [CrossRef]
- Dellago, C.; Bolhuis, P.G.; Csajka, F.S.; Chandler, D. Transition path sampling and the calculation of rate constants. J. Chem. Phys. 1998, 108, 1964–1977. [Google Scholar] [CrossRef]
- Geissler, P.L.; Dellago, C.; Chandler, D. Kinetic pathways of ion pair dissociation in water. J. Phys. Chem. B 1999, 103, 3706–3710. [Google Scholar] [CrossRef]
- Geissler, P.L.; Dellago, C.; Chandler, D.; Hutter, J.; Parrinello, M. Autoionization in liquid water. Science 2001, 291, 2121–2124. [Google Scholar] [CrossRef] [PubMed]
- Bolhuis, P.G.; Chandler, D.; Dellago, C.; Geissler, P.L. Transition path sampling: Throwing ropes over rough mountain passes, in the dark. Annu. Rev. Phys. Chem. 2002, 53, 291–318. [Google Scholar] [CrossRef] [PubMed]
- Radhakrishnan, R.; Schlick, T. Orchestration of cooperative events in DNA synthesis and repair mechanism unraveled by transition path sampling of DNA polymerase β’s closing. Proc. Natl. Acad. Sci. USA 2004, 101, 5970–5975. [Google Scholar] [CrossRef] [PubMed]
- Basner, J.E.; Schwartz, S.D. How enzyme dynamics helps catalyze a reaction in atomic detail: A transition path sampling study. J. Am. Chem. Soc. 2005, 127, 13822–13831. [Google Scholar] [CrossRef] [PubMed]
- Hagan, M.F.; Chandler, D. Dynamic pathways for viral capsid assembly. Biophys. J. 2006, 91, 42–54. [Google Scholar] [CrossRef] [PubMed]
- Peters, B. Recent advances in transition path sampling: Accurate reaction coordinates, likelihood maximisation and diffusive barrier-crossing dynamics. Mol. Simul. 2010, 36, 1265–1281. [Google Scholar] [CrossRef]
- Limmer, D.T.; Chandler, D. Theory of amorphous ices. Proc. Natl. Acad. Sci. USA 2014, 111, 9413–9418. [Google Scholar] [CrossRef] [PubMed]
- Giardina, C.; Kurchan, J.; Peliti, L. Direct evaluation of large-deviation functions. Phys. Rev. Lett. 2006, 96, 120603. [Google Scholar] [CrossRef] [PubMed]
- Giardina, C.; Kurchan, J.; Lecomte, V.; Tailleur, J. Simulating rare events in dynamical processes. J. Stat. Phys. 2011, 145, 787–811. [Google Scholar] [CrossRef]
- Nemoto, T.; Bouchet, F.; Jack, R.L.; Lecomte, V. Population-dynamics method with a multicanonical feedback control. Phys. Rev. E 2016, 93, 062123. [Google Scholar] [CrossRef] [PubMed]
- Klymko, K.; Geissler, P.L.; Garrahan, J.P.; Whitelam, S. Rare behavior of growth processes via umbrella sampling of trajectories. arXiv, 2017; arXiv:1707.00767. [Google Scholar]
- Ray, U.; Chan, G.K.-L.; Limmer, D.T. Exact fluctuations of nonequilibrium steady states from approximate auxiliary dynamics. arXiv, 2017; arXiv:1708.09482. [Google Scholar]
- Onsager, L. Reciprocal relations in irreversible processes. I. Phys. Rev. 1931, 37, 405. [Google Scholar] [CrossRef]
- Palmer, T.; Speck, T. Thermodynamic formalism for transport coefficients with an application to the shear modulus and shear viscosity. J. Chem. Phys. 2017, 146, 124130. [Google Scholar] [CrossRef] [PubMed]
- Abascal, J.L.F.; Vega, C. A general purpose model for the condensed phases of water: TIP4P/2005. J. Chem. Phys. 2005, 123, 234505. [Google Scholar] [CrossRef] [PubMed]
- Weeks, J.D.; Chandler, D.; Andersen, H.C. Role of repulsive forces in determining the equilibrium structure of simple liquids. J. Chem. Phys. 1971, 54, 5237–5247. [Google Scholar] [CrossRef]
- Chandler, D. Introduction to Modern Statistical Mechanics; Oxford University Press: London, UK, 1987. [Google Scholar]
- Morriss, G.P.; Evans, D.J. Statistical Mechanics of Nonequilbrium Liquids; ANU Press: Canberra, Australia, 2013. [Google Scholar]
- Lebowitz, J.L.; Spohn, H. A Gallavotti–Cohen-type symmetry in the large deviation functional for stochastic dynamics. J. Stat. Phys. 1999, 95, 333–365. [Google Scholar] [CrossRef]
- Helfand, E. Transport coefficients from dissipation in a canonical ensemble. Phys. Rev. 1960, 119, 1. [Google Scholar] [CrossRef]
- Foulkes, W.M.C.; Mitas, L.; Needs, R.J.; Rajagopal, G. Quantum Monte Carlo simulations of solids. Rev. Mod. Phys. 2001, 73, 33. [Google Scholar] [CrossRef]
- Hurtado, P.I.; Garrido, P.L. Spontaneous symmetry breaking at the fluctuating level. Phys. Rev. Lett. 2011, 107, 180601. [Google Scholar] [CrossRef] [PubMed]
- Garrahan, J.P.; Jack, R.L.; Lecomte, V.; Pitard, E.; van Duijvendijk, K.; van Wijland, F. First-order dynamical phase transition in models of glasses: An approach based on ensembles of histories. J. Phys. A Math. Theor. 2009, 42, 075007. [Google Scholar] [CrossRef]
- Bodineau, T.; Lecomte, V.; Toninelli, C. Finite size scaling of the dynamical free-energy in a kinetically constrained model. J. Stat. Phys. 2012, 147, 1–17. [Google Scholar] [CrossRef]
- Allen, R.J.; Valeriani, C.; ten Wolde, P.R. Forward flux sampling for rare event simulations. J. Phys. Condens. Matter 2009, 21, 463102. [Google Scholar] [CrossRef] [PubMed]
- Frenkel, D.; Smit, B. Understanding Molecular Simulation: From Algorithms to Applications; Academic Press: San Diego, CA, USA, 2001; Volume 1. [Google Scholar]
- Nemoto, T.; Hidalgo, E.G.; Lecomte, V. Finite-time and finite-size scalings in the evaluation of large-deviation functions: Numerical approach in continuous time. Phys. Rev. E 2017, 95, 062134. [Google Scholar] [CrossRef] [PubMed]
- Plimpton, S. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 1995, 117, 1–19. [Google Scholar] [CrossRef]
- Ray, U.; Chan, G.K.-L.; Limmer, D.T. Importance sampling large deviations in nonequilibrium steady states: Part 1. arXiv, 2017; arXiv:1708.00459. [Google Scholar]
- González, M.A.; Abascal, J.L.F. The shear viscosity of rigid water models. J. Chem. Phys. 2010, 132, 096101. [Google Scholar] [CrossRef] [PubMed]
- Nosé, S. A unified formulation of the constant temperature molecular dynamics methods. J. Chem. Phys. 1984, 81, 511–519. [Google Scholar] [CrossRef]
- Ryckaert, J.P.; Ciccotti, G.; Berendsen, H.J.C. Numerical integration of the cartesian equations of motion of a system with constraints: Molecular dynamics of n-alkanes. J. Comput. Phys. 1977, 23, 327–341. [Google Scholar] [CrossRef]
- Sendner, C.; Horinek, D.; Bocquet, L.; Netz, R.R. Interfacial water at hydrophobic and hydrophilic surfaces: Slip, viscosity, and diffusion. Langmuir 2009, 25, 10768–10781. [Google Scholar] [CrossRef] [PubMed]
- Petravic, J.; Harrowell, P. On the equilibrium calculation of the friction coefficient for liquid slip against a wall. J. Chem. Phys. 2007, 127. [Google Scholar] [CrossRef] [PubMed]
- Huang, K.; Szlufarska, I. Green–Kubo relation for friction at liquid–solid interfaces. Phys. Rev. E 2014, 89, 032119. [Google Scholar] [CrossRef] [PubMed]
- Bocquet, L.; Barrat, J.L. On the Green–Kubo relationship for the liquid–solid friction coefficient. J. Chem. Phys. 2013, 139, 044704. [Google Scholar] [CrossRef] [PubMed]
- Alder, B.J.; Wainwright, T.E. Decay of the velocity autocorrelation function. Phys. Rev. A 1970, 1, 18–21. [Google Scholar] [CrossRef]
- Wainwright, T.; Alder, B.; Gass, D. Decay of time correlations in two dimensions. Phys. Rev. A 1971, 4, 233. [Google Scholar] [CrossRef]
- Isobe, M. Long-time tail of the velocity autocorrelation function in a two-dimensional moderately dense hard-disk fluid. Phys. Rev. E 2008, 77, 021201. [Google Scholar] [CrossRef] [PubMed]
- Che, J.; Cagin, T.; Goddard, W.A., III. Thermal conductivity of carbon nanotubes. Nanotechnology 2000, 11, 65–69. [Google Scholar] [CrossRef]
© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).