A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics

Federica Pes Dipartimento di Chimica e Chimica Industriale, Università di Pisa, Via G. Moruzzi 13, 56124 Pisa, Italy Étienne Polack CERMICS, École des Ponts and Inria Paris, 6 & 8 avenue Blaise Pascal, 77455 Marne-la-Vallé, France Patrizia Mazzeo Dipartimento di Chimica e Chimica Industriale, Università di Pisa, Via G. Moruzzi 13, 56124 Pisa, Italy Geneviève Dusson Laboratoire de Mathématiques de Besançon, UMR CNRS 6623, Université de Franche-Comté, 16 route de Gray, 25030 Besançon, France Benjamin Stamm Institute of Applied Analysis and Numerical Simulation, University of Stuttgart, 70569 Stuttgart, Germany Filippo Lipparini Dipartimento di Chimica e Chimica Industriale, Università di Pisa, Via G. Moruzzi 13, 56124 Pisa, Italy

Abstract

This article introduces the so-called Quasi Time-Reversible (QTR G-Ext) scheme based on Grassmann extrapolation of density matrices for an accurate calculation of initial guesses in Born-Oppenheimer Molecular Dynamics (BOMD) simulations. The method shows excellent results on four large molecular systems, that are representative of real-life production applications, ranging from 21 to 94 atoms simulated with Kohn-Sham (KS) density functional theory surrounded with a classical environment with 6k to 16k atoms. Namely, it clearly reduces the number of self-consistent field iterations, while at the same time achieving energy-conserving simulations, resulting in a considerable speed-up of BOMD simulations even when tight convergence of the KS equations is required.

Ab-initio Born-Oppenheimer Molecular Dynamics (BOMD) is a very powerful and versatile tool to simulate molecular processes where the quantum nature of the system is not negligible. Unfortunately, this comes at a high computational price, which stems from the necessity of solving the quantum mechanical (QM) equations, typically Kohn-Sham Density Functional Theory (KS-DFT) equations, to compute the energy and forces at every time step. Such equations are nonlinear and are solved using a fixed-point iterative method known as Self-Consistent Field [33] (SCF). Achieving SCF convergence typically requires, in a standard single-point run, up to 20 iterations, making the MD simulation very expensive, as in turn the SCF has to be performed tens of thousands of times. Two main families of methods have been developed to address such a limitation. In extended Lagrangian methods, such as Car-Parrinello Molecular Dynamics (CPMD)[6] or Atom-centered Density matrix propagation (ADMP)[35], the electronic degrees of freedom are propagated, avoiding thus the need of solving the SCF problem. This requires one to endow the electronic degrees of freedom with a fictitious mass, that needs to be small enough to keep the trajectory close to its Born-Oppenheimer counterpart. As a consequence, rather short time steps need to be used. A different strategy relies on developing extrapolation techniques[8, 1, 2, 31, 12, 13, 36, 37, 22, 23, 21, 39] for BOMD that allow one to converge the SCF in a limited number of iterations. In this work, we choose the second strategy, which is particularly effective for calculations using localized basis sets, e.g., Gaussian-type orbitals. The extrapolation techniques used in BOMD use converged solutions from previous MD steps to compute an accurate guess for the SCF, limiting thus the number of iterations required to achieve convergence. A significant contribution to this field was given by Niklasson and co-workers in 2006, with their work on the time-reversible extrapolation for Born-Oppenheimer Molecular Dynamics [22]. The core concept involves generating a guess density matrix by combining the density matrices from previous steps in a symmetric and time-reversible manner. However, numerical applications showed that enforcing an exact time-reversibility can lead to errors accumulating in long-time simulations, spoiling thus the convergence properties of the algorithm in the long run. This led to the development of the Extended Lagrangian Born-Oppenheimer approach (XLBO) in 2008 [23, 21, 20, 24, 39]. In this particular case, the time-reversible extrapolation is augmented by the inclusion of a dissipative term, which serves to reduce numerical fluctuations. XLBO can be seen as an intermediate strategy between Car-Parrinello like approaches and extrapolation techniques for BOMD, as it indeed propagates an auxiliary density matrix that can either be used directly in a CPMD spirit[25, 14], possibly after refining the density using an approximate SCF solver, or used as a guess for the SCF[23]. Here, we focus on the latter approach.

In Niklasson’s XLBO scheme, the guess density is propagated in time subject to a potential that forces it to be close to the converged density. The result is a guess density that is accurate enough to achieve reasonable SCF convergence (e.g., $10^{-5}$ RMS norm of the density matrix change) in as little as four iterations: Niklasson’s pioneering work has therefore been crucial in extending the applicability of BOMD. However, the XLBO method suffers of a few shortcomings. First, the guess density obtained with XLBO is not exactly idempotent [23], unless it is postprocessed using, e.g., McWeeny purification[19, 29]. Second, its performance degrades if a tightly converged SCF solution is required, as it is the case when a post Hartree-Fock method is used to compute the energy and forces (e.g., in a time-dependent DFT excited state simulation).

Recently, we proposed a different strategy to compute a guess density using linear extrapolation. This is non-trivial, because in general a linear combination of density matrices does not preserve idempotency or, in other words, density matrices belong to a differentiable manifold called Grassmann manifold and not to a vector space. Our approach uses tools from differential geometry to map the Grassmann manifold onto its tangent space, which is a vector space. It then performs a linear extrapolation on the tangent space, and then maps back the extrapolated density to the manifold. We named such a method Grassmann extrapolation (G-Ext) [29, 28]. G-Ext is an accurate and efficient strategy for ab-initio MD simulations, that has been shown to outperform XLBO, especially if a tight SCF convergence is required[29]. It has been successfully adopted in the Pisa-group for both ground- and excited-state SCF-based BOMD simulations in a polarizable multiscale framework [4, 10, 18, 26]. Unfortunately, G-Ext suffers from a serious shortcoming. Numerical experiments have shown that the extrapolation introduces a bias causing a drift in the total energy for NVE simulations[29]. Such an energy drift is modest in absolute terms (few kcal/mol in 10 ps, to be compared with total energies of hundreds of thousands kcal/mol), but large if compared with the energetics of typical chemical processes. While using a tight convergence criterion for the SCF solves the problem[29], this is not an option for expensive, production simulations, limiting thus the gains introduced with the overall approach.

In this contribution, we not only address such a limitation by introducing a new strategy to perform the extrapolation, but further improve the performance of the method. We name the new strategy Quasi Time-Reversible Grassmann extrapolation method (QTR G-Ext). This approach leverages the principles of differential geometry, similarly to the previous method, but offers enhanced accuracy, improved performances, and excellent energy conservation properties. Given a $\mathcal{N}$ -dimensional atomic orbitals (AO) basis, the SCF solves the following nonlinear eigenvalue problem which consists to find a matrix $C$ and a diagonal matrix $E$ such that

\begin{cases}F(D)C=SCE\\ C^{T}SC=I_{N}\\ D=CC^{T},\end{cases}

where $C\in\mathbb{R}^{\mathcal{N}\times N}$ contains the $\mathcal{N}$ coefficients of the $N$ occupied molecular orbitals, $D\in\mathbb{R}^{\mathcal{N}\times\mathcal{N}}$ is the density matrix, $E\in\mathbb{R}^{N\times N}$ is a diagonal matrix which entries are the energy levels, $F$ denotes the DFT operator, $S\in\mathbb{R}^{\mathcal{N}\times\mathcal{N}}$ is the overlap matrix, and $I_{N}$ denotes the identity matrix of order $N$ .

We assume that the density matrix is orthogonal. In any case, it can be transformed into such matrix by considering the Löwdin factorization of the overlap matrix $S$ and consequently the modified coefficient matrix $\widetilde{C}=S^{1/2}C$ . Then the normalized density matrix $\widetilde{D}=\widetilde{C}\widetilde{C}^{T}=S^{1/2}DS^{1/2}$ belongs to the manifold

\mathcal{G}r(N,\mathcal{N})=\left\{D\in\mathbb{R}^{\mathcal{N}\times\mathcal{N% }}|D^{2}=D=D^{T},\operatorname{Tr}(D)=N\right\},

which is isomorphic to the so-called “Grassmann manifold”, therefore we identify $\mathcal{G}r$ by this name. From now on, we assume that the density matrix has been orthonormalized and we denote it by $D$ .

Since $\mathcal{G}r$ is a differential manifold, given a point $D_{0}\in\mathcal{G}r$ , there exists a tangent space $\mathcal{T}_{D_{0}}\subset\mathbb{R}^{\mathcal{N}\times N}$ , such that tangent vectors $\Gamma(D)\in\mathcal{T}_{D_{0}}$ can be associated to nearby points $D\in\mathcal{G}r$ .

In MD, $t\to\bm{R}(t)$ represents the trajectory of the nuclei. The transformation of the electronic structure can be interpreted as a trajectory denoted by $t\to D_{\bm{R}(t)}$ on the manifold. In order not to burden the notation, we simply indicate $D$ in place of $D_{\bm{R}(t)}$ . The objective is to determine a suitable approximation for the density matrix at the next step of the molecular dynamics trajectory by extrapolating the densities from previous steps. Since the tangent space $\mathcal{T}_{D_{0}}$ is a vector space, we approximate the density matrix on $\mathcal{T}_{D_{0}}$ . In order to solve the extrapolation problem, we decompose the mapping $\bm{R}\to D$ as a composition of several maps

	$\displaystyle\mathbb{R}^{3M}$	$\displaystyle\longrightarrow\mathcal{D}\longrightarrow\mathcal{T}_{D_{0}}% \longrightarrow\mathcal{G}r$		(1)
	$\displaystyle\bm{R}$	$\displaystyle\longmapsto d\;\;\longmapsto\Gamma\quad\longmapsto D,$		(1)

where the first function $\bm{R}\mapsto d$ is a map from atomic positions to molecular descriptors. Here, as a descriptor, we use the Coulomb matrix [34] $d\in\mathbb{R}^{N_{\rm QM}\times N_{\rm QM}}$ ,

(d)_{kl}=\begin{cases}0.5z_{k}^{2.4}&k=l,\\ \dfrac{z_{k}z_{l}}{\|\bm{R}_{k}-\bm{R}_{l}\|}&k\neq l,\end{cases}

(2)

where $N_{\rm QM}$ is the number of atoms treated quantum mechanically and $z_{k}$ and $\bm{R}_{k}$ denotes the nuclear charge and the position of the $k$ th atom, respectively. Note that other descriptors can also be considered. We will detail the crucial mapping $d\mapsto\Gamma$ below. The mapping $\Gamma\mapsto\operatorname{Exp}(\Gamma)=D$ is the so-called Grassmann exponential which maps tangent vectors on $\mathcal{T}_{D_{0}}$ to $\mathcal{G}r$ , and it is a locally bijective function in a neighborhood of $D_{0}$ . Its inverse $D\mapsto\operatorname{Log}(D)=\Gamma(D)$ is the Grassmann logarithm. These mappings are computed by means of the singular value decomposition (SVD). For mathematical details, the interested reader is referred to [28, 40, 7]. In our method, during the MD, we use a fixed reference point $D_{0}$ to construct the tangent space $\mathcal{T}_{D_{0}}$ .

Let $n$ be the current time step of the MD. Given previous $q$ snapshots $\Gamma_{n-i}=\operatorname{Log}(D_{n-i})$ , for $i=1,\ldots,q$ , the approximation of the density matrix representative on the tangent space is written as

\widetilde{\Gamma}_{n}=-\Gamma_{n-q}+\sum_{i=1}^{\widetilde{q}}\alpha_{i}\left% (\Gamma_{n-i}+\Gamma_{n-q+i}\right),

(3)

where $\widetilde{q}=q/2$ if $q$ is even, while $\widetilde{q}=(q-1)/2$ if $q$ is odd. We remark that if in Eq. (3), the term $\Gamma_{n-q}$ is substituted by $\widetilde{\Gamma}_{n-q}$ , a “fully” time-reversible approach (instead of quasi time-reversible) is obtained. Numerical experiments with the fully time-reversible approach, that are reported in the Supporting Information (SI), showed good behavior for total energy conservation, but unfortunately a strong increase in the number of performed SCF iterations. This is consistent with what has been observed by Niklasson and coworkers[21], who remark that exact time-reversibility under noisy conditions (e.g., not fully converged SCF) can lead to error accumulations and significantly worse SCF convergence.

The descriptors are involved in the computation of the coefficients $\bm{\alpha}=[\alpha_{1},\ldots,\alpha_{\widetilde{q}}]^{T}$ appearing in Eq. (3). Indeed, they are computed by solving the least-squares problem with Tikhonov regularization

\min_{\bm{\alpha}\in\mathbb{R}^{\widetilde{q}}}\left\{\left\|d_{n}+d_{n-q}-% \sum_{i=1}^{\widetilde{q}}\alpha_{i}\left(d_{n-i}+d_{n-q+i}\right)\right\|^{2}% +\varepsilon^{2}\left\|\bm{\alpha}\right\|^{2}\right\},

(4)

where $\|\cdot\|$ denotes the $\ell^{2}$ -norm and $\varepsilon>0$ is the regularization parameter. Since the Coulomb matrix (2) is symmetric, in the above formula $d_{j}$ represents the vectorized Coulomb matrix considering the lower triangle. In matrix form, it corresponds to solving the following least-squares problem

\min_{\bm{\alpha}\in\mathbb{R}^{\widetilde{q}}}\left\|\begin{bmatrix}\bm{b}\\ \bm{0}\end{bmatrix}-\begin{bmatrix}A\\ \varepsilon I_{\widetilde{q}}\end{bmatrix}\bm{\alpha}\right\|^{2},

(5)

where the vector $\bm{b}=d_{n}+d_{n-q}$ is padded with $\widetilde{q}$ zeroes, $A\in\mathbb{R}^{N_{d}\times\widetilde{q}}$ is the matrix which columns are defined as $A_{\cdot,i}=d_{n-i}+d_{n-q+i}$ , and $I_{\widetilde{q}}$ is the identity matrix of order $\widetilde{q}$ . Then the initial guess for the density matrix is obtained as the composition of the three maps in (1), where the second map $d\mapsto\Gamma$ is given by (3). Note that if this second map denoted by $f$ was linear, then the guess would be close to exact, namely

$\displaystyle\Gamma_{n}$	$\displaystyle=f(d_{n})\approx f\left(-d_{n-q}+\sum_{i=1}^{\widetilde{q}}\alpha% _{i}\left(d_{n-i}+d_{n-q+i}\right)\right)$
	$\displaystyle=-f\left(d_{n-q}\right)+\sum_{i=1}^{\widetilde{q}}\alpha_{i}\left% [f\left(d_{n-i}\right)+f\left(d_{n-q+i}\right)\right]$
	$\displaystyle=-\Gamma_{n-q}+\sum_{i=1}^{\widetilde{q}}\alpha_{i}\left(\Gamma_{% n-i}+\Gamma_{n-q+i}\right)=\widetilde{\Gamma}_{n}.$	(6)

After computing the coefficients $\alpha_{i}$ by solving (4) and the tangent vector $\widetilde{\Gamma}_{n}$ by Eq. (3), we obtain the sought guess density matrix for the SCF iterative method as $\widetilde{D}_{n}=\operatorname{Exp}(\widetilde{\Gamma}_{n})$ .

The number $q$ of density matrices taken at previous steps and the value of the regularization parameter $\varepsilon$ are chosen in a heuristic manner: we computed the error $\|\Gamma_{n}-\widetilde{\Gamma}_{n}\|$ for different values of $q$ and $\varepsilon$ , specifically $q=3,4,\ldots,20$ and $\varepsilon=0.001,0.002,0.005,0.01,0.02,0.05$ , and we selected the combination $(q,\varepsilon)$ corresponding to the minimal error. When the SCF convergence threshold is $10^{-5}$ , we found that good values are $q=5$ and $\varepsilon=0.005$ , while if it is fixed to $10^{-7}$ , we found $q=4$ and $\varepsilon=0.001,0.002$ . Additional details on the selection of $q$ and $\varepsilon$ values can be found in Section S1 of the SI. The computational cost to compute the extrapolation coefficients $\bm{\alpha}$ is negligible compared to the time for a single MD step. Thanks to the symmetric property of the coefficients, the size of the system (5) is $(N_{d}+\tilde{q})\times\tilde{q}$ , and $\tilde{q}$ is a small number (in our simulations $\tilde{q}=2$ , as $q=4$ or $q=5$ ).

The QTR G-Ext approach is tested on four different systems. The first system is dimethylaminobenzonitrile (DMABN) in methanol. The second system is 3-hydroxyflavone (3HF) in acetonitrile. The last two systems (OCP and AppA) are chromophores embedded in a biological matrix-namely, a carotenoid in the orange carotenoid protein (OCP) and a flavin in the AppA Blue-Light Using Flavin photoreceptor [5, 4, 10]. Some information on the systems is reported in Table 1.

Table 1: Summary of systems’ size: number of QM atoms

N_{\text{QM}}

, number of MM atoms

N_{\text{MM}}

, number of QM basis functions

\mathcal{N}

, number of occupied orbitals

N

, and size of descriptors

N_{d}

System	$N_{\text{QM}}$	$N_{\text{MM}}$	$\mathcal{N}$	$N$	$N_{d}$
DMABN	21	6843	185	39	234
3HF	28	15046	290	62	409
AppA	31	16449	309	67	468
OCP	94	6058	734	154	4468

KS-DFT has been adopted to describe the QM subsystem, with the B3LYP hybrid functional [3] and the 6-31G(d) Pople’s basis set [11]. This is coupled with a polarizable description of the environment, using the AMOEBA forcefield [30]. For each system, we performed a QM/AMOEBA geometry optimization until a root-mean-square norm on the forces of 4 kcal/mol/Å is found and finally a 2 ps QM/AMOEBA NVT equilibration to obtain the starting point of the simulations presented in this work.

All simulations have been performed using the Gaussian-Tinker interface [27, 16, 17, 9]. We implemented the QTR G-Ext extrapolation approach in Tinker [32, 15].

To assess the quality of the guess density obtained by the QTR G-Ext extrapolation, we performed 10 ps BOMD simulations, with 0.5 fs time step, in the NVE ensemble, using the velocity Verlet integrator [38]. All systems were tested with an SCF convergence threshold fixed to $10^{-5}$ and $10^{-7}$ with respect to the RMS variation of density. We compare our approach in terms of energy stability and number of iterations required to reach convergence with other two extrapolation schemes, which are the G-Ext scheme [29]

\widetilde{\Gamma}_{n}=\sum_{i=1}^{q}\alpha_{i}\Gamma_{n-i},\qquad q=6,

where the $\alpha_{i}$ are computed by solving

\min_{\bm{\alpha}\in\mathbb{R}^{q}}\left\{\left\|d_{n}-\sum_{i=1}^{q}\alpha_{i% }d_{n-i}\right\|^{2}+\varepsilon^{2}\left\|\bm{\alpha}\right\|^{2}\right\},

where $\varepsilon=0.01$ , and XLBO[23, 21]

	$\displaystyle\widetilde{D}_{n}$	$\displaystyle=2\widetilde{D}_{n-1}-\widetilde{D}_{n-2}+\kappa\left(D_{n-1}-% \widetilde{D}_{n-1}\right)$
		$\displaystyle\phantom{=}+c\sum_{i=1}^{8}\alpha_{i}\widetilde{D}_{n-i},$

with fixed parameters $\kappa=1.86$ , $c=0.0016$ , and $\bm{\alpha}=(-36,99,-88,11,32,-25,8,-1)$ .

Refer to caption — Figure 1: Total energy as a function of simulation time for DMABN, using a $10^{-5}$ convergence threshold for the SCF.

Figure 1 provides the plot of the total energy along the DMABN simulation, with a $10^{-5}$ SCF convergence threshold. Despite the non-fully time-reversible formulation of our newly implemented approach, we observe a great improvement with respect to the G-Ext scheme. In particular, the QTR G-Ext method resembles the fully time-reversible scheme XLBO. The same behaviour is almost imperceptible when the SCF convergence is set to $10^{-7}$ (Figure 2), since the accumulation of errors that generates the energy drift when G-Ext is used is lower, so we can appreciate the same trend with all the extrapolation schemes. Analogous figures are reported in Section S2 of the SI for all tested systems. To better evaluate the energy stability, we consider the average short-time fluctuation (STF) of the energy, which is computed by getting the RMS of the energy fluctuation every 50 fs and averaging over the trajectory, and the long-time drift (LTD) for a long-time analysis, that is the slope of the linear regression line of the energy. Tables 2 and 3 disclose STF and LTD for convergence thresholds $10^{-5}$ and $10^{-7}$ , respectively. QTR G-Ext, G-Ext, and XLBO show comparable STF, which is specific for the system and is related to the time step for the integration. On the other hand, the absolute value of LTD is in general higher for $10^{-5}$ simulations, in particular for G-Ext. We can state that the QTR G-Ext method solves the energy-drift issue of G-Ext, showing an LTD that is always similar to the XLBO one, suggesting again a good time-reversible behaviour.

Table 2: Short- and Long-Time Stability Analysis of the QTR G-Ext, G-Ext, and XLBO methods. SCF convergence threshold

10^{-5}

	DMABN		3HF		AppA		OCP
	STF	LTD	STF	LTD	STF	LTD	STF	LTD
QTR G-Ext	0.33	-0.01	0.62	-0.40	0.57	-0.08	0.36	-0.23
G-Ext	0.35	-0.43	0.61	-0.94	0.56	-0.93	0.38	-1.38
XLBO	0.32	0.01	0.57	-0.42	0.59	0.14	0.39	-0.28

Table 3: Short- and Long-Time Stability Analysis of the QTR G-Ext, G-Ext, and XLBO methods. SCF convergence threshold

10^{-7}

	DMABN		3HF		AppA		OCP
	STF	LTD	STF	LTD	STF	LTD	STF	LTD
QTR G-Ext	0.37	0.01	0.59	-0.30	0.53	0.18	0.38	-0.16
G-Ext	0.33	0.04	0.60	-0.27	0.54	0.06	0.38	-0.20
XLBO	0.32	0.13	0.64	-0.37	0.56	0.06	0.38	-0.08

The gain of our new methodology is not only in terms of accuracy (energy stability), but also in terms of the computational time of the simulation. Tables 4 and 5 report the average number of SCF iterations required to achieve convergence, as well as the standard deviation for $10^{-5}$ and $10^{-7}$ SCF thresholds, respectively. We remark that each strategy requires $q$ previous density matrices, before having them available a standard SCF is performed. Therefore, for the computation of average and standard deviation, we discard the first $q$ points. The two tables show that for all the tested systems, the QTR G-Ext method requires the lowest number of SCF iterations, for both convergence thresholds. Moving averages of SCF iteration numbers during the simulations for all systems and with both SCF convergence thresholds are reported in Section S2 of the SI.

Table 4: Performance of the QTR G-Ext method compared with the G-Ext method and XLBO algorithm. Average

\overline{k}

and standard deviation

\sigma

of SCF iterations. Convergence threshold

10^{-5}

	DMABN		3HF		AppA		OCP
	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$
QTR G-Ext	3.04	0.22	2.98	0.21	3.00	0.02	2.96	0.31
G-Ext	3.55	0.85	3.16	0.69	3.03	0.54	2.91	0.41
XLBO	4.00	0.05	4.00	0.00	4.00	0.07	4.00	0.01

Table 5: Performance of the QTR G-Ext method compared with the G-Ext method and XLBO algorithm. Average

\overline{k}

and standard deviation

\sigma

of SCF iterations. Convergence threshold

10^{-7}

	DMABN		3HF		AppA		OCP
	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$
QTR G-Ext	5.42	0.69	5.42	0.80	5.37	0.84	4.86	0.83
G-Ext	7.33	0.63	6.96	0.79	6.56	0.75	5.83	0.87
XLBO	7.51	0.65	7.45	0.65	7.43	0.80	7.21	0.75

The performances of the QTR G-Ext guess are maintained also for larger and smaller time steps. We compared QTR G-Ext and XLBO for time steps of 0.1, 0.25, 0.75, and 1 fs by running MD simulations for the DMABN system with an SCF converge threshold of $10^{-5}$ . All the results can be found in Section S3 of the SI. Both methods show excellent energy conservation for the smaller time steps, and afford reasonably stable simulations even for the larger ones, which is remarkable, as such simulations employ a time step that is too large to accurately sample molecular vibrations involving protons and are in general very noisy. For all time steps, QTR G-Ext requires a smaller average number of SCF iterations than XLBO. Finally, we tested the method for a looser SCF convergence of $10^{-4}$ , again, a value that should not be used for production applications, as the error on the SCF solution transfers to the forces, affecting thus the quality of the dynamics. The results are reported in Section S4 of the SI. Again, good energy conservation is shown for both methods, and QTR G-Ext outperforms XLBO in terms of average number of SCF iterations required.

In conclusion, we presented the Quasi Time-Reversible Grassmann Extrapolation scheme, a new extrapolation method for ab-initio molecular dynamics that not only allows for energy-conserving simulations, but exhibits overall excellent performances. Our numerical tests, performed on large, complex systems described with a polarizable multiscale strategy and taken from real-life production applications, show that QTR G-Etx is able to provide a guess density to BOMD simulations that allows the convergence of the SCF procedure in about 3 iterations on average for convergence thresholds that are typical of ground-state production runs, which is a 25% gain with respect to the state-of-the-art XLBO method. Tighter convergences, that are required for, e.g., time-dependent DFT excited state simulations, can also be achieved in as little as 5-6 iterations. Furthermore, our numerical tests show that the new method does not introduce any significant bias in the guess density and thus exhibits very good energy conservation properties. This can be clearly seen by comparing the long-term drift observed in simulations for the two different SCF convergence thresholds used in our tests: while the previous G-Ext method shows a sharp increase in the drift going from $10^{-7}$ to $10^{-5}$ SCF convergence threshold, this is not the case for the QTR G-Ext method. We stress here that, due to the cost of BOMD simulations, every gain in performances is important, as it can easily translate in hundreds or thousands of saved CPU hours. The QTR G-Ext method is easy to implement and does not introduce any significant computational overhead, and represents therefore an effective strategy to extend the applicability of BOMD simulations to larger and more complex systems.

Acknowledgements

This work was supported by the Italian Ministry of University and Research under grant 2020HTSXMA_002 (PSI-MOVIE) and by the French ‘Investissements d’Avenir’ program, project Agence Nationale de la Recherche (ISITE-BFC) (contract ANR-15-IDEX-0003). ÉP also acknowledges support from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement No 810367–project EMC2) as well as from the Simons Targeted Grant Award No. 896630. Funded by Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC 2075 – 390740016 (BS). FP is member of the GNCS group of INdAM.

Supporting Information Available

Determination of the optimal parameters, plots of the total energy and number of SCF iterations along the dynamics for all tested systems, and results for the fully time-reversible algorithm.

References

[1] D. Alfè (1999) Ab initio molecular dynamics, a simple algorithm for charge extrapolation. Comput. Phys. Commun. 118 (1), pp. 31–33. External Links: ISSN 0010-4655, Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[2] T. A. Arias, M. C. Payne, and J. D. Joannopoulos (1992) Ab initio molecular-dynamics techniques extended to large-length-scale systems. Phys. Rev. B 45 (4), pp. 1538–1549. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[3] A. Becke (1993) Density-functional thermochemistry. 3. the role of exact exchange. J. Chem. Phys. 98 (7), pp. 5648–5652. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[4] M. Bondanza, L. Cupellini, P. Faccioli, and B. Mennucci (2020) Molecular mechanisms of activation in the orange carotenoid protein revealed by molecular dynamics. J. Am. Chem. Soc. 142 (52), pp. 21829–21841. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[5] M. Bondanza, L. Cupellini, F. Lipparini, and B. Mennucci (2020) The Multiple Roles of the Protein in the Photoactivation of Orange Carotenoid Protein. Chem 6 (1), pp. 187–203. External Links: Document, ISSN 24519294 Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[6] R. Car and M. Parrinello (1985-11) Unified approach for molecular dynamics and density-functional theory. Phys. Rev. Lett. 55, pp. 2471–2474. External Links: Document, Link Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[7] Alan. Edelman, T. A. Arias, and S. T. Smith (1998) The Geometry of Algorithms with Orthogonality Constraints. SIAM J. Matrix Anal. Appl. 20 (2), pp. 303–353. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[8] J. Fang, X. Gao, H. Song, and H. Wang (2016) On the existence of the optimal order for wavefunction extrapolation in Born-Oppenheimer molecular dynamics. J. Chem. Phys. 144 (24), pp. 244103. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[9] M. J. Frisch, G. W. Trucks, H. B. Schlegel, G. E. Scuseria, M. A. Robb, J. R. Cheeseman, G. Scalmani, V. Barone, G. A. Petersson, H. Nakatsuji, X. Li, A. V. Marenich, M. Caricato, J. Bloino, B. G. Janesko, J. Zheng, R. Gomperts, B. Mennucci, H. P. Hratchian, J. V. Ortiz, A. F. Izmaylov, J. L. Sonnenberg, D. Williams-Young, F. Ding, F. Lipparini, F. Egidi, J. Goings, B. Peng, A. Petrone, T. Henderson, D. Ranasinghe, V. G. Zakrzewski, J. Gao, N. Rega, G. Zheng, W. Liang, M. Hada, M. Ehara, K. Toyota, R. Fukuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, H. Nakai, T. Vreven, K. Throssell, Jr. J. A. Montgomery, J. E. Peralta, F. Ogliaro, M. J. Bearpark, J. J. Heyd, E. N. Brothers, K. N. Kudin, V. N. Staroverov, T. A. Keith, R. Kobayashi, J. Normand, K. Raghavachari, A. P. Rendell, J. C. Burant, S. S. Iyengar, J. Tomasi, M. Cossi, J. M. Millam, M. Klene, C. Adamo, R. Cammi, J. W. Ochterski, R. L. Martin, K. Morokuma, O. Farkas, J. B. Foresman, and D. J. Fox (2020) Gaussian Development Version, Revision J.16. Note: Gaussian, Inc., Wallingford CT, 2020. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[10] S. Hashem, V. Macaluso, M. Nottoli, F. Lipparini, L. Cupellini, and B. Mennucci (2021) From crystallographic data to the solution structure of photoreceptors: the case of the AppA BLUF domain. Chem. Sci. 12, pp. 13331–13342. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[11] W. J. Hehre, R. Ditchfield, and J. A. Pople (1972) Self-Consistent Molecular Orbital Methods. XII. Further Extensions of Gaussian-Type Basis Sets for Use in Molecular Orbital Studies of Organic Molecules.. J. Chem. Phys. 56 (5), pp. 2257–2261. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[12] J. M. Herbert and M. Head-Gordon (2005) Accelerated, energy-conserving Born–Oppenheimer molecular dynamics via Fock matrix extrapolation. Phys. Chem. Chem. Phys. 7 (18), pp. 3269–3275. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[13] J. Hutter, M. Parrinello, and S. Vogel (1994) Exponential transformation of molecular orbitals. J. Chem. Phys. 101 (5), pp. 3862–3865. External Links: ISSN 0021-9606, Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[14] M. Kulichenko, K. Barros, N. Lubbers, N. Fedik, G. Zhou, S. Tretiak, B. Nebgen, and A. M. N. Niklasson (2023) Semi-empirical shadow molecular dynamics: a pytorch implementation. J. Chem. Theory Comput. 19 (11), pp. 3209–3222. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[15] L. Lagardère, L. Jolly, F. Lipparini, F. Aviat, B. Stamm, Z. F. Jing, M. Harger, H. Torabifard, G. A. Cisneros, M. J. Schnieders, N. Gresh, Y. Maday, P. Y. Ren, J. W. Ponder, and J. Piquemal (2018) Tinker-HP: a massively parallel molecular dynamics package for multiscale simulations of large complex systems with advanced point dipole polarizable force fields. Chem. Sci. 9, pp. 956–972. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[16] D. Loco, L. Lagardère, S. Caprasecca, F. Lipparini, B. Mennucci, and J. Piquemal (2017) Hybrid QM/MM molecular dynamics with AMOEBA polarizable embedding. J. Chem. Theory Comput. 13 (9), pp. 4025–4033. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[17] D. Loco, L. Lagardère, G. A. Cisneros, G. Scalmani, M. Frisch, F. Lipparini, B. Mennucci, and J. Piquemal (2019) Towards large scale hybrid QM/MM dynamics of complex systems with advanced point dipole polarizable embeddings. Chem. Sci. 10 (30), pp. 7200–7211. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[18] P. Mazzeo, S. Hashem, F. Lipparini, L. Cupellini, and B. Mennucci (2023) Fast method for excited-state dynamics in complex systems and its application to the photoactivation of a blue light using flavin photoreceptor. J. Phys. Chem. Lett. 14 (5), pp. 1222–1229. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[19] R. McWeeny (1960-04) Some recent advances in density matrix theory. Rev. Mod. Phys. 32, pp. 335–369. External Links: Document, Link Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[20] N. M. N. (2017) Next generation extended lagrangian first principles molecular dynamics. J. Chem. Phys. 147 (5), pp. 054103. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[21] A. M. N. Niklasson, P. Steneteg, A. Odell, N. Bock, M. Challacombe, C. J. Tymczak, E. Holmström, G. Zheng, and V. Weber (2009) Extended Lagrangian Born–Oppenheimer molecular dynamics with dissipation. J. Chem. Phys. 130 (21), pp. 214109. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[22] A. M. N. Niklasson, C. J. Tymczak, and M. Challacombe (2006-09) Time-reversible Born-Oppenheimer molecular dynamics. Phys. Rev. Lett. 97, pp. 123001. External Links: Document, Link Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[23] A. M. N. Niklasson (2008) Extended Born-Oppenheimer Molecular Dynamics. Phys. Rev. Lett. 100 (12), pp. 123004. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[24] A. M. N. Niklasson (2020) Density-matrix based extended lagrangian born–oppenheimer molecular dynamics. J. Chem. Theory Comput. 16 (6), pp. 3628–3640. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[25] A. M. N. Niklasson (2020-03) Extended Lagrangian Born–Oppenheimer molecular dynamics using a Krylov subspace approximation. J. Chem. Phys. 152 (10), pp. 104103. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[26] M. Nottoli, M. Bondanza, P. Mazzeo, L. Cupellini, C. Curutchet, D. Loco, L. Lagardère, J. Piquemal, B. Mennucci, and F. Lipparini QM/amoeba description of properties and dynamics of embedded molecules. WIREs Comput. Mol. Sci. n/a (n/a), pp. e1674. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[27] M. Nottoli, B. Mennucci, and F. Lipparini (2020) Excited state Born-Oppenheimer molecular dynamics through a coupling between time dependent DFT and AMOEBA. Phys. Chem. Chem. Phys. 22, pp. 19532–19541. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[28] É. Polack, A. Mikhalev, G. Dusson, B. Stamm, and F. Lipparini (2020) An approximation strategy to compute accurate initial density matrices for repeated self-consistent field calculations at different geometries. Mol. Phys. 118 (19-20), pp. e1779834. External Links: ISSN 0026-8976, Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[29] É. Polack, G. Dusson, B. Stamm, and F. Lipparini (2021) Grassmann extrapolation of density matrices for Born–Oppenheimer molecular dynamics. J. Chem. Theory Comput. 17 (11), pp. 6965–6973. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics , A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[30] J. W. Ponder, C. Wu, P. Ren, V. S. Pande, J. D. Chodera, M. J. Schnieders, I. Haque, D. L. Mobley, D. S. Lambrecht, R. A. DiStasio, M. Head-Gordon, G. N. I. Clark, M. E. Johnson, and T. Head-Gordon (2010) Current status of the AMOEBA polarizable force field. J. Phys. Chem. B 114 (8), pp. 2549–2564. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[31] P. Pulay and G. Fogarasi (2004) Fock matrix dynamics. Chem. Phys. Lett. 386 (4), pp. 272–278. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[32] J. A. Rackers, Z. Wang, C. Lu, M. L. Laury, L. Lagardère, M. J. Schnieders, J. Piquemal, P. Ren, and J. W. Ponder (2018) Tinker 8: software tools for molecular design. J. Chem. Theory Comput. 14 (10), pp. 5273–5289. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[33] C. C. J. Roothaan (1951) New developments in molecular orbital theory. Rev. Mod. Phys. 23, pp. 69–89. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[34] M. Rupp, A. Tkatchenko, K. Müller, and O. A. von Lilienfeld (2012) Fast and accurate modeling of molecular atomization energies with machine learning. Phys. Rev. Lett. 108, pp. 058301. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[35] H. B. Schlegel, J. M. Millam, S. S. Iyengar, G. A. Voth, A. D. Daniels, G. E. Scuseria, and M. J. Frisch (2001) Ab initio molecular dynamics: Propagating the density matrix with Gaussian orbitals. J. Chem. Phys. 114 (22), pp. 9758–9763. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[36] J. VandeVondele and J. Hutter (2003) An efficient orbital transformation method for electronic structure calculations. J. Chem. Phys. 118 (10), pp. 4365–4369. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[37] J. VandeVondele, M. Krack, F. Mohamed, M. Parrinello, T. Chassaing, and J. Hutter (2005) Quickstep: Fast and accurate density functional calculations using a mixed Gaussian and plane waves approach. Comput. Phys. Commun. 167 (2), pp. 103–128. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[38] L. Verlet (1967) Computer “experiments” on classical fluids. I. Thermodynamical properties of Lennard-Jones molecules. Phys. Rev. 159, pp. 98–103. Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[39] V. Vitale, J. Dziedzic, A. Albaugh, A. M. N. Niklasson, T. Head-Gordon, and C. Skylaris (2017) Performance of extended Lagrangian schemes for molecular dynamics simulations with classical polarizable force fields and density functional theory. J. Chem. Phys. 146 (12), pp. 124115. External Links: Document Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .
[40] R. Zimmermann (2019) Manifold interpolation and model reduction. Note: http://arxiv.org/abs/1902.06502 Cited by: A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics .

S1 Determination of optimal $q$ and $\varepsilon$ values

The parameters $q$ and $\varepsilon$ of the QTR G-Ext method were found by computing the error $\|\Gamma_{n}-\widetilde{\Gamma}_{n}\|$ and averaging it over the whole simulation for different values of $q$ and $\varepsilon$ , specifically $q=3,4,\ldots,20$ and $\varepsilon$ = 0.001, 0.002, 0.005, 0.01, 0.02, 0.05, and we selected the combination $(q,\varepsilon)$ corresponding to the minimal error.

Table S1: Optimal

q

\varepsilon

for each system and SCF tolerance.

	DMABN		3HF		AppA		OCP
SCF tolerance	$10^{-5}$	$10^{-7}$	$10^{-5}$	$10^{-7}$	$10^{-5}$	$10^{-7}$	$10^{-5}$	$10^{-7}$
$q$	5	4	5	4	5	4	5	4
$\varepsilon$	0.005	0.001	0.005	0.001	0.005	0.002	0.005	0.002

S2 Supplementary figures: Energy stability and number of SCF iterations

In the following, we report the total energy profile and number of SCF iterations per step for all the simulations performed in this work using the XLBO, G-Ext, QTR G-Ext, and TR schemes. The time-reversible scheme is obtained by computing the guess density as follows:

\widetilde{\Gamma}_{n}=-\widetilde{\Gamma}_{n-q}+\sum_{i=1}^{\widetilde{q}}% \alpha_{i}\left(\Gamma_{n-i}+\Gamma_{n-q+i}\right),

(S1)

Eq. S1 is manifestly symmetric, and thus fully time-reversible. As mentioned in the main text, the fully TR scheme exhibits excellent stability, but poor performance, as the number of SCF iterations tends to quickly increase along the simulation, to the point that the extrapolation is not anymore beneficial.

S2.1 DMABN

S2.2 3HF

S2.3 AppA

S2.4 OCP

S3 Supplementary tests: time step dependence

In the following, we report the results obtained for 20000 MD step on the system DMABN for QTR G-Ext and XLBO approaches using different time steps: 0.1, 0.25, 0.75, 1 fs. For completeness, the tables also show the results for time step equal to 0.5 fs. In these simulations, the SCF convergence threshold is $10^{-5}$ . For QTR G-Ext method, we estimate the optimal value of the parameters $q$ and $\varepsilon$ , as explained in Section S1:

Table S2: Optimal

q

\varepsilon

for each time step.

time step (fs)	0.1	$0.25$	0.75	$1$
$q$	4	$5$	7	$4$
$\varepsilon$	0.01	$0.002$	0.001	$0.01$

Table S3: DMABN: Short- and Long-Time Stability Analysis of the QTR G-Ext and XLBO methods for molecular dynamics with different time steps. SCF convergence threshold

10^{-5}

time step	0.1		0.25		0.5		0.75		1
	STF	LTD	STF	LTD	STF	LTD	STF	LTD	STF	LTD
QTR G-Ext	0.02	-0.04	0.09	-0.01	0.33	-0.01	0.82	-0.04	1.39	-0.14
XLBO	0.02	-0.14	0.08	0.00	0.32	0.01	0.74	0.10	1.34	0.01

Table S4: DMABN: Performance of the QTR G-Ext method compared with the XLBO algorithm for molecular dynamics with different time steps. Average

\overline{k}

and standard deviation

\sigma

of SCF iterations. SCF convergence threshold

10^{-5}

time step	0.1		0.25		0.5		0.75		1
	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$
QTR G-Ext	2.95	0.65	2.66	0.81	3.04	0.22	3.68	0.61	4.02	0.20
XLBO	3.57	0.91	3.94	0.25	4.00	0.05	4.03	0.18	4.73	0.44

S4 Supplementary tests: lower SCF convergence threshold

In this section, we report the results obtained on the system DMABN for QTR G-Ext and XLBO approaches by establishing the convergence threshold to $10^{-4}$ . For QTR G-Ext method, the parameters are $q=5$ and $\varepsilon=0.005$ . For completeness, the tables also show the results for convergence threshold equal to $10^{-5}$ and $10^{-7}$ . We performed 10 ps BOMD simulations, with 0.5 fs time step.

Table S5: DMABN: Short- and Long-Time Stability Analysis of the QTR G-Ext and XLBO methods for molecular dynamics with different SCF convergence thresholds.

conv. threshold	$10^{-4}$		$10^{-5}$		$10^{-7}$
	STF	LTD	STF	LTD	STF	LTD
QTR G-Ext	0.37	-0.27	0.33	-0.01	0.37	0.01
XLBO	0.45	-0.00	0.32	0.01	0.32	0.13

Table S6: DMABN: Performance of the QTR G-Ext method compared with the XLBO algorithm for molecular dynamics with different SCF convergence thresholds. Average

\overline{k}

and standard deviation

\sigma

of SCF iterations.

conv. threshold	$10^{-4}$		$10^{-5}$		$10^{-7}$
	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$	$\overline{k}$	$\sigma$
QTR G-Ext	2.68	0.97	3.04	0.22	5.42	0.69
XLBO	3.86	0.75	4.00	0.05	7.51	0.65

A Quasi Time-Reversible scheme based on density matrix extrapolation on the Grassmann manifold for Born-Oppenheimer Molecular Dynamics

Abstract

Acknowledgements

Supporting Information Available

References

S1 Determination of optimal qq and ε\varepsilon values

S2 Supplementary figures: Energy stability and number of SCF iterations

S2.1 DMABN

S2.2 3HF

S2.3 AppA

S2.4 OCP

S3 Supplementary tests: time step dependence

S4 Supplementary tests: lower SCF convergence threshold

S1 Determination of optimal $q$ and $\varepsilon$ values