Abstract
Recently, we have proposed a new diffusive representation for fractional derivatives and, based on this representation, suggested an algorithm for their numerical computation. From the construction of the algorithm, it is immediately evident that the method is fast and memory-efficient. Moreover, the method’s design is such that good convergence properties may be expected. In this paper, we commence a systematic investigation of these convergence properties.
Keywords:
fractional derivative; Caputo derivative; diffusive representation; numerical method; convergence MSC:
primary 65D25; secondary 26A33; 65L05; 65R20
1. Introduction
Fractional differential operators have proven to be very useful tools for the numerical modeling of various phenomena in science, engineering, and economics, cf., e.g., [1] and the literature cited therein. Since differential equations with fractional derivatives are very difficult, and often impossible, to solve exactly in closed form, there is a great need for efficient numerical methods.
A large number of different approaches has been proposed but many of these are not sufficiently fast or require too much memory whereas others are more efficient in these respects but their convergence behaviour is not well understood (see Section 2 for more details on this topic). In this paper, we pick up a recently developed approach [2] that is very efficient with respect to both runtime and memory requirements and that has explicitly been designed in a way that allows a thorough analysis. While [2] merely indicated that such an error analysis is possible, in the present paper we actually describe this analysis and its outcomes. Our main result, Theorem 1, contains an error estimate that clearly shows the influence of all the parameters on which the algorithm depends. Therefore, in combination with the results of [2], we now have a complete picture of the method’s properties, both from the theoretical and from the practical perspective.
We explicitly stress that this paper is exclusively devoted to (theoretical) error analysis. Numerical aspects of the algorithm under consideration have already been discussed in [2]; that paper in particular also contains some numerical examples. Therefore, it is not necessary to repeat these here.
2. A Diffusive-Representation-Based Numerical Scheme for Computing Fractional Derivatives
The topic of this paper is the analysis of a recently developed numerical method for the approximate calculation of fractional derivatives [2]. Specifically, we assume a function to be given with some and some , and the task is to compute the Caputo derivative of order of the function y ([3], Chapter 3) at some points , with and being the two end points of the interval of interest. The algorithm is based on the diffusive representation
where the function in the integrand on the right-hand side of Equation (1) is the solution to the initial value problem
cf. ([2], Theorem 1). In Equation (2), ; thus, if , i.e., in the case that is prevalent in most known technical applications of fractional derivatives, we have .
More precisely, there are two variants of the approach. Both variants use the quadrature formula
with
where and for denote the weights and nodes, respectively, of the K-point Gauss–Laguerre quadrature formula, i.e., the Gaussian quadrature formula for the weight function . The difference between the two methods is that, in order to compute the function values for some given via the representation (4), one of them uses the backward Euler formula for the solution of the initial value problem (2) whereas the other one employs the trapezoidal method.
The basic motivation for the development of this technique was to use it as a building block for a numerical solver for fractional differential equations. If this is done, then, like other numerical schemes based on diffusive representations or similar techniques (cf., e.g., [4,5,6,7,8,9,10]) this approach has some significant advantages over traditional schemes such as the fractional Adams method [11,12] or fractional linear multistep methods [13,14]:
- (1)
- The computational complexity is where N is the number of time steps over which the solution to the fractional differential equation is sought, whereas traditional methods usually have a cost of when implemented in a straightforward manner [11,13] or of or [15,16,17,18] if more sophisticated implementations are used.
- (2)
- Due to the completely different way in which the inherent memory of the fractional differential operators is dealt with, the active memory requirements of the method are only and not for the traditional methods or at best for their modified versions [15,19].
- (3)
- Whereas some (but not all) traditional schemes require the use of a uniform mesh, this approach gives the user complete freedom to use any discretization whatsoever of the interval on which the fractional differential equation is to be solved.
3. Convergence Properties of the Numerical Method
The paper [2] in which the abovementioned algorithm was developed contains some numerical results. It also provides a qualitative convergence analysis and a heuristic argumentation as to why the method should have satisfactory convergence properties, and it describes how to avoid certain potential pitfalls in the implementation of the algorithm in finite precision arithmetic. However, it does not provide a thorough convergence analysis in the quantitative sense, so we shall now address this matter here. To this end, we assume that the function values of are to be approximately computed at the points , , with . We then proceed as follows.
First of all, we note that our approximation formula takes the final form
where denotes the approximation for obtained by the ODE solver in question using the grid . It thus follows that the error of this approximation is
where
is the error induced by the numerical quadrature, and
is the error induced by the ODE solver.
Remark 1.
In the decomposition (6) of the total error, it is clear from Equation (7) that the component —as indicated by the notation—depends only on the function y whose fractional derivative we want to compute, the order α of this derivative, the number K of nodes of the Gauss–Laguerre quadrature formula that we use, and the point . Thus, there is an indirect dependency on the exact solution of the initial value problem (2) at the point but not on the numerical solution of this problem. Hence, for the analysis of , it does not matter by which numerical method and with which grid we solve this initial value problem.
The component , on the other hand, does not only depend on the ODE solver and its grid but also on the number K of quadrature nodes and on the location of these nodes because these quantities appear as parameters in the differential equation and thus have an influence on the ODE solver’s error.
Our main goal now is to analyze and estimate the expressions and under reasonable assumptions on the given data. The first result in this context reads as follows.
Lemma 1.
Let for some and some . Then, for all , we have uniformly for all that
as .
Therefore, we can conclude that the quadrature error converges to zero with a faster-than-algebraic rate.
For the proof of Lemma 1, we require an auxiliary result on the asymptotic behaviour of for that refines the findings of ([2], Theorem 1(e)):
Lemma 2.
Let for some and some . Then, for all , we have
and
The implied constants in the O-terms can be bounded uniformly for all and all .
Proof.
The claims for have already been shown in ([2], Theorem 1(e)). We thus only discuss the cases here. To this end, we first recall from ([2], Theorem 1) that
holds for any fixed . This implies, in particular, that for all w, and so the claims clearly hold in the case .
For the last factor in each summand, we can again use the Leibniz rule and find
with
and
It is then immediately clear that
so
Furthermore, by de l’Hospital’s rule,
Together with the fact that is monotonically decreasing in w, this shows that
Moreover, a straightforward mathematical induction yields for that there exist polynomials of degree such that
Denoting the polynomials by , it can also be seen via the induction process that the coefficients depend on in a continuous way and so they may be bounded in absolute value independently of t. Hence, we conclude
Thus,
and
because (since we only need to deal with those values of t for which ) the argument of each of the exponential functions is less than for sufficiently large w. Plugging (13) and (16) into (12), we then derive
and inserting (14) and (15) into (12), we find
The two claims of the Lemma then follow upon combining these two relations, both of which evidently hold uniformly for all admissible values of t and , with Equation (11) and the observation that with some constant . □
Remark 2.
Clearly, the bounds of Lemma 2 imply the slightly simpler relationships
and
uniformly for all and all that, however, provide weaker bounds than Lemma 2 itself if α is close to an integer.
Proof of Lemma 1.
We begin by noting that (this is an immediate consequence of the representation (9); see also ([2], Theorem 1(d))). Therefore, it follows from Equation (4) that . Moreover, defining the function for on by
we can see by definition of , cf. Equation (4), that . In addition, by the Leibniz formula we conclude
An application of Lemma 2 then tells us that, for arbitrary and ,
with some constant c that is independent of t and . Therefore, for all , so we may invoke ([20], Theorem 1) with any such r, and the claim of Lemma 1 follows. □
Remark 3.
An inspection of the proof of Lemma 1 reveals that the implied constant in the O-term of the final estimate cannot be bounded uniformly for all . This is due to the presence of the positive powers of and in the denominators of the right-hand side of Equation (17) and the fact that tends to zero for and that tends to zero for . The factor on the right-hand side of Equation (17) (that originates from the bounds of Lemma 2 in its present form) can only compensate a first power of these expressions but not a higher one. Whether or not a uniform bound can be obtained with a more sophisticated estimation technique remains an open question. The uniformity of the bound with respect to n is no problem though.
For the other component of the total error, we may also derive a bound. We assume here that the backward Euler formula is chosen as the numerical solver for the initial value problems.
Lemma 3.
Assume that the grid is uniform, i.e., that with . If and the differential equations are solved by means of the backward Euler method, then
with
Proof.
According to Equations (3) and (4), we need to solve the differential equation in Equation (2) for where
and
Let us introduce the notation
so that the differential equation in question takes the form
For any , we have and so, for all , we can see that
Therefore, in the terminology of ([21], Definition 8.58), the initial value problem satisfies an upper Lipschitz condition with constant , and hence the initial value problem is dissipative.
This property allows us to estimate the local truncation error of the backward Euler method in the following essentially standard way: By our smoothness assumption on the function y, a Taylor expansion shows that the local truncation error in the n-th step has the form
where is the approximation for (so that ) and . Hence,
in view of Equation (20). A rearrangement of terms then yields
Evidently, the left-hand side of this inequality is nonnegative, and so the right-hand side must be nonnegative too. Hence, we may replace both sides of the inequality by their respective absolute values without changing anything. Having done that, we may divide both sides by to obtain
In view of the chain rule, the concrete structure of the function f and the differential equation under consideration, we have
Using the explicit representation of given in Equation (9), it can be shown that the last expression in this inequality satisfies
Thus, we can continue the estimation of Equation (21) as
Since the second factor on the left-hand side is greater than 1, we deduce
for all admissible values of the parameters.
This estimate, together with the dissipativity of the differential equation, allows us to derive our claim from ([21], Theorem 8.68). □
Thus, combining Lemmas 1 and 3 with the fact that
(cf. [22], Equation (6.32.2)), we obtain the following overall error estimate:
Theorem 1.
If , if the grid is uniform and if the differential equations are solved using the backward Euler method, then, for all ,
as and/or , where .
Remark 4.
If the backward Euler method for solving the differential Equation (2) for is replaced by the trapezoidal method, a similar analysis may be performed. It is then possible to replace the factor h in the error estimate by , reflecting the higher order of the trapezoidal scheme, but this requires even stronger differentiability assumptions on the function y.
4. Comments and Further Remarks
The results indicated above give rise to a number of observations that raise new questions and indicate some directions for additional research work. We intend to address these issues in the future.
4.1. The Stiffness of the Differential Equation for the Integrand
A short look at the differential equation in the initial value problem (2) reveals that it is an inhomogeneous linear differential equation with constant coefficients (note that it is a dfferential equation with respect to the variable t, so the value w that arises in the coefficient of on the right-hand side is a constant for any given differential equation of this form). The Lipschitz constant of the function on its right-hand side is with some (cf. Equations (18) and (19)). Since for all k and , it is clear that , and so for . Therefore, the differential equations associated to these values of w have small Lipschitz constants and thus do not exhibit any stiffness which, in turn, means that they do not pose significant challenges for the numerical solvers.
For the case , however, the situation is completely different. Here, we encounter Lipschitz constants of up to . Bearing in mind the well known result ([22], Equation (6.32.8)) that
it is clear that this Lipschitz constant may be an extremely large number if is close to 1 (i.e., if with some and a positive number close to 0) or if K is large. Therefore, these differential equations may be extremely stiff and hence difficult to handle numerically. This, in fact, also explains why one should only use A-stable solvers for the differential equation.
4.2. The Error Bounds for the ODE Solver
The error estimates of Lemma 3 and hence also of Theorem 1 indicate that the error of the ODE solver depends on the maximum of the Lipschitz constants of the differential equations under consideration and hence, in view of the observation from Section 4.1, on the number K of quadrature nodes or, more precisely, on the location of the largest node of the quadrature formula in use. Although this is only an upper bound that in fact may drastically overestimate the true error ([23], p. 7), it nevertheless clarifies the importance of keeping the value as small as possible.
4.3. Choice of the Quadrature Formula
From the results of Mastroianni and Monegato [24] one can conclude that it might be useful to modify the quadrature formula in use in our algorithm. To be precise, they suggest truncating the summation in Equation (5) prematurely, i.e., letting the summation index k run only from 1 to some number with . This concept has two obvious advantages, namely, it reduces the computational cost and it improves the approximation quality of the ODE solver (due to the fact that the largest nodes of the quadrature formula, i.e., the nodes of which the associated differential equations have the right-hand sides with the largest Lipschitz constants and are thus most difficult so solve accurately—cf. Section 4.2—are left out). Intuitively, one is likely led into the belief that one has to pay for this improvement on the ODE solver component that this approach generates in terms of a loss of accuracy on the numerical integration component. It has been shown in [24], however, that, in reality, the convergence rate of the quadrature formula actually becomes better, at least when the number is chosen in a proper way.
4.4. The Smoothness of the Function y
The assumptions of Lemma 3 are relatively strong in the sense that they require some degree of smoothness of the function y whose fractional derivative we want to compute. If an algorithm of the type under consideration is to be used in the context of numerically solving fractional initial value problems of the form
with some , it is well known ([3], Theorem 6.27) that these smoothness properties can only be expected to be present in rare and exceptional situations. If the function is less smooth, then the convergence order of Lemma 3 and Theorem 1 cannot be achieved. This expectation was confirmed by the numerical experiments in [2] for this particular algorithm and in [25] for a different algorithm based on the same fundamental principle (i.e., on diffusive representations). A precise estimation of the convergence behaviour under less restrictive (and hence more realistic, in the context of solutions to fractional differential equations) conditions remains a topic for future research.
4.5. Diffusive Representations for Fractional Integrals
As a possible remedy for the problem discussed in Section 4.4, one may rewrite the given initial value problem (23) in the equivalent form ([3], Lemma 6.2)
where is the Riemann–Liouville integral operator of order with starting point a, find a diffusive representation analog to (1) for this integral operator and proceed in a corresponding manner. It is conceivable that the functions needing to be approximated in this approach possess more favorable smoothness properties, thus possibly leading to better convergence properties of the ODE solver.
4.6. The Discretization of the Interval
In the error analysis of Theorem 1, we had assumed a uniform discretization of the interval on which the fractional derivative of y was to be computed. However, as indicated above, the basic construction principle of our approach actually admits a completely arbitrary discretization . In such a case, the methods applied in the proof of Theorem 1 can still be applied and the same result may be obtained, wherein the parameter h must then be interpreted as .
If one chooses a graded mesh of the form with a mesh grading coefficient (most commonly actually with ), then we conjecture that it may be possible to obtain an error bound of the form given in Theorem 1 under weaker smoothness conditions on the function y.
5. Conclusions
In [2], a new algorithm for the approximate calculation of fractional derivatives of Caputo’s type has been developed. The algorithm is based on a novel diffusive representation of such operators. By construction, with respect to run time and memory requirements the method described in [2] behaves in the same way as other methods based on diffusive representations, i.e., in order to compute the fractional derivative at N points, the computational complexity is and the memory requirements are . This is optimal and, in particular, significantly smaller than the corresponding requirements of traditional methods which are not based on diffusive representations.
The main difference between the approach of [2] and other diffusive-representation-based schemes is that, also by construction, the numerical integration phase of the algorithm has been specifically designed in a way that allows to invoke traditional well understood, fast and highly accurate quadrature methods. There is no need to use ad hoc quadrature methods with many numerical parameters whose behaviour is hardly understood. Rather, the only parameters of the algorithm that have an influence on the error are the discretization grid and the number of quadrature nodes. The simplicity of this structure has enabled us in this paper to provide a comprehensive error analysis which clearly exhibits the influence of each parameter on the accuracy of the final result and which is much more complete than the error analysis that has been conducted for many other methods based on diffusive representations.
Combining the new error analysis developed here with the numerical results shown in [2], we can conclude that the algorithm described in Equation (5)—which is based on the new diffusive representation, comprising Equations (1) and (2)—provides a promising tool for the approximate calculation of Caputo-type fractional derivatives and that, as such, it is also a good candidate to be a fundamental building block in numerical schemes for solving fractional differential equations.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The author declares no conflict of interest.
References
- Diethelm, K.; Kiryakova, V.; Luchko, Y.; Machado, J.A.T.; Tarasov, V.E. Trends, directions for further research, and some open problems of fractional calculus. Nonlinear Dyn. 2022, 107, 3245–3270. [Google Scholar] [CrossRef]
- Diethelm, K. A new diffusive representation for fractional derivatives, Part I: Construction, implementation and numerical examples. In Fractional Differential Equations: Modeling, Discretization, and Numerical Solvers; Springer: Heidelberg, Germany, 2022. [Google Scholar]
- Diethelm, K. The Analysis of Fractional Differential Equations; Springer: Berlin, Germany, 2010. [Google Scholar]
- Baffet, D. A Gauss-Jacobi kernel compression scheme for fractional differential equations. J. Sci. Comput. 2019, 79, 227–248. [Google Scholar] [CrossRef] [Green Version]
- Hinze, M.; Schmidt, A.; Leine, R.I. Numerical solution of fractional-order ordinary differential equations using the reformulated infinite state representation. Fract. Calc. Appl. Anal. 2019, 22, 1321–1350. [Google Scholar] [CrossRef]
- Li, J.-R. A fast time stepping method for evaluating fractional integrals. SIAM J. Sci. Comput. 2010, 31, 4696–4714. [Google Scholar] [CrossRef] [Green Version]
- McLean, W. Exponential sum approximations for t−β. In Contemporary Computational Mathematics; Dick, J., Kuo, F.Y., Woźniakowski, H., Eds.; Springer: Cham, Switzerland, 2018; pp. 911–930. [Google Scholar]
- Singh, S.J.; Chatterjee, A. Galerkin projections and finite elements for fractional order derivatives. Nonlinear Dyn. 2006, 45, 183–206. [Google Scholar] [CrossRef]
- Yuan, L.; Agrawal, O.P. A numerical scheme for dynamic systems containing fractional derivatives. J. Vib. Acoust. 2002, 124, 321–324. [Google Scholar] [CrossRef] [Green Version]
- Zhang, W.; Capilnasiu, A.; Sommer, G.; Holzapfel, G.A.; Nordsletten, D. An efficient and accurate method for modeling nonlinear fractional viscoelastic biomaterials. Comput. Methods Appl. Mech. Eng. 2020, 362, 112834. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Diethelm, K.; Ford, N.J.; Freed, A.D. A predictor-corrector approach for the numerical solution of fractional differential equations. Nonlinear Dyn. 2002, 29, 3–22. [Google Scholar] [CrossRef]
- Diethelm, K.; Ford, N.J.; Freed, A.D. Detailed error analysis for a fractional Adams method. Numer. Algorithms 2004, 36, 31–52. [Google Scholar] [CrossRef] [Green Version]
- Lubich, C. Fractional linear multistep methods for Abel-Volterra integral equations of the second kind. Math. Comput. 1985, 45, 463–469. [Google Scholar] [CrossRef]
- Lubich, C. Discretized fractional calculus. SIAM J. Math. Anal. 1986, 17, 704–719. [Google Scholar] [CrossRef]
- Ford, N.J.; Simpson, A.C. The numerical solution of fractional differential equations: Speed versus accuracy. Numer. Algorithms 2001, 26, 333–346. [Google Scholar] [CrossRef]
- Garrappa, R. Numerical solution of fractional differential equations: A survey and a software tutorial. Mathematics 2018, 6, 16. [Google Scholar] [CrossRef] [Green Version]
- Hairer, E.; Lubich, C.; Schlichte, M. Fast numerical solution of nonlinear Volterra convolution equations. SIAM J. Sci. Stat. Comput. 1985, 6, 532–541. [Google Scholar] [CrossRef]
- Hairer, E.; Lubich, C.; Schlichte, M. Fast numerical solution of weakly singular Volterra integral equations. J. Comput. Appl. Math. 1988, 23, 87–98. [Google Scholar] [CrossRef]
- Diethelm, K.; Freed, A.D. An efficient algorithm for the evaluation of convolution integrals. Comput. Math. Appl. 2006, 51, 51–72. [Google Scholar] [CrossRef] [Green Version]
- Mastroianni, G.; Monegato, G. Error Estimates for Gauss-Laguerre and Gauss-Hermite Quadrature Formulas. In Approximation and Computation; Zahar, R.V.M., Ed.; Int. Ser. Numer. Math. 119; Birkhäuser: Boston, MA, USA, 1994; pp. 421–434. [Google Scholar]
- Plato, R. Concise Numerical Mathematics; American Mathematical Society: Providence, RI, USA, 2003. [Google Scholar]
- Szego, G. Orthogonal Polynomials, 4th ed.; American Mathematical Society: Providence, RI, USA, 1975. [Google Scholar]
- Iserles, A. A First Course in the Numerical Analysis of Differential Equations; Cambridge University Press: Cambridge, UK, 1996. [Google Scholar]
- Mastroianni, G.; Monegato, G. Truncated quadrature rules over (0,∞) and Nyström-type methods. SIAM J. Numer. Anal. 2003, 41, 1870–1892. [Google Scholar] [CrossRef]
- Diethelm, K. Fast solution methods for fractional differential equations in the modeling of viscoelastic materials. In Proceedings of the 9th International Conference on Systems and Control (ICSC 2021), Caen, France, 24–26 November 2021. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).