Abstract
A probabilistic approach is developed for the exact solution u to a deterministic partial differential equation as well as for its associated approximation performed by Lagrange finite element. Two limitations motivated our approach: On the one hand, the inability to determine the exact solution u relative to a given partial differential equation (which initially motivates one to approximating it) and, on the other hand, the existence of uncertainties associated with the numerical approximation . We, thus, fill this knowledge gap by considering the exact solution u together with its corresponding approximation as random variables. By a method of consequence, any function where u and are involved are modeled as random variables as well. In this paper, we focus our analysis on a variational formulation defined on Sobolev spaces and the corresponding a priori estimates of the exact solution u and its approximation in order to consider their respective -norm as a random variable, as well as the approximation error with regards to finite elements. This will enable us to derive a new probability distribution to evaluate the relative accuracy between two Lagrange finite elements and .
MSC:
65N15; 65N30; 65N75
1. Introduction
We recently proposed new perspectives on relative finite element accuracy [1,2] by using a mixed geometrical-probabilistic interpretation of the error estimate in the case of finite element approximation (see for example [3] or [4]) derived from the Bramble–Hilbert lemma [5].
In [6], we further extended the results we had derived in the case of to Sobolev spaces . To this end, we had to consider a more general framework, which mainly relied on the Banach–Nečas–Babusȟka abstract problem [7] devoted to Banach spaces.
This enabled us to obtain two new probability distributions that estimate the relative accuracy between two Lagrange finite elements and , (), by considering it as a random variable.
Thus, we obtained new results that show, among others, which of or is the most likely accurate, depending on the value of the mesh size h; this value is not considered anymore as proceeding towards zero, as in the standard procedure.
However, while obtaining these probability distributions, we only considered the standard error estimate dedicated to Lagrange finite elements, which approximates solution u to a variational problem , u being formulated in the present case in Sobolev space .
In the current study, we enrich the model published in [6] in multiple manners. Indeed, considering the functional framework in the case of the Sobolev spaces, we take into account the available a priori estimates one can deal with with respect to the solution of this kind of problem as well as for its approximation, together with the approximation error which corresponds to Lagrange finite elements (see Section 2.2). This will enable us to derive a new probabilistic model applied to the relative accuracy between two finite elements and . To this end, we also generalize the discrete probabilistic framework we considered in [1,6] by introducing continuous probabilistic formalism based on an appropriate density of probability (see Section 3.1).
The paper is organized as follows. In Section 2, we recall the mathematical problem we considered and introduce the basic definitions of functional tools to consider different estimations in Sobolev spaces. Section 3.1 is dedicated to the analysis of the relative finite elements accuracy based on a probabilistic approach. In Section 4, we detail the contribution of the a priori estimates and their interactions with the error estimate in the probability distributions we derived in Section 3.1. Concluding remarks follow.
2. The Functional and Approximation Frameworks and Their Corresponding Estimates
2.1. Abstract Problem in Banach Spaces and Corresponding Fundamental Results
In this section, we define an abstract framework that will enable us to consider the solution of a variational problem in Banach Sobolev spaces when and its corresponding approximation computed by Lagrange finite elements.
In order to conduct this, we follow the presentation of A. Ern and J. L. Guermond [7], where two general Banach spaces W and V are involved where V is reflexive. We also recall the different assumptions needed in order to apply the Theorem valid in Banach spaces.
Let be the solution of the variational formulation defined by the following:
where the following is the case:
- W and V are two Banach spaces equipped with norms denoted by and , respectively; moreover, V is reflexive;
- a is a continuous bilinear form on , i.e,with .;
- l is a continuous linear form on V, i.e, .
We further make the two following assumptions.
- (BNB1)
- (BNB2)
Then, one can prove the (BNB) Theorem ([7], Theorem 2.6), which claims that the variational problem (VP) has one and only one solution in W and that the following a priori estimate holds.
We also define the approximation of u solution to the approximate variational formulation:
where we assume that and are two finite-dimensional subsets of W and V, respectively.
Moreover, as observed in [7] (Remark 2.23, p.92), neither condition (BNB1) nor condition (BNB2) imply its discrete counterpart. Then, the well-posedness of (3) is equivalent to the two following discrete conditions.
- (BNB1h)
- (BNB2h)
If we further assume that = , a direct application of Theorem 2.2 in [7] enables us to write the following a priori estimate.
In the next section, we will apply these results to the particular case where the exact solution u belongs to Sobolev spaces and the approximation is computed by the help of Lagrange finite elements.
2.2. Application to Sobolev Spaces and the Corresponding Error Estimate
We introduce an open-bounded subset exactly recovered by a mesh composed by n-simplexes which respect the classical rules of regular discretization (see, for example, [4] or [8]). Moreover, we denote by h the mesh size of (the largest diameter in the mesh ) and the space of polynomials defined on a given n-simplex of degree less than or equal to k ( 1) by .
Henceforth, we assume that approximate spaces and satisfy = and that they are included in the space of functions defined on and composed of polynomials belonging to .
Finally, we also specify the functional framework of the abstract problem (VP) by introducing Sobolev spaces as follows.
For any integer and any , we denote by the Sobolev space of (class of) real-valued functions that, together with all their partial distributional derivatives of order less or equal to m, belongs to :
being a multi-index for which its length is given by , and is the the partial derivative of order defined by the following.
We also consider norm and semi-norms , which are, respectively, defined by the following:
where denotes the standard norm in .
Then, in order to fulfill the conditions of the (BNB) Theorem, particularly so that W is a Banach space and V a reflexive one, in the sequel of the paper, the following definitions of spaces W and V hold:
where m and are two non zero integers, and p and two real positive numbers that satisfy and such that the following is the case.
Regarding these choices, Sobolev’s space is a Banach space and is a reflexive one [9]. Moreover, we have: and .
We can now recall the a priori error estimate for Lagrange finite elements we derived in [6]:
where is a positive constant independent of h.
Remark 1.
Since we noticed that is included in , by considering for the topology of that was induced from thanks to the triangle inequality, (2)–(4) and (10) result in the following error estimate:
where is the duality of .
As one can observe, the right-hand side of (11) contains for which its dependency on h is usually unknown, except for particular cases. This is the reason why we will assume it is bounded from below by a positive constant δ independent of h:
This uniform boundedness property of , crucial to guaranteeing optimal error estimates [7], is valid in multiple cases (see Chapters 4 and 5 in [7] or [10]) but not systematically (see for example first-order PDEs in [7]).
Next, from (11), we obtain the following:
where .
In the sequel, we will denote by the following expression.
Finally, we also observe that there exists a critical value of h defined by the following:
such that the error estimate (12) can be split according to the following.
Based on this error estimate, the following section is devoted to deriving a probabilistic model applied to the relative accuracy between two Lagrange finite elements and when the mesh size h has a given and fixed value.
3. The Probabilistic Analysis of the Relative Finite Elements Accuracy
In [6], we proposed two probability distributions that enabled us to obtain an evaluation of the more likely accurate between two Lagrange finite elements and , . These distributions were essentially derived by considering the error estimate (10).
In the present paper, we generalize these probability distributions by introducing two new inputs, which are the following:
- The error estimate (10) is enriched by the a priori estimates (2) and (4) finally considers (15).
- The probabilistic approach we develop is an extension of those we considered in [1,6]. More precisely, by the help of an ad hoc density probability, we derive the probability distribution of a suitable random variable so that we can compare two approximation errors and , considered as random variables, as will be introduced now.
3.1. Random Solution and Random Approximations of Deterministic Partial Differential Equation
The purpose of this section is based of the following fundamental remark: the solution u to the variational problem (VP), except for particular cases, is totally unknown (being impossible to calculate it analytically); this motivates the numerical schemes one will choose to implement.
This inability to determine, in most cases, the exact solution u is mainly due to the complexity of the involved PDE’s operator; it indeed depends on complex combinations of integrals, partial derivatives and boundary conditions, as well as on the bent geometrical shape of the domain of integration . All of these ingredients hence participate in the incapability to analytically determine exact solution u as their relationship with u is inextricable and unknown.
As a consequence, this lack of knowledge and information regarding the dependency between these ingredients and solution u motivates us to consider u as a random variable, as well as any function of u. This paper is dedicated to the approximation error of , considered as a random variable.
In this frame, we view solution u with respect to variational formulation (VP) defined by (1) in the same manner as it normally is viewed to consider the trajectory and contact point with the ground of any solid body that is thrown, i.e., as random. Indeed, in this case, due to the lack of information concerning the initial conditions of the trajectory of the body, the solution of the concerned inverse kinematic operator is inaccessible and is, thus, observed as a random variable.
In the case analyzed in this paper, the situation is much worse. Indeed, we investigate a general variational formulation (VP) where the analog of the inverse kinematic operator is too complex to enable us to analytically determine the corresponding solution of (VP) by any mathematical expression. It is one of the reasons that motivate us to view solution u and its approximation as random variables, since the corresponding approximate operator conserves the complexity of the original one described above.
3.2. The Probabilistic Distribution of the Relative Finite Elements Accuracy
In the previous section, we expressed the reason for why we consider solution u and its approximation as random variables.
In order to complete the description of the randomness feature of approximation error , we also remark that since the manner a given mesh grid generator will produce any mesh is random, the corresponding approximation is random too.
For all of these reasons, based on the error estimate (12) and (13), we can only affirm that the value of approximation error is somewhere within interval .
As a consequence, we decide to view norm as a random variable defined as follows.
Let and p be fixed. We introduce the random variable defined by the following.
Thus, space product plays the role of the usual probability space introduced in this context.
Now, regarding the absence of information concerning the more likely or less likely values of norm within interval , we assume that random variable is uniformly distributed over interval with the following meaning.
Equation (18) means that if one slides interval anywhere in , the probability of event does not depend on where interval is located in , but only on its length; this corresponds to the property of uniformity of random variable .
Let us now consider two families of Lagrange finite elements and corresponding to a set of values such that .
The two corresponding inequalities given by (12) and (13), assuming that solution u to (VP) belong to , are as follows:
where and , respectively, denote and Lagrange finite element approximations of u and as defined by (13).
Remark 2.
If one considers a given mesh for finite element that contains that of , then for the particular class of problems where is equivalent to a minimization formulation (see for example [11]), one can show that the approximation error of the finite element is always smaller than that of , and is more accurate than for all values of the mesh size h.
Therefore, in order to avoid this situation, for a given value of h, we consider two independent meshes built by a mesh generator for and . Now, usually, in order to compare the accuracy between these two finite elements, one asymptotically considers inequalities (19) and (20) to conclude that, when h proceeds to zero, is more accurate than , since proceeds faster to zero than .
However, for a given application, h has a given and fixed value; thus, this method of comparison is not valid anymore. For this reason, our purpose is to determine the relative accuracy between two finite elements and for a fixed value of h corresponding to two independent meshes.
Moreover, since we chose to consider two random variables as uniformly distributed on their respective interval of values , we also assume that they are independent. This assumption is, once again, the result of the lack of information which led us to model the relationship between these two variables as independent, since any knowledge is available for more precisely localizing the value of if the value of was known and vice versa.
By the following result, we establish the density of probability of the random variable Z defined by the following: .
Theorem 1.
Let be the two uniform and independent random variables defined by (16) and (17): , where is defined by (13).
Then, random variable Z defined on has the following density of probability.
Proof.
Let us now remark that since the support of two random variables is , the support of density is, therefore, , which corresponds to (21).
Let us consider the case where .
If denotes the density of probability defined on associated to each random variable , since we assume that they are independent variables, density is given by the following:
where
and is the indicator function of the interval .
Furthermore, due to the definition (27) of the density for each variable the integrand of (26) can be expressed as follows:
As a consequence, , which leads one to consider the five following cases corresponding to the significant relative positions between intervals and :
- Let us assume that . If , then and , which is again a result of (21);
- We consider now the values of z such that and .If , then and by (26) we obtain the expected expression of (25) since we have the following.
- The following case concerns the values of z such that and . If , then and we obtain (23) since the following is the case.
- If and , then , which is impossible since .
- We consider now the values of z such that and .If , then and since we have the following:we obtain the expected expression of (22).
- Finally, if , then and , which corresponds to (21).
The other cases corresponding to can be deduced by using the same arguments. □
From Theorem 1, one can infer the entire cumulative distribution function defined by the following.
However, since we are interested in determining the more likely finite element between and , we focus the following corollary to the value of , which corresponds to .
Corollary 1.
Let be the two uniform and independent random variables defined by (16)–(17): , where is defined by (13). Then, we have:
Proof.
- —
- Let us consider the case where . Then, by definition (33) of the entire cumulative distribution function , at , we have the following:where we used the values of the density given by (22) and (23).
- —
- In the same manner, when , we have the following.
□
We can now explicate the probability distribution of event given by (34) and (35) as a function of the mesh size h.
To this end, we remark that, since each has two possible values depending on the relative position between h and defined by (14), the probability distribution we are looking for must be splitted in the corresponding cases as well.
Let us, hence, introduce constants defined by the following:
and the specific value of h which corresponds to the intersection of the curves defined by (see Figure 1).
Figure 1.
Relative positions between the curves and the line .
Then, we have the following.
We notice that and strongly depend on m and p, since and depend on these two parameters as well. As a consequence, in the following theorem, the different formulas of will contain this dependency on m and p.
Theorem 2.
Let be the two uniform and independent random variables defined by (16) and (17). Then, is determined by the following.
If , then and we have:
If , then and the following.
Proof.
In order to establish the proof of Theorem 2, we will consider a geometrical interpretation of error estimate (19) and (20).
These two inequalities can indeed be geometrically viewed with the help of the relative position between the two curves introduced above and the horizontal line defined by (see Figure 1).
Then, depending on the position of the horizontal line with particular value we have to consider the two following cases:
- —
- If , then ;
- —
- If , then .
Thus, let us consider the first case when or equivalently . Therefore, due to the relative positions between the three curves and , we have the following results:
- and with . Then, from (35), we obtain (40);
- and with and . Then, again from (35), we obtain (41);
- and . Then, from (34) or (35), we obtain (42);
- and . Then, again from (34) or (35), we also obtain (42).
We now consider the case where which corresponds to . Therefore, using once again the relative positions between curves and , we deduce the following results:
- and with . Then, from (35), we obtain (43);
- and with . Then, from (34), we obtain (44);
- and with and . Then, again from (34), we obtain (45);
- and . Then, from (34) or (35), we obtain (46).
□
Remark 3.
We notice that the probability distribution given by Theorem 2 generalizes those that we found in [1,6]. Indeed, when we derived probability distribution without taking into account the a priori estimates (2) and (4), we obtained the following.
In this case, we showed in [12] two numerical examples based on PDE’s formulation where exact solutions may be computed to illustrate probabilistic law (47) and (48).
In particular, we highlighted the statistical methodology that enabled us to determine a statistical estimator of the threshold given by (39).
Based on this study, one may adapt it in order to obtain applications that will illustrate the new probabilistic value we derived in Theorem 2.
Moreover, in Theorem 2, the contribution of a priori estimates (2) and (4) modifies the probability distribution given by (47) and (48), since the horizontal line interferes with the two polynomials (see Figure 1).
This phenomenon will be analyzed in the following section.
4. Discussion
First of all, let us plot the shape of the two cases of the probability distribution we derived in Theorem 2 and those we obtained in [1,6] that we recalled in (47) and (48).
As one can observe, the distribution (47) and (48) plotted in Figure 4 resembles more of the second case of Theorem 2 plotted in Figure 3 than its first case, as plotted in Figure 2.
Figure 2.
Probability distribution when .
In fact, in the latter case, when , line interacts with two polynomials before critical value . As a consequence, its contribution correspondingly takes place in the expression of the probability distribution for .
On the contrary, when , the contribution of line only has to be taken into account after the value of . However, in this case, when , once again line interacts with two polynomials and the equivalent distribution we found in (47) and (48) is cut to provide the corresponding one in (45) and (46) plotted in Figure 3.
Figure 3.
Probability distribution when .
This shows the importance of the a priori estimates (2) and (4), which are not considered in the classical point of view limited to the asymptotic behavior of error estimate (10), when the mesh size h proceeds to zero. The reason for this is the fact that the right hand sides of (2) and (4) do not depend on h; it does not bring any influence for the desired asymptotic behavior.
The goal of the probabilistic approach we developed in this paper is to evaluate, for any fixed value of the mesh size h, the relative accuracy between two Lagrange finite elements and . The probability distributions of Theorem 2 adjust those that we had found in [1,6].
Indeed, probability distribution (47) and (48) claim that finite element is more likely to be accurate than with a high level of probability when . Actually, this probability can reach the value of one, as it can be observed in Figure 4 and could be almost surely more accurate than . Thus, distributions (40)–(42) and (43)–(46) limit the value of this probability.
Figure 4.
Probability distribution published in [1].
More precisely, if , then the probability such that is more likely accurate than will never be greater than 0.5 (see Figure 2), and if , then for the corresponding probability will decrease to the limit value of 0.5, even if it increases with values greater than 0.5 for h values between and (see Figure 3).
5. Conclusions
5.1. The Relative Accuracy between Two Finite Elements
In this paper, we presented a generalized probability distribution that takes into account the three available standard estimates one can confront with respect to solution u to a variational formulation together with its approximation performed by Lagrange finite elements .
Those include the a priori estimates for solution u and its approximation carried out by finite elements, on the one hand, and the error estimate, on the other hand.
More precisely, by taking into account a priori estimates (2) and (4), we enriched probability distribution (47) and (48) that we obtained in [1,6] to find new ones (40)–(46), which were derived in Theorem 2.
As we observed in the previous paragraph, if the contribution of a priori estimates (2) and (4) are not considered when one limits the analysis of the relative accuracy between two finite elements to the asymptotic rate of convergence when the mesh size h proceeds to zero, one must deal with these two estimates in order to conclude about the more likely accurate one when h has a fixed value.
Moreover, regarding the probabilistic approach we developed in this paper, the a priori estimates (2) and (4) brought a more realistic behavior of distributions (40)–(46) in comparison with (47) and (48). Indeed, the probability given by (40)–(46) cannot reach a value of one since it was quite a surprise in (47) and (48) to obtain this theoretical value. This result meant that event “ is more accurate than ” is an almost certain event when .
5.2. Generalization to Other Numerical Methods
Finally, we would like to mention that the probabilistic approach we proposed in this study is not restricted to the finite elements method but may be extended to other types of approximation: given a class of numerical schemes and their corresponding approximation error estimates, one is able to order them not only in terms of asymptotic rate of convergence but also by evaluating the most probably accurate one.
It is, for example, the case of the numerical integration where the composite quadrature error has a mathematical structure, which looks similar to error estimate (10) we considered in the present study.
More precisely, as an example, for a composite quadrature of order k on a given interval , if is a given function, the corresponding composite quadrature error can be written [13] as follows:
where h denotes the size of the equally spaced panels that discretized the interval , are given numbers such that all belong to , and is a constant independent of h that mainly depends on f and k.
As a consequence, due to the similar mathematical structure between (10) and (49), with the same arguments we introduced to compare the accuracy between two Lagrange finite elements and one can evaluate the probability of the more accurate numerical composite quadrature associated to two different parameters and for a fixed value of h.
This will make sense because, normally, to bypass the lack of information associated with the unknown value of the left hand side of (49) in interval , only the asymptotic convergence rate comparison is concerned for evaluating the relative accuracy between two numerical quadratures of order and .
Nevertheless, this procedure is no longer available when one wants to compare two composite quadratures in the case when the size of the equally panels h is fixed, as it is for any application. Thus, the probabilistic approach we propose here could be a relevant alternative.
The same consideration may be developed in order to compare the accuracy between two numerical schemes of order and , which are performed to approximate the exact solution of ordinary differential equations. To make precise these ideas, let us consider solution y of the first order ordinary differential initial value problem defined on a given interval :
where is given.
Let us also restrict ourselves by considering one-step numerical methods to approximate function y solution to problem (CP).
Namely, if we introduce a constant mesh size , where denotes the sequence of values of t within the interval , then the corresponding one-step numerical scheme is given by the following:
where is a given function “sufficiently” smooth that characterizes the numerical scheme (CP)h.
Moreover, (CP)h is called a numerical scheme of order k if [13]:
where is a constant independent of h that depends on y, and k (see [14] for the dependency on k).
Thus, when considering two one-step numerical methods of order and defined by two functions and , and due to the similar structure between (10) and (52), one would be able to evaluate the probability of the more accurate scheme with the same arguments we implemented when comparing accuracy between and Lagrange finite elements.
In summary, when one wants to evaluate the relative accuracy between two numerical methods that belong to a given family of approximations, the probabilistic approach we propose in this study essentially depends on the ability to determine the constant , which appears in different corresponding approximation errors (10), (49) or (52).
Indeed, for each of these errors of approximation, the complexity of the constant will suggest appropriate investigations. In the current study, we were devoted to the finite elements method, and we pointed out the important role of the a priori estimates (2) and (4), which are usually neglected, since they do not bring any asymptotic information/behavior when mesh size h proceeds to zero.
As a consequence, in order to render probability distribution (40)–(46) derived in Theorem 2, one will have to consider appropriate techniques that will make the determination (or at least the approximation) of different constants that are involved in (40)–(46) possible. This is mainly due to the contribution of the three estimates (2), (4) and (10), which results in (12), namely, and .
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
The author wants to warmly dedicate this research to pay homage to the memory of Professors André Avez and Gérard Tronel, who broadly promoted my passion of research and teaching in mathematics.
Conflicts of Interest
The author declares no conflict of interest.
References
- Chaskalovic, J.; Assous, F. A new probabilistic interpretation of Bramble-Hilbert lemma. Comput. Methods Appl. Math. 2019, 20, 79–87. [Google Scholar] [CrossRef] [Green Version]
- Chaskalovic, J.; Assous, F. A new mixed functional-probabilistic approach for finite element accuracy. arXiv 2018, arXiv:1803.09552. [Google Scholar] [CrossRef]
- Ciarlet, P.G. Basic error estimates for elliptic problems. In Handbook of Numerical Analysis; Ciarlet, P.G., Lions, J.L., Eds.; North Holland: Amsterdam, The Netherlands, 1991; Volume II. [Google Scholar]
- Raviart, P.A.; Thomas, J.M. Introduction à L’analyse Numérique des Équations aux Dérivées Partielles; Masson: Issy-les-Moulineaux, France, 1982. [Google Scholar]
- Bramble, J.H.; Hilbert, S.R. Estimation of linear functionals on Sobolev spaces with application to Fourier transforms and spline interpolation. SIAM J. Numer. Anal. 1970, 7, 112–124. [Google Scholar] [CrossRef]
- Chaskalovic, J.; Assous, F. Explicit k-dependenc for Pk finite elements in Wm,p error estimates: Application to probability distributions for accuracy analysis. Appl. Anal. 2019, 100, 2825–2843. [Google Scholar] [CrossRef]
- Ern, A.; Guermond, J.L. Theory and Practice of Finite Elements; Springer: New York, USA, 2004. [Google Scholar]
- Ciarlet, P.G. The Finite Element Method for Elliptic Problems; Classics In Applied Mathematics; SIAM: Philadelphia, PA, USA, 2002. [Google Scholar]
- Brezis, H. Analyse Fonctionnelle—Théorie et Applications; Masson: Paris, France, 1992. [Google Scholar]
- Toundykov, D.; Avalos, G. A uniform discrete inf-sup inequality for finite element hydro-elastic models. Evol. Equ. Control Theory 2016, 5, 515–531. [Google Scholar] [CrossRef] [Green Version]
- Chaskalovic, J. Mathematical and Numerical Methods for Partial Differential Equations; Springer: Basel, Switzerland, 2013. [Google Scholar]
- Chaskalovic, J.; Assous, F. Numerical validation of probabilistic laws to evaluate finite element error estimates. Math. Model. Anal. 2021, 26, 684–694. [Google Scholar] [CrossRef]
- Crouzeix, M.; Mignot, A.L. Analyse Numérique des Équations Diffécentielles; Masson: Issy-les-Moulineaux, France, 1984. [Google Scholar]
- Stroud, A.H. Numerical Quadrature and Solution of Ordinary Differential Equations; Springer: Basel, Switzerland, 1974. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).



