1. Introduction
In the following paper, we discuss a couple of results obtained in [
1,
2] concerning a system of elliptic partial differential equations of interest in physics that present interesting mathematical challenges.
Over the past years, there has been great interest in the study of systems of equations that model the interaction of matter with electromagnetic fields. In fact many theories have been considered that incorporate matter equations, such as the Klein–Gordon or Schrödinger equation, coupled with equations of the electromagnetic field, for example the Maxwell equations, or even equations that provide a more accurate description of the electromagnetic field, like the Bopp–Podolsky or Born–Infeld equation. We do not enter into physical details here, but it is worth noting that the well-known Maxwell and Bopp–Podolsky equations can be seen as approximations of the more sophisticated Born–Infeld theory introduced in the seminal papers [
3,
4,
5,
6].
In essence, the coupling mentioned above of a matter field with the electromagnetic field is a model for a physical situation in which a charged particle interacts with its own electromagnetic field generated by the motion. This interaction can be rigorously described within the framework of gauge theories and involves, from a practical point of view, the replacing of the usual derivatives that appear in the Lagrangian with the so-called covariant derivatives or Weyl derivatives. Then, looking for a standing wave solution in equilibrium with its own electromagnetic field in a purely electrostatic situation, one arrives at an elliptic system of two coupled equations: the first equation is related to the matter field and the second one is related to the electrostatic potential. In fact, according to the theory considered for the electromagnetic field, different equations appear.
Recently, we have studied these types of systems and demonstrated the existence and multiplicity of solutions in various situations. The interested reader may refer to the pioneering paper [
7], starting from which many papers appeared: for example, the Maxwell and Klein–Gordon equations are combined in [
8,
9,
10,
11], the Born–Infeld and Klein–Gordon equations are considered in [
12,
13], the Schrödinger and Maxwell equations are considered in [
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24], and the Schrödinger and Bopp–Podolsky equations are considered in [
25,
26,
27,
28,
29]. These are just few references, since it is almost impossible to cite all the interesting papers on the subject, but they are sufficient to give to the reader an idea of the mathematics involved, the techniques developed and the surprising results obtained.
In this paper we treat the Schrödinger equation coupled with the electromagnetic field described by the Born–Infeld equation. The derived system has a Lagrangian , which provides a more accurate description of the physical situation compared to the Lagrangian employed in the Maxwell theory of the electromagnetic field, . In contrast, the electrostatic potential equation in the Born–Infeld theory is nonlinear, unlike the Maxwell theory, where the Poisson equation appears and is more studied and understood mathematically.
More specifically, our aim involves the search for standing waves of the Schrödinger equation that are in equilibrium with the electrostatic field generated by the motion within the Born–Infeld theory for the electric field. Then we are reduced to looking for solutions
to the following system of partial differential equations:
This system is referred to as the Schrödinger–Born–Infeld system. Here represents the nonlinearity, evidently of power type, which simulates the interaction between numerous particles. Of course solves the system, but we do not care about the zero solution.
In our discussions we will highlight the major difficulties in dealing with the above system.
Let us comment, however, a little bit on the meaning of the equations.
The first one is a nonlinear Schrödinger equation for u, the modulus of the wave function, in the presence of a potential . The point is that this electrostatic potential is not assigned and is not known a priori; in other words it cannot be classified as an external potential. In fact, it is generated by the wave function due to the fact that its “gradient” field has as a source: this is the second equation. Moreover the second equation is not linear in since a mean curvature operator in the Minkowski metric appears and is related, evidently, to the structure of the Born–Infeld Lagrangian. This is the first apparent difficulty. The system is accompanied with boundary conditions at infinity, which are quite natural and mean that the solutions vanish at large distances, as should be the case in any reasonable physical model.
For interesting papers about the mean curvature operator in the Minkowski metric as well the Born–Infeld equations, see, e.g., [
12,
30,
31,
32,
33,
34,
35,
36,
37,
38].
A second mathematical difficulty related to the problem is that the problem is situated in the entire space , where compact embeddings of Sobolev spaces are not valid anymore. From a variational point of view, this presents certain challenges in establishing the compactness of Palais–Smale sequences for the energy functional: it can happen that “quasi-solutions” escape at infinity. Indeed an indirect approach in order to control these quasi-solutions is implemented.
Despite these challenges, we find some existence results, and depending on the values of p two different approaches are developed.
2. Materials and Methods
As with many interesting physical phenomena, the problem under study is variational: this means that the equations are obtained as critical points of a certain functional. This is something more general than Maupertuis’s principle or the minimal action principle, according to which natural phenomena appear and are stable when the energy is minimal. In fact the solutions could be also other types of critical points of the energy functional: not only minima, but also saddle-like, and they give rise in this case to “less stable” phenomena.
The advantage of working with a variational system is that its solutions are exactly the critical points of the energy functional; then a problem in partial differential equations is transferred to a problem in Critical Point Theory. This theory deals with very abstract and general functionals and a robust theory is established, although continuous advances are being made by many authors around the world.
Also our system is variational: we can write a functional of two variables whose critical points give the solutions. What is not immediate is how to deal with a functional that is unbounded, so no notion of the minimum can be given. Nevertheless it can be reduced using a general method applied to this kind of problem to a functional I of a single variable, which is easier to deal with and for which we will find a minimum; in other words we will find a minimal-energy solution for the problem, or ground state. In order to apply the general theory to this new functional, some aspects about its geometry and compactness have to be explored. Generally speaking, the minimum, even for a functional that is bounded from below, is not guaranteed.
To this aim, an interesting notion is introduced in Critical Point Theory, which is the Palais–Smale condition: a smooth functional I is said to satisfy the Palais–Smale compactness condition if any sequence is such that is bounded and admits a convergence subsequence (eventually up to subsequences). From this it is very easy to check that, for example, the functional , although bounded below, does not satisfy the Palais–Smale condition: the sequence is such that is bounded, goes to zero, but is unbounded so cannot have any convergent subsequence.
Now this study is divided into two cases depending on the values of p. In fact the geometry of the functional is very different, and the estimate we find for is not valid for .
We say here that the range is natural for this kind of problem in order to have a well-defined energy functional. So we are able to cover the entire range of ’s; this is not trivial: many interesting problems are solved just for suitable values of the parameter p.
In both cases, we will argue by studying not directly our functional I, but a nearby one. More specifically, we perform the following:
We introduce a parameter , which measures in some sense the distance from the original functional to the new one;
We find critical points of this new functional, which then solve not exactly our problem but a near one;
We take these approximating solutions and show that they actually form a sequence converging to a solution of the original problem, or to a minimum of the original functional.
In showing the steps above we need to find suitable estimates, prove the right geometry of the functionals, and prove the right compactness properties in order to avoid, as we said before, the escaping of the quasi solutions. All these concepts will be made rigorous in the next sections, where the implementation of suitable modifications of the classical Mountain Pass theorem is carried out.
3. Results
Before stating the results, we show in which spaces the unknowns have to be found. They seems to be the right ones since they combine and merge the needs of the mathematical process (the rigor of the statement and the justification of the steps) and the physical result (its reasonability).
Looking at the first equation in the system, we see that the unknown u has to be found in the usual Sobolev space , namely the functions are square summable joint with their gradients. This is quite reasonable even from a physical point of view since the Hilbert space appears in many problems in Quantum Mechanics. The norm in this space is denoted by .
The difficult point is the understanding of the right space in which to find the second unknown
. In fact, looking at the second equation, it appears in a nonlinear fashion, although it is interesting from a mathematical point of view since it is the mean curvature operator in the Minkowski metric. In fact we are forced to find a different space in which to look at the unknown
. The right setting for the second unknown has to be
where
is the completion of
with respect to the
norm of the gradient. We denote by
the norm in
for
.
Some interesting properties of the space are listed below:
has continuous embedding in for every ;
has continuous embedding in ;
If , then ;
is weakly closed;
If is bounded, there exists such that, up to a subsequence, weakly in and uniformly on compact sets.
For a proof see Lemma 2.1 in [
33].
3.1. Statement of the Results
Once we have the functional framework, the definition of weak solution we give is the following. A couple
is a weak solution of (
1) if
for all
.
It would be interesting, however, to find such a solution, namely working in the spaces and . In fact, due to technical difficulties, we are able to find solutions in some subspace of them, where the functions have a symmetry property, which helps in the computations.
Then we introduce the spaces of radial functions
The main result we are going to discuss is the following.
Theorem 1. Let . Problem (1) possesses a radial ground state solution, that is, a solution whose energy is minimal among all the other nontrivial radial solutions. We point out that different approaches are used according to the values of or . In any case, the solutions we find, u and , satisfy the integral identities as stated above. But actually they are classical solutions, namely of class , and then the equations are also solved pointwise. The proof of this last fact relies on classical bootstrap arguments.
This paper is organized as follows. In
Section 3.2 some preliminaries are given. They are used to give the right variational framework for the problem. In
Section 3.3 we recall some well-known arguments that are used to prove Theorem 1 in case
. In
Section 3.4 we give some ideas of how to obtain the result stated in the case
. All the details, for the interested reader, are in [
1,
2].
3.2. Functional Setting and Preliminary Results
Our problem is variational in the sense that it comes from the action functional
F defined by
The verification of this is straightforward, and it happens that the partial derivatives of F give the operators appearing in the two equations. So we need to find critical points of F, and a first variational principle holds.
Proposition 1. The pair is a weak solution of (1) if and only if it is a critical point of F. In this way we are changing the problem of finding solutions to the system into one of finding critical points of a function of two variables that is differentiable. Then Critical Point Theory enters the game, and we can take advantage of the abstract results, which ensures the existence of critical points for functionals.
Some difficulties, however, arise now. First of all, the space is not a vector space, implying among other things that not all the directions are admissible for computing the derivative of the functional; even the notion of the critical point of F requires some care. Note that for this aim, radial symmetry is not yet necessary.
Moreover the functional F is unbounded from above and from below, in the sense that there is a sequence such that and, with a fixed function , a sequence such that .
The next step is then trying to deal with a better functional. This is performed by solving the second equation for every fixed
. In fact considering the functional
defined as
the following holds.
Lemma 1. For any fixed , there exists a unique such that the following properties hold:
- 1.
is the unique minimizer of and , namely - 2.
and if and only if ;
- 3.
If ϕ is a weak solution of the second equation of system (
1)
, then , and it satisfies the following equality:
Moreover, if , then is the unique weak solution of the second equation of system (
1).
As we can see, here enters radial symmetry: only in the radial case are we able to prove that the minimum of the functional
(which also exists in the non-radial setting) solves the second equation of (
1). Due to this fact,
u has to be radial (and in this case
also is).
By Lemma 1, we are able to define the following functional on
of a single variable by
to show that the functional
is not immediate since we do not know if the map
is. And in fact the smoothness of
I is proved by hand using the definition. See [
1] for the details.
The advantage now is that this functional is no longer strongly indefinite.
As we have seen, we have to restrict the functional to the subspace of radial functions
. The important point now is that
is a natural constraint for
I in the sense that
Indeed
I is invariant under the action of
on
, that is
This can be seen by using the previous Lemma, which gives
and so
by virtue of the uniqueness of the minimizer of
. Once we have
the invariance of
I is easy to check. The Palais Principle of Symmetric Criticality can be applied and implies that we can find critical points of
I on
. In other words we arrive at a second variational principle:
Proposition 2. If is a weak nontrivial solution of (1), then and u is a critical point of I. On the other hand, if is a critical point of I, then is a weak nontrivial (radial) solution of (1). Moreover, u is a ground state of I if and only if is a ground state of F.
Put in other terms, to find solutions of (
1) we are reduced to finding critical points of
I on
. Moreover the solution with minimal energy of the system is exactly the ground state of
I.
Observe that searching for critical points of
I means solving
With the aim of looking at critical points of
I, we immediately observe that, even in presence of an acceptable power nonlinearity, the behavior of the functional is not clear the under the rescaling
This means that classical scaling arguments are not applicable (as, e.g., for the Schrödinger–Maxwell system). This is indeed another difficulty in the problem.
However some advantages of working with variational problems is that we can find a relation that any solution has to satisfy. In Physics this is also called
Virial Theorem and in Mathematics the Pohozaev (or Derrick’s) Theorem: if
is a smooth solution of (
1), then the following identity is satisfied:
The proof of this identity is carried out by multiplying the first equation by
, the second one by
, integrating on a ball
, and making standard computations. Of course the boundary terms that appear have to be carefully estimated. From a purely formal point of view, it can also be derived by the equation
We omit the technical details here.
Let us describe now the approach we use to find critical points of I. In fact it is based on variants and on suitable modifications of Mountain Pass-type arguments.
3.3. Sketch of Proof of Theorem 1: The Case
In order to find critical points of I we have to study its properties, especially with regard to the geometry and the compactness. The next technical lemma is useful for studying the shape and the behavior of the functional I. We also present the short but interesting proof. This result permits control of the gradient of with the norm of the same u. This will be useful later; however it is an independent result that is interesting on its own.
Lemma 2. Let q be in Then there exist positive constants C and such that, for any , we have Here we denote by the critical Sobolev exponent related to q, and is its conjugate exponent, namely Proof. Since
and
we have
so, by (
2) and
, we have
which gives the conclusion. □
For the case
the proof of the main result uses the following monotonicity trick from Struwe [
39,
40].
Proposition 3. Let be a Banach space and an interval. Consider a family of functionals on X defined bywith B being non-negative and either or as such that . For any , we set Assume that for every , the set is non-empty and Then for almost every , there is a sequence such that
- (i)
is bounded in X;
- (ii)
, as ;
- (iii)
in the dual space of X, as .
The importance of the previous result is clear: it allows us to obtain a critical point of our original functional I by means of “approximation”. In other words, instead of dealing with the functional I we work with a slight perturbation for which the proposition furnishes a bounded Palais–Smale sequence and then a critical point . Of course the critical point does not need to solve our equation, but just an approximated one, due to the presence of the parameter (see below).
The great advantage of this procedure is that our “approximating sequence” satisfies the identity , which proves very useful during the computations.
In our specific case
and by (
2),
as
.
Then our aim is to look for bounded Palais–Smale sequences of the following perturbed functionals:
for almost all
near 1. The idea is that for
we recover the functional
I.
At this point we need to show that the above abstract Proposition 3 can be applied. And this fact is a consequence of the next result.
Proposition 4. The following facts hold:
- (i)
for every ;
- (ii)
For every , there is and , suitably small, such that , for all , with . It follows that .
Proof. We just show the first point
. Indeed for a fixed
and
, by Lemma 2 and for
and standard inequalities we have
So, whenever
and
, we get
From the fact that
, it is possible to find
such that, for
t sufficiently large,
. □
Then by a standard but straightforward proof we obtain the following.
Proposition 5. For almost every , there exists , , such that and .
With this procedure we have found a nontrivial solution
of the following perturbed (namely depending on
) equation:
for almost any value of
near 1. This is not exactly (
3); indeed we need to pass to the limit as
in order to recover a nontrivial critical point for
I. This is performed by using the relevant fact that
satisfies the equation
and so Nehari- and Pohozaev-type identities are available
Indeed as we write down and combine the above identities together and then pass to the limit as we obtain a nontrivial critical point of I.
Up to now, we do not know if it is actually a ground state for
I. But we define
The above infimum is strictly positive. In fact, any
satisfies
and therefore
meaning that the set
is bounded away from zero. However since
for all
we conclude. As a final step, one easily shows that the infimum is achieved, proving the existence of a ground state solution
for (
3). Having
, we recover
as given by Lemma 1, namely as the unique minimizer of
.
3.4. Sketch of Proof of Theorem 1: The Case
The case is more involved and indeed a different approach is needed. In fact just the monotonicity trick proves not to be useful, since it was based on the fundamental Lemma 2 used at the end of the proof of Proposition 4.
We now use a different approximating problem: instead of (
4) we consider the following one:
where
As we can see, two parameters appear in the perturbation. We naturally have the associated energy functional
where
I is the original functional defined before. As a first step we aim to find a solution of the above perturbed equation (or a critical point of
) and then pass to the limit as
, instead of
.
For such a functional the following properties hold.
There is a Mountain Pass Geometry for , which is uniform (i.e., independent) in . This is something like Proposition 4 of the previous case, but with a proof that is a little bit more involved. After all, two additional terms in the functional are present with respect to the previous functional.
The Mountain Pass level
satisfies
Indeed this is a fundamental step in order to take the limit, since we do not want to go to zero; otherwise we have the risk of finding the zero solution.
satisfies the Palais–Smale condition: it is important to avoid escaping at infinity, as we commented before.
The result of these facts are very technical and quite long. We do not think that they are necessary here, but the interested reader can refer to [
2]. We just say that in proving such properties, the “new” perturbation, which involves
and
q, has been extensively used. We suspect that also in the first case we could have used the double-parameter perturbation, but we would have obtained the same result without any real advantage and at the expense of a more complicated proof.
Then, with the Palais–Smale condition also satisfied, there is an analogous of Proposition 5.
Proposition 6. For any , there is , , such that and .
The nontrivial part now is to show that the family gives a bounded Palais–Smale sequence for the original functional I. In fact after some computations, one effectively shows the existence of a critical point for I at the level .
Finally, the fact that I possesses a ground state is addressed as before.
Remark 1. Actually in [2] we considered a nonlinearity that is still more general. Indeed we studied the problemwith the following assumptions on the nonlinearity f: - 1.
and ;
- 2.
for ;
- 3.
For any , , where and .
They are quite natural when one deals with variational methods. Actually we also obtain a multiplicity result of solutions; in fact the problem admits infinitely many solutions in which the energy functional tends to infinity. In this case a symmetric version of the Mountain Pass theorem is used.
Finally it is worth noting that the critical case is also treated. However a further assumption due to compactness issues is necessary:
- 4.
There exist and such that for .
We prove then that under the set of assumption (1)–(4) the system has a ground state solution if (i) , or (ii) and D is sufficiently large.
4. Discussion
The results discussed here are interesting since they show how many techniques can be combined to solve an elliptic system: variational tools, fine estimates, and the perturbation approach. The results that this paper discussed are new; however some open questions remain, like the following:
How to treat the nonradial case?
What about the problem with a priori given the norm of u?
What about different kinds of nonlinearity, like, e.g., of logarithmic type?
We think they are all challenging and nothing is known in this direction.
5. Conclusions
We have seen how an approximating procedure can be useful in different contexts. Actually the type of approximation is dictated by the problem or by its difficulties. We have to say that in many problems, an approximating procedure can be implemented and usually depends on the nature of the problem. We have seen that just the range where the parameter p varies led us to choose different approximations of the original problem.
In our case, to prove the result in the case we have approximated in some way the problem with a parameter , for which it was easier to find a solution. Then sending to 1, we recovered the original problem and its solution. However in doing that, an auxiliary result, not available for the second case, , was used. And in fact the approximating procedure for this second case, , is very different and indeed more complex, and for this reason we have skipped many details.