Next Article in Journal
A New Identity Relating Weighted Sums of Primes and Sums of Reciprocals of Riemann Zeros
Previous Article in Journal
Compact Finite Difference Schemes: A Review of Fundamentals, Applications, and Practical Implementation
Previous Article in Special Issue
Zeno and Anti-Zeno Effects in Dark-State Dynamics Under Thermal Dephasing: A Numerical Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Foundations of Quantum Mechanics: Generalizations of the Mathematical Axiomatic Derivation of the Schrödinger Equation

by
Olavo L. Silva Filho
Institute of Physics, University of Brasilia, Brasilia 70910-900, Brazil
Current address: Central Institute of Sciences, Darcy Ribeiro Campus, University of Brasilia, Entrance BT 297, Asa Norte, Brasilia 70910-900, Brazil.
Mathematics 2026, 14(11), 1961; https://doi.org/10.3390/math14111961
Submission received: 6 May 2026 / Revised: 28 May 2026 / Accepted: 30 May 2026 / Published: 3 June 2026
(This article belongs to the Special Issue Mathematics Methods in Quantum Physics and Its Applications)

Abstract

An axiomatic approach to quantum mechanics that has, as a theorem, the Schrödinger equation may be of enormous value to cope with interpretation issues, since all the interpretation constructs must be present in the axioms, or directly derived by them from their mathematical unfolding. Thus, it is critical to show that this axiomatic derivation is reliable beyond any possible doubt. To show this, it is possible to make generalizations and extensions of the axioms to derive the Schrödinger equation in the underlying generalized or extended formats. In previous papers, we have shown that the axiomatic approach we propose can be used to derive the Schrödinger equation as a direct axiom. Since then, we have also shown that it was possible to generalize that derivation to coordinate systems other than the Cartesian, as well as its relativistic extensions that lead to the relativistic wave equations. An extension to dissipative systems was also performed, allowing us to mathematically derive the Caldirola–Kanai equation from first principles. All these derivations were performed using pure states and in the absence of the electromagnetic field. This means that we can further generalize the approach to embrace these two possibilities. Being an axiomatic approach, we show that we need only to slightly modify the axioms to derive the Schrödinger equation for these two contexts. Despite being quite direct, the algebraic complexity of these derivations should give the reader the desired confidence in the proposed axioms.

1. Introduction

It is known that there are a huge number of types of interpretations of the Schrödinger equation, most of them incompatible with each other. They all assign different meanings to the function ψ ( x , t ) . Some, as in Bohm’s approach [1,2], call it the generator of a concrete quantum mechanical field; others, such as Ballentine [3,4], call it a representative of an ensemble of identical physical systems; some, such as Everett/DeWitt [5,6,7], say that it shows the necessity of considering the outcomes of experiments in different universes. We also have the suggestion that ψ ( x , t ) represents consistent histories [8,9]; Bayesian calculations [10,11]; or it is assumed, as Copenhagen does [12,13], to be part of a reduction mechanism in the act of observation because it represents single systems. The function ψ ( x , t ) may also be considered as codifying averages taken from a stochastic support [14,15,16]; these are but a few interpretations spanning almost a century.
Perhaps the issue of the interpretation of the Schrödinger equation would be clarified if one took a “step back” and presented a set of axioms by means of which the Schrödinger equation becomes a theorem, such that the interpretation of its symbols would be a mere extension of interpretations already contained in the axioms. This “step back” can be rephrased as the need for a mathematically sound quantization process.
Indeed, if one wants to pass from the classical to the quantum domain, one uses the rules
p p ^ = i ħ ; x x ^ = x .
This is the heuristic of the quantization process currently accepted. These rules come with some limitations with respect to their generalizations. For example, generalization to general curvilinear coordinates is not prescribed, and we always adopt the strategy of quantization in Cartesian coordinates, as in (1), and then change the differential operators to the desired curvilinear system [17,18]. We want to move from a heuristic approach to a formal, mathematically sound process.
In previous papers [16,19,20], we have found a different quantization process based on two fairly simple axioms, of which the Schrödinger equation is just a theorem. These axioms must, of course, pass all generalization tests if we want to assume their adequacy. For example, if we generalize them to a configuration space of greater dimension, they must yield the Schrödinger equation in this new context. Furthermore, they must allow us to derive the Schrödinger equation in any coordinate system by simply writing these axioms in the desired system. At the outset, the axioms must give us the relativistic equations when written in their relativistic generalization [21]. All these results have already been mathematically shown to be the case [16,19,20,21]. We have also shown that our derivation is equivalent to Feynman’s path integral, except that it is carried out in phase space with the Hamiltonian, rather than in configuration space with the Lagrangian. We have also shown that our derivation is equivalent to stochastic derivations [22].
An axiomatic approach may present many advantages. One of them is uncovering unsuspected relations. Thus, with the characteristic function derivation, we have shown that the Bohr–Sommerfeld rules are inscribed in quantum mechanics’ formalism [23]. The usual criticisms presented for this approach were as follows: (a) it does not take half-integral numbers into account; and (b) it cannot be applied to some physical systems, such as the Helium atom. We have mathematically shown that these rules are a direct consequence of the axioms (they are theorems), just as the Schrödinger equation is, and that the half-integral numbers appear naturally in this mathematical context. Moreover, we have also shown that these rules can be used only for systems that present some spatial symmetry (which the Helium atom does not present) [23]. This is a fairly unsuspected result. Another unsuspected result was the mathematical connection between the Schrödinger equation and the central limit theorem, which was clarified in [24].
Furthermore, in paper [16], we proposed a new stochastic interpretation that crucially depends on the strength of the axioms. In fact, in paper [25], we explicitly advanced an epistemological constraint that argues that the better any interpretation is, the closer it keeps its semantic propositions to its syntactic apparatus; that is, its formalism.
Thus, strengthening our confidence in the axioms ensures the adequacy of our interpretation. We may now strengthen our confidence in the axioms if we show two other generalizations that were not previously envisaged: the extension of the derivation of the Schrödinger equation to deal with electromagnetic fields, known as “minimal coupling”, and the extension of this derivation to encompass superpositions, since all previous derivations were based on pure states. As will be clear in what follows, both these derivations are very algebraically involved, and the fact that they provide us with the correct derivation of the underlying Schrödinger equations must increase our confidence in the adequacy of the axioms, which, in turn, should improve our confidence in the interpretations already proposed in [16].
The objective of this paper is to present the two generalizations mentioned above: a mathematical derivation of the Schrödinger equation related to the prescription of minimal coupling for electromagnetic potentials, and the generalization of the derivation for superposed states, all of which are expressed in terms of the generalization of the axioms presented in [19,20].
The paper is organized as follows: in the next section, we briefly present the usual derivation process using pure states in one dimension. In the third section, we generalize the axioms presented in section two to show that they allow us to derive the Schrödinger equation in its minimum coupling format. In section four, we present the generalization of the axioms to cope with superposed states. In the last section, we present our conclusion.

2. Previous Derivation of the Usual Schrödinger Equation

In a previous paper [19,20], we mathematically derived the Schrödinger equation from two simple axioms, which were stated as
Axiom  1.
The marginal characteristic function of the phase space probability density function F ( x , p ; t ) , defined by
Z x , δ x ; t = + F x , p ; t exp i ħ p · δ x d p ,
where ℏ is a universal parameter with dimensions of angular momentum, is such that it can be written as
Z x , δ x ; t = ψ x δ x 2 ; t ¯ ψ x + δ x 2 ; t ,
and should be expanded to second order in the real parameter δ x .
Axiom  2.
For an isolated system, the joint phase space probability density function related to any quantum-mechanical phenomenon obeys the Fourier-transformed Liouville equation
exp i ħ p · δ x d F x , p ; t d t d p = 0 ,
to second order in δ x .
Before proceeding with the derivation, it is interesting to comment on some important issues related to these axioms. First of all, Equation (3) may give the impression to the reader that this is just the usual Wigner phase space distribution approach. However, the two approaches are different in some aspects: Wigner departs from the representation for the phase space distribution, given as the Fourier inversion of (2), where it is already assumed that the functions ψ are solutions of the Schrödinger equation. From this expression, one can show that the distribution must satisfy a Wigner–Moyal equation in phase space, which resembles the Liouville equation but has terms of order greater than or equal to ħ 2 in an infinite expansion. The present approach takes the inverse path: it defines the characteristic function in (2) as the Fourier transform of the probability density function (not a distribution, since it is positive-definite) — in this sense, it begins from the “classical side” and postulates that this characteristic function must be expanded to second order in δ x , something completely absent from Wigner’s approach. From this assumption and the assumption of Axiom 2, we can mathematically derive the Schrödinger equation. The assumption of an expansion to second order in δ x is crucial for the derivation process. Moreover, one can show [19] that this approach is mathematically equivalent to Feynman’s path integral derivation of the Schrödinger equation exactly because the expansion to second order in δ x corresponds to an expansion to second order in ϵ = δ t in Feynman’s formalism. The difference between this approach and Feynman’s is that the present one is conducted in phase space, with the Hamiltonian function, while Feynman’s is carried out in configuration space, with the classical Lagrangian, as we have already shown [16,19]. As a last comment, Wigner’s approach gives us a phase space distribution function that can present negative values for some phase space regions, while the present approach provides a positive definite phase space probability density.
Moreover, it is of utmost importance to note that, just as with Feynman’s approach (for the time), the present expansion to second order in δ x is not an approximation. In our case, we were able to show this result mathematically. Indeed, we have shown in [26] that this expansion to second order in the characteristic function and its representation as in (3) implies that quantum mechanics has the central limit theorem implicit in it. In other words, the function F ( x , p ; t ) is represented, for each x in configuration space, as a Gaussian function. Thus, the present derivation is mathematically exact.
As a result of these developments, the characteristic function becomes the density matrix (for a pure state, a 1 × 1 “matrix”) in the space representation, as its expression clearly indicates. In fact, Equations (2) and (3) give the method to obtain averages using the probability density function F ( x , p ; t ) . For example, one can simply take the “trace” of this 1 × 1 matrix by making δ x = 0 to obtain the probability density function on configuration space, as can be easily verified from the definition (2). The generalization for superposed states, to be made later in this paper, will introduce a full density matrix.
Thus, the derivation of the Schrödinger equation is as follows. We can write (4) as
d F ( x , p ; t ) d t = F t + p m · F x V ( x ) x · F p = 0
The equation for the characteristic function is obtained by applying the Fourier kernel in (4) and using that
e i p · δ x / ħ F t d p = Z t e i p · δ x / ħ F d p = Z t e i p · δ x / ħ p m · F x d p = i ħ m 2 x ( δ x ) e i p δ x / ħ F d p = i ħ m 2 Z x ( δ x ) e i p · δ x / ħ V ( x ) x · F p d p = V ( x ) x · e i p · δ x / ħ F p d p = i ħ δ x V ( x ) x Z ,
where we used integration by parts in the last integration and the fact that F ( x , p ; t ) = 0 , which means that the divergence of it in the entire momentum space is zero. Note that we are using / x = x and / p = p , as usual. If we multiply the terms in (6) by i ħ and use (5), we obtain the equation for the characteristic function as
ħ 2 m 2 Z x δ x + δ x · V x Z = i ħ Z t .
We then write
ψ ( x ; t ) = R ( x ; t ) exp i S ( x ; t ) ħ ,
and use this result and the expression (3) expanded to the second order in δ x , as imposed by the Axiom 1, to write
Z x , δ x ; t = R 2 + δ x 2 2 R 2 R ( R ) 2 exp i δ x ħ · S .
We now substitute this expression into Equation (7) and separate the real and imaginary parts to obtain the equations
ρ x ; t t + · p x ; t m ρ x ; t = 0 ,
and
S ( x ; t ) t + 1 2 m S ( x ; t ) 2 + V x ħ 2 2 m R x ; t 2 R ( x ; t ) = 0 ,
where
ρ ( x ; t ) = R ( x ; t ) 2 ; p ( x ; t ) = S ( x ; t ) .
are the probability density function and the average momentum, respectively. Equations (10) and (11) are known to be equivalent to the Schrödinger equation, since we obtain these equations if we write the probability amplitude as in (8) and substitute it into the Schrödinger equation, separating the real and imaginary parts. In fact, this is just Bohm’s decomposition, introduced in 1952 [1,27].
Let us now extend this derivation to encompass the electromagnetic potentials from a mathematically sound, axiomatic approach.

3. Electromagnetic Fields and Minimal Coupling

If one is now interested in making a quantization process that encompasses electromagnetic fields, in what is called the minimum coupling approach, the usual approach prescribes that it is only necessary to write rules (1) as follows:
p p ^ = i ħ q A ( x ; t ) ; x x ^
where A ( x ; t ) is the vector potential. In this case, the potential in the Schrödinger equation is simply the electrostatic potential ϕ ( x ; t ) .
The rationale for applying these rules comes from writing the Hamiltonian classical function in terms of the electromagnetic potentials. Thus, one starts by calculating the Lagrangian L of a particle in an electromagnetic field [28]. The acting force is the Lorentz force
f = q ( E + v × B )
Since F depends on velocity, we have to find a generalized potential U that satisfies the following equation
f j = U x j + d d t U x ˙ j .
From Maxwell’s equations B = × A , × E = B / t , we receive
E = ϕ A t .
Now, by plugging our results into the Lorentz force equation and carrying out some vector calculus, we end up with
f = q ϕ ( v · A ) d A d t .
where we make use of the fact that
v × ( × A ) = ( v · A ) ( v · ) A d A / d t = ( v · ) A + A / t
To obtain the generalized potential U, a final observation is needed, namely,
d A d t = d d t v ( v · A q ϕ ) ,
which is true since the electrostatic potential ϕ does not depend on the velocity.
Comparing
f = ( q ϕ q ( v · A ) ) + d d t v ( v · A q ϕ ) ,
with the equation for the generalized potential, we receive
U = q ϕ q ( v · A ) ,
which allows us to write down the Lagrangian as
L = T V = 1 2 m v · v U
To derive the minimal coupling Hamiltonian, one has to transform the classical kinetic momentum m v to the canonical momentum
p = L v = m v + q A .
In quantum mechanics, the kinetic momentum corresponds to the momentum operator p ^ , so the canonical momentum operator becomes p ^ q A . Thus, the Hamiltonian may be obtained by performing the Legendre transformation on L
H = p · v L = 1 2 m m v q A 2 + q ϕ .
Being able to write down the Schrödinger equation for the electromagnetic potentials is particularly important given the role that these potentials play in quantum mechanics [29]. In the next subsection, we provide this derivation.

3.1. Derivation of the Schrödinger Equation for the Minimal Coupling

In what follows, we will present two different (and not equivalent) ways to derive a Schrödinger equation from the two basic postulates of the derivation of Section 2, now extended to include the electromagnetic potentials (or fields). This, of course, implies the question of which one is correct. We will show that the usual way of postulating the minimum coupling in quantum mechanics is not correct, and we will present the correct one.

3.2. Derivation Leading to the Usual Result

Let us begin by considering the scalar electric potential ϕ and the vector potential A in relation to the Lorentz force (we will use c = 1 to simplify notation)
f k = e k ϕ t A k + 1 m p j G k j ,
in which we are using Einstein’s summation convention, and where
G k j = k A j j A k .
Of course, this is a derivation made in a non-relativistic context, which implies that we are not assuming Lorentz invariance. However, the same derivation can be made in a relativistic context using four vectors and the relativistic scalar product to show that the relativistic quantum mechanical equations are also derived as an extension of the present result [21]; in fact, the relativistic derivation can be shown in a way that is somewhat easier than the derivation we will present in the following.
Now, to encompass the electromagnetic field, our axioms become
Axiom  3.
The marginal characteristic function of the phase space probability density function F ( x , p ; t ) , is defined by
Z x , δ x ; t = + F x , p ; t exp i ħ ( p i + e A i ) δ x i d p ,
where ℏ is a universal parameter with dimensions of angular momentum and where A ( x ) is the vector potential and e is the electric charge. Moreover, this characteristic function must be written as
Z x , δ x ; t = ψ x δ x 2 ; t ¯ ψ x + δ x 2 ; t ,
and should be expanded to second order in the real parameter δ x .
Axiom  4.
For an isolated system, the joint phase space probability density function related to any quantum-mechanical phenomenon obeys the Fourier-transformed Liouville equation
exp i ħ ( p i + e A i ) δ x i d F x , p ; t d t d p = 0 ,
to second order in δ x i , for all i = 1 , 2 , 3 .
Our Liouville equation becomes
F t A + p k m F x k B + e ϕ x k A k t F p k C + 1 m p j G k j F p k D = 0 .
When we apply the Fourier kernel to the Liouville Equation (30) and perform the integrations, we obtain the following results:
A = Z t i e ħ δ x k A k t Z
B = 1 m i ħ ( δ x k ) e A k x k i e ħ δ x λ A λ x k Z
C = i e ħ δ x k ϕ x k + A k t Z
For (D), we have
( D ) = e m G k j p j F p k exp i p λ + e A λ δ x λ ħ d p =                                                           = e m p k G k j p j F exp i p λ + e A λ δ x λ ħ d p D 1                                                                       e m G k j p j p k F exp i p λ + e A λ δ x λ ħ d p D 2 +                                                                                                                                           i e m ħ δ x k G k j p j F exp i p λ + e A λ δ x λ ħ d p D 3
Thus, ( D 1 ) 0 , since it represents a divergence of a probability density function, and we know that a probability function must be zero at infinity; the term ( D 2 ) 0 , since p j / p k = δ j k , and δ j k G j k = 0 , since G j k is an antisymmetric tensor. Thus, we end with
D = i e m ħ δ x k G k j i ħ ( δ x j ) e A j Z
With these results, (30) becomes, by explicitly writing the tensor G i j ,
i ħ Z t ħ 2 m 2 Z x k δ x k + i ħ e m A k x k Z + i ħ e m A k Z x k + e 2 m δ x j A j x k A k Z + i e ħ m δ x j A j x k Z ( δ x k )                                                                                                       + i ħ e m δ x j G j k Z δ x k + e 2 m δ x k G k j A j Z + e δ x k ϕ x k Z = 0 ,
where we note that the two terms having A k / t cancel out; this being the reason why the induction part of the electric potential does not appear in the Hamiltonian in the minimal coupling.
Now, we expand the characteristic function to the second order in δ x k to obtain
Z x , δ x ; t = R 2 + δ x m δ x n 4 R m n R m R n R exp i ħ δ x λ λ S
We now take (36) into (35) to receive (up to the first order in δ x )
i ħ t R 2 + k R 2 m ( k S e A k ) +                                                                                       + R 2 δ x j j t S + k S e A k 2 2 m + e ϕ ħ 2 2 m R k k R = 0
to receive the two equations ( ρ = R 2 )
t ρ + k ρ m k e A k = 0 ,
which is the continuity equation, and
t S + k S e A k 2 2 m + e ϕ ħ 2 2 m R k k R = 0 .
The last two equations are equivalent to the equation
1 2 m i ħ k e A k ( x , t ) 2 ψ ( x ; t ) + e ϕ ( x ; t ) ψ ( x ; t ) = i ħ t ψ ( x ; t ) ,
which is the Schrödinger equation for minimal coupling, as we wanted to show.

4. The Generalization of the Derivation to Superposed States

We begin with the same axioms presented in the previous section but introduce a generalization in the way the characteristic function is written. Axiom 1 now reads:
Axiom  5.
The marginal characteristic function of the phase space probability density function F ( x , p ; t ) , defined by
Z x , δ x ; t = + F x , p ; t e i p δ x / ħ d p ,
where ℏ is a universal parameter with dimensions of angular momentum; this characteristic function is such that it can be written as
Z x , δ x ; t = μ ν ψ ν * x δ x 2 ; t g ν μ ψ μ x + δ x 2 ; t ,
where g μ ν is a matrix of constant parameters, which we will henceforth call “metric”; this characteristic function must be expanded to second order in the parameter δ x .
The introduction of the metric g μ ν is necessary to correctly substitute each pure state ψ ( x , t ) in (3) with a superposed state ν c ν ψ ν ( x , t ) .
Note that this is consistent with our previous understanding of the characteristic function as being the density matrix. Indeed, in the case of a superposed state, g μ , ν is exactly the form of the density matrix in the configuration space representation.
Thus, we generalize from the characteristic function for pure states used in our previous derivations. Given the structure of the metric, we can have mixed states if g μ ν = d i a g ( p 1 , p 2 , , p n ) or superposed states for a self-adjoint matrix g μ ν .
Indeed, for a superposition given by
Ψ ( x ; t ) = μ c μ ψ μ ( x ; t ) ,
we find, using (42), that the elements of the metric matrix for this case can be written as g μ ν = c ν * c μ .
We may now repeat the steps made in the previous section to obtain the equation for the characteristic function given by (7), since this equation does not depend on how we write this function.
Thus, we write the probability amplitude vector components  ψ μ ( x , t ) as
ψ μ ( x , t ) = R μ ( x , t ) exp ( i S μ ( x , t ) / ħ ) ,
since each ψ μ refers to a pure state. Note that we are assuming one-dimensional physical systems; the generalization to more dimensions in the configuration space is straightforward.
Expanding each probability amplitude vector μ up to second order, as we have carried out in the previous section, we receive (we used symbolic computation throughout, and these results can be find in the Supplementary Material as an algebraic computation file).
Z x , δ x ; t = μ ν R μ g μ ν R ν + δ x 2 R μ x g μ ν R ν R μ g μ ν R ν x + + δ x 2 2 1 2 R μ g μ ν 2 R ν x 2 + 1 2 2 R μ x 2 g μ ν R ν R μ x g μ ν R ν x × × exp i δ x 2 ħ ( S μ + S ν ) x exp i 2 ħ ( S μ S ν ) + ( δ x ) 2 8 2 S μ x 2 2 S ν x 2 .
Note that if we are assuming mixed states, then g μ ν becomes diagonal, and we may write g μ ν = p μ δ μ ν , where all p μ are real and δ μ ν is the Kronecker delta; in this case, the expression simplifies to
Z x , δ x ; t = μ p μ ( R μ ) 2 + δ x 2 2 R μ 2 R μ x 2 R μ x 2 exp i δ x ħ S μ x ,
Equation (46) refers to mixed states, while (45) refers to superposed states in general. In the next subsection, we will derive the Schrödinger equation for mixed states and then move on to the more involved superposed states.

4.1. Mixed States Derivation

Mixed states are represented by the characteristic function
Z ( x , δ x ; t ) = μ p μ ψ μ * x δ x 2 ; t ψ μ * x + δ x 2 .
We begin with the characteristic function defined in (46) and substitute it into Equation (7) to find
μ p μ i ħ R μ t + 1 2 m R μ 2 S μ x 2 + 1 m R μ x S μ x +                                     + ( δ x ) R μ 2 x S μ t + 1 2 m S μ x 2 ħ 2 2 m R μ 2 R μ x 2 exp i δ x ħ S μ x = 0 .
Now, since the exponential terms are all linearly independent from each other, we must have, as in the previous section, the set of equation pairs (separating the real and imaginary parts in each pair in (48))
R μ t + 1 2 m R μ 2 S μ x 2 + 1 m R μ x S μ x = 1 2 R μ ρ μ x ; t t + x ρ μ x ; t p μ x ; t m = 0 ,
and
S μ x ; t t + 1 2 m S μ x ; t x 2 + V x ħ 2 2 m R μ x ; t 2 R μ x ; t x 2 = 0 ,
where
ρ μ ( x ; t ) = R μ ( x ; t ) 2 ; p μ ( x ; t ) = S μ ( x ; t ) x .
From what we have shown in Section 2, this result means that a Schrödinger equation must be valid for each probability amplitude ψ μ ( q , t ) that constitutes the mixed state.

4.2. Superposed States

Now, we must insert the characteristic function in (45) into Equation (7) and keep only terms up to the first order in δ q . If we proceed in this way, we will be left with a result that gives exactly the same expression as shown in (48) for the diagonal terms.
Each of the non-diagonal terms is multiplied by an exponential term given by
e i 8 ħ ( δ x ) 2 2 S μ x 2 S ν x 2 + 4 ( δ x ) S μ x + S ν x + 8 ( S μ S ν ) .
Because of the linearly independent exponential functions in the expression obtained for different values of μ and ν , each term multiplying these exponential terms must be satisfied independently; that is, each one of them must be equal to zero, given the results of the diagonal terms.
The terms corresponding to the diagonal elements reproduce exactly what we obtained in the previous subsection for mixed states. Thus, they suffice to show that each state of the superposition must satisfy a Schrödinger equation. Thus, we must show that the non-diagonal terms also give zero, assuming that the states of which they are composed satisfy a Schrödinger equation.
Since the non-diagonal terms are pairwise similar (apart from a complex conjugation, which renders the involved exponential functions linearly independent) and the functions R μ , R ν , S μ , S ν are real, we may treat only one term without loss of generality. We thus must focus on the substitution of
t μ ν = e i 8 ħ ( δ x ) 2 2 S μ x 2 S ν x 2 + 4 ( δ x ) S μ x + S ν x + 8 ( S μ S ν ) × × R μ R ν + δ x 2 R ν R μ x R μ R ν x + ( δ x ) 2 8 R ν 2 R μ x 2 2 R ν x R μ x + R μ 2 R ν x 2
into (7). Let us write
K α = S α t + 1 2 m S α x 2 + V ( x ) ħ 2 2 m R α 2 R α x 2 0 ,
which we already know is equivalent to the Schrödinger equation for ψ α ( x , t ) = R α ( x , t ) exp ( i S α ( x , t ) / ħ ) .
If we make that substitution and collect only terms up to the first order in δ x , we obtain the real part of the result as (see the Supplemental Material)
0 = R μ R ν K μ K ν + 1 2 δ x x R μ R ν K μ + K ν δ x R μ R ν x K μ δ x R ν R μ x K ν
while, for the imaginary part, if we write
T α = R α t + 1 2 m R α 2 S α x 2 + 1 m R α x S α x = 1 2 R α ρ α t + x ρ α m S α x 0 ,
we obtain (see the Supplemental Material)
0 = ( R μ T ν + R ν T μ ) δ x 2 R μ x T ν R ν x T μ R μ T ν x + R ν T μ x
It should be noted that all terms of the real part (because of K μ and K ν ) reduce to some form of Equation (50) for the functions R μ ( x , t ) , S μ ( x , t ) and R ν ( x , t ) , S ν ( x , t ) . In the same fashion, all terms of the imaginary part (because of T μ and T ν ) reduce to some form of the continuity Equation (49) for the functions R μ ( x , t ) , S μ ( x , t ) and R ν ( x , t ) , S ν ( x , t ) .
Given the results for the diagonal elements, we know that K μ , K ν , T μ , T ν = 0 and Equations (55) and (57) are equal to zero. This means that the characteristic function built from a superposition of states also allows us to derive the Schrödinger equation (for each component of the superposition), despite this being a much more involved calculation.

5. Conclusions

Adopting an axiomatic approach to some theory entails the burden of demonstrating that the axioms withstand all possible generalizations necessary to derive the theorems of the theory in broader contexts, the Schrödinger equation being one of these theorems.
Thus, we have already shown [16,19] that the axioms allow us to directly derive the Schrödinger equation in any coordinate system; that our axioms are equivalent to Feynman’s path integral method (now represented in phase space); that their relativistic extension gives the relativistic counterparts of the Schrödinger equation [21]; and that they can be extended to encompass non-Hamiltonian dissipative systems [30].
In this paper, we have shown that the usual minimal coupling approach to quantizing physical systems encompassing electromagnetic fields can be directly obtained, along with the generalization of the results for mixed and superposed states. Unlike the generalization for the usual Schrödinger equation in many dimensions, which is quite direct, all those other derivations are algebraically involved, as shown in the present paper for two of them (for the derivation of the Schrödinger equation in generalized coordinates, see [19]).
It should be noted that it was also possible to derive a dissipative Schrödinger equation, represented by the Caldirola–Kanai equation [31,32,33], from a slight modification of the axioms [30], since the present derivation method can be easily extended to non-Hamiltonian contexts.
These extensions provide additional support for the consistency and applicability of the proposed axiomatic framework. Note that we do not need to show that the axioms are unique. It is only necessary that they “do the job”, that is, they actually allow us to derive the Schrödinger equation. Note that it was mentioned that this derivation was already shown to be equivalent to Feynman’s path integral approach, and also many stochastic derivations of the Schrödinger equation. Another derivation of the Schrödinger equation from a different set of axioms was already made [23]. The important fact here is that all these derivations were shown to be equivalent [22,23]. We are not assuming that these axioms form a minimal set, although it is important to notice that there are only two axioms, which means that it is possible that this is, indeed, a minimal set, even if this is not relevant to judge the adequacy of the derivation. What is being presented here is a formal derivation (and its extensions) that encompasses, as equivalent, many other derivations already presented in the literature over the course of decades. In this sense, this derivation also performs a great synthesis of the whole field of derivations of the Schrödinger equation.
Thus, as mentioned in the introduction of this paper, in matters of interpretation, if we keep ourselves within the boundaries established by the axioms [25], we should be concerned about a number of semantic constructs that do not appear in these axioms but are still present in many interpretations. Active observers, reduction of the wave packet, duality, complementarity, many worlds, consciousness, and wholeness, to cite but a few, should be considered in this context as helpful historical attempts to understand a field that has shown itself impervious to an interpretation devoid of extraneous semantic constructs. However, those semantic constructs that the axiomatic derivation does not introduce must be discarded at some point in the theory’s development. This interpretation is presented in [16] and is epistemologically justified in [25].
As a final note, we stress that all the previous developments were made within the context of the Schrödinger equation. It is not easy, if feasible at all, to think of extensions of the approach to include quantum field theory.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/math14111961/s1, (I) Derivation of the Schrödinger equation for superposed states: This derivation is rather long, but its development gives much confidence in the adequacy of the axioms of the approach. To make it easy for the reader to follow the derivation process, which is algebraically cumbersome, we provide an algebraic computer program using Maple 17®. To access the program, click the link https://www.dropbox.com/scl/fi/y1gkxzle4ynhk85aeqnwc/Formal_developments.mw?rlkey=b8gclf3o8qplvbx4ocmt3thhl&st=sf3zkoha&dl=0 (accessed on 1 May 2026).

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Bohm, D. A Suggested Interpretation of the Quantum Theory in Terms of “Hidden” Variables. I. Phys. Rev. 1952, 85, 166–179. [Google Scholar] [CrossRef]
  2. Bohm, D.; Hiley, B. The Undivided Universe, 1st ed.; Routledge: London, UK, 1983. [Google Scholar]
  3. Takabayasi, T. Remarks on the hydrodynamical representation and certain generalization of quantum mechanies. Il Nuovo Cimento (1955–1965) 1955, 1, 257–259. [Google Scholar] [CrossRef]
  4. Ballentine, L.E. The Statistical Interpretation of Quantum Mechanics. Rev. Mod. Phys. 1970, 42, 358–381. [Google Scholar] [CrossRef]
  5. Everett, H. “Relative State” Formulation of Quantum Mechanics. Rev. Mod. Phys. 1957, 29, 454–462. [Google Scholar] [CrossRef]
  6. Vaidman, L. Why the many-worlds interpretation? Quantum Rep. 2022, 4, 264–271. [Google Scholar] [CrossRef]
  7. DeWitt, B.S.; Graham, N. The Many-Worlds Interpretation of Quantum Mechanics: A Fundamental Exposition by Hugh Everett, III, with Papers by JA Wheeler, BS DeWitt, LN Cooper and D. Van Vechten, and N. Graham; Princeton University Press: Princeton, NJ, USA, 2025. [Google Scholar]
  8. Griffiths, R.B. Consistent histories and the interpretation of quantum mechanics. J. Stat. Phys. 1984, 36, 219–272. [Google Scholar] [CrossRef]
  9. Rocha, G.R.; Rickles, D.; Boge, F.J. A brief historical perspective on the consistent histories interpretation of quantum mechanics. arXiv 2021, arXiv:2103.05280. [Google Scholar] [CrossRef]
  10. Fuchs, C.A.; Schack, R. Quantum-Bayesian coherence. Rev. Mod. Phys. 2013, 85, 1693–1715. [Google Scholar] [CrossRef]
  11. Benavoli, A.; Facchini, A.; Zaffalon, M. Quantum mechanics: The Bayesian theory generalized to the space of Hermitian matrices. Phys. Rev. A 2016, 94, 042106. [Google Scholar] [CrossRef][Green Version]
  12. Heisenberg, W. The Copenhagen interpretation of quantum theory. Phys. Philos. 1958, 19, 44–58. [Google Scholar]
  13. Faye, J. Copenhagen Interpretation of Quantum Mechanics. In The Stanford Encyclopedia of Philosophy, Summer 2024 ed.; Zalta, E.N., Nodelman, U., Eds.; Metaphysics Research Lab, Stanford University: Stanford, CA, USA, 2024. [Google Scholar]
  14. De la Peña, L.; Cetto, A.M. Does quantum mechanics accept a stochastic support? Found. Phys. 1982, 12, 1017–1037. [Google Scholar] [CrossRef]
  15. De la Peña, L.; Cetto, A.M.; Valdés-Hernández, A. The emerging quantum. In The Physics Behind Quantum Mechanics; Springer International Publishing: Cham, Switzerland, 2015. [Google Scholar]
  16. Silva Filho, O.L.; Ferreira, M. A Bird’s-Eye View on a New Stochastic Interpretation of Quantum Mechanics. Mathematics 2025, 13, 3571. [Google Scholar] [CrossRef]
  17. Gruber, G.R. Quantization in generalized coordinates. Found. Phys. 1971, 1, 227–234. [Google Scholar] [CrossRef]
  18. Gruber, G.R. Quantization in generalized coordinates—II. Int. J. Theor. Phys. 1972, 6, 31–35. [Google Scholar] [CrossRef]
  19. Olavo, L. Foundations of quantum mechanics: Non-relativistic theory. Phys. A Stat. Mech. Its Appl. 1999, 262, 197–214. [Google Scholar] [CrossRef]
  20. Silva Filho, O.L.; Ferreira, M. Derivation of the Schrödinger equation I: The characteristic function. Rev. Bras. Ensino Fis. 2024, 46, e20240183. [Google Scholar] [CrossRef]
  21. Olavo, L.; Ferreira, M. Derivation of the Schrödinger equation V: Relativistic Equations of Quantum Mechanics and its epistemological impacts on teaching. Rev. Bras. Ensino Fis. 2025, 47, e20240426. [Google Scholar] [CrossRef]
  22. da Silva Filho, O.L.; Ferreira, M. Derivation of the Schrödinger equation IV: Stochastic and the Langevin Equations for Quantum Mechanics. Rev. Bras. Ensino Fis. 2024, 46, e20240234. [Google Scholar] [CrossRef]
  23. Olavo, L. Foundations of quantum mechanics (II): Equilibrium, Bohr–Sommerfeld rules and duality. Phys. A Stat. Mech. Its Appl. 1999, 271, 260–302. [Google Scholar] [CrossRef]
  24. Silva Filho, O.L.d.; Ferreira, M. Derivation of the Schrödinger equation III: The Central Limit Theorem. Rev. Bras. Ensino Fis. 2024, 46, e20240228. [Google Scholar] [CrossRef]
  25. Silva Filho, O.L.; Simon, S.J.; Ferreira, M. Semiotics and Epistemology of Physics: Reflections on Language and the Interpretation of Quantum Mechanics. Mathematics 2026, 14, 550. [Google Scholar] [CrossRef]
  26. Olavo, L. Foundations of quantum mechanics: The connection between QM and the central limit theorem. Found. Phys. 2004, 34, 891–935. [Google Scholar] [CrossRef]
  27. Bohm, D.; Hiley, B.J. The Undivided Universe: An Ontological Interpretation of Quantum Theory; Routledge: London, UK, 2006. [Google Scholar]
  28. Goldstein, H.; Poole, C.P.; Safko, J. Classical Mechanics; Addison-Wesley: Reading, MA, USA, 1950; Volume 2. [Google Scholar]
  29. Aharonov, Y.; Bohm, D. Significance of electromagnetic potentials in the quantum theory. Phys. Rev. 1959, 115, 485. [Google Scholar] [CrossRef]
  30. Gonçalves, L.; Olavo, L. Foundations of Quantum Mechanics: Derivation of a dissipative Schrödinger equation from first principles. Ann. Phys. 2017, 380, 59–70. [Google Scholar] [CrossRef]
  31. Caldirola, P. Quantum theory of nonconservative systems. Il Nuovo Cimento B (1971–1996) 1983, 77, 241–262. [Google Scholar] [CrossRef]
  32. Brittin, W.E. A note on the quantization of dissipative systems. Phys. Rev. 1950, 77, 396. [Google Scholar] [CrossRef]
  33. Greenberger, D.M. A new approach to the problem of dissipation in quantum mechanics. J. Math. Phys. 1979, 20, 771–780. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Silva Filho, O.L. Foundations of Quantum Mechanics: Generalizations of the Mathematical Axiomatic Derivation of the Schrödinger Equation. Mathematics 2026, 14, 1961. https://doi.org/10.3390/math14111961

AMA Style

Silva Filho OL. Foundations of Quantum Mechanics: Generalizations of the Mathematical Axiomatic Derivation of the Schrödinger Equation. Mathematics. 2026; 14(11):1961. https://doi.org/10.3390/math14111961

Chicago/Turabian Style

Silva Filho, Olavo L. 2026. "Foundations of Quantum Mechanics: Generalizations of the Mathematical Axiomatic Derivation of the Schrödinger Equation" Mathematics 14, no. 11: 1961. https://doi.org/10.3390/math14111961

APA Style

Silva Filho, O. L. (2026). Foundations of Quantum Mechanics: Generalizations of the Mathematical Axiomatic Derivation of the Schrödinger Equation. Mathematics, 14(11), 1961. https://doi.org/10.3390/math14111961

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop