Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect

Wakamatsu, Masashi

doi:10.3390/sym17060935

Open AccessArticle

Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect

by

Masashi Wakamatsu

KEK Theory Center, Institute of Particle and Nuclear Studies, High Energy Accelerator Research Organization (KEK), Oho 1-1, Tsukuba 305-0801, Ibaraki, Japan

Symmetry 2025, 17(6), 935; https://doi.org/10.3390/sym17060935

Submission received: 1 May 2025 / Revised: 30 May 2025 / Accepted: 5 June 2025 / Published: 12 June 2025

(This article belongs to the Special Issue Feature Papers in 'Physics' Section 2025)

Download

Browse Figure

Versions Notes

Abstract

The most popular interpretation of the Aharonov–Bohm (AB) effect is that the electromagnetic potential locally affects the complex phase of a charged particle’s wave function in the magnetic field free region. However, since the vector potential is a gauge-variant quantity, multiple researchers suspect that it is just a convenient tool for calculating the force field. This motivates them to explain the AB effect without using the vector potential, which inevitably leads to some sort of non-locality. This frustrating situation is shortly summarized by the statement by Aharonov et al. that the AB effect may be due to a local gauge potential or due to non-local gauge-invariant fields. In the present paper, we shall give several convincing arguments which support the viewpoint that the vector potential is not just a convenient mathematical tool with little physical entity. Despite its gauge arbitrariness, the vector potential certainly contains a gauge-invariant piece, which solely explains the observed AB phase shift. Importantly, this component has a property such that it is basically unique and cannot be eliminated by any regular gauge transformations. To complete the discussion, we also discuss the role of remaining gauge arbitrariness still contained in the entire vector potential.

Keywords:

Aharonov–Bohm effect; role of vector potential; gauge ambiguity; transverse–longitudinal decomposition

1. Introduction

Since its first theoretical prediction [1,2], the Aharonov–Bohm effect has continued to provoke innumerable debates which concern the deep foundation of modern physics. (For reviews, see [3,4,5,6], for example). A central question is whether the electromagnetic potential is a physical entity or just a convenient mathematical tool [7,8,9]. Without doubt, the most popular interpretation of the AB effect is that the magnetic vector potential locally affects the complex phase of the quantum electron wave function, thereby causing a change in phase that can be verified through interference experiments [10,11]. Nonetheless, it is also true that there are many researchers who are not completely satisfied with the vector potential explanation of the AB effect [12,13,14,15]. This is because the vector potential is a gauge-variant quantity with inherent arbitrariness. For this reason, they believe that the vector potential is just a convenient mathematical tool for calculating the electromagnetic force field, as is in fact the case with classical electromagnetism. This motivates them to look for explanations which do not use the gauge-dependent vector potential [12,13,14]. One of the most influential studies along this line would be the work by Vaidman [16,17]. He proposed an explanation for the AB effect via the force between the solenoid current and the moving electron rather than via electromagnetic potential. Later, Vaidman’s paper was criticized by Aharonov, Cohen, and Rohrlich [18]. Through six thought experiments, these authors concluded that the attempt to dispense with scalar and vector potentials is at least incompatible with the attempt to interpret the AB effect as a local effect. They also pointed out a potential problem inherent in the local force explanation of the AB effect. According to this force interpretation, the change in the phase of the electron’s wave function implies a change in the electron’s velocity, which appears to be incompatible with the so-called dispersionless nature of the AB effect [19,20] (the dispersionless nature of the AB effect means that the observed AB phase shift is independent of the velocity of the electron).

Partially motivated by Vaidman’s work, several authors have focused on the interaction energy between the solenoid current and the moving electron rather than the force between them [21,22,23,24,25]. (See also a related older work by Boyer [26]). They postulate that the change in the phase of the electron wave function along a path is proportional to the change in the above interaction energy along the same path. Then, by explicitly evaluating the change in the interaction energy along a closed path of the electron, they showed that it reproduces the standard answer for the AB phase shift.

The validity or invalidity of these authors’ further claim is highly nontrivial, as follows. According to them, since the change in the above interaction energy along the path of the electron is a gauge-invariant quantity, the partial AB phase shift along a non-closed path is also a gauge-invariant quantity, which in turn implies that it can in principle be observed. However, in a recent paper [27], based on the framework of a self-contained quantum mechanical treatment of the combined system of a solenoid, an electron, and the quantized electromagnetic field, we showed that there is no evidence to support their central claim, i.e., the proportionality assumption between the interaction energy of the solenoid current and the moving electron and the corresponding partial AB phase shift. To summarize the situation to date, we feel that all attempts to dispense with the vector potential in the explanation of the AB effect have not displayed complete success. On the other hand, there already exist several works which support the physically reasonable nature of the vector potential interpretation [28,29,30]. The only drawback of this theoretical approach is the fact that the vector potential is a gauge-dependent quantity, which makes it difficult to completely dispel some researchers’ suspicion regarding its physical reality. The purpose of the present paper is to address and eliminate these doubts in the most convincing manner.

The remainder of this paper is structured as follows. First, in Section 2, we analyze the nature of the vector potential generated by an infinitely long solenoid with the specific intention of confirming its physically substantial nature. It is demonstrated that the vector potential generated by an infinitely long solenoid can be uniquely decomposed into a transverse component and a longitudinal component, provided that physically unacceptable multi-valued gauge transformation is excluded. To help our understanding of the significance of the discussion in Section 3, we provide in Section 4 a brief introduction to some past works which tried to explain the AB effect, without using the standard vector potential interpretation. Also discussed in this section is the highly nontrivial claim made in several recent studies that the partial AB phase shift corresponding to a non-closed path of the electron can in principle be observed, thereby proposing several concrete settings of measurement for verifying these authors’ claim [22,23,24,25]. Section 5 summarizes the essential points of our vector-potential-based interpretation of the AB effect improved upon in the present paper.

2. On the Vector Potential Generated by an Infinitely Long Solenoid

In order to discuss the essence of the AB effect in the simplest possible form, it is customary to consider an idealized setting of an infinitely long solenoid with radius R directed in the z-direction. The stationary and uniform surface current distribution of the solenoid is represented as

J_{e x t} (x) = B δ (ρ - R) e_{ϕ} .

(1)

Here, we use the cylindrical coordinates

x = (ρ, ϕ, z)

with

ρ = \sqrt{x^{2} + y^{2}}

and

ϕ = arctan (\frac{y}{x})

. Our aim is to find the vector potential generated by the above solenoid current distribution. There are various routes to reach this goal, but probably the most instructive way is to start with the familiar Biot–Savart law represented as (note that we are basically handling magnetostatics)

B (x) = \frac{1}{4 π} \nabla \times \int \frac{j_{e x t} (x^{'})}{| x - x^{'} |} d^{3} x^{'} .

(2)

For simplicity’s sake, we use the Heaviside–Lorentz unit combined with the natural unit

ℏ = c = 1

. The importance of this formula is that the physical quantities contained in it (these are the external current distribution

j_{e x t} (x)

and the generated magnetic field

B (x)

) are all gauge-invariant quantities. The form of the Biot–Savart law naturally leads us to introduce the vector field

A (x)

by the relation

B (x) = \nabla \times A (x) .

(3)

This quantity

A (x)

is nothing but what we call the (magnetic) vector potential. Naturally, the vector potential introduced in the above way is not unique. Its general form is given as

A (x) = A^{(S)} (x) + \nabla χ (x),

(4)

with the definition

A^{(S)} (x) \equiv \frac{1}{4 π} \int \frac{j_{e x t} (x^{'})}{| x - x^{'} |} d^{3} x^{'},

(5)

while

χ (x)

is an arbitrary scalar function [31]. The superscript

(S)

on

A^{(S)}

designates that it is a part of the vector potential

A (x)

which is uniquely determined by the solenoid current distribution

j_{e x t} (x)

, provided that the relevant spatial integral in (5) converges (naively, this integral diverges, but it is known to converge by using an appropriate limiting procedure [31]). The arbitrary nature of the part

\nabla χ (x)

is interpreted as gauge degrees of freedom of the vector potential. This is of course a well-known story, but we point out that there exists a physically very important constraint on the scalar function

χ (x)

given by

\nabla \times \nabla χ (x) = 0,

(6)

which is often forgotten when the gauge ambiguity issue of the vector potential in the AB effect is discussed. As we shall soon argue in more detail, if this condition is not satisfied, the part

\nabla χ (x)

of

A (x)

would generate a new magnetic field distribution, which necessarily alters the original distribution

B (x)

.

At the moment, let us go ahead by assuming that the condition (6) is satisfied. Then, if it is combined with the easily verified relation

\nabla \cdot A^{(S)} (x) = 0

, Equation (4) just gives the transverse–longitudinal decomposition of the vector potential [28,29,30]

A (x) = A_{⊥} (x) + A_{‖} (x),

(7)

with the following identification

A_{⊥} (x) \equiv A^{(S)} (x), A_{‖} (x) \equiv \nabla χ (x) .

(8)

In fact, these two components certainly satisfy the transverse condition and the longitudinal condition, respectively.

\nabla \cdot A_{⊥} (x) = 0, \nabla \times A_{‖} (x) = 0 .

(9)

Note that the derivation above indicates that, in our setting of an infinitely long solenoid, the transverse–longitudinal decomposition of the vector potential is unique [28,29,30]. (Note that this is equivalent to saying that the transverse part of the vector potential is unique. To avoid misunderstanding, however, we recall in Appendix A that there is one familiar physical system in which the transverse–longitudinal decomposition of the vector potential is not unique at all).

Unfortunately, the uniqueness of the transverse–longitudinal decomposition of the vector potential has been often questioned, probably because of the existence of the following gauge transformation [32]:

A^{'} (x) = A (x) + \nabla χ^{s i n g} (x),

(10)

which is specified by the multi-valued gauge function as follows:

χ^{s i n g} (x) = - \frac{1}{2 π} Φ ϕ = - \frac{1}{2 π} Φ arctan (\frac{y}{x}) .

(11)

Here,

Φ = π R^{2} B

is the total magnetic flux penetrating the solenoid. As can be easily verified, the rotation of

χ^{s i n g} (x)

does not vanish, i.e.,

\nabla \times \nabla χ^{s i n g} (x) \neq 0

, but it rather satisfies the transverse condition

\nabla \cdot \nabla χ^{s i n g} (x) = 0

. At first glance, this observation appears to show that the transverse–longitudinal decomposition, or equivalently, the identification of

A^{(S)} (x)

as the transverse component, is not unique at all, once the multi-valued gauge transformation as above is permitted.

We, however, recall that our discussion of the relation (4) based on the Biot–Savart law already indicates that the scalar function

χ (x)

in this equation must satisfy the rotation-free condition

\nabla \times \nabla χ (x) = 0

. Otherwise, the term

\nabla χ (x)

in

A (x)

inevitably alters the magnetic field distribution of the system. Let us look into this state of affairs in a more concrete manner. As is well-known, the vector potential

A^{(S)} (x)

obtained from the integral (5) is given by (see page 208 of [31], for example)

A^{(S)} (x) = \{\begin{matrix} \frac{Φ}{2 π} \frac{ρ}{R^{2}} e_{ϕ} & (ρ < R) \\ \frac{Φ}{2 π} \frac{1}{ρ} e_{ϕ} & (ρ \geq R) . \end{matrix}

(12)

It is an elementary exercise to obtain the explicit form of the gauge-transformed vector potential

A^{'} (x)

given by (10). First, for

ρ \geq R

, i.e., in the outer region of the solenoid, we get

\nabla χ^{s i n g} (x) = - \frac{Φ}{2 π} \frac{1}{ρ} e_{ϕ},

(13)

which in turn gives

A^{'} (x) = \frac{Φ}{2 π} \frac{1}{ρ} e_{ϕ} - \frac{Φ}{2 π} \frac{1}{ρ} e_{ϕ} = 0 (for ρ \geq R) .

(14)

This means that, by the above multi-valued gauge transformation, the vector potential outside the solenoid can be completely eliminated. At first glance, this appears to show the expulsion of the AB effect, as advocated by Bocchieri and Loisinger many years ago [32]. However, as was shown later by several researchers [33,34], the AB effect continues to exist even after such a multi-valued gauge transformation if one properly takes account of the change in the

2 π

periodic boundary condition of the electron wave functions (see also [35] concerning the general consideration of multi-valued wave functions).

Let us next examine what happens with the transformed vector potential inside the solenoid. First, in the domain excluding the origin (

ρ = 0

), we find that

\begin{matrix} \nabla χ^{s i n g} (x) & = & - \frac{Φ}{2 π} \frac{1}{ρ} e_{ϕ} (ρ \neq 0), \end{matrix}

(15)

\begin{matrix} A^{'} (x) & = & \frac{Φ}{2 π} (\frac{ρ}{R^{2}} - \frac{1}{ρ}) e_{ϕ} (ρ \neq 0), \end{matrix}

(16)

\begin{matrix} \nabla \times A^{'} (x) & = & \frac{Φ}{π R^{2}} e_{z} (ρ \neq 0) . \end{matrix}

(17)

The form of

\nabla χ^{s i n g} (x)

above indicates that

\nabla \times \nabla χ^{s i n g} (x)

has a singularity at the origin. To confirm this, let us consider a circle

C_{ε}

around the origin with an infinitesimally small radius

ε (\to 0^{+})

. The area surrounded by

C_{ε}

is denoted as

S_{ε}

. If we evaluate the surface integral of

\nabla \times \nabla χ^{s i n g} (x)

over

S_{ε}

with the use of the Stokes theorem, we obtain

\begin{matrix} \int_{S_{ε}} (\nabla \times \nabla χ^{s i n g} (x)) \cdot d S & = & \oint_{C_{ε}} \nabla χ^{s i n g} (x) \cdot d x \\ = & - \frac{Φ}{2 π} \int_{0}^{2 π} \frac{1}{ε} ε d ϕ = - Φ . \end{matrix}

(18)

This indicates that, in the vicinity of the origin, the following relation holds:

\nabla \times \nabla χ^{s i n g} (x) = - \frac{Φ}{2 π} \frac{δ (ρ)}{ρ} e_{z} .

(19)

In fact, it can be verified from the following manipulation:

\int_{S_{ε}} (\nabla \times \nabla χ^{s i n g} (x)) \cdot d S = - \frac{Φ}{2 π} \int_{0}^{2 π} d ϕ \int_{0}^{ε} \frac{δ (ρ)}{ρ} ρ d ρ = - Φ .

(20)

To sum up, inside the solenoid, we find that

\begin{matrix} B^{'} (x) = \nabla \times A^{'} (x) & = & \frac{Φ}{π R^{2}} e_{z} - \frac{Φ}{2 π} \frac{δ (ρ)}{ρ} e_{z} \\ \equiv & B (x) + B^{s t r i n g} (x) (ρ < R) . \end{matrix}

(21)

The first term of the above equation is nothing but the original uniform magnetic field

B (x)

inside the solenoid. On the other hand, the second term shows that the multi-valued gauge transformation generates a string-like magnetic field in the direction of the negative z-axis, which is opposite to the direction of the original uniform magnetic field. In this way, as pointed out before, we confirm that that the multi-valued gauge transformation specified by (10) and (11) generates an extra magnetic field distribution which is originally absent. If we evaluate the total flux of

B^{'} (x)

penetrating the solenoid (with the radius R), we obtain

\begin{matrix} \int_{S (ρ \leq R)} B^{'} (x) \cdot n d S & = & \int_{S (ρ \leq R)} \frac{Φ}{π R^{2}} e_{z} \cdot e_{z} d S - \int_{S (ρ \leq R)} \frac{Φ}{2 π} \frac{δ (ρ)}{ρ} e_{z} \cdot e_{z} d S \\ = & Φ - Φ = 0, \end{matrix}

(22)

which means that the new net magnetic field penetrating the solenoid becomes precisely zero. Undoubtedly, this is the reason why the gauge-transformed vector potential

A^{'} (x)

entirely vanishes outside the solenoid.

The unphysical nature of such a singular gauge transformation can also be demonstrated if we evaluate the curve of

B^{'} (x) \equiv B (x) + B^{s t r i n g} (x)

. We find that

\begin{matrix} \nabla \times B (x) & = & B δ (ρ - R) e_{ϕ} = J_{e x t} (x) \end{matrix}

(23)

\begin{matrix} \nabla \times B^{s t r i n g} (x) & = & \frac{Φ}{2 π} \frac{\partial}{\partial ρ} (\frac{δ (ρ)}{ρ}) e_{ϕ} \equiv J_{s t r i n g} (x) . \end{matrix}

(24)

This means that the new magnetic field

B^{'} (x)

satisfies the following equation:

\nabla \times B^{'} (x) = J_{e x t} (x) + J_{s t r i n g} (x) .

(25)

This is clearly different from the original Maxwell equation for the magnetic field

B (x)

given by

\nabla \times B (x) = J_{e x t} (x) .

(26)

Beyond doubt, all these observations reveal the physically unacceptable nature of the above multi-valued gauge transformation. We therefore conclude that, as long as such an unphysical gauge transformation is excluded, the transverse–longitudinal decomposition of the vector potential is unique at least in our setting of an infinitely long solenoid.

3. Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect

Through the discussion in the previous sections, we have verified that, at least in the setting of an infinitely long solenoid, the generated vector potential can uniquely be decomposed into a transverse component and a longitudinal component, provided that the possibility of multi-valued gauge transformation is excluded. The point is that the multi-valued gauge transformation may be mathematically allowed, but it is physically unacceptable because it alters the magnetic field distribution of the system or even the form of the basic Maxwell equation.

Most importantly, the transverse component of the vector potential, i.e.,

A_{⊥} (x) = A^{(S)} (x)

, is unique, gauge-invariant, and it cannot be eliminated by any regular gauge transformations which leave the magnetic field distribution intact. This strongly indicates that this transverse component of the vector potential is not just a convenient mathematical tool but rather contains some definite physical entity. Nevertheless, it is also true that the entire vector potential still contains the longitudinal part, which cannot be free from gauge ambiguity. How one should confront this puzzling situation has already been discussed by several researchers [28,29,30]. Unfortunately, in any of these previous investigations, the uniqueness argument of the transverse–longitudinal decomposition has not been completed at a satisfactory level, as discussed in the present paper. This is probably the reason why such past analyses could not completely dispel the misbelief that the vector potential is just a mathematical tool with little physical substance. Now, we are ready to make more a definitive statement on this long-standing frustrating situation.

First, let us consider the most fundamental AB phase shift measured through the interference of the two electron beams. (See the schematic picture illustrated in Figure 1). According to the standard analysis, the phase change of the electron wave function along the path

C_{1}

is given by

Δ ϕ_{A B} (C_{1}) = e \int_{C_{1}} A (x) \cdot d x,

(27)

while the phase change along the path

C_{2}

is given by

Δ ϕ_{A B} (C_{2}) = e \int_{C_{2}} A (x) \cdot d x .

(28)

Since the observable AB phase shift corresponds to the difference between the above two phase shifts, it is eventually given by the following expression, which is proportional to the closed line integral of the vector potential

A (x)

represented as

ϕ_{A B} = e \int_{C_{1}} A (x) \cdot d x - e \int_{C_{2}} A (x) \cdot d x = e \oint_{C_{1} - C_{2}} A (x) \cdot d x .

(29)

By now, we know that the vector potential is generally given as

A (x) = A^{(S)} (x) + \nabla χ (x)

. The above closed line integral of the vector potential is then given by

\oint_{C_{1} - C_{2}} A (x) \cdot d x = \oint_{C_{1} - C_{2}} A^{(S)} (x) \cdot d x + \oint_{C_{1} - C_{2}} \nabla χ (x) \cdot d x .

(30)

Here, we have

\oint_{C_{1} - C_{2}} A^{(S)} (x) \cdot d x = \int_{S} B (x) \cdot d S = Φ,

(31)

and

\oint_{C_{1} - C_{2}} \nabla χ (x) \cdot d x = \int_{S} (\nabla \times \nabla χ (x)) \cdot d S = 0,

(32)

since the gauge function

χ (x)

is demanded to satisfy the constraint

\nabla \times \nabla χ (x) = 0

. This means that the transverse component

A^{(S)} (x)

solely explains the Aharonov–Bohm phase shift, and the gauge-dependent longitudinal component never contributes to it.

Importantly, however, if we consider the phase change corresponding to a non-closed path connecting the two spatial points

x_{i}

and

x_{f}

, it is given by

Δ ϕ_{A B} = e \int_{x_{i}}^{x_{f}} A (x) \cdot d x .

(33)

This quantity is also divided into two pieces as

\begin{matrix} Δ ϕ_{A B} & = & e \int_{x_{i}}^{x_{f}} (A^{(S)} (x) + \nabla χ (x)) \cdot d x \\ = & e \int_{x_{i}}^{x_{f}} A^{(S)} (x) \cdot d x + e (χ (x_{f}) - χ (x_{i})) . \end{matrix}

(34)

Although the first term of the above equation is gauge-invariant, the second term is not, because of the arbitrariness of the gauge function

χ (x)

contained in the longitudinal part. This means that such a partial AB phase shift is a gauge-dependent quantity. Hence, as long as we believes the widely accepted gauge principle, we must conclude that it would not correspond to any measurable quantity. The statement above may sound self-evident to some researchers. We, however, recall that this conclusion contradicts the recent claims by several authors that such a partial Aharonov–Bohm phase shift can in principle be observed. In the next section, we briefly introduce and comment on these challenging claims.

4. On Some Attempts to Explain the AB Effect Without Using the Gauge-Variant Electromagnetic Potential

It is widely accepted that the vital importance of the vector potential in quantum mechanics was established in the paper by Aharonov and Bohm on the quantum phenomenon associated with their names [2]. However, it appears that, because of the gauge-dependent nature of the vector potential, Aharonov himself is not completely satisfied with the vector-potential-based interpretation of the AB effect. This motivates him and his collaborators to look for explanations which do not use the gauge-dependent vector potential [13,14]. Along with this line of exploration, Vaidman argued that when the source of the electromagnetic potential is treated in the framework of quantum theory, the AB effect can be explained without the notion of vector potential [16]. To be more concrete, he considered the following setup. The solenoid consists of two cylinders and the opposite charges Q and

- Q

spread on their surfaces. The cylinders rotate in opposite directions with a certain surface velocity. The electron is supposed to encircle the solenoid with some velocity in a superposition of being in the left and in the right sides of the circular trajectory. When the electron enters one arm of the circle, it changes the magnetic flux through a cross-section of the solenoid and then induces an electromagnetic force acting on the solenoid. Vaidman explicitly calculated the shift of the wave packet of each cylinder during the motion of the electron. Combining this result with the information on the relevant wavelength of the de Broglie wave of each cylinder, he eventually showed that this analysis precisely reproduces the familiar AB phase shift.

Although Vaidman emphasized the quantum mechanical nature of his analysis, there appear to be close connections between his analysis and Boyer’s semiclassical analysis of the AB effect [36], especially as to the basic interaction dynamics of the solenoid and electron. Boyer considered a solenoid as a stack of electric current loops. He calculated Lorentz force due to the electron acting on charge carriers flowing in each current loop. This Lorentz force was shown to generate the change in velocity and the electron paths. By calculating the difference in path length for electron paths passing on either side of the solenoid, Boyer demonstrated that the resultant path length difference leads to a semiclassical phase shift which reproduces the known AB phase shift.

In any case, a common ingredient in the explanations of Vaidman and of Boyer is the presence of force acting on the solenoid induced by the motion of the electron. However, we recall that such an explanation of the AB phase shift due to force has been believed to be incompatible with the dispersionless nature of the AB effect, which means that the magnitude of the AB phase shift is independent of the electron velocity [19,20]. Unfortunately, no decisive experiment to verify the dispersionless nature of the AB effect has been carried out for a long time. Some years ago, however, Caprez, Barwick, and Batelaan carried out a crucial time delay measurement of the electron beam and verified that no time delay was observed, thereby concluding that all force explanations of the AB phase shift are ruled out [37].

Also worthy of mention is the existence of still another explanation of the AB phase shift. The basic postulation of this approach is that the AB phase shift is proportional to the change in the interaction energy between the charged particle and the solenoid along the path of the moving charge. This idea can be traced back to Boyer’s older work [26], which is based on the framework of classical electrodynamics. He assumed that the AB phase shift for a given path of the moving charge is proportional to the change in the interaction energy between the magnetic field

B^{s}

generated by the current of an infinitely long solenoid and the magnetic field

B^{'}

generated by a moving charge with a constant velocity

v

as

Δ ϕ_{A B} \propto Δ ϵ (Boyer) = \int B^{s} (x^{'}) \cdot B^{'} (x^{'}, t) d^{3} x^{'},

(35)

where

B^{s} (x)

the magnetic field generated by the surface current

j_{e x t} (x)

of the solenoid according to the Maxwell equation,

\nabla \times B^{s} (x) = j_{e x t} (x) .

(36)

After transforming the above expression by making full use of the knowledge of classical electrodynamics, Boyer arrived at a remarkable relation

Δ ε (Boyer) = e v \cdot A^{(S)} (x),

(37)

with

A^{(S)} (x) = \frac{1}{4 π} \int \frac{j_{e x t} (x^{'})}{| x - x^{'} |} d^{3} x^{'} .

(38)

As emphasized by Boyer, the above

Δ ε (Boyer)

is free from the gauge choice. This is because, in the above expression, the quantity

A^{(S)} (x)

is uniquely determined by the surface current

j_{e x t} (x)

of the solenoid, which is gauge-invariant. This claim sounds reasonable, because the interaction energy is likely to be a gauge-invariant quantity.

Motivated by the work of Boyer, several researchers have investigated the interaction energy between the solenoid and a moving charge within the framework of quantum electrodynamics [21,22,23,24]. (Also noteworthy is a related but slightly different approach discussed in [38]). They evaluated the interaction energy between the solenoid current and the charged particle mediated by the exchange of a virtual photon within the framework of the quantum electrodynamics, thereby arriving at the following answer:

Δ ε (virtual photon exchange) = - e v \cdot A^{(S)} (x),

(39)

where

A^{(S)} (x)

is the same quantity as appearing in the corresponding interaction energy obtained by Boyer [26].

Important messages from the authors of the above investigations are as follows. The change in interaction energy between the solenoid and the charged particle along the path of the moving charge is a gauge-invariant quantity. Therefore, if one accepts the above-mentioned postulation that the phase change of the electron wave function is proportional to the change in interaction energy along the path of the moving charge, the AB phase shift for a non-closed path is also a gauge-invariant quantity. This appears to indicate that it can in principle be observed. In fact, based on this belief, several authors proposed some concrete measurements for extracting the partial AB phase shift corresponding to a non-closed path [22,23,24].

This claim was, however, criticized in a recent paper by ourselves [27]. It was pointed out that, very strangely, the expressions of the interaction energy of Boyer and that due to the virtual-photon exchange are identical with opposite signs (this remarkable fact was never noticed before, since the above researchers paid attention only to the absolute magnitude of the predicted AB phase shift). It was further shown that, within the framework of a self-contained quantum mechanical treatment of the combined system of a solenoid, a charged particle, and the quantized electromagnetic field, the interaction energy of Boyer and that due to virtual-photon exchange exactly cancel each other out. (Since this demonstration requires fairly careful preparation, interested readers are recommended to read the original paper [27]). The analysis there rather shows that the origin of the AB phase shift can be traced back to other part of the self-contained treatment above, which is, after all, identical to the standard mechanism as explained in Section 3 of the present paper. This means that the AB phase shift corresponding to a non-closed path is not a gauge-invariant quantity, meaning that its observation is most likely to contradict the celebrated gauge principle. In any case, it seems to us that all attempts at explaining the AB phase shift without using the notion of the vector potential have been unsuccessful up to the present. At this point in time, the vector potential interpretation seems to be the simplest and most reasonable physical explanation of the AB effect.

5. Summary and Conclusions

The vector-potential-based interpretation of the AB effect is not universally accepted because of the gauge-variant nature of the vector potential. Even now, multiple researchers seem to believe that the vector potential is just a convenient tool for obtaining the electromagnetic field, and they are searching for an explanation of the AB effect without using the vector potential concept. In the present paper, we tried to demonstrate that the vector potential is not just a convenient mathematical tool with little physical substance. The argument proceeds as follows. Employing the simplest setting of the system, i.e., an infinitely long solenoid, we have shown the following facts:

The vector potential generated by the infinitely long solenoid is given as a sum of the transverse part and the longitudinal part.
The above decomposition is unique as long as the multi-valued gauge transformation is excluded. In particular, the transverse part of the vector potential is uniquely determined by the surface electric current distribution of the solenoid.
The multi-valued (and singular) gauge transformation is not allowed from the physical point of view, because it inevitably generates a new or extra magnetic field distribution which is originally absent in the system.
The transverse part of the vector potential solely explains the standard Aharonov–Bohm effect corresponding to a closed path of the electron’s trajectory.
Nevertheless, one should not forget the fact that the vector potential still contains the longitudinal part which has inherent gauge arbitrariness. It seems to us that this gauge arbitrariness forbids the observation of the partial AB phase shift corresponding to a non-closed path, which was recently claimed to be possible by several researchers.

To sum up, we conclude that the vector potential contains in it a piece which is unique, gauge-invariant, and cannot be eliminated by any regular gauge transformations. This part of the vector potential solely explains the standard Aharonov–Bohm effect. However, the remaining ambiguity of the longitudinal part of the vector potential is thought to forbid the observability of the partial Aharonov–Bohm phase shift corresponding to a non-closed path, because it is gauge-dependent and its observation contradicts the celebrated gauge principle. Conversely speaking, if the AB phase shift corresponding to a non-closed path were observed, it would give us the first counterexample to the validity of the gauge principle. Undoubtedly, this last statement is closely related to the authenticity of the vector-potential-based interpretation of the Aharonov–Bohm effect considerably refined in the present paper.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. One Familiar Physical System in Which the Transverse-Longitudinal Decomposition of the Vector Potential Is Not Unique

In Section 2, we have shown that the magnetic vector potential generated by an infinitely long solenoid is uniquely decomposed into the transverse and longitudinal components, once we exclude physically unacceptable multi-valued or singular gauge transformation. Naturally, whether the transverse–longitudinal decomposition of the vector potential is unique or not depends on what physical system we are considering. One interesting example is provided by the familiar Landau problem, which handles the quantum mechanical motion of an electron in an infinitely spreading uniform magnetic field. As is well-known, there are three typical choices of gauge potential (configuration) which reproduce the uniform magnetic field. These are the symmetric gauge potential

A^{(S)} (x)

, the first Landau gauge potential

A^{(L_{1})} (x)

, and the second Landau gauge potential

A^{(L_{2})} (x)

, respectively given as

\begin{matrix} A^{(S)} (x) & = & \frac{1}{2} (- B y e_{x} + B x e_{y}) = \frac{1}{2} B r e_{ϕ}, \end{matrix}

(A1)

\begin{matrix} A^{(L_{1})} (x) & = & - B y e_{x}, \end{matrix}

(A2)

\begin{matrix} A^{(L_{2})} (x) & = & + B x e_{y} . \end{matrix}

(A3)

These potentials are related through the following gauge transformations

\begin{matrix} A^{(S)} (x) & = & A^{(L_{1})} (x) + \nabla χ_{1} (x) with χ_{1} (x) = + \frac{1}{2} B x y, \end{matrix}

(A4)

\begin{matrix} A^{(S)} (x) & = & A^{(L_{2})} (x) + \nabla χ_{2} (x) with χ_{2} (x) = - \frac{1}{2} B x y . \end{matrix}

(A5)

We start with the fact that any gauge potential reproducing the uniform magnetic field can be expressed in the form

A (x) = A^{(S)} (x) + \nabla χ (x),

(A6)

where

χ (x)

is an arbitrary scalar function subject to the constraint

\nabla \times \nabla χ (x) = 0

. With the identification

A_{⊥} (x) = A^{(S)} (x)

and

A_{‖} (x) = \nabla χ (x)

, (A6) certainly gives a transverse–longitudinal decomposition of the vector potential as

A (x) = A^{(S)} (x) + \nabla χ (x) \equiv A_{⊥} (x) + A_{‖} (x) .

(A7)

However, the vector potential

A (x)

can also be expressed in either of the following forms

\begin{matrix} A (x) & = & A^{(S)} (x) + \nabla χ (x), \end{matrix}

(A8)

\begin{matrix} = & A^{(L_{1})} (x) + \nabla χ_{1}^{'} (x) with χ_{1}^{'} \equiv χ + χ_{1}, \end{matrix}

(A9)

\begin{matrix} = & A^{(L_{2})} (x) + \nabla χ_{2}^{'} (x) with χ_{2}^{'} = χ + χ_{2} . \end{matrix}

(A10)

Any of these three give transverse–longitudinal decompositions, since it holds that

\begin{matrix} \nabla \cdot A^{(S)} (x) & = & \nabla \cdot A^{(L_{1})} (x) = \nabla \cdot A^{(L_{2})} (x) = 0, \end{matrix}

(A11)

\begin{matrix} \nabla \times \nabla χ (x) & = & \nabla \times \nabla χ_{1}^{'} (x) = \nabla \times \nabla χ_{2}^{'} (x) = 0 . \end{matrix}

(A12)

Undoubtedly, the transverse–longitudinal decomposition of the vector potential is not unique in the setting of the Landau system.

Also worthy of mention is the existence of the multi-valued gauge transformation in the Landau system [39]. Let us consider the gauge potential

A^{(B B)} (x)

obtained from the symmetric gauge potential

A^{(S)} (x)

by the following multi-valued gauge transformation:

A^{(B B)} (x) = A^{(S)} (x) + \nabla \tilde{χ} (x),

(A13)

with

\tilde{χ} (x) = - \frac{1}{2} B r^{2} ϕ .

(A14)

An explicit calculation gives

A^{(B B)} (x) = - B r ϕ e_{r},

(A15)

which was called the vector potential in the Bawin–Burnel gauge [33,39]. Different from the infinitely long solenoid problem, the above multi-valued gauge transformation does not generate an extra magnetic field distribution, because the above function

\nabla \tilde{χ} (x)

satisfies the rotation-free condition

\nabla \times \nabla \tilde{χ} (x) = 0 .

(A16)

We emphasize that this situation is significantly different from the case of the multi-valued gauge transformation in the infinitely long solenoid system, which inevitably generates a new or extra magnetic field distribution of string type.

Finally, for reference, we reiterate the fact that the non-uniqueness of the transverse–longitudinal decomposition of the vector potential in the Landau problem is related to a special nature of the Landau system, in which the uniform and constant magnetic field is spread over whole the x-y plane. It is obvious that it breaks the validity conditions of the famous Helmholtz theorem, which ensures the uniqueness condition of the transverse–longitudinal decomposition of a vector field (see Appendix B of the book [40], for example).

References

Ehrenberg, W.; Siday, R.E. The refractive index in electron optics and the principles of dynamics. Proc. Phys. Soc. Lond. B 1949, 62, 8–21. [Google Scholar] [CrossRef]
Aharonov, Y.; Bohm, D. Significance of electromagnetic potentials in quantum theory. Phys. Rev. 1959, 115, 485–491. [Google Scholar] [CrossRef]
Peshkin, M. The Aharonov-Bohm effect: Why it cannot be eliminated from quantum mechanics. Phys. Rep. 1981, 80, 375–386. [Google Scholar] [CrossRef]
Olariu, S.; Popescu, I.I. The quantum effects of electromagnetic fluxes. Rev. Mod. Phys. 1985, 57, 339–436. [Google Scholar] [CrossRef]
Peshkin, M.; Tonomura, A. The Aharonov-Bohm effect. Lect. Note Phys. 1989, 340, 1–152. [Google Scholar]
Wakamatsu, M.; Kitadono, Y.; Zou, L.; Zhang, P.-M. The role of electron orbital angular momentum in the Aharonov-Bohm effect revisited. Ann. Phys. 2018, 397, 259–277. [Google Scholar] [CrossRef]
Feynman, R.P.; Leighton, R.B.; Sands, M. The Feynman Lectures on Physics, Vol. II: Mainly Electromagnetism and Matter; Basic Books, A Member of the Perseus Books Group: New York, NY, USA, 2011; Chapter 15. [Google Scholar]
Konopinski, E.J. What the electromagnetic potential describes? Am. J. Phys. 1978, 46, 499–502. [Google Scholar] [CrossRef]
Semon, M.D.; Taylor, J.R. Thoughts on the magnetic vector potential. Am. J. Phys. 1996, 64, 1361–1369. [Google Scholar] [CrossRef]
Tonomura, A.; Osakabe, N.; Matsuda, T.; Kawasaki, T.; Endo, J. Evidence for Aharonov-Bohm effect with magnetic field completely shielded from electron wave. Phys. Rev. Lett. 1986, 56, 792–795. [Google Scholar] [CrossRef]
Osakabe, N.; Matsuda, T.; Kawasaki, T.; Endo, J.; Tonomura, A.; Yano, S.; Yamada, H. Experimental confirmation of Aharonov-Bohm effect using a toroidal magnet field confined by a superconductor. Phys. Rev. A 1986, 34, 815–822. [Google Scholar] [CrossRef]
Healey, R. Nonlocality and the Aharonov-Bohm Effect. Philos. Sci. 1997, 64, 18–41. [Google Scholar] [CrossRef]
Aharonov, Y.; Vaidman, L. Nonlocal aspects of a quantum wave. Phys. Rev. A 2000, 61, 052108. [Google Scholar] [CrossRef]
Aharonov, Y.; Cohen, E.; Rhorlich, D. Nonlocality of the Aharonov-Bohm effect. Phys. Rev. A 2016, 93, 042110. [Google Scholar] [CrossRef]
Heras, J.A.; Heras, R. Topology, nonlocality and duality in classical electrodynamics. Eur. Phys. J. Plus 2022, 137, 157. [Google Scholar] [CrossRef]
Vaidman, L. Role of potentials in the Aharonov-Bohm effect. Phys. Rev. A 2012, 86, 040101(R). [Google Scholar] [CrossRef]
Vaidman, L. Reply to “Comment on ‘Role of potentials in the Aharonov-Bohm effect’”. Phys. Rev. A 2015, 92, 026102. [Google Scholar] [CrossRef]
Aharonov, Y.; Cohen, E.; Rohrlich, D. Comment on “Role of potentials in the Aharonov-Bohm effect”. Phys. Rev. A 2015, 92, 026101. [Google Scholar] [CrossRef]
Zellinger, A. Generalized Aharonov-Bohm Experiments with Neutrons. In Fundamental Aspects of Quantum Theory, Como 1985; Gorrini, V., Figueido, A., Eds.; Plenum Press: New York, NY, USA, 1986; pp. 311–318. [Google Scholar]
Peshkin, M. Force Free Interactions and Nondispersive Phase Shifts in Interferometry. Found. Phys. 1999, 29, 481–489. [Google Scholar] [CrossRef]
Santos, E.; Gonzalo, I. Microscopic theory of the Aharonov-Bohm effect. Europhys. Lett. 1999, 45, 418–423. [Google Scholar] [CrossRef]
Kang, K. Proposal for locality test of the Aharonov-Bohm effect via Andreev interferometer without a loop. Phys. Soc. 2017, 71, 565–570. [Google Scholar] [CrossRef]
Marletto, C.; Vedral, V. Aharonov-Bohm Phase is Locally Generated Like All Other Quantum Phases. Phys. Rev. Lett. 2020, 125, 040401. [Google Scholar] [CrossRef] [PubMed]
Saldanha, P.L. Local Description of the Aharonov-Bohm effect with a Quantum Electromagnetic Field. Found. Phys. 2021, 51, 6. [Google Scholar] [CrossRef]
Li, X.; Hansson, T.H.; Ku, W. Gauge-independent description of the Aharonov-Bohm effect. Phys. Rev. A 2022, 106, 032217. [Google Scholar] [CrossRef]
Boyer, T.H. Classical Electromagnetic Interaction of a Charged Particle with a Constant-Current Solenoid. Phys. Rev. 1971, 8, 1667–1679. [Google Scholar] [CrossRef]
Wakamatsu, M. Is the Aharonov-Bohm phase shift for a non-closed path a measurable quantity? Eur. Phys. J. Plus 2024, 139, 112. [Google Scholar] [CrossRef]
Adachi, T.; Inagaki, T.; Ozaki, M.; Sasabe, K. The Vector Potential Revisited. Electr. Eng. Jpn. 1993, 114, 11–16, Translated from Denki Gakkai Ronbunnshi A 1992, 112, 763–767. [Google Scholar] [CrossRef]
Stewart, A.M. Vector potential of the Coulomb gauge. Eur. J. Phys. 2003, 24, 519–524. [Google Scholar] [CrossRef]
Li, J.-F.; Jiang, Y.; Sun, W.-M.; Zong, H.-S.; Wang, F. New application of decomposition of U(1) gauge potential: Aharonov-Bohm effect and Anderson-Higgs mechanism. Mod. Phys. Lett. B 2012, 26, 1250124. [Google Scholar] [CrossRef]
Shadowitz, A. The Electromagnetic Field; Dover Publication, Inc.: New York, NY, USA, 1975. [Google Scholar]
Bocchieri, P.; Loinger, A. Nonexistence of the Aharonov-Bohm Effect. Nuovo Cimento 1978, 47, 475–482. [Google Scholar] [CrossRef]
Bawin, M.; Burnel, A. Aharonov-Bohm effect and gauge invariance. J. Phys. A Math. Gen. 1983, 16, 2173–2177. [Google Scholar] [CrossRef]
Miyazawa, H.; Miyazawa, T. The Physical Meaning of the Ehrenberg-Siday-Aharonov-Bohm Effect (In Japanese). Available online: http://www.miyazawa1.sakura.ne.jp/papers/esba.pdf.
Kreizschmar, M. Must Quantal Wave Functions be Single-Valued? Z. Phys. 1965, 185, 73–83. [Google Scholar] [CrossRef]
Boyer, T.-H. Semiclassical Explanation of the Matteucci-Pozzi and Aharonov-Bohm Phase Shifts. Found. Phys. 2002, 32, 41–49. [Google Scholar] [CrossRef]
Caprez, A.; Barwick, B.; Batelaan, H. Macroscopic Test of the Aharonov Bohm Effect. Phys. Rev. Lett. 2007, 99, 210401. [Google Scholar] [CrossRef] [PubMed]
Kholmetskii, A.I.; Missevitch, O.; Yarman, T. Role of electromagnetic energy and momentum in the Aharonov-Bohm effect. Proc. R. Soc. 2024, 480, 20230286. [Google Scholar] [CrossRef]
Wakamatsu, M.; Kitadono, Y.; Zhang, P.-M. The issue of gauge choice in the Landau problem and the physics of canonical and mechanical orbital angular momentum. Ann. Phys. 2018, 392, 287–322. [Google Scholar] [CrossRef]
Griffiths, D.J. Introduction to Electrodynamics; Prentice Hall: Upper Saddle River, NJ, USA, 1999. [Google Scholar]

Figure 1. Schematic picture showing two paths connecting the initial point

P (x_{i})

where the electron beam is ejected and the final point

Q (x_{f})

on the screen.

Figure 1. Schematic picture showing two paths connecting the initial point

P (x_{i})

where the electron beam is ejected and the final point

Q (x_{f})

on the screen.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wakamatsu, M. Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect. Symmetry 2025, 17, 935. https://doi.org/10.3390/sym17060935

AMA Style

Wakamatsu M. Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect. Symmetry. 2025; 17(6):935. https://doi.org/10.3390/sym17060935

Chicago/Turabian Style

Wakamatsu, Masashi. 2025. "Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect" Symmetry 17, no. 6: 935. https://doi.org/10.3390/sym17060935

APA Style

Wakamatsu, M. (2025). Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect. Symmetry, 17(6), 935. https://doi.org/10.3390/sym17060935

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect

Abstract

1. Introduction

2. On the Vector Potential Generated by an Infinitely Long Solenoid

3. Unveiling the Role of Vector Potential in the Aharonov–Bohm Effect

4. On Some Attempts to Explain the AB Effect Without Using the Gauge-Variant Electromagnetic Potential

5. Summary and Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. One Familiar Physical System in Which the Transverse-Longitudinal Decomposition of the Vector Potential Is Not Unique

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI