Abstract
Quaternions have an (over a century-old) extensive and quite complicated interaction with special relativity. Since quaternions are intrinsically 4-dimensional, and do such a good job of handling 3-dimensional rotations, the hope has always been that the use of quaternions would simplify some of the algebra of the Lorentz transformations. Herein we report a new and relatively nice result for the relativistic combination of non-collinear 3-velocities. We work with the relativistic half-velocities w defined by , so that , and promote them to quaternions using , where is a unit quaternion. We shall first show that the composition of relativistic half-velocities is given by , and then show that this is also equivalent to . Here as usual we adopt units where the speed of light is set to unity. Note that all of the complicated angular dependence for relativistic combination of non-collinear 3-velocities is now encoded in the quaternion multiplication of with . This result can furthermore be extended to obtain novel elegant and compact formulae for both the associated Wigner angle and the direction of the combined velocities: , and . Finally, we use this formalism to investigate the conditions under which the relativistic composition of 3-velocities is associative. Thus, we would argue, many key results that are ultimately due to the non-commutativity of non-collinear boosts can be easily rephrased in terms of the non-commutative algebra of quaternions.
1. Introduction
Hamilton first described the quaternions in the mid-1800s, primarily with a view to finding algebraically simple ways to handle 3-dimensional rotations. With the advent of special relativity in 1905, and noting the manifestly 4-dimensional nature of quaternions once one adds a real part, multiple authors have tried to interpret special relativity in an intrinsically quaternionic fashion [1,2,3,4,5,6,7,8,9].
Despite technical success in applying quaternions to special relativity, the use of quaternions in this subject has never really gained all that much traction in the physics community. Perhaps one of the reasons for this is that there are a number of sub-optimal notational choices in Silberstein’s original work [1,2,3], and the fact that there is no generally accepted way of using quaternions to represent Lorentz transformations, with many different authors employing their own quite distinct methods [1,2,3,4,5,6,7,8,9]. Even in more recent, post-millennial, articles on “quaternionic special relativity” there is considerable disagreement on notational choices [10,11,12,13].
Below we shall introduce what we feel is a particularly simple and straightforward method for combining relativistic 3-velocities using quaternions. In particular, we shall present some new and compact formulae for computing the Wigner angle [14]. All of the interesting features due to non-commutativity properties of non-collinear boosts are implicitly and rather efficiently dealt with by the non-commutative algebra of quaternions. The method is based on an extension of an analysis by Giust, Vigoureux, and Lages [15,16], who (because they were working with the usual complex numbers) were essentially limited to motion in 2-space; their formalism is not really well-adapted to general motions in 3-space. Related constructions can also be found in [10,11].
Observe that there is a representation of pure quaternions in terms of a subset of matrices, specifically the anti-hermitian matrices, essentially . (The factor of is important.) However this does not mean that replacing quaternions by Pauli matrices in any way simplifies our results below; it just complicates the formalism. Neither does this mean that any of our results below are at all “well-known” in this alternate notation. We have carefully checked the relevant literature, (roughly speaking, 2-spinor representations of the Lorentz group). There is much more than pedagogy going on—the results reported in our article are (apart from a consistency check or two) both novel and interesting (see also [13].)
2. Preliminaries
2.1. Lorentz Transformations
The set of all Lorentz transformations of space-time form a group called the Lorentz group. Mathematically, the Lorentz group is isomorphic to , the orthogonal group of one time and three space dimensions that preserves the space-time interval
Here and hereafter, as usual we adopt units where the speed of light is set to unity. It is clear from this description that rotations of space-time are included in the Lorentz group, as well as the more familiar pure Lorentz transformations (boosts). In fact, the pure Lorentz transformations do not even form a subgroup of the Lorentz group as, in general, the composition of two boosts and is not another boost but in fact a boost and a rotation ; while . This rotation, known as the Wigner rotation, was first discovered by Llewellyn Thomas in 1926 whilst trying to describe the Zeeman effect from a relativistic view-point [17], and was more fully analyzed by Eugene Wigner in 1939 [14]. (For more recent discussions see [18,19,20,21,22]).
It is well–known that the composition of Lorentz transformations is non-commutative. That is, applying two successive boosts and in different orders results in the same final boost, , but different rotations, . In the context of the combination of two velocities and , this means that the final speed is the same no matter the order we combine the velocities, , but the final directions they point in are different . Although not immediately obvious, the angle between and is in fact the Wigner angle , see Reference [22]. The Lorentz group has very many different representations, one of which is formulated by using the quaternions [1,2,4].
One could instead try to deal with the non-commutativity of the Lorentz transformations by adapting the general formalism of the Baker–Campbell–Hausdorff theorem [23,24,25,26,27]. Unfortunately the general BCH formalism applied to this problem very quickly becomes intractable, and we have found that the specifics of the quaternion formalism yield much more useful and tractable results.
Since the full symmetry group of the Maxwell equations is the conformal extension of the Poincare group, it is sometimes useful, (when looking at pure electromagnetic effects), to work with this conformal extension. However physical observers, (physical clocks and physical rulers), break the conformal invariance, and to even meaningfully define 3-velocities one needs to restrict attention to the Poincare group. We shall go even further and take translation invariance (spatial and temporal homogeneity) for granted, and focus more specifically on the Lorentz group.
2.2. Quaternions
The quaternions are numbers that can be written in the form , where , , , and d are real numbers, and , , and are the quaternion units which satisfy the famous relation
They form a four–dimensional number system that is generally treated as an extension of the complex numbers. We shall define the quaternion conjugate of the quaternion to be , and define the norm of to be . This allows us to evaluate the quaternion inverse as .
Trying to define a “norm” as , while superficially more “relativistic”, violates the usual mathematical definition of “norm”, and furthermore is not useful when it comes to evaluating the quaternion inverse .
For current purposes we focus our attention on pure quaternions. That is, we consider quaternions of the form . Many quaternion operations become much simpler when we are dealing with pure quaternions. For example, the product of two pure quaternions and is given by , where, in general, we shall set . From this, we obtain the useful relations
A notable consequence of (3) is . There is a natural isomorphism between the space of pure quaternions and given by
where and are the standard unit vectors in .
One of the most common uses for quaternions today (2020) is in the computer graphics community, where they are used to compactly and efficiently generate rotations in 3-space. Indeed, if is an arbitrary unit quaternion and is the image of a vector in under the isomorphism (4), then the mapping rotates through an angle about the axis defined by . The mapping is called quaternion conjugation by .
3. Combining Two 3-Velocities
In the paper by Giust, Vigoureux, and Lages [15], see also Reference [16], (and the somewhat related discussion in Reference [10]), a method is developed to compactly combine relativistic velocities in two space dimensions, and by extension, coplanar relativistic velocities in 3 space dimensions. In the following subsection, we first provide a short summary of their approach, and then in the next subsection extend their method to general non-coplanar 3-velocities.
3.1. Velocities in the (x,y)-Plane
The success of this Giust, Vigoureux, and Lages approach relies on the angle addition formula for the hyperbolic tangent function,
The tanh function is a natural choice for combining relativistic velocities since it is limited to the interval . Indeed, using the rapidity defined by , we can easily combine collinear relativistic speeds using Equation (5). In order to use this for the combination of non-collinear relativistic 2-velocities, we replace each 2-velocity by the complex number
Here is the rapidity of the velocity , and gives the orientation of according to some observer in the plane defined by and . Giust, Vigoureux, and Lages then define the composition law ⊕ for coplanar velocities and by
where is the standard complex conjugate of V. By using instead of in Equations (6) and (7), we are actually dealing with the “relativistic half–velocities”, , (sometimes called the “symmetric velocities”), where
That is:
Using Equations (5) and (7) we can easily retrieve the real velocity from the half-velocity by using the ⊕ operator: . In terms of the half velocities
The ⊕ addition law is non-commutative, which is most easily seen by first setting , then , and finally observing that the ratio
is not equal to unity for non–zero , meaning that is non-zero.
The angle is in fact the Wigner angle , so an expression for this angle can be obtained by taking the real and imaginary parts of Equation (11):
This expression does not explicitly appear in Reference [15] though something functionally equivalent, in the form , appears in Reference [16].
The ⊕ law can be applied to any number of coplanar velocities by iteration:
Thus it would be desirable to cleanly extend this formalism to general three-dimensional velocities. Note that the order of composition is important, as we shall see in more detail below, the ⊕ operation is in general not associative.
3.2. General 3-Velocities
We now extend the result of Giust, Vigoureux, and Lages to arbitrary 3-velocities in three dimensions.
3.2.1. Algorithm
Suppose we have a velocity in the -plane, represented by the pure quaternion . Using the rules for quaternion multiplication, we can write this as . The term inside the brackets now looks very similar to what would be a natural extension of the exponential function to the quaternions, . To formalise this, we define the exponential of a quaternion by the power series
To calculate an explicit formula for Equation (14), we first consider the case of a pure quaternion . We know from Section 2.2 that for a pure quaternion we have , and so we find and so on. Thus, we can compute
Following the same procedure above, we find the exponential of a pure unit quaternion and real number to be
This nice result reflects the expression for the exponential of a complex number.
We can now extend this result to any arbitrary quaternion by noting that the real number a commutes with all the terms in , thereby allowing us to write , where has the same form as Equation (15). Explicitly,
The exponential of a quaternion possesses many of the same properties as the exponential of a complex number. Two particularly useful ones we use below are
Using these results, we are now justified in writing
for our velocity in the -plane.
Building on this result, we now find it appropriate to define the ⊕ operator for general 3-velocities, and , by the novel formula:
The usefulness of this novel definition is best understood by looking at a few examples.
3.2.2. Example: Parallel Velocities
We consider two parallel velocities and represented by the quaternions
respectively. Our composition law (20) then gives
which is equivalent to
and hence, also equivalent to the well–known result for the relativistic composition of two parallel velocities,
3.2.3. Example: Perpendicular Velocities in the x–y Plane
We now consider two perpendicular velocities in the x–y plane. By rotating around the z axis, without loss of generality they can be taken to be given by
where we have written and for brevity.
Our composition law then gives a combined velocity of
which is definitely not commutative. In contrast the norm is symmetric:
Here the are the “relativistic half–velocities” , so the full velocities are
and so give a final speed of
The non-quaternionic result for the composition of two perpendicular velocities is [22]
Thus, we find
And so our composition law ⊕ gives the standard result for the composition of two perpendicular velocities in the x–y plane.
3.2.4. Example: Perpendicular Velocities in General
For general perpendicular velocities and the easiest way of proceeding is to simply rotate to point along the x-axis and along the y-axis, and just copy the argument above. If one wishes to be more direct then simply define
In view of the mutual orthogonality of the vectors , , and , the unit quaternions obey exactly the same commutation relations as . Thence
This now leads to exactly the same results as above; there was no loss of generality inherent in working in the x–y plane.
3.2.5. Example: Reduction to Giust–Vigoureux–Lages Result in the x–y Plane
It is important to note that our composition law ⊕ reduces to the composition law of Giust, Vigoureux, and Lages [15] when dealing with planar velocities in the x–y plane. As above, we define general velocities in the ()-plane by , and , then, using our composition law (20), we find
But, noting that and , we can re-write this as
Now writing
we can cancel out the trailing , to obtain
This expression now only contains , so everything commutes, and we can write
which is equivalent to the result of Giust, Vigoureux, and Lages [15].
3.2.6. Example: Composition in General Directions
For general velocities and the easiest way of proceeding is to simply rotate to put and in the the x–y plane, and just copy the Giust–Vigoureux–Lages argument [15] above. If one wishes to be more direct then simply define
As long as is not parallel to , then is well defined and perpendicular to both and . With these definitions one can now write
Then, following the discussion above, we see
From this we can extract
Thence
This finally is a fully explicit result for general velocities and , which is manifestly in agreement with the Giust–Vigoureux–Lages results [15].
3.2.7. Uniqueness of the Composition Law
Finally, we might note that the expression for the composition law (20) is not unique. For example, by considering the power-series of , we can re-write Equation (20) as
But, as and are pure quaternions, both and are real numbers, and so commute with and . Thus,
Consequently we find that our composition law can also be written as
3.3. Calculating the Wigner Angle
In this section we obtain an expression for the Wigner angle for general 3-velocities using our composition law (20). Our calculations are obtained using the result that the Wigner angle is the angle between the velocities and . We first note
Thence, setting we explicitly verify
Now note that because it follows that is a unit norm quaternion. In fact it is related to the Wigner angle by
Then
But since for a product of quaternions this reduces to
Now
Let us define
Then setting so that we have:
Consequently the Wigner angle satisfies
Equivalently,
Taking the scalar and vectorial parts of Equation (56), we finally obtain
as an explicit expression for the Wigner angle .
The simplicity of Equation (57) compared to existing formulae for in the literature, shows how the composition law (20) can lead to much tidier and simpler formulae than other methods allowed for. This can be seen as the extension of the result (12) to more general velocities.
We can write Equation (57) in a perhaps more familiar (though possibly more tedious) form by first noting that from Equation (28) we have
and so
We can check two interesting cases of Equation (57) for when (parallel velocities) and when (perpendicular velocities). We can see directly that, for parallel velocities, the associated Wigner angle is given by , so that for ; whilst for perpendicular velocities, the associated Wigner angle is simply given by .
It is easiest to check our results against the literature using the somewhat messier Equation (59), in which case parallel velocities again give , whilst perpendicular velocities give
which agrees with the results given in Reference [22].
4. Combining Three 3-Velocities
Let us now see what happens when we relativistically combine 3 half-velocities.
We shall calculate, compare, and contrast with .
4.1. Combining 3 Half-Velocities:
Start from our key result
and iterate it to yield
It is now a matter of straightforward quaternionic algebra to check that
Ultimately we have the novel result
An alternative formulation starts from
which when iterated yields
Thence a little straightforward quaternionic algebra verifies that
Ultimately we have the novel result
4.2. Combining 3 Half-Velocities:
In contrast, the situation for is considerably more subtle. Start from the key result that
and iterate it to yield
The relevant quaternionic algebra is now a little trickier
To proceed we note that
Thence we have the novel result
4.3. Combining 3 Half-Velocities: (Non)-Associativity
From (64) and (68) for , and (73) for , it is clear that relativistic composition of velocities is in general not associative. (See for instance the discussion in References [28,29], commenting on Reference [29].) A sufficient condition for associativity, , is to enforce
That is, a sufficient condition for associativity is
But note and . Since , we then have . This now implies that these two sufficiency conditions are in fact identical; so a sufficient condition for associativity is
This sufficient condition for associativity can also be written as the vanishing of the vector triple product
Equivalently
4.4. Specific Non-Coplanar Example
As a final example of the power of the quaternion formalism, let us consider a specific intrinsically non-coplanar example. Let , , and be three mutually perpendicular half-velocities. (So this configuration does automatically satisfy the associativity condition discussed above.) Then we have already seen that:
Furthermore, since is perpendicular to , we have
and
A little algebra now yields the manifestly non-commutative result
In this particular case we can also explicitly show that
though (as discussed above) associativity fails in general.
5. Conclusions
Herein we have provided a simple, elegant, and novel algebraic method for combining special relativistic 3-velocities using quaternions:
The construction also leads to a simple, elegant, and novel formula for the Wigner angle:
in terms of which
All of the non-commutativity associated with non-collinearity of 3-velocities is automatically and rather efficiently dealt with by the quaternion algebra.
Author Contributions
Conceptualization, T.B. and M.V.; methodology, T.B. and M.V.; software, T.B. and M.V.; validation, T.B. and M.V.; formal analysis, T.B. and M.V.; resources, M.V.; writing—original draft preparation, T.B. and M.V.; writing—review and editing, T.B. and M.V.; supervision, M.V.; project administration, M.V.; funding acquisition, M.V. All authors have read and agreed to the published version of the manuscript.
Funding
T.B. was supported by a Victoria University of Wellington MSc scholarship, and was also indirectly supported by the Marsden Fund, via a grant administered by the Royal Society of New Zealand. M.V. was directly supported by the Marsden Fund, via a grant administered by the Royal Society of New Zealand.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Silberstein, L. Quaternionic form of relativity. Philisophical Mag. 1912, 23, 790–809. [Google Scholar] [CrossRef]
- Silberstein, L. The Theory of Relativity; Macmillan and Co.: London, UK, 1914. [Google Scholar]
- Silberstein, L. Available online: https://en.wikipedia.org/wiki/Ludwik_Silberstein (accessed on 8 December 2020).
- Dirac, P.A.M. Application of quaternions to Lorentz transformations. Proc. R. Ir. Acad. 1944, 50, 261–270. [Google Scholar]
- Rastall, P. Quaternions in Relativity. Rev. Mod. Phys. 1964, 36, 820. [Google Scholar] [CrossRef]
- Girard, P.R. The quaternion group and modern physics. Eur. J. Phys. 1984, 5, 25–32. [Google Scholar] [CrossRef]
- Ungar, A.A. The relativistic velocity composition paradox and the Thomas rotation. Found. Phys. 1989, 19, 1385–1396. [Google Scholar] [CrossRef]
- Mocanu, C.I. On the relativistic velocity composition paradox and the Thomas rotation. Found. Phys. Lett. 1992, 5, 443–456. [Google Scholar] [CrossRef]
- De Leo, S. Quaternions and Special Relativity. J. Math. Phys. 1996, 37, 2955–2968. [Google Scholar] [CrossRef]
- Friedman, Y. Physical Applications of Homogeneous Balls; Progress in Mathematical Physics; Springer: Birkhäuser, Basel, 2005; Volume 40. [Google Scholar] [CrossRef]
- Friedman, Y.; Semon, M.D. Relativistic acceleration of charged particles in uniform and mutually perpendicular electric and magnetic fields as viewed in the laboratory frame. Phys. Rev. E 2005, 72, 026603. [Google Scholar] [CrossRef]
- Greiter, M.; Schuricht, D. Imaginary in all directions: An Elegant formulation of special relativity and classical electrodynamics. Eur. J. Phys. 2003, 24, 397–401. [Google Scholar] [CrossRef][Green Version]
- Yefremov, A.P. Theory of relativity in quaternion spinors. Gravit. Cosmol. 2016, 22, 97–106. [Google Scholar] [CrossRef]
- Wigner, E. On unitary representations of the inhomogeneous Lorentz group. Ann. Math. 1939, 40, 149–204. [Google Scholar] [CrossRef]
- Giust, R.; Vigoureux, J.-M.; Lages, J. Generalized composition law from 2 × 2 matrices. Am. J. Phys. 2009, 77, 1068–1073. [Google Scholar] [CrossRef]
- Lages, J.; Giust, R.; Vigoureux, J.-M. Composition law for polarizers. Phys. Rev. 2008, 78, 033810. [Google Scholar] [CrossRef]
- Thomas, L.H. The motion of the spinning electron. Nature 1926, 117, 514. [Google Scholar] [CrossRef]
- Fisher, G.P. Thomas precession. Am. J. Phys. 1972, 40, 1772. [Google Scholar] [CrossRef]
- Ferraro, M.; Thibeault, R. Generic composition of boosts: An elementary derivation of the Wigner rotation. Eur. J. Phys. 1999, 20, 143. [Google Scholar] [CrossRef]
- Malykin, G.B. Thomas precession: Correct and incorrect solutions. Phys. Uspekhi 2006, 49, 837–853. [Google Scholar] [CrossRef]
- Ritus, V.I. On the difference between Wigner’s and Møller’s approaches to the description of Thomas precession. Phys. Uspekhi 2007, 50, 95–101. [Google Scholar] [CrossRef]
- O’Donnell, K.; Visser, M. Elementary analysis of the special relativistic combination of velocities, Wigner rotation, and Thomas precession. Eur. J. Phys. 2011, 32, 1033–1047. [Google Scholar] [CrossRef]
- Achilles, R.; Bonfiglioli, A. The early proofs of the theorem of Campbell, Baker, Hausdorff, and Dynkin. Arch. Hist. Exact Sci. 2012, 66, 295–358. [Google Scholar] [CrossRef]
- Goldberg, K. The formal power series for log(ex ey). Duke Math. J. 1956, 23, 13–21. [Google Scholar] [CrossRef]
- Van-Brunt, A.; Visser, M. Special-case closed form of the Baker-Campbell-Hausdorff formula. J. Phys. A 2015, 48, 225207. [Google Scholar] [CrossRef]
- Van-Brunt, A.; Visser, M. Simplifying the Reinsch algorithm for the Baker-Campbell-Hausdorff series. J. Math. Phys. 2016, 57, 023507. [Google Scholar] [CrossRef]
- Van-Brunt, A.; Visser, M. Explicit Baker-Campbell-Hausdorff Expansions. Mathematics 2018, 6, 135. [Google Scholar] [CrossRef]
- Ungar, A.A. Thomas precession: A kinematic effect of the algebra of Einstein’s velocity addition law. Comments on ‘Deriving relativistic momentum and energy: II. Three-dimensional case’. Eur. J. Phys. 2006, 27, L17. [Google Scholar] [CrossRef]
- Sonego, S.; Pin, M. Deriving relativistic momentum and energy: II. Three-dimensional case. Eur. J. Phys. 2005, 26, 851–856, Correction in 2006, 27, 685. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).