Abstract
Based upon the vector representation of complex numbers and the vector exponential function, we introduce the vector representation of characteristic functions and consider some of its elementary properties such as its polar representation and a vector power expansion.
Keywords:
complex number; imaginary number; vector representation; vector exponential function; characteristic function; Fourier transformation; polar representation; vector power expansion MSC:
60E10; 42A38
1. Introduction
Basic mathematical techniques in probability theory and statistics are associated with characteristic functions and complex numbers. Innovations in the latter areas are therefore also reflected in the former and are to be presented here.
In [], p. 24, Cramér says about the earliest origins of characteristic functions that “the first use of an analytical instrument substantially equivalent to the characteristic function seems to be due to Lagrange []” and that “similar functions where then systematically employed by Laplace in his great work []”. Expressions like also occur in []. Further, significant deeping and expansion, as well as the firm anchoring in modern probability theory, are due to Lévy [], Cramer [], Esseen [], Gnedenko and Kolmogoroff [], Ibragimov and Linnik [], Ramachandran [], Feller [], Lukacs [], Petrov [] and Bhattacharya and Ranga Rao [].
The characteristic function of a random variable X is usually defined as
where means mathematical expectation and i means the so-called imaginary unit, which is formally dealt with in the series like a constant. It is customary to define the quantity i by saying that it is not a real number but a “formal quantity” or “number” that satisfies the equation and assuming at the same time that it allows an interpretation as an element of the two-dimensional Gaussian number plane, which makes the range of values of the function appear pretty unclear. This long-standing apparent lack of mathematical rigor and some consequences resulting from this for characteristic functions will be addressed here.
To start right away, the vectors, or complex numbers, and are considered here as elements of the complex algebraic structure where V is a two-dimensional vector space, which is chosen here as for simplicity, ⊕ means usual vector addition and the product of two complex numbers and the th power of such number are accordingly explained as
while multiplication of vector z by scalar is denoted . The vector could be called an imaginary unit for historical reasons, but it has no imaginary character at all. The reader is encouraged to distinguish here and in the following carefully between the not really comprehensible symbol i and the well-defined vector Obviously, , and solves the quadratic vector equation
For more details about the complex algebraic structure and its non-classical generalizations, we refer to [,,,,,]. The rest of the paper is organized as follows. In Section 2, we consider vector powers and the vector exponential function. The vector representation of characteristic functions and further aspects concerning it as well as several examples are studied in Section 3. A final discussion in Section 4, which includes some historical remarks and an outlook on possible further work, closes this paper.
2. Vector-Valued Exponential Function
The vector-valued vector powers can be derived directly from the definition (1) or alternatively using binomial formulas as the following example implies:
Lemma 1.
The k-th power of z allows for the representation
Let us say that this representation starts in the term proceeds alternating in the lower and upper rows, respectively, and finally ends in the term , which is on the top row if k is an even number and on the bottom row if k is odd. The is plus if k admits one of the representations or , respectively, for an n in , and it is minus if or holds for an n in .
Proof.
By starting from
and making use of the equations
it follows that allows the claimed representation. □
Example 1.
The following equation shows that the multiplication of elements from the unit circle again results in elements from just there:
In the next step, the vector exponential function is defined as the convergent vector series
In particular, we have the Euler-type formula
Remark 1.
(a) Many more general exponential functions were introduced in [,,,] with reference to general functionals and a look forward to non-classically generalized complex algebraic structures (while exp refers to the Euclidean norm).
(b) Corresponding generalized trigonometric functions lead there to generalized Euler-type formulas.
(c) With the functional
defined for every and the product
introduced in [] instead of (1), for example, the p-generalized Euler formula
with
results where means the central projection of vector onto the -unit circle.
(d) Visualizations of the functions with and any fixed and the functions with and any function can help gain a deeper understanding of the vector-valued function .
3. Characteristic Functions
3.1. An Update
The characteristic function of a random variable X, , is formally completely correct defined by However, for the sake of convenience, we will write the multiplication of the scalar and the vector differently, such that
looks more similar to the classical definition. The characteristic function of the random variable is then
that is, the complex conjugate of or the x-axis mirrored version of If X has a finite k-th order moment, then its characteristic function has derivatives up to order k,
where, for
Taylor’s theorem now allows for an expansion of corresponding order for
where
and means Landau’s symbol. Thus, allows for the vector power expansion
starting in the term 1 and ending in the Landau-type term , which is on the top row if k is an even number and on the bottom row if k is odd. The following theorem is thus proved.
Theorem 1.
If X has a finite k-th order moment, then satisfies the following vector power expansion for ,
starting in the term 1 and ending in the Landau-type term , which is at the end of the expression in brackets in front of the vector if k is an even number and otherwise at the end of the expression in the other brackets.
The following corollary is an immediate conclusion from Theorem 1.
Corollary 1.
If X has a finite variance , then the polar representation of its characteristic function can be written for as
If X has a finite fourth-order moment, then the following refinement holds:
and
Now, let and be independent random variables. Then, as shown in [], the following vector multiplication formula applies:
The following remark is intended to stimulate further investigations.
Remark 2.
(a) One might think about introducing and studying the general notion of a functional -related characteristic function or Fourier transformation . In the particular case of discussed in Remark 1c, this would mean
(b) Many statements of asymptotic probability theory are proved using characteristic functions. Strengthening of certain proofs and detection of some additional aspects could be stimulated by the present work.
(c) Visualizations of various challenging issues related to complex numbers and functions are given in [,]. Different figures of certain characteristic functions can be found in [,].
(d) As long as the representation of a characteristic function makes use of the classic imaginary unit i, it is problematic or even unsuitable for visualizing this function, since one would not know what to use for i. The vector representations of characteristic functions given here, however, can be the basis of further visualizations. In addition, the following examples show that in some cases the dependence on the imaginary unit only seems to exist.
We now continue with some examples.
3.2. Normal Distribution
The characteristic function of a standard Gaussian distributed random variable X is
There does not appear to be anything new in this example: the imaginary part is zero and the characteristic function is real, one might think at first glance. However, complex numbers, that is, vectors from whose imaginary part is zero, are not real numbers but vectors having one zero component. For this reason, the set of real numbers is not a subset of the set of complex numbers, although the opposite is often claimed in the literature. The dimension distinguishes the real numbers from the complex numbers. Numerous physical facts are determined by the interaction of two or more quantities, even if one should have originally only been interested in a part of these variables. Complex numbers of dimensions two, three or higher, or their non-classical generalizations like those presented in [,,,,,], are then adequate description tools and could possibly even provide information about hidden variables.
The polar representation of this characteristic function is
3.3. Binomial Distribution
We recall that the characteristic function of a random variable X following the Binomial distribution with parameters is usually written as
Because it is not clearly said where i belongs to, it is unknown where function takes values. The following theorem, however, presents a well-interpretable vector-valued update of this formula.
Theorem 2.
The characteristic function of the random variable X satisfies the vector representation
Proof.
Let denote the probability that X attains the value .
It follows from (4) and Example 1 that
□
Remark 3.
(a) The polar representation of this characteristic function is
where
(b) Note that and that the inequality suggests to define an estimator for p from solving the equation .
(c) An alternative proof of this theorem makes use of Formula (5).
3.4. Poisson Distribution
We are looking now for the vector representation of the characteristic function of a random variable X following the Poisson distribution with parameter , which is usually written as
The unknown quantity i occurring here is replaced by the known two-dimensional vector in the following representation. This is made possible by the additional double use of the vector exponential function (3).
Theorem 3.
The characteristic function of X is
Proof.
This assertion follows with Example 1 from
□
3.5. Uniform Distribution
For real numbers , let X follow the uniform distribution on . Then, its characteristic function is known to attain the value and is usually written for non-zero t as:
This representation can apparently easily be converted to a vector representation if one ’interprets’ or ’substitutes’
Such a simple rewriting today, and in particular its admissibility, was historically obscure, arguably because of the incompletely defined character of i. One might prefer, therefore, the completely correct derivation of this vector representation starting from
The usual complex representation of using the unknown quantity i has thus been updated to a purely real representation as a two-dimensional vector.
3.6. Exponential Distribution
Something similar to the case of a uniform distribution applies to an exponentially distributed random variable X where
The corresponding polar representation is
3.7. Gamma Distribution
We now assume that X has a Gamma distribution with parameters and , Then its characteristic function is
where, according to Lemma 1,
This purely real two-dimensional vector representation updates the usual complex one which makes use of the unknown quantity i,
As in the examples above, Gauss’s [] interpretation of complex numbers as points in the two-dimensional plane can be seen particularly well here,
where “” η means “interpret as ”. Beginning with the work in [], this status of interpretation was transformed into the status of an axiom, and at the same time, the equation , of which it is not said by which quantity i and which operation of squaring it can be fulfilled, was replaced with Equation (2). The point of view of a purely formal handling of the unexplained (imaginary or mystical) quantity i is thus not further pursued here.
4. Discussion
Fourier transformations are among the most commonly used mathematical methods. With respect to probability theory, a comprehensive theory of characteristic functions is based upon the further development of some of the examples considered here. We have just started this journey here and have not followed it far, but the reader is invited to follow this thought further. Our approach is based upon a new understanding of the role that the so-called imaginary unit plays as a vector from a space of dimension two. For the cases of two imaginary units in a three-dimensional space or such units in a space of dimension k, reference is made to the papers [,].
For a closely related discussion of the question of which rich variety of quadratic vector equations can be solved by the known formulas, we refer to [,].
In [], p. 14, the role played by imaginary numbers is described as follows: “In algebra, when trying to find a formula for the solution of the equation of third degree, one came up with the initially meaningless expression . But if you calculated with it as you are used to with the usual square roots, for example, or , something sensible always came out. This strengthens the belief in the right to exist of this structure, for which the designation i has meanwhile become common. But it was almost 300 years before Gauss showed that what had been achieved so far can reasonably be interpreted as an extension of the range of real numbers in which there is a new number whose square is ”. In this regard, it is said in [] that “the role that Euler assigns to accountability shows his algorithmic-analytical thinking and structural understanding because Euler did not have a geometric illustration of imaginary quantities at his disposal…” However, there is an unsatisfactory inaccuracy which, from a historical perspective, was not eliminated immediately after vector calculus was established in [,]. Regarding a further discussion of early statements on this, see []. Even if, as in [,], a complex structure is initially introduced in a seemingly correct way, later it happens in these and several other publications that is visibly or indirectly equated in one way or another with the real number , although, obviously, this square equals the vector .
A question that has probably not yet been dealt with comprehensively in the history of science is that authors on probability-theoretical questions such as in [,,,,,,,,,] do not refer to the mentioned gap in mathematical rigor and say nothing about the range of values of a characteristic function. Significantly, in [], the characteristic function is given contradictingly on page 94 as and on page 104 as .
“How to choose a good research problem?” is the title of the article []. The answer to this question can be very diverse. Originally in part intended just as a didactic self-study for a pensioner, the question of the completely exact treatment of complex numbers has mutated into the development of a great variety of new, non-classically generalized complex numbers and has found expression in [,,,,,]. It also discusses, in particular, the aspects of feasibility and importance with regard to the choice of problem. While the question of feasibility was clarified through the development of the common basic geometric idea behind the papers [,,,,,], the answer to the question of the meaning will hopefully be completed in the future with more practical applications and such as here.
We close this paper with an outlook on another possible research question. Let be a positively homogeneous and bounded functional such that the disc is star-shaped with respect to the additive neutral element , ⊙ a vector-valued vector product generated by this functional according to Definition 1 in [], the boundary of B or unit circle, the central projection of vector onto S and
If further
denote generalized trigonometric functions with respect to the functional then it is known from Corollary 1 in [] that
How useful is this generalized Euler formula when signals resemble generalized trigonometric functions rather than classical ones and the idea of a generalized Fourier transformation suggests itself? If, in particular, , and
then is multiplicative neutral, satisfies the equation and with
and the elliptic Euler-type formula reads
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Conflicts of Interest
The author declares no conflict of interest.
References
- Cramér, H. Random Variables and Probability Distributions; University Printing House: Cambridge, UK, 1937. [Google Scholar]
- Lagrange, J.L. Mémoire sur l’utilité de la méthode de prende le milieu entre les résultats de plusieurs observations. Misc. Taur. 1770–1773, 5, 167–232. [Google Scholar]
- Laplace, P.S. Théorie Analytique des Probabilités; Encyclopedia Universalis: Paris, France, 1812. [Google Scholar]
- Tchebychef, P.L. Note sur la convergence de la série de Taylor. Crelle J. Die Reine Angew. Math. B 1844, 28, 279–283. [Google Scholar]
- Lévy, P. Calcul des Probabilités; Gauthier-Villars: Paris, France, 1925. [Google Scholar]
- Esseen, K.G. Fourier analysis of distributions. Acta Math. 1945, 77, 1–125. [Google Scholar] [CrossRef]
- Gnedenko, B.V.; Kolmogorov, A.N. Limit Distributions for Sums of Independent Random Variables; Addison-Wesley Mathematics Series; Addison-Wesley: Cambridge, MA, USA, 1954. [Google Scholar]
- Ibragimov, I.A.; Linnik, Y.V. Independent and Stationary Connected Variables; Nauka: Moscow, Russia, 1965. (In Russian) [Google Scholar]
- Ramachandran, B. Advanced Theory of Characteristic Functions; Statistical Publishing Society: Calcutta, India, 1967. [Google Scholar]
- Feller, W. An Introduction to Probability Theory and Its Applications; Wiley: Hoboken, NJ, USA, 1970; Volume I–II. [Google Scholar]
- Lukacs, E. Characteristic Functions; Charles Griffin and Company: London, UK, 1970. [Google Scholar]
- Petrov, V.V. Sums of Independent Random Variables; Springer: Berlin/Heidelberg, Germany, 1972. (In Russian) [Google Scholar]
- Bhattacharya, R.N.; Ranga Rao, R. Normal Approximation and Asymptotic Expansions; John Wiley Sons: Hoboken, NJ, USA, 1975. [Google Scholar]
- Richter, W.-D. On lp-complex numbers. Symmetry 2020, 12, 877. [Google Scholar] [CrossRef]
- Richter, W.-D. Three-complex numbers and related algebraic structures. Symmetry 2021, 13, 342. [Google Scholar] [CrossRef]
- Richter, W.-D. Complex numbers related to semi-antinorms, ellipses or matrix homogeneous functionals. Axioms 2021, 10, 340. [Google Scholar] [CrossRef]
- Richter, W.-D. On complex numbers in higher dimensions. Axioms 2022, 11, 22. [Google Scholar] [CrossRef]
- Richter, W.-D. On hyperbolic complex numbers. Appl. Sci. 2022, 12, 5844. [Google Scholar] [CrossRef]
- Richter, W.-D. Deterministic and random generalized complex numbers related to a class of positively homogeneous functionals. Axioms 2023, 12, 60. [Google Scholar] [CrossRef]
- Needham, T. Visual Complex Analysis; Oxford University Press: New York, NY, USA, 1997. [Google Scholar]
- Wegert, E. Visual Complex Functions. An Introduction with Phase Portraits; Springer: Basel, Switzerland, 2012. [Google Scholar]
- Sasvári, Z. Multivariate Characteristic and Correlation Functions; De Gruyter: Berlin, Germany, 2013. [Google Scholar]
- Witkovský, V. Numerical inversion of a characteristic function. Acta Imeco 2016, 5, 32–44. [Google Scholar]
- Gauss, C.F. Theoria Residuorum Biquadraticorum: Commentatio Secunda; Werke 2; Chelsea Publishing Company: New York, NY, USA, 1965; pp. 93–148. [Google Scholar]
- Gellert, W.; Küstner, H.; Hellwich, M.; Kästner, H. Kleine Enzyklopädie Mathematik; VEB Bibliographisches Institut: Leipzig, Germany, 1967. [Google Scholar]
- Thiele, R. Leonhard Euler. 15.April 1707-18. September 1783. Zur Erinnerung an seinen 300. Geburtstag. Mitteilungen Dtsch. Math. Ver. 2007, 15, 93–103. [Google Scholar]
- Grassmann, G. Die Lineale Ausdehnungslehre ein neuer Zweig der Mathematik; Cambridge University Press: Cambridge, UK, 1844. [Google Scholar]
- Hamilton, W.R. On quaternions. Proc. R. Ir. Acad. 1847, 3, 89–92. [Google Scholar]
- Walz, G. Lexikon der Mathematik; Spektrum Akademischer Verlag: Heidelberg/Berlin, Germany, 2001. [Google Scholar]
- Bremaud, P. An Introduction to Probabilistic Modeling; Springer: New York, NY, USA, 1988. [Google Scholar]
- Along, U. Wie wählt man ein gutes Forschungsproblem? Mitteilungen Dtsch. Math. Ver. 2010, 18, 160–163. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).