1. Introduction
The problem of obtaining regions of the complex plane containing all eigenvalues is an old subject that still leads to an important and active research topic (cf. [
1,
2,
3,
4]). If the matrix satisfies special properties, the eigenvalue localization problem has also special properties. For instance, if the matrix is symmetric, then all its eigenvalues are real. In this paper, we consider the eigenvalue localization problem for symmetric positive Toeplitz matrices, being aware that a huge body of literature exists starting from the early years of the past century, when Toeplitz matrices began to be associated to a generating function; see [
5,
6,
7,
8,
9,
10,
11,
12] and the references therein.
Toeplitz matrices arise in many applications of different fields, such as signal processing, physics, differential and integral equations or statistics (see [
9,
10] and references therein). Our approach to study the eigenvalue localization problem for these matrices is based on classical Gerschgorin circles as well as on an alternative approach presented in [
13,
14]. In contrast to Gerschgorin circles, which are sharper when the matrix is close to a diagonally dominant matrix, this alternative approach has a complementary behavior, in the sense that it is sharper when the matrix is far from a diagonally dominant matrix. In fact, it is sharper when the off-diagonal entries have close sizes.
Another topic considered in this paper is total positivity. This is also a field with more than a century of history and that is now receiving a lot of attention, leading to a very active research area. Totally positive matrices have many application to combinatorics, economics, approximation theory, finances, differential equations, statistics, mechanics, quantum groups, computer-aided geometric design, Markov chains, and biomathematics (cf. [
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25]). Moreover, the relationship of Toeplitz matrices with total positivity is also an old and important subject, where this paper also presents some new results.
The paper is organized in the following way.
Section 2 presents the main classes of matrices used in this paper. In addition to the classes mentioned above, circulant matrices are defined. They form a subclass of Toeplitz matrices.
Section 3 recalls some inclusion and exclusion eigenvalues regions obtained with the approaches commented previously, and it particularizes them for the case of symmetric positive matrices. In
Section 4, we present an inclusion interval for the eigenvalues of these matrices and, under an additional hypothesis, it also provides two disjoint intervals contained in the previous one and containing all the eigenvalues. As we show with some examples, these two intervals are necessary, and they can provide sharp information on the localization of the eigenvalues. Some sufficient conditions for the positive definiteness of these matrices are also provided.
Section 5 discusses the relationship of Toeplitz matrices with total positivity and presents some necessary and sufficient conditions of total positivity for these matrices. It also characterizes when a symmetric circulant matrix is totally positive. Another related commented field is that of obtaining algorithms with high relative accuracy for linear algebra problems. In this field, totally positive matrices and Toeplitz matrices have a different behavior in general, although there are connections in some tridiagonal Toeplitz matrices.
Section 6 revisits another interval for the eigenvalues localization by using generating functions of Toeplitz matrices and their powerful associated tools. Finally,
Section 7 presents the main conclusions of the paper.
2. Some Classes of Matrices
This section is devoted to introducing some of the classes of matrices that will be used in this paper and to recall some of their properties, in particular, their eigenvalues.
In this paper, we say that a matrix is positive (respectively, non-negative) if (respectively, ) for all . We shall consider symmetric matrices whose eigenvalues are all real, and positive definite matrices whose eigenvalues are all positive.
A diagonal matrix with equal diagonal entries is a
scalar matrix. A matrix where all the entries are the same number is called a
constant matrix. A constant matrix with all entries equal to zero is called the null matrix. An
Toeplitz matrix is a real matrix such that all its diagonals are constant. These matrices can be defined through a sequence of
real numbers
with
which are called the
parameters associated to the Toeplitz matrix
A. Thus, a Toeplitz matrix
A with parameters
is of the form
If the Toeplitz matrix is symmetric, then it can be defined by (
1) for
, and it only uses the
n parameters
. Toeplitz matrices arise in many applications: in signal processing, in physics, in solutions to differential and integral equations, and in statistics.
A common special case of Toeplitz matrices is formed by the
circulant matrices, where every row of the matrix is a right cyclic shift of the row above: the parameters satisfy
for all
. Eigenvalues and eigenvectors of circulant matrices are known (cf. [
26]). For symmetric circulant matrices, we have the extra condition
for all
i. Thus, they have the form
In addition to Toeplitz matrices, we consider in this paper another important class of matrices. A real matrix is
totally positive (TP) if all its minors are non-negative (see [
21,
24,
25]). These matrices are also called totally non-negative matrices, and they arise in many fields such as combinatorics, economics, approximation theory, finances, differential equations, statistics, mechanics, quantum groups, computer-aided geometric design, Markov chains, and biomathematics (cf. [
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25]). By Corollary 6.6 of [
25], all eigenvalues of a TP matrix are non-negative numbers. When the matrix has all its minors as strictly positive, it is called
strictly totally positive (STP). By Theorem 6.2 of [
25], a STP matrix has all its eigenvalues as positive and distinct.
Finally, we present a weaker property than total positivity. A real matrix is TP
2 if all its
submatrices have a non-negative determinant. Many properties and applications of TP
2 matrices can be seen in [
21].
3. Eigenvalue Localization of Symmetric Positive Matrices
Given a real matrix
, several eigenvalue regions are obtained. First, let us mention Gerschgorin circles. For each
, we denote by
the
ith row Gerschgorin circle. Analogously, the column Gerschgorin circles can be defined. By Gerschgorin’s theorem, all eigenvalues belong to the union of all row Gerschgorin circles, and also to the union of all column Gerschgorin circles. An excellent book on this and related topics is [
4].
For the real parts of the eigenvalues, alternative inclusion regions were obtained in [
13]. Let us now recall them. For this purpose, let us define for each
and
Analogously, we define for each
and
Number
leads (see Proposition 3.3 of [
13]) to a characterization of the class of
-matrices, which is a class of matrices used in [
13] to obtain eigenvalue localization regions known as
-intervals and that will be recalled in the next result. Let us mention that some extensions of
-matrices to tensors were obtained in [
27,
28,
29,
30].
By Theorem 3.5 (i) of [
13] (applied to
A and
), we have the following inclusion regions for the real eigenvalues of a real matrix
A.
Theorem 1. Let be a real matrix, and let be as in (3) and (5). If λ is a real eigenvalue of A, then and .
We shall call the intervals of the first set of Theorem 1 row -intervals. When we use of (5) instead of (3), we call the corresponding intervals column -intervals.
Moreover,
-intervals can be also used to localize the real parts of all eigenvalues of a real matrix as Theorem 4.3 (i) of [
13] shows.
Theorem 2. Let be a real matrix, and let be as in (3) and (5). If λ is an eigenvalue of A, then , where for each Since we are now interested in the localization of the eigenvalues of a symmetric positive matrix
A, the row Gerschgorin circles reduce to the following intervals containing all eigenvalues of
A: for each
, the interval
As for the column Gerschgorin circles, for each
, the interval
The next result simplifies the previous notations when the matrix is symmetric positive and shows that, in this case, each of the previous row and column Gerschgorin intervals can only differ from the corresponding -interval in the left endpoints.
Proposition 1. If is a symmetric positive matrix and are given in (3)–(6), then the eigenvalues of A belong to the intersection of the following two sets: the union of row Gerschgorin intervals and the union of row -intervals. For each , , , and the right endpoint of each row (respectively, column) Gerschgorin interval (7) (respectively, (8)) coincides with the right endpoint in the corresponding row (respectively, column) -interval. Proof. Since
A is symmetric, the row
-intervals and row Gerschgorin intervals coincide with the corresponding column
-intervals and column Gerschgorin intervals, and the first part of the result is a consequence of Gerschgorin’s theorem and of Theorem 1. Observe that, since
A is positive, by (3) and (
4),
and
for all
, and, by the symmetry of the matrix, we also have
and
for all
. The second part of this proposition follows from Theorem 1 and the properties
for all
. □
Until now, we only considered inclusion regions for the eigenvalues. We now recall an exclusion region for the real eigenvalues of a real matrix derived in [
14]. Analogously to the class of
-matrices used for the
-intervals, in this case, we use the class of
-matrices. The extension of
-matrices to tensors was obtained in [
31]. We first need some auxiliary notations. Given a real matrix
, for each
, we define
Proposition 2.6 of [
14] provides the following exclusion interval for the real eigenvalues of a real matrix.
Theorem 3. Let be a real matrix, and let be as in (9). If λ is a real eigenvalue of A, then The interval of (
10) is called the
row exclusion interval.
Using instead of A, we can obtain an exclusion column interval, and a result analogous to Theorem 3 for the exclusion column interval also holds.
The following result simplifies the previous notations and the row exclusion interval when the matrix is symmetric positive.
Proposition 2. If is a symmetric positive matrix and are given in (9), then, for each , and , and the eigenvalues of A cannot belong to the interval Proof. Observe that since
A is positive, by (
9),
and
for all
. The second part of this proposition follows from Theorem 3 and the properties
for all
. □
The next section considers the special case of symmetric positive Toeplitz matrices.
4. Inclusion Intervals for Symmetric Positive Toeplitz Matrices
Let us consider a symmetric positive Toeplitz matrix
A with parameters
. We shall use the following notations:
and the following notations for the minimal and maximal row sums of
A:
The following result first provides a simple eigenvalue inclusion interval for all eigenvalues of a symmetric positive Toeplitz matrix, and, under an additional condition, it also provides two disjoint intervals contained in the previous one and containing all the eigenvalues. As we shall see in Remark 1 for some examples, these two intervals are necessary, and they can provide sharp information on the localization of the eigenvalues.
Theorem 4. Let be a symmetric positive Toeplitz matrix with parameters and let be given by (12) and (13). Then, all eigenvalues of A belong to the interval Moreover, if in addition , then all eigenvalues of A belong to the following union of two disjoint intervals: Proof. By Proposition 1, all eigenvalues of A belong to the intersection of the union of row Gerschgorin intervals and the union of row -intervals, and the right endpoint of each row -interval coincides with the the right endpoint of the corresponding row Gerschgorin interval. Since A is positive, right endpoints of all row Gerschgorin intervals are given by the corresponding row sums, and so they are bounded above by . Again by the positivity of A, all left endpoints of all row Gerschgorin intervals are given by minus the sum of the entries of the corresponding row, and so they are bounded below by . By Theorem 1 and the positivity of A, all left endpoints of all row -intervals are given by the correspond row sum minus and since the row sum is bounded below by and for all , we conclude that all left endpoints of all row -intervals are bounded below by .
Let us now prove the second part of the result so that we also assume that
. If
is an eigenvalue of
A, then by Proposition 2,
Since
for all
,
for all
. Therefore,
By our hypothesis, the exclusion interval of (
17) is nonempty and the result follows. □
Observe that symmetric positive circulant matrices obviously satisfy all hypotheses of Theorem 4. In particular,
because, for these matrices,
. The next remark will show examples of these matrices for which both disjoint intervals of (
15) are necessary and, in fact, they provide exactly the eigenvalues of the matrices. In this sense, Theorem 4 cannot be improved.
Remark 1. Let us consider the () symmetric Toeplitz matrix A with n positive parameters , that is, Clearly, this matrix has the eigenvalues associated to the eigenvector and the eigenvalue with multiplicity . A satisfies all hypotheses of Theorem 4. Its two disjoint intervals of (15) areandwhich exactly provide the two eigenvalues of A. As a consequence of the first part of Theorem 4, we can also obtain a sufficient condition for the positive definiteness of a symmetric positive Toeplitz matrix.
Corollary 1. Let be a symmetric positive Toeplitz matrix and let be given by (12) and (13). If A satisfies thatthen A is positive definite. Proof. By Theorem 4, all eigenvalues are greater than or equal to the left endpoint of the interval (
14), which is positive by our hypothesis. □
Let us illustrate the application of Corollary 1 with two extreme matrices.
Example 1. First, let us consider the () symmetric Toeplitz matrix B with n positive parameters with . In this case, the maximum of Corollary 1 is . The second example is given by the () symmetric Toeplitz matrix A with n positive parameters with . In this case, the maximum of Corollary 1 is .
The previous examples are only illustrative. In fact, since the matrices are circulant, the eigenvalues and the eigenvectors are explicitly known. The following example shows a family of matrices depending on
for which the two intervals of (
15) converge to the two eigenvalues of the limit matrix of this family as
.
Example 2. Let us consider the family of symmetric Toeplitz matrices with n positive parameters with , . As , matrices tend to the matrix with all entries 1, whose eigernvalues are 0 and n. Observe that, for matrices , σ and Σ given by (13) satisfy , which is an eigenvalue for all matrices associated to the eigenvector . So, the second interval of Theorem 4 reduces to this eigenvalue. On the other hand, one can check for all matrices that and so the first interval of Theorem 4 iswhich tends to (the other eigenvalue of the limit matrix) as . The next section considers TP Toeplitz matrices, which have non-negative eigenvalues as recalled in the Introduction.
5. Total Positivity and Toeplitz Matrices
Definitions and some properties of TP matrices are recalled in
Section 2. There are old and deep connections between Toeplitz matrices and total positivity.
Let us start by recalling the connection with the study of generating functions for TP infinite Toeplitz matrices (see Section 7 III of [
25] and Section 4.7 of [
24]). Given a bi-infinite sequence
the matrix
is called its
associated Toeplitz matrix, and the function
its
generating function. The study and characterization of the total positivity Toeplitz matrix in terms of the generating functions was performed in [
32,
33,
34,
35,
36]. We now recall the particular case of a one-sided infinite sequence
where we suppose that
for
in the previous definition. Assuming the normalization
, the previous sequence gives rise to a TP Toeplitz matrix
if and only if its corresponding generating function
has the form
where
, and
. Sequences leading to TP Toeplitz matrices were recalled in Chapter VIII of [
21]
Polya frequency sequences (PF-sequences), where many examples were presented. Examples of PF-sequences arising in combinatorics can be seen in [
37]. Let us also recall that results concerning the total positivity of the Hadamard product of TP Toeplitz matrices can be seen in [
38].
Now we come back to finite symmetric positive Toeplitz matrices, and we shall present necessary and sufficient conditions related to total positivity. We first give a necessary condition for TP
2 in a very simple way: the parameters form a monotone decreasing sequence. If a stronger condition is required, then we obtain a sufficient condition for strict total positivity. This last sufficient condition uses a corresponding condition obtained in [
39], which slightly refined a previous condition of [
40] (see Corollary 2.17 of [
24]).
Proposition 3. Let A be a symmetric positive Toeplitz matrix with parameters . If A is TP2, then . If and for all , then A is STP.
Proof. Let us first assume that A is TP2. If we consider the submatrix formed by rows 1, 2 and columns 1, 2, we obtain and so the positivity of the parameters implies . With the submatrix formed by rows 1, 2 and columns 2, 3, we obtain and so the positivity of the parameters implies . Continuing in an analogous way, we obtain .
For the second part of the result, let us assume that
and
for all
. By applying Corollary 2.17 of [
24] to
A, for its strict total positivity, it is sufficient to prove that
, which holds by our hypothesis, and that
for all
, which also holds by our hypothesis. □
A symmetric positive Toeplitz matrix is TP if and only if its associated parameters satisfy . Let us now consider symmetric circulant matrices of the form (5). In this case, () TP symmetric circulant matrices are characterized in the following result.
Theorem 5. An () symmetric circulant matrix A with parameters is TP if and only if it is either a non-negative constant matrix or a scalar matrix with a positive diagonal.
Proof. Trivially, any non-negative constant matrix is TP and any scalar matrix with positive diagonal is TP. Let us see the converse, and so we assume that A is a symmetric TP circulant matrix.
Since A is a circulant, it is in particular a Toeplitz matrix, and, since it is TP, it is also TP2. If it is positive, then by Proposition 3, the parameters satisfy , which also holds when A is the null matrix. Since A is symmetric circulant, it also satisfies that for all i. Then, we have and so, . The non-negative determinant of the submatrix formed by rows 2, 3 and columns 1, 2 is given by , which implies that either (that is, A is a non-negative constant matrix) or (that is, A is a scalar matrix with a positive diagonal). □
We finish this section by commenting on the problem of finding algorithms with high relative accuracy (cf. [
41,
42,
43]. For some subclasses of TP matrices (cf. [
44,
45,
46,
47]), algorithms with high relative accuracy have been found for solving linear algebra problems, such as the computation of the inverse, of all singular values, of all eigenvalues, and the solution of some linear systems. This can be performed by starting from an accurate bidiagonal decomposition of a TP matrix and using the algorithms presented in [
48] (and in [
49] for the inverse). In contrast, accurate linear algebra for the problem of calculating determinants or minors is impossible on the class of Toeplitz matrices (see corollaries 3.43 and 3.45 of [
41]). However, for special classes of Toeplitz matrices, such as some tridagonal Toeplitz matrices, algorithms with high relative accuracy for some linear algebra problems were obtained in [
50].
6. An Interval Using Generating Functions of Toeplitz Matrices
We already commented in
Section 2 that eigenvalues and eigenvectors of circulant matrices are known. In fact, many spectral properties of Toeplitz matrices associated to a generating function (see
Section 5) are known. Following p. 95 of [
11], a Toeplitz matrix
A given by (
1) with parameters
can be written as
where
is the
matrix whose
th entry equals 1 if
and 0 otherwise.
Given a function
belonging to
, its Fourier coefficients are given by
where
.
The
nth Toeplitz matrix associated with
f is defined as
The sequence of Toeplitz matrices
is called the Toeplitz sequence generated by
f, and
f is called the generating function of
. The generating function can be periodically extended on the real line and, after the change of variable
, is the generating function given in
Section 5. As shown in Section 6.2 of [
11], if
f is real-valued, then
is Hermitian for every
n, and all its eigenvalues are real.
As shown in Theorem 6.1 of [
11], taking
as the essential infimum of
f and
as its essential supremum, then the interval
contains all eigenvalues of
for all
n. Furthermore, if
is the maximal order of the zeros of
and
is the maximal order of the zeros of
, then the minimal eigenvalue of
converges monotonically from above to
with a convergence speed asymptotic to
, and the maximal eigenvalue of
converges monotonically from below to
with a convergence speed asymptotic to
; see [
6,
11,
12] and the references therein.
These results can be applied to specific examples of real symmetric Toeplitz matrices with non-negative entries and arising in concrete applications. For instance, if one takes
with stencil
(and with all the other Fourier coefficients equal to zero), then all the eigenvalues of
belong to
, and the minimal eigenvalue is equal to
and the maximal eigenvalue is equal to
, with
and where the value of
can be explicitly identified; see [
6,
7] and the references therein. This example comes from applications because
represents a canonical Finite Difference discretization of
on
with equispaced grid and proper homogeneous boundary conditions. Since
has zero entries for
, we cannot use our results from the previous sections. However, for
, it is a positive Toeplitz matrix and the intervals obtained in (
15) are
.
In general, if one considers
, with
k as a positive integer, then one ends up again with real symmetric Toeplitz matrices with non-negative entries and discretizing
on
with equispaced grid and proper homogeneous boundary conditions, via canonical Finite Differences. The same very beautiful localization and asymptotic results can be obtained. These results go beyond the case of real Fourier coefficients, beyond the Hermitian setting, and beyond the case of zeros of integer orders like in the approximation of fractional operators; see [
5] and the references therein including [
6,
7]. And for the banded case, also asymptotic expansions of the eigenvalues can be found; see [
8].
7. Conclusions
This paper studies the eigenvalue localization for a symmetric positive Toeplitz matrix. An inclusion interval for its eigenvalues is provided. With one additional restriction, we also give two disjoint intervals contained in the previous one and containing all the eigenvalues. We include examples showing that these two intervals are necessary, and they can provide sharp information on the localization of the eigenvalues. In fact, they can even provide exactly the eigenvalues of the matrices. Sufficient conditions to check if a given symmetric positive Toeplitz matrix is positive definite are provided. Illustrative examples are included.
The relationship between TP matrices and Toeplitz matrices is also analyzed. A necessary condition for the total positivity of a symmetric positive Toeplitz matrix in terms of its parameters is obtained. A sufficient condition for its strict total positivity is also presented. A characterization of symmetric totally positive circulant matrices is also obtained. They are formed by non-negative constant matrices and scalar matrices with positive diagonal.
Finally, the tools associated with generating functions and mentioned in
Section 6 and the references therein are very general, and also provide an eigenvalue inclusion interval and apply to any kind of real symmetric (and also complex Hermitian, or non-Hermitian) Toeplitz matrices.