Some Comments about Zero and Non-Zero Eigenvalues from Connected Undirected Planar Graph Adjacency Matrices

Griffith, Daniel A.

doi:10.3390/appliedmath3040042

Open AccessEditor’s ChoiceArticle

Some Comments about Zero and Non-Zero Eigenvalues from Connected Undirected Planar Graph Adjacency Matrices

by

Daniel A. Griffith

School of Economic, Political, and Policy Science, University of Texas at Dallas, Richardson, TX 75080, USA

AppliedMath 2023, 3(4), 771-798; https://doi.org/10.3390/appliedmath3040042

Submission received: 30 September 2023 / Revised: 20 October 2023 / Accepted: 25 October 2023 / Published: 1 November 2023

Download

Browse Figures

Versions Notes

Abstract

:

Two linear algebra problems implore a solution to them, creating the themes pursued in this paper. The first problem interfaces with graph theory via binary 0-1 adjacency matrices and their Laplacian counterparts. More contemporary spatial statistics/econometrics applications motivate the second problem, which embodies approximating the eigenvalues of massively large versions of these two aforementioned matrices. The proposed solutions outlined in this paper essentially are a reformulated multiple linear regression analysis for the first problem and a matrix inertia refinement adapted to existing work for the second problem.

Keywords:

adjacency matrix; eigenvalue; linearly dependent subsets; planar graph; spatial statistics/econometrics

1. Introduction

This paper addresses two related linear algebra-based graph theory problems involving a binary 0-1 symmetric adjacency matrix, say C, representing connected undirected planar graphs with no self-loops, relevant to spatial statistics/econometrics that is generalizable to other especially applied matrix and linear algebra situations. Both pertain to the inertia of a matrix (i.e., its number of negative, zero, and positive—respectively, n₋, n₀, and n₊—eigenvalues, say λ_i(C), i = 1, 2, … , n; e.g., see [1,2]; for a “matrix inertia theory and its applications” exposition, see Chapter 13 in Lancaster and Tismenetsky [3]). Its novelty is twofold within the context of planar graph theory: it contributes knowledge to help fill an existing gap in graph adjacency matrix nullity― the multiplicity of the zero eigenvalue in a graph adjacency matrix spectrum―theory (e.g., [4,5,6,7,8]); and, it adds to the scant literature published to date about matrix inertia (e.g., [9]). The first problem, which goes beyond the well-known solutions for simply calculating zero eigenvalues, concerns enumeration of linearly dependent adjacency matrix column/row subsets affiliated with these zero eigenvalues (e.g., see [10]), a single challenge here as a result of symmetry. The second problem concerns approximation of the real eigenvalues (e.g., see [11]) of these adjacency matrices, particularly their row-standardized version, say W, a linearly transformed version of a Laplacian matrix, that enjoys extremely popular usage in spatial statistics/econometrics applications and conceptualization nomenclature.

With regard to the first dilemma, discussion posted on the internet (see, for example: https://www.mathworks.com/matlabcentral/answers/574543-algorithm-to-extract-linearly-dependent-columns-in-a-matrix (accessed on 24 October 2023); https://stackoverflow.com/questions/28816627/how-to-find-linearly-independent-rows-from-a-matrix (accessed on 24 October 2023); and, https://stackoverflow.com/questions/28928893/linear-dependent-rows-huge-sparse-matrix (accessed on 24 October 2023)) suggests the existence of inadequate understanding about identifying a complete suite of (most likely non-unique) linearly dependent p-tuple, p ≥ 2, subsets of adjacency (or other square) matrix (a finite collection of matrix columns/rows in the same vector space is linearly dependent if some non-zero scalar multiples of them sum to the zero vector; for the simplest 2-tuple case and a graphic theoretic adjacency matrix, identical pairs are easily identified by a brute force non-zero element-by-element comparison algorithm) columns/rows, even if the number of zero eigenvalues is known or calculated. The proposed solution here exploits statistical linear regression estimation theory. Meanwhile, the second difficulty spotlights the desire for a refinement of an approximation offered by Griffith and Luhanga [12] paralleling the regular square lattice (e.g., see [13]) solution by Griffith [14]. Partly stimulating this inclination is an increasing availability of readily accessible high quality fine geographic resolution data for national landscapes partitioned into irregular tessellations, such as census tract or block group polygons for the United States (US), and dissemination area and block polygons for Canada (see https://www.census.gov/geographies/reference-files/time-series/geo/tallies.html (accessed on 24 October 2023) and https://www12.statcan.gc.ca/census-recensement/2021/geo/sip-pis/boundary-limites/index2021-eng.cfm?Year=21 (accessed on 15 June 2022), to name some of the germane countries, whose numbers are in the tens-of-thousands to millions; e.g., 84,414 census tracts and 8,132,968 block groups for the US in 2020, and 57,932 dissemination areas and 498,547 blocks for Canada in 2021. In other words, a substantial applied mathematics demand already exists for solving these two problems.

2. Background to the Pair of Problems

Let G_n = (V, E) be a simple undirected connected graph with n vertices V = {v₁, … , v_n}, and m ≤ n(n − 1)/2 edges E = {e_ij linking vertices v_i and v_j: i = 1, 2, … , n and i = 1, 2, … , n; i ≠ j due to the absence of self-loops}. Often in spatial statistics/econometrics, G_n is both complete and planar, and hence m ≤ 3(n − 2); sometimes G_n is near-planar, with m ≤ 8n. Several well-known properties of the sparse adjacency matrices for these graphs pertaining to both their zero and non-zero eigenvalues include:

(1): the number of zero eigenvalues count linearly dependent column/row subsets (e.g., [15]);
(2): the interval [ $\underset{1 \leq i \leq n}{MAX} {\sqrt{\sum_{i = 1}^{n} n_{i}^{2} / n}, \sum_{i = 1}^{n} n_{i} / n}$ , $\underset{1 \leq i \leq n}{MAX} {(1 + \sqrt{4 n_{j | i} - 4 n_{i} + 1}) / 2}$ ] (e.g., [16,17]) contains the principal eigenvalue (i.e., spectral radius) of the graph G_n adjacency matrix C, where n_i is the ith row sum, and n_j|i is the sum of vertex i’s linked row sums n_j;
(3): adding a link to graph G_n strictly increases its adjacency matrix C principal eigenvalue(e.g., [18]), whereas removing a link decreases the magnitude of this eigenvalue (e.g., [19]);
(4): the principal eigenvalue of matrix W is one (via the Perron-Frobenius theorem (e.g., [20]));
(5): the extreme maximum eigenvalue of matrix C, and the extreme negative eigenvalue of matrix W, for graph G_n compute very quickly [21];
(6): the variance of the eigenvalues of matrix C is 1^TC1/n, and of matrix W is 1^TD⁻¹C D⁻¹1/n, where 1 is an n-by-1 vector of ones, superscript T denotes the matrix transpose operator, and D is a diagonal matrix whose (i, i) cell entry is n_i [21];
(7): the sum of the positive eigenvalues equals minus the sum of the negative eigenvalues [all diagonal entries of matrix C are zero in the absence of self-loops, implying that $\sum_{i = 1}^{n} λ_{i}$ (C) = 0];
(8): the sum of the k largest eigenvalues of matrix C is at most (√k + 1)n/2 [22];
(9): the line graph, L(G_n), furnishes a lower bound, whereas the maximally connected graph [23], L(G₂) + L(G_n−2), furnishes an upper bound configuration for numerical planar graph eigenfunction analysis;
(10): for irregular graphs G_n (i.e., n_i has a relatively small modal frequency coupled with a relatively large range), the theoretical maximum number of negative eigenvalues is 3n/4 [24], whereas empirically this maximum number almost always is less than 2n/3, and usually between n/2 and 3n/5 [25], even in the presence of numerous completely connected K₄ (in graph theory parlance) subgraphs (i.e., the maximum fully connected subgraph Kuratowski’s theorem allows to exist in a planar graph);
(11): a bipartite graph is always 2-colorable (i.e., one only needs to assign at most two different colors to all graph vertices such that no two adjacent vertices have the same color), and vice-versa (see [26]), implying knowledge about the number and positioning of K₄ subgraphs potentially is informative; and,
(12): if the maximum degree (i.e., n_i; see property #2) Δ > 3, η denotes nullity, and G_n is not complete bipartite, then η < (Δ − 2)n/(Δ − 1) [5]―this is not a very useful spatial statistics/econometrics upper bound since most irregular surface partitionings have at least one polygon areal unit with Δ sufficiently large (e.g., in the range 10–28) that (Δ − 2)/(Δ − 1) is close to one.

Although this list is not exhaustive, and hence could contain many more eigenvalue properties, these twelve provide a foundation for the findings summarized in this paper. They also highlight that missing from this list is an effective method for determining the set of zero eigenvalues for an adjacency matrix, one of this matrix’s three inertia counts.

Mohar’s [22] result (i.e., property #8) needs modification in order to translate it from matrix C to matrix W. Although the line graph, L(G_n) (i.e., property #9) gives a lower bound configuration for practical spatial statistical problems, an undirected star graph (i.e., K₁ in graph theory parlance) gives its absolute lower bound for the sum of positive eigenvalues, which is one for matrix W; this particular graph also highlights the importance of accounting for zero adjacency matrix eigenvalues. In other words, given that the largest eigenvalue, λ₁(W), equals 1, the upper bound for the k positive eigenvalues is k, denoted here by matrix inertia notation n₊; realization of this value is infeasible for a connected L(G_n) since all non-principal positive eigenvalues other than λ₁(W) are guaranteed to be < 1. Thus, for other symmetric eigenvalue distributions (e.g., those characterizing a regular square tessellation lattice forming a complete rectangular region), this loose upper bound also is n/2 at most, and actually is closer to n₊/2; such a tessellation forming a complete √n-by-√n square region, for a perfect square number n, with links in its dual planar graph defined by pixels sharing a non-zero length boundary, also has √n zero eigenvalues (e.g., see Comellas et al., 2008), slightly reducing this upper bound for it by a factor of (1 − 1/√n). Meanwhile, the sum of positive L(G_n) eigenvalues is approximately/exactly (depending upon whether n is even or odd) equal to

\{1 + S I N [\frac{2 k - 1}{2 (n - 1)} π] / S I N [\frac{π}{2 (n - 1)}]\} / 2

, implying a sharper upper bound on k for the sum of its first k positive eigenvalues, and n/3—which can be <k—for the sum of all of its positive eigenvalues. This outcome suggests the following more general proposition:

Conjecture 1.

Let G_n = (V, E) be a simple connected undirected planar graph with n vertices V = {v₁, … , v_n}, and m ≤ n(n − 1)/2 edges E = {e_ij linking vertices v_i and v_j: i = 1, 2, … , n and j = 1, 2, … , n; i ≠ j}. For n ≥ 100 nodes and no self-loops, k is an upper bound for the sum of the first k positive eigenvalues of its row-standardized adjacency matrix W.

Rationale.

By the Perron-Frobenius theorem, λ₁(W) = 1, and all other positive eigenvalues in this case are such that 0 < λ_j(W) < 1, j = 2, 3, … , k. Taliceo and Griffith [25] synthesize simulation experiment calculations that suggest a minimum value of around 100 for n, the part of this conjecture requiring proof. Tait and Tobin [23] underline the potential for G_n displaying anomalous qualities when n is small, noting that the maximum connectivity case requires n to exceed 8 to be true. Furthermore, in their extensive investigation of planar graphs housing embedded K₄ subgraphs, Taliceo and Griffith [25] reveal that 3n/4 negative eigenvalues occur, at least asymptotically with increasing n, for certain supra-structure wheel and line graphs organizing a substructure of only K₄ subgraphs, which is a peculiarity vis-à-vis empirical surface partitionings. Their empirical inventory, together with the observational roster appearing in Table 1 (also see Appendix A), implies that most irregular surface partitions have a relatively small percentage of their edges arranged in the precise formation of these subgraphs.

The rare event nature of this preceding K₄ situation reflects upon a need to adequately differentiate theoretical and practical consequences.

The principal eigenvalue λ₁(C) upper bound result proved and reported by Wu and Liu [17] can extend this conjecture to binary 0-1 graph adjacency matrices with an improved bound of k

\times \underset{1 \leq i \leq n}{MAX} {(1 + \sqrt{4 n_{j | i} - 4 n_{i} + 1}) / 2}

< (√k + 1)n/2. This value outperforms even the extreme negative eigenvalue case of k = n/4 for all positive eigenvalues: for example, using the true n₊ value, (7.59)(839)/2 (≈3184.05) is closer than (√393 + 1)(839)/2 (≈ 8738.76) to 656.86, the ensuing empirical England illustration numbers.

Finally, while a λ_n(W) value of −1 implies the foregoing loose upper bound of n/2, as λ_n(W) goes toward its smallest absolute value of roughly −1/2 (e.g., [27]), property #10 implies that this loose upper bound goes to n/4. One implication here is that λ_n(W) can either directly or indirectly contribute to establishing sharper upper bounds. Taliceo and Griffith [25] reveal that values greater than the extreme case of −1 move the loose upper bound toward 2n/5. This hypothesis merits future research attention. Therefore, as the discussion in this section reflects, a prominent theme in the existing literature concerns various aspects of positive eigenvalues, relegating negative eigenvalues to a secondary topic, while nearly ignoring zero eigenvalues altogether.

3. Specimen Empirical Surface Partitionings and Their Dual Graphs

Although the first problem this paper addresses entailing subsets of linearly dependent columns/rows appears to be of interest for any size matrix (e.g., Appendix B furnishes an easily replicated and control example, suitable for launching exploratory work, with much known about its zero eigenvalues for any size adjacency matrix), the second entailing eigenvalue approximations is of pragmatic interest—beyond verification exercises—especially in spatial statistics/econometrics, and only for extremely large matrices involving n in the 100,000s or greater (i.e., matrices for which numerical computation of eigenvalues is prohibitive in terms of time or computer memory requirements that surpass ultramodern resource capabilities). This section treats a relatively large n that is much smaller than this restrictive magnitude, but sufficiently large to engage computational challenges while allowing comparisons with accurate numerical solutions, namely n between 839 and 7249. The seven selected cases are actual real-world surface partitionings, with their selection decisions partly attributable to their relative and absolute sizes. Table 1 summarizes some of their relevant features. Its upper bound for λ₁(C) exhibits a marked decrease—by an average of over 50%—from the maximum n_i values delivered by the Perron-Frobenius theorem.

Figure 1 portrays the selected empirical eigenvalue distributions. The complete sets of eigenvalues tend to resemble, and somewhat align with, truncated gamma probability density function distributions. The positive eigenvalue subsets more closely resemble skewed Weibull distributions (with most of the analyzed cases here displaying conspicuous tail deviations). These subset graphics help visualize the aforementioned conjecture.

Table 2 presents a second tabulation with entries for the five selected empirical adjacency matrices containing zero eigenvalues. These recordings insinuate that 2-tuples may be the dominant source of zero eigenvalues in empirical graph adjacency matrices (also see Appendix Figure A1), an anticipation consistent with intuition. The affiliated zero eigenvalues are a straightforward consequence:

Lemma 1.

If each of two nodes in a graph has a single link that connects them only with a common other node, then this graph’s binary 0-1 adjacency matrix C has a zero eigenvalue.

Proof.

Without loss of generality, arrange an adjacency matrix such that the two nodes in question occupy the first and second column/row positions of the adjacency matrix under study. Utilizing elementary matrix determinant column/row operations (e.g., [28,29]), subtracting the second row from the first row of det|(C − λI)|, where det denotes matrix determinant, and then adding the first column to the new second column, yields.

d e t |(\begin{matrix} - λ & 0 & \begin{matrix} 0 & 1 & 0 \end{matrix} \\ 0 & - λ & \begin{matrix} 0 & 1 & 0 \end{matrix} \\ \begin{matrix} 0 \\ 1 \\ 0 \end{matrix} & \begin{matrix} 0 \\ 1 \\ 0 \end{matrix} & C_{n - 2, n - 2} \end{matrix})| = \det |(\begin{matrix} - λ & 0 & \begin{matrix} 0 & 0 & 0 \end{matrix} \\ 0 & - λ & \begin{matrix} 0 & 1 & 0 \end{matrix} \\ \begin{matrix} 0 \\ 1 \\ 0 \end{matrix} & \begin{matrix} 0 \\ 2 \\ 0 \end{matrix} & C_{n - 2, n - 2} \end{matrix})|

(1)

Next, implementing the Laplacian expansion by minors for cell entry (1, 1) renders λ = 0. This result is corroborated by the determinant property that the eigenvalues of a block diagonal matrix are the union of the individual block matrix eigenvalue sets. □

Corollary 1.

Lemma 1 applies to K > 1 separate distinct pairs of nodes in a graph that have a single link connecting each of a duo only with another common node.

Proof.

Without loss of generality, arrange an adjacency matrix such that the K pairs of nodes in question contiguously occupy its first 2K column/row positions, reducing the core adjacency matrix to C_{n−2K,n−2K}. Recursively applying the Lemma 1 proof technique to the K twosomes yields K zero eigenvalues. □

Lemma 2.

If each of K nodes in a graph has a single link that connects them only with a common other node (i.e., a star graph, a special case of a complete bipartite graph), then this graph’s binary 0-1 adjacency matrix C has at least K − 1 zero eigenvalues.

Proof.

Without loss of generality, arrange an adjacency matrix such that the K nodes in question contiguously occupy its last K column/row positions, reducing the core adjacency matrix to C_n−K,n−K. Then,

d e t |(\begin{matrix} C_{n - K, n - K} - λ I_{n - K} & \begin{matrix} 0_{n - K - 1, K} \\ 1_{K, 1}^{T} \end{matrix} \\ \begin{matrix} 0_{K, n - K - 1} & 1_{K, 1} \end{matrix} & - λ I_{K} \end{matrix})| = d e t |(\begin{matrix} C_{n - K, n - K} - λ I_{n - K} & \begin{matrix} 0_{n - K - 1, K} \\ \begin{matrix} 1 & 0_{K - 1,1}^{T} \end{matrix} \end{matrix} \\ \begin{matrix} 0_{K, n - K - 1} & \begin{matrix} 1 \\ 0_{K - 1,1} \end{matrix} \end{matrix} & \begin{matrix} (K - 1) - λ & 0_{K - 1,1}^{T} \\ 0_{K - 1,1} & - λ I_{K - 1} \end{matrix} \end{matrix})|

(2)

Next, implementing the Laplacian expansion by minors for the ending (K − 1)-by-(K − 1) cell entries, −λ_K−1, is equivalent to applying it to a star graph for which one of its nodes is contained in the core adjacency matrix, C_n−K,n−K, rendering at least K − 1 zero eigenvalues (see [30], p. 266). □

Corollary 2.

If each of K nodes in a square mesh lattice (i.e., regular square tessellation) graph has a single link that connects them only with a common other node (i.e., a star graph), then this graph’s binary 0-1 adjacency matrix C has at least P + K − 2 zero eigenvalues.

Proof.

Extending the proof for Lemma 2, the core adjacency matrix, C_n−K,n−K =

C_{P^{2}, P^{2}}

for a P-by-P regular square tessellation, in isolation has P zero eigenvalues for its rook adjacency definition―its eigenvalues are given by 2{COS[πj/(P + 1)] + COS[πk/(P + 1)]}, j = 1, 2, … , P and k = 1, 2, … , P [31], with the P cases of k = [(P + 1) – j] yielding zero values. One of these matches occurs for each integer j, yielding a total of P zero eigenvalues. But the single arbitrary node to which the star graph attaches has its column/row modified by its P + 1 column and row additional entries that render the matrix determinant

\det |(\begin{matrix} C_{P^{2}, P^{2}} - λ I_{P^{2}} & \begin{matrix} 0_{P^{2} - 1, 1} \\ 1 \end{matrix} \\ \begin{matrix} 0_{1, P^{2} - 1} & 1 \end{matrix} & (K - 1) - λ \end{matrix})|,

(3)

and hence removes one of the P zero eigenvalues. Hence, the number is P − 1 + K − 1 = P + K − 2. □

Remark.

This corollary produces an exact expectation for algorithm and conjecture verification exercises like the simulation specimens outlined in Appendix B.

Corollary 3.

If each of H sets of K_h nodes forms a separate, distinct star subgraph, then this graph’s binary 0-1 adjacency matrix C has at least

\sum_{h = 1}^{H} K_{h}

– H zero eigenvalues.

Proof.

Recursively applying the Lemma 2 proof technique to the H collections of K_h nodes yields

\sum_{h = 1}^{H} K_{h}

– H zero eigenvalues. □

Remark.

These star subgraphs need to attach to nodes that contribute to linear combinations of the core adjacency matrix

C_{n - \sum_{h = 1}^{H} {n - K}_{h}, \sum_{h = 1}^{H} K_{h}}

in order to eliminate one of its zero eigenvalues (see Corollary 2 proof).

Table 2 entries also raise questions about the claim by Feng et al. [18] that adding links strictly increases the matrix C principal eigenvalue’s magnitude (e.g., the US HAS specimen graph), which is the reason this table’s entries show so many decimal places; rather, this spectral radius amount looks to be simply monotonically non-decreasing. This assertion may well apply to the extreme negative eigenvalues, too: they also appear to be monotonically increasing in absolute value.

The p-tuples acknowledged in Table 2 are the linearly dependent subsets of columns/rows producing zero eigenvalues in their respective adjacency matrices. The next section discusses a statistical regression estimation operator mechanism for uncovering these linear combinations. Their important attributes alluded to in this table include overlaps (i.e., a column/row can be a member of more than one linearly dependent subset), size variation (i.e., the number of columns/rows in a linear combination equation), and frequency that seems to decrease with increasing size, perhaps mirroring a Poisson distribution. Overlaps emphasize that the discovered p-tuples acknowledged in Table 2 are not necessarily unique; those removed for the construction of this table are ones that maintain a connected graph [documented by λ₂(W) ≠ 1] as well as contain the smallest possible n_is (most often just one).

4. Identifying Linearly Dependent Column/Row Subsets in Adjacency Matrices

Hawkins [32] gives an insightful clue to identifying linearly dependent column/row subsets in an adjacency matrix by noting that eigenvectors of (near-)zero principal components extracted from a correlation matrix constructed by amalgamating a dependent variable and its accompanying set of covariates suggest possible alternative sub-regressions. One weakness of his conceptualization occurs with multiple zero eigenvalues: their eigenspace is not unique [33,34], and hence is an arbitrary choice among all of the possible correct eigenvectors for a repeated (non-degenerate) eigenvalue from a real symmetric matrix. Consequently, if only a few zero eigenvalues exist, then multivariate principal components analysis (PCA) tends to recover p-tuples like the simple structure ones referenced in Table 2 acquired with the SAS PROC TRANSREG [35] procedure. Otherwise, it can confuse even the stand-alone 2-tuple sources. Accordingly, still pursuing this regression approach, standard linear regression procedures designed to detect and convey a user message equivalent to “model is not of full rank” (e.g., the SAS PROC REG [35] message) furnishes a process that yields the desired p-tuples.

Overlapping linear combinations constitute a complication whose astute management and treatment remain both elusive and a fruitful subject of future research. The Syracuse entry in Table 2 as well as Appendix C underscore some of the data analytic complexities that can arise in the presence of this phenomenon.

4.1. Competing Algorithms

SAS champions the sweep algorithm (aka SWP [35,36], [37] (pp. 95–99), [38]; Tsatsomeros [39] furnishes an excellent literature review and historical overview for it), whereas much contemporary mathematical work advocates for the classical QR with a column pivoting solution (e.g., [40,41,42]). As an aside, Lange ([37], pp. 93–94) notes that “[t]he sweep operator … is the workhorse of computational statistics,” and that “[a]lthough there are faster and numerically more stable algorithms for … solving a least-squares problem, no algorithm matches the conceptual simplicity and utility of sweeping” due to its superior abilities to efficiently and effectively exploit linear regression matrix symmetries; he then notes that its historical competitors were/are the Cholesky decomposition, the Gram-Schmidt orthogonalization, and Woodbury’s formula. Meanwhile, generally speaking, this former QR algorithm, the most serious contemporary competitor of the sweep operator, performs elementary row operations on a system of linear equations, which for least squares regression is the set of normal equations. It enables the execution of this linear regression by sweeping particular rows of the sums-of-squares-and-cross-products matrix X^TX in or out, where sweeping is similar to executing a Gauss-Jordan elimination. This sweep procedure involves the following sequence of matrix row adjustments (the basic row operations, pivoting exclusively on diagonal matrix elements, are the multiplication of a row by a constant, and the addition of a multiple of one row to another) based upon diagonal element h in matrix X^TX = <a_jk> whose nonzero value is c_hh:

j = k = h: matrix entry X^TX[h, h] = a_hh,
j = h & k ≠ h: matrix entry X^TX[h, k] = a_hk/a_hh, and
j ≠ h & k ≠ h: matrix entry X^TX[j, k] = a_jk – a_jha_kh/a_hh.

The row operations generating these quantities simultaneously update estimates for the regression coefficients, error sum of squares, and generalized inverse of the given system of equations. Matrix X^TX symmetry requires calculations for only its upper triangle, reaping execution time savings. It also achieves in-place mapping with minimal storage. Meanwhile, sweeping in and out refers to the reversibility of this operation. These preceding added effects constitute “sweeping in.” They can be undone by sweep a second time, constituting “sweeping out.” This sweep operator is quite simple, making it an extremely useful tool for computational statisticians.

In contrast, a QR algorithm, which tends to be slow but accurate for ill-conditioned systems, applies the QR matrix decomposition (aka QR or QU matrix factorization) of X^TX = QR, where Q is an orthonormal matrix, and R is an upper triangular matrix. The appeal of such an orthogonal matrix is that its transpose and inverse are equal. Thus, the linear regression system of equations X^TXb = (X^TY) reduces to a triangular system Rb = Q^T(X^TY), which is much easier to solve. Implementation of this QR algorithm can be carried out with column pivoting, which bolsters its ability to solve rank-deficient systems of equations while tending to provide better numerical accuracy. However, by doing so, it solves the different system of equations X^TXP = QR, or X^TX = QRP^T, where P is a permutation matrix linked to the largest remaining column (i.e., column pivoting) at the beginning of each new step. The selection of matrix P usually is such that the diagonal elements of matrix R are non-increasing. This system of equations switch potentially adds further complexity to handling this algorithm. Nevertheless, column pivoting is useful when eigenvalues of a matrix actually, or are expected to approach, zero, the topic of this paper. Consequently, the relative simplicity of the sweep algorithm frequently makes it preferable to a QR algorithm.

A systematic comparative review or numerical examination between the sweep operator and its handful of prominent competitors mentioned here is beyond the scope of this paper. The conceptual discussion in this section identifies certain strengths and weaknesses of this sweep operator, which is a reasonable starting point for embarking upon such a future assessment.

4.2. SAS Procedures: REG and TRANSREG

PROC REG [35] sequentially applies a sweep algorithm to the mathematical statistics normal equations formed from the standard linear regression cross-products covariate matrix X^TX, according to the order of the covariates appearing in its SAS input statement. It always begins by sweeping the first column in matrix X, followed by the next column in this matrix if the pivot is less than a near-zero value whose default SAS threshold magnitude is 1.0

\times

10⁻⁹ for most machines (i.e., if that column is not a linear function of the preceding column), then continuing sequentially to each of the next columns if their respective pivots are not less than this threshold amount, until it passes through all of the columns of matrix X. This method is accurate for most undirected connected planar graph adjacency matrixes since they are reasonably scaled and not too collinear. Given this setting, this SAS procedure can uncover linearly dependent subsets of adjacency matrix columns by specifying its first column, C₁, as a dependent variable, and its remaining (n − 1) columns, C₂-C_n, as covariates. SAS output from this artificial specification includes an enumeration of the existing linearly dependent subsets of columns. A second regression that is stepwise in its nature and execution can check whether or not C₁ itself is part of a linear combination subset. Simplicity dissipates here when n becomes too large, with prevailing numerical precision resulting in some linear combinations embracing numerous superfluous columns with computed near-zero regression coefficients (e.g., 1.0

\times

10⁻¹¹). This rounding error corruption emerges in PROC REG [35] between n = 2100 (e.g., the Chicago empirical example) and n = 3408 (e.g., the US HSA empirical example).

In contrast, PROC TRANSREG [35] executes nonsequential sweeps, finding the best column to sweep first, followed by the second best, then the third best, and so on. This deviation from a sequential SAS input ordering tends to be more numerically accurate. Again, simplicity dissipates here when n becomes too large, although a much larger n than that for PROC REG [35], emerging here somewhere between n = 5265 (e.g., the Texas empirical example) and n = 7249 (e.g., the Syracuse empirical example). In such messy situations, which usually are for the larger -tuple values of p, ignoring the near-zero regression coefficients exposes the correct latent subset linear combinations. Therefore, PROC TRANSREG [35] is the SAS procedure most often utilized in this paper, with a few of its smaller n results checked or carried out with PROC REG [35].

4.3. Disclosing Subset Linear Combinations for the Specimen Empirical Surface Partitionings

We denote the columns of a given n-by-n matrix by COLj, j = 1, 2, … , n (i.e., number them from left to right in a matrix with consecutive positive integers). Among other listings, Table 2 details the numbers and sizes of subset linear combinations of the specimen graph adjacency matrix columns, once more understanding that symmetry means row outcomes are exact parallels. Neither the Chicago nor the North Carolina empirical examples contain zero eigenvalues, and their PROC TRANSREG [35] results yield no “less than full rank model” (the PROC TRANSREG message) warning.

Identifying 2-tuples is relatively easy and efficient for an undirected connected planar graph G_n. Since its sparse binary 0-1 adjacency matrix may be stored as a list of no more than 6(n − 2) pairs of edges/links, rather than either a full n² or no-self-loops n(n − 1) set of pairings, a fast comparison of the position of ones in two columns is possible. These revelations, which primarily are the ones Griffith and Luhanga [12] report, can serve as a check for generated regression disclosures. Sylvester’s [43] matrix algebra inertia theorem transfers these linear combinations from matrix C to matrix W.

Figure 2 portrays the SAS output for the Texas empirical example. The order of column appearances in an equation is irrelevant, although switching them between the right- and left-hand sides of the equations can alter the itemized linear combination coefficients, if only in terms of their signs. The seven 2-tuples here corroborate the fast brute force pairwise comparison matchings outcome based upon a sparse adjacency matrix file. In addition, PROC TRANSREG [35] identifies the columns constituting the exclusive 3-tuple (i.e., {C₁₅₉₄, C₁₆₆₈, C₂₄₅₁}) and the two 4-tuple (i.e., {C₃₂₆₃, C₃₂₆₄, C₃₂₆₅, C₃₂₇₀} and {C₄₈₄₂, C₄₈₄₇, C₄₈₄₈, C₄₈₄₉}) subsets.

Figure 3 portrays part of the SAS output for the Syracuse empirical example. The near-zero coefficients (e.g., all

\pm

0.000197 or

\pm

0.000031) are unessential (i.e., attributable to numerical imprecision), whereas the few

\pm

1.0000 coefficients indicate the latent subset: C₅₅, C₉₇, and C₁₄₁. The term “Identity” appears in this computer hardcopy since PROC TRANSREG [35] enables and implements variable transformations, with this particular formulation option instructing this procedure to retain the initial untransformed variables themselves.

Figure 4 demonstrates the failure of PCA to always produce simple linear combinations (i.e., simple structure) due to the non-uniqueness of the eigenspaces involved. Multiple linear regression allocates the repeated adjacency matrix column C₁₄₆₀ in this example to only one subset linear combination, whereas PCA introduces some confusion by allocating it to the various subsets to which it also could belong. In other words, the linear regression approach employing a sweep operator furnishes a more parsimonious solution here. One important aspect of this exemplification is that PCA muddling still materializes in the presence of a relatively large n (i.e., 3408) coupled with a quite small number of redundant columns (i.e., eight).

4.4. Discussion: Selected Sweep Operator Properties

Computational complexity and robustness are two salient properties beyond advantages established via competitor algorithm comparisons that demand commentary here supplementing that already appearing in Section 4.1, if for no other reason than to better contextualize the sweep operator within the existing literature. In statistics, although Goodnight [36] popularized this operator, Dempster [44] essentially introduced it to this discipline, with Andrews [45] evaluating it in terms of robustness; noteworthy is that non-statisticians call it by other names, including the principal pivot transform [46], gyration [47], and exchange [48]. Two of its close relatives are the (previously mentioned) Gauss-Jordan elimination ([42], p. 166) and the forward Doolittle ([36], pp. 151–152), an execution step in the LU decomposition, procedures. Thus, on the one hand, its computational complexity—the solution time and memory/storage space resources required by it to solve a specific zero eigenvalue problem—are not extreme, relatively speaking. For example, SAS reports total processing CPU execution time of 5:22.59 for n = 7249 (the largest graph analyzed in this paper) on a desktop computer with four CPUs. It quickly regresses all variables, such as the second through nth columns, against one specified variable, such as the first column, of a graph adjacency matrix, obtaining in a single execution linear regression computations and all perfect linear combinations of columns that are present. Therefore, it is relatively efficient and scalable to the same degree as any linear regression algorithm; linear regression scalability is not computationally heavy.

On the other hand, computer algorithm robustness here—a desirable characteristic ensuring the sweep algorithm maintains reliability and resilience across differing circumstances—refers to its ability to consistently and accurately uncover zero eigenvalue linear combinations of covariates in the presence of various types of disturbances, errors, and/or unexpected inputs (e.g., the random planar graphs of Appendix C). Accordingly, the criteria it must satisfy should ensure that it is able to identify perfect linear combinations that are: (1) for multiple replications of a single adjacency matrix column/row (see Corollary 2); (2) smaller ones embedded in larger ones (e.g., see Table 2); (3) different but sharing columns/rows without double-counting them (e.g., see Figure A1); (4) not confused by numerical precision/rounding error (e.g., see Figure 3); and, (5) for excessively complicated outlier graph structures (e.g., Corollary 2) involving very many columns/rows (see Figure A3). In other words, the sweep operator is resistant since it continues to Identify all latent zero eigenvalue linear combinations even if a small fraction of a graph adjacency matrix undergoes detrimental alterations, or the recovered linear combination is tremendously sophisticated (see Appendix C). With regard to the preceding fourth point, PROC TRANSREG [35] often appears to be more resilient than PROC REG [35], at least in their SAS enactment. However, the reported trivial regression coefficients are so small that they become obvious precision errors (see Section 4.2) to the naked eye.

5. Approximating the Eigenvalues of Matrix W

The spatial statistical problem of interest here is estimation of the auto-normal model normalizing constant (e.g., [49]), the Jacobian of the transformation from a spatially autocorrelated to a spatially unautocorrelated mathematical space using calculus terminology. This quantity almost always is a function of the undirected connected planar graph dual row-standardized adjacency matrix W eigenvalues for a configuration of polygons forming a two-dimensional surface partitioning. Such eigenvalue calculations are possible for quite large n, In the lower 10,000s, but become Impossible beyond the frontiers of computing power, a constantly expanding resource size. This normalizing constant is the existing work adaption target for the matrix inertia refinement this paper disseminates.

An undirected star graph, K_1,n, perceptively illustrates the problem here in its extreme: its eigenvalues are 1, n − 2 0s, and −1 for matrix W, whereas they are

\pm \sqrt{1^{T} C 1}

and n − 2 0s for matrix C. In this kind of situation, Bindel and Dong [15], for example, show how eigenvalue frequency distributions can become dominated by a spike at zero (also see Figure 5b,d). A null eigenvalue fails to add any amount to the sum of the n eigenvalues, regardless of the value of n for star graph G_n. For all n > 1, the variance of these eigenvalues is 1^TD⁻¹C D⁻¹1 = 2. More realistic graphs show that although λ₁(W) = 1 remains unchanged, adding redundant links that introduce zero eigenvalues can alter λ_n(C) and λ_n(W) as well as some or all of the intermediate eigenvalues (see Table 2). Furthermore, the matrix W eigenvalues have a more limited range (Table 2) that causes their density to increase faster than it does for their binary 0-1 parent matrix C eigenvalues.

Estimating a complete set of n eigenvalues with a method of moments type technique means spreading the variance over all n values while constraining their sum to equal zero. Griffith and Luhanga [12] address the need to include matrix inertia information in this process: zero eigenvalues should not have non-zero allocations during an approximation routine, and the number of positive and negative eigenvalues needs differentiation in asymmetric eigenvalue distributions since the same cumulative sum almost certainly is spread over fewer individual λs in one of the two instances (almost always the positive domain). Taliceo and Griffith [25] explore this latter trait. Incorporating these two facets for remotely sensed data (e.g., see [13]) analyses renders remarkably accurate eigenvalue approximations for that spatial statistical landscape [14], in part due to its symmetric eigenvalue distribution for the often employed rook adjacency definition. In contrast, ignoring these two influential facets tends to cause typical administrative districts (e.g., counties, census tracts) forming irregular surface partitioning to approximate eigenvalues displaying a discontinuity separation gap at zero (e.g., Figure 5a,c; a finding reported by Griffith in earlier publications). Improved adoption of these two eigenvalue properties should enable a better approximation fabrication for them that exhibits a smoother transition between positive and negative λs.

5.1. Preliminary Eigenvalue Approximation Steps

Eigenvalue approximation can follow a three-step sequence of directives. The first step approximates the extreme eigenvalues of matrices C and W [16,50]. A formulated algorithm employing solely the sparse version of an adjacency matrix (i.e., a sequential listing of only the row-column cells containing a one) quickly produces these values, with extreme accuracy for both λ₁(C) and λ_n(W) (Table 3); the most relevant of these two values for spatial statistics/econometrics is λ_n(W). This algorithm builds its λ_n(C) approximation with the normalized principal eigenvector, E₁, estimated during the iterative calculation of λ₁(C). Next, this algorithm builds its λ_n(W) estimate with the normalized eigenvector E_n approximated during the iterative calculation of λ_n(C). Given that λ₁(W) ≡ 1 is known theoretically, the appealing outcome is that λ_n(W) is knowable for massively large matrices.

The second step of the proposed eigenvalue approximation procedure determines the number of zero eigenvalues. The sparse version of an adjacency matrix also allows a quick identification of all 2-tupes generating zero eigenvalues (Table 4), which in general, and as a rational expectation, seem to account for a preponderance of zero eigenvalues (e.g., [12], Table 1 and Table 2). However, even the triplet computations—executing n(n − 1)(n − 2) trivariate regressions—for the larger of the seven specimen landscapes studied in this paper are too computer intensive for practical purposes. Moreover, presently the best estimate of n₀ is the number of 2-tuples for massively large square matrices. In addition, this approach fails to be effective when extended to more general real matrices since their non-sparseness requires a replacement of each n(n − 1) simple comparison with a bivariate regression; columns/rows can be proportional rather than just equal. This daunting computational demand argues for keeping the original PROC TRANSREG [35] operationalization with its n(n − 1) sweep operations.

The third step approximates the non-zero eigenvalues with a method of moments type of calibration methodology, given that their mean (i.e., zero) and variance (i.e., 1^TD⁻¹C D⁻¹1/n) are either known or easily computable, and hence available. A favored specification for this task exploits eigenvalue ordering by calibrating expressions such as

\begin{matrix} λ (W) > 0 : {[(n_{+} + 1 - r_{i}) / n_{+}]}^{γ} \\ λ (W) < 0 : {- [({| λ}_{n}| r_{i}) / n_{-}]}^{θ} \end{matrix}\},

(4)

where r_i denotes the descending ordering rank of the i^th eigenvalue after setting aside the set of zero eigenvalues [which subsequently are merged with the approximations generated by Equation (4)], and exponents γ and θ that match the approximated eigenvalue mean and square root of its sum of squares as closely as possible (guided by the method of moments estimation strategy, and using a mean squared error criterion); all manufactured values of expression (4) belong to the interval [0, 1], as do the absolute values of their corresponding actual eigenvalues, with γ > 1 and θ > 1 indicative of variance inflation needing removal, and γ < 1 and θ < 1 indicative of deflated variance needing to be restored. The differences between γ and θ appearing in Table 4 confirm an asymmetric distribution of eigenvalues, whereas the positive eigenvalues bivariate regressions corroborate a well-aligned linear trend with their adjusted rankings. This linear trend tendency is capable of facilitating the drafting of an improved upper bound for the sum of k positive eigenvalues.

5.2. An Instructive Eigenvalue Approximation Assessment

This section summarizes evaluations of two specimen landscapes, namely North Carolina (no zero and 41.4% positive eigenvalues) and Syracuse (48 of 70 easily detectable zero and 45.2% positive eigenvalues). The working assumptions are: (1) the number of 2-tuple zero eigenvalues accurately counts n₀ (the calculable count for massively large G_n), the nearest integer 45% of the non-zero eigenvalues accurately counts n₊ (roughly the mid-point of large-landscape simulation findings reported in [25]), and equation (4) approximates a total set of n eigenvalues. Figure 6 portrays the Jacobian plots across the auto-normal model spatial autocorrelation parameter value contained in its feasible parameter space interval, namely, 1/λ_n(W) < ρ < 1. These graphical findings imply that the enhanced (i.e., refined) but relatively simple eigenvalue approximations proposed in this paper are sufficiently accurate to support sound spatial autoregressive analyses. More comprehensive appraisals of this proposition merit future research attention.

6. Discussion

The following two topics warrant further discussion: identifying linearly dependent subsets of columns/rows in more general matrices, and application of the eigenvalue approximation technique to a very sizeable adjacency matrix.

Table 5 presents a posed problem, one that contains column c₆ that is a trivial zero eigenvalue source, and hence ignored in terms of constructing linear combinations with it. The non-square matrix requires analysis of both its original columns and original rows in their matrix transformed styles. PROC TRANSREG [35] identifies the linearly dependent subsets of columns and rows appearing in Figure 7, with a count of five. Column c₁ and transposed column (i.e., row) r₁ serve as the dependent variables in their respective linear regression analyses. Accordingly, since PROC TRANSREG [35] only uncovers linearly dependent subsets in a collection of covariates, both c₁ and r₁ also have to be analyzed in a stepwise PROC REG [35], which exposes the {r₁, r₂, r₅} linearly dependent subset (e.g., the root mean squared error, _RMSE_, is zero, and the linear regression multiple correlation, _RSQ_, is 1).

The 2010 US census tract polygons covering the entire coterminous continental part of that country furnishes a sizeable empirical example of an undirected connected planar G_n, with n = 72,538, 1^TC1 = 402,830, MAX{n_i} = 48, 1^TD⁻¹C D⁻¹1 = 12985.54, λ₁(C) = 8.63582, and λ_n(W) = −0.89732—desktop computer—Intel(R) Xeon(R) CPU, E5640 @ 2.67GHz and 2.66GHz (2 processors), 24.0GB RAM, 64-bit OS, x64-based processor, Windows 10 Enterprise—execution time for calculating these last two quantities was less than 40 s. Crafting this geographic polygons configuration graph constructed for the coterminous US census tracts irregular surface partitioning involved modifying an ESRI ArcMAP .shp file with such manual adjustments as linking islands to the mainland through ferry routes. The simple 2-tuple matching analysis retrieves 32 zero eigenvalue sources. Consequently, the basic input for approximating the matrix W eigenvalues, and hence the needed Jacobian term for spatial statistical/econometric autoregressive analysis, is not only feasible, but also available here. This exemplification once again illustrates how the matrix inertia refinement promulgated in this paper is adaptable to existing work.

7. Conclusions and Implications

In conclusion, this paper set out to establish solutions to two seemingly unsolved or poorly solved linear algebra problems contained in topics of more general interest, one frequently mentioned on the internet, and the other becoming of increasing interest and concern as geospatial analysts push their empirical spatial statistical/economics analyses to larger and larger georeferenced datasets (sometimes in the realm of machine learning and data mining) and contemporary social network analysts continue to adapt spatial analysis techniques to their often massively larger—the KONECT archive project (http://konect.cc/ (accessed on 24 October 2023) makes 1326 social networks available to the public, ranging in node count from 16 to 105.2 million, of which just 543 have fewer than 10,000 nodes, and whose size frequencies closely conform to a log-normal distribution—denser, and almost certainly non-planar graph generated adjacency matrices. The first is the enumeration of all non-trivial linearly dependent column/row subsets in a matrix. One on-line commentator mentions that this problem most likely requires a supercomputer for even modestly large matrices. The regression-based solution promoted in this paper successfully, efficiently, and effectively handles matrices with dimensions in the 1000s, employing a laptop personal computer using the following processor: Intel(R) Core(TM) i5-2520M CPU @ 2.50 GHz; this machine has 8.00 GB of installed RAM, and runs with a 64-bit operating system, clearly not state-of-the-art computer or supercomputer technology.

The second problem of approximating eigenvalues for massively large matrices builds upon an evolving foundation, with demand for its solution being prompted by the existence of a parallel solution for remotely sensed satellite data whose G_n adjacency matrix size often is in the 10,000,000s [14]. This magnitude echoes that for the aforementioned KONECT project social network entries in excess of one million nodes (reflecting a practical upper limit size), which number 18 and have a median of 30 million nodes, and whose Jacobian term requires the irregular tessellation category of eigenvalue approximations outlined in this paper. Another imperative application arena is the present-day expansion of data mining and machine learning analyses to georeferenced, social media, and other substantially large datasets. The solution advocated for in this paper streamlines earlier results by exploiting extremely large adjacency matrix inertia.

The fundamental implications launched by this paper may be summarized as a combination of the preceding and following conjectures:

Conjecture 2.

Let G_n = (V, E) be a simple connected undirected planar graph with n vertices V = {v₁, … , v_n}, and m ≤ n(n − 1)/2 edges E = {e_ij linking vertices v_i and v_j: i = 1, 2, … , n and j = 1, 2, … , n; i ≠ j}. If G_n has no K₄ subgraphs, then the maximum value of n₋ is 2n/3.

Conjecture 3.

Let G_n = (V, E) be a simple connected undirected planar graph with n vertices V = {v₁, … , v_n}, and m ≤ n(n − 1)/2 edges E = {e_ij linking vertices v_i and v_j: i = 1, 2, … , n and j = 1, 2, … , n; i ≠ j}. The sum of the k largest matrix W positive eigenvalues least upper bound for a self-loopless G_n’s adjacency matrix is

\sum_{i = 1}^{k} [(n + 1 - r i) / n +] γ

+ ε for some suitably small ε.

A more general implication is that geographical analysis matrices continue to be a fertile research subject. Concerted quantitative geography efforts in this area began in the 1960s, and have become progressively more sophisticated with the passing of time.

Finally, future research needs to upgrade the estimation of λ_n(C), enhance the accurate determination of n₊ (and hence n₋), expand the quick calculation of p-tuple linearly dependent subsets of adjacency matrix columns/rows greater than p = 2, accurately quantify the negative eigenvalue part of Equation (4) with a nonlinear regression equation specification, and refine and then convert the three stated conjectures into theorems with proofs. Future research also should explore furthering the effectiveness of the proposed sweep-algorithm-based solution utilized here, especially for much larger n, by reformulating it to be optimal for the nullity problem addressed here, rather than for its current generic multiple linear regression implementation, allowing a meaningful reduction in its customized computational complexity. One way to achieve this particular goal might be by more accurately addressing the underlying structure of empirical planar graphs, perhaps utilizing special properties of the Laplacian version of their adjacency matrices to attain faster execution times.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

For the pseudorandom number generators used to create test data see https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.2/lefunctionsref/p0fpeei0opypg8n1b06qe4r040lv.htm (accessed on 19 May 2022). Empirical adjacency matrices available from the author upon request.

Conflicts of Interest

The University of Texas at Dallas subscribes to an annual license for the latest version of Mathematica. The University of Texas System subscribes to an annual license for SAS. Otherwise, the author declares he has no personal circumstances or interest that may be perceived as inappropriately influencing the representation or interpretation of reported research results. Since the research summarized here had no funding support, no funders had a role in: the design of the study; collection, analyses, or interpretation of data; writing of the manuscript; or, decision to publish the results.

Appendix A. Selected Supplemental Specimen Geographic Landscapes

A database comprising roughly seven dozen empirical surface partitioning dual graphs studied elsewhere (e.g., [51]) yields a much smaller number of graphs with zero eigenvalues. The assortment in Table 1 spans a wade range of n; Table A1 reports selected summary statistics for the remaining ten. Figure A1 also enumerates the source linear combinations uncovered by a linear regression sweep algorithm.

Table A1. Selected chosen specimen geographic landscape surface partitioning descriptors.

Geographic Landscape	n	Nullity	K₄ Count	Matrix C			Matrix W
Geographic Landscape	n	Nullity	K₄ Count	λ₁(C)	λ_n(C)	1^TC1	λ₂(W)	λ_n(W)
Columbus, OH 1980 ^‡	49	1	0	5.908	−3.097	232	0.963	−0.651
Winnipeg 1971 ^§	101	1	1	5.738	−3.279	498	0.968	−0.727
Stockholm, Sweden	119	1	1	6.016	−3.420	582	0.987	−0.893
Henan, China	130	1	5	6.111	−3.029	668	0.980	−0.894
Brazilian Amazon ^†	323	1	12	6.276	−3.430	1608	0.996	−0.911
Minnesota tree stands	513	5	29	6.118	−4.127	2462	0.994	−0.763
Germany kreise 1930 ^⁕	743	28	7	6.271	−3.412	3706	1.000 ^⁑	−0.793
Sheffield, UK	930	1	33	6.322	−3.940	4708	0.999	−0.833
Chicago 1990 ^‡	1754	1	23	6.310	−3.605	8820	0.998	−0.861
US counties ^‡	3111	3	18	6.264	−3.426	17,508	0.999	−0.814

NOTE: n denotes the number of graph G_n nodes; K₄ denotes the well-known completely connected G₄; λ denotes eigenvalue; nullity denotes the number of zero eigenvalues; ^§ Statistics Canada 1971 census tracts; ^⁑ this surface partitioning comprises two separated sub-graphs of sizes 35 and 708; ^† Amazon basin municipalities; ^‡ US Census Bureau 1980 or 1990 census tracts. ^⁕ O’ Loughlin, J., C. Flint and L. Anselin. 19943. The Geography of the Nazi Vote: Context, Confession and Class in the Reichstag Election of 1930, Annals, American Association of Geographers, 84 (3): 351–380; map on p. 361.

Figure A1. Partial SAS PROC TRANSREG output for Table A1 examples; matrix column (designated by C#, where # denotes column position) linear combinations for each of the matrix’s zero eigenvalues. The top panel contains illustrative SAS output. In the bottom panel, the left-hand table column contains particular redundant matrix columns, and the right-hand expression is each redundant column’s linear combination.

Appendix B. A Conrolled Experiment: A P-By-P Regular Square Tessellation Forming a Complete Rectangular Region, Rook Adjacency Definition, with a (P, P) Node Corner Landscape Central Node K-Star Subgraph

This particular graph contains a near-block diagonal P-by-P regular square tessellation plus a near-K-star supplemental subgraph (i.e., both subgraphs share a node, causing them to overlap). The expected number of zero eigenvalues for the two subgraphs in isolation is (P + K − 2). Corollary 2 furnishes this counting result, explicitly indicating that P − 1 linear combinations come from the core adjacency matrix, and K − 1 come from simple pairwise node redundancies.

Figure A2 displays the outcome for the P = 9 and K = 19 case. Its anticipated number of zero eigenvalues is 8 + 18 = 26. Given that the corner tessellation node adds two additional links to the start subgraph, 18 of these zero eigenvalues relate to the same redundancy. The remainder relate to more complex uncorrupted linear combinations of columns/rows arising from the set of eigenvectors that, in isolation, are functions of SIN[πj/(P + 1)]

\times

SIN[πk/(P + 1)]. An interested reader can easily replicate this finding.

Figure A2. Partial SAS PROC REG output for the artificial example; matrix column (designated by COL#, where # denotes column position) linear combinations for each of the matrix’s zero eigenvalues. The left-hand table column contains particular redundant matrix columns, and the right-hand expression is each redundant column’s linear combination.

Table A2 furnishes randomly simulated specimen planar graph adjacency matrix results as an exploratory confirmatory exercise. The sweep algorithm successfully identifies the redundant columns/rows in every case, a count Corollary 2 proclaims.

Table A2. Simulated results for cases specified by Corollary 2.

Random Selection		Number of Nodes	P + K − 2	Number of Separate Linear Regression Sweep Algorithm Collinearities
$P \in$ [6, 50]	$K \in$ [2, 500]	Number of Nodes	P + K − 2
16	390	646	404	404
15	408	633	421	421
34	495	1651	527	527
12	312	656	322	322
39	289	1810	326	326
23	375	904	396	396
46	84	2200	128	128
37	409	1778	444	444
39	165	1686	202	202
19	11	372	28	28

NOTE: P denotes the dimensions of a P-by-P regular square tessellation; K denotes the number of nodes forming a star attached to the southeast corner of the square mesh.

Appendix C. Expanding the Graph Validation Dataset

The empirical graphs dataset employed for illustrative purposes is limited since its compilation is for surface partitionings referenced in the literature. Many of these adjacency matrices are free of zero eigenvalues, which is not uncommon in spatial analysis practice. Griffith [51] offers one way to circumvent this restriction. This appendix summarizes synthetic random planar graphs for n = 100, 500, 1000, and 2000 generated specifically for analyzing zero eigenvalues. Of 25 such generated graphs for n = 100, six have at least one zero eigenvalue: two have eight, and four have exactly one. Figure A3 displays this outcome. Of note is that its first entry has nine linear combinations for eight zero eigenvalues; the linear combination for COL91 also is part of the linear combination for COL100, and hence is for the same, not for a separate, zero eigenvalue.

None of these synthetic specimens has the star subgraph typically encountered in administrative district surface partitionings. Rather, each of the single zero eigenvalues here relates to a sizeable linear combination of other adjacency matrix columns/rows. Nevertheless, this example furnishes additional evidence about the usefulness and robustness of the regression sweep algorithm for solving this linear algebra problem. Most of the linear combinations reported in this appendix are far more complicated than those appearing in either this paper’s main text or Appendix A and Appendix B.

Figure A3. Partial SAS PROC REG output for the simulated examples; matrix column (designated by COL#, where # denotes column position) linear combinations for each of the matrix’s zero eigenvalues. The left-hand table column contains particular redundant matrix columns, and the right-hand expression is each redundant column’s linear combination.

Figure A4 furnishes several additional random planar network outcomes. The random sampling produced adjacency matrices with zero eigenvalues with the following frequencies: two out of 10 for n = 500; two out of 35 for n = 1000; and, one out of 25 for n = 2000.

Figure A4. Partial SAS PROC TRANSREG output for five simulated examples; matrix column (designated by C#, where # denotes column position) linear combinations for each of the matrix’s zero eigenvalues. From left to right, the column contents are: n (i.e., number of vertices), size of linear combination yielding a zero eigenvalue, a particular redundant matrix column causing collinearity, and the expression for the redundant column’s linear combination.

References

Fan, Y.; Wang, L. Bounds for the positive and negative inertia index of a graph. Linear Algebra Its Appl. 2017, 522, 15–27. [Google Scholar] [CrossRef]
Li, S.; Sun, W. On the relation between the positive inertia index and negative inertia index of weighted graphs. Linear Algebra Its Appl. 2019, 563, 411–425. [Google Scholar] [CrossRef]
Lancaster, P.; Tismenetsky, M. The Theory of Matrices; Academic Press: New York, NY, USA, 1985. [Google Scholar]
Wang, L. Nullity of a graph in terms of path cover number. Linear Multilinear Algebra 2021, 69, 1902–1908. [Google Scholar] [CrossRef]
Cheng, B.; Liu, M.; Tam, B. On the nullity of a connected graph in terms of order and maximum degree. Linear Algebra Its Appl. 2022, 632, 193–232. [Google Scholar] [CrossRef]
Hicks, I.; Brimkov, B.; Deaett, L.; Haas, R.; Mikesell, D.; Roberson, D.; Smith, L. Computational and theoretical challenges for computing the minimum rank of a graph. INFORMS J. Comput. 2022, 34, 2868–2872. [Google Scholar] [CrossRef]
Alaeiyan, M.; Obayes, K.; Alaeiyan, M. Prediction nullity of graph using data mining. Results Nonlinear Anal. 2023, 6, 1–8. [Google Scholar]
Arumugam, S.; Bhat, K.A.; Gutman, L.; Karantha, M.; Poojary, R. Nullity of graphs—A survey and some new results. In Applied Linear Algebra, Probability and Statistics: A Volume in Honour of CR Rao and Arbind K. Lal; Bapat, R., Karantha, M., Kirkland, S., Neogy, S., Pati, S., Puntanen, S., Eds.; Springer Nature: Singapore, 2023; pp. 155–175. [Google Scholar]
Druinsky, A.; Carlebach, E.; Toledo, S. Wilkinson’s inertia-revealing factorization and its application to sparse matrices. Numer. Linear Algebra Appl. 2018, 25, e2130. [Google Scholar] [CrossRef]
Fan, Y.; Wang, Y.; Bao, Y.; Wan, J.; Li, M.; Zhu, Z. Eigenvectors of Laplacian or signless Laplacian of hypergraphs associated with zero eigenvalue. Linear Algebra Its Appl. 2019, 579, 244–261. [Google Scholar] [CrossRef]
Nakatsukasa, Y.; Noferini, V. Inertia laws and localization of real eigenvalues for generalized indefinite eigenvalue problems. Linear Algebra Its Appl. 2019, 578, 272–296. [Google Scholar] [CrossRef]
Griffith, D.; Luhanga, U. Approximating the inertia of the adjacency matrix of a connected planar graph that is the dual of a geographic surface partitioning. Geogr. Anal. 2011, 43, 383–402. [Google Scholar] [CrossRef]
Comellas, F.; Dalfó, C.; Fiol, M.; Mitjana, M. The spectra of Manhattan street networks. Linear Algebra Its Appl. 2008, 429, 1823–1839. [Google Scholar] [CrossRef]
Griffith, D. Approximation of Gaussian spatial autoregressive models for massive regular square tessellation data. Int. J. Geogr. Inf. Sci. 2015, 29, 2143–2173. [Google Scholar] [CrossRef]
Bindel, D.; Dong, K. Modified kernel polynomial method for estimating graph spectra. In Proceedings of the SIAM Workshop on Network Science, Snowbird, UT, USA, 15–16 May 2015; Available online: https://www.cs.cornell.edu/~bindel/papers/2015-siam-ns.pdf (accessed on 24 October 2023).
Griffith, D. Extreme eigenfunctions of adjacency matrices for planar graphs employed in spatial analyses. Linear Algebra Its Appl. 2004, 388, 201–219. [Google Scholar] [CrossRef]
Wu, X.-Z.; Liu, J.-P. Sharp upper bounds for the adjacency and the signless Laplacian spectral radius of graphs. Appl. Math.—A J. Chin. Univ. 2019, 34, 100–112. [Google Scholar] [CrossRef]
Feng, L.; Yu, G.; Zhang, X.-D. Spectral radius of graphs with given matching number. Linear Algebra Its Appl. 2007, 422, 133–138. [Google Scholar] [CrossRef]
Milanese, A.; Sun, J.; Nishikawa, T. Approximating spectral impact of structural perturbations in large networks. Phys. Rev. E 2010, 81, 046112. [Google Scholar] [CrossRef]
Meyer, C. Chapter 8: Perron–Frobenius theory of nonnegative matrices. In Matrix Analysis and Applied Linear Algebra; SIAM: Philadelphia, PA, USA, 2000; pp. 661–704. [Google Scholar]
Griffith, D. Eigenfunction properties and approximations of selected incidence matrices employed in spatial analyses. Linear Algebra Its Appl. 2000, 321, 95–112. [Google Scholar] [CrossRef]
Mohar, B. On the sum of k largest eigenvalues of graphs and symmetric matrices. J. Comb. Theory Ser. B 2009, 99, 306–313. [Google Scholar] [CrossRef]
Tait, M.; Tobin, J. Three conjectures in extremal spectral graph theory. J. Comb. Theory Ser. B 2017, 126, 137–163. [Google Scholar] [CrossRef]
Elphick, C.; Wocjan, P. An inertial lower bound for the chromatic number of a graph. Electron. J. Comb. 2016, 24, 1–58. [Google Scholar]
Taliceo, N.; Griffith, D. The K₄ graph and the inertia of the adjacency matrix for a connected planar graph. STUDIA KPZK PAN (Publ. Pol. Acad. Sci.) 2018, 183, 185–209. [Google Scholar]
Keller, M.; Trotter, W. Chapter 5: Graph theory. In Applied Combinatorics; LibreTexts: Davis, CA, USA, 2023; Available online: https://math.libretexts.org/Bookshelves/Combinatorics_and_Discrete_Mathematics/Applied_Combinatorics_(Keller_and_Trotter) (accessed on 24 October 2023).
Griffith, D.; Sone, A. Trade-offs associated with normalizing constant computational simplifications for estimating spatial statistical models. J. Stat. Comput. Simul. 1995, 51, 165–183. [Google Scholar] [CrossRef]
Venkateshan, S.; Swaminathan, P. Chapter 2: Solution of linear equations. In Computational Methods in Engineering; Elsevier: Amsterdam, The Netherlands, 2013; pp. 19–103. [Google Scholar]
Venkateshan, S.; Swaminathan, P. Chapter 4: Solution of algebraic equations. In Computational Methods in Engineering; Elsevier: Amsterdam, The Netherlands, 2013; pp. 155–201. [Google Scholar]
Bollobás, B. Modern Graph Theory; Graduate Texts in Mathematics; Springer: Berlin/Heidelberg, Germany, 1998; Volume 184. [Google Scholar]
Ord, J. Estimating methods for models of spatial interaction. J. Am. Stat. Assoc. 1975, 70, 120–126. [Google Scholar] [CrossRef]
Hawkins, D. On the investigation of alternative regressions by principal component analysis. J. R. Stat. Soc. Ser. C (Appl. Stat.) 1973, 22, 275–286. [Google Scholar] [CrossRef]
Gupta, R. Chapter 6: Eigenvalues and eigenvectors. In Numerical Methods: Fundamentals and Applications; Cambridge University Press: Cambridge, UK, 2019; pp. 268–298. [Google Scholar]
Taboga, M. Algebraic and Geometric Multiplicity of Eigenvalues. Lectures on Matrix Algebra. 2021. Available online: https://www.statlect.com/matrix-algebra/algebraic-and-geometric-multiplicity-of-eigenvalues (accessed on 18 June 2022).
SAS Institute Inc. SAS/STAT® 15.1 User’s Guide; SAS Institute Inc.: Cary, NC, USA, 2018; Available online: https://documentation.sas.com/doc/en/pgmsascdc/9.4_3.4/statug/statug_reg_details36.htm (accessed on 24 October 2023).
Goodnight, J. A tutorial on the sweep operator. Am. Stat. 1979, 33, 149–158. [Google Scholar]
Lange, K. Numerical Analysis for Statisticians, 2nd ed.; Springer: New York, NY, USA, 2010. [Google Scholar]
Ahamed, M.; Biswa, A.; Phukon, M. A study on multicollinearity diagnostics and a few linear estimators. Adv. Appl. Stat. 2023, 89, 29–54. [Google Scholar] [CrossRef]
Tsatsomeros, M. Principal pivot transforms: Properties and applications. Linear Algebra Its Appl. 2000, 307, 151–165. [Google Scholar] [CrossRef]
Duersch, J.; Gu, M. Randomized QR with column pivoting. SIAM J. Sci. Comput. 2017, 39, C263–C291. [Google Scholar] [CrossRef]
Martinsson, P.; OrtI, G.Q.; Heavner, N.; Van De Geijn, R. Householder QR factorization with randomization for column pivoting (HQRRP). SIAM J. Sci. Comput. 2017, 39, C96–C115. [Google Scholar] [CrossRef]
Li, H. Numerical Methods Using Java: For Data Science, Analysis, and Engineering; APress: New York, NY, USA, 2022. [Google Scholar]
Sylvester, J. A demonstration of the theorem that every homogeneous quadratic polynomial is reducible by real orthogonal substitutions to the form of a sum of positive and negative squares. Philos. Mag. 4th Ser. 1852, 4, 138–142. [Google Scholar] [CrossRef]
Dempster, A. Elements of Continuous Multivariate Analysis; Addison-Wesley: Reading, MA, USA, 1969. [Google Scholar]
Andrews, D.F. A robust method for multiple linear regression. Technometrics 1974, 16, 523–531. [Google Scholar] [CrossRef]
Tucker, A. Principal pivot transforms of square matrices. SIAM Rev. 1963, 5, 305. [Google Scholar]
Duffin, R.; Hazony, D.; Morrison, N. Network synthesis through hybrid matrices. SIAM J. Appl. Math. 1966, 14, 390–413. [Google Scholar] [CrossRef]
Stewart, M.; Stewart, G. On hyperbolic triangularization: Stability and pivoting. SIAM J. Matrix Anal. Appl. 1998, 19, 847–860. [Google Scholar] [CrossRef]
Bivand, R.; Hauke, J.; Kossowski, T. Computing the Jacobian in Gaussian spatial autoregressive models: An illustrated comparison of available methods. Geogr. Anal. 2013, 45, 150–179. [Google Scholar] [CrossRef]
Griffith, D.; Bivand, R.; Chun, Y. Implementing approximations to extreme eigenvalues and eigenvalues of irregular surface partitionings for use in SAR and CAR models. Procedia Environ. Sci. 2015, 26, 119–122. [Google Scholar] [CrossRef]
Griffith, D. Generating random connected planar graphs. GeoInformatica 2018, 22, 767–782. [Google Scholar] [CrossRef]

Figure 1. Matrix C twosomes: eigenvalue gamma and positive eigenvalue Weibull quantile plots (black) with superimposed 95% confidence intervals and trend lines (red). Top left: (a,b) England. Top right (c,d): Chicago. Top middle left (e,f): Edmonton. Top middle right (g,h): North Carolina. Bottom middle left (i,j): US HSA. Bottom middle right (k,l): Texas. Bottom (m,n): Syracuse.

Figure 2. Partial SAS PROC TRANSREG output for the Table 2 Texas empirical example; matrix column (designated by COL#, where # denotes column position) linear combinations for each of the matrix’s zero eigenvalues. The left-hand table column contains particular redundant matrix columns, and the right-hand expression is each redundant column’s linear combination.

Figure 3. Partial SAS PROC TRANSREG output for the Table 2 Syracuse empirical example; matrix column (designated by COL#, where # denotes column position) linear combinations for each of the matrix’s zero eigenvalues The left-hand table column contains particular redundant matrix columns, and the right-hand expression is each redundant column’s linear combination.

Figure 4. Partial SAS PROC TRANSREG (top) and PROC PRINCOMP (bottom) output for the Table 2 US HAS empirical example; matrix column (designated by COL#, where # denotes column position) linear combinations for each of the matrix’s zero eigenvalues. The left-hand table column contains particular redundant matrix columns, and the right-hand expression is each redundant column’s linear combination, based upon the last eight extracted components (a la [32]). Bold font denotes compound loadings; underlining denotes incorrect loadings.

Figure 5. Specimen empirical adjacency matrix-based W eigenvalues with a more conspicuous zero value presence. Top left (a): England descending rank ordering. Top right (b): England histogram with a superimposed gamma distribution (red curve line; see Figure 1) and a highlighted zero value frequency (red bar). Bottom left (c): Syracuse descending rank ordering. Bottom right (d): Syracuse histogram with a superimposed gamma distribution (red curve line; see Figure 1) and a highlighted zero value frequency (red bar).

Figure 6. The approximated eigenvalues Jacobian plot (denoted by smaller solid black circles) superimposed upon the true Jacobian plot (denoted by larger solid gray circles) for an auto-normal model specification. Left (a): the North Carolina case (γ = 1.1526 and θ = 1.6473). Right (b): the Syracuse case (γ = 1.8090 and θ = 1.6227).

Figure 7. Linearly dependent subsets in a non-square matrix from SAS PROC TRANSREG.

Table 1. Selected chosen specimen geographic landscape surface partitioning descriptors (also see Appendix A).

Geographic Landscape	n	K₄ Count	Matrix C					Matrix W
Geographic Landscape	n	K₄ Count	UB	λ₁(C)	LB	λ_n(C)	1^TC1	λ₂(W)	λ_n(W)
England ^†	839	9	7.59	6.07	4.97	−4.19	3564	0.9976	−0.93
Chicago ^‡	2067	35	8.52	6.22	5.30	−3.73	10,562	0.9987	−0.86
Edmonton ^§	2098	121	10.95	6.95	5.62	−4.32	10,640	0.9985	−0.77
North Carolina ^‡	2195	52	8.52	6.28	5.77	−3.76	12,206	0.9994	−0.70
US HSA ^⁑	3408	204	12.13	7.82	5.94	−4.46	18,308	0.9996	−0.96
Texas ^‡	5265	185	12.43	7.43	5.66	−4.52	28,176	0.9995	−0.84
Syracuse ^†	7249	322	12.30	7.38	5.40	−5.02	35,100	0.9994	−0.94

NOTE: n denotes the number of graph G_n nodes; K₄ denotes the well-known completely connected G₄; λ denotes eigenvalue; UB/LB denote upper/lower bound; ^§ Statistics Canada 2011 census tracts; ^⁑ US HSA denotes 2017 US hospital service areas.; ^† Appears in Griffith and Luhanga (2011). ^‡ US Census Bureau 2000, 2010, or 2020 census tracts.

Table 2. Selected descriptors of the chosen specimen geographic landscape surface partitionings containing zero λs.

Feature	Removed λ = 0	England	Edmonton	US HSA	Texas	Syracuse
λ₁(C)	none	6.071787	6.950707	7.820136	7.431868	7.375069
	2-tuples	6.027092	6.905243	7.820136	7.413686	7.352621
	p-tuples	6.026312	6.886547	7.820136	7.406109	7.352365
λ_n(C)	none	−4.185268	−4.324066	−4.458946	−4.523357	−5.024567
	2-tuples	−3.729453	−4.236484	−4.353676	−4.512665	−5.018984
	p-tuples	−3.729440	−4.236477	−4.353676	−4.506244	−5.017506
λ_n(W)	none	−0.925413	−0.773818	−0.956681	−0.843875	−0.935861
	2-tuples	−0.900969	−0.731063	−0.956681	−0.843875	−0.935861
	p-tuples	−0.900969	−0.728697	−0.956681	−0.843875	−0.935861
1^TC1	none	3564	10,640	18,308	28,176	35,100
	2-tuples	3400	10,584	18,294	28,142	34,932
	p-tuples	3372	10,536	18,290	28,126	34,878
1^TD⁻¹C D⁻¹1	none	177.14	397.38	620.01	977.65	1440.31
	2-tuples	165.02	394.03	619.06	976.71	1434.92
	p-tuples	161.61	391.09	618.58	976.35	1431.56
p-tuple frequency	2	79	23	6	7	48
	3	5	0	1	1	7 ^†
	4	3	4	1	2	9
	5	4	4	0	0	3
	6	1	1	0	0	1
	7	0	1	0	0	0
	8	0	0	0	0	2
	10	2	0	0	0	0
	11	1	0	0	0	0

NOTE: λ denotes eigenvalue; ^† one 2-tuple overlaps with, and hence may mask, two 3-tuples.

Table 3. Matrix C and W extreme eigenvalue approximations for the chosen specimen geographic landscape surface partitionings.

Extreme λs	England	Chicago	Edmonton	North Carolina	US HSA	Texas	Syracuse
λ₁(C)	6.07179	6.21901	6.95071	6.27921	7.82014	7.43187	7.37507
λ_n(C)	−4.17990 (−4.18527)	−3.73161 (−3.34868)	−3.88935 (−4.32407)	−3.75641 (−3.75393)	−4.45320 (−4.45895)	−4.52253 (−4.52336)	−3.76275 (−5.02457)
λ_n(W)	−0.92541	−0.85971	−0.77382	−0.70273	−0.95668	−0.84387	−0.93586

NOTE: λ denotes eigenvalue; actual λ_n(C) values in parentheses, with deviating decimal places in bold font; all λ₁(C) and λ_n(W) reported decimal place values agree.

Table 4. Relevant eigenvalue approximation measures for the selected specimen graphs.

Quantity	Category	England	Chicago	Edmon-Ton	North Carolina	US HSA	Texas	Syracuse
tuple small p	2	79	0	23	0	6	7	48
tuple small p	3	6 ^†	0	9 ^†	0	1	2 ^†	7
inertia	n₊	349	891	895	909	1442	2205	3242
	n₀	93	0	41	0	8	10	70
	n₋	397	1176	1162	1286	1958	3050	3937
exponent	γ	1.4178	1.5656	1.2890	1.2279	2.2820	1.6077	1.8094
exponent	θ	1.3732	1.4636	1.4546	1.4297	1.6744	1.4771	1.6084
λ > 0 bivariate regression	intercept	0.0504	0.0361	0.0348	0.0283	0.0643	0.0419	0.0518
	slope	0.9429	0.9700	0.9643	0.9745	0.9632	0.9684	0.9520
	R²	0.9973	0.9993	0.9992	0.9995	0.9966	0.9988	0.9979

NOTE: λ denotes eigenvalues; 3- and 4-tuples computer execution speed is too slow for massively large graphs; the bivariate regression intercept and slope are close to their respective ideal values of 0 and 1. ^† includes identified overlap matchings with a 2-tuple.

Table 5. The matrix appearing in both https://stackoverflow.com/questions/11966515/find-all-lineary-dependent-subsets-of-vectors (accessed on 24 October 2023) and https://math.stackexchange.com/questions/182753/find-all-linearly-dependent-subsets-of-this-set-of-vectors (accessed on 24 October 2023), with its rows labeled to indicate its transpose.

	c₁	c₂	c₃	c₄	c₅
r₁	1	1	1	0	1
r₂	0	0	1	0	0
r₃	1	0	0	0	0
r₄	0	0	0	1	0
r₅	1	1	0	0	1
r₆	0	0	1	1	0
r₇	1	0	1	1	0

NOTE: c_j denotes column j, and r_j denotes row j; transposing the rows and columns produces the binary rows matrix.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Griffith, D.A. Some Comments about Zero and Non-Zero Eigenvalues from Connected Undirected Planar Graph Adjacency Matrices. AppliedMath 2023, 3, 771-798. https://doi.org/10.3390/appliedmath3040042

AMA Style

Griffith DA. Some Comments about Zero and Non-Zero Eigenvalues from Connected Undirected Planar Graph Adjacency Matrices. AppliedMath. 2023; 3(4):771-798. https://doi.org/10.3390/appliedmath3040042

Chicago/Turabian Style

Griffith, Daniel A. 2023. "Some Comments about Zero and Non-Zero Eigenvalues from Connected Undirected Planar Graph Adjacency Matrices" AppliedMath 3, no. 4: 771-798. https://doi.org/10.3390/appliedmath3040042

APA Style

Griffith, D. A. (2023). Some Comments about Zero and Non-Zero Eigenvalues from Connected Undirected Planar Graph Adjacency Matrices. AppliedMath, 3(4), 771-798. https://doi.org/10.3390/appliedmath3040042

Article Menu

Some Comments about Zero and Non-Zero Eigenvalues from Connected Undirected Planar Graph Adjacency Matrices

Abstract

1. Introduction

2. Background to the Pair of Problems

3. Specimen Empirical Surface Partitionings and Their Dual Graphs

4. Identifying Linearly Dependent Column/Row Subsets in Adjacency Matrices

4.1. Competing Algorithms

4.2. SAS Procedures: REG and TRANSREG

4.3. Disclosing Subset Linear Combinations for the Specimen Empirical Surface Partitionings

4.4. Discussion: Selected Sweep Operator Properties

5. Approximating the Eigenvalues of Matrix W

5.1. Preliminary Eigenvalue Approximation Steps

5.2. An Instructive Eigenvalue Approximation Assessment

6. Discussion

7. Conclusions and Implications

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Selected Supplemental Specimen Geographic Landscapes

Appendix B. A Conrolled Experiment: A P-By-P Regular Square Tessellation Forming a Complete Rectangular Region, Rook Adjacency Definition, with a (P, P) Node Corner Landscape Central Node K-Star Subgraph

Appendix C. Expanding the Graph Validation Dataset

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI