The Geometry of Signal Detection with Applications to Radar Signal Processing

: The problem of hypothesis testing in the Neyman–Pearson formulation is considered from a geometric viewpoint. In particular, a concise geometric interpretation of deterministic and random signal detection in the philosophy of information geometry is presented. In such a framework, both hypotheses and detectors can be treated as geometrical objects on the statistical manifold of a parameterized family of probability distributions. Both the detector and detection performance are geometrically elucidated in terms of the Kullback–Leibler divergence. Compared to the likelihood ratio test, the geometric interpretation provides a consistent but more comprehensive means to understand and deal with signal detection problems in a rather convenient manner. Example of the geometry based detector in radar constant false alarm rate (CFAR) detection is presented, which shows its advantage over the classical processing method.


Introduction
Signal detection in the presence of noise is a basic issue in statistical signal processing and is fundamental to target detection applications in modern radar signal processing systems.Classical approaches to binary hypothesis testing are usually in the form of a likelihood ratio test.It has been proved that the optimal detector for a known deterministic signal in Gaussian noise is termed the matched filter, while for a random signal with known statistics, it is named the estimator-correlator in which the signal estimate is correlated with the data [1].The essence of a basic detection problem can be regarded as decision or discrimination between two probability distributions with different models or parameters.While the likelihood ratio test provides an optimal solution to hypothesis testing problems, its meaning is not obvious in terms of the discrimination nature of a detection problem.The term "discrimination" is usually related to a decision or comparison in terms of distance between two items.There is no doubt that a better understanding of the detection problem will benefit from interpretations of its discrimination nature.Furthermore, it is promising to develop new approaches revealed by such interpretations to deal with the detection problem itself.
In statistics and information theory, a parameterized family of identically distributed probability distributions is usually regarded as a statistical manifold [2].The statistical manifold of probability distributions has a natural geometrical structure that characterizes the intrinsic statistical properties of the family of distributions.Information geometry is the study of intrinsic properties of manifolds of probability distributions [3], where the ability of the data to discriminate those distributions is translated into a Riemannian metric [4].The main tenet of information geometry is that many important notions in probability theory, information theory and statistics can be treated as structures in differential geometry by regarding a space of probabilities as a differentiable manifold endowed with a Riemannian metric and a family of affine connections [5].By providing a means to analyze the Riemannian geometric properties of various families of probability density functions, information geometry offers comprehensive results about statistical models simply by considering them as geometrical objects.
The geometric theory of statistics, pioneered in the 1940s by Rao [4] and further developed by Chentsov [6], Efron [7,8] and Amari [5,9], has found a wide range of applications in the theory of statistical inference [10,11], the study of Boltzmann machine [12], the learning of neural networks [13], the Expectation-Maximization (EM) algorithm [14], all with certain degrees of success.In the last two decades, its application has spanned several discipline areas such as information theory [15,16], systems theory [17,18], mathematical programming [19], and statistical physics [20].As this new general theory reveals the capability of defining a new perspective on existing questions, many researchers are extending their work on this geometric theory of information to new areas of application and interpretation.For example, in the area of communication coding/decoding and channel analysis, the most important milestone is the work of Richardson where the geometric perspective clearly indicates the relationship between turbo decoding and maximum-likelihood decoding [21], while a geometric characterization of multiuser detection for synchronous direct-sequence code division multiple access (DS/CDMA) channels is presented in [22]; In the area of estimation and filtering, Smith [23] studied the intrinsic Cramér-Rao bounds on estimation accuracy for the estimation problems on arbitrary manifolds, where the set of intrinsic coordinates are not apparent.Srivastava et al. [24,25] addressed the geometric subspace estimation and target tracking problems under a Bayesian framework.In particular, in the area of hypothesis testing and signal detection, the geometrical interpretation of multiple hypothesis testing problems in the asymptotic limit was developed by Westover [26], while the interpretation of Bayes risk error in Bayesian hypothesis testing was demonstrated by Varshney [27] and it has been proved that the Bayes risk error is a member of the class of Bregman divergences.Furthermore, Barbaresco et al. [28][29][30] developed a new Matrix constant false alarm rate (CFAR) detection method based on Bruhat-Tits complete metric space and upper-half Siegel space in symplectic geometry, which improves the detection performance with respect to the classical CFAR detection approach based on Doppler Filter Bank or fast Fourier transform (FFT).
Information geometry opens a new prospective to study the intrinsic geometrical nature of information theory and provides a new way to deal with existing statistical problems.For signal detection in statistical signal processing, information geometry can also provide new viewpoints and better understandings in the analysis of the detection process where both hypotheses and detector can be regarded as geometrical objects on the statistical manifold.In this paper, geometrical interpretations of deterministic and random signal detection are presented from the viewpoint of information geometry.Under such interpretations, the optimal detector is a "generalized minimum distance detector" where the decision is made by comparing distances from the signal distribution estimates to two hypotheses in the sense of the Kullback-Leibler divergence (KLD).Results, presented in this paper, illustrate that the generalized minimum distance detector provides a consistent but more comprehensive means to understand signal detection problems compared to the likelihood ratio test.An example of the geometry based detector in radar CFAR detection is presented, which shows its advantage over the classical processing method.
In the next section, the equivalence between likelihood ratio test and the Kullback-Leibler divergence is discussed, followed by the geometric interpretation of classical likelihood ratio test from the viewpoint of information geometry.In Section 3 and Section 4, the geometry of deterministic and random signal detection as well as the detection performance is elucidated geometrically, respectively.An example of the geometry based detector in radar CFAR detection is presented in Section 5. Finally, conclusions are made in Section 6.

Equivalence between Likelihood Ratio Test and Kullback-Leibler Divergence
In statistics, the likelihood ratio test is a basic form for hypothesis testing.Suppose x 1 , x 2 , . . ., x N are N independent and identically distributed (i.i.d.) observations from a statistical model whose probability density function is q(x), i.e., x 1 , x 2 , . . ., x N i.i.d.
∼ q(x).Consider two hypotheses for q(x), denoted by p 0 (x) and p 1 (x), which are referred to as the null hypothesis and alternative hypothesis, respectively.Then, the binary hypothesis testing problem is to decide between two possible hypotheses based on the observed data x 1 , x 2 , . . ., x N [1].The likelihood ratio for the underlying hypothesis test problem is The normalized log likelihood ratio can be represented by The normalized log likelihood ratio l is itself a random variable, and is an arithmetic average of N i.i.d.random variables l i = ln p 1 (x i ) p 0 (x i ) .According to the strong law of large numbers [31] that for large N, as where P denotes the convergence in probability, E is the expectation of random variables.As l i , i = 1, . . ., N are i.i.d.random variables and asymptotically l = E l i = q(x) ln p 1 (x) p 0 (x) dx = q(x) ln p 1 (x) p 0 (x) q(x) q(x) dx = q(x) ln q(x) p 0 (x) The quantity q(x) ln q(x) p(x) dx is known as the Kullback-Leibler divergence (KLD) [32] from q to p, which can be denoted by The Kullback-Leibler divergence is widely used to measure the similarity (distance) between two distributions and is also termed the relative entropy [33] in information theory.It should be noted that the Kullback-Leibler divergence is not a genuine metric of distance as it fails to be symmetric or satisfy the triangle inequality, which is justified by the Pinsker-Csiszar Inequality [34].However, it has been proved that the KLD plays a central role in the theory of statistical inference [8,9].
The Kullback-Leibler divergence can be regarded as the expected log-likelihood ratio.The equivalence between the likelihood ratio and Kullback-Leibler divergence is indicated by Equation (4) as l = D(q p 0 ) − D(q p 1 ).
Therefore, for large N, the classical log likelihood ratio test ln L γ can be performed by D(q p 0 ) − D(q p 1 ) In particular, for γ = 0 where the prior probabilities of two hypotheses are equal, the decision becomes In such a case, the likelihood ratio test is equivalent to choosing the model that is "closer" to q in the sense of the Kullback-Leibler divergence.This is referred to as a minimum distance detector.For the general case given by Equation (7), the test can be regarded as a generalized minimum distance detector with a threshold γ .Similar results can also be found in [26,33], where the analysis is performed on a probability simplex.In practical applications, typically in radar and sonar systems, the Neyman-Pearson criterion [35] is employed so that the detector is designed to maximize the probability of detection P D for a given probability of false alarm P F .In such a case, the probability of detection, i.e., the miss probability P M of the detector, is especially concerned as a performance index of the detector.Generally speaking, the miss probability of the Neyman-Pearson detector decays exponentially with a fixed level as the sample size increases, and the error exponent K is a widely used measure as the rate of exponential decay [36], i.e., under the given false-alarm constraint.Stein's lemma [33,37] shows that the best error exponent under a fixed false-alarm constraint is given by the Kullback-Leibler divergence D(p 0 p 1 ) from the distribution of hypothesis p 0 to p 1 , i.e., where .
= denotes equality to first order in the exponent, i.e., a n .= b n means lim n→∞ 1 n log a n b n = 0.The error exponent provides a good performance index for Neyman-Pearson detectors in the large sample regime.Furthermore, discussions of the error exponent and its properties are addressed in [36].In the following sections of this paper, the above notions will be demonstrated through discussions on the geometry of deterministic and random signal detection.

Geometric Interpretation of the Classical Likelihood Ratio Test
The previous analysis of the geometry of hypothesis testing is performed on a probability simplex [38], which is a generalization of the notion of a triangle or tetrahedron to arbitrary dimension.The vertex labels of a probability simplex correspond to a single probability set to 1, while edge labels correspond to a sum of two probabilities set to 1 and triangular faces correspond to a sum of three probabilities set to 1.The probability simplex provides a simple structure to analyze testing problems of probability distributions by mapping the type (empirical histogram) of data to a point on the simplex.However, it is not accomplished in dealing with continuous probability distributions as well as a complete structure to perform higher level of analysis.In a sense, information geometry provides a more powerful mathematical tool to deal with statistical problems in a geometrical manner, in which a parameterized family of probability distributions can be conveniently referred to as a statistical manifold with precise geometrical structure.
As mentioned earlier, the essence of a basic hypothesis testing problem can be regarded as discrimination between two identically distributed distributions with different parameters.From the viewpoint of information geometry, probability distributions of hypotheses can be conveniently regarded as two elements of a statistical manifold.Consider the parameterized family of probability distributions S = { p(x|θ )}, where x is a random variable and θ = (θ 1 , . . ., θ n ) is a parameter vector specifying the distribution.The family S is regarded as a statistical manifold with θ as its (possibly local) coordinate system [2].
Figure 1 illustrates the definition of a statistical manifold.For a given state of interest θ in the parameter space Θ ∈ R n , the measurement x in the sample space X ∈ R m is a realisation of to a probability distribution p(x|θ ).Each probability distribution p(x|θ ) is labelled by a point s(θ) on the manifold S. The parameterized family of probability distributions S = { p(x|θ )} forms an n-dimensional statistical manifold, where θ plays the role of a coordinate system of S.
For a sequence of observations x = (x 1 , x 2 , . . ., x n ) T , the binary hypothesis testing problem is that of deciding whether this sequence is originated from a source (hypothesis H 1 ) with a probability distribution p(x|θ 1 ) or from a source (hypothesis H 0 ) with a probability distribution p(x|θ 0 ), based on observations x.The probability distribution p(x|θ i ), i = 1, 2, associated with the hypothesis H i , is a element of a parametric family of probability density functions S = { p(x|θ ), θ ∈ Θ}, where Θ ∈ R n is the parameter set.This family of distributions S = { p(x|θ )} makes up of a statistical manifold, where the parameter θ is the coordinate of the measurement distribution on the statistical manifold.Distributions of the two model (hypotheses) p(x|θ 0 ) and p(x|θ 1 ) can be regarded as two "datum marks" on the manifold, while the distribution p(x|θ) summarized from observations x is an estimated distribution on the manifold.Then, the above hypothesis testing problem becomes a discrimination problem where the decision is made by comparing distances from the signal distribution estimates to two hypotheses in the sense of the Kullback-Leibler divergence, i.e., selecting the model that is "closer" to the signal distribution estimates.The geometric interpretation of classical likelihood ratio test from the viewpoint of information geometry can be illustrated by Figure 2. The above analysis can be extended to the complex parameter θ and/or complex measurement x.

Signal Model and Likelihood Ratio Test
The detection of a known deterministic signal in Gaussian noise is a basic detection problem encountered in statistical signal processing.Meanwhile, in radar applications, it usually involves the complex data.Consider the detection of a known deterministic signal in complex Gaussian noise.The hypothesis testing problem is [1] H 0 : where s(n) is a known complex signal and w(n) is a complex correlated Gaussian noise with zero mean.The above discrete quantities can be written as the vector forms, , . . ., x(N − 1) T , s = s(0), s(1), . . ., s(N − 1) T , w = w(0), w(1), . . ., w(N − 1) T .
Then, under the two hypotheses, the distributions of measurement error can be represented by where Σ is the covariance of noise vector w and CN denotes the complex Gaussian distribution.Consequently, the underlying detection problem for a known deterministic signal can be regarded as discrimination between two distributions with the same covariance but different means.According to the Neyman-Pearson criterion, the likelihood ratio test reduces to deciding H 1 if where T(x) is the test statistic and H denotes the complex conjugate transpose.The detector described by Equation ( 14) is referred to as a generalized replica-correlator or generalized matched filter [1].

Geometry of Deterministic Signal Detection
As the vectorial measurement x is from either of two hypotheses described in Equation ( 13), its covariance is Σ for either hypothesis.However, the mean of the measurements x is unknown and should be estimated in order to decide which hypothesis is true.The measurement distribution can be summarized (estimated) by where the mean is estimated by the mean of measurements, e.g., x.
The two datum marks on the manifold can be respectively denoted by For two multivariate Gaussian distributions p 0 = CN 0 µ 0 , Σ 0 and p 1 = CN 1 µ 1 , Σ 1 , the Kullback-Leibler divergence from p 0 to p 1 can be calculated via the closed form as [32] where "tr" and "det" denote the trace and determinant of a matrix, respectively.According to Equation ( 17), the KLDs from the measurement distribution p(x|θ) to hypotheses p(x|θ 1 ) and p(x|θ 0 ) are given by Thus, the distance difference is By incorporating the non-data-dependent term into the threshold we decide H 1 if where D(x) is the test statistic of the generalized minimum distance detector.By comparing the results given by Equation (14) and Equation ( 21), the detector defined by the generalized minimum distance detector coincides with the result given by the likelihood ratio test.However, the former illustrates a clear geometric interpretation of the detection problem in terms of its discrimination nature.
For above detectors, the test statistic is a complex Gaussian random variable under either hypothesis, being a linear transformation of x.Its statistics can be represented by where we have noted that s H Σ −1 s is real.
For a given probability of false alarm P F , the threshold is given by where Q(•) is the right tail cumulative distribution function of the standard Gaussian distribution.
The corresponding probability of detection is given by [1] where the deflection coefficient is defined by d 2 = 2s H Σ −1 s, which can be regarded as the signal-to-noise ratio (SNR) of the underlying detection problem.Equation (24) indicates that the probability of detection increases monotonically with s H Σ −1 s.
From the geometrical viewpoint, as shown in Equations ( 10) and ( 11), the performance of the Neyman-Pearson detector under a given false-alarm constraint can be measured by the error exponent, which is associated with the Kullback-Leibler divergence between two hypothesis distributions.In fact, in the deterministic signal detection example, the deflection coefficient d 2 is equivalent to the KLD D(p 0 p 1 ) from p(x|θ 0 ) to p(x|θ 1 ), which denotes distributions of the two hypotheses, as The result in Equation (25) indicates that the detection performance depends on the distance between two hypothesis distributions, which gives a clear geometric meaning of the detection performance by viewing the detection process as a discrimination between two hypotheses.

Geometry of Random Signal Detection
In the last section, we discussed the deterministic signal detection problem as well as its geometry.It is able to detect signals in the presence of noise by detecting the change in the mean of a test statistic.This is because the presence of a deterministic signal alters the mean of the received data.However, in some cases, the signal is more appropriately to be modeled as a random process with a known covariance structure.In this section, we consider the detection from the random signal model as well as the geometry of random signal detection.

Signal Model and Likelihood Ratio Test
The signal model of a random signal with known statistics is similar to the deterministic case given by Equation (12).The only difference is that the signal s(n) is assumed as a zero-mean complex Gaussian random process, and under the two hypotheses, the distributions of measurement error can be represented by where Σ s and Σ w denote the covariance matrices of the random signal and noise, respectively.Consequently, the underlying detection problem for a random signal with known statistics in the presence of noise can be regarded as discrimination between two distributions with the same mean but different covariance matrices.The Neyman-Pearson detector decides H 1 if the likelihood ratio exceeds a threshold or if where T(x) is the test statistic.

Geometry of Random Signal Detection
As measurements x are from either of the two hypotheses described in Equation ( 26), its mean is zero for either hypothesis.However, the covariance of x is uncertain and should be estimated in order to decide which hypothesis is true.Assume that both the signal and noise are zero mean wide sense stationary (WSS) random process; then, the covariance of data x is defined by where r k = E[x n x n+k ] is called the correlation coefficient and x denotes the complex conjugate of x.R is a Toeplitz Hermitian Positive Definite matrix with R H = R.According to the ergodicity of a WSS random process, the correlation coefficients of the signal can be calculated by averaging over time instead of its statistical expectation, as Thus, the measurement error distribution can be summarized (estimated) by where the covariance R is estimated from measurements x.
Then, the two datum marks on the manifold can be, respectively, denoted by Using Equation ( 17), the KLDs from the measurement distribution p(x|θ) to hypotheses p(x|θ 1 ) and p(x|θ 0 ) are given by Consequently, the distance difference is By incorporating the non-data-dependent term into the threshold, we decide H 1 if where D(x) is the test statistic of the generalized minimum distance detector.
As the test statistic in Equation ( 27) is a quadratic form in the data and thus will not be a Gaussian random variable, the detection performance of the detector is quite often difficult to determine analytically [1], especially when the signal is a correlated random process with arbitrary covariance matrix.While the detector given in Equation ( 35) is associated with an estimated covariance R based on measurements, analysis of its detection performance is also non-trivial.Thus, here, the performance comparison of two detectors is performed via Monte Carlo simulations.In addition, 1,000,000 Monte Carlo runs are performed and the performances of two detectors are illustrated in Figure 3.In the simulation, the number of measurement samples is N = 100 and the probabilities of false alarm P F are assumed P F = 10 −4 , 10 −5 , respectively.Figure 3 illustrates that the detection performance of two detectors coincide with each other, i.e., the likelihood ratio-based detector is equivalent to the geometry-based detector.The error exponent of the above Neyman-Pearson detector under the fixed false-alarm constraint is given by the Kullback-Leibler divergence from p 0 to p 1 , as As a special case, when the signal and noise are both white random processes with variances σ 2 s and σ 2 w , the error exponent is simplified by where SNR = σ 2 s σ 2 w denotes the signal-to-noise ratio.As the error exponent is monotonically increasing as SNR increases.Moreover, as shown in Figure 4, the error exponent K increases linearly with respect to SNR at high SNR.The result indicates that the detection performance improves as the divergence between two hypotheses increases, which gives a clear geometric interpretation of the detection performance of the Neyman-Pearson detector.

Application to Radar Target Detection
In this section, the application of geometry based detector to radar constant false alarm rate (CFAR) detection with pulse Doppler radar is presented.The performance between the geometry based CFAR detector and the classical cell-averaging CFAR (CA-CFAR) detector is compared.As illustrated in Figure 5, in the classical CA-CFAR detector for pulse Doppler radar, the data z i in each range resolution cell is obtained from a square-law detector of Doppler filter banks, where the Doppler power spectral density of the sample data x is estimated by fast Fourier transform (FFT).The CA-CFAR detector performs by comparing the data z D in the test cell to an adaptive detection threshold γ such that a CFAR is maintained.The detection threshold γ is determined by estimating the background clutter power of the test cell and multiplying a scaling factor T based on the desired probability of false alarm P F .The power of the background clutter is estimated by the arithmetical mean of the data in the background range cells, which is given by Due to the poor Doppler resolution as well as the energy spread of the Doppler filter banks, the classical CA-CFAR detector based on Doppler spectral estimation suffers from a severe performance degradation.Barbaresco et al. [28] developed a matrix CFAR detector based on Cartan's geometry of the symmetric positive-definite (SPD) matrices space, which improves the detection performance of the classical CA-CFAR detector based on Doppler filter banks.As illustrated in Figure 6, the data R i in each resolution cell is a covariance matrix estimated by the sample data x according to Equation (28).Then, calculate the distance between the covariance matrix R D of the cell under test and the mean matrix R of the reference cells around the cell under test.Finally, the detection is made by comparing the distance between R D and R with an adaptive detection threshold γ, which is related to the clutter power level as well as a multiplier corresponding to the desired probability of false alarm P F .In [28], the Riemannian distance measure is used when calculating the distance as well as the mean matrix, as the geometry of the manifold of SPD matrices is considered.The Riemannian distance between the covariance matrix of the cell under test and the mean matrix of reference cells is calculated by where λ k is the k th eigenvalue of R −1/2 D RR −1/2 D .The Riemannian mean of a set of SPD matrices {R 1 , R 2 , . . ., R M } is calculated via the minimum sum of distance between R and R i as [39] where d(•, •) represents the Riemannian distance between two matrices.One approach to solve Equation ( 41) is to use an iterative gradient algorithm that is based on the Jacobi field and the exponential map.In summary, the Karcher barycenter based Riemannian mean is given by the following iteration [40] where η k denotes the step-length of iterations.Then, the matrix CFAR detector based on Riemannian distance can be performed as The matrix CFAR detector proposed in [28] improves the performance of the classical CA-CFAR detector.However, as discussed in Section 2, the likelihood ratio test is equivalent to the generalized minimum distance detector in terms of the Kullback-Leibler divergence, which indicates that the Kullback-Leibler divergence is suitable for detection applications.Thus, it is necessary to compare the performance of matrix CFAR detector in terms of the Riemannian distance measure and Kullback-Leibler divergence.The Kullback-Leibler divergence between two matrix is given by Equation (17), where the mean of a set of SPD matrices {R 1 , R 2 , . . ., R M } is calculated by The performance of: (i) the classical CA-CFAR detector based on Doppler spectral estimation, (ii) the Riemannian distance-based matrix CFAR detector, and (iii) the Kullback-Leibler divergence-based matrix CFAR detector are compared via Monte Carlo simulations.The data samples are from N = 7 received pulses.A total of 50 range cells are considered, while M = 14 range cells are used for averaging.One moving target with Doppler frequency f d = 2.65 Hz is located at the 19th range cell.As the Gaussian assumption is no longer met in many situations of practical interest, such as the high-resolution radar and sea clutter, a compound-Gaussian model is more appropriate to describe the non-Gaussian radar clutter.The compound-Gaussian clutter can be written as the product of two mutually independent random processes, where the fast fluctuating 'speckle' component is a zero-mean complex Gaussian process and the comparatively slow varying 'texture' component is a nonnegative real random process that describes the underlying mean power level of the resultant clutter process [41,42].The most common clutter models, such as the Weibull and K-distribution, are compatible with the compound-Gaussian model.In the simulation, the clutter is assumed as a Weibull distribution with scale parameter α = 1 and shape parameter β = 3.
The performance of the three detectors are presented in Figure 7.In addition, 100,000 Monte Carlo runs are performed in the simulation.The probabilities of false alarm P F are assumed to be 10 −3 , 10 −4 , respectively.For the underlying detectors, the detection thresholds are not only related to the background clutter power but are also associated with the threshold multiplier, and a closed form expression of the threshold multiplier does not usually exist.Moreover, for Weibull distributed clutter, the threshold multiplier of the underlying detector depends on the desired probability of false alarm P F , the number of bins averaged M, as well as the shape parameter β [43].Consequently, it is difficult to determine the detection threshold analytically.In the simulation, the detection threshold is obtained via Monte Carlo runs.The test statistic in the absence of target is calculated via 100,000 Monte Carlo runs.As the effects of the clutter power, the number of bins averaged M and the shape parameter β are included in the test statistic; then, the detection threshold is determined according to the given probability of false alarm P F .The result illustrates that both the two geometry based detectors outperform the classical CA-CFAR detector.Moreover, the performance of the Kullback-Leibler divergence-based matrix CFAR detector is better than the Riemannian distance-based matrix CFAR detector with 4-6 dB signal-to-clutter ratio (SCR) improvement.
Figure 8 illustrates a geometrical interpretation of the classical and geometry-based detection procedures.The classical CA-CFAR detector performs in the Euclidean space, where the Euclidean distance measure is used to define the divergence between two elements.However, the geometry based detector performs in the statistical manifold, in which the intrinsic divergence of the covariance matrices can be properly exploited and utilized, thus improving the detection performance.Different distance measures adopted in detectors may result in different performance of detection.The Riemannian distance and Kullback-Leibler divergence are two measures used to evaluate the divergence between the data under test and clutter.The quality of the estimated Riemannian mean and Kullback-Leibler mean of SPD matrices are analyzed as follows.As the accuracy of the estimated mean matrix is a function of the number of samples (SPD matrices) used for averaging [40], when the number of samples M tends to infinity, the true mean matrix is obtained.In the simulation, 1000 samples are used to approximately estimate the true mean matrix.Then, for the case of a few number of samples available in detection, the distances between the estimated mean of few samples and the true mean matrix in terms of the Riemannian distance as well as Kullback-Leibler divergence are calculated via 300 Monte Carlo runs.The average of the 300 distances is presented as a function of the number of samples in Figure 9.It represents the distance (divergence) between the estimated mean matrix and the true mean matrix.
It can be observed that both of the Riemannian mean and Kullback-Leibler mean asymptotically approach the true mean matrix as the number of samples for averaging increases.It seems that the Kullback-Leibler mean is of smaller divergence to the true mean matrix than Riemannian mean in quantity.However, as Riemannian distance and Kullback-Leibler divergence are two different distance measures, the values of the two measures are incomparable.As discussed in the geometric interpretation of classical likelihood ratio test presented in Section 2, the likelihood ratio test is equivalent to the generalized minimum distance detector in the sense of Kullback-Leibler divergence, rather than Riemannian distance, which provides evidence that the KLD plays a central role in the theory of statistical inference.Therefore, Kullback-Leibler divergence is probably more appropriate to the case of hypothesis testing than Riemannian distance measure, which brings a performance improvement in detection.

Conclusions
In this paper, the problem of hypothesis testing in the Neyman-Pearson formulation is considered from a geometric viewpoint.In particular, we have described a concise geometric interpretation of deterministic and random signal detection in the philosophy of information geometry.In such a framework, both hypotheses and detectors can be regarded as geometrical objects on the statistical manifold of a parameterized family of probability distributions.Both the detector and detection performance are elucidated geometrically in terms of the Kullback-Leibler divergence.Compared to the likelihood ratio test, the geometric interpretation provides a consistent but more comprehensive means to understanding signal detection problems.Such methodology may play an important role in the development and understanding of the theoretical aspects of algorithms in radar signal processing.

Figure 2 .
Figure 2. Geometry of the classical likelihood ratio test.

Figure 5 .
Figure 5. Classical cell-averaging constant false alarm rate (CA-CFAR) detector based on Doppler spectral estimation.

Figure 8 .
Figure 8. Geometrical interpretation of the classical CA-CFAR detector and geometry-based detector.