1. Introduction
Nonparametric association measures serve as indispensable tools in statistical analysis, especially when handling non-Gaussian distributions or intricate dependence structures. Spearman’s footrule rank correlation coefficient [
1], a rank-based statistic, has recently regained prominence for its resilience to parametric assumptions and ease of interpretation [
2,
3,
4]. By aggregating absolute differences between paired ranks, this metric captures permutation-based disorder while circumventing limitations of linear correlation measures like Spearman’s rho correlation coefficient.
Four key attributes solidify Spearman’s footrule as a versatile analytical instrument: (i) computational efficiency (
complexity), surpassing quadratic-time alternatives like Kendall’s tau; (ii) sensitivity to positional deviations, crucial for applications prioritizing top-ranked items; (iii) intuitive interpretation through normalized rank displacement metrics; and (iv) enhanced outlier resistance compared to Euclidean-based counterparts. These properties enable diverse applications: genomic reproducibility analysis under noisy conditions [
5], ranked list comparison in information retrieval [
6,
7], and uncertainty-aware consensus ranking in preference learning [
8]. Its adaptability further extends to gene expression studies [
9] and bioinformatics workflows [
10], demonstrating broad interdisciplinary utility.
Recent decades have witnessed considerable efforts to extend Spearman’s footrule to multivariate contexts. Úbeda-Flores (2005) [
11] introduced a copula-based multivariate generalization that preserves interpretability, though its computational complexity limits its practical utility. Genest et al. (2010) [
12] further analyzed the theoretical properties of Spearman’s footrule and Gini’s gamma, emphasizing persistent challenges in developing efficient multivariate tests and establishing tight bounds. While the range of the lower bound was theoretically established, the complete characterization of copulas achieving this range remained unresolved until [
13] identified sparse copula structures attaining its minimum value.
Despite these advancements, significant limitations persist. Current multivariate extensions of Spearman’s footrule, while theoretically robust, often depend on intricate copula formulations (e.g., [
11,
14]), compromising their accessibility and practical implementation. To address this gap, we propose a novel multivariate testing approach prioritizing simplicity and computational feasibility. Our approach begins by constructing a
correlation matrix through pairwise computation of Spearman’s footrule coefficients between all components of two
p- and
q-dimensional random vectors. This matrix encapsulates rank-based dependencies in a structure analogous to classical correlation matrices. We formally demonstrate that, under independence assumptions, its elements jointly converge to a multivariate normal distribution using the Cramér–Wold device and asymptotic representation techniques, thereby extending univariate normality results to multivariate settings. A global test statistic derived from the Frobenius norm of this matrix asymptotically follows a weighted sum of chi-square distributions. However, recognizing the impracticality of critical value tabulation for this complex distribution, we recommend a permutation-based testing procedure. This method empirically approximates the null distribution, offering enhanced robustness and scalability in finite-sample applications.
Simulation studies confirm that the permutation approach maintains well-controlled Type I error rates and demonstrates superior power compared to existing methods, while applications to two real-world datasets illustrate its practical effectiveness in detecting dependencies. This work bridges the gap between theoretical development and pragmatic implementation, providing a scalable and robust framework for multivariate nonparametric inference.
The remaining sections are organized as follows:
Section 2 formally introduces the multivariate footrule correlation matrix, establishes its joint asymptotic normality, and outlines the permutation-based testing procedure.
Section 3 and
Section 4 evaluate the method through simulations and real-data demonstrations, respectively, while
Section 5 concludes with a discussion of implications and potential extensions. Technical proofs are deferred to
Appendix A and
Appendix B to provide additional simulations for covariance estimation. The codes implementing the simulation studies are available online.
2. Multivariate Extension of Independence Test
Consider two real-valued, continuous random vectors, and , with fixed dimensions of p and q, respectively. Suppose n independent and identically distributed (i.i.d.) observations are from , where , for .
We begin by revisiting the bivariate Spearman’s footrule. For a sample
from the paired variables
with their marginal distribution functions
and
(
), let
and
denote the ranks of
and
, respectively. The bivariate Spearman’s footrule, tailored for the two scalars,
and
, is then given by the following expression:
Thus, based on Equation (
1), the Spearman’s footrule correlation matrix applied to vectors
and
can be defined as
In this study, we will concentrate on the utilization of
in conducting independence tests. To be specific, we will explore the following null and alternative hypotheses based on
n i.i.d. observations.
For ease of notation, we denote the set of observations as
. To examine Equation (
2), the multivariate Spearman’s footrule rank test statistic, which is obtained by aggregating the
squared and standardized bivariate Spearman’s footrules from the correlation matrix
, takes the specific form
where
is the variance of
under the independence assumption, which can be directly calculated using the results from [
15]. The notation
represents the Frobenius norm of a matrix; specifically, for a
matrix
with elements
and
, the Frobenius norm is computed as
.
Furthermore, for any matrix , is employed to denote the column vector by vertically stacking the columns of matrix . The theorem presented below provides the asymptotic joint normality property of the elements in matrix .
Theorem 1. Under the null hypothesis , converges in distribution to a -dimensional multivariate normal distribution with mean vector and covariance matrix Σ,
i.e., as , In the covariance matrix Σ, the diagonal entries are all equal to , whereas the off-diagonal elements are determined by , given that with and for , .
The derivation of Theorem 1 primarily relies on the Hájek asymptotic representation of the following form presented in [
16]:
where
is asymptotically equidistributed with
when
and
are independent, and it effectively removes the dependence among ranks, greatly facilitating the further development of the theory.
For the plug-in estimation of covariance matrix
, one may substitute the expectation of off-diagonal elements in
with corresponding sample means and replace the population distribution function with the corresponding empirical distribution function. The resulting estimator
has the following specific form: diagonal elements equal 2/5, and off-diagonal elements equal
for
,
, where
and
denote the empirical distribution functions of
and
, respectively, defined as
and
. The performance of this estimator is demonstrated through simulations in
Appendix B, which reveal that practical within-group dependencies and sample size significantly impact estimation accuracy. Consequently, we employ the random permutation technique (described later) when conducting hypothesis tests in real applications.
By leveraging the joint normality property of all elements within , as established by Theorem 1, we can readily derive the asymptotic null distribution of the multivariate Spearman’s footrule statistic.
Corollary 1. Under the null hypothesis , as ,where represents the eigenvalues of matrix Σ,
and for are independent standard normal random variables. Although Corollary 1 furnishes the asymptotic result for determining the critical values of the proposed test statistic, the unknown joint distribution of every pair of components within or complicates the intricate expectations in covariance matrix and renders the task of finding suitable estimates challenging. Although the commonly used plug-in estimation technique can serve as an alternative estimation method, its performance is susceptible to both the sample size and the complex within-group dependencies. Consequently, the critical values derived in Corollary 1 are impractical and cannot serve as valid critical values for the testing procedure. To address this issue, we can employ random permutation to conduct the test. The specific procedure and algorithm (Algorithm 1) are outlined as follows:
- Step 1:
Given the dataset of observations specify the number of permutations, denoted as B.
- Step 2:
For each , randomly generate two permutations, and , of the index set .
- Step 3:
Construct the b-th permuted dataset based on the generated permutations.
- Step 4:
Calculate the b-th permutation-based statistics using the permuted dataset.
- Step 5:
Repeat Steps 2–4
B times. Utilize the collection of statistics
to approximate the
p-value of the test as follows:
- Step 6:
For a prespecified significance level , if , reject the null hypothesis .
3. Simulations
To evaluate the performance of the multivariate footrule test statistic proposed in this paper (denoted as Mfootrule and given in Equation (
3)), we conduct a series of simulations in this section using four synthetic examples. These examples encompass 2 models for investigating the validity of tests under the null hypothesis, 12 models for examining the power, and 4 varying models designed to visualize the trend of test power when data are generated under the alternative hypothesis.
For the purpose of comparison, we select several commonly employed methods to test the independence of two multivariate vectors. These include distance covariance (DCOV) [
17] and its marginal rank-based version (RDCOV) ([
18]), the Hilbert–Schmidt Independence Criteria (HSIC) [
19], along with two additional methods that bear similarity to the construction of our proposed statistic (they are, respectively, based on Spearman’s
and Kendall’s
, as referenced in [
20,
21], denoted as Mrho and Mtau, respectively). All methods perform tests using permutation-based approaches with 1000 permutations, including our proposed method which specifically utilizes the permutation testing procedure described in Algorithm 1. Two sample sizes are set,
and
, with dimensions of
. A significance level of 0.05 is set for all scenarios, with 1000 replicate simulations conducted. Detailed information on data generation can be found in the following four examples.
Algorithm 1: Permutation-based algorithm for multivariate Spearman’s footrule test |
![Mathematics 13 01527 i001]() |
Example 1 (Data generated under
).
In this example, in order to assess the validity of various testing methods, Gaussian distribution and non-Gaussian heavy-tailed distribution are employed to generate data under the null hypothesis. Additionally, we introduce within-group dependence to examine its impact on the empirical size of the tests. These two distributions are similar to the setups in Examples 6.1 and 6.2 of [22]. All of the empirical sizes are presented in Table 1.- (a)
(Gaussian) ∼, where the entry of the covariance matrix is defined as with and representing the strengths of within-group and without-group dependence, respectively. In this model, data under the null hypothesis are generated by setting while allowing τ to vary.
- (b)
(Heavy-tail) Data generation is conducted independently from , such that the components of and are given by for , and by for . In this context, represents the quantile function of the t-distribution with one degree of freedom, Φ is the cumulative distribution function of the standard Gaussian distribution, and is generated in the same manner as described in Example 1(a).
From
Table 1, it is evident that the proposed method, along with all of the methods employed for comparison, demonstrates good control over the empirical size across various levels of within-group dependence and different sample sizes, whether under Gaussian or heavy-tailed distributions. This is because both the proposed method and the comparative approaches utilize techniques based on random permutation or resampling to accurately estimate the null hypothesis distribution, thereby preventing distortion of the empirical size and confirming the validity of all tests.
Example 2 (Data generated under in Example 1 with ). In this example, the data generation for the two models follows the same procedure as in Example 1(a) and Example 1(b), except that ρ is set to 0.2 to generate data under the alternative hypothesis.
The data presented in
Table 2 indicate that as the within-group dependence increases, all empirical rejection rates decrease, which is a normal phenomenon, since within-group dependence interferes with between-group dependence. Another notable observation is that all rank-based tests (including our proposed Mfootrule) demonstrate clear advantages under both Gaussian and heavy-tailed distributions. This is because rank-based tests, being insensitive to outliers, exhibit superior robustness in heavy-tailed distributions. Although our Mfootrule exhibits negligible disadvantages, these are inconsequential, especially considering that methods like Mrho and Mtau specialize in linear dependency detection for Gaussian models. In contrast, non-rank-based methods (DCOV and HSIC) perform poorly in heavy-tailed models due to their lack of robustness. The rank-based DCOV (RDCOV) also performs remarkably well but shows slight power loss compared to the original DCOV under Gaussian models, likely because it uses marginal ranks without fully accounting for interactions between within-group ranks. HSIC exhibits the lowest power in Gaussian models due to its limited linear dependency detection capability, and while it shows marginal advantages over DCOV in heavy-tailed models, its performance still lags behind our proposed methods and specialized linear dependency tests.
Example 3 (Data generated from various general alternative models under
).
In this example, additional general alternative models are generated to examine the capability of the proposed methods and the comparison methods in rejecting the null hypothesis. The construction involves , where for . There are 10 specific models that generate the distribution of , and some of these distributions are also taken into account in [23] for testing the independence of two vectors. These distributions can be categorized into four types: the first type is distributions with heavy-tailed dependence (V1–V3), the second type is distributions where the dependence follows a Gaussian or non-Gaussian mixture (V4–V6), the third type exhibits noisy functional dependence (V7–V8), and the fourth type features shape-based dependence (V9–V10). The detailed model settings are as follows, and all of the results are presented in Table 3.- (V1)
(Heavy-tailed): Let , , and . Then, and , with V, , and being mutually independent.
- (V2)
(Heavy-tailed): Let , , and . Then, and , with V, , and being mutually independent.
- (V3)
(Heavy-tailed): Let , , and . Then, and , with V, , and being mutually independent.
- (V4)
(Mixture): Let , and . Then, , with X, E, and V being mutually independent.
- (V5)
(Mixture): Let , and . Then, and . Finally, and , with W, , , and A being mutually independent.
- (V6)
(Mixture): Let and , where these two vectors are independent. The covariance matrix Σ has entries such that and if or . Further, and . Then, and .
- (V7)
(Quadratic function): Let and . Then, , with X and ϵ being independent.
- (V8)
(Fractional exponential function): Let and . Then, , with X and ϵ being independent.
- (V9)
(Semicircle): Let . Then, and .
- (V10)
(Two parabolas): Let . Then, , , with V and X being independent.
According to the results in
Table 3, except for Models V7 and V8, where our proposed test is less proficient (although Model V8 only shows a slight disadvantage), our Mfootrule test outperforms all competitors by a significant margin among the remaining eight models. Among the two tests based on classical coefficients across all models, the Mtau test slightly outperforms the Mrho test, which is reasonable given their well-established performance in the bivariate case.
In heavy-tailed models V1-V3, which differ from the Gaussian-transformed heavy-tailed models in Example 2, the DCOV-based test completely fails even with increased sample sizes, while the marginal rank-based RDCOV maintains robust performance. This reflects the inherent robustness of rank-based methods against heavy-tailed distributions. Our Mfootrule outperforms classical methods (Mrho and Mtau) due to its absolute distance property and rank-based advantage, whereas HSIC shows limited power only in Model V1 and fails entirely in V2–V3 compared to rank-based approaches. For mixture distributions in Models V4–V6, RDCOV retains moderate performance in V4 and V6. Notably, in Gaussian and non-Gaussian mixture models (V5), the remaining four tests (Mtau, Mrho, DCOV, HSIC) perform poorly. Some methods (Mrho, DCOV, HSIC) show no power improvement with larger samples, while Mfootrule demonstrates consistent advantages over competitors through its unique design. In the non-monotonic functional dependence model V7, classical rank-based coefficients (Mtau, Mrho) perform poorly, as expected—since they specialize in linear/monotonic relationships—but our method surprisingly outperforms them. Meanwhile, underperforming HSIC excels here, while RDCOV and DCOV show moderate performance. In the monotonic model V8, all tests perform adequately without notable differences. In the shape-dependent models V9–V10, our proposed method excels, whereas other approaches show minimal effectiveness. Even renowned methods (Mrho, Mrho, DCOV) demonstrate no power improvement with larger samples, or, at best, marginal gains, with only HSIC showing slight advantages.
Example 4 (Data generated from four varying alternative models). In this example, we generate four varying models to examine the trend of test power for all methods as the between-group dependence changes. Here, we set the sample size to , and the number of simulations to 500, with the other settings remaining the same as before. The specific data-generating models are as follows:
- (a)
(Gaussian) This model is identical to Example 1(a), but with and ρ varying from to 0.3.
- (b)
(Heavy-tailed) This model is identical to Example 1(b), but with and ρ varying from to 0.3.
- (c)
(Mixture) , and . Then, , with X, E, and V being mutually independent. This model extends the mixture model V4 from Example 3, where λ is a noise parameter. A larger λ implies weaker dependence between and , with representing noise.
- (d)
(Semicircle) . Then, and . This model extends semicircular model V9 from Example 3, where λ is a noise parameter. A larger λ implies weaker dependence between and , with representing noise.
The power curves of the four models are displayed in
Figure 1. As shown in
Figure 1a, all methods exhibit nearly identical performance for the Gaussian model, though our Mfootrule and HSIC exhibit virtually imperceptible slight disadvantages. In the Gaussian-transformed heavy-tailed model of
Figure 1b, all rank-based tests (including our proposed methods) demonstrate strong robustness, while non-rank-based DCOV and HSIC show inferior performance. These findings align with the results from Example 2. For the Gaussian mixture model in
Figure 1c, our method outperforms all competitors, as Mfootrule’s absolute difference-based distance metric effectively captures the mixture signals. In the semicircular model of
Figure 1d, all tests exhibit extremely low power, whereas our method maintains a clear lead, with power degradation remaining gradual even as between-group dependency increases.