1. Introduction
In reliability, the 
k-out-of-
n system forms an important technical structure that comprises series and parallel systems as special cases. In such a system, 
n components work simultaneously and may fail over time. The system, however, is operating as long as at least 
k of the components are operating, such that the 
-th component failure time coincides with the lifetime of the system. In a simple model, the component lifetimes are described, say, by 
n non-negative, independent, and identically distributed (iid) random variables with cumulative distribution function (cdf) 
F. The 
-th order statistic based on 
F then represents the system’s lifetime. To provide a more flexible modelling of the lifetimes of 
k-out-of-
n systems and their components, sequential order statistics (SOSs) were introduced in References [
1,
2], which allow to describe the impacts of failed components on the residual lifetimes of the remaining components. In particular, so called load-sharing effects can be modelled arising in systems, where all components (equally) share the total load of the system, and every component failure is likely to increase the stress put on the surviving ones. Obviously, modelling with order statistics is not adequate in applications with relevant impacts.
When SOSs are used as a load-sharing model for a 
k-out-of-
n system, statistical inference for unknown underlying quantities typically has to be performed, aiming at a good model fit. In a proportional hazard rate setup of SOSs, the uncertainty of the model is captured within some baseline cdf or within a finite number of positive model parameters, for which a variety of inferential results have been derived. Estimators of model parameters or distribution parameters of the baseline cdf of SOSs are provided in, e.g., References [
3,
4,
5,
6,
7,
8,
9,
10]. Statistical tests for single or vectors of the model or baseline-distribution parameters of SOSs are proposed in References [
3,
6,
7,
11,
12,
13], which, in particular, allow for model selection in the sense of whether order statistics have to be rejected for modelling in a given situation. More inferential results, along with the detailed model and structural properties, can be found in Reference [
14]. For nonparametric estimation and testing with SOSs, we refer to References [
15,
16,
17]. Bayesian inference for SOSs is discussed, for instance, in References [
18,
19,
20].
In this paper, continuing existing work on inference with SOSs, we develop statistical tests for the comparison of multiple k-out-of-n systems in the sense of whether they have model parameters or baseline-distribution parameters in common. In reliability and the particular context of load-sharing systems, sample sizes are often small going along with, e.g., low estimator accuracy. The proposed tests may be applied, for instance, to decide for a meta-analysis, i.e., a joint analysis of the data to improve the performance of subsequent inferential procedures, such as maximum-likelihood estimation of the unknown parameters.
The remainder of this article is as follows. First, we introduce the SOS model and review some basic properties (
Section 2). Based on multiple samples of SOSs modelling possibly differently structured systems, we then derive exact statistical tests to check for common load-sharing parameters (
Section 3.1) and for common baseline-distribution parameters (
Section 4.1). In the particular case of two samples, critical values for test statistics are provided for small sample sizes and, moreover, asymptotic results are addressed (
Section 3.2 and 
Section 4.2). A simulation study is carried out to compare the proposed tests in terms of power (
Section 5), and a concluding section finally highlights the main findings (
Section 6).
  2. Model and Basic Properties
SOSs were introduced in References [
1,
2] by means of a triangular scheme of independent random variables and certain recursive formulas. Meanwhile, other possible definitions have been proposed in the literature, for instance, based on independent power-function-distributed random variables (see Reference [
21]) or on counting processes (see Reference [
15]; cf. References [
22,
23]). For our purposes here, a likelihood approach to the model is sufficient.
Let 
 be absolutely continuous cdfs with 
 and corresponding density functions 
. Ordered random variables 
 (defined on a probability space (
) are called SOSs based on 
 if their joint density function with respect to (wrt) the Lebesgue measure is given by
      
      for all real numbers 
, where here and in the following, 
 for the sake of a simple representation. In that case, 
 form a Markov chain with transition probabilities
      
      for 
; see References [
1,
24]. In the distribution-theoretical sense, order statistics (based on 
) are included in the model and result by setting 
.
When choosing
      
      for some absolutely continuous cdf 
F, which is referred to as a baseline cdf, and positive numbers 
, we arrive at a semiparametric SOS model, in which 
 have proportional hazard rates, i.e.,
      
The conditional hazard rate of 
, given 
, is then
      
      for 
. In a load-sharing context, model parameters 
 may describe increasing stress put on surviving components in case of component failures in the following sense. All components start working at hazard rate 
. Then, upon the 
j-th component failure for 
, the hazard rate of any still-operating component changes from 
 to 
. An extensive account on the SOS model, including motivational aspects, structural results, and inference, is provided by Reference [
14].
When the component lifetimes of a 
k-out-of-
n system are recorded, the data are type-II right-censored if 
, such that marginal densities are naturally of some interest. Here, the marginal density of the first 
 SOSs based on 
 is given by
      
      which, by assuming Formula (
1), simplifies to
      
      where 
f denotes the density function of the baseline distribution; see, e.g., Reference [
14]. For brevity, let 
 denote the sample space in what follows.
  3. Testing for Equal Load-Sharing Parameters
For 
, we assume to have 
 observations of the first 
r component-failure times in an 
-out-of
 system. In sample 
, failure times are described by iid vectors 
, where 
 is distributed as the first 
r SOSs based on a known absolutely continuous cdf 
 with density function 
 and unknown model parameters 
 for 
 (see Formula (
2) and 
Table 1).
As an example, 
 may be specified as the standard exponential distribution for all 
 to assume exponential component lifetimes in all systems with varying scale parameters upon failures. Moreover, 
, 
, 
, are supposed to be independent, the joint density function of which can then be represented as
      
      where, for 
 and 
,
      
      and
      
      for 
 with 
, 
, 
. Here, 
 denotes the vector of all model parameters 
, 
, 
. It is well known that the statistics 
, 
, 
, are independent with 
, i.e., a gamma distribution with shape parameter 
 and scale parameter 
; see, e.g., Reference [
5]. Moreover, for 
 and 
, the maximum-likelihood estimator (MLE) of 
 is given by
      
      which can directly be seen by computing the first two derivatives of the log-likelihood function corresponding to Formula (
3); cf. References [
5,
7]. As a consequence, 
, 
, 
, are independent and inverse-gamma-distributed. In what follows, the vector of these MLEs is denoted by 
.
  3.1. Exact Tests
For 
, let 
 be nonempty index sets forming a partition of 
, and let 
 for at least one index 
. Then, Equation (
3) can be rewritten as
        
We aim at testing the null hypothesis:
		where, here and in the following, the alternative hypothesis is the negation of the null hypothesis (in the SOS model). Under null hypothesis (
5), the 
j-th model parameter is assumed to be constant in blocks, where blocks 
, refer to sample numbers, 
. 
Figure 1 illustrates such a situation. In the particular situation 
, the 
r model parameters are the same in all samples. For 
 and 
, null hypothesis (
5) refers to the two-sample case with 
 and 
 observed systems, which then simply reads
        
Note that by setting  for some , there is no assumption for model parameters  under the null hypothesis.
To derive the MLE of 
, 
, 
, under 
, let 
 be any index in 
 for 
 and 
. Then, we have to maximize expression
        
       wrt 
, 
, 
. Analysis of the first two derivatives yields that the maximum is attained at
        
        for 
 and 
. Hence, the MLE of 
, 
, 
, under 
 is given by
        
        and the corresponding vector of estimators is denoted by 
.
Now, the likelihood-ratio statistic 
 for testing null hypothesis (
5) is seen to be
        
As a competing test statistic to 
, we address the Rao score statistic 
 for testing null hypothesis (
5), which involves the score statistic 
 and the Fisher information matrix 
, where superscript 
t denotes transposition, and integration is done wrt 
; see, e.g., Reference [
25]. In our model, 
 is a diagonal 
-matrix with blocks 
, 
, since 
, 
, 
, are independent. Hence, by using Formula (
4),
        
The likelihood-ratio test and the Rao score test then reject the null hypothesis if the corresponding test statistics exceed certain critical values, which have to be determined from the distributions of  and R under the null hypothesis and from the significance level of the tests. For this, the following theorem is useful.
Theorem 1. For testing null hypothesis (5), Λ and R, given by Formulas (6) and (7), have single null distributions; i.e., under the null hypothesis, the distributions of both test statistics do not depend on the specific parameters.  Proof.   and 
R depend on the data only through the ratios
          
          with independent statistics 
, 
, 
. Evidently, 
, 
, 
, are, in turn, independent. Moreover, under 
, the statistics 
, 
, have a common scale parameter for 
 and 
, which implies that the distribution of 
 is free of 
 for 
 and 
. This yields the assertion. □
 As a consequence of Theorem 1, the exact critical values for 
 and 
R subject to a desired significance level can be obtained via Monte Carlo simulation by independently sampling from gamma distributions with scale parameters all equal to 1. For 
 systems, small sample sizes 
, and number 
 of pairs with matching model parameters under the null hypothesis, critical values are shown in 
Table 2 and 
Table 3 for a significance level of 5%. Note that these values do not depend on 
, 
, 
, or on 
r, and may, in particular, be used for a full statistical comparison of two arbitrary 
-out-of-
 systems, 
, with 
.
  3.2. Asymptotic Tests
We address asymptotic results for 
 systems and null hypothesis
        
        for some nonempty index set 
, in the case of which Formulas (
6) and (
7) simplify to
        
        where 
, 
, are independent statistics. Note that for 
 and 
, the likelihood-ratio test and the Rao score test are equivalent, since then
        
        and, hence, 
 and 
R are both strictly monotone functions of statistic 
.
As an overall assumption in this section, let 
 when the total sample size increases. Then, by the strong law of large numbers,
        
Moreover, the central limit theorem yields that
        
        where 
 is the unity (or identity) matrix in 
, 
 denotes the 
k-dimensional normal distribution with mean 
 and positive definite covariance matrix 
, and 
 means convergence in distribution.
Lemma 1. Under null hypothesis (8),  Proof.  Since 
 are independent, it is sufficient to show that, under 
, 
 for 
. Let 
 and 
, say. Then, we have
          
Since 
 and 
 a.s. by the strong law of large numbers, Formula (
12) along with the multivariate Slutsky theorem (see, e.g., Reference [
25], Theorem 3.4.3) then yield the assertion. □
 Theorem 2. Under null hypothesis (8), Λ 
and R, given by Formulas (9) and (10), are asymptotically -distributed, i.e., chi-square-distributed with p degrees of freedom.  Proof.  Since 
 are independent, it is sufficient to show that, under 
, any term of the sum in Formulas (
9) and (10), respectively, is asymptotically chi-square-distributed with one degree of freedom. The assertion then follows by application of the continuous mapping theorem (see, e.g., Reference [
26], Theorem 1.10). To this end, let 
 be true and 
. Moreover, let 
 to simplify notation. From Taylor’s theorem, we have for every 
 the identity
          
          where 
 lies in the interval with boundary points 1 and 
x. Application to both logarithmic arguments in Formula (
9) yields
          
          with
          
          where 
 and 
 lie in the interval with boundary points 1 and 
, respectively 
. By Formula (
11) with 
, the term in brackets in Formula (
13) as well as 
 and 
 converge to 1 a.s.. Since
          
Lemma 1 and Formula (
11), together with Slutsky’s theorem, then yield that 
B converges to 0 in distribution, which implies this convergence in probability, and, moreover,
          
Application of Slutsky’s theorem to Formula (
13), twice, then yields the assertion for the likelihood-ratio statistic.
To show the assertion for the Rao score statistic, we rewrite the terms in Formula (10) as
          
By Formula (
11), the term in square brackets converges to 
 a.s.. Application of Lemma 1 and Slutsky’s theorem, again, then completes the proof. □
   4. Testing for Equal Baseline-Distribution Parameters
We assume to have the sample situation as introduced at the beginning of 
Section 3 with the difference that model parameters 
, 
, and 
 are known, and parameters of the baseline cdfs 
 are unknown. More precisely, for 
, let baseline cdf in sample 
k be of the form
      
      for some unknown positive scale parameter 
 and a known increasing function 
 with 
 and 
, which is differentiable on 
; cf. References [
7,
14,
27]. As two examples, choices 
 and 
 for 
 correspond to an exponential and a Pareto baseline distribution, respectively. Hence, the uncertainty of the model is totally captured within the vector 
 of baseline-distribution parameters. Here, the joint density function of 
, 
, 
, can be written as
      
      where, for 
,
      
      and
      
      for 
 with 
, 
, 
 (cf. Formula (
3)). Statistics 
, 
, are independent, with 
 for 
. Moreover, for 
, the MLE of 
 is given by
      
      as analysis of the first two derivatives of the corresponding log-likelihood function shows. In the following, let 
.
  4.1. Exact Tests
Let 
 with 
 be nonempty index sets forming a partition of 
. Equation (4) can be rewritten as
        
We consider the test problem with null hypothesis
        
        and develop the corresponding likelihood-ratio test and Rao score test as in 
Section 3.1. To derive the MLE of 
, 
, under 
, let 
 be any index in 
 for 
. Then, the aim is to maximize the term
        
        wrt 
, the maximum of which is attained at
        
Hence, the MLE of 
, 
, under 
 is given by
        
Proceeding along the lines in 
Section 3.1, the likelihood-ratio statistic and Rao score statistic for testing null hypothesis (
15) turn out to be
        
Theorem 3. For testing null hypothesis (15),  and , given by Formulas (16) and (17), have single null distributions (cf. Theorem 1).  Proof.  The assertion can be shown by using similar arguments as in the proof of Theorem 1. □
 Theorem 3 allows for computing exact critical values for 
 and 
, subject to a desired significance level, by using Monte Carlo simulations and independently sampling from gamma distributions with scale parameters all equal to 1. For 
 systems, small sample sizes 
, and 
, 
Table 4 and 
Table 5 show such critical values for a significance level of 5%. Note that these values depend neither on 
 or 
, 
, nor on prespecified model parameters 
, 
, 
.
  4.2. Asymptotic Tests
Finally, we provide asymptotic results for 
 systems and null hypothesis
        
From Formulas (
16) and (17), the corresponding test statistics are seen to be
        
        with statistic 
; cf. Formulas (
9) and (10). Note that for 
, the likelihood-ratio test and the Rao score test are equivalent, since 
 and 
 are both strictly monotone functions of statistic 
; see also 
Section 3.2.
Again, we assume that 
 when the total sample size tends to infinity. Then, by the strong law of large numbers and the central limit theorem,
        
        which implies that, under null hypothesis (
18),
        
        (cf. Lemma 1 and its proof). From this, the following theorem can be shown in analogy to the proof of Theorem 2.
Theorem 4. Under null hypothesis (18),  and , given by Formulas (19) and (20), are asymptotically -distributed.    5. Power Study
We perform a simulation study to investigate and compare the power of the tests derived in 
Section 3 and 
Section 4, respectively.
For , let  vectors of r component failure times of some -out-of- system be observed, where  are arbitrary integers. In sample , any vector of component failure times is described by the first  SOSs based on the cdf  and model parameters . Moreover, all  vectors are assumed to be independent.
First, let 
 and 
 be known (but arbitrary), such that the uncertainty of the model is captured within 
, 
, and let 
. To decide whether both systems are subject to the same load-sharing effects, we consider null hypothesis
      
      and apply the exact and asymptotic likelihood-ratio test and Rao score test of 
Section 3. For a significance level of 5%, different samples sizes 
, and three vectors of model parameters describing either no, a linear, or an even faster increase in stress upon failures, 
Table 6 shows numerical power values at the corresponding pairs.
It is seen from 
Table 6 that the power of all tests increases when sample sizes increase or the vectors of the model parameters differ more. For small sample sizes, the exact Rao score test turns out to be biased, whereas the exact likelihood-ratio test seems to be unbiased (at least over the alternatives considered). Here, a test is said to be unbiased if its power function, defined on the set of all alternatives, is bounded from below by the significance level of the test; otherwise, the test is called biased. None of the exact tests dominates the other in terms of power. While the power of the likelihood-ratio tests seems to be almost unaffected when sample sizes are interchanged, the Rao score tests have greater power when more observations are recorded from the system with larger load-sharing effects. Moreover, the table indicates that the asymptotic Rao score test is conservative, i.e., its actual level is smaller than the nominal one (of 5%), which implies that the test is biased; on the other hand, the asymptotic likelihood-ratio test seems to be nonconservative and unbiased. Somewhat surprisingly, the actual levels of both asymptotic tests are already close to the nominal one for moderate sample sizes.
Now, suppose that model parameters 
, 
, are known (but arbitrary), and let baseline cdfs 
 and 
 be as stated in Formula (
14) with unknown parameters 
, and known (but arbitrary) functions 
. To check for common baseline-distribution parameters, we consider null hypothesis
      
      and apply the exact and asymptotic likelihood-ratio test and Rao score test of 
Section 4. For a significance level of 5% and different values of 
 and 
r, 
Table 7 shows numerical power values at several alternatives.
From 
Table 7, it is found that the power of all tests increases with increasing sample sizes, with increasing 
r, or with growing distance 
 of the baseline-distribution parameters. Again, the exact and asymptotic Rao score tests turn out to be biased for small sample sizes, and the likelihood-ratio tests seem to be unbiased (over the considered alternatives). None of the exact tests uniformly has greater power than the other. Interchanging sample sizes has an impact on the power of the Rao score tests at a given alternative, whereas the power of the likelihood-ratio tests seems to be nearly invariant. The actual levels of the asymptotic tests are all close to the nominal one (of 5%), where those of the likelihood-ratio test slightly exceed 5% while the Rao score test is conservative.
  6. Conclusions
In a setup of multiple samples of sequential order statistics modelling the component lifetimes of possibly differently structured 
k-out-of-
n systems, we provided exact and asymptotic statistical tests with flexible hypotheses to check for common load-sharing parameters as well as for common baseline-distribution parameters. The corresponding test statistics are shown to have single null distributions, i.e., they each have only one distribution under all parameters specified by the null hypothesis, such that exact critical values subject to a desired significance level are readily obtained by using Monte Carlo simulations. The proposed tests can also be used to decide whether a meta-analysis of the underlying data is reasonable. If, based on some dataset, the null hypothesis of common load-sharing parameters (or common baseline-distribution parameters) is not rejected, the performance of statistical procedures as, for instance, the accuracy of estimators, may be increased when applied to the whole dataset; this is, in particular, relevant for small sample sizes that are prevalent in reliability. Finally, the derived results might also be useful in other reliability applications. On the one hand, by appropriately setting the model parameters, the presented tests may be applied to check for identical scale parameters of underlying lifetime distributions in differently designed progressively type-II censored lifetime experiments ( see, e.g., Reference [
28]). On the other hand, by choosing a standard exponential baseline distribution, we may test for equality of parameters associated with stress levels in multiple repeated type-II censored exponential step-stress experiments (see Reference [
29]).