Next Article in Journal
Chirped Soliton Perturbation and Benjamin–Feir Instability of Chen–Lee–Liu Equation with Full Nonlinearity
Previous Article in Journal
A Quasi-Convex RKPM for 3D Steady-State Thermomechanical Coupling Problems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Non-Parametric Inference for Multi-Sample of Geometric Processes with Application to Multi-System Repair Process Modeling

Department of Statistics and Computer Sciences, Bilecik Şeyh Edebali University, Bilecik 11100, Turkey
Mathematics 2025, 13(14), 2260; https://doi.org/10.3390/math13142260 (registering DOI)
Submission received: 25 May 2025 / Revised: 8 July 2025 / Accepted: 9 July 2025 / Published: 12 July 2025
(This article belongs to the Section D1: Probability and Statistics)

Abstract

The geometric process is a significant monotonic stochastic process widely used in the fields of applied probability, particularly in the failure analysis of repairable systems. For repairable systems modeled by a geometric process, accurate estimation of model parameters is essential. The inference problem for geometric processes has been well-studied in the case of single-sample data. However, multi-sample data may arise when the repair processes of multiple systems are observed simultaneously. This study addresses the non-parametric inference problem for geometric processes based on multi-sample data. Several non-parametric estimators are proposed using the linear regression method, and their asymptotic properties are established. In addition, test statistics are introduced to assess sample homogeneity and to evaluate the significance of the trend observed in the process. The performance of the proposed estimators is evaluated through a comprehensive simulation study under small-sample settings. An artificial data analysis is conducted to model the repair processes of multiple repairable systems using the geometric process. Furthermore, a real-world dataset consisting of multi-sample failure data from two shared memory processors of the Blue Mountain supercomputer is analyzed to demonstrate the practical applicability of the method in multi-sample failure data analysis.

1. Introduction

In statistical modeling of datasets that include inter-arrival times of consecutive random events, a common approach is to use counting process models. When the data shows no trend, the homogeneous Poisson process (HPP) or its generalization, the renewal process (RP), may be applied. If the data exhibits a trend, the non-homogeneous Poisson process (NHPP) is often preferred. Two key NHPP models are the Cox––Lewis (log-linear) process and the Weibull (power–law) process. These models are typically used when the data demonstrates a monotonic trend, as their intensity functions are monotonic. For more details on NHPPs, refer to Ascher and Feingold [1] and Cox and Lewis [2]. However, Lam [3] introduced an alternative counting process, the geometric process (GP), to directly model the monotonic trend in the data. Before formally defining the GP, it is essential to revisit the concept of stochastic monotonicity.
Let { X k , k = 1 , 2 , } be the inter-arrival times of a counting process. The process { X k ,   k = 1 , 2 , } is said to be stochastically increasing (decreasing), if P X k > t   P ( X k + 1 > t ) for all k = 1 , 2 , and t > 0 . As a simple example of a monotone stochastic process, the GP is defined as follows. Let { X k ,   k = 1 , 2 , } be non-negative random variables representing the inter-arrival times of a counting process. Then, the process { X k ,   k = 1 , 2 , } is said to be a GP with trend parameter a   ( a > 0 ) if the random variables a k 1 X k ,   k = 1 , 2 , are independent and identically distributed (iid) with a common distribution function F ; see Lam [3]. Clearly, a GP is stochastically increasing when a < 1 , and stochastically decreasing when a > 1 . Furthermore, the GP reduces to an RP when a = 1 . Thus, the GP generalizes the RP while incorporating stochastic monotonicity.
Since its introduction, the GP has been widely used as a suitable model across various fields of applied probability. In particular, it has been applied to model the repair processes of repairable systems. In such systems, it is reasonable to assume that the successive operating times tend to decrease, while the successive repair times tend to increase due to aging effects and accumulated wear. These monotonic patterns make the GP an appropriate model for analyzing such systems. The GP has been extensively studied in the context of reliability, maintainability, failure, and warranty analysis. For instance, Lam [4,5] utilized the GP to determine the optimal replacement policy for repairable systems. Lam and Zhang [6] applied the GP model to analyze a two-component series system with a single repairman. Lam [7], Tang and Lam [8], and Lam and Zhang [9] explored the GP in the context of maintenance analysis for various deteriorating systems. For other recent applications of the GP, see Chan et al. [10,11,12], Wan and Chan [13], Zhang and Wang [14], and Arnold et al. [15]. Notable contributions have also been made by Aydoğdu and Altındağ [16], Pekalp et al. [17], Pekalp and Aydoğdu [18,19,20], and Rasay et al. [21].
For a repairable system modeled by a GP, estimating the expected operating time after each repair is crucial. Let X k ,   k = 1 , 2 , denote the successive operating times after each repair. Assume that the successive operating times follow a GP with trend parameter a , so that the process { X k ,   k = 1 , 2 , } is a GP. Then, the expected operating time and variance are given by E X k = μ / a k 1 , V a r X k = σ 2 / a 2 ( k 1 ) for k = 1 , 2 , , where μ = E ( X 1 ) and σ 2 = V a r ( X 1 ) . Thus, the expected operating time of a repairable system after a given number of repairs depends on the parameters a , μ , and σ 2 . Accurate statistical estimation of these parameters is critical. The inference problem for a , μ , and σ 2 has been thoroughly studied in the literature. Lam [22] proposed some non-parametric estimators using the linear regression method. Parametric estimation approaches have also been developed by Lam and Chan [23], Aydoğdu et al. [24], Chan et al. [25], Kara et al. [26], assuming that X 1 follows common failure distributions such as lognormal, gamma, and Weibull. Additional contributions to statistical inference on GP parameters include the works of Biçer et al. [27], Biçer et al. [28], Lone et al. [29], Usta [30], and Yılmaz [31]. Notably, all existing studies regarding the statistical inference of the parameters a , μ , and σ 2 have been based on single-sample data. However, in practical applications, multi-sample data may be required or may naturally arise over multiple repairable units. This can be explained as follows: A repairable system initially begins to function and is repaired when a failure occurs. After the repair, it returns to the functioning state and is repaired again if another failure occurs. This repair process is repeated until the system becomes ineffective. Most systems tend to become inefficient after only a few repairs due to increased repair times and decreased operating times. Once the system becomes ineffective, it should be replaced with a new one. In such cases, only a few successive operating times are observed. These few observations are often insufficient for reliable statistical inference. To address this limitation and obtain enough observations, the repair processes of multiple identical systems should be observed simultaneously. Such observations constitute multi-sample data. Several studies have addressed the statistical analysis of multi-sample data for modeling the failure/repair processes of multiple repairable systems. For example, Garmabaki et al. [32,33] studied the statistical modeling of multiple repairable systems using RP, HPP, or NHPP as alternative repair process models. Na and Chang [34] analyzed multi-sample failure data from multiple repairable systems using an NHPP model, applying it to the Korean National Army tank failure data. Wang et al. [35] analyzed multi-sample data from multiple repairable systems using a specific NHPP model. In these studies, the authors utilized NHPP models with monotonic intensity functions to analyze multi-sample data collected from the repair processes of multiple repairable systems. The GP model should also be considered as an alternative for analyzing multi-sample failure data due to its straightforward implementation for processes exhibiting a monotonic trend. Recently, Altındağ [36] evaluated the statistical analysis of multi-sample GPs with exponential failure times and demonstrated its application in modeling multiple repairable systems. In certain cases, especially when the samples include only a few observations, the assumption of parametric distribution for failure times may be inappropriate. To address this limitation, we investigate the non-parametric inference problem of GPs for multi-sample data and explore its application in modeling failure data from multiple repairable systems.
The remainder of the paper is organized as follows: Section 2 introduces the concept of multi-sample GPs, proposes several non-parametric estimators, and investigates their theoretical properties. It also presents test statistics to assess sample homogeneity and the significance of monotonic trends. Section 3 provides a comprehensive simulation study to evaluate and compare the performance of the proposed estimators. Section 4 presents illustrative examples based on both simulated and real data analyses. Finally, Section 5 summarizes the conclusions.

2. Inference for Multi-Sample of GPs

2.1. Multi-Sample of GPs

Let { X k , k = 1 , 2 , } be a GP with trend parameter a . Denote the mean and variance of the random variable X 1 by E X 1 = μ and V a r X 1 = σ 2 , respectively. Then, for k = 1 , 2 , , it follows that E X k = μ / a k 1 , V a r X k = σ 2 / a 2 ( k 1 ) . To statistically estimate the parameters a , μ , and σ 2 , a realization of the process { X k , k = 1 , 2 , } must be observed. Let X 1 , X 2 , , X n denote the first n inter-arrival times of the GP { X k , k = 1 , 2 , } . Then the sample X 1 , X 2 , , X n is considered a single realization. If n is not sufficiently large, statistical inference for the parameters a , μ , and σ 2 may be unreliable. To overcome this limitation and improve estimation efficiency, multiple realizations of the process should be observed. Let { X j k , k = 1 , 2 , } , j = 1 , 2 , , r denote r homogeneous GPs sharing a common trend parameter a . Suppose that for each j , a realization X j 1 , X j 2 , , X j n j of size n j is observed. If all these GPs are identical, so that they have the same trend parameter a and their initial variables X j 1 are iid with E X j 1 = μ and V a r X j 1 = σ 2 for all j = 1 , 2 , , r , then the collection { X j i ,   i = 1 , 2 , , n j ,   j = 1 , 2 , , r } is referred to as a multi-sample of the GP { X k , k = 1 , 2 , } .
A multi-sample from a GP can be illustrated in the context of repair processes for repairable systems. Consider r identical repairable systems operating simultaneously, where each system undergoes immediate repair upon failure. It is commonly observed that the successive operating times of such systems tend to decrease after each repair. As a result, the sequence of operating times for each system can be suitably modeled by a GP. Assuming that all repair processes are homogeneous across all systems, the combined dataset of operating times from the r systems constitutes a multi-sample from a GP. This data structure, observed throughout the repair processes of the r identical systems, is illustrated in Figure 1.

2.2. Non-Parametric Inference

Let { X j i ,   i = 1 , 2 , , n j ,   j = 1 , 2 , , r } be a multi-sample from r independent GPs with a common trend parameter a . Assume that all processes are homogeneous; that is, the random variables X j 1 are iid with E X j 1 = μ , and V a r X j 1 = σ 2 for j = 1 , 2 , , r . To make inferences about the parameters a , μ , and σ 2 , we first remind the non-parametric estimation approach based on single-sample data, as proposed by Lam [22] and Lam et al. [37]. Let { X j i ,   i = 1 , 2 , , n j } represent r independent samples from a GP with a trend parameter a , for j = 1 , 2 , , r . The parameters a , μ , and σ 2 can be estimated separately from each sample as follows. Define Y j i = a i 1 X j i . By the definition of the GP, the random variables Y j i for i = 1 , 2 , , n j and j = 1 , 2 , , r , are iid. Let us take the logarithm of each Y j i . The transformed random variables l n Y j i for i = 1 , 2 , , n j and j = 1 , 2 , , r are also iid. Denote the mean and variance of l n Y j i as E l n Y j i = λ and V a r l n Y j i = τ 2 for i = 1 , 2 , , n j and j = 1 , 2 , , r . By taking the logarithm of the random variable Y j i , we obtain for each j { 1 , 2 , , r } that
l n Y j i = i 1 l n a + l n X j i ,     i = 1 , 2 , , n j .
Let us write for each j { 1 , 2 , , r } that
l n Y j i = λ + ϵ j i ,     i = 1 , 2 , , n j
where ϵ j i ’s are iid with mean 0 and variance τ 2 . Rearranging Equation (1) yields for each j { 1 , 2 , , r } that
l n X j i = λ i 1 l n a + ϵ j i ,     i = 1 , 2 , , n j .
Therefore, Equation (3) can be interpreted as a simple linear regression model for each j { 1 , 2 , , r } . By utilizing the least squares method, the least squares estimators of β = l n a , λ , and τ 2 based on the j th sample { X j i ,   i = 1 , 2 , , n j } are obtained as follows:
β ^ j = l n a ^ j = 6 n j 1 n j ( n j + 1 ) i = 1 n j n j 2 i + 1 l n X j i ,
τ ^ j 2 = 1 n j         2 i = 1 n j l n X j i 2 1 n j i = 1 n j l n X j i 2     β ^ j i = 1 n j n j 2 i + 1 l n X j i / 2 ,    
λ ^ j = 2 n j ( n j + 1 ) i = 1 n j 2 n j 3 i + 2 l n X j i .
Hence, the least squares estimator of a for the j th sample is given by α ^ j = e x p { β ^ j } . For the details, see Lam [22]. Additionally, Lam [22] proposed several non-parametric estimators for the parameters μ and σ 2 as follows. Let us consider Equations (1) and (2). Lam [22] showed that
σ 2 e 2 λ τ 2
Thus, the parameter σ 2 can be estimated from the j th sample as:
σ ^ j , 1 2 = e x p 2 λ ^ j τ ^ j 2 .
Alternatively, as suggested by Lam [22], define Y ^ j i = a ^ j i 1   X j i , for i = 1 , 2 , , n j and j = 1 , 2 , , r . Then, a modified moment estimator for σ 2 , based on the j th sample, as defined as:
σ ^ j , 2 2 = 1 n j 1 i = 1 n j Y ^ j i Y ^ ¯ j 2 ,  
where Y ^ ¯ j = i = 1 n j Y ^ j i / n j . Furthermore, it can be written for each j { 1 , 2 , , r } that:
Y j i = μ 1 + ε j i ,   i = 1 , 2 , . . . , n j ,  
where E ε j i = 0 and V a r ε j i = σ 2 / μ 2 ; see Lam [22]. Taking the logarithm of both sides of Equation (10) yields l n Y j i = l n μ + l n ( 1 + ε j i ) . If we take the expectations of both sides, then we have from Equation (7) that:
λ l n μ V a r ( ε j i ) / 2 .  
Since V a r ε j i = σ 2 / μ 2 , the parameter μ satisfies the equation 2 μ 2 l n μ 2 λ μ 2 σ 2 = 0 . Therefore, the parameter μ can be estimated from the j th sample as:
μ ^ j , 1 Positive   Solution   of   2 μ 2 l n μ 2 λ ^ j μ 2 σ ^ j , 1 2 = 0 ;  
μ ^ j , 2 Positive   Solution   of   2 μ 2 l n μ 2 λ ^ j μ 2 σ ^ j , 2 2 = 0 .  
Other estimators for μ from the j th sample can be defined as follows:
μ ^ j , 3 = ( 1 + τ ^ j 2 / 2 ) e x p { λ ^ j } ,  
μ ^ j , 4 = Y ^ ¯ j .  
For further details, refer to Lam [22]. Note that the subscript j in the above estimators indicates that the estimator is based solely on the j th sample. The theoretical properties of some estimators based on each single sample are provided below.
Theorem 1.
If E l n Y j i 2 < , then
n j 3 / 2 β ^ j β d N 0 , 12 τ 2 ,
as n j for each j = 1 , 2 , , r ; see Theorem 3 of Lam et al. [37].
Theorem 2.
If E l n Y j i 2 < , then
n j 3 / 2 a ^ j a           d           N 0 , 12 a 2 τ 2 ,
as n j for each j = 1 , 2 , , r ; see Theorem 4 of Lam et al. [37].
Theorem 3.
If E Y j i 2 < and E l n Y j i 2 < , then
n j 1 / 2 μ ^ j , 4 μ           d           N 0 , σ 2 + 3 μ 2 τ 2 I a ,
as n j for each j = 1 , 2 , , r ;   w h e r e   I a = 0 if a = 1 , otherwise I ( a ) = 1 ; see Theorem 5 of Lam et al. [37].
Theorem 4.
If E Y j i 4 < and E l n Y j i 2 < , then
n j 1 / 2 σ ^ j , 2 2 σ 2           d           N 0 , ω 2 + 12 σ 4 τ 2 I a ,
as n j for each j = 1 , 2 , , r ; where ω 2 = V a r ( Y j i μ ) ; see Theorem 6 of Lam et al. [37].
Note that the estimators μ ^ j , 4 and σ ^ j , 2 2 are called modified moment estimators, as they correspond to the sample mean and variance of the set { Y ^ j i ,   i = 1 , 2 , , n j } , respectively, for each j { 1 , 2 , , r } . It follows directly from Theorems 3 and 4 that the least squares estimator a ^ j , along with the modified moment estimators μ ^ j , 4 and σ ^ j , 2 2 , are consistent estimators of the parameters a , μ , and σ 2 , respectively. We are now ready to study non-parametric inference based on the multi-sample, i.e., the combined set of all samples. Let { X j i ,   i = 1 , 2 , , n j ,   j = 1 , 2 , , r } be a multi-sample consisting of r independent GPs. A simple linear regression model, based on the multi-sample, can be formulated as
l n X j i = λ + β 1 i + ϵ j i ,         i = 1 , 2 , , n j ,   j = 1 , 2 , , r ,
where β = l n a . Then, the least squares estimators of β , λ , and τ 2 can be derived from Equation (20) as follows:
β ^ = 6 j = 1 r n j 4 j = 1 r n j 3 j = 1 r n j 3 j = 1 r n j 2 2 j = 1 r n j 2 j = 1 r i = 1 n j n * 2 i + 1 l n X j i ,
λ ^ = j = 1 r i = 1 n j l n X j i j = 1 r n j + β ^ n * 1 2 ,
τ ^ 2 = 1 ( j = 1 r n j ) 2 j = 1 r i = 1 n j l n X j i λ ^ β ^ i 1 2 ,
where n * = ( j = 1 r n j 2 ) / ( j = 1 r n j ) . Consequently, the estimator for a is given by a ^ = e x p { β ^ } . Analogous to the single-sample case, the following estimators for μ and σ 2 can be proposed:
σ ^ 1 2 = e x p 2 λ ^ τ ^ 2 ,
σ ^ 2 2 = 1 ( j = 1 r n j ) 1 j = 1 r i = 1 n j Y ^ j i Y ^ ¯ 2 ,       Y ^ ¯ = j = 1 r i = 1 n j Y ^ j i j = 1 r n j ,       Y ^ j i = a ^ i 1 X j i ,
μ ^ 1 Positive   Solution   of   2 μ 2 l n μ 2 λ ^ μ 2 σ ^ 1 2 = 0 ,  
μ ^ 2 Positive   Solution   of   2 μ 2 l n μ 2 λ ^ μ 2 σ ^ 2 2 = 0 ,
μ ^ 3 = ( 1 + τ ^ 2 / 2 ) e x p { λ ^ } ,  
μ ^ 4 = Y ^ ¯ .  
If the sample sizes are equal, i.e., n 1 = n 2 = = n r = n , the estimator β ^ given in Equation (21) simplifies to:
β ^ = 6 r n 1 n ( n + 1 ) j = 1 r i = 1 n n 2 i + 1 l n X j i ,
since in this case n * = n , and the coefficient simplifies as 6 j = 1 r n j / 4 j = 1 r n j 3 j = 1 r n j 3 j = 1 r n j 2 2 j = 1 r n j 2 = 6 / r n n 1 n + 1 . Moreover, under equal sample sizes, the estimator β ^ can also be expressed as the average of the individual estimators β ^ j given in Equation (4):
β ^ = 1 r j = 1 r β ^ j .
Furthermore, a similar formulation can be established for the modified moment estimator μ ^ 4 , even when the sample sizes are unequal. Specifically,
μ ^ 4 = Y ^ ¯ = j = 1 r i = 1 n j Y ^ j i j = 1 r n j = j = 1 r w j Y ^ ¯ j ,  
where w j = n j / ( j = 1 r n j ) . If n 1 = n 2 = = n r = n , it reduces to
μ ^ 4 = 1 r j = 1 r Y ^ ¯ j .
In contrast, the estimator σ ^ 2 2 cannot be expressed as a linear combination of σ ^ j , 2 2 for j = 1 , 2 , , r , since Y ^ ¯ j may not be equal to Y ^ ¯ . However, it can be approximated asymptotically as Y ^ ¯ j and Y ^ ¯ tend to the same value. Utilizing these results, the asymptotic properties of the estimators are provided below.
Theorem 5.
Suppose that all sample sizes are asymptotically equal, i.e., n 1 n 2 n r n   a n d   E l n Y j i 2 < , then,
n 3 / 2 β ^ β           d           N 0 , 12 τ 2 / r ,
as n .
Theorem 6.
Suppose that n 1 n 2 n r n   a n d   E l n Y j i 2 < , then,
n 3 / 2 a ^ a           d           N 0 , 12 a 2 τ 2 / r ,
as n .
Theorem 7.
Suppose that n 1 n 2 n r n   a n d   E Y j i 2 < and E l n Y j i 2 < , then,
n 1 / 2 μ ^ 4 μ           d           N 0 , σ 2 + 3 μ 2 τ 2 I a / r ,
as n .
Theorem 8.
Suppose that n 1 n 2 n r n   a n d   E Y j i 4 < and E l n Y j i 2 < , then,
n 1 / 2 σ ^ 2 2 σ 2           d           N 0 , ω 2 + 12 σ 4 τ 2 I a / r ,
as n .
Proof of Theorems 5–8.
The proofs of Theorem 5–7 are straightforward by considering linear combinations of β ^ and μ ^ 4 , and applying Cramer’s delta method (for the asymptotic distribution of a ^ ). To prove Theorem 8, we start by expressing σ ^ 2 2 as follows:
σ ^ 2 2 1 r n j = 1 r i = 1 n Y ^ j i Y ^ ¯ Y ^ ¯ j + Y ^ ¯ j 2 ,
which can be expanded as
σ ^ 2 2 1 r n j = 1 r i = 1 n Y ^ j i Y ^ ¯ j 2 2 j = 1 r i = 1 n Y ^ j i Y ^ ¯ j Y ^ ¯ j Y ^ ¯ + j = 1 r i = 1 n Y ^ ¯ j Y ^ ¯ 2 .
This simplifies to
σ ^ 2 2 1 r n j = 1 r i = 1 n Y ^ j i Y ^ ¯ j 2 + R n ,
where R n = 1 r n 2 j = 1 r i = 1 n Y ^ j i Y ^ ¯ j Y ^ ¯ j Y ^ ¯ + j = 1 r i = 1 n Y ^ ¯ j Y ^ ¯ 2 . Then, we have
σ ^ 2 2 1 r j = 1 r σ ^ j , 2 2 + R n .
It is straightforward to show that R n converges in probability to zero, since both Y ^ ¯ j for j = 1 , 2 , , r , and Y ^ ¯ converge in probability to μ by the law of large numbers. Therefore, the result follows directly from Theorem 4 and the independence of samples. □
For a multi-sample from a GP, it is important to statistically test whether the samples are homogeneous. When the samples are obtained from the repair processes of identical units, as illustrated in Figure 1, it is reasonable to assume that the random variables X j 1 for j = 1 , 2 , , r are iid. However, the trends exhibited in each process may differ. In such a case, unifying all samples becomes inappropriate, and each process should be analyzed separately. Therefore, the homogeneity of trends should be tested. The asymptotic results of the least squares estimators derived above can be utilized to test whether the trend parameters across all samples are homogeneous and to assess whether the multi-sample exhibits a significant trend.
Proposition 1.
Let { X j i ,     i = 1 , 2 , , n j } be r independent samples of a GP with trend parameter a j for j = 1 , 2 , , r . Assume that the random variables X j 1 are iid, and the variances of their logarithms are V a r l n X j 1 = τ 2 for j = 1 , 2 , , r . To test the homogeneity of trend parameters, consider the following hypotheses:
H 0 : a j = a ,   j = 1 , 2 , , r H 1 :   a i a j ,   i , j 1 , 2 , , r ,   i j ,
and define the test statistic as
S = 1 r         1 j = 1 r a ^ j a ^ ¯ 12 a ^ 2 τ ^ 2 / n j 3 2 ,
where a ^ j is the least squares estimator of a j based on the j th sample, a ^ ¯ = ( j = 1 r a ^ j ) / r , a ^ 2 and τ ^ 2 are least squares estimators of a and τ 2 based on the multi-sample { X j i ,     i = 1 , 2 , , n j ,   j = 1 , 2 , , r } . Then, the null hypothesis is rejected at a significance level α if S > χ r 1 2 α , since S ~ χ r 1 2 asymptotically under H 0 . Here, χ r 1 2 denotes the chi-square distribution with degrees of freedom r 1 , and χ r 1 2 α denotes its upper α quantile.
Alternatively, the hypothesis in Equation (42) can be tested using the following test statistic:
T = 1 r         1 j = 1 r β ^ j β ^ ¯ 12 τ ^ 2 / n j 3 2 ,
where β ^ j is the least squares estimator based on the j th sample, β ^ ¯ = ( j = 1 r β ^ j ) / r . Note that testing a j = a for j = 1 , 2 , , r is equivalent to testing β j = β for j = 1 , 2 , , r , since l n a = β . The statistic T may be more convenient to test homogeneity, as it can be computed more simply. The null hypothesis is rejected at the significance level α if T > χ r     1 2 α , since T ~ χ r     1 2 asymptotically under H 0 . Once the null hypothesis is not rejected, the common trend parameter of all samples, say a , can be estimated from the multi-sample. We can now test whether the common trend of the multi-sample is statistically significant.
Proposition 2.
Let X j i ,   i = 1 , 2 , , n j be r independent samples of a GP with common trend parameter a. Assume that the random variables X j 1 are iid, and the variances of their logarithms are V a r l n X j 1 = τ 2 for j = 1 , 2 , , r . To test whether the common trend is statistically significant, consider the hypothesis H 0 : a = 1 vs. H 1 : a 1 , and define the test statistic as
R = j = 1 r n j 3 12 a ^ 2 τ ^ 2 ( a ^ 1 ) ,
where a ^ 2 and τ ^ 2 are the least squares estimators of a and τ 2 based on the multi-sample { X j i ,     i = 1 , 2 , , n j ,   j = 1 , 2 , , r } . The null hypothesis is rejected at the significance level α if R > z α / 2 , since R ~ N ( 0 , 1 ) asymptotically under H 0 . Here, z α / 2 denotes the upper α / 2 quantile of the standard normal distribution.
Note that if the sample sizes are equal, i.e., n j = n for j = 1 , 2 , , r , the test statistic R can be computed as r n 3 / ( 12 a ^ 2 τ ^ 2 ) ( a ^ 1 ) . Once the null hypothesis is rejected, it can be concluded that the multi-sample { X j i ,   i = 1 , 2 , , n j ,   j = 1 , 2 , , r } has a statistically significant common trend, which distinguishes the GP from an RP. Conversely, if the null hypothesis is not rejected, the multi-sample can be regarded as a sample from an RP.

3. Simulation Study

In this section, we conduct a simulation study to investigate the small sample properties of the estimators. The samples are generated under the assumption that the initial random variables X j 1 , for j = 1 , 2 , , r , follow one of the following commonly used non-negative distributions:
  • Exponential distribution E x p ( θ ) with a probability density function
    f x = 1 θ e x p x θ ,     x 0 ;   θ > 0 .
The mean and variance of X j 1 are μ = E X j 1 = θ and σ 2 = V a r X j 1 = θ 2 , respectively.
2.
Weibull distribution W ( α , η ) with a probability density function
f x = α η x η α 1 e x p x η α ,     x 0 ;   α > 0 ,     η > 0 .
The mean and variance of X j 1 are μ = E X j 1 = η Γ ( 1 + 1 / α ) and σ 2 = V a r X j 1 = η 2 [ Γ ( 1 + 2 / α ) Γ ( 1 + 1 / α ) 2 ] , respectively.
3.
Lognormal distribution L N ( δ , ν ) with a probability density function
f x = ( x ν 2 π )   1 e x p l n x δ 2 2 ν 2 ,   x 0 ;   < δ < , ν > 0 .
The mean and variance of X j 1 are μ = E X j 1 = e x p { δ + ν 2 / 2 } , σ 2 = V a r X j 1 = μ 2 ( e x p ν 2 1 ) .
In the simulation study, each repetition is conducted by following the steps outlined below.
Step 1. Set the number of samples r and the sample sizes n j for j = 1 , 2 , , r .
Step 2. Generate r independent samples of sizes n j , say { Y j 1 , Y j 2 , , Y j n j } for j = 1 , 2 , , r , from the specified distributions given above.
Step 3. Set the trend parameter a and transform each sample using the relation X j i = Y j i / a i 1 for i = 1 , 2 , , n j and j = 1 , 2 , , r . As a result, the collection { X j i , i = 1 , 2 , , n j , j = 1 , 2 , , r } forms a multi-sample from a GP with a common trend parameter a .
Step 4. Finally, calculate the estimators a ^ , μ ^ i for i = 1 , 2 , 3 , 4 , and σ ^ i 2 for i = 1 , 2 .
The simulation study is carried out using MATLAB 2021a, with the number of replications set to 10 4 . The function fzero is employed to numerically solve Equations (26) and (27), for the estimators μ ^ 1 and μ ^ 2 , respectively. Simulations are carried out for varying values of the parameters a , μ , and σ 2 , as well as for different numbers of samples and sample sizes. For simplicity, the results are reported for selected values of a , μ , and σ 2 , as the findings are qualitatively similar across other parameter configurations.
First, the performance of the estimators a ^ , μ ^ i for i = 1 , 2 , 3 , 4 , and σ ^ i 2 for i = 1 , 2 is evaluated for each distribution. Subsequently, the effectiveness of the least squares estimator a ^ , and the modified moment estimators μ ^ 4 and σ ^ 2 2 is compared under varying sample sizes and numbers of samples. The other estimators of μ and σ 2 are excluded from the comparative analysis, as the modified moment estimators generally yield superior performance.
Table 1, Table 2 and Table 3 present the simulation results for the estimators a ^ , μ ^ i for i = 1 , 2 , 3 , 4 , and σ ^ i 2 for i = 1 , 2 , under exponential, Weibull, and lognormal distributions, respectively. In each table, the first row indicates the true parameter values used in the simulation, while the second row specifies the sample sizes for each multi-sample. ‘Mean’, ‘Var’, and ‘MSE’ represent the simulated mean, variance, and mean squared error of the corresponding estimators, respectively.
An analysis of the results reveals that the least squares estimator a ^ performs consistently well across all scenarios. Both its bias and variance are relatively small and decrease as the sample size increases. In contrast, the estimator σ ^ 1 2 exhibits substantial bias in all cases, and its variance and MSE are unacceptably high; therefore, these values have been marked as “*” in the tables. The estimator σ ^ 2 2 provides reliable estimates for the parameter σ 2 in all cases. Although its variance is relatively large under the exponential distribution, due to the higher value of σ 2 in this setting, both its bias and variance decrease with larger sample sizes. Although μ ^ 1 may appear reasonable, it is not recommended due to its reliance on the highly biased estimator σ ^ 1 2 . The other estimators, μ ^ 2 , μ ^ 3 , and μ ^ 4 , demonstrate good performance, with relatively low and comparable bias and variance. Moreover, their MSE values decrease as the sample size increases, suggesting that they are consistent estimators. It should be noted, however, that only the consistency of μ ^ 4 has been established theoretically. Overall, these findings suggest that the estimators a ^ , μ ^ 4 , and σ ^ 2 2 can be confidently recommended for estimating the parameters a , μ , and σ 2 in a multi-sample of GPs.
Next, we compare the effectiveness of multi-sample settings with varying numbers of samples and sample sizes, using the estimators a ^ , μ ^ 4 , and σ ^ 2 2 . For this purpose, the total number of observations is held constant, while the number of samples is systematically increased. This procedure is carried out for total sample sizes of 30 (small), 50 (moderate), and 100 (large) observations. Table 4 summarizes the corresponding sample configurations. Table 5, Table 6 and Table 7 present the simulation results of the estimators under these different multi-sample settings.
The estimators a ^ , μ ^ 4 , and σ ^ 2 2 exhibit strong performance across various multi-sample settings, characterized by low bias and variance in each case. As the total number of observations increases, both bias and variance decrease, which is consistent with the theoretical property of consistency proven for all three estimators. When the number of samples increases while keeping the total number of observations fixed, a slight increase is observed in the bias and variance of a ^ . However, the biases of μ ^ 4 and σ ^ 2 2 remain relatively stable, and their variances decrease significantly. These findings indicate that inference based on multi-sample settings can be as effective as inference based on single-sample settings of GPs.

4. Data Analysis

4.1. Simulated Data Analysis

We now present a numerical study to illustrate the application of the proposed inference procedures in data analysis. We consider a scenario involving 10 identical repairable systems, where both the consecutive operating times and repair times are modeled by a GP. As noted by Lam et al. [37], the trend parameter generally lies between 0.95 and 1.05 in practical applications. Accordingly, the trend parameter is set to 1.05 for consecutive operating times, reflecting the decreasing trend in performance after each repair. The trend parameter is chosen as 0.95 for consecutive repair times, capturing the increasing trend caused by the aging effect and accumulated wear. The multi-sample of operating times is generated by assuming X j 1 follows a Weibull distribution with parameters α = 2 and η = 10 for j = 1 , 2 , , 10 . Similarly, the multi-sample of repair times is generated by assuming X j 1 follows a Weibull distribution with parameters α = 2 and η = 2 for j = 1 , 2 , , 10 . For each system, it is assumed that only a limited number of operating and repair times can be observed. The resulting multi-sample of operating times is provided in Table 8, and the multi-sample of repair times is shown in Table 9, separately.
For the multi-sample of operating times, we first test whether the samples are homogeneous, i.e., whether the trend parameters are equal across all samples. The test statistics have been computed as S = 1.4408 with p-value p = 0.9976 , and T = 1.6815 with p-value p = 0.9956 . Both results strongly suggest that the trend parameters are homogeneous, justifying the use of the combined multi-sample for inference. The corresponding estimators are computed as a ^ = 1.0552 , μ ^ 4 = 8.5991 , and σ ^ 2 2 = 18.9345 . Similarly, for the multi-sample of repair times, the test statistics are computed as S = 0.9453 with a p-value of p = 0.9995 , and T = 0.8929 with a p-value of p = 0.9996 , again indicating strong evidence for the homogeneity of trends across all samples. Based on the multi-sample of repair times, the estimators are computed as a ^ = 0.9604 , μ ^ 4 = 1.8688 , and σ ^ 2 2 = 1.1202 . In both cases, the estimators provide reliable estimates of the parameters, even when each individual sample contains only a small number of observations.
Suppose now that we aim to estimate the expected lifetime of the system after the fourth repair, i.e., E ( X j 5 ) . Two estimators can be considered. The first is the sample mean of X j 5 , j = 1 , 2 , , 10 , given by ( j = 1 10 X j 5 ) / 10 = 7.7220 . The second is based on the model relationship E ( X j i ) = μ / a i 1 , yielding the estimator μ ^ 4 / a ^ 4 = 7.0155 . Notably, the second estimator provides a value that is closer to the true expected value, E X j 5 = 7.2910 , demonstrating the advantage of model-based estimation. Moreover, if the goal is to estimate E ( X j 7 ) , the first estimator would rely on only three observations (assuming X j 7 is available for only three systems), whereas the model-based estimator utilizes all available information from the multi-sample. This highlights the benefit of model-based estimation.

4.2. Real Data Analysis

In this section, a real dataset is analyzed non-parametrically using the GP to illustrate the applicability of the proposed inferential procedures for modeling multi-system repair processes. For comparison, the dataset is also analyzed under the RP, NHPP, and GP with a parametric assumption on the inter-arrival times. The dataset represents failure processes of two identical shared memory processors (SMPs) in the Blue Mountain supercomputer at Los Alamos National Laboratory. It consists of the consecutive failure times of each SMP and can be found in Wang et al. [35]. The inter-arrival times between consecutive failures for each system are presented in Table 10.
Before modeling the dataset with the GP non-parametrically, we first give the results of existing studies in the literature. Wang et al. [35] denoted that each SMP is restarted upon failure, allowing each restart to be interpreted as a system repair. They also denoted that the failure processes of two SMPs exhibit reliability growth after each restart/repair. Consequently, the data exhibit a trend, and the failure processes of two SMPs can be appropriately modeled using monotonic stochastic processes. Wang et al. [35] proposed to model the data with the power–law process (PLP), a monotonic NHPP with intensity function λ t = β 0 β 1 t β 1 1 ,   t 0 . It is well-known that the failures tend to decrease in time, so the inter-arrival times of consecutive failures tend to increase when β 1 < 1 . Conversely, the failures tend to increase, so the inter-arrival times of consecutive failures tend to decrease when β 1 > 1 . If β 1 = 1 , the PLP reduces to an HPP, which means the data has no trend. Altındağ [36] computed the maximum likelihood estimates (MLEs) of the PLP parameters as β ^ 0 = 0.2496 and β ^ 1 = 0.7794 . In the same study, the author also examined whether the data could be modeled using a GP under the assumption that the inter-arrival times follow an exponential distribution. The results indicated that the data are compatible with the GP model and that the failure processes of the two SMPs can be considered homogeneous. Moreover, the GP model with exponential inter-arrival times (GP-EXP) provided a better fit to the data than the PLP model, as evidenced by a lower Akaike Information Criterion (AIC) value. For detailed analysis, see Altındağ [36]. Additionally, the MLEs of the common trend parameter a and the mean θ of the exponential distribution were estimated as a ^ = 0.9654 and θ ^ = 9.0295 , respectively.
After reviewing the existing studies that model the dataset parametrically, we outline the steps necessary to validate the GP model on the dataset using a non-parametric approach:
Step 1. Test for the presence of a trend in the dataset.
Step 2. Assess the compatibility of the dataset with the GP model.
Step 3. Examine whether the samples (i.e., trends exhibited within each sample) are homogeneous.
Step 4. Estimate the model parameters.
Step 5. Evaluate the fitting performance of the model.
Evaluation of Step 1. To test for the presence of a trend in the dataset, the combined Laplace trend test—an adaptation of the well-known Laplace test for multi-sample settings—can be employed. For further details on multi-sample trend testing, see Kvaløy and Lindqvist [38] and Garmabaki et al. [32]. The combined Laplace test statistic has been computed as L c = 15.3731 with a p-value of p < 0.0001 , indicating a statistically significant trend in the SMPs failure dataset (Altındağ [36]).
Evaluation of Step 2. To assess whether the dataset is compatible with the GP, the following auxiliary variables proposed by Lam [22] are considered. Let U j i = X j 2 i / X j ( 2 i 1 ) , i = 1 , 2 , , ( n j 1 ) / 2 for j = 1 , 2 . If the dataset follows a GP model, these variables are expected to be iid for each j { 1 ,   2 } . The iid property of the auxiliary variables can be tested by the well-known turning-point (TP) test. The test statistics for the TP test have been computed as T P 1 = 0.4354 with a p-value of p = 0.66 for j = 1 , and T P 2 =   1.0885 , with a p-value of p = 0.27 for j = 2 . These results indicate that each sample is compatible with the GP model. Therefore, the entire dataset can be reasonably modeled using the GP (Altındağ [36]).
Evaluation of Step 3. To examine the homogeneity of the samples under the GP model, the non-parametric test proposed in Proposition 1 is applied. The test statistics have been computed as S = 0.0712 with a p-value of p = 0.9920 , and T = 0.0735 , with a p-value of p = 0.9915 . Both results indicate that the trend parameters of each sample are homogeneous under the GP model. Consequently, we can non-parametrically estimate the common trend parameter a and the mean of inter-arrival times μ .
Evaluation of Step 4. The non-parametric least squares and modified moment estimators of the parameters a and μ have been computed as a ^ = 0.9762 and μ ^ 4 = 10.5272 . Note that these values are very close to the parametric counterparts under the GP-EXP model provided above. Furthermore, the result a ^ = 0.9762 < 1 indicates that failures tend to decrease over time, so the consecutive operating times tend to increase. This observation is consistent with the results obtained from both the NHPP-PLP and the GP-EXP models.
Remark 1.
In the first step, the presence of a trend has been evaluated using the combined Laplace test. Additionally, the significance of the trend parameter estimated under the GP model can be assessed using the test procedure described in Proposition 2. Based on this approach, the test statistic has been computed as R =   1.3519 with a p-value of p = 0.1764 , suggesting that the trend may not be statistically significant. This result appears to contradict the finding from Step 1. However, it is important to note that the critical value for the test statistic R is valid asymptotically. As such, the power of the test may be undesirable for small or moderately sized samples. For instance, if the trend parameter was estimated as a ^ = 0.965 , the test statistic would be R =   2.0112 with a p-value of p = 0.0443 , which would indicate statistical significance at the α = 0.05 level. If we assume that the data exhibit no trend, then the appropriate model for the dataset would be the RP. Under the RP model, the mean of the inter-arrival times can be estimated by the sample mean, yielding μ ^ = 15.0917 .
Evaluation of Step 5. We now proceed to analyze the dataset by comparing the fitting performances of various models. The RP, the GP-EXP, and the NHPP-PLP are compared with the non-parametric GP (GP-NP) model. For this purpose, the mean squared error (MSE*) of predictions for each model can be used as the evaluation criterion. The MSE* for predictions can be defined as:
M S E * = 1 n 1 + n 2 j = 1 2 i = 1 n j X ^ j i X j i 2 ,
where X ^ j i denotes the predicted value of X j i for i = 1 , , n j and j = 1 , 2 according to the model under consideration. Specifically:
  • For the RP model: X ^ j i = μ ^ .
  • For the GP-EXP model: X ^ j i = a ^ 1 i θ ^ .
  • For the GP-NP model: X ^ j i = a ^ 1 i μ ^ 4 .
  • For the NHPP-PLP model: X ^ j i = S ^ j i S ^ j ( i 1 ) where
    S ^ j i = e x p { β ^ 0 S ^ j i 1 β ^ 1 } β ^ 0 1 /   β ^ 1 β ^ 0 S ^ j i 1 β ^ 1 t 1 /   β ^ 1 e t d t ,
    and S ^ j 0 = 0 . See Kara et al. [39] for the details. The parameter estimates and corresponding MSE* values for all models are presented in Table 11.
Remark 2.
All MSE* values appear to be relatively high. This can be attributed to the comparatively large magnitudes of certain X j i observations, for example X 1 ( 31 ) = 79.43 and X 2 ( 22 ) = 62.91 .
Upon examining the results, the highest MSE* value has been observed for the RP model. This indicates that assuming a trend in the failure data for multi-system SMPs is a more realistic approach. The MSE* values for the exponential failure times and non-parametric GP models have been found to be lower than those of the NHPP-PLP model, suggesting that the GP model performs a better fit for this dataset compared to the NHPP-PLP model. Additionally, when comparing the parametric and non-parametric GP models, the estimated trend parameters and the mean failure (operating) times have been found to be similar for both models. However, the parametric GP model yielded a lower MSE*, indicating a better overall fit. It is reasonable to expect that, given a sufficient number of observations, the parametric model would yield a better fit compared to the non-parametric model.

5. Conclusions and Discussion

In this paper, we address the problem of non-parametric inference for GPs based on multi-sample data. We propose several estimators by using the linear regression method to estimate the parameters a , μ , and σ 2 . The asymptotic properties of the least squares estimator a ^ , as well as the modified moment estimators μ ^ 4 and σ ^ 2 2 , are investigated. Their asymptotic distributions are derived. To evaluate the small sample performance of the estimators, an extensive simulation study is conducted. The results demonstrate that the least squares estimator a ^ closely estimates the true value of a . For the parameters μ , and σ 2 , the modified moment estimators μ ^ 4 and σ ^ 2 2 yield accurate and robust estimates across varying numbers of samples and sample sizes, outperforming the other estimators considered.
The effectiveness of the multi-sample approach has been illustrated through an artificial data analysis involving the repair process data of multiple repairable systems. Furthermore, a real-world dataset representing the failure processes of two SMPs from the Blue Mountain supercomputer has been analyzed as a multi-sample of GP. The GP model, incorporating non-parametric estimates, provided a better fit to the data compared to the NHPP-PLP and RP models. While many previous studies in the literature have relied on NHPP models for analyzing the repair processes of multiple repairable systems, this study illustrates the potential of the GPs for modeling multi-sample data obtained from such systems. However, the study can be extended in several directions, as discussed below.
In the study, we have assumed that the initial means and variances of the inter-arrival times are identical across all samples. While this assumption is reasonable when the samples are obtained from identical systems, it is nonetheless strong and may not hold in some cases. In practice, systems are often exposed to varying environmental and operational conditions, which may lead to differences in variability, even when the trend parameters appear homogeneous. The non-identicality of the initial inter-arrival times across samples should be addressed for future research on multi-sample settings of GPs. The study has dealt with the statistical modeling of datasets consisting of multiple samples using the GP, in the context of repair process modeling for multiple repairable systems. After appropriately modeling the repair process using the GP, further analyses for the repair process itself should be considered from different perspectives, such as system reliability evaluation, maintenance cost analysis, determination of optimal maintenance policies, etc. For this purpose, in addition to the estimation of model parameters, the estimation problem of some key functions related to the GP, such as the distribution function of inter-arrival times, the mean value and variance functions of the GP that correspond to the expected value and variance of the repairs realized in a certain period, should be addressed. This requires some further development based on this study. Furthermore, in the study, statistical inference has been conducted using non-parametric techniques. This can be extended by assuming parametric distributions for the inter-arrival times of GPs. Addressing such scenarios represents valuable directions for future research.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study.

Acknowledgments

The author thanks to the anonymous reviewers for their careful reading and valuable suggestions.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Ascher, H.; Feingold, H. Repairable Systems Reliability: Modeling, Inference, Misconceptions and Their Causes; Wiley: New York, NY, USA, 1984. [Google Scholar]
  2. Cox, D.R.; Lewis, P.A. The Statistical Analysis of Series of Events; Springer: Berlin/Heidelberg, Germany, 1966. [Google Scholar]
  3. Lam, Y. Geometric processes and replacement problem. Acta Math. Appl. Sin. 1988, 4, 366–377. [Google Scholar]
  4. Lam, Y. A note on the optimal replacement problem. Adv. Appl. Probab. 1988, 20, 479–482. [Google Scholar]
  5. Lam, Y. An optimal repairable replacement model for deteriorating systems. J. Appl. Probab. 1991, 28, 843–851. [Google Scholar]
  6. Lam, Y.; Zhang, Y.L. Analysis of a two-component series system with a geometric process model. Naval Res. Logist. 1996, 43, 491–502. [Google Scholar] [CrossRef]
  7. Lam, Y. A geometric process maintenance model with preventive repair. Eur. J. Oper. Res. 2007, 182, 806–819. [Google Scholar] [CrossRef]
  8. Tang, Y.Y.; Lam, Y. A δ-shock maintenance model for a deteriorating system. Eur. J. Oper. Res. 2006, 168, 541–556. [Google Scholar] [CrossRef]
  9. Lam, Y.; Zhang, Y.L. A geometric-process maintenance model for a deteriorating system under a random environment. IEEE Trans. Reliab. 2003, 52, 83–89. [Google Scholar] [CrossRef]
  10. Chan, J.S.; Yu, P.L.; Lam, Y.; Ho, A.P. Modelling SARS data using threshold geometric process. Stat. Med. 2006, 25, 1826–1839. [Google Scholar] [CrossRef]
  11. Chan, J.S.K.; Lam, C.P.Y.; Yu, P.L.H.; Choy, S.B.; Cohen, C.W. A Bayesian conditional autoregressive geometric process model for range data. Comput. Stat. Data Anal. 2012, 56, 3006–3019. [Google Scholar] [CrossRef]
  12. Chan, J.S.K.; Wan, W.Y.; Yu, P.L.H. Poisson geometric process approach for predicting drop-out and committed first-time blood donors. J. Appl. Stat. 2014, 41, 1486–1503. [Google Scholar] [CrossRef]
  13. Wan, W.Y.; Chan, J.S.K. A new approach for handling longitudinal count data with zero-inflation and overdispersion: Poisson geometric process model. Biom. J. 2009, 51, 556–570. [Google Scholar] [CrossRef] [PubMed]
  14. Zhang, Y.L.; Wang, G.J. A geometric process warranty model using a combination policy. Commun. Stat. Theory Methods 2019, 48, 1493–1505. [Google Scholar] [CrossRef]
  15. Arnold, R.; Chukova, S.; Hayakawa, Y.; Marshall, S. Geometric-like processes: An overview and some reliability applications. Reliab. Eng. Syst. Saf. 2020, 201, 106990. [Google Scholar] [CrossRef]
  16. Aydoğdu, H.; Altındağ, Ö. Computation of the mean value and variance functions in geometric process. J. Stat. Comput. Simul. 2016, 86, 986–995. [Google Scholar] [CrossRef]
  17. Pekalp, M.H.; Aydoğdu, H.; Türkman, K.F. Discriminating between some lifetime distributions in geometric counting processes. Commun. Stat. Simul. Comput. 2020, 51, 715–737. [Google Scholar] [CrossRef]
  18. Pekalp, M.H.; Aydoğdu, H. An integral equation for the second moment function of a geometric process and its numerical solution. Naval Res. Logist. 2018, 65, 176–184. [Google Scholar] [CrossRef]
  19. Pekalp, M.H.; Aydoğdu, H. Power series expansions for the probability distribution, mean value and variance functions of a geometric process with gamma interarrival times. J. Comput. Appl. Math. 2021, 388, 113287. [Google Scholar] [CrossRef]
  20. Pekalp, M.H.; Aydoğdu, H. Parametric estimations of the mean value and variance functions in geometric process. J. Comput. Appl. Math. 2024, 449, 115969. [Google Scholar] [CrossRef]
  21. Rasay, H.; Azizi, F.; Naderkhani, F. A mathematical maintenance model for a production system subject to deterioration according to a stochastic geometric process. Ann. Oper. Res. 2024, 340, 451–478. [Google Scholar] [CrossRef]
  22. Lam, Y. Nonparametric inference for geometric processes. Commun. Stat. Theory Methods 1992, 21, 2083–2105. [Google Scholar]
  23. Lam, Y.; Chan, J.S.K. Statistical inference for geometric processes with lognormal distribution. Comput. Stat. Data Anal. 1998, 27, 99–112. [Google Scholar] [CrossRef]
  24. Aydoğdu, H.; Şenoğlu, B.; Kara, M. Parameter estimation in geometric process with Weibull distribution. Appl. Math. Comput. 2010, 217, 2657–2665. [Google Scholar] [CrossRef]
  25. Chan, J.S.K.; Lam, Y.; Leung, D.Y. Statistical inference for geometric processes with gamma distributions. Comput. Stat. Data Anal. 2004, 47, 565–581. [Google Scholar] [CrossRef]
  26. Kara, M.; Güven, G.; Şenoğlu, B. Estimation of the parameters of the gamma geometric process. J. Stat. Comput. Simul. 2022, 92, 2525–2535. [Google Scholar] [CrossRef]
  27. Biçer, C.; Bakouch, H.S.; Biçer, H.D. Inference on parameters of a geometric process with scaled Muth distribution. Fluct. Noise Lett. 2021, 20, 2150006. [Google Scholar] [CrossRef]
  28. Biçer, H.D.; Biçer, C.; Bakouch, H.S. A geometric process with Hjorth marginal: Estimation, discrimination, and reliability data modeling. Qual. Reliab. Eng. Int. 2022, 38, 2795–2819. [Google Scholar] [CrossRef]
  29. Lone, S.A.; Alam, I.; Rahman, A. Statistical analysis under geometric process in accelerated life testing plans for generalized exponential distribution. Ann. Data Sci. 2023, 10, 1653–1665. [Google Scholar] [CrossRef]
  30. Usta, I. Bayesian estimation for geometric process with the Weibull distribution. Commun. Stat. Simul. Comput. 2024, 53, 2527–2553. [Google Scholar] [CrossRef]
  31. Yılmaz, A. Bayesian parameter estimation for geometric process with Rayleigh distribution. Bitlis Eren Univ. J. Sci. Technol. 2024, 13, 482–491. [Google Scholar] [CrossRef]
  32. Garmabaki, A.H.S.; Ahmadi, A.; Block, J.; Pham, H.; Kumar, U. A reliability decision framework for multiple repairable units. Reliab. Eng. Syst. Saf. 2016, 150, 78–88. [Google Scholar] [CrossRef]
  33. Garmabaki, A.H.S.; Ahmadi, A.; Mahmood, Y.A.; Barabadi, A. Reliability modelling of multiple repairable units. Qual. Reliab. Eng. Int. 2016, 32, 2329–2343. [Google Scholar] [CrossRef]
  34. Na, I.Y.; Chang, W. Multi-system reliability trend analysis model using incomplete data with application to tank maintenance. Qual. Reliab. Eng. Int. 2017, 33, 2385–2395. [Google Scholar] [CrossRef]
  35. Wang, Y.; Lu, Z.; Xiao, S. Parametric bootstrap confidence interval method for the power law process with applications to multiple repairable systems. IEEE Access 2018, 6, 49157–49169. [Google Scholar] [CrossRef]
  36. Altındağ, Ö. Statistical evaluation of multiple process data in geometric processes with exponential failures. Hacet. J. Math. Stat. 2025, 54, 738–761. [Google Scholar] [CrossRef]
  37. Lam, Y.; Zhu, L.X.; Chan, J.S.K.; Liu, Q. Analysis of data from a series of events by a geometric process model. Acta Math. Appl. Sin. 2004, 20, 263–282. [Google Scholar] [CrossRef]
  38. Kvaløy, J.T.; Lindqvist, B.H. TTT-based tests for trend in repairable systems data. Reliab. Eng. Syst. Saf. 1998, 60, 13–28. [Google Scholar] [CrossRef]
  39. Kara, M.; Altındağ, Ö.; Pekalp, M.H.; Aydoğdu, H. Parameter estimation in α-series process with lognormal distribution. Commun. Stat. Theory Methods 2019, 48, 4976–4998. [Google Scholar] [CrossRef]
Figure 1. Repair processes of r identical repairable systems.
Figure 1. Repair processes of r identical repairable systems.
Mathematics 13 02260 g001
Table 1. Simulated mean, variance, and MSE values for the estimators under different sample settings when X j 1 ~ E x p ( θ ) .
Table 1. Simulated mean, variance, and MSE values for the estimators under different sample settings when X j 1 ~ E x p ( θ ) .
a   =   0.95 ,   θ   =   5 ,   μ   =   5 ,   σ 2   =   25
r   =   3 ,   n j   =   10 ,   j   =   1 , 2 , 3 r   =   3 ,   n j   =   15 ,   j   =   1 , 2 , 3 r   =   3 ,   n j   =   20 ,   j   =   1 , 2 , 3
MeanVarMSEMeanVarMSEMeanVarMSE
a ^ 0.95260.00630.04890.95170.00190.00190.94970.00070.0007
μ ^ 1 5.999531.48632.4855.249212.43812.5004.971610.52310.524
μ ^ 2 5.12894.69074.70735.07823.20303.20924.97512.53112.5318
μ ^ 3 5.52855.66465.94405.41023.59583.76405.30242.83742.9289
μ ^ 4 5.30524.90004.99325.24043.24313.30095.14402.56872.5894
σ ^ 1 2 145.55**63.635**55.200**
σ ^ 2 2 32.7531251.61311.731.511794.89837.2829.471588.56608.55
* Unreasonable value due to highly biased estimation.
Table 2. Simulated mean, variance, and MSE values for the estimators under different sample settings when X j 1 ~ W ( α , η ) .
Table 2. Simulated mean, variance, and MSE values for the estimators under different sample settings when X j 1 ~ W ( α , η ) .
a   =   0.95 ,   α   =   2 ,   η   =   5 ,   μ   =   4.4311 ,   σ 2   =   5.3650
r   =   3 ,   n j   =   10 ,   j   =   1 , 2 , 3 r   =   3 ,   n j   =   15 ,   j   =   1 , 2 , 3 r   =   3 ,   n j   =   20 ,   j   =   1 , 2 , 3
MeanVarMSEMeanVarMSEMeanVarMSE
a ^ 0.95040.00150.00150.95000.00040.00040.94990.00020.0002
μ ^ 1 5.34253.96094.79165.21242.52533.13575.14281.81012.3166
μ ^ 2 4.40750.83420.83474.37980.58660.58924.36550.45040.4547
μ ^ 3 4.59680.90900.93644.57040.64430.66374.55570.49260.5081
μ ^ 4 4.48260.85340.85614.46530.60470.60584.45610.46460.4652
σ ^ 1 2 26.404**21.553**19.198**
σ ^ 2 2 5.67858.51528.61345.59025.39935.45005.54273.99414.0257
* Unreasonable value due to highly biased estimation.
Table 3. Simulated mean, variance, and MSE values for the estimators under different sample settings when X j 1 ~ L N ( δ , ν ) .
Table 3. Simulated mean, variance, and MSE values for the estimators under different sample settings when X j 1 ~ L N ( δ , ν ) .
a   =   0.95 ,   δ   =   0 ,   ν   =   1 ,   μ   =   1.6487 ,   σ 2   =   4.6708
r   =   3 ,   n j   =   10 ,   j   =   1 , 2 , 3 r   =   3 ,   n j   =   15 ,   j   =   1 , 2 , 3 r   =   3 ,   n j   =   20 ,   j   =   1 , 2 , 3
MeanVarMSEMeanVarMSEMeanVarMSE
a ^ 0.95280.00370.00370.95080.00110.00110.95060.00040.0004
μ ^ 1 1.44110.13400.17711.40650.07870.13741.39030.05650.1233
μ ^ 2 1.83100.61870.65191.84130.44400.48111.85700.35570.3991
μ ^ 3 1.59590.33080.33361.56370.21450.22171.55420.16010.1690
μ ^ 4 1.69750.41340.41581.68090.27640.27741.68010.20960.2106
σ ^ 1 2 1.3592**1.2147**1.1499**
σ ^ 2 2 5.079992.78892.9564.927256.04756.1134.895938.53138.582
* Unreasonable value due to highly biased estimation.
Table 4. Sample numbers and sizes of multi-sample.
Table 4. Sample numbers and sizes of multi-sample.
Sample 1 r = 1 ,   n 1 = 30
Sample 2 r = 2 ,   n 1 = 15 ,   j = 1 , 2
Sample 3 r = 3 ,   n j = 10 ,   j = 1 , 2 , 3
Sample 4 r = 5 ,   n j = 6 , j = 1 , 2 , 3 , 4 , 5
Sample 5 r = 1 ,   n 1 = 50
Sample 6 r = 2 ,   n j = 25 , j = 1 , 2
Sample 7 r = 3 ,   n j = 17 , j = 1 , 2 , , n 3 = 16
Sample 8 r = 5 ,   n j = 10 , j = 1 , , 5
Sample 9 r = 7 ,   n j = 7 , j = 1 , , 6 ,   n 7 = 8
Sample 10 r = 1 ,   n 1 = 100
Sample 11 r = 2 ,   n j = 50 , j = 1 , 2
Sample 12 r = 3 ,   n j = 33 , j = 1 , 2 , n 3 = 34
Sample 13 r = 4 ,   n j = 25 , j = 1 , 2 , 3 , 4
Sample 14 r = 10 ,   n j = 10 , j = 1 , , 10
Table 5. Comparison of estimators for different multi-sample settings when X j 1 ~ E x p ( θ ) .
Table 5. Comparison of estimators for different multi-sample settings when X j 1 ~ E x p ( θ ) .
a   =   0.95 ,   θ   =   5 ,   μ   =   5 ,   σ 2   =   25
a ^ μ ^ 4 σ ^ 2 2
MeanVarMSEMeanVarMSEMeanVarMSE
Sample 10.95020.00070.00075.35845.78655.915035.5132313.82424.3
Sample 20.95020.00270.00275.28085.26335.342133.9272131.12210.8
Sample 30.95480.00610.00615.34925.34355.465535.7763961.44077.5
Sample 40.96020.01710.01725.28824.33614.419133.2091602.81670.2
Sample 50.95000.00010.00015.21413.32803.373831.2311104.31143.1
Sample 60.94990.00060.00065.19183.13233.169130.514865.74896.15
Sample 70.95080.00130.00135.22563.08223.133130.830808.23842.23
Sample 80.95120.00370.00375.16922.81382.842530.111718.97745.09
Sample 90.95340.00720.00725.16982.66772.696629.894657.20681.15
Sample 100.95000.00000.00005.12531.61001.625827.944277.79286.46
Sample 110.95000.00010.00015.11631.55581.569327.960268.44277.20
Sample 120.95000.00020.00025.10411.46761.478427.587251.82258.52
Sample 130.95030.00030.00035.11651.48731.500827.850263.79271.92
Sample 140.95090.00180.00185.09301.35141.360027.565241.49248.07
Table 6. Comparison of estimators for different multi-sample settings when X j 1 ~ W ( α , η ) .
Table 6. Comparison of estimators for different multi-sample settings when X j 1 ~ W ( α , η ) .
a   =   0.95 ,   α   =   2 , η   =   5 ,   μ   =   4.4311 ,   σ 2   =   5.3650
a ^ μ ^ 4 σ ^ 2 2
MeanVarMSEMeanVarMSEMeanVarMSE
Sample 10.95030.00020.00024.52030.97300.98095.824110.28810.499
Sample 20.95020.00070.00074.49220.92840.93215.73199.66019.7948
Sample 30.95110.00150.00154.49180.87140.87515.72599.02879.1590
Sample 40.95240.00420.00424.48010.74360.74605.66387.41147.5007
Sample 50.94990.00000.00004.46280.57930.58035.57834.91314.9586
Sample 60.95020.00010.00014.47610.57310.57515.62905.12835.1980
Sample 70.95010.00030.00034.46810.53380.53525.59634.76134.8147
Sample 80.95050.00090.00094.46810.50750.50895.56994.50034.5423
Sample 90.95130.00180.00184.46760.47730.47865.56434.32514.3648
Sample 100.95000.00000.00004.45270.29670.29715.48502.31422.3286
Sample 110.95000.00000.00004.45130.29070.29115.49872.30212.3200
Sample 120.95000.00000.00004.45380.29470.29525.49042.25752.2733
Sample 130.94990.00010.00014.43870.27820.27825.46592.22472.2349
Sample 140.94980.00050.00054.43970.25150.25165.44932.03692.0440
Table 7. Comparison of estimators for different multi-sample settings when X j 1 ~ L N ( δ , ν ) .
Table 7. Comparison of estimators for different multi-sample settings when X j 1 ~ L N ( δ , ν ) .
a   =   0.95 ,   δ   =   0 ,   ν   =   1 ,   μ   =   1.6487 ,   σ 2   =   4.6708
a ^ μ ^ 4 σ ^ 2 2
MeanVarMSEMeanVarMSEMeanVarMSE
Sample 10.95020.00040.00041.69740.46250.46485.2719151.78152.15
Sample 20.95100.00160.00161.68950.42730.42895.047788.49488.636
Sample 30.95240.00370.00371.69300.41510.41715.048491.79691.939
Sample 40.95460.01030.01031.68080.37870.37984.941289.93690.009
Sample 50.95020.00010.00011.68710.26660.26815.083983.06083.231
Sample 60.95030.00040.00041.68510.26930.27075.059367.55167.702
Sample 70.95050.00080.00081.67160.24730.24784.845857.39257.423
Sample 80.95050.00220.00221.66840.24010.24054.797139.46539.481
Sample 90.95200.00430.00431.66730.22810.22854.761937.40937.417
Sample 100.95000.00000.00001.66120.13370.13384.789926.06626.081
Sample 110.95000.00000.00001.66430.13010.13034.801525.65925.676
Sample 120.94990.00010.00011.65680.12440.12444.726221.64421.647
Sample 130.95020.00020.00021.66700.12780.12824.804924.93524.953
Sample 140.95050.00110.00111.66130.11780.11794.701917.48117.482
Table 8. Consecutive operating times of repairable identical systems.
Table 8. Consecutive operating times of repairable identical systems.
X j 1 ~ W 2 , 10 ,   j   =   1 , 2 , , 10 ,   a   =   1.05 ,   μ   =   8.8623 ,   σ 2   =   21.4602
System 18.3216.2216.564.465.81
System 26.3118.099.656.324.692.10
System 33.483.9214.257.2012.83
System 415.3811.593.2411.109.917.574.19
System 53.7910.068.204.348.644.49
System 61.977.0912.585.7111.78
System 76.0010.582.948.957.132.466.76
System 85.603.445.9410.927.433.138.264.03
System 915.415.148.162.577.096.28
System 103.588.469.928.511.91
Table 9. Consecutive repair times of repairable identical systems.
Table 9. Consecutive repair times of repairable identical systems.
X j 1 ~ W 2 , 2 ,   j   =   1 , 2 , , 10 ,   a   =   0.95 ,   μ   =   1.7725 ,   σ 2   =   0.8584
System 11.041.671.151.840.54
System 20.981.832.732.311.842.49
System 31.462.821.464.373.34
System 4 0.813.012.131.062.622.531.94
System 50.721.651.853.360.973.37
System 62.133.330.843.172.09
System 72.550.693.103.860.915.141.43
System 82.305.261.230.641.120.282.013.52
System 91.900.492.971.451.695.26
System 101.453.341.121.170.70
Table 10. Inter-arrival times between consecutive failures of the two SMPs in the Blue Mountain supercomputer.
Table 10. Inter-arrival times between consecutive failures of the two SMPs in the Blue Mountain supercomputer.
j n j X j i
1 314.7415.491.983.819.072.2520.701.67
7.965.221.2711.528.9017.7911.750.71
4.306.845.954.5916.5538.4113.6224.77
4.880.2215.667.7224.557.8679.43
2 235.219.2827.8212.214.577.008.9044.28
8.2019.0126.2917.4413.3119.8529.2712.88
18.5428.629.9911.0314.7862.9123.37
Table 11. Parameter estimates and MSE* values for all models.
Table 11. Parameter estimates and MSE* values for all models.
ModelParameter EstimatesMSE*
RP μ ^ = 15.0917 213.0120
NHPP-PLP β ^ 0 = 0.2496 ,   β ^ 1 = 0.7794 196.1034
GP-NP a ^ = 0.9762 ,   μ ^ 4 = 10.5272 192.9454
GP-EXP a ^ = 0.9654 ,   θ ^ = 9.0295 188.8042
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Altındağ, Ö. Non-Parametric Inference for Multi-Sample of Geometric Processes with Application to Multi-System Repair Process Modeling. Mathematics 2025, 13, 2260. https://doi.org/10.3390/math13142260

AMA Style

Altındağ Ö. Non-Parametric Inference for Multi-Sample of Geometric Processes with Application to Multi-System Repair Process Modeling. Mathematics. 2025; 13(14):2260. https://doi.org/10.3390/math13142260

Chicago/Turabian Style

Altındağ, Ömer. 2025. "Non-Parametric Inference for Multi-Sample of Geometric Processes with Application to Multi-System Repair Process Modeling" Mathematics 13, no. 14: 2260. https://doi.org/10.3390/math13142260

APA Style

Altındağ, Ö. (2025). Non-Parametric Inference for Multi-Sample of Geometric Processes with Application to Multi-System Repair Process Modeling. Mathematics, 13(14), 2260. https://doi.org/10.3390/math13142260

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop