Next Article in Journal
Proposal of a Protocol for Adjusting the Value of the SN-GoGn Angle in Steiner Cephalometry
Previous Article in Journal
Gut Microbiota-Mediated Biotransformation of Medicinal Herb-Derived Natural Products: A Narrative Review of New Frontiers in Drug Discovery
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bias-Reduced Haebara and Stocking–Lord Linking

by
Alexander Robitzsch
1,2
1
IPN—Leibniz Institute for Science and Mathematics Education, Olshausenstraße 62, 24118 Kiel, Germany
2
Centre for International Student Assessment (ZIB), Olshausenstraße 62, 24118 Kiel, Germany
J 2024, 7(3), 373-384; https://doi.org/10.3390/j7030021
Submission received: 10 May 2024 / Revised: 14 August 2024 / Accepted: 21 August 2024 / Published: 4 September 2024
(This article belongs to the Section Computer Science & Mathematics)

Abstract

:
Haebara and Stocking–Lord linking methods are frequently used to compare the distributions of two groups. Previous research has demonstrated that Haebara and Stocking–Lord linking can produce bias in estimated standard deviations and, to a smaller extent, in estimated means in the presence of differential item functioning (DIF). This article determines the asymptotic bias of the two linking methods for the 2PL model. A bias-reduced Haebara and bias-reduced Stocking–Lord linking method is proposed to reduce the bias due to uniform DIF effects. The performance of the new linking method is evaluated in a simulation study. In general, it turned out that Stocking–Lord linking had substantial advantages over Haebara linking in the presence of DIF effects. Moreover, bias-reduced Haebara and Stocking–Lord linking substantially reduced the bias in the estimated standard deviation.

1. Introduction

Item response theory (IRT) models [1,2] are multivariate statistical models for multivariate binary random variables. These kinds of models are frequently used to model cognitive testing data stemming from educational or psychological applications. This article only considers unidimensional IRT models [3]. Let X = ( X 1 , , X I ) be the vector of I dichotomous (i.e., binary) random variables X i { 0 , 1 } that are typically referred to as items in the psychometric literature. A unidimensional IRT model [4] is a statistical model for the probability distribution P ( X = x ) for x = ( x 1 , , x I ) { 0 , 1 } I , where
P ( X = x ; δ , γ ) = i = 1 I P i ( θ ; γ i ) x i 1 P i ( θ ; γ i ) 1 x i ϕ ( θ ; μ , σ ) d θ ,
where ϕ denotes the density of a normal distribution with mean  μ and standard deviation  σ . The parameters of the distribution’s latent variable θ (also referred to as the factor variable, trait, or ability) are contained in δ = ( μ , σ ) . The vector γ = ( γ 1 , , γ I ) contains all estimated item parameters of item response functions (IRFs) P i ( θ ; γ i ) = P ( X i = 1 | θ ) ( i = 1 , , I ). The two-parameter logistic (2PL) model [5] is the most popular IRT model for dichotomous items and possesses the IRF
P i ( θ ; γ i ) = Ψ a i ( θ b i ) ,
where a i and b i are the item discrimination and item difficulty, respectively, and Ψ ( x ) = ( 1 + exp ( x ) ) 1 denotes the logistic distribution function. For independent and identically distributed observations x 1 , , x N of N persons from the distribution of the random variable X , the unknown model parameters in Equation (1) can be estimated by (marginal) maximum likelihood estimation [6,7,8].
In many educational applications, IRT models are employed to compare the distributions of two groups in a test (i.e., on a set of items) regarding the factor variable θ in the IRT model in Equation (1). Linking methods [9,10,11] estimate the 2PL model separately in the two groups in the first step. All group differences are captured in the item parameters. Linking methods transform the estimated item parameters from the first step in a subsequent second step to determine a difference regarding the mean μ and the standard deviation σ between the two groups. The separate application of the 2PL model in each of the two groups has the advantage that it allows items to function differently across groups: a property that is referred to as differential item functioning (DIF) [12,13]. It has been pointed out that the occurrence of DIF causes additional variability in the estimated mean μ and standard deviation σ when applying a linking method [14,15,16,17,18,19].
However, it has also been shown that the presence of DIF effects produces a bias in the parameters that quantify the group difference [20]. The bias occurs for the popular Haebara [21] and Stocking–Lord [22] linking methods. This article proposes bias-reduced variants of these linking methods that rely on a derivation of the asymptotic by using a second-order Taylor expansion of the optimization function involved in the linking method for the 2PL model. A simulation study is carried out to investigate the usefulness of the proposed modified linking estimator.
The rest of the article is organized as follows. The derivation of the bias-reduced variants of Haebara and Stocking–Lord linking is presented in Section 2. Section 3 reports findings from a simulation study that compares the proposed new estimators with the currently used linking methods in the presence of DIF. Finally, the article closes with a discussion in Section 4.

2. Bias Reduction in Haebara and Stocking–Lord Linking

2.1. Group Comparisons in the 2PL Model

In this section, we explain how a group comparison involving two groups is conducted within the 2PL model. Assume that the 2PL model holds for the first group, and the ability distribution is given as a standard normal distribution; that is, θ N ( 0 , 1 ) . The item discriminations and item intercepts in the first group are defined as a i 1 = a i and b i 1 = b i , respectively. In the second group, the ability distribution is given as θ N ( μ , σ 2 ) , where μ and σ are the mean and the standard deviation (SD), respectively, of the second group. By using this parametrization, μ reflects group differences regarding the mean, while σ quantifies group differences in standard deviations. The item discriminations is assumed to be invariant across the two groups such that a i 2 = a i . A uniform DIF [12] effect e i is assumed for item difficulties:
b i 2 = b i + e i , where E ( e i ) = 0 and Var ( e i ) = τ 2 .
The quantity τ 2 is also called the DIF variance [23,24], and τ is denoted as the DIF SD.
An anonymous reviewer wondered whether the assumption of a mean DIF of zero is tenable in applications. We favor the assumption of a zero mean DIF because we consider DIF effects as construct-relevant [25,26,27]. In this case, the estimated group differences (i.e., estimates of μ and σ ) should be based on all items, and no items should be removed from group comparisons [28], as they are done under a partial invariance assumption [29,30].
A linking method proceeds in two steps. In the first step, the 2PL model is separately fitted within the two groups while assuming a standard normal distribution N ( 0 , 1 ) for the ability θ . Due to sampling errors (i.e., sampling of persons), the estimated item parameters a ^ i 1 and b ^ i 1 will slightly differ from the data-generating parameters a i 1 = a i and b i 1 = b i . In the second group, the original item parameters a i 2 and b i 2 are not recovered because the 2PL model is fitted with θ N ( 0 , 1 ) , but the data-generating model imposes θ N ( μ , σ 2 ) . However, it holds that
a i 2 ( θ b i 2 ) = a i 2 ( σ θ * + μ b i 2 ) for θ * N ( 0 , 1 ) ,
and we have that the identified item parameters
a i 2 * = a i σ and b i 2 * = σ 1 ( b i + e i μ )
We can now compute the identified item parameters. By construction, we have a ^ i 1 = a i 1 and b ^ i 1 = b i . Note that
a i 2 ( θ b i 2 ) = a i 2 ( σ θ * + μ b i 2 ) ,
and we have that the identified item parameters
a ^ i 2 = a i σ and b ^ i 2 = σ 1 ( b i + e i μ )
Due to sampling errors, the estimated item parameters a ^ i 2 and b ^ i 2 will slightly differ from a i 2 * and b i 2 * .
If no uniform DIF effects occur (i.e., τ = 0 ) and there is no sampling error, the group mean μ and the group SD σ can be determined from a single item i using the identified item parameters a i 2 * and b i 2 * . In the presence of sampling errors and DIF, a linking function H is chosen that estimates μ and σ based on estimated item parameters a ^ i g and b ^ i g for  i = 1 , , I and g = 1 , 2 . A linking method is formally described in the following Section 2.2.

2.2. Bias Reduction in a Linking Method Due to DIF

In a linking method, the vector δ = ( μ , σ ) is the statistical parameter of interest. The corresponding parameter estimate for δ is denoted as δ ^ = ( μ ^ , σ ^ ) . The linking function H is defined as a function of δ and the estimated item parameters γ ^ = ( γ ^ 1 , , γ ^ I ) (where γ ^ i = ( a ^ i 1 , b ^ i 1 , a ^ i 2 , b ^ i 2 ) ) in the two groups such that
δ ^ = arg min δ H ( δ ; γ ^ ) .
By assuming differentiability of H, the estimate δ ^ fulfills the estimating equation
H δ ( δ ^ ; γ ^ ) = H δ ( δ ^ ; γ ^ ) = 0 .
A minimal requirement of a linking method is that it should recover the true group mean μ 0 and the true group SD σ 0 if there is no sampling error and no DIF (i.e., τ = 0 ). Hence, it is assumed that
H δ ( δ 0 ; γ 0 ) = 0 for δ 0 = ( μ 0 , σ 0 ) ,
where γ 0 denotes joint item parameters a i and b i , and all DIF effects e i are assumed to be zero.
We now derive an expression of the bias of the linking method due to the presence of DIF effects e i . The idea is to utilize a second-order Taylor expansion of H δ and to derive a bias correction term of the parameter estimate δ ^ . Note that the estimated item parameters γ ^ i for item i are a function of a common item parameter γ i = ( a i , b i ) , and the uniform DIF effect e i . The estimating equation in Equation (9) can be formalized as
H δ ( δ ; γ , e ) = 0 ,
where γ collects all item parameters, and e = ( e 1 , , e I ) denotes the vector of DIF effects. Due to Equation (10), it holds that H δ ( δ 0 ; γ , 0 ) = 0 for e = 0 . This property ensures that the application of the linking method provides the true mean μ 0 and the true SD σ 0 .
By assuming independence of DIF effects e i , we have, by using a first-order Taylor expansion with respect to δ and a second-order Taylor expansion with respect to DIF effects  e i ,
0 = H δ ( δ ^ ; γ , e ) = H δ ( δ 0 ; γ , 0 ) + H δ δ ( δ 0 ; γ , 0 ) ( δ ^ δ 0 ) + i = 1 I H δ e i ( δ 0 ; γ , 0 ) e i + 1 2 i = 1 I H δ e i e i ( δ 0 ; γ , 0 ) e i 2
Due to H δ ( δ 0 ; γ , 0 ) = 0 , we obtain from Equation (12)
δ ^ δ 0 = H δ δ ( δ 0 ; γ , 0 ) 1 i = 1 I H δ e i ( δ 0 ; γ , 0 ) e i + 1 2 i = 1 I H δ e i e i ( δ 0 ; γ , 0 ) e i 2
Because the DIF effects e i are random variables with zero expectation E ( e i ) = 0 , we can determine an approximate (i.e., expected) bias in the estimate δ ^ as
Bias ( δ ^ ) = 1 2 H δ δ ( δ 0 ; γ , 0 ) 1 i = 1 I H δ e i e i ( δ 0 ; γ , 0 ) E ( e i 2 )
Note that the bias due to DIF effects does not vanish in large samples of persons or for a large number of items. Let τ 2 = E ( e i 2 ) ; the asymptotic bias of δ ^ can be described as
Bias ( δ ^ ) = 1 2 τ 2 H δ δ ( δ 0 ; γ , 0 ) 1 i = 1 I H δ e i e i ( δ 0 ; γ , 0 )
Equation (14) gives rise to the idea of using an empirical version of it as a bias correction (or maybe only a bias-reducing) term for δ ^ . We compute the estimated DIF effects e ^ i by
e ^ i = μ ^ + σ ^ b ^ i 2 b ^ i 1
to obtain a bias-reduced estimate of δ as
δ ^ br = δ ^ + 1 2 H δ δ ( δ ^ ; γ ^ , 0 ) 1 i = 1 I H δ e i e i ( δ ^ ; γ ^ , 0 ) e ^ i 2
The vector H δ e i e i contains second-order derivatives of H δ with respect to e i and is given by H δ e i e i = ( H μ e i e i , H σ e i e i ) .
The described construction scheme for a bias-reduced linking estimate is applied for Haebara linking in Section 2.3 and Stocking–Lord linking in Section 2.4.

2.3. Bias-Reduced Haebara Linking (brHAE)

The optimization function in Haebara (HAE) [21] linking is defined as
H ( δ ; γ ^ ) = i = 1 I t = 1 T ω t Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) Ψ a ^ i 2 ( θ t b ^ i 2 ) 2 for δ = ( μ , σ ) .
with a priori specification of weights ω t on a grid of θ values θ 1 , , θ T . A convenient choice is to use an equidistant grid of θ values on [ 6 , 6 ] and weights ω t that are proportional to the density function of a normal distribution with a mean of 0 and an SD of 2. The corresponding estimating equations to the minimization problem in Equation (18) are given by
H μ = H μ = 2 i = 1 I t = 1 T ω t Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) Ψ a ^ i 2 ( θ t b ^ i 2 ) Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 = 0 and
H σ = H σ = 2 i = 1 I t = 1 T ω t Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) Ψ a ^ i 2 ( θ t b ^ i 2 ) Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 θ t = 0 .
In Equation (20), Ψ denotes the derivative of the logistic function Ψ . The second-order derivative H δ δ (i.e., the partial derivatives of H μ and H σ with respect to μ and σ ) can be simply computed, although the formulas are a bit cumbersome. In a numerical implementation of the bias-reduced linking method, the derivatives can be obtained with numerical approximation.
Using the true parameter vector δ 0 = ( μ 0 , σ 0 ) , we can write
Ψ a ^ i 2 ( θ t b ^ i 2 ) = Ψ a i σ 0 ( θ t σ 0 1 ( b i e i + μ 0 ) ) = Ψ a i ( σ 0 θ t ( b i e i + μ 0 ) )
The unknown DIF effect e i in Equation (21) can be substituted by e ^ i such that we obtain
Ψ a i ( σ 0 θ t ( b i e i + μ 0 ) ) Ψ a ^ i 1 ( σ ^ θ t ( b ^ i 1 e ^ i + μ ^ ) )
We now compute the required derivatives with respect to e i in the bias-reduced estimate derived in Equation (17). The derivatives with respect to μ are given by
H μ e i = 2 t = 1 T ω t Ψ a i ( σ θ t ( b i + e i ) μ ) Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 2 and
H μ e i e i = 2 t = 1 T ω t Ψ a i ( σ θ t ( b i + e i ) μ ) Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 3 ,
where Ψ is the second derivative of Ψ . In analogy, we have for the required second derivative of H σ with respect to e i ,
H σ e i e i = 2 t = 1 T ω t Ψ a i ( σ θ t ( b i + e i ) μ ) Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 3 θ t .
The term a i ( σ θ t ( b i + e i ) μ ) in Equations (24) and (25) can be estimated by a ^ i 2 ( θ t b ^ i 2 ) . Hence, the required terms for the bias-reduced Haebara linking (brHAE) can be written as
H μ e i e i = 2 t = 1 T ω t Ψ a ^ i 2 ( θ t b ^ i 2 ) Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 3 and
H σ e i e i = 2 t = 1 T ω t Ψ a ^ i 2 ( θ t b ^ i 2 ) Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 3 θ t .

2.4. Bias-Reduced Stocking–Lord Linking (brSL)

The Stocking–Lord (SL) [22] linking method minimizes the weighted squared distance of test characteristic functions:
H ( δ ; γ ^ ) = t = 1 T ω t i = 1 I Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) i = 1 I Ψ a ^ i 2 ( θ t b ^ i 2 ) 2 with δ = ( μ , σ ) .
SL linking offers more flexibility in allowing differences in the IRFs between groups because the alignment occurs at the level of the test characteristic function. In contrast, HAE linking explicitly defines the squared distance between IRFs. The corresponding estimating equations of SL linking defined in Equation (28) for the parameters μ and σ are given as
H μ = 2 t = 1 T ω t i = 1 I Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) i = 1 I Ψ a ^ i 2 ( θ t b ^ i 2 ) i = 1 I Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 = 0 and
H σ = 2 t = 1 T ω t i = 1 I Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) i = 1 I Ψ a ^ i 2 ( θ t b ^ i 2 ) i = 1 I Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 θ t = 0 .
The second-order derivatives H δ δ can be similarly obtained, although they can be numerically obtained in a practical implementation of the bias-reduced linking method.
The second-order derivatives of H μ and H σ with respect to e i can be computed similarly as for Haebara linking. We obtain the following quantities that are required in the formula for the bias reduction:
H μ e i e i = 2 t = 1 T ω t Ψ a ^ i 2 ( θ t b ^ i 2 ) i = 1 I Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 a ^ i 1 2 and
H σ e i e i = 2 t = 1 T ω t Ψ a ^ i 2 ( θ t b ^ i 2 ) i = 1 I Ψ a ^ i 1 ( σ θ t + μ b ^ i 1 ) a ^ i 1 a ^ i 1 2 θ t .
By using Equations (31) and (32) in Formula (17) for the bias-reduced estimate, we obtain the bias-reduced Stocking–Lord (brSL) linking method.

2.5. Mean-Geometric-Mean Linking (MGM)

The mean-geometric-mean (MGM) linking method [10] is a two-step linking method that computes the SD by a geometric mean of item discriminations in the first step and the mean by an ordinary average of transformed item difficulties in the second step. The estimating equations for μ and σ are given by
H μ = i = 1 I σ b ^ i 2 b ^ i 1 + μ = 0 and
H σ = i = 1 I log a ^ i 2 log ( σ ) log a ^ i 1 = 0 .
One can easily show that H μ e i e i = 0 and H σ e i e i = 0 . Hence, no asymptotic bias of the MGM linking method is expected. Therefore, exploring a bias-reducing variant of the MGM linking method is unnecessary.

3. Simulation Study

3.1. Method

Item responses in two groups were simulated according to the 2PL model. For identification reasons, the mean and the standard deviation of the normally distributed ability variable θ in the first group were set to 0 to 1, respectively. In the second group, we chose the mean μ = 0.3 and the SD σ = 1.2 for the normally distributed ability variable. The parameters μ and σ were held constant in the simulation.
In the simulation, the number of items I was chosen as 10, 20, or 40, indicating short, medium, or long tests, respectively. The group-specific item parameters a i g and b i g for i = 1 , , I and g = 1 , 2 relied on base item parameters that were fixed in the simulation and a random DIF effect that was simulated in each replication of the simulation study. The common item discriminations a i in the case of I = 10 items were chosen as 0.83, 1.02, 0.88, 0.80, 1.04, 0.95, 1.00, 1.13, 1.32, and 1.11, resulting in a mean M = 1.01 and an S D = 0.16 . The common item difficulties b i were chosen as −1.74, −1.22, −0.22, 0.54, −0.04, −0.39, −0.73, 0.30, 0.83, and −1.39, resulting in M = 0.41 and S D = 0.86 . For item numbers as multiples of 10, we duplicated the item parameters of the 10 items accordingly. The item parameters in the second group included a uniform DIF effect e i that was added to the common item difficulty b i . As described in Section 2.1, the item difficulty b i 2 in the second group was simulated as
b i 2 = b i + e i .
The DIF effects e i were independently and identically normally distributed with a mean of zero and a DIF SD τ . Note that DIF effects varied across replications within simulation conditions. The DIF SD τ was chosen as 0, 0.25, or 0.5, indicating no DIF, moderate DIF, or large DIF.
Item responses were simulated according to the 2PL model for sample sizes N per group of 500, 1000, 2000, or 4000. We also investigated an infinite sample size (denoted by N = Inf ) in which only item parameters with DIF effects were simulated.
Five different linking methods were applied to estimate the mean μ and the SD σ in the second group: Haebara (HAE) linking, bias-reduced Haebara (brHAE) linking, Stocking–Lord (SL) linking, bias-reduced Stocking–Lord (brSL) linking, and mean-geometric-mean (MGM) linking. In finite sample sizes, all linking methods relied on estimated item discriminations a ^ i g and item difficulties b ^ i g ( i = 1 , , I , g = 1 , 2 ) obtained from separately fitting the 2PL model to the item response datasets in the two groups. In an infinite sample size, the identified item parameters as defined in Equation (7) were used, and no item responses were simulated.
In each of the 5 (sample size N) × 3 (DIF standard deviation τ ) × 3 (number of items I) = 45 cells of the simulation, 3000 replications were conducted. We computed the empirical bias, the empirical SD, and the root mean square error (RMSE) for the estimated mean μ ^ and the estimated standard deviation σ ^ . A relative percentage RMSE was computed as the ratio of the RMSE values of a particular linking method and the RMSE of Haebara linking.
R software (Version 4.3.1, [31]) was used for the entire analysis in this simulation study. The 2PL model was fitted using the TAM::tam.mml.2pl() function in the R package TAM [32]. MGM linking was implemented using the function sirt::linking.haberman() in the R package sirt [33]. Dedicated R functions were written by the author for HAE and SL linking and their bias-reduced variants. These functions and replication material for this Simulation Study can be found at https://osf.io/8mk49 (accessed on 10 May 2024).

3.2. Results

Table 1 presents the bias, the SD, and the relative RMSE for the estimated mean μ ^ . By construction, all linking methods estimated the true population value μ = 0.3 in an infinite sample size in the no-DIF condition. Furthermore, the five linking methods performed similarly regarding the RMSE in the no-DIF condition τ = 0 . However, an efficiency loss due to MGM linking and an efficiency advantage due to SL compared to HAE linking were observed for short tests with a small number of items I = 10 . A small positive bias was induced by brHAE and brSL in the small sample size N = 500 .
Overall, SL outperformed HAE regarding the RMSE, which was a direct consequence of the lower standard deviation of SL linking compared to HAE linking. Notably, the negative bias of HAE and SL was reduced by the bias-reduced variants brHAE and brSL in the large-DIF condition τ = 0.5 . MGM linking was advantageous over the other linking methods in the presence of large DIF and larger sample sizes.
We also computed descriptive statistics for the absolute biases and the relative RMSEs of the different linking methods across all simulation conditions. MGM linking had the lowest average absolute bias with a small SD (MGM: M = 0.002 , S D = 0.002 ). The bias-reduced variants of HAE and SL slightly outperformed HAE and SL with respect to the average absolute bias (HAE: M = 0.006 , S D = 0.006 ; brHAE: M = 0.005 , S D = 0.004 ; SL: M = 0.009 , S D = 0.009 ; brSL: M = 0.003 , S D = 0.003 ). SL can be considered the frontrunner among the linking methods regarding the average relative RMSE for the estimated mean μ ^ (HAE: M = 100 , S D = 0 ; brHAE: M = 102.62 , S D = 2.18 ; SL: M = 96.93 , S D = 1.60 ; brSL: M = 9 6.93 , S D = 1.34 ; MGM: M = 99.48 , S D = 4.59 ).
Table 2 reports the bias, the SD, and the relative RMSE for the estimated SD σ ^ . The MGM linking method exactly recovers the true SD σ = 1.2 in an infinite sample size in all DIF conditions because uniform DIF does not affect estimates of the SD. There was noticeable bias in the large-DIF condition for HAE and SL linking that did not vanish in an infinite sample size. Overall, MGM was the best-performing method regarding bias and relative RMSE for σ ^ in DIF conditions and had only minor efficiency losses in the no-DIF conditions. In general, SL outperformed HAE linking regarding the RMSE, which was a consequence of the lower SD. Note that the advantages of SL linking over HAE linking were more pronounced in tests with fewer items (i.e., I = 10 ). The bias-reduced variants of HAE and SL (i.e., brHAE and brSL) effectively reduced a large portion of the bias in the estimated standard deviation σ ^ . However, positive bias for brHAE and brSL was observed for a sample size N = 500 .
In summary, descriptive statistics across all simulation conditions for the absolute bias resulted in the MGM linking method as the frontrunner, followed by brHAE and brSL (HAE: M = 0.013 , S D = 0.012 ; brHAE: M = 0.006 , S D = 0.005 ; SL: M = 0.016 , S D = 0.016 ; brSL: M = 0.005 , S D = 0.004 ; MGM: M = 0.002 , S D = 0.002 ). The aggregated statistics for the relative RMSEs demonstrated the good performance of MGM linking. SL clearly outperformed the HAE and brHAE methods, but brSL additionally had improvements over SL, in particular for large DIF (HAE: M = 100 , S D = 0 ; brHAE: M = 103.29 , S D = 4.78 ; SL: M = 89.02 , S D = 10.00 ; brSL: M = 8 5.87 , S D = 1 3.50 ; MGM: M = 69.95 , S D = 33.90 ).

4. Discussion

Previous research has highlighted that the HAE and SL linking methods can produce substantial bias for SDs and for the mean (to a smaller extent) in the presence of DIF effects. Importantly, the bias does not vanish for large sample sizes of persons or a large number of items. To this end, bias-reduced variants of HAE and SL linking were proposed in this article to reduce large portions of the bias in the estimated SD. However, using the bias-reduced variants of the HAE and SL linking methods comes with the price of an increased SD. Whether it is beneficial regarding the RMSE to apply the bias-reduced linking method depends on the size of the DIF variance. In general, bias-reducing methods might be preferred in the presence of large DIF effects, but they could potentially hurt in situations with small DIF effects, because in those situations, the biases of the original HAE and SL linking methods are relatively small. Overall, SL linking outperformed HAE linking (see also [34,35,36]).
In future research, the methodology could be further improved by considering alternative bias-reducing linking estimators. Moreover, the methodology could be extended to linking polytomous item responses using the generalized partial credit model [37]. Also, this article only treated the impact of uniform DIF effects (i.e., DIF effects in item difficulties) in HAE and SL linking. It might also be interesting to study bias-reducing linking methods that can also handle nonuniform DIF effects (i.e., DIF effects in item discriminations).
Our simulation study relied on marginal maximum likelihood estimation of item response models. However, the item parameters in separate scalings could also be obtained with limited information estimation [38,39]. Stocking–Lord and Haebara linking remain unchanged when using a different estimation procedure of item parameters. Hence, our proposed methodology also applies to alternative IRT estimation methods.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Replication material for the simulation study can be found at https://osf.io/8mk49.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
2PLtwo-parameter logistic
brHAEbias-reduced Haebara
brSLbias-reduced Stocking–Lord
HAEHaebara
IRFitem response function
IRTitem response theory
MGMmean-geometric-mean
RMSEroot mean square error
SDstandard deviation
SLStocking–Lord

References

  1. Bock, R.D.; Moustaki, I. Item response theory in a general framework. In Handbook of Statistics, Vol. 26: Psychometrics; Rao, C.R., Sinharay, S., Eds.; Elsevier: Amsterdam, The Netherlands, 2007; pp. 469–513. [Google Scholar] [CrossRef]
  2. Chen, Y.; Li, X.; Liu, J.; Ying, Z. Item response theory—A statistical framework for educational and psychological measurement. Stat. Sci. 2024. Epub ahead of print. Available online: https://rb.gy/1yic0e (accessed on 10 May 2024).
  3. van der Linden, W.J. Unidimensional logistic response models. In Handbook of Item Response Theory, Volume 1: Models; van der Linden, W.J., Ed.; CRC Press: Boca Raton, FL, USA, 2016; pp. 11–30. [Google Scholar] [CrossRef]
  4. Yen, W.M.; Fitzpatrick, A.R. Item response theory. In Educational Measurement; Brennan, R.L., Ed.; Praeger Publishers: Westport, CT, USA, 2006; pp. 111–154. [Google Scholar]
  5. Birnbaum, A. Some latent trait models and their use in inferring an examinee’s ability. In Statistical Theories of Mental Test Scores; Lord, F.M., Novick, M.R., Eds.; MIT Press: Reading, MA, USA, 1968; pp. 397–479. [Google Scholar]
  6. Aitkin, M. Expectation maximization algorithm and extensions. In Handbook of Item Response Theory, Vol. 2: Statistical Tools; van der Linden, W.J., Ed.; CRC Press: Boca Raton, FL, USA, 2016; pp. 217–236. [Google Scholar] [CrossRef]
  7. Bock, R.D.; Aitkin, M. Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika 1981, 46, 443–459. [Google Scholar] [CrossRef]
  8. Glas, C.A.W. Maximum-likelihood estimation. In Handbook of Item Response Theory, Vol. 2: Statistical Tools; van der Linden, W.J., Ed.; CRC Press: Boca Raton, FL, USA, 2016; pp. 197–216. [Google Scholar] [CrossRef]
  9. Lee, W.C.; Lee, G. IRT linking and equating. In The Wiley Handbook of Psychometric Testing: A Multidisciplinary Reference on Survey, Scale and Test; Irwing, P., Booth, T., Hughes, D.J., Eds.; Wiley: New York, NY, USA, 2018; pp. 639–673. [Google Scholar] [CrossRef]
  10. Kolen, M.J.; Brennan, R.L. Test Equating, Scaling, and Linking; Springer: New York, NY, USA, 2014. [Google Scholar] [CrossRef]
  11. Sansivieri, V.; Wiberg, M.; Matteucci, M. A review of test equating methods with a special focus on IRT-based approaches. Statistica 2017, 77, 329–352. [Google Scholar] [CrossRef]
  12. Millsap, R.E. Statistical Approaches to Measurement Invariance; Routledge: New York, NY, USA, 2011. [Google Scholar] [CrossRef]
  13. Penfield, R.D.; Camilli, G. Differential item functioning and item bias. In Handbook of Statistics, Vol. 26: Psychometrics; Rao, C.R., Sinharay, S., Eds.; Elsevier: Amsterdam, The Netherlands, 2007; pp. 125–167. [Google Scholar] [CrossRef]
  14. Monseur, C.; Berezner, A. The computation of equating errors in international surveys in education. J. Appl. Meas. 2007, 8, 323–335. Available online: https://bit.ly/2WDPeqD (accessed on 10 May 2024). [PubMed]
  15. Robitzsch, A. Robust and nonrobust linking of two groups for the Rasch model with balanced and unbalanced random DIF: A comparative simulation study and the simultaneous assessment of standard errors and linking errors with resampling techniques. Symmetry 2021, 13, 2198. [Google Scholar] [CrossRef]
  16. Robitzsch, A. Linking error in the 2PL model. J 2023, 6, 58–84. [Google Scholar] [CrossRef]
  17. Sachse, K.A.; Roppelt, A.; Haag, N. A comparison of linking methods for estimating national trends in international comparative large-scale assessments in the presence of cross-national DIF. J. Educ. Meas. 2016, 53, 152–171. [Google Scholar] [CrossRef]
  18. Sachse, K.A.; Haag, N. Standard errors for national trends in international large-scale assessments in the case of cross-national differential item functioning. Appl. Meas. Educ. 2017, 30, 102–116. [Google Scholar] [CrossRef]
  19. Wu, M. Measurement, sampling, and equating errors in large-scale assessments. Educ. Meas. 2010, 29, 15–27. [Google Scholar] [CrossRef]
  20. Robitzsch, A. A comparison of linking methods for two groups for the two-parameter logistic item response model in the presence and absence of random differential item functioning. Foundations 2021, 1, 116–144. [Google Scholar] [CrossRef]
  21. Haebara, T. Equating logistic ability scales by a weighted least squares method. Jpn. Psychol. Res. 1980, 22, 144–149. [Google Scholar] [CrossRef]
  22. Stocking, M.L.; Lord, F.M. Developing a common metric in item response theory. Appl. Psychol. Meas. 1983, 7, 201–210. [Google Scholar] [CrossRef]
  23. Longford, N.T.; Holland, P.W.; Thayer, D.T. Stability of the MH D-DIF statistics across populations. In Differential Item Functioning; Holland, P.W., Wainer, H., Eds.; Routledge: New York, NY, USA, 1993; pp. 171–196. [Google Scholar] [CrossRef]
  24. Penfield, R.D.; Algina, J. A generalized DIF effect variance estimator for measuring unsigned differential test functioning in mixed format tests. J. Educ. Meas. 2006, 43, 295–312. [Google Scholar] [CrossRef]
  25. Camilli, G. The case against item bias detection techniques based on internal criteria: Do item bias procedures obscure test fairness issues. In Differential Item Functioning: Theory and Practice; Holland, P.W., Wainer, H., Eds.; Erlbaum: Hillsdale, NJ, USA, 1993; pp. 397–417. [Google Scholar]
  26. Robitzsch, A.; Lüdtke, O. Some thoughts on analytical choices in the scaling model for test scores in international large-scale assessment studies. Meas. Instrum. Soc. Sci. 2022, 4, 9. [Google Scholar] [CrossRef]
  27. Shealy, R.; Stout, W. A model-based standardization approach that separates true bias/DIF from group ability differences and detects test bias/DTF as well as item bias/DIF. Psychometrika 1993, 58, 159–194. [Google Scholar] [CrossRef]
  28. Robitzsch, A.; Lüdtke, O. Why full, partial, or approximate measurement invariance are not a prerequisite for meaningful and valid group comparisons. Struct. Equ. Model. 2023, 30, 859–870. [Google Scholar] [CrossRef]
  29. Byrne, B.M.; Shavelson, R.J.; Muthén, B. Testing for the equivalence of factor covariance and mean structures: The issue of partial measurement invariance. Psychol. Bull. 1989, 105, 456–466. [Google Scholar] [CrossRef]
  30. von Davier, M.; Yamamoto, K.; Shin, H.J.; Chen, H.; Khorramdel, L.; Weeks, J.; Davis, S.; Kong, N.; Kandathil, M. Evaluating item response theory linking and model fit for data from PISA 2000–2012. Assess. Educ. 2019, 26, 466–488. [Google Scholar] [CrossRef]
  31. R Core Team. R: A Language and Environment for Statistical Computing. 2023. Available online: https://www.R-project.org (accessed on 15 March 2023).
  32. Robitzsch, A.; Kiefer, T.; Wu, M. TAM: Test Analysis Modules, R Package Version 4.2-21. 2024. Available online: https://cran.r-project.org/web/packages/TAM/index.html (accessed on 20 April 2024).
  33. Robitzsch, A. sirt: Supplementary Item Response Theory Models, R Package Version 4.2-57. 2024. Available online: https://github.com/alexanderrobitzsch/sirt (accessed on 20 April 2024).
  34. Kang, T.; Petersen, N.S. Linking item parameters to a base scale. Asia Pacific Educ. Rev. 2012, 13, 311–321. [Google Scholar] [CrossRef]
  35. Kilmen, S.; Demirtasli, N. Comparison of test equating methods based on item response theory according to the sample size and ability distribution. Procedia Soc. Behav. Sci. 2012, 46, 130–134. [Google Scholar] [CrossRef]
  36. Lee, W.C.; Ban, J.C. A comparison of IRT linking procedures. Appl. Meas. Educ. 2009, 23, 23–48. [Google Scholar] [CrossRef]
  37. Muraki, E. A generalized partial credit model: Application of an EM algorithm. Appl. Psychol. Meas. 1992, 16, 159–176. [Google Scholar] [CrossRef]
  38. Forero, C.G.; Maydeu-Olivares, A.; Gallardo-Pujol, D. Factor analysis with ordinal indicators: A Monte Carlo study comparing DWLS and ULS estimation. Struct. Equ. Model. 2009, 16, 625–641. [Google Scholar] [CrossRef]
  39. Robitzsch, A. A comparison of limited information estimation methods for the two-parameter normal-ogive model with locally dependent items. Stats 2024, 7, 576–591. [Google Scholar] [CrossRef]
Table 1. Simulation study: bias, standard deviation (SD), and relative root mean square error (RMSE) for the estimated mean μ ^ as a function of the uniform DIF standard deviation τ , number of items I, and sample size N.
Table 1. Simulation study: bias, standard deviation (SD), and relative root mean square error (RMSE) for the estimated mean μ ^ as a function of the uniform DIF standard deviation τ , number of items I, and sample size N.
Bias SD Relative RMSE
I N HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM
No DIF (DIF SD τ = 0 )
10500 0.008 0.012 0.003 0.010 0.004 0.094 0.095 0.091 0.092 0.103 100 101.796.798.1109.9
1000 0.004 0.006 0.002 0.005 0.002 0.064 0.064 0.063 0.063 0.069 100 100.797.397.9108.1
2000 0.002 0.003 0.001 0.002 0.001 0.046 0.046 0.045 0.045 0.050 100 100.397.097.3107.8
4000 0.000 0.001 0.000 0.000 0.000 0.032 0.032 0.031 0.031 0.035 100 100.197.197.2107.3
Inf00000 00000 100 100.0100.0100.0100.0
20500 0.007 0.011 0.003 0.009 0.003 0.082 0.083 0.081 0.082 0.087 100 101.498.699.7106.1
1000 0.004 0.006 0.002 0.005 0.002 0.058 0.058 0.057 0.057 0.060 100 100.798.398.9104.8
2000 0.002 0.003 0.001 0.002 0.001 0.041 0.041 0.040 0.040 0.042 100 100.399.299.4104.4
4000 0.000 0.000−0.001 0.000−0.001 0.029 0.029 0.028 0.028 0.030 100 100.198.798.8103.4
Inf00000 00000 100 100.0100.0100.0100.0
40500 0.007 0.011 0.003 0.009 0.004 0.077 0.078 0.077 0.077 0.080 100 101.499.4100.6102.9
1000 0.002 0.004 0.000 0.003 0.000 0.055 0.055 0.054 0.054 0.056 100 100.699.399.7102.3
2000 0.002 0.003 0.001 0.002 0.001 0.039 0.039 0.039 0.039 0.040 100 100.398.999.2102.3
4000 0.000 0.000−0.001 0.000 0.000 0.027 0.028 0.027 0.027 0.028 100 100.199.599.6101.8
Inf00000 00000 100 100.0100.0100.0100.0
Bias SD Relative RMSE
I N HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM
Moderate Uniform DIF (DIF SD τ = 0.25 )
10500 0.006 0.015 0.000 0.012 0.005 0.127 0.130 0.121 0.123 0.131 100 103.595.597.8103.2
1000 0.001 0.007−0.003 0.006 0.003 0.107 0.110 0.102 0.104 0.107 100 102.295.496.799.9
2000−0.002 0.003−0.005 0.001 0.000 0.096 0.097 0.091 0.092 0.094 100 101.995.596.597.9
4000−0.003 0.002−0.006 0.000−0.001 0.091 0.092 0.086 0.087 0.087 100 101.795.496.296.0
Inf−0.004 0.000−0.006−0.001 0.000 0.085 0.086 0.080 0.081 0.079 100 101.795.095.793.5
20500 0.003 0.012−0.002 0.010 0.004 0.100 0.103 0.097 0.098 0.102 100 103.496.698.8101.7
1000 0.000 0.007−0.003 0.005 0.003 0.083 0.085 0.080 0.081 0.082 100 102.396.697.899.4
2000−0.001 0.005−0.003 0.004 0.003 0.072 0.073 0.069 0.070 0.070 100 102.096.197.197.6
4000−0.003 0.002−0.005 0.001 0.001 0.066 0.067 0.063 0.064 0.063 100 101.896.096.895.5
Inf−0.005 0.000−0.006 0.000−0.001 0.059 0.060 0.056 0.057 0.056 100 101.595.996.394.3
40500 0.000 0.009−0.005 0.007 0.002 0.087 0.089 0.086 0.087 0.088 100 102.898.399.7101.0
1000−0.001 0.006−0.004 0.004 0.002 0.068 0.070 0.067 0.067 0.068 100 102.298.099.0100.1
2000−0.004 0.002−0.005 0.001 0.001 0.057 0.058 0.055 0.056 0.056 100 101.697.197.697.3
4000−0.003 0.002−0.005 0.001 0.000 0.050 0.051 0.048 0.048 0.048 100 101.796.496.995.6
Inf−0.007−0.002−0.008−0.002−0.002 0.043 0.043 0.041 0.041 0.040 100 100.695.795.293.0
Large Uniform DIF (DIF SD τ = 0.5 )
10500−0.008 0.014−0.019 0.007 0.003 0.188 0.203 0.177 0.186 0.189 100 107.894.598.4100.2
1000−0.012 0.006−0.020 0.001 0.002 0.176 0.187 0.168 0.174 0.174 100 106.095.798.698.6
2000−0.017−0.001−0.024−0.005−0.004 0.172 0.182 0.162 0.168 0.167 100 105.595.097.596.6
4000−0.012 0.003−0.019−0.001 0.001 0.169 0.179 0.160 0.165 0.162 100 105.494.997.395.5
Inf−0.011 0.005−0.018 0.000 0.002 0.165 0.175 0.156 0.161 0.156 100 105.894.697.394.4
20500−0.008 0.014−0.017 0.009 0.007 0.144 0.154 0.137 0.142 0.144 100 107.095.899.0100.2
1000−0.013 0.004−0.020 0.001 0.003 0.130 0.138 0.124 0.128 0.127 100 105.596.098.197.5
2000−0.015 0.001−0.020−0.001 0.003 0.121 0.128 0.116 0.120 0.118 100 105.096.298.097.0
4000−0.019−0.003−0.024−0.005−0.003 0.122 0.129 0.115 0.119 0.116 100 104.895.797.294.3
Inf−0.015 0.000−0.020−0.002 0.002 0.115 0.122 0.110 0.113 0.111 100 104.895.897.595.3
40500−0.011 0.009−0.019 0.006 0.006 0.112 0.118 0.108 0.112 0.113 100 105.997.599.7100.7
1000−0.015 0.002−0.021 0.000 0.003 0.099 0.104 0.095 0.098 0.098 100 104.397.698.598.7
2000−0.018−0.002−0.023−0.003 0.000 0.092 0.097 0.088 0.091 0.090 100 103.896.997.295.9
4000−0.016−0.001−0.021−0.002 0.001 0.087 0.092 0.082 0.085 0.083 100 104.196.296.794.3
Inf−0.019−0.004−0.023−0.005 0.000 0.082 0.087 0.077 0.080 0.078 100 103.496.095.792.8
Note: HAE = Haebara linking; brHAE = bias-reduced Haebara linking; SL = Stocking–Lord linking; brSL = bias-reduced Stocking–Lord linking; MGM = mean-geometric-mean linking; Inf = infinite sample size; Haebara (HAE) linking was the reference method for computing the relative RMSE. Biases with absolute values larger than 0.010 are printed in bold font. Relative RMSE values larger than 10% more than the relative RMSE of the best-performing method in the respective condition are printed in bold font.
Table 2. Simulation study: bias, standard deviation (SD), and relative root mean square error (RMSE) for the estimated standard deviation σ ^ as a function of the uniform DIF standard deviation τ , number of items I, and sample size N.
Table 2. Simulation study: bias, standard deviation (SD), and relative root mean square error (RMSE) for the estimated standard deviation σ ^ as a function of the uniform DIF standard deviation τ , number of items I, and sample size N.
Bias SD Relative RMSE
I N HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM
No DIF (DIF SD τ = 0 )
10500 0.013 0.018 0.008 0.015 0.008 0.099 0.100 0.097 0.098 0.099 100 102.497.398.999.5
1000 0.008 0.011 0.005 0.008 0.005 0.068 0.068 0.067 0.067 0.068 100 101.298.098.9100.4
2000 0.002 0.004 0.001 0.003 0.001 0.047 0.047 0.046 0.047 0.048 100 100.598.799.0101.4
4000 0.001 0.002 0.000 0.001 0.000 0.033 0.033 0.033 0.033 0.034 100 100.299.199.2101.7
Inf00000 00000 100 100.0100.0100.0100.0
20500 0.010 0.015 0.005 0.012 0.005 0.077 0.078 0.076 0.076 0.077 100 102.497.599.098.8
1000 0.005 0.008 0.003 0.006 0.003 0.054 0.054 0.053 0.053 0.054 100 101.298.699.4100.6
2000 0.003 0.004 0.001 0.003 0.002 0.037 0.038 0.037 0.037 0.038 100 100.699.099.4100.5
4000 0.001 0.002 0.000 0.001 0.000 0.027 0.027 0.026 0.026 0.027 100 100.399.099.2100.7
Inf00000 00000 100 100.0100.0100.0100.0
40500 0.007 0.012 0.003 0.009 0.003 0.066 0.067 0.065 0.066 0.066 100 102.498.199.699.0
1000 0.004 0.007 0.002 0.005 0.002 0.046 0.046 0.046 0.046 0.046 100 101.298.699.499.3
2000 0.002 0.003 0.000 0.002 0.000 0.033 0.033 0.032 0.032 0.033 100 100.599.199.499.8
4000 0.000 0.001−0.001 0.000−0.001 0.022 0.022 0.022 0.022 0.022 100 100.299.499.4100.5
Inf00000 00000 100 100.0100.0100.0100.0
Moderate Uniform DIF (DIF SD τ = 0.25 )
10500 0.007 0.020−0.001 0.015 0.009 0.108 0.113 0.099 0.101 0.098 100 105.991.394.490.6
1000 0.000 0.010−0.005 0.007 0.005 0.081 0.084 0.072 0.073 0.069 100 104.588.590.584.8
2000−0.002 0.007−0.007 0.004 0.002 0.065 0.068 0.053 0.055 0.048 100 104.682.784.574.0
4000−0.005 0.004−0.008 0.002 0.001 0.056 0.058 0.042 0.044 0.033 100 104.476.777.859.6
Inf−0.007 0.001−0.010 0.000 0.000 0.044 0.046 0.026 0.028 0.000 100 103.963.162.7 5 0.0
20500 0.002 0.015−0.005 0.012 0.006 0.085 0.088 0.079 0.081 0.079 100 105.693.496.292.9
1000−0.003 0.008−0.008 0.005 0.003 0.062 0.064 0.056 0.057 0.053 100 104.190.992.086.6
2000−0.006 0.004−0.009 0.003 0.002 0.050 0.052 0.042 0.043 0.038 100 103.785.986.576.6
4000−0.009 0.000−0.011 0.000 0.000 0.041 0.043 0.032 0.033 0.026 100 102.080.778.662.8
Inf−0.008 0.001−0.010 0.000 0.000 0.031 0.032 0.018 0.020 0.000 100 102.366.361.9 5 0.0
40500−0.001 0.012−0.007 0.009 0.004 0.070 0.073 0.066 0.068 0.065 100 105.294.997.093.3
1000−0.006 0.005−0.010 0.004 0.001 0.051 0.053 0.047 0.048 0.046 100 103.294.094.090.2
2000−0.008 0.001−0.011 0.001 0.000 0.039 0.041 0.035 0.035 0.032 100 101.490.387.980.3
4000−0.009 0.000−0.012−0.001−0.002 0.032 0.033 0.026 0.027 0.023 100 100.186.480.868.8
Inf−0.010−0.001−0.011−0.001 0.000 0.022 0.023 0.013 0.014 0.000 100 96.872.158.7 5 0.0
Bias SD Relative RMSE
I N HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM HAEbrHAESLbrSLMGM
Large Uniform DIF (DIF SD τ = 0.5 )
10500−0.017 0.014−0.032 0.006 0.006 0.130 0.149 0.106 0.115 0.098 100 114.284.387.674.6
1000−0.022 0.007−0.033 0.001 0.005 0.108 0.124 0.082 0.090 0.069 100 112.580.582.062.9
2000−0.025 0.003−0.036−0.003 0.002 0.097 0.113 0.067 0.076 0.048 100 112.775.976.247.6
4000−0.025 0.002−0.036−0.004 0.000 0.092 0.108 0.060 0.070 0.033 100 112.973.673.235.0
Inf−0.026 0.001−0.036−0.005 0.000 0.085 0.100 0.050 0.060 0.000 100 113.070.068.2 5 0.0
20500−0.023 0.008−0.035 0.003 0.005 0.096 0.109 0.081 0.087 0.077 100 110.989.388.477.9
1000−0.029−0.001−0.039−0.004 0.002 0.080 0.091 0.063 0.068 0.054 100 107.186.380.262.7
2000−0.033−0.005−0.041−0.007 0.000 0.070 0.080 0.051 0.057 0.038 100 104.584.374.248.9
4000−0.030−0.002−0.039−0.005 0.001 0.064 0.075 0.043 0.050 0.027 100 105.782.170.737.9
Inf−0.032−0.005−0.040−0.007 0.000 0.059 0.070 0.036 0.043 0.000 100 104.079.464.6 5 0.0
40500−0.026 0.004−0.037 0.002 0.005 0.080 0.089 0.070 0.075 0.068 100 106.494.589.181.0
1000−0.033−0.005−0.041−0.006 0.001 0.062 0.070 0.051 0.056 0.046 100 100.193.579.265.2
2000−0.034−0.006−0.042−0.007−0.001 0.053 0.060 0.041 0.045 0.033 100 96.392.972.052.3
4000−0.034−0.007−0.041−0.007 0.000 0.048 0.055 0.033 0.038 0.023 100 95.090.966.239.0
Inf−0.035−0.008−0.042−0.008 0.000 0.042 0.050 0.026 0.031 0.000 100 91.989.058.1 5 0.0
Note: HAE = Haebara linking; brHAE = bias-reduced Haebara linking; SL = Stocking–Lord linking; brSL = bias-reduced Stocking–Lord linking; MGM = mean-geometric-mean linking; Inf = infinite sample size; Haebara (HAE) linking was the reference method for computing the relative RMSE. Biases with absolute values larger than 0.010 are printed in bold font. Relative RMSE values larger than 10% more than the relative RMSE of the best-performing method in the respective condition are printed in bold font.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Robitzsch, A. Bias-Reduced Haebara and Stocking–Lord Linking. J 2024, 7, 373-384. https://doi.org/10.3390/j7030021

AMA Style

Robitzsch A. Bias-Reduced Haebara and Stocking–Lord Linking. J. 2024; 7(3):373-384. https://doi.org/10.3390/j7030021

Chicago/Turabian Style

Robitzsch, Alexander. 2024. "Bias-Reduced Haebara and Stocking–Lord Linking" J 7, no. 3: 373-384. https://doi.org/10.3390/j7030021

APA Style

Robitzsch, A. (2024). Bias-Reduced Haebara and Stocking–Lord Linking. J, 7(3), 373-384. https://doi.org/10.3390/j7030021

Article Metrics

Back to TopTop