Abstract
A common and simple estimate of variability is the sample range, which is the difference between the maximum and minimum values in the sample. While other measures of variability are preferred in most instances, process owners and operators regularly use range (R) control charts to monitor process variability. The center line and limits of the R charts use constants that are based on the first two moments (mean and variance) of the distribution of the range of normal random variables. Historically, the computation of moments requires the use of tabulated constants approximated using numerical integration. We provide exact results for the moments for sample sizes 2 through 5. For sample sizes from 6 to 1000, we used the differential correction method to find Chebyshev minimax rational-function approximations of the moments. The rational function we recommend for the mean (R-chart constant d2) has a polynomial of order two in the numerator and six in the denominator and achieves a maximum error of 4.4 × 10−6. The function for the standard deviation (R-chart constant d3) has a polynomial of order two in the numerator and seven in the denominator and achieves a maximum error of 1.5 × 10−5. The exact and approximate expressions eliminate the need for table lookup in the control chart design phase.
Keywords:
control charts; Six Sigma; process variation; order statistics; rational functions; minimax error approximation MSC:
62P30; 41A20; 62G30
1. Introduction
Variation is central to both statistical methods and statistical thinking [,,]. There are multiple measures of variation that differ in their ease of computation and usefulness. Perhaps the simplest to compute, but inferior in some of its statistical properties (e.g., consistency), is the sample range. In most applications, statistical performance outweighs computational ease; however, there are important exceptions. One of these applications is the use of control charts to monitor (and ultimately improve) processes.
Control charts are one of the central tools used in Six Sigma and quality management. When the variable of interest is measured (as opposed to counted), the most common control charts are and R charts. The former is used to monitor the central tendency of the process, while the latter monitors process variation. To implement the charts, “rational” samples (often referred to as subgroups) are collected at regular intervals. The sample mean of each subgroup is plotted on the chart, while the range of each is plotted on the R chart. The charts are time-series plots with superimposed limits. Plotted points falling within the limits indicate a stable (“in-control”) process, whereas points falling outside the limits point to a process change. The limits are typically set at three standard deviations from the mean of the statistic being plotted to reduce the chance of a false out-of-control signal while still providing sufficient power to identify out-of-control conditions when they occur.
While substitutes for the R chart have been proposed (see [], for example), the original R chart is widely used in practice because of its computational simplicity and ease of interpretation. Process operators can easily compute the range and plot it on a control chart. By doing so, they can quickly identify potential changes and increase their involvement in process improvement initiatives.
While the use of R charts is simple, the mathematics behind the design of the chart is not as straightforward. The computation of R chart limits requires the use of tabulated constants. These constants were derived from the moments of the distribution of the range and are denoted d2 (the expected value of the range of standard normal random variables) and d3 (the standard deviation of the range of standard normal random variables). In almost all cases, the tabulated values of d2 and d3 have been approximated by numerical integration because analytical solutions have not been found. While the design procedure is not difficult, it is somewhat of a black box, and typically requires table lookup procedures for implementation. Moreover, the tabulated values are usually of limited precision (three decimal places) and scope, typically providing values only to a subgroup size of 20 or 25, which may be insufficient for some applications (We also note that having a computationally efficient way to estimate the moments could be of use in other statistical methods such as estimation or hypothesis testing. As explained at the beginning of the introduction, however, the range is rarely chosen as a measure of variability in such applications due to its poor statistical properties and hence we limit our attention to the application to control charts where the range is used regularly).
A preferred approach might be one in which data collection and processing are integrated into an online process control system. Ref. [] described how an online system can be used to improve both control and inspection to reduce quality costs. While a lookup procedure could be used in such online systems, the preferred approach would be to automate the control chart computations and data collection.
In this paper (which extends our own work in []), we provide simple expressions that eliminate the need for table lookups. For sample sizes of 2 through 5, the provided expressions are exact and are obtained from a review of the previous literature. For larger sample sizes (up to 1000), the expressions are rational function (of the sample size) approximations. These expressions are simple to implement on a computer and are highly accurate.
In Section 2, we review the general expressions for the distribution, mean, and standard deviation of the range. We also discuss the approximation procedure for larger sample sizes. In Section 3, we review previously reported results for sample sizes of 2 through 5 and then present several possible rational functions that can be used to approximate d2 and d3 for sample sizes larger than 5. In Section 4, we compare the errors of various rational functions and provide recommended functions for general sample sizes. In Section 5, we provide concluding remarks.
2. The Distribution and First Two Moments of the Range
The constants used to design the range control chart are all derived from the distribution of the range of standard normal random variables. The usual assumption in control charting is that the process generates a random output that follows a normal distribution. It is not our purpose to explore violations of this assumption (see [,,] for example discussions of potential problems when normality is not observed). All control chart constants associated with the range chart can be found if the mean and variance of the range are known. The mean of the range of normal random variables is denoted by d2σx, where σx is the standard deviation of the process, and the standard deviation of the range is represented by d3σx. In the remainder of the paper, we assume, without loss of generality, that the normal random output variable has a mean of 0 and a standard deviation of 1. Hence the design of the range control chart (and associated chart) is equivalent to finding the constants d2 and d3.
To find the first two moments of the range, we begin with an expression for the probability density function (pdf) of R, fR(r). The pdf of the range can be obtained by integrating the midrange T from the joint distribution of R and T [] as follows:
where n is the sample size, and and are the distribution function and pdf of a standard normal random variable. From (1) we can find the control chart constants d2 and d3 as
and
For general n, expressions (1)–(3) are difficult to solve analytically. Hence, the control chart constants d2 and d3, which are tabulated in many quality control handbooks and textbooks, were determined using numerical integration. However, it is possible to obtain analytical solutions for sample sizes 2–5. The simplified expressions and results are presented in the Results section.
2.1. Approximations to d2 and d3 for Samples Sizes of at Least 6
As mentioned above, we are not familiar with analytical solutions to expressions (1)–(3) when the sample size exceeds 5. However, numerical values of the control chart constants d2 and d3 are widely available. In this section, we describe how the numerical values were used to develop rational function approximations. Other authors have developed approximations for the moments of order statistics of normal random variables (see [,] for summaries of some of these methods; see [] as well). We believe that these earlier approximations have at least three limitations. First, some approximations are less accurate, especially for extreme order statistics (those used to compute the range). Second, the methods are more complicated than those presented in this study. Third, some methods require the use of tabulated moments of neighboring order statistics, which defeats the purpose of having approximations that do not require table lookup procedures.
Approximation Method
We focus on the method for approximating d2. The method was essentially the same for finding d3. We began by plotting the tabulated values of d2 as a function of the sample size n for values ranging from 2 to 1000, as given in [] (We also used the values from [] for d3. We note that the tabulated values of [] were presented to 5 decimal places and those of [] were to 15 places). Figure 1 shows a plot that suggests that d2 is a function of the logarithm of n. Hence, we sought an approximation relating the control chart constant values to the logarithm of n.
Figure 1.
Plot of the tabulated values of d2 as a function of sample size n. The plot suggests that the relationship between d2 and n is logarithmic.
While numerous methods exist for estimating discrete values, we determined that a rational-function approximation would work well for this application. First, rational functions include polynomials (when the order of the polynomial in the denominator is of order 0). Second, other options such as splines, can result in more complex functions, and we wanted relatively simple expressions. Third, the rational functions have desirable asymptotic characteristics. For these and other advantages, see [].
Instead of using a global measure of approximation error, such as the mean square error (MSE), we used the minimization of the maximum error between the approximations and the tabulated values of the control chart constants as our objective (As a final reason for using rational-function approximations rather than splines, we did look briefly at spline estimates and found that our rational function approximations resulted in smaller maximum errors). While MSE or mean absolute error (MAE) may be more appropriate if the overall accuracy of the approximation is important, we desired to ensure that the estimate of the control chart constants at every value of n was as close as possible to the tabulated values. Using MSE or MAE may result in an aggregate lower error, but it may do so at the expense of a few estimates having larger errors than desired. In other words, minimizing the maximum error is useful for guaranteeing worst-case (lack of) accuracy.
Because d2 is defined on a discrete point set rather than on an interval, we used the differential correction method to determine the Chebyshev minimax approximations []. The algorithm is an iterative one that seeks to minimize the maximum difference between the rational function, Rmk(n) and the tabulated values of d2, where
In (4), m is the order of the polynomial in the numerator of the rational function, k is the order of the denominator, and aj and bj are the coefficients of the numerator and denominator polynomials.
An initial rational function estimate is required to begin the procedure. We used multiple (polynomial) regression to find an initial polynomial solution and then used the minimax algorithm to convert the polynomial expression into a rational expression. The regression solutions were quite accurate, but in all cases, the estimates were improved by using the differential correction method.
The two parameters, m and k, can be adjusted to obtain better approximations. Generally, increasing m + k improves the accuracy of the estimates. Hence, we tried different combinations of parameters to find accurate and parsimonious approximations. For d2, we tried m + k values from 2 to 9, and for d3, we used values from 4 to 10. Combinations with sums greater than 9 (or 10 for d3) did not yield significant improvements in accuracy and were discarded in the interest of parsimony. Combinations with sums less than 2 (4 for d3) yielded unacceptable maximum errors.
We note that not all tabulated values were used for the computations. For d2 we used tabulated values for n = 2 (1) 25, 30 (10) 50, 100 (100) 1000. For d3, we used tabulated values for n = 2 (1) 25, 30 (5) 60, 70 (10) 100, 200, 500, and 1000 (the last three tabulated values were taken from [] and were presented to only three decimal places). Removing some of the intermediate values where the curve in Figure 1 is relatively flat results in a more manageable optimization problem. As a post-hoc check, we evaluated the preferred rational function expressions at sample sizes 2 (1) 500 and 510 (10) 1000 for d2 and 2 (1) 100 for d3 and found only a few cases where the error only slightly exceeded the maximum error calculated during the estimation procedure (See Table 4 in the Results section for the preferred functions and the associated errors).
3. Results
We begin by presenting the analytical results for the first two moments of the range distribution for sample sizes 2 through 5. We then provide the results for the rational function approximations for larger sample sizes.
3.1. Analytical Results for n = 2, 3, 4 and 5
When n = 2, (1) simplifies substantially, and the solutions to (1)–(3) are straightforward. When n = 3, 4, and 5, the analytical solutions are not as simple. Ref. [] showed that the pdf for the range when the sample size is 3 can be expressed as
They also showed that the ith moment μi of the distribution about the origin is
Other researchers have found moments of order statistics for samples of sizes 4 and 5. For a listing of these researchers and a description of the derivation by [], see []. The author’s alternative derivations are provided in Appendix A. Table 1 summarizes the analytical results.
Table 1.
Analytical results for d2 and d3 for samples of sizes 2 through 5.
The analytical results reveal some common elements. For example, the arc tangent function and the term π3/2 appear in most of the d2 results. Despite these commonalities, we were unable to induce expressions for general sample sizes. Instead, in the next sections, we present accurate rational function approximations.
3.2. Approximation Results
Table 2 summarizes the results for the d2 approximations, and Table 3 summarizes the results for the d3 approximations. The tables show the values of m and k, the maximum error for the given rational function, and the coefficient values defined in (4). The tables are sorted first by the number of parameters (i.e., by m + k) and then by the maximum error. For the sake of brevity, the tables only show the case for a given m + k for which the maximum error is the smallest (More complete tables are given in Appendix B). For example, the first entry in Table 2 is for a rational function with m = 7 and k = 2. The maximum error for such a case was 4.3685 × 10−6, and the rational function approximation was
where x = log(n).
Table 2.
Rational function values for approximating the control chart constant d2. These results are for the functions with the lowest errors for each combination of m + k. More complete tables are found in Appendix B. See (4) for definitions of the terms used in the table.
Table 3.
Rational function values for approximating the control chart constant d3. These results are for the functions with the lowest error for each combination of m + k. More complete tables are found in Appendix B. See (4) for definitions of the terms used in the table.
4. Discussion
Table 2 shows that the smallest maximum error was reported for the m + k = 9 case. We have only reported this case to show that a few of the m + k = 8 cases were very comparable in terms of their maximum error. The table also shows that there was no one systematically preferred form of the rational function approximations. For example, when m + k = 8, the best rational function approximation had more terms in the denominator (m = 2 and k = 6) than in the numerator; however, when m + k = 7, the lowest error was obtained from a rational function with more terms in the numerator (m = 5 and k = 2). Finally, the table shows that the d2 terms can be accurately estimated to at least three decimal places (maximum errors on the order of 10−4) with rational functions having as few as five total terms (m + k = 4). The approximations are very good with only eight total terms, having maximum errors on the order of 10−6. Such accuracy is remarkable given that the tabulated values used for the estimation were only given to five decimal places. Hence, very accurate estimates of d2 can be made for sample sizes from 2 to 1000 using simple algebraic expressions.
Similar results were obtained for d3, although the approximations were not as good. To achieve maximum errors on the order of 10−6, m + k must be as high as 10. With a total of eight terms (m + k = 7), some maximum errors were just over 10−5, which is still very good. We believe that greater accuracy is worth sacrificing a small degree of parsimony in the expression. We suggest using the rational functions shown in the second rows of Table 2 and Table 3, as they provide excellent accuracy with reasonably sized rational functions. Table 4 summarizes our suggestions for determining when finding the values of d2 and d3 for sample sizes of 2 to 1000. Moreover, Figure 2 shows plots of the difference between the actual and estimated values of the control chart constants for the suggested expressions for sample sizes greater than 5. As can be seen in the figure, the error oscillates, and in some cases, it is much smaller than the maximum error.
Table 4.
Suggested algebraic expressions for control chart constants d2 and d3 for sample sizes 2 to 1000. In the table, x = log(n).
Figure 2.
Plots of the approximation errors for (a) d2 and (b) d3 for the rational functions listed in Table 4. The plots show the sample sizes for which the estimation error is the largest.
While we recommend the expressions in Table 4, others may prefer a different combination of parsimony and accuracy. Appendix B shows more complete tables of the estimated expressions and their maximum errors, which can be used to determine the preferred combination. Similarly, Figure 3 shows the contour plots of the negative logarithm of the maximum error for different combinations of the orders of the polynomials in the rational functions. Higher values in the plot indicate lower errors. For example, a value of 5 in the plot is associated with a maximum error of 1 × 10−5.
Figure 3.
Contour plots of the negative logarithm of the maximum error for (a) d2 and (b) d3 for different combinations of the orders of the polynomials in (4). Larger values are associated with smaller errors.
5. Conclusions
We have provided algebraic expressions that can be used to easily compute the control chart constants d2 and d3 for sample sizes from 2 to 1000. For the cases of n = 2 through 5 we have reviewed the exact solutions, whereas for cases of n > 5, we have provided rational function approximations that are very accurate. Other constants that are based on range control charts (e.g., A2, D3, and D4) can easily be computed from the two constants d2 and d3 (see [] for example). Such expressions allow control chart designers and users, including control chart software producers, to compute the control chart constants without using lookup routines, which generally find control chart constants for a limited number of possible sample sizes with limited precision. Hence, the expressions simplify the process of automating the range control chart construction.
Funding
This research received no external funding.
Data Availability Statement
The data used for this project consisted only of the tabulated control chart constants given in [,] as described in Section 2.1.
Conflicts of Interest
The author declare no conflict of interest.
Appendix A
Author’s derivation of the analytical solutions for the first two moments of the range for sample sizes of four and five.
Appendix A.1. Derivation of the Expected Value of the Range for n = 4 and 5
We first note that from the definition of the standard normal pdf, we can rewrite the latter part of the integrand of (1) as
For ease of exposition, we can use (A1) we can rewrite (1) as
where
From the definition of expected value we have
We can integrate (A4) by parts to obtain
where
and
Substituting z = in the first exponent and z = in the second, (A7) becomes
We can evaluate (A8) by taking the derivative of Jn with respect to r, evaluating the resulting integral with respect to z, and then integrating again with respect to r. The result is that
Expression (A9) has several products of the form , which can be recombined to obtain . We can further substitute for z using , which gives
We now consider the special cases of n = 4 and n = 5.
Appendix A.1.1. n = 4 Case
In this case (A10) simplifies to
and so
Using (A12) in (A6) and then (A5) for n = 4 gives
To find (A13) we evaluate the three terms separately, which we will call K1, K2 and K3 (so that ). The integral K3 is straightforward and is . The other two are of the same form K,
We will use the same technique as [], which is to substitute v = into (A14) and change the order of integration. The result is
We can now evaluate (A15) with respect to r, giving
We now let v = and obtain
Comparing this result to the terms in (A13) shows that
and
Hence we have that
which simplifies to
Appendix A.1.2. n = 5 Case
If we substitute n = 5 into (A10) we obtain
We can evaluate the integrals in (A22) by differentiating with respect to r first, then integrating with respect to u, and finally integrating again with respect to r. The first integral then is and the second integral is . Hence we have
If we combine (A5) and (A6) for n = 5 we have
We can integrate (A24) by parts again to obtain
To evaluate E(R) now, we must be able to evaluate integrals of the form L = . We will do so by integrating by parts, letting u = and dv = . We can easily find du, but finding v is much more difficult. I will again use the method of []. First write v as . Now let y = in the second of these integrals, and then reverse the order of integration. The result is . We can now evaluate the inside integral to obtain
Now we have that L = or
For simplicity, let β = and reverse the order of integration to get
The inner definite integral can now be evaluated, so that we have
The last integral can now be evaluated and gives
The other terms in (A25) have the form K = . From the n = 4 case we know that K = . Now we can combine this result with (A30) and substitute them for the appropriate terms in (A25). After (a large amount of) simplification, the result is that E(R) = .
Appendix A.2. Derivation of the Variance of the Range for n = 4 and 5
We again begin with the case for general n and use (A2) and (A3) as a base. To find the variance of the range, we first find E(R2),
First, we can integrate this by parts to obtain
where is defined in (A6). By noting that the integrand of the second term in (A32) is essentially fR(r), and by using (A6) and (A7) we can simplify (A32) to
where Jn(r) is defined in (A8). Integrating by parts once again gives
where is defined in (A12).
For the n = 4 case we can substitute (A11) into (A34) and integrate to find that E(R2) is .
For the case of n = 5, we can use (A23) in (A34) to obtain
We have already seen in Appendix A.1 that integrals like most of those found in (A35) can be found using (A14)–(A17). Hence (A35) simplifies to E(R2) = .
Appendix B. Complete Tables for the Rational Function Approximations
Table A1.
Rational function values for approximating the control chart constant d2 (see (4) for definitions of the terms used in the table). Due to the large number of parameters, the table is divided based on the values of m + k. In the first column, “a” refers to the coefficients in the numerator of the rational function approximation, while “b” refers to the coefficients in the denominator (See expression (4) in Section 2).
Table A1.
Rational function values for approximating the control chart constant d2 (see (4) for definitions of the terms used in the table). Due to the large number of parameters, the table is divided based on the values of m + k. In the first column, “a” refers to the coefficients in the numerator of the rational function approximation, while “b” refers to the coefficients in the denominator (See expression (4) in Section 2).
| a. m + k = 9. | ||||||||
| (m, k) | (7, 2) | |||||||
| Maximum Error | 4.37 × 10−6 | |||||||
| a0 | −4.68 × 10−5 | |||||||
| a1 | 4.1601706 | |||||||
| a2 | −1.072159 | |||||||
| a3 | 0.2862847 | |||||||
| a4 | −0.0791025 | |||||||
| a5 | 0.0128782 | |||||||
| a6 | −0.0011768 | |||||||
| a7 | 4.36 × 10−5 | |||||||
| b1 | 0.1108409 | |||||||
| b2 | −0.036305 | |||||||
| b. m + k = 8. | ||||||||
| (m, k) | (2, 6) | (8, 0) | (6, 2) | (5, 3) | (4, 4) | (7, 1) | (3, 5) | (1, 7) |
| Maximum Error | 4.39 × 10−6 | 4.40 × 10−6 | 4.45 × 10−6 | 4.50 × 10−6 | 4.82 × 10−6 | 5.69 × 10−6 | 6.68 × 10−6 | 1.23 × 10−5 |
| a0 | 0.0002117 | 0.0001602 | −0.0003999 | 0.0003858 | 0.0005454 | 0.0003989 | 0.0017505 | −0.0022132 |
| a1 | 4.157714 | 4.1582259 | 4.1627898 | 4.1561616 | 4.1545675 | 4.1561546 | 4.1424508 | 4.1789954 |
| a2 | 0.3491476 | −1.5259333 | −0.950065 | 1.4537905 | 0.7921954 | −1.5117788 | 4.4979643 | |
| a3 | 0.5924688 | 0.2092784 | 0.2428315 | −0.2792372 | 0.5775411 | −1.3731118 | ||
| a4 | −0.1840047 | −0.0564231 | −0.0142767 | −0.1098239 | −0.1706958 | |||
| a5 | 0.0429772 | 0.0076955 | 0.0007023 | 0.0354162 | ||||
| a6 | −0.0070858 | −0.000464 | −0.0045255 | |||||
| a7 | 0.0007349 | 0.0002648 | ||||||
| a8 | −3.60 × 10−5 | |||||||
| b1 | 0.4502653 | 0.1420291 | 0.71454 | 0.5539226 | 0.0017124 | 1.4332245 | 0.3843422 | |
| b2 | 0.0245085 | −0.0462027 | 0.1823815 | 0.0012477 | 0.0864897 | −0.0338921 | ||
| b3 | −0.0137816 | 0.0005872 | −0.0705542 | −0.1716855 | 0.0137563 | |||
| b4 | 0.004125 | −0.0034663 | 0.0147776 | −0.0105444 | ||||
| b5 | −0.0006383 | −0.0008095 | 0.0043436 | |||||
| b6 | 4.18 × 10−5 | −0.0008831 | ||||||
| b7 | 7.16 × 10−5 | |||||||
| c. m + k = 7. | ||||||||
| (m, k) | (5, 2) | (7, 0) | (3, 4) | (2, 5) | (1, 6) | (6, 1) | (4, 3) | |
| Maximum Error | 4.52 × 10−6 | 5.76 × 10−6 | 1.20 × 10−5 | 1.83 × 10−5 | 2.47 × 10−5 | 3.36 × 10−5 | 8.85 × 10−5 | |
| a0 | 0.0003945 | 0.0004069 | −0.0013429 | 0.0674587 | −0.0023136 | 0.0019498 | −0.0049402 | |
| a1 | 4.1560808 | 4.1560877 | 4.1706731 | 3.5533193 | 4.1779037 | 4.144603 | 4.1994308 | |
| a2 | 1.4690251 | −1.5186854 | −0.6775001 | 133.74141 | −1.4869875 | −3.479488 | ||
| a3 | 0.2391045 | 0.5798019 | 0.3559161 | 0.5353291 | 0.697147 | |||
| a4 | −0.0147213 | −0.1713809 | −0.137747 | 0.0833977 | ||||
| a5 | 0.0007021 | 0.0355484 | 0.0215142 | |||||
| a6 | −0.0045384 | −0.0015042 | ||||||
| a7 | 0.0002651 | |||||||
| b1 | 0.7181305 | 0.2137831 | 32.001851 | 0.3814535 | −0.0001152 | −0.4379041 | ||
| b2 | 0.1829496 | 0.0046636 | 12.604867 | −0.0254915 | −0.1903784 | |||
| b3 | 0.0331532 | −1.0269053 | 0.0018241 | 0.1142485 | ||||
| b4 | −0.0013196 | 0.0535564 | −0.0014589 | |||||
| b5 | 0.0010865 | 0.0005456 | ||||||
| b6 | −6.26 × 10−5 | |||||||
| d. m + k = 6. | ||||||||
| (m, k) | (2, 4) | (6, 0) | (1, 5) | (3, 3) | (4, 2) | (5, 1) | ||
| Maximum Error | 2.18 × 10−5 | 4.01 × 10−5 | 4.13 × 10−5 | 5.93 × 10−5 | 7.91 × 10−5 | 8.66 × 10−5 | ||
| a0 | 0.0740863 | 0.0021644 | −0.0024841 | 0.1112902 | −0.0039991 | −0.004793 | ||
| a1 | 3.5003552 | 4.1431818 | 4.1779643 | 3.2300901 | 4.1888288 | 4.1941431 | ||
| a2 | 133.77185 | −1.4831307 | 112.82908 | −0.5261395 | 0.4951024 | |||
| a3 | 0.5314118 | 13.842787 | −0.0412288 | −0.0515586 | ||||
| a4 | −0.1356001 | −0.0023828 | 0.0141002 | |||||
| a5 | 0.0209355 | −0.0016136 | ||||||
| a6 | −0.0014438 | |||||||
| b1 | 31.970664 | 0.3803881 | 26.769961 | 0.2610645 | 0.5094877 | |||
| b2 | 12.65184 | −0.0223003 | 14.24345 | −0.0863658 | ||||
| b3 | −1.0557861 | −0.0019663 | 0.2333121 | |||||
| b4 | 0.0625968 | 0.0007101 | ||||||
| b5 | −4.89 × 10−5 | |||||||
| e. m + k = 5. | ||||||||
| (m, k) | (1, 4) | (2, 3) | (3, 2) | (4, 1) | (5, 0) | |||
| Maximum Error | 7.04 × 10−5 | 1.93 × 10−4 | 1.98 × 10−4 | 2.34 × 10−4 | 2.81 × 10−4 | |||
| a0 | −0.0021528 | −0.0068195 | −0.0069689 | −0.0079574 | 0.0087059 | |||
| a1 | 4.1778096 | 4.2048559 | 4.2056463 | 4.2118095 | 4.1014476 | |||
| a2 | 0.3389855 | 0.4231483 | 0.4123636 | −1.3881892 | ||||
| a3 | 0.0040529 | −0.0093844 | 0.4305764 | |||||
| a4 | 0.0008702 | −0.0815235 | ||||||
| a5 | 0.0067557 | |||||||
| b1 | 0.3812861 | 0.4760923 | 0.496602 | 0.4973312 | ||||
| b2 | −0.0240701 | −0.0019042 | 0.0066181 | |||||
| b3 | −0.0006661 | 0.0001546 | ||||||
| b4 | 0.0002934 | |||||||
| f. m + k = 4. | ||||||||
| (m, k) | (1, 3) | (2, 2) | (3, 1) | (4, 0) | ||||
| Maximum Error | 1.61 × 10−4 | 2.53 × 10−4 | 5.07 × 10−4 | 2.61 × 10−3 | ||||
| a0 | −0.0058338 | −0.0046286 | −0.0123297 | 0.0390467 | ||||
| a1 | 4.1990741 | 4.1962377 | 4.2343323 | 3.9449081 | ||||
| a2 | 0.3417114 | 0.4466871 | −1.1263646 | |||||
| a3 | −0.0082 | 0.2442248 | ||||||
| a4 | −0.0227788 | |||||||
| b1 | 0.3919365 | 0.4738405 | 0.5145802 | |||||
| b2 | −0.0310113 | −0.0006776 | ||||||
| b3 | 0.0016884 | |||||||
| g. m + k = 3 and m + k = 2. | ||||||||
| m + k | 3 | 3 | 3 | 2 | 2 | |||
| (m, k) | (2, 1) | (1, 2) | (3, 0) | (1, 1) | (2, 0) | |||
| Maximum Error | 5.61 × 10−4 | 2.35 × 10−3 | 1.09 × 10−2 | 2.19 × 10−2 | 5.80 × 10−2 | |||
| a0 | −0.0023213 | 0.0172151 | 0.1008629 | 0.1295186 | 0.2907984 | |||
| a1 | 4.1871142 | 4.0937756 | 3.6822471 | 3.6681333 | 3.0786409 | |||
| a2 | 0.3449063 | −0.8018105 | −0.3446446 | |||||
| a3 | 0.0949052 | |||||||
| b1 | 0.4720972 | 0.3546785 | 0.2410842 | |||||
| b2 | −0.0184772 | |||||||
Table A2.
Rational function values for approximating the control chart constant d3 (see (4) for definitions of the terms used in the table). Due to the large number of parameters, the table is divided based on the values of m + k. In the first column, “a” refers to the coefficients in the numerator of the rational function approximation, while “b” refers to the coefficients in the denominator. (See expression (4) in Section 2).
Table A2.
Rational function values for approximating the control chart constant d3 (see (4) for definitions of the terms used in the table). Due to the large number of parameters, the table is divided based on the values of m + k. In the first column, “a” refers to the coefficients in the numerator of the rational function approximation, while “b” refers to the coefficients in the denominator. (See expression (4) in Section 2).
| a. m + k = 10. | ||||||||
| (m, k) | (9, 1) | |||||||
| Maximum Error | 1.04 × 10−6 | |||||||
| a0 | 0.315104 | |||||||
| a1 | 5.547612 | |||||||
| a2 | −2.84442 | |||||||
| a3 | −0.78335 | |||||||
| a4 | 2.852878 | |||||||
| a5 | −2.51086 | |||||||
| a6 | 1.234777 | |||||||
| a7 | −0.36344 | |||||||
| a8 | 0.059829 | |||||||
| a9 | −0.00424 | |||||||
| b1 | 3.396075 | |||||||
| b. m + k = 9. | ||||||||
| (m, k) | (2, 7) | (7, 2) | (8, 1) | (6, 3) | (9, 0) | (4, 5) | (3, 6) | (5, 4) |
| Maximum Error | 1.49 × 10−5 | 1.67 × 10−5 | 1.83 × 10−5 | 2.13 × 10−5 | 2.85 × 10−5 | 3.40 × 10−5 | 3.46 × 10−5 | 3.48 × 10−5 |
| a0 | 0.222874 | 0.296684 | 0.288269 | 0.282365 | 0.425844 | −0.84597 | 0.415148 | 0.416896 |
| a1 | 7.435075 | 5.72538 | 5.824849 | 5.81292 | 2.891117 | 156.1599 | 4.140362 | 3.804773 |
| a2 | −1.74755 | −2.7717 | −3.84618 | −5.82438 | −7.15265 | 1102.012 | 2.558952 | −0.02747 |
| a3 | 0.802064 | 1.369529 | 3.523224 | 9.656155 | −308.826 | −1.32839 | −0.29831 | |
| a4 | 0.200033 | 0.068175 | −1.19533 | −8.5306 | −24.3344 | −0.03858 | ||
| a5 | −0.2218 | −0.26892 | 0.230026 | 5.088273 | −0.00149 | |||
| a6 | 0.063196 | 0.104807 | −0.01879 | −2.024 | ||||
| a7 | −0.00636 | −0.0174 | 0.512864 | |||||
| a8 | 0.00106 | −0.07467 | ||||||
| a9 | 0.004743 | |||||||
| b1 | 5.437382 | 3.460118 | 3.421473 | 3.145292 | 330.4889 | 2.79654 | 2.043444 | |
| b2 | 0.138415 | 0.668103 | −0.99902 | 514.4913 | 3.534685 | 2.385689 | ||
| b3 | 2.584673 | 0.379228 | 605.6943 | 0.912601 | −0.35895 | |||
| b4 | −2.56493 | −303.519 | −1.08252 | −0.23241 | ||||
| b5 | 1.00801 | 11.36619 | 0.102501 | |||||
| b6 | −0.20567 | −0.0042 | ||||||
| b7 | 0.017412 | |||||||
| c. m + k = 8. | ||||||||
| (m, k) | (1, 7) | (6, 2) | (4, 4) | (3, 5) | (5, 3) | (7, 1) | (2, 6) | (8, 0) |
| Maximum Error | 2.63 × 10−5 | 3.29 × 10−5 | 3.49 × 10−5 | 3.65 × 10−5 | 4.85 × 10−5 | 5.25 × 10−5 | 6.56 × 10−5 | 1.06 × 10−4 |
| a0 | −0.22202 | 0.361297 | 0.417448 | 0.419877 | 0.437844 | 0.19618 | 0.464882 | 0.457687 |
| a1 | 16.809 | 4.604874 | 3.713019 | 3.598293 | 3.391173 | 7.40356 | 2.936455 | 2.580339 |
| a2 | −3.34365 | −0.73609 | −1.34281 | −0.28428 | −3.57152 | 0.112444 | −5.92177 | |
| a3 | 2.01313 | −0.12318 | 0.036971 | −0.2868 | 0.096022 | 7.037428 | ||
| a4 | −0.72414 | −0.0197 | 0.043483 | 1.070521 | −5.20663 | |||
| a5 | 0.145333 | −0.00508 | −0.62695 | 2.458343 | ||||
| a6 | −0.01238 | 0.153482 | −0.71801 | |||||
| a7 | −0.01431 | 0.117911 | ||||||
| a8 | −0.00831 | |||||||
| b1 | 15.37598 | 2.36091 | 1.839365 | 1.629358 | 1.60254 | 4.905237 | 1.221915 | |
| b2 | −2.99882 | 0.458887 | 2.04778 | 1.768778 | 2.332491 | 2.699692 | ||
| b3 | 14.3693 | −0.74976 | −1.10774 | −0.79915 | −0.76911 | |||
| b4 | −10.1252 | −0.05782 | 0.117442 | 0.336715 | ||||
| b5 | 3.946838 | −0.00475 | −0.09027 | |||||
| b6 | −0.83038 | 0.009819 | ||||||
| b7 | 0.072844 | |||||||
| d. m + k = 7. | ||||||||
| (m, k) | (4, 3) | (1, 6) | (5, 2) | (6, 1) | (2, 5) | (3, 4) | (7, 0) | |
| Maximum Error | 3.90 × 10−5 | 6.68 × 10−5 | 7.97 × 10−5 | 8.37 × 10−5 | 8.73 × 10−5 | 9.96 × 10−5 | 4.00 × 10−4 | |
| a0 | 0.414579 | 0.448364 | 0.514699 | 0.267489 | 0.354628 | 0.347222 | 0.506673 | |
| a1 | 3.718122 | 3.171776 | 2.357509 | 5.998534 | 4.667852 | 4.701032 | 2.154437 | |
| a2 | −0.92875 | 3.387868 | −4.38089 | −0.57936 | −1.3774 | −4.46778 | ||
| a3 | −0.07681 | −1.25034 | 2.220661 | 0.284086 | 4.464941 | |||
| a4 | −0.00913 | 0.32038 | −0.68974 | −2.59734 | ||||
| a5 | −0.033 | 0.120869 | 0.887721 | |||||
| a6 | −0.00918 | −0.1648 | ||||||
| a7 | 0.012798 | |||||||
| b1 | 1.802498 | 1.407458 | 1.17619 | 3.425601 | 2.675402 | 2.570444 | ||
| b2 | 1.976001 | 2.652334 | 4.470245 | 2.285831 | 1.913318 | |||
| b3 | −0.86642 | −0.74001 | −0.33457 | −0.66506 | ||||
| b4 | 0.292182 | −0.06511 | 0.143048 | |||||
| b5 | −0.07846 | 0.012773 | ||||||
| b6 | 0.008683 | |||||||
| e. m + k = 6. | ||||||||
| (m, k) | (4, 2) | (1, 5) | (2, 4) | (3, 3) | (5, 1) | (6, 0) | ||
| Maximum Error | 1.10 × 10−4 | 1.28 × 10−4 | 1.36 × 10−4 | 1.91 × 10−4 | 2.26 × 10−4 | 1.34 × 10−3 | ||
| a0 | 0.342748 | 0.427834 | 0.444321 | 0.475274 | 0.217224 | 0.569274 | ||
| a1 | 4.964747 | 3.719638 | 3.378007 | 2.995876 | 6.915992 | 1.677416 | ||
| a2 | −0.67509 | −0.51529 | 0.213881 | −4.33346 | −3.09547 | |||
| a3 | 0.305956 | −0.05469 | 1.749774 | 2.508293 | ||||
| a4 | −0.03384 | −0.37293 | −1.08081 | |||||
| a5 | 0.032428 | 0.239276 | ||||||
| a6 | −0.02135 | |||||||
| b1 | 2.973567 | 2.011761 | 1.613263 | 1.41501 | 4.279257 | |||
| b2 | 2.179108 | 2.353528 | 2.087442 | 2.394381 | ||||
| b3 | −0.12915 | −0.59247 | −0.25408 | |||||
| b4 | −0.04059 | 0.040986 | ||||||
| b5 | 0.007675 | |||||||
| f. m + k = 5. | ||||||||
| (m, k) | (2, 3) | (1, 4) | (3, 2) | (4, 1) | (5, 0) | |||
| Maximum Error | 2.03 × 10−4 | 5.27 × 10−4 | 6.82 × 10−4 | 7.5 × 10−4 | 3.88 × 10−4 | |||
| a0 | 0.457381 | 0.523099 | 0.576235 | −0.05848 | 0.650202 | |||
| a1 | 3.232791 | 2.395738 | 1.709512 | 11.17565 | 1.145273 | |||
| a2 | 0.559042 | 0.826375 | −5.5501 | −1.84151 | ||||
| a3 | −0.05561 | 1.594977 | 1.130005 | |||||
| a4 | −0.17954 | −0.31863 | ||||||
| a5 | 0.034104 | |||||||
| b1 | 1.648605 | 0.940191 | 0.51129 | 7.753653 | ||||
| b2 | 2.65881 | 2.042514 | 2.324483 | |||||
| b3 | 0.024817 | −0.35739 | ||||||
| b4 | 0.036746 | |||||||
| g. m + k = 4. | ||||||||
| (m, k) | (2, 2) | (1, 3) | (3, 1) | (4, 0) | ||||
| Maximum Error | 2.13 × 10−4 | 8.98 × 10−4 | 5.25 × 10−3 | 1.08 × 10−2 | ||||
| a0 | 0.452862 | 0.476886 | −1.42047 | 0.762477 | ||||
| a1 | 3.338977 | 3.230196 | 36.39687 | 0.533722 | ||||
| a2 | 0.491674 | −12.4903 | −0.74534 | |||||
| a3 | 1.832343 | 0.295284 | ||||||
| a4 | −0.03852 | |||||||
| b1 | 1.737353 | 1.694324 | 29.41969 | |||||
| b2 | 2.63829 | 2.130666 | ||||||
| b3 | −0.17919 | |||||||
References
- Snee, R.D. Statistical Thinking and Its Contribution to Total Quality. Am. Stat. 1990, 44, 116–121. [Google Scholar] [CrossRef]
- Hoerl, R.W.; Snee, R.D.; De Veaux, R.D. Applying Statistical Thinking to ‘Big Data’ Problems. WIREs Comput. Stat. 2014, 6, 222–232. [Google Scholar] [CrossRef]
- Hoerl, R.W.; Snee, R.D. Statistical Thinking: Improving Business Performance; Wiley: Hoboken, NJ, USA, 2020. [Google Scholar]
- Khoo, M.B.; Lim, E.G. An Improved R (Range) Control Chart for Monitoring the Process Variance. Qual. Reliab. Eng. Int. 2005, 21, 43–50. [Google Scholar] [CrossRef]
- Chen, W.H.; Tirupati, D. On-line Quality Management: Integration of Product Inspection and Process Control. Prod. Oper. Manag. 1995, 4, 242–262. [Google Scholar] [CrossRef]
- Wardell, D.G. Algebraic Expressions for Range Control Chart Constants. In Proceedings of the Fiftieth Annual Meeting of the Western Decision Sciences Institute, Waikoloa, HI, USA, 5–8 April 2022. [Google Scholar]
- Burr, I.W. The Effect of Non-normality on Constants for and R charts. Ind. Qual. Control 1967, 23, 563–569. [Google Scholar]
- Qiu, P.; Zhang, J. On Phase II SPC in Cases When Normality is Invalid. Qual. Reliab. Eng. Int. 2015, 31, 27–35. [Google Scholar] [CrossRef]
- Khakifirooz, M.; Tercero-Gómeza, V.G.; Woodall, W.H. The Role of the Normal Distribution in Statistical Process Monitoring. Qual. Eng. 2021, 3, 497–510. [Google Scholar] [CrossRef]
- Mood, A.M.; Graybill, F.A.; Boes, D.C. Introduction to the Theory of Statistics; McGraw Hill: New York, NY, USA, 1974. [Google Scholar]
- David, H.A. Order Statistics; John Wiley & Sons, Inc.: New York, NY, USA, 1970. [Google Scholar]
- Arnold, B.C.; Balakrishnan, N. Relations, Bounds and Approximations for Order Statistic; Lecture Notes in Statistics No. 53; Springer: New York, NY, USA, 1989. [Google Scholar]
- Royston, J.P. Algorithm AS 177: Expected Normal Order Statistics (Exact and Approximate). Appl. Stat. 1982, 31, 161–165. [Google Scholar] [CrossRef]
- Pearson, E.S.; Hartley, H.O. (Eds.) Biometrika Tables for Statisticians; Biometrika Trust: London, UK, 1976; Volume 1. [Google Scholar]
- Harter, H.L.; Balakrishnan, N. Tables for the Use of Range and Studentized Range in Tests of Hypotheses; CRC Press: Boca Raton, FL, USA, 1998. [Google Scholar]
- Petrushev, P.P.; Popov, V.A. Rational Approximation of Real Functions; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
- Ralston, A.; Rabinowitz, P. A First Course in Numerical Analysis; Dover Publications, Inc.: New York, NY, USA, 2001. [Google Scholar]
- McKay, A.T.; Pearson, E.S. A Note on the Distribution of Range in Samples of n. Biometrika 1933, 25, 415–420. [Google Scholar] [CrossRef]
- Bose, R.C.; Gupta, S.S. Moments of Order Statistics from a Normal Population. Biometrika 1959, 46, 433–440. [Google Scholar] [CrossRef]
- Balakrishnan, N.; Cohen, A.C. Order Statistics and Inference: Estimation Methods; Academic Press, Inc.: London, UK, 2014. [Google Scholar]
- Ryan, T.P. Statistical Methods for Quality Improvement; John Wiley and Sons: Weinheim, Germany, 2011. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).