Expressions for the First Two Moments of the Range of Normal Random Variables with Applications to the Range Control Chart

Don G. Wardell

doi:10.3390/math13091537

David Eccles School of Business, University of Utah, Salt Lake City, UT 84112, USA

^†

This article is a revised and expanded version of a paper entitled, “Algebraic Expressions for Range Control Chart Constants.” which was presented at the Western Decision Sciences Institute Annual Meeting in Big Island, HI, USA, 5–8 April 2022.

Mathematics2025, 13(9), 1537;https://doi.org/10.3390/math13091537

This article belongs to the Section D: Statistics and Operational Research

Version Notes

Order Reprints

Abstract

A common and simple estimate of variability is the sample range, which is the difference between the maximum and minimum values in the sample. While other measures of variability are preferred in most instances, process owners and operators regularly use range (R) control charts to monitor process variability. The center line and limits of the R charts use constants that are based on the first two moments (mean and variance) of the distribution of the range of normal random variables. Historically, the computation of moments requires the use of tabulated constants approximated using numerical integration. We provide exact results for the moments for sample sizes 2 through 5. For sample sizes from 6 to 1000, we used the differential correction method to find Chebyshev minimax rational-function approximations of the moments. The rational function we recommend for the mean (R-chart constant d₂) has a polynomial of order two in the numerator and six in the denominator and achieves a maximum error of 4.4 × 10⁻⁶. The function for the standard deviation (R-chart constant d₃) has a polynomial of order two in the numerator and seven in the denominator and achieves a maximum error of 1.5 × 10⁻⁵. The exact and approximate expressions eliminate the need for table lookup in the control chart design phase.

Keywords:

control charts; Six Sigma; process variation; order statistics; rational functions; minimax error approximation

MSC:

62P30; 41A20; 62G30

1. Introduction

Variation is central to both statistical methods and statistical thinking [1,2,3]. There are multiple measures of variation that differ in their ease of computation and usefulness. Perhaps the simplest to compute, but inferior in some of its statistical properties (e.g., consistency), is the sample range. In most applications, statistical performance outweighs computational ease; however, there are important exceptions. One of these applications is the use of control charts to monitor (and ultimately improve) processes.

Control charts are one of the central tools used in Six Sigma and quality management. When the variable of interest is measured (as opposed to counted), the most common control charts are

\bar{X}

and R charts. The former is used to monitor the central tendency of the process, while the latter monitors process variation. To implement the charts, “rational” samples (often referred to as subgroups) are collected at regular intervals. The sample mean of each subgroup is plotted on the

\bar{X}

chart, while the range of each is plotted on the R chart. The charts are time-series plots with superimposed limits. Plotted points falling within the limits indicate a stable (“in-control”) process, whereas points falling outside the limits point to a process change. The limits are typically set at three standard deviations from the mean of the statistic being plotted to reduce the chance of a false out-of-control signal while still providing sufficient power to identify out-of-control conditions when they occur.

While substitutes for the R chart have been proposed (see [4], for example), the original R chart is widely used in practice because of its computational simplicity and ease of interpretation. Process operators can easily compute the range and plot it on a control chart. By doing so, they can quickly identify potential changes and increase their involvement in process improvement initiatives.

While the use of R charts is simple, the mathematics behind the design of the chart is not as straightforward. The computation of R chart limits requires the use of tabulated constants. These constants were derived from the moments of the distribution of the range and are denoted d₂ (the expected value of the range of standard normal random variables) and d₃ (the standard deviation of the range of standard normal random variables). In almost all cases, the tabulated values of d₂ and d₃ have been approximated by numerical integration because analytical solutions have not been found. While the design procedure is not difficult, it is somewhat of a black box, and typically requires table lookup procedures for implementation. Moreover, the tabulated values are usually of limited precision (three decimal places) and scope, typically providing values only to a subgroup size of 20 or 25, which may be insufficient for some applications (We also note that having a computationally efficient way to estimate the moments could be of use in other statistical methods such as estimation or hypothesis testing. As explained at the beginning of the introduction, however, the range is rarely chosen as a measure of variability in such applications due to its poor statistical properties and hence we limit our attention to the application to control charts where the range is used regularly).

A preferred approach might be one in which data collection and processing are integrated into an online process control system. Ref. [5] described how an online system can be used to improve both control and inspection to reduce quality costs. While a lookup procedure could be used in such online systems, the preferred approach would be to automate the control chart computations and data collection.

In this paper (which extends our own work in [6]), we provide simple expressions that eliminate the need for table lookups. For sample sizes of 2 through 5, the provided expressions are exact and are obtained from a review of the previous literature. For larger sample sizes (up to 1000), the expressions are rational function (of the sample size) approximations. These expressions are simple to implement on a computer and are highly accurate.

In Section 2, we review the general expressions for the distribution, mean, and standard deviation of the range. We also discuss the approximation procedure for larger sample sizes. In Section 3, we review previously reported results for sample sizes of 2 through 5 and then present several possible rational functions that can be used to approximate d₂ and d₃ for sample sizes larger than 5. In Section 4, we compare the errors of various rational functions and provide recommended functions for general sample sizes. In Section 5, we provide concluding remarks.

2. The Distribution and First Two Moments of the Range

The constants used to design the range control chart are all derived from the distribution of the range of standard normal random variables. The usual assumption in control charting is that the process generates a random output that follows a normal distribution. It is not our purpose to explore violations of this assumption (see [7,8,9] for example discussions of potential problems when normality is not observed). All control chart constants associated with the range chart can be found if the mean and variance of the range are known. The mean of the range of normal random variables is denoted by d₂σ_x, where σ_x is the standard deviation of the process, and the standard deviation of the range is represented by d₃σ_x. In the remainder of the paper, we assume, without loss of generality, that the normal random output variable has a mean of 0 and a standard deviation of 1. Hence the design of the range control chart (and associated

\bar{X}

chart) is equivalent to finding the constants d₂ and d₃.

To find the first two moments of the range, we begin with an expression for the probability density function (pdf) of R, f_R(r). The pdf of the range can be obtained by integrating the midrange T from the joint distribution of R and T [10] as follows:

f_{R} (r) = n (n - 1) \int_{- \infty}^{\infty} {[Φ (t + \frac{r}{2}) - Φ (t - \frac{r}{2})]}^{n - 2} ϕ (t + \frac{r}{2}) ϕ (t - \frac{r}{2}) d t

(1)

where n is the sample size, and

Φ

and

ϕ

are the distribution function and pdf of a standard normal random variable. From (1) we can find the control chart constants d₂ and d₃ as

d_{2} = \int_{0}^{\infty} r f_{R} (r) d r

(2)

and

d_{3} = \sqrt{\int_{0}^{\infty} r^{2} f_{R} (r) d r - d_{2}^{2}} .

(3)

For general n, expressions (1)–(3) are difficult to solve analytically. Hence, the control chart constants d₂ and d₃, which are tabulated in many quality control handbooks and textbooks, were determined using numerical integration. However, it is possible to obtain analytical solutions for sample sizes 2–5. The simplified expressions and results are presented in the Results section.

2.1. Approximations to d₂ and d₃ for Samples Sizes of at Least 6

As mentioned above, we are not familiar with analytical solutions to expressions (1)–(3) when the sample size exceeds 5. However, numerical values of the control chart constants d₂ and d₃ are widely available. In this section, we describe how the numerical values were used to develop rational function approximations. Other authors have developed approximations for the moments of order statistics of normal random variables (see [11,12] for summaries of some of these methods; see [13] as well). We believe that these earlier approximations have at least three limitations. First, some approximations are less accurate, especially for extreme order statistics (those used to compute the range). Second, the methods are more complicated than those presented in this study. Third, some methods require the use of tabulated moments of neighboring order statistics, which defeats the purpose of having approximations that do not require table lookup procedures.

Approximation Method

We focus on the method for approximating d₂. The method was essentially the same for finding d₃. We began by plotting the tabulated values of d₂ as a function of the sample size n for values ranging from 2 to 1000, as given in [14] (We also used the values from [15] for d₃. We note that the tabulated values of [14] were presented to 5 decimal places and those of [15] were to 15 places). Figure 1 shows a plot that suggests that d₂ is a function of the logarithm of n. Hence, we sought an approximation relating the control chart constant values to the logarithm of n.

Figure 1. Plot of the tabulated values of d₂ as a function of sample size n. The plot suggests that the relationship between d₂ and n is logarithmic.

While numerous methods exist for estimating discrete values, we determined that a rational-function approximation would work well for this application. First, rational functions include polynomials (when the order of the polynomial in the denominator is of order 0). Second, other options such as splines, can result in more complex functions, and we wanted relatively simple expressions. Third, the rational functions have desirable asymptotic characteristics. For these and other advantages, see [16].

Instead of using a global measure of approximation error, such as the mean square error (MSE), we used the minimization of the maximum error between the approximations and the tabulated values of the control chart constants as our objective (As a final reason for using rational-function approximations rather than splines, we did look briefly at spline estimates and found that our rational function approximations resulted in smaller maximum errors). While MSE or mean absolute error (MAE) may be more appropriate if the overall accuracy of the approximation is important, we desired to ensure that the estimate of the control chart constants at every value of n was as close as possible to the tabulated values. Using MSE or MAE may result in an aggregate lower error, but it may do so at the expense of a few estimates having larger errors than desired. In other words, minimizing the maximum error is useful for guaranteeing worst-case (lack of) accuracy.

Because d₂ is defined on a discrete point set rather than on an interval, we used the differential correction method to determine the Chebyshev minimax approximations [17]. The algorithm is an iterative one that seeks to minimize the maximum difference between the rational function, R_mk(n) and the tabulated values of d₂, where

R_{m k} (n) = \frac{\sum_{j = 0}^{m} a_{j} {[\log (n)]}^{j}}{\sum_{j = 0}^{k} b_{j} {[\log (n)]}^{j}}, with b_{0} = 1 .

(4)

In (4), m is the order of the polynomial in the numerator of the rational function, k is the order of the denominator, and a_j and b_j are the coefficients of the numerator and denominator polynomials.

An initial rational function estimate is required to begin the procedure. We used multiple (polynomial) regression to find an initial polynomial solution and then used the minimax algorithm to convert the polynomial expression into a rational expression. The regression solutions were quite accurate, but in all cases, the estimates were improved by using the differential correction method.

The two parameters, m and k, can be adjusted to obtain better approximations. Generally, increasing m + k improves the accuracy of the estimates. Hence, we tried different combinations of parameters to find accurate and parsimonious approximations. For d_2, we tried m + k values from 2 to 9, and for d_3, we used values from 4 to 10. Combinations with sums greater than 9 (or 10 for d₃) did not yield significant improvements in accuracy and were discarded in the interest of parsimony. Combinations with sums less than 2 (4 for d₃) yielded unacceptable maximum errors.

We note that not all tabulated values were used for the computations. For d₂ we used tabulated values for n = 2 (1) 25, 30 (10) 50, 100 (100) 1000. For d₃, we used tabulated values for n = 2 (1) 25, 30 (5) 60, 70 (10) 100, 200, 500, and 1000 (the last three tabulated values were taken from [14] and were presented to only three decimal places). Removing some of the intermediate values where the curve in Figure 1 is relatively flat results in a more manageable optimization problem. As a post-hoc check, we evaluated the preferred rational function expressions at sample sizes 2 (1) 500 and 510 (10) 1000 for d₂ and 2 (1) 100 for d₃ and found only a few cases where the error only slightly exceeded the maximum error calculated during the estimation procedure (See Table 4 in the Results section for the preferred functions and the associated errors).

3. Results

We begin by presenting the analytical results for the first two moments of the range distribution for sample sizes 2 through 5. We then provide the results for the rational function approximations for larger sample sizes.

3.1. Analytical Results for n = 2, 3, 4 and 5

When n = 2, (1) simplifies substantially, and the solutions to (1)–(3) are straightforward. When n = 3, 4, and 5, the analytical solutions are not as simple. Ref. [18] showed that the pdf for the range when the sample size is 3 can be expressed as

f_{R} (r) = \frac{6}{π \sqrt{2}} \exp (\frac{- r^{2}}{4}) \int_{0}^{r / \sqrt{6}} \exp (\frac{- u^{2}}{2}) d u = \frac{6}{\sqrt{π}} \exp (\frac{- r^{2}}{4}) [Φ (\frac{r}{\sqrt{6}}) - \frac{1}{2}] .

(5)

They also showed that the ith moment μ_i of the distribution about the origin is

μ_{i} = \frac{3}{π} 2^{i + 1} Γ (\frac{i}{2} + 1) \int_{0}^{π / 6} \cos^{i} θ d θ .

(6)

Other researchers have found moments of order statistics for samples of sizes 4 and 5. For a listing of these researchers and a description of the derivation by [19], see [20]. The author’s alternative derivations are provided in Appendix A. Table 1 summarizes the analytical results.

Table 1. Analytical results for d₂ and d₃ for samples of sizes 2 through 5.

The analytical results reveal some common elements. For example, the arc tangent function and the term π^3/2 appear in most of the d₂ results. Despite these commonalities, we were unable to induce expressions for general sample sizes. Instead, in the next sections, we present accurate rational function approximations.

3.2. Approximation Results

Table 2 summarizes the results for the d₂ approximations, and Table 3 summarizes the results for the d₃ approximations. The tables show the values of m and k, the maximum error for the given rational function, and the coefficient values defined in (4). The tables are sorted first by the number of parameters (i.e., by m + k) and then by the maximum error. For the sake of brevity, the tables only show the case for a given m + k for which the maximum error is the smallest (More complete tables are given in Appendix B). For example, the first entry in Table 2 is for a rational function with m = 7 and k = 2. The maximum error for such a case was 4.3685 × 10⁻⁶, and the rational function approximation was

R_{72} (x) = \frac{- 4.683 \times 10^{- 5} + 4.1602 x - 1.0722 x^{2} + 0.2863 x^{3} - 0.0791 x^{4} + 0.0129 x^{5} - 0.0012 x^{6} + 4.363 \times 10^{- 5} x^{7}}{1 + 0.1108 x - 0.0363 x^{2}},

where x = log(n).

Table 2. Rational function values for approximating the control chart constant d₂. These results are for the functions with the lowest errors for each combination of m + k. More complete tables are found in Appendix B. See (4) for definitions of the terms used in the table.

Table 3. Rational function values for approximating the control chart constant d₃. These results are for the functions with the lowest error for each combination of m + k. More complete tables are found in Appendix B. See (4) for definitions of the terms used in the table.

4. Discussion

Table 2 shows that the smallest maximum error was reported for the m + k = 9 case. We have only reported this case to show that a few of the m + k = 8 cases were very comparable in terms of their maximum error. The table also shows that there was no one systematically preferred form of the rational function approximations. For example, when m + k = 8, the best rational function approximation had more terms in the denominator (m = 2 and k = 6) than in the numerator; however, when m + k = 7, the lowest error was obtained from a rational function with more terms in the numerator (m = 5 and k = 2). Finally, the table shows that the d₂ terms can be accurately estimated to at least three decimal places (maximum errors on the order of 10⁻⁴) with rational functions having as few as five total terms (m + k = 4). The approximations are very good with only eight total terms, having maximum errors on the order of 10⁻⁶. Such accuracy is remarkable given that the tabulated values used for the estimation were only given to five decimal places. Hence, very accurate estimates of d₂ can be made for sample sizes from 2 to 1000 using simple algebraic expressions.

Similar results were obtained for d₃, although the approximations were not as good. To achieve maximum errors on the order of 10⁻⁶, m + k must be as high as 10. With a total of eight terms (m + k = 7), some maximum errors were just over 10⁻⁵, which is still very good. We believe that greater accuracy is worth sacrificing a small degree of parsimony in the expression. We suggest using the rational functions shown in the second rows of Table 2 and Table 3, as they provide excellent accuracy with reasonably sized rational functions. Table 4 summarizes our suggestions for determining when finding the values of d₂ and d₃ for sample sizes of 2 to 1000. Moreover, Figure 2 shows plots of the difference between the actual and estimated values of the control chart constants for the suggested expressions for sample sizes greater than 5. As can be seen in the figure, the error oscillates, and in some cases, it is much smaller than the maximum error.

Table 4. Suggested algebraic expressions for control chart constants d₂ and d₃ for sample sizes 2 to 1000. In the table, x = log(n).

Figure 2. Plots of the approximation errors for (a) d₂ and (b) d₃ for the rational functions listed in Table 4. The plots show the sample sizes for which the estimation error is the largest.

While we recommend the expressions in Table 4, others may prefer a different combination of parsimony and accuracy. Appendix B shows more complete tables of the estimated expressions and their maximum errors, which can be used to determine the preferred combination. Similarly, Figure 3 shows the contour plots of the negative logarithm of the maximum error for different combinations of the orders of the polynomials in the rational functions. Higher values in the plot indicate lower errors. For example, a value of 5 in the plot is associated with a maximum error of 1 × 10⁻⁵.

Figure 3. Contour plots of the negative logarithm of the maximum error for (a) d₂ and (b) d₃ for different combinations of the orders of the polynomials in (4). Larger values are associated with smaller errors.

5. Conclusions

We have provided algebraic expressions that can be used to easily compute the control chart constants d₂ and d₃ for sample sizes from 2 to 1000. For the cases of n = 2 through 5 we have reviewed the exact solutions, whereas for cases of n > 5, we have provided rational function approximations that are very accurate. Other constants that are based on range control charts (e.g., A₂, D₃, and D₄) can easily be computed from the two constants d₂ and d₃ (see [21] for example). Such expressions allow control chart designers and users, including control chart software producers, to compute the control chart constants without using lookup routines, which generally find control chart constants for a limited number of possible sample sizes with limited precision. Hence, the expressions simplify the process of automating the range control chart construction.

Funding

This research received no external funding.

Data Availability Statement

The data used for this project consisted only of the tabulated control chart constants given in [14,15] as described in Section 2.1.

Conflicts of Interest

The author declare no conflict of interest.

Appendix A

Author’s derivation of the analytical solutions for the first two moments of the range for sample sizes of four and five.

Appendix A.1. Derivation of the Expected Value of the Range for n = 4 and 5

We first note that from the definition of the standard normal pdf, we can rewrite the latter part of the integrand of (1) as

ϕ (t + \frac{r}{2}) ϕ (t - \frac{r}{2}) = \frac{1}{2 π} \exp (- t^{2}) \exp (- \frac{r^{2}}{4}) .

(A1)

For ease of exposition, we can use (A1) we can rewrite (1) as

f_{R} (r) = \frac{n (n - 1) \exp (- \frac{r^{2}}{4})}{2 π} I_{n} (r),

(A2)

where

I_{n} (r) = \int_{- \infty}^{\infty} [{[Φ (t + \frac{r}{2}) - Φ (t - \frac{r}{2})]}^{n - 2}] \exp (- t^{2}) d t .

(A3)

From the definition of expected value we have

E (R) = \frac{n (n - 1)}{2 π} \int_{0}^{\infty} r \exp (\frac{- r^{2}}{4}) I_{n} (r) d r .

(A4)

We can integrate (A4) by parts to obtain

E (R) = \frac{n (n - 1)}{π} \int_{0}^{\infty} \exp (\frac{- r^{2}}{4}) \frac{d I_{n} (r)}{d r} d r

(A5)

where

\frac{d I_{n} (r)}{d r} = \frac{n - 2}{2 \sqrt{2 π}} \exp (\frac{- r^{2}}{12}) J_{n} (r)

(A6)

and

J_{n} (r) = \int_{- \infty}^{\infty} {[Φ (t + \frac{r}{2}) - Φ (t - \frac{r}{2})]}^{n - 3} \{\exp [- \frac{1}{2} {(\sqrt{3} t + \frac{r}{\sqrt{12}})}^{2}] + \exp [- \frac{1}{2} {(\sqrt{3} t - \frac{r}{\sqrt{12}})}^{2}]\} d t

(A7)

Substituting z =

\sqrt{3} t + \frac{r}{\sqrt{12}}

in the first exponent and z =

\sqrt{3} t - \frac{r}{\sqrt{12}}

in the second, (A7) becomes

J_{n} (r) = \frac{1}{\sqrt{3}} \int_{- \infty}^{\infty} e^{\frac{- z^{2}}{2}} \{{[Φ (\frac{z}{\sqrt{3}} + \frac{r}{3}) - Φ (\frac{z}{\sqrt{3}} - \frac{2 r}{3})]}^{n - 3} + {[Φ (\frac{z}{\sqrt{3}} + \frac{2 r}{3}) - Φ (\frac{z}{\sqrt{3}} - \frac{r}{3})]}^{n - 3}\} d z

(A8)

We can evaluate (A8) by taking the derivative of J_n with respect to r, evaluating the resulting integral with respect to z, and then integrating again with respect to r. The result is that

\frac{d J_{n} (r)}{d r} = \frac{n - 3}{3 \sqrt{3}} \{\int_{- \infty}^{\infty} e^{- \frac{z^{2}}{2}} {[Φ (\frac{z}{\sqrt{3}} + \frac{r}{3}) - Φ (\frac{z}{\sqrt{3}} - \frac{2 r}{3})]}^{n - 4} [ϕ (\frac{z}{\sqrt{3}} + \frac{r}{3}) + 2 ϕ (\frac{z}{\sqrt{3}} - \frac{2 r}{3})] d z + \int_{- \infty}^{\infty} e^{- \frac{z^{2}}{2}} {[Φ (\frac{z}{\sqrt{3}} + \frac{2 r}{3}) - Φ (\frac{z}{\sqrt{3}} - \frac{r}{3})]}^{n - 4} [2 ϕ (\frac{z}{\sqrt{3}} + \frac{2 r}{3}) + ϕ (\frac{z}{\sqrt{3}} - \frac{r}{3})] d z\} .

(A9)

Expression (A9) has several products of the form

e^{\frac{- z^{2}}{2}} ϕ (\frac{z}{\sqrt{3}} + \frac{k r}{3})

, which can be recombined to obtain

\frac{1}{\sqrt{2 π}} \exp (\frac{- k^{2} r^{2}}{24}) \exp [- \frac{1}{2} {(\frac{2 z}{\sqrt{3}} + \frac{k r}{6})}^{2}]

. We can further substitute for z using

u = \frac{2 z}{\sqrt{3}} + \frac{k r}{6}

, which gives

\frac{d J n (r)}{d r} = \frac{n - 3}{6 \sqrt{2 π}} (e^{- \frac{r^{2}}{24}} \int_{- \infty}^{\infty} e^{- \frac{u^{2}}{2}} \{{[Φ (\frac{u}{2} + \frac{r}{4}) - Φ (\frac{u}{2} - \frac{3 r}{4})]}^{n - 4} + {[Φ (\frac{u}{2} + \frac{3 r}{4}) - Φ (\frac{u}{2} - \frac{r}{4})]}^{n - 4}\} d u + 4 e^{- \frac{r^{2}}{6}} \int_{- \infty}^{\infty} e^{- \frac{u^{2}}{2}} {[Φ (\frac{u}{2} + \frac{r}{2}) - Φ (\frac{u}{2} - \frac{r}{2})]}^{n - 4} d u) .

(A10)

We now consider the special cases of n = 4 and n = 5.

Appendix A.1.1. n = 4 Case

In this case (A10) simplifies to

\frac{d J_{4} (r)}{d r} = \frac{1}{3} (e^{- \frac{r^{2}}{24}} + 2 e^{- \frac{r^{2}}{6}})

(A11)

and so

J_{4} (r) = \frac{2 \sqrt{6 π}}{3} [Φ (\frac{r}{\sqrt{12}}) + Φ (\frac{r}{\sqrt{3}}) - 1] .

(A12)

Using (A12) in (A6) and then (A5) for n = 4 gives

E (R) = \frac{8 \sqrt{3}}{π} [\int_{0}^{\infty} \exp (\frac{- r^{2}}{3}) [Φ (\frac{r}{\sqrt{12}}) + Φ (\frac{r}{\sqrt{3}}) - 1] d r] .

(A13)

To find (A13) we evaluate the three terms separately, which we will call K₁, K₂ and K₃ (so that

E (R) = \frac{8 \sqrt{3}}{π} [K_{1} + K_{2} - K_{3}]

). The integral K₃ is straightforward and is

\sqrt{3 π} / 2

. The other two are of the same form K,

K = \int_{0}^{\infty} \exp (\frac{- r^{2}}{3}) Φ (\frac{r}{\sqrt{b}}) d r = \frac{1}{\sqrt{2 π}} \int_{0}^{\infty} \exp (\frac{- r^{2}}{3}) d r \int_{- \infty}^{r / \sqrt{b}} \exp (\frac{- u^{2}}{2}) d u

(A14)

We will use the same technique as [18], which is to substitute v =

\frac{u}{r}

into (A14) and change the order of integration. The result is

K = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{1 / \sqrt{b}} \int_{0}^{\infty} r \exp (- \frac{{(v r)}^{2}}{2}) \exp (\frac{- r^{2}}{3}) d r d v = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{1 / \sqrt{b}} \int_{0}^{\infty} r \exp [- \frac{r^{2}}{3} (1 + \frac{v^{2}}{2 / 3})] d r d v .

(A15)

We can now evaluate (A15) with respect to r, giving

K = \frac{3}{2 \sqrt{2 π}} \int_{- \infty}^{1 / \sqrt{b}} {(1 + \frac{3 v^{2}}{2})}^{- 1} d v

(A16)

We now let v =

\sqrt{\frac{2}{3}} \tan θ

and obtain

K = \frac{\sqrt{3}}{2 \sqrt{π}} \int_{- π / 2}^{\tan^{- 1} (\sqrt{\frac{3}{2 b}})} θ d θ = \frac{\sqrt{3}}{2 \sqrt{π}} [\tan^{- 1} (\sqrt{\frac{3}{2 b}}) + π / 2] .

(A17)

Comparing this result to the terms in (A13) shows that

K_{1} = \frac{\sqrt{3}}{2 \sqrt{π}} [\tan^{- 1} (\sqrt{\frac{1}{8}}) + π / 2]

(A18)

and

K_{2} = \frac{\sqrt{3}}{2 \sqrt{π}} [\tan^{- 1} (\sqrt{\frac{1}{2}}) + π / 2] .

(A19)

Hence we have that

E (R) = \frac{8 \sqrt{3}}{π} [\frac{\sqrt{3}}{2 \sqrt{π}} [\tan^{- 1} (\sqrt{\frac{1}{8}}) + \tan^{- 1} (\sqrt{\frac{1}{2}}) + π] - \frac{\sqrt{3 π}}{2}],

(A20)

which simplifies to

E (R) = d_{2} = \frac{12}{π \sqrt{π}} \tan^{- 1} (\sqrt{2}) .

(A21)

Appendix A.1.2. n = 5 Case

If we substitute n = 5 into (A10) we obtain

\frac{d J_{5} (r)}{d r} = \frac{1}{3 \sqrt{2 π}} \{e^{- \frac{r^{2}}{24}} \int_{- \infty}^{\infty} e^{- \frac{u^{2}}{2}} [Φ (\frac{u}{2} + \frac{r}{4}) - Φ (\frac{u}{2} - \frac{3 r}{4}) + Φ (\frac{u}{2} + \frac{3 r}{4}) - Φ (\frac{u}{2} - \frac{r}{4})] d u + 4 e^{- \frac{r^{2}}{6}} \int_{- \infty}^{\infty} e^{- \frac{u^{2}}{2}} [Φ (\frac{u}{2} + \frac{r}{2}) - Φ (\frac{u}{2} - \frac{r}{2})] d u\} .

(A22)

We can evaluate the integrals in (A22) by differentiating with respect to r first, then integrating with respect to u, and finally integrating again with respect to r. The first integral then is

2 \sqrt{2 π} [Φ (\frac{r}{\sqrt{20}}) + Φ (\frac{3 r}{\sqrt{20}}) - 1]

and the second integral is

2 \sqrt{2 π} [Φ (\frac{r}{\sqrt{5}}) - \frac{1}{2}]

. Hence we have

\frac{d J_{5} (r)}{d r} = \frac{2}{3} e^{- \frac{r^{2}}{24}} [Φ (\frac{r}{\sqrt{20}}) + Φ (\frac{3 r}{\sqrt{20}}) - 1] + \frac{8}{3} e^{- \frac{r^{2}}{6}} [Φ (\frac{r}{\sqrt{5}}) - \frac{1}{2}] .

(A23)

If we combine (A5) and (A6) for n = 5 we have

E (R) = \frac{30}{π \sqrt{2 π}} \int_{0}^{\infty} \exp (\frac{- r^{2}}{3}) J_{5} (r) d r .

(A24)

We can integrate (A24) by parts again to obtain

E (R) = \frac{60}{\sqrt{π}} - \frac{30 \sqrt{3}}{π \sqrt{2}} \int_{0}^{\infty} Φ (\sqrt{\frac{2}{3}} r) \frac{d J_{5} (r)}{d r} d r = \frac{60}{\sqrt{π}} - \frac{20 \sqrt{3}}{π \sqrt{2}} \int_{0}^{\infty} Φ (\sqrt{\frac{2}{3}} r) \{e^{- \frac{r^{2}}{24}} [Φ (\frac{r}{\sqrt{20}}) + Φ (\frac{3 r}{\sqrt{20}}) - 1] + 4 e^{- \frac{r^{2}}{6}} [Φ (\frac{r}{\sqrt{5}}) - \frac{1}{2}]\} d r .

(A25)

To evaluate E(R) now, we must be able to evaluate integrals of the form L =

\int_{0}^{\infty} e^{- \frac{r^{2}}{a}} Φ (\sqrt{\frac{2}{3}} r) Φ (\frac{r}{\sqrt{b}}) d r

. We will do so by integrating by parts, letting u =

Φ (\sqrt{\frac{2}{3}} r)

and dv =

e^{- \frac{r^{2}}{a}} Φ (\frac{r}{\sqrt{b}}) d r

. We can easily find du, but finding v is much more difficult. I will again use the method of [18]. First write v as

v = \frac{1}{\sqrt{2 π}} \int \exp (- \frac{x^{2}}{a}) \int_{- \infty}^{\frac{x}{\sqrt{b}}} \exp (- \frac{z^{2}}{2}) d z d x

. Now let y =

\frac{z}{x}

in the second of these integrals, and then reverse the order of integration. The result is

v = \frac{1}{\sqrt{2 π}} \int_{- \infty}^{\frac{1}{\sqrt{b}}} \int x \exp (- \frac{x^{2}}{a} - \frac{x^{2} y^{2}}{2}) d x d y

. We can now evaluate the inside integral to obtain

v = \frac{1}{2 \sqrt{2 π}} \int_{- \infty}^{\frac{1}{\sqrt{b}}} \frac{- \exp (- \frac{r^{2}}{a} - \frac{r^{2} y^{2}}{2})}{\frac{1}{a} + \frac{y^{2}}{2}} d y = \frac{- 1}{2 \sqrt{2 π}} e^{- \frac{r^{2}}{a}} \int_{- \infty}^{\frac{1}{\sqrt{b}}} \frac{e^{- \frac{r^{2} y^{2}}{2}}}{\frac{1}{a} + \frac{y^{2}}{2}} d y .

(A26)

Now we have that L =

\int_{0}^{\infty} u d v = {u v|}_{0}^{\infty} - \int_{0}^{\infty} v d u =

\frac{1}{4 \sqrt{2 π}} \{\sqrt{2 a} \tan^{- 1} (\sqrt{\frac{a}{2 b}}) + π \sqrt{\frac{a}{2}}\} + \frac{1}{2 π \sqrt{6}} \int_{0}^{\infty} e^{- \frac{r^{2}}{a}} e^{- \frac{r^{2}}{3}} \int_{- \infty}^{\frac{1}{\sqrt{b}}} \frac{e^{- \frac{r^{2} y^{2}}{2}}}{\frac{1}{a} + \frac{y^{2}}{2}} d y d r

or

L = \frac{1}{4 \sqrt{2 π}} \{\sqrt{2 a} \tan^{- 1} (\sqrt{\frac{a}{2 b}}) + π \sqrt{\frac{a}{2}}\} + \frac{a}{π \sqrt{6}} \int_{0}^{\infty} \int_{- \infty}^{\frac{1}{\sqrt{b}}} \frac{\exp [- \frac{r^{2}}{2} (y^{2} + \frac{2}{a} + \frac{2}{3})]}{2 + a y^{2}} d y d r .

(A27)

For simplicity, let β =

y^{2} + \frac{2}{a} + \frac{2}{3}

and reverse the order of integration to get

L = \frac{1}{4 \sqrt{2 π}} \{\sqrt{2 a} \tan^{- 1} (\sqrt{\frac{a}{2 b}}) + π \sqrt{\frac{a}{2}}\} + \frac{a}{π \sqrt{6}} \int_{- \infty}^{\frac{1}{\sqrt{b}}} \frac{1}{2 + a y^{2}} \int_{0}^{\infty} \exp [- \frac{β r^{2}}{2}] d r d y .

(A28)

The inner definite integral can now be evaluated, so that we have

L = \frac{1}{4 \sqrt{2 π}} \{\sqrt{2 a} \tan^{- 1} (\sqrt{\frac{a}{2 b}}) + π \sqrt{\frac{a}{2}}\} + \frac{a}{2} \sqrt{\frac{a}{π}} \int_{- \infty}^{\frac{1}{\sqrt{b}}} \frac{d y}{(2 + a y^{2}) \sqrt{3 a y^{2} + 2 a + 6}} .

(A29)

The last integral can now be evaluated and gives

L = \frac{1}{4 \sqrt{2 π}} \{\sqrt{2 a} \tan^{- 1} (\sqrt{\frac{a}{2 b}}) + π \sqrt{\frac{a}{2}}\} + \frac{1}{4} \sqrt{\frac{a}{π}} [\sin^{- 1} (\frac{a \sqrt{2}}{\sqrt{(a + 2 b) (6 + 2 a)}}) + \sin^{- 1} (\sqrt{\frac{2 a}{6 + 2 a}})]

(A30)

The other terms in (A25) have the form K =

\int_{0}^{\infty} e^{- \frac{r^{2}}{a}} Φ (\sqrt{\frac{2}{3}} r) d r

. From the n = 4 case we know that K =

\frac{\sqrt{a}}{2 \sqrt{π}} [\tan^{- 1} (\sqrt{\frac{a}{3}}) + \frac{π}{2}]

. Now we can combine this result with (A30) and substitute them for the appropriate terms in (A25). After (a large amount of) simplification, the result is that E(R) =

\frac{30}{{(π)}^{3 / 2}} \tan^{- 1} (\sqrt{2}) - \frac{5}{\sqrt{π}}

.

Appendix A.2. Derivation of the Variance of the Range for n = 4 and 5

We again begin with the case for general n and use (A2) and (A3) as a base. To find the variance of the range, we first find E(R²),

E (R^{2}) = \frac{n (n - 1)}{2 π} \int_{0}^{\infty} r^{2} \exp (\frac{- r^{2}}{4}) I_{n} (r) d r .

(A31)

First, we can integrate this by parts to obtain

E (R^{2}) = \frac{n (n - 1)}{π} [\int_{0}^{\infty} r \exp (\frac{- r^{2}}{4}) \frac{d I_{n} (r)}{d r} d r + \int_{0}^{\infty} \exp (\frac{- r^{2}}{4}) I_{n} (r) d r]

(A32)

where

\frac{d I_{n} (r)}{d r}

is defined in (A6). By noting that the integrand of the second term in (A32) is essentially f_R(r), and by using (A6) and (A7) we can simplify (A32) to

E (R^{2}) = 2 + \frac{n (n - 1) (n - 2)}{{(2 π)}^{3 / 2}} \int_{0}^{\infty} r \exp (\frac{- r^{2}}{3}) J_{n} (r) d r .

(A33)

where J_n(r) is defined in (A8). Integrating by parts once again gives

E (R^{2}) = 2 + \frac{3 n (n - 1) (n - 2)}{2 {(2 π)}^{3 / 2}} \int_{0}^{\infty} \exp (\frac{- r^{2}}{3}) \frac{d J_{n} (r)}{d r} d r

(A34)

where

\frac{d J_{n} (r)}{d r}

is defined in (A12).

For the n = 4 case we can substitute (A11) into (A34) and integrate to find that E(R²) is

\frac{2 π + 2 \sqrt{3} + 6}{π}

.

For the case of n = 5, we can use (A23) in (A34) to obtain

E (R^{2}) = 2 + \frac{60}{{(2 π)}^{3 / 2}} \int_{0}^{\infty} \{e^{- \frac{3 r^{2}}{8}} [Φ (\frac{r}{\sqrt{20}}) + Φ (\frac{3 r}{\sqrt{20}}) - 1] + 4 e^{- \frac{r^{2}}{2}} [Φ (\frac{r}{\sqrt{5}}) - \frac{1}{2}]\} d r

(A35)

We have already seen in Appendix A.1 that integrals like most of those found in (A35) can be found using (A14)–(A17). Hence (A35) simplifies to E(R²) =

\frac{2 π^{2} + 10 \sqrt{3} [\tan^{- 1} (\sqrt{\frac{5}{3}}) + 2 \sqrt{3} \tan^{- 1} (\sqrt{\frac{1}{5}})]}{π^{2}}

.

Appendix B. Complete Tables for the Rational Function Approximations

Table A1. Rational function values for approximating the control chart constant d₂ (see (4) for definitions of the terms used in the table). Due to the large number of parameters, the table is divided based on the values of m + k. In the first column, “a” refers to the coefficients in the numerator of the rational function approximation, while “b” refers to the coefficients in the denominator (See expression (4) in Section 2).

a. m + k = 9.
(m, k)					(7, 2)
Maximum Error					4.37 × 10⁻⁶
a₀					−4.68 × 10⁻⁵
a₁					4.1601706
a₂					−1.072159
a₃					0.2862847
a₄					−0.0791025
a₅					0.0128782
a₆					−0.0011768
a₇					4.36 × 10⁻⁵
b₁					0.1108409
b₂					−0.036305
b. m + k = 8.
(m, k)	(2, 6)	(8, 0)	(6, 2)	(5, 3)	(4, 4)	(7, 1)	(3, 5)	(1, 7)
Maximum Error	4.39 × 10⁻⁶	4.40 × 10⁻⁶	4.45 × 10⁻⁶	4.50 × 10⁻⁶	4.82 × 10⁻⁶	5.69 × 10⁻⁶	6.68 × 10⁻⁶	1.23 × 10⁻⁵
a₀	0.0002117	0.0001602	−0.0003999	0.0003858	0.0005454	0.0003989	0.0017505	−0.0022132
a₁	4.157714	4.1582259	4.1627898	4.1561616	4.1545675	4.1561546	4.1424508	4.1789954
a₂	0.3491476	−1.5259333	−0.950065	1.4537905	0.7921954	−1.5117788	4.4979643
a₃		0.5924688	0.2092784	0.2428315	−0.2792372	0.5775411	−1.3731118
a₄		−0.1840047	−0.0564231	−0.0142767	−0.1098239	−0.1706958
a₅		0.0429772	0.0076955	0.0007023		0.0354162
a₆		−0.0070858	−0.000464			−0.0045255
a₇		0.0007349				0.0002648
a₈		−3.60 × 10⁻⁵
b₁	0.4502653		0.1420291	0.71454	0.5539226	0.0017124	1.4332245	0.3843422
b₂	0.0245085		−0.0462027	0.1823815	0.0012477		0.0864897	−0.0338921
b₃	−0.0137816			0.0005872	−0.0705542		−0.1716855	0.0137563
b₄	0.004125				−0.0034663		0.0147776	−0.0105444
b₅	−0.0006383						−0.0008095	0.0043436
b₆	4.18 × 10⁻⁵							−0.0008831
b₇								7.16 × 10⁻⁵
c. m + k = 7.
(m, k)	(5, 2)	(7, 0)	(3, 4)	(2, 5)	(1, 6)	(6, 1)	(4, 3)
Maximum Error	4.52 × 10⁻⁶	5.76 × 10⁻⁶	1.20 × 10⁻⁵	1.83 × 10⁻⁵	2.47 × 10⁻⁵	3.36 × 10⁻⁵	8.85 × 10⁻⁵
a₀	0.0003945	0.0004069	−0.0013429	0.0674587	−0.0023136	0.0019498	−0.0049402
a₁	4.1560808	4.1560877	4.1706731	3.5533193	4.1779037	4.144603	4.1994308
a₂	1.4690251	−1.5186854	−0.6775001	133.74141		−1.4869875	−3.479488
a₃	0.2391045	0.5798019	0.3559161			0.5353291	0.697147
a₄	−0.0147213	−0.1713809				−0.137747	0.0833977
a₅	0.0007021	0.0355484				0.0215142
a₆		−0.0045384				−0.0015042
a₇		0.0002651
b₁	0.7181305		0.2137831	32.001851	0.3814535	−0.0001152	−0.4379041
b₂	0.1829496		0.0046636	12.604867	−0.0254915		−0.1903784
b₃			0.0331532	−1.0269053	0.0018241		0.1142485
b₄			−0.0013196	0.0535564	−0.0014589
b₅				0.0010865	0.0005456
b₆					−6.26 × 10⁻⁵
d. m + k = 6.
(m, k)	(2, 4)	(6, 0)	(1, 5)	(3, 3)		(4, 2)	(5, 1)
Maximum Error	2.18 × 10⁻⁵	4.01 × 10⁻⁵	4.13 × 10⁻⁵	5.93 × 10⁻⁵		7.91 × 10⁻⁵	8.66 × 10⁻⁵
a₀	0.0740863	0.0021644	−0.0024841	0.1112902		−0.0039991	−0.004793
a₁	3.5003552	4.1431818	4.1779643	3.2300901		4.1888288	4.1941431
a₂	133.77185	−1.4831307		112.82908		−0.5261395	0.4951024
a₃		0.5314118		13.842787		−0.0412288	−0.0515586
a₄		−0.1356001				−0.0023828	0.0141002
a₅		0.0209355					−0.0016136
a₆		−0.0014438
b₁	31.970664		0.3803881	26.769961		0.2610645	0.5094877
b₂	12.65184		−0.0223003	14.24345		−0.0863658
b₃	−1.0557861		−0.0019663	0.2333121
b₄	0.0625968		0.0007101
b₅			−4.89 × 10⁻⁵
e. m + k = 5.
(m, k)	(1, 4)		(2, 3)	(3, 2)		(4, 1)	(5, 0)
Maximum Error	7.04 × 10⁻⁵		1.93 × 10⁻⁴	1.98 × 10⁻⁴		2.34 × 10⁻⁴	2.81 × 10⁻⁴
a₀	−0.0021528		−0.0068195	−0.0069689		−0.0079574	0.0087059
a₁	4.1778096		4.2048559	4.2056463		4.2118095	4.1014476
a₂			0.3389855	0.4231483		0.4123636	−1.3881892
a₃				0.0040529		−0.0093844	0.4305764
a₄						0.0008702	−0.0815235
a₅							0.0067557
b₁	0.3812861		0.4760923	0.496602		0.4973312
b₂	−0.0240701		−0.0019042	0.0066181
b₃	−0.0006661		0.0001546
b₄	0.0002934
f. m + k = 4.
(m, k)	(1, 3)		(2, 2)			(3, 1)	(4, 0)
Maximum Error	1.61 × 10⁻⁴		2.53 × 10⁻⁴			5.07 × 10⁻⁴	2.61 × 10⁻³
a₀	−0.0058338		−0.0046286			−0.0123297	0.0390467
a₁	4.1990741		4.1962377			4.2343323	3.9449081
a₂			0.3417114			0.4466871	−1.1263646
a₃						−0.0082	0.2442248
a₄							−0.0227788
b₁	0.3919365		0.4738405			0.5145802
b₂	−0.0310113		−0.0006776
b₃	0.0016884
g. m + k = 3 and m + k = 2.
m + k	3		3	3		2	2
(m, k)	(2, 1)		(1, 2)	(3, 0)		(1, 1)	(2, 0)
Maximum Error	5.61 × 10⁻⁴		2.35 × 10⁻³	1.09 × 10⁻²		2.19 × 10⁻²	5.80 × 10⁻²
a₀	−0.0023213		0.0172151	0.1008629		0.1295186	0.2907984
a₁	4.1871142		4.0937756	3.6822471		3.6681333	3.0786409
a₂	0.3449063			−0.8018105			−0.3446446
a₃				0.0949052
b₁	0.4720972		0.3546785			0.2410842
b₂			−0.0184772

Table A2. Rational function values for approximating the control chart constant d₃ (see (4) for definitions of the terms used in the table). Due to the large number of parameters, the table is divided based on the values of m + k. In the first column, “a” refers to the coefficients in the numerator of the rational function approximation, while “b” refers to the coefficients in the denominator. (See expression (4) in Section 2).

a. m + k = 10.
(m, k)					(9, 1)
Maximum Error					1.04 × 10⁻⁶
a₀					0.315104
a₁					5.547612
a₂					−2.84442
a₃					−0.78335
a₄					2.852878
a₅					−2.51086
a₆					1.234777
a₇					−0.36344
a₈					0.059829
a₉					−0.00424
b₁					3.396075
b. m + k = 9.
(m, k)	(2, 7)	(7, 2)	(8, 1)	(6, 3)	(9, 0)	(4, 5)	(3, 6)	(5, 4)
Maximum Error	1.49 × 10⁻⁵	1.67 × 10⁻⁵	1.83 × 10⁻⁵	2.13 × 10⁻⁵	2.85 × 10⁻⁵	3.40 × 10⁻⁵	3.46 × 10⁻⁵	3.48 × 10⁻⁵
a₀	0.222874	0.296684	0.288269	0.282365	0.425844	−0.84597	0.415148	0.416896
a₁	7.435075	5.72538	5.824849	5.81292	2.891117	156.1599	4.140362	3.804773
a₂	−1.74755	−2.7717	−3.84618	−5.82438	−7.15265	1102.012	2.558952	−0.02747
a₃		0.802064	1.369529	3.523224	9.656155	−308.826	−1.32839	−0.29831
a₄		0.200033	0.068175	−1.19533	−8.5306	−24.3344		−0.03858
a₅		−0.2218	−0.26892	0.230026	5.088273			−0.00149
a₆		0.063196	0.104807	−0.01879	−2.024
a₇		−0.00636	−0.0174		0.512864
a₈			0.00106		−0.07467
a₉					0.004743
b₁	5.437382	3.460118	3.421473	3.145292		330.4889	2.79654	2.043444
b₂	0.138415	0.668103		−0.99902		514.4913	3.534685	2.385689
b₃	2.584673			0.379228		605.6943	0.912601	−0.35895
b₄	−2.56493					−303.519	−1.08252	−0.23241
b₅	1.00801					11.36619	0.102501
b₆	−0.20567						−0.0042
b₇	0.017412
c. m + k = 8.
(m, k)	(1, 7)	(6, 2)	(4, 4)	(3, 5)	(5, 3)	(7, 1)	(2, 6)	(8, 0)
Maximum Error	2.63 × 10⁻⁵	3.29 × 10⁻⁵	3.49 × 10⁻⁵	3.65 × 10⁻⁵	4.85 × 10⁻⁵	5.25 × 10⁻⁵	6.56 × 10⁻⁵	1.06 × 10⁻⁴
a₀	−0.22202	0.361297	0.417448	0.419877	0.437844	0.19618	0.464882	0.457687
a₁	16.809	4.604874	3.713019	3.598293	3.391173	7.40356	2.936455	2.580339
a₂		−3.34365	−0.73609	−1.34281	−0.28428	−3.57152	0.112444	−5.92177
a₃		2.01313	−0.12318	0.036971	−0.2868	0.096022		7.037428
a₄		−0.72414	−0.0197		0.043483	1.070521		−5.20663
a₅		0.145333			−0.00508	−0.62695		2.458343
a₆		−0.01238				0.153482		−0.71801
a₇						−0.01431		0.117911
a₈								−0.00831
b₁	15.37598	2.36091	1.839365	1.629358	1.60254	4.905237	1.221915
b₂	−2.99882	0.458887	2.04778	1.768778	2.332491		2.699692
b₃	14.3693		−0.74976	−1.10774	−0.79915		−0.76911
b₄	−10.1252		−0.05782	0.117442			0.336715
b₅	3.946838			−0.00475			−0.09027
b₆	−0.83038						0.009819
b₇	0.072844
d. m + k = 7.
(m, k)	(4, 3)	(1, 6)	(5, 2)	(6, 1)	(2, 5)	(3, 4)	(7, 0)
Maximum Error	3.90 × 10⁻⁵	6.68 × 10⁻⁵	7.97 × 10⁻⁵	8.37 × 10⁻⁵	8.73 × 10⁻⁵	9.96 × 10⁻⁵	4.00 × 10⁻⁴
a₀	0.414579	0.448364	0.514699	0.267489	0.354628	0.347222	0.506673
a₁	3.718122	3.171776	2.357509	5.998534	4.667852	4.701032	2.154437
a₂	−0.92875		3.387868	−4.38089	−0.57936	−1.3774	−4.46778
a₃	−0.07681		−1.25034	2.220661		0.284086	4.464941
a₄	−0.00913		0.32038	−0.68974			−2.59734
a₅			−0.033	0.120869			0.887721
a₆				−0.00918			−0.1648
a₇							0.012798
b₁	1.802498	1.407458	1.17619	3.425601	2.675402	2.570444
b₂	1.976001	2.652334	4.470245		2.285831	1.913318
b₃	−0.86642	−0.74001			−0.33457	−0.66506
b₄		0.292182			−0.06511	0.143048
b₅		−0.07846			0.012773
b₆		0.008683
e. m + k = 6.
(m, k)	(4, 2)	(1, 5)	(2, 4)	(3, 3)		(5, 1)	(6, 0)
Maximum Error	1.10 × 10⁻⁴	1.28 × 10⁻⁴	1.36 × 10⁻⁴	1.91 × 10⁻⁴		2.26 × 10⁻⁴	1.34 × 10⁻³
a₀	0.342748	0.427834	0.444321	0.475274		0.217224	0.569274
a₁	4.964747	3.719638	3.378007	2.995876		6.915992	1.677416
a₂	−0.67509		−0.51529	0.213881		−4.33346	−3.09547
a₃	0.305956			−0.05469		1.749774	2.508293
a₄	−0.03384					−0.37293	−1.08081
a₅						0.032428	0.239276
a₆							−0.02135
b₁	2.973567	2.011761	1.613263	1.41501		4.279257
b₂	2.179108	2.353528	2.087442	2.394381
b₃		−0.12915	−0.59247	−0.25408
b₄		−0.04059	0.040986
b₅		0.007675
f. m + k = 5.
(m, k)		(2, 3)	(1, 4)	(3, 2)		(4, 1)	(5, 0)
Maximum Error		2.03 × 10⁻⁴	5.27 × 10⁻⁴	6.82 × 10⁻⁴		7.5 × 10⁻⁴	3.88 × 10⁻⁴
a₀		0.457381	0.523099	0.576235		−0.05848	0.650202
a₁		3.232791	2.395738	1.709512		11.17565	1.145273
a₂		0.559042		0.826375		−5.5501	−1.84151
a₃				−0.05561		1.594977	1.130005
a₄						−0.17954	−0.31863
a₅							0.034104
b₁		1.648605	0.940191	0.51129		7.753653
b₂		2.65881	2.042514	2.324483
b₃		0.024817	−0.35739
b₄			0.036746
g. m + k = 4.
(m, k)		(2, 2)		(1, 3)		(3, 1)	(4, 0)
Maximum Error		2.13 × 10⁻⁴		8.98 × 10⁻⁴		5.25 × 10⁻³	1.08 × 10⁻²
a₀		0.452862		0.476886		−1.42047	0.762477
a₁		3.338977		3.230196		36.39687	0.533722
a₂		0.491674				−12.4903	−0.74534
a₃						1.832343	0.295284
a₄							−0.03852
b₁		1.737353		1.694324		29.41969
b₂		2.63829		2.130666
b₃				−0.17919

References

Snee, R.D. Statistical Thinking and Its Contribution to Total Quality. Am. Stat. 1990, 44, 116–121. [Google Scholar] [CrossRef]
Hoerl, R.W.; Snee, R.D.; De Veaux, R.D. Applying Statistical Thinking to ‘Big Data’ Problems. WIREs Comput. Stat. 2014, 6, 222–232. [Google Scholar] [CrossRef]
Hoerl, R.W.; Snee, R.D. Statistical Thinking: Improving Business Performance; Wiley: Hoboken, NJ, USA, 2020. [Google Scholar]
Khoo, M.B.; Lim, E.G. An Improved R (Range) Control Chart for Monitoring the Process Variance. Qual. Reliab. Eng. Int. 2005, 21, 43–50. [Google Scholar] [CrossRef]
Chen, W.H.; Tirupati, D. On-line Quality Management: Integration of Product Inspection and Process Control. Prod. Oper. Manag. 1995, 4, 242–262. [Google Scholar] [CrossRef]
Wardell, D.G. Algebraic Expressions for Range Control Chart Constants. In Proceedings of the Fiftieth Annual Meeting of the Western Decision Sciences Institute, Waikoloa, HI, USA, 5–8 April 2022. [Google Scholar]
Burr, I.W. The Effect of Non-normality on Constants for $\bar{x}$ and R charts. Ind. Qual. Control 1967, 23, 563–569. [Google Scholar]
Qiu, P.; Zhang, J. On Phase II SPC in Cases When Normality is Invalid. Qual. Reliab. Eng. Int. 2015, 31, 27–35. [Google Scholar] [CrossRef]
Khakifirooz, M.; Tercero-Gómeza, V.G.; Woodall, W.H. The Role of the Normal Distribution in Statistical Process Monitoring. Qual. Eng. 2021, 3, 497–510. [Google Scholar] [CrossRef]
Mood, A.M.; Graybill, F.A.; Boes, D.C. Introduction to the Theory of Statistics; McGraw Hill: New York, NY, USA, 1974. [Google Scholar]
David, H.A. Order Statistics; John Wiley & Sons, Inc.: New York, NY, USA, 1970. [Google Scholar]
Arnold, B.C.; Balakrishnan, N. Relations, Bounds and Approximations for Order Statistic; Lecture Notes in Statistics No. 53; Springer: New York, NY, USA, 1989. [Google Scholar]
Royston, J.P. Algorithm AS 177: Expected Normal Order Statistics (Exact and Approximate). Appl. Stat. 1982, 31, 161–165. [Google Scholar] [CrossRef]
Pearson, E.S.; Hartley, H.O. (Eds.) Biometrika Tables for Statisticians; Biometrika Trust: London, UK, 1976; Volume 1. [Google Scholar]
Harter, H.L.; Balakrishnan, N. Tables for the Use of Range and Studentized Range in Tests of Hypotheses; CRC Press: Boca Raton, FL, USA, 1998. [Google Scholar]
Petrushev, P.P.; Popov, V.A. Rational Approximation of Real Functions; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Ralston, A.; Rabinowitz, P. A First Course in Numerical Analysis; Dover Publications, Inc.: New York, NY, USA, 2001. [Google Scholar]
McKay, A.T.; Pearson, E.S. A Note on the Distribution of Range in Samples of n. Biometrika 1933, 25, 415–420. [Google Scholar] [CrossRef]
Bose, R.C.; Gupta, S.S. Moments of Order Statistics from a Normal Population. Biometrika 1959, 46, 433–440. [Google Scholar] [CrossRef]
Balakrishnan, N.; Cohen, A.C. Order Statistics and Inference: Estimation Methods; Academic Press, Inc.: London, UK, 2014. [Google Scholar]
Ryan, T.P. Statistical Methods for Quality Improvement; John Wiley and Sons: Weinheim, Germany, 2011. [Google Scholar]

Figure 1. Plot of the tabulated values of d₂ as a function of sample size n. The plot suggests that the relationship between d₂ and n is logarithmic.

Figure 2. Plots of the approximation errors for (a) d₂ and (b) d₃ for the rational functions listed in Table 4. The plots show the sample sizes for which the estimation error is the largest.

Figure 3. Contour plots of the negative logarithm of the maximum error for (a) d₂ and (b) d₃ for different combinations of the orders of the polynomials in (4). Larger values are associated with smaller errors.

Table 1. Analytical results for d₂ and d₃ for samples of sizes 2 through 5.

Sample Size n	d₂	d₃
2	$2 / \sqrt{π}$	$\sqrt{2 - \frac{4}{π}}$
3	$3 / \sqrt{π}$	$\sqrt{\frac{2 π + 3 \sqrt{3} - 9}{π}}$
4	$\frac{12}{π \sqrt{π}} \tan^{- 1} (\sqrt{2})$	$\sqrt{\frac{2 π + 2 \sqrt{3} + 6}{π} - d_{2}^{2}}$
5	$\frac{30}{{(π)}^{3 / 2}} \tan^{- 1} (\sqrt{2}) - \frac{5}{\sqrt{π}}$	$\sqrt{\frac{2 π^{2} + 10 \sqrt{3} [\tan^{- 1} (\sqrt{\frac{5}{3}}) + 2 \sqrt{3} \tan^{- 1} (\sqrt{\frac{1}{5}})]}{π^{2}} - d_{2}^{2}}$

Table 2. Rational function values for approximating the control chart constant d₂. These results are for the functions with the lowest errors for each combination of m + k. More complete tables are found in Appendix B. See (4) for definitions of the terms used in the table.

m + k	9	8	7	6	5	4	3	2
(m, k)	(7, 2)	(2, 6)	(5, 2)	(2, 4)	(1, 4)	(1, 3)	(2, 1)	(1, 1)
Maximum Error	4.3685 × 10⁻⁶	4.3907 × 10⁻⁶	4.5173 × 10⁻⁶	2.1767 × 10⁻⁵	7.0438 × 10⁻⁵	1.6052 × 10⁻⁴	5.6083 × 10⁻⁴	2.1880 × 10⁻²
a₀	−4.6830 × 10⁻⁵	0.00021168	0.00039446	0.0740863	−0.0021528	−0.0058338	−0.0023213	0.12951861
a₁	4.16017058	4.15771399	4.15608081	3.50035523	4.17780958	4.19907408	4.18711419	3.66813329
a₂	−1.072159	0.34914757	1.4690251	133.771848			0.34490634
a₃	0.28628465		0.23910449
a₄	−0.0791025		−0.0147213
a₅	0.01287818		0.00070213
a₆	−0.0011768
a₇	4.36 × 10⁻⁵
b₁	0.11084091	0.45026527	0.71813046	31.970664	0.38128612	0.3919365	0.47209716	0.24108419
b₂	−0.036305	0.02450851	0.18294963	12.6518404	−0.0240701	−0.0310113
b₃		−0.0137816		−1.0557861	−0.0006661	0.00168837
b₄		0.00412501		0.06259679	0.00029336
b₅		−0.0006383
b₆		4.1793 × 10⁻⁵

Table 3. Rational function values for approximating the control chart constant d₃. These results are for the functions with the lowest error for each combination of m + k. More complete tables are found in Appendix B. See (4) for definitions of the terms used in the table.

m + k	10	9	8	7	6	5	4
(m, k)	(9, 1)	(2, 7)	(1, 7)	(4, 3)	(4, 2)	(2, 3)	(2, 2)
Maximum Error	1.0426 × 10⁻⁶	1.4861 × 10⁻⁵	2.6339 × 10⁻⁵	3.9049 × 10⁻⁵	1.1010 × 10⁻⁴	2.0261 × 10⁻⁴	2.1340 × 10⁻⁴
a₀	0.315104	0.222874	−0.22202	0.414579	0.342748	0.457381	0.452862
a₁	5.547612	7.435075	16.809	3.718122	4.964747	3.232791	3.338977
a₂	−2.84442	−1.74755		−0.92875	−0.67509	0.559042	0.491674
a₃	−0.78335			−0.07681	0.305956
a₄	2.852878			−0.00913	−0.03384
a₅	−2.51086
a₆	1.234777
a₇	−0.36344
a₈	0.059829
a₉	−0.00424
b₁	3.396075	5.437382	15.37598	1.802498	2.973567	1.648605	1.737353
b₂		0.138415	−2.99882	1.976001	2.179108	2.65881	2.63829
b₃		2.584673	14.3693	−0.86642		0.024817
b₄		−2.56493	−10.1252
b₅		1.00801	3.946838
b₆		−0.20567	−0.83038
b₇		0.017412	0.072844

Table 4. Suggested algebraic expressions for control chart constants d₂ and d₃ for sample sizes 2 to 1000. In the table, x = log(n).

a. Expressions for d₂
Sample Size n	Expression for d₂	Maximum Error
2	$2 / \sqrt{π}$	0
3	$3 / \sqrt{π}$	0
4	$\frac{12}{π \sqrt{π}} \tan^{- 1} (\sqrt{2})$	0
5	$\frac{30}{{(π)}^{3 / 2}} \tan^{- 1} (\sqrt{2}) - \frac{5}{\sqrt{π}}$	0
n = 6 to 1000	$\frac{2.1168 \times 10^{- 4} + 4.1577 x + 0.3491 x^{2}}{1 + 0.4503 x + 0.0245 x^{2} - 0.0138 x^{3} + 0.4125 \times 10^{- 3} x^{4} - 6.383 \times 10^{- 4} x^{5} + 4.1793 \times 10^{- 5} x^{6}}$	4.3907 × 10⁻⁶
b. Expressions for d₃
Sample Size n	Expression for d₃	Maximum Error
2	$\sqrt{2 - \frac{4}{π}}$	0
3	$\sqrt{\frac{2 π + 3 \sqrt{3} - 9}{π}}$	0
4	$\sqrt{\frac{2 π + 2 \sqrt{3} + 6}{π} - d_{2}^{2}}$	0
5	$\sqrt{\frac{2 π^{2} + 10 \sqrt{3} [\tan^{- 1} (\sqrt{\frac{5}{3}}) + 2 \sqrt{3} \tan^{- 1} (\sqrt{\frac{1}{5}})]}{π^{2}} - d_{2}^{2}}$	0
n = 6 to 1000	$\frac{0.2229 + 7.4351 x - {1.7476 x}^{2}}{1 + 5.4374 x + {0.1384 x}^{2} + {2.5847 x}^{3} - {2.5649 x}^{4} + {1.0080 x}^{5} - {0.2057 x}^{6} + {0.0174 x}^{7}}$	1.4861 × 10⁻⁵

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Expressions for the First Two Moments of the Range of Normal Random Variables with Applications to the Range Control Chart^†

Abstract

1. Introduction

2. The Distribution and First Two Moments of the Range

2.1. Approximations to d₂ and d₃ for Samples Sizes of at Least 6

Approximation Method

3. Results

3.1. Analytical Results for n = 2, 3, 4 and 5

3.2. Approximation Results

4. Discussion

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix A.1. Derivation of the Expected Value of the Range for n = 4 and 5

Appendix A.1.1. n = 4 Case

Appendix A.1.2. n = 5 Case

Appendix A.2. Derivation of the Variance of the Range for n = 4 and 5

Appendix B. Complete Tables for the Rational Function Approximations

References

Article Metrics

Citations

Article Access Statistics

Expressions for the First Two Moments of the Range of Normal Random Variables with Applications to the Range Control Chart †

Abstract

1. Introduction

2. The Distribution and First Two Moments of the Range

2.1. Approximations to d2 and d3 for Samples Sizes of at Least 6

Approximation Method

3. Results

3.1. Analytical Results for n = 2, 3, 4 and 5

3.2. Approximation Results

4. Discussion

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix A.1. Derivation of the Expected Value of the Range for n = 4 and 5

Appendix A.1.1. n = 4 Case

Appendix A.1.2. n = 5 Case

Appendix A.2. Derivation of the Variance of the Range for n = 4 and 5

Appendix B. Complete Tables for the Rational Function Approximations

References

Article Metrics

Citations

Article Access Statistics

Expressions for the First Two Moments of the Range of Normal Random Variables with Applications to the Range Control Chart^†

2.1. Approximations to d₂ and d₃ for Samples Sizes of at Least 6