Abstract
In this work, a control chart with multiple runs rules is proposed and studied in the case of monitoring inflated processes. Usually, Shewhart-type control charts for attributes do not have a lower control limit, especially when the in-control process mean level is very low, such as in the case of processes with a low number of defects per inspected unit. Therefore, it is not possible to detect a decrease in the process mean level. A common solution to this problem is to apply a runs rule on the lower side of the chart. Motivated by this approach, we suggest a Shewhart-type chart, supplemented with two runs rules; one is used for detecting decreases in process mean level, and the other is used for improving the chart’s sensitivity in the detection of small and moderate increasing shifts in the process mean level. Using the Markov chain method, we examine the performance of various schemes in terms of the average run length and the expected average run length. Two illustrative examples for the use of the proposed schemes in practice are also discussed. The numerical results show that the considered schemes can detect efficiently various shifts in process parameters in either direction.
Keywords:
attributes control chart; average run length count data; expected average run length; inflated Poisson distribution; statistical process monitoring MSC:
62P30
1. Introduction
Statistical process monitoring (SPM) is a collection of methods and techniques which focus on the monitoring of a process and the timely detection of changes in it. The most frequently used SPM method is the control chart. The most common chart is the Shewhart chart, suggested by Walter A. Shewhart in the 1920’s, while there are two other main types of control charts, namely the cumulative sum (CUSUM) chart, proposed by Page [1], and the exponentially weighted moving average (EWMA) chart, proposed by Roberts [2]. A Shewhart chart is useful for the detection of sudden and of large magnitude shifts, while the CUSUM and EWMA charts are better than Shewhart charts in the detection of shifts of small magnitude. See [3] for more details on the properties and applications of the main types of control charts.
The superiority of CUSUM and EWMA charts in the detection of shifts of small magnitude is attributed to their inherent memory, i.e., the respective charting statistics consist of information not only from the most recent sample (or recent observation) but also from the past ones. However, even though their practical implementation is nowadays a routine application, there are still some difficulties in their statistical design, mainly in how to choose the most appropriate values for their chart parameters. From this point of view, intermediate solutions, such as supplementing a Shewhart chart with additional stopping rules based on runs (i.e., runs rules), are still popular in practical problems. We refer to [4,5] for thorough reviews of control charts with runs rules.
Control charts for attributes are used when count data are available from the process. This situation occurs when it is not possible to find a critical-to-quality (CTQ) characteristic (random variable, r.v.) that follows a continuous distribution. Usually, in the case of attributes charts, the distribution of can either take values on {0, 1, …} (e.g., in the case of monitoring the number of defects that follow a Poisson distribution) or on {0, 1, …, n} (e.g., in the case of monitoring the number of defective in a sample of size , which follows a binomial distribution). Even though Poisson distribution and the associated Shewhart-type chart, the -chart, are frequently used in the monitoring of count data, there are many cases where both are not appropriate. For example, this is the case of count data that exhibit over-dispersion, i.e., the variance of the distribution of is much larger than its mean. Recall that in Poisson distribution, mean and variance are equal. Therefore, a solution to this problem is to adopt a distributional model for the available count data from the process that can capture this deviation from the ordinary Poisson model.
Inflated probability distributions have been studied by several authors due to their flexibility in modeling over-dispersion in the data. A common sub-class of the family of inflated distributions is that of zero-inflated distributions (see, for example, Chapter 8 in [6]), where the probability for the occurrence of a zero value is much larger than the respective probability under the non-inflated distribution. Even though inflated distributions (not necessarily only at zero) have been studied in the past (Yoneda [7]), in recent years, there has also been interest in extending them to model the inflation of two or more values. See, for example, [8,9,10,11] and references therein for inflated distributions in exactly two values: at 0 and at another one non-zero value. Also, Sun et al. [12] suggested a zero-one-two inflated distribution, a distribution inflated at exactly three values. Begum et al. [13] and Rakitzis et al. [14] proposed a general inflated Poisson model that takes into account the inflation on the first values (i.e., the ) of the distribution. Also, Rakitzis et al. [14] proposed and studied a two-parameter mixture model, namely the -geometrically inflated Poisson () distribution, which can model the inflation not only on the zero-values but also on other values of the Poisson distribution, while it has only two parameters.
Control charts for inflated distributions have been studied quite extensively in the recent literature, especially those related to the monitoring of zero-inflated processes. See the overview provided in [15]. Even though in the literature exists almost every main type of chart for inflated distributions (such as Shewhart, CUSUM, and EWMA), there are very few control charts with supplementary runs rules for monitoring this type of process. Actually, the case of attributes control charts with supplementary runs rules has not been paid much attention compared to their variable counterparts. Supplementing a Shewhart chart with runs rules is an easy-to-apply solution for the problem of their insensitivity in detecting small to moderate shifts in process parameter(s). We should also mention that it is very common for attribute charts to have no lower control limit, especially in the case of high-quality processes, where the fraction of non-conforming items is very low. For example, the lower control limit of a -chart with 3 sigma limits and an IC mean lower than 9 is negative. In this case, it is suggested to use only an upper control limit on the chart (see [16]). Nelson [17], Acosta-Mejia [18], Lucas et al. [19], and Chang and Gan [20] studied attributes control charts with no lower control limit, and instead, they applied a runs rule on the lower side of the chart to detect a decrease in the process mean level.
Usually, high-quality processes can be modeled according to a discrete inflated probability distribution. Motivated by the works in [19,20], we propose and study a control chart with multiple runs rules to monitor a process and detect increases as well as decreases in its mean level. The aim is to suggest a control chart for attributes that retains the simplicity of a Shewhart-type chart with runs rules while it has an improved performance compared to its competitors. To the best of our knowledge, in the literature, there are no two-sided control charts with multiple runs rules for monitoring a processes. Also, it is worth noting that even though the properties of the proposed scheme are investigated in the case of monitoring a process, after some straightforward but necessary modifications, it can be used for the monitoring of any other process which is modeled according to a discrete probability distribution. These are the main scopes and motivations of this work.
The structure of this work is as follows: In Section 2, we present the main properties of the distribution. In Section 3, we introduce the proposed scheme and present the measures for evaluating its performance. In Section 4, we present the findings of an extensive numerical study regarding the statistical design and the performance of the proposed chart. Practical guidelines for applying the chart in practice are given as well. In Section 5, we present two illustrative examples (using real data), which show how to implement the proposed charts in practice. Finally, in Section 6, the conclusions and main findings of this work are summarized, while topics for future research are also given.
2. The r-Geometrically Inflated Poisson Distribution
We start this section by presenting the basic properties of the -geometrically inflated Poisson distribution. Further details can be found in Rakitzis et al. [14]. Let be a discrete random variable with support . If the probability mass function (pmf) of is
then we say that follows the -geometrically inflated Poisson distribution with parameters (the inflation parameter), and (i.e., ). Also, satisfies the inequality for every (so as is a true pmf), while is the usual indicator function. For , the distribution reduces to the ZIP distribution with parameters and .
The cumulative distribution function (cdf) of the distribution is given by
The represents the smallest integer contained in and is the cdf of the Poisson distribution with parameter . In addition, the mean of the distribution equals
where, for , it is .
For a given value, the parameters and can be estimated by using either the maximum likelihood estimation method or the method of moments; see [14] for further details on estimation methods for the parameters of the distribution. In the present work, we assume that process parameters are known, or that they have been estimated from a (sufficiently) large Phase I sample.
3. The Proposed Monitoring Scheme
3.1. Operation of the Proposed Monitoring Scheme
In this section, we present the operation of the proposed monitoring scheme that can be used for the monitoring of a process. Following the setup in [21], we assume that the value of is predetermined and remains unaffected by the presence of assignable causes. Moreover, in this work the aim is to detect either upward or downward shifts in the in-control (IC) process mean level, which is denoted as . The control charts studied by [21,22] can detect only upward shifts in . Let us also assume that at each sampling stage, an individual observation is obtained from the process. We denote as , the IC values of the process parameters. When the process is OOC, we assume that the OOC values of the process parameters are and with and . When the process is IC.
Both and are obtained via Equation (3) for the respective IC and OOC values of and . It is worth mentioning that in practice, it is of great importance to detect an increase in the mean of the process, that is, a change from to , because it is related to process deterioration. For example, control chart operators are interested in detecting an increase in the expected number of nonconformities of the inspected units or an increase in the expected weekly number of confirmed new infections from a specific disease. In addition, in modern statistical process monitoring (see, for example, [23,24,25,26]), the case of decreases is considered very important because it is related to process improvement. In such cases, the interest is in detecting a decrease in the expected number of non-conformities in the inspected units, which can be considered an indication that the attempts for process improvement have been successful. Note also that this improvement can be attributed to the occurrence of assignable causes, such as the recruitment of well-trained personnel or the use of improved raw materials. Consequently, for continuous improvement, it becomes essential to consider the improvement case. In this work, we consider both OOC cases: or .
Let be positive integers with and . Then, the proposed monitoring scheme, to be denoted as , gives an OOC signal when at least one of the following events occurs:
- i.
- A single value is beyond an upper control limit (Region 1);
- ii.
- l-out-of-m successive values are in the interval (Region 2) with the intermediate values being in the interval (Region 3);
- iii.
- k successive values are in the interval (Region 4);
whichever of the (i)–(iii) occurs first.
The suggested scheme can be viewed as an extension of the schemes studied by [19,20]. Specifically, in the above-mentioned works, the rule for detecting a decreasing shift in is of the type -out-of- successive zero values. For the detection of increasing shifts in , Lucas et al. [19] applied only the rule of a single value above an upper control limit while Chang and Gan [20] applied a rule of type -out-of- consecutive points above an .
The regions of the chart are given in Figure 1. To apply the proposed control chart in practice, the values of , , , and must be determined for the given integer values (). This is carried out via an appropriate design procedure; the values are determined so the scheme has the desired IC performance, and it is sensitive enough in the detection of specific shifts in process parameters.
Figure 1.
Regions of the control chart.
3.2. Performance of the Proposed Monitoring Scheme
The performance of a process monitoring scheme is evaluated by the run length () distribution, which is defined as the number of points plotted on the chart until it gives for the first time an OOC signal.
The computation of the properties of the scheme is feasible using the finite Markov chain embedding technique of Fu and Koutras [27] (see also [28,29] and references therein). Further details on the method can be found in the Appendix A. Using this method, it is possible to calculate the entire distribution, as well as its expected value , which is known as the average run length (). The is the most common performance measure of a control chart.
Apart from the there are other performance measures that can be viewed as measures of the overall performance of the chart. Specifically, the evaluates the performance of a chart at a specific shift in process parameters and , respectively. In practice, these shifts are rarely known, or practitioners may want to know how the chart performs for a range of shifts. For this reason, in this work, we use also the expected average run length (), which is defined as (see, for example, [30,31])
where and are the intervals for the shifts and in and , respectively. The chart with the minimum is the one with the best overall performance at the specific range of shifts.
4. Numerical Results
In this section, we present the results of a numerical study regarding the performance of the chart. Specifically, we consider the following specific values for , (see Table 1):
Table 1.
The schemes under study.
These values correspond to very well-known runs rules such as the 2-of-2, the 2-of-3, and the 4-of-5, to mention a few.
Next, in Table 2 we provide six indicative scenarios for different IC processes. Cases 5 and 6 correspond to a zero-inflated Poisson (ZIP) process. The IC process mean level is given in the third row of Table 2 and varies from 0.4 to 2.6250 for the different 6 scenarios. Thus, they can be considered cases of high-quality processes.
Table 2.
Scenarios for the IC Processes.
For each IC scenario in Table 2, we determined the values of the design parameters , , and for the seven schemes in Table 1 to have comparable IC performance. For illustrative purposes, we considered an IC approximately equal to 100. Any other value can be used depending on practitioners’ needs.
The design procedure is based on an extensive grid search on the possible positive integer values for , , and which satisfy the following two constraints: (I) and (II) . Then, any combination of , , and that gives an IC approximately equal to the pre-specified value, i.e., in the interval , is considered accepted.
Once all the possible combinations for that give the desired IC performance have been determined, we evaluated the OOC of the proposed charts for several pairs of shifts . The considered pairs are while for the process is IC.
In Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8 we provide the best charts, i.e., the ones with the minimum OOC values, for various shifts in and when the IC is (approximately) equal to 100. The shifts are given in columns ‘’ and ‘’, while the OOC values and of the process parameters are given in the respective columns. The column ‘’ gives the OOC process mean level, while the column ‘’ gives the change in IC process mean level. Note that when , there is a decrease in the process mean level, otherwise, there is an increase. Also, the results in Table 3, Table 4, Table 5, Table 6, Table 7 and Table 8 have been sorted according to the values of this column to discriminate the decreases from the increases.
Table 3.
Best charts in terms of , Case 1, .
Table 4.
Best charts in terms of , Case 2, .
Table 5.
Best charts in terms of , Case 3, .
Table 6.
Best charts in terms of , Case 4, .
Table 7.
Best charts in terms of , Case 5, .
Table 8.
Best charts in terms of , Case 6, .
For comparison purposes, we included the profiles of the following schemes: The upper one-sided Shewhart chart in Mamzeridou and Rakitzis [21] (in column ‘SH’), a lower one-sided scheme, which is the add-on procedure suggested by Lucas et al. [19] and gives an OOC signal when -out-of- () consecutive zero values occur (in column ‘’) and a two-sided scheme (in column ‘Combined’) that combines the previously mentioned one-sided schemes. This combined scheme is the procedure suggested by Lucas et al. [19]. Note also that the Shewhart chart has one upper control limit ; thus, its performance is evaluated only for increasing shifts in . The value of is determined so as its IC performance is as close as possible to 100 in order to have a fair comparison with the other schemes. Given the values of , , and , the is calculated by
Furthermore, the for the scheme is given by (see [19])
where . Also, its performance is evaluated only for decreasing shifts in . The value of is determined so the IC is also as close as possible to 100. Finally, the of the two-sided that combines the two one-sided procedures equals
where . Again, the values of are determined so the IC is as close as possible to 100.
The numerical results for the charts show that there is not a single chart that has the best performance, neither for all shifts nor for all different cases. For the upper side of the chart, we suggest using the rule 2-out-of-, , depending on the IC values of the process parameters as well as on the values of the shifts . For the lower side of the chart, larger values for are needed when than when . For the latter, a value around 11–15 is sufficient in most of the cases, but the exact value depends on the shift we want to detect.
The comparison with two one-sided charts and the combined two-sided chart shows that there is at least one chart that has better performance, in almost all the considered cases. It should be mentioned that due to the discrete nature of the process, the IC of the upper Shewhart chart, the lower-sided scheme, and the combined two-sided chart is not always comparable with that of the . Therefore, conclusions should be made with caution. Another interesting point is that the rule of -out-of- consecutive zero values works well in the case of zero-inflated Poisson processes (Cases 5 and 6) but its performance is weak in the remaining ones. Thus, the add-on procedure suggested by Lucas et al. [19] is still effective when there is an excessive number of zero values in the data but when there is inflation in zero and non-zero values, a modification is needed. This modification is offered by the proposed chart.
Another interesting finding is that there are cases where a change in one of the process parameters results in a . See, for example, Case 2 for and (1.1, 1) or Case 3 for . The charts can only marginally detect this change while they cannot distinguish the type of shift. This is one of the limitations of this study. A solution is to use, if possible, rational samples of size (instead of individual observations), as well as an appropriate statistic (e.g., the MLEs from each sample) to improve chart’s ability to discriminate between the different types of shifts. This topic needs a separate careful investigation, and it is left for a future study.
Next, in Table 9 and Table 10, we provide the values for the seven charts in Table 1 and the combined two-sided chart for each of the six cases in Table 2. For the shifts in process parameters, we considered the following two scenarios:
- ▪
- Scenario 1: and
- ▪
- Scenario 2: and
Table 9.
Best charts based on , , Scenario 1.
Table 9.
Best charts based on , , Scenario 1.
| Case | Combined | |||||||
|---|---|---|---|---|---|---|---|---|
| 1 | 65.31 2, 6, 7, 10 | 64.89 2, 6, 7, 10 | 59.30 2, 5, 8, 9 | 58.76 2, 5, 8, 9 | 61.34 2, 3, 10, 11 | 64.19 2, 3, 12, 8 | 63.94 2, 3, 7, 10 | 142.59 7, 4 |
| 2 | 68.67 1, 4, 5, 8 | 68.16 1, 4, 5, 8 | 64.57 1, 3, 6, 8 | 63.92 1, 3, 6, 8 | 67.95 1, 3, 5, 8 | 69.72 1, 3, 5, 8 | 68.53 1, 2, 5, 8 | 84.88 5, 4 |
| 3 | 134.41 1, 3, 6, 14 | 64.67 1, 3, 9, 9 | 63.47 1, 3, 11, 9 | 87.35 1, 4, 6, 10 | 76.26 1, 3, 7, 8 | 74.22 1, 2, 7, 8 | 90.79 1, 2, 7, 8 | 130.75 6, 5 |
| 4 | 64.39 3, 7, 10, 9 | 50.53 4, 6, 11, 15 | 51.31 4, 6, 12, 15 | 51.17 4, 6, 12, 15 | 51.56 4, 5, 10, 14 | 54.06 4, 5, 9, 15 | 56.68 3, 4, 9, 10 | 144.35 9, 4 |
| 5 | 78.45 1, 2, 6, 25 | 127.77 1, 3, 7, 21 | 94.11 0, 2, 6, 19 | 93.71 0, 2, 6, 19 | 116.05 3, 4, 6, 49 | 116.44 3, 4, 6, 49 | 116.45 3, 4, 6, 49 | 95.68 5, 17 |
| 6 | 42.13 0, 1, 14, 43 | 42.17 0, 1, 13, 45 | 42.16 0, 1, 13, 45 | 42.17 0, 1, 13, 45 | 137.79 7, 8, 13, 50 | 143.94 3, 4, 9, 45 | 144.82 7, 8, 13, 50 | 413.46 10, 27 |
Table 10.
Best charts based on , , Scenario 2.
Table 10.
Best charts based on , , Scenario 2.
| Case | Combined | |||||||
|---|---|---|---|---|---|---|---|---|
| 1 | 46.17 2, 5, 7, 11 | 44.67 2, 6, 7, 10 | 40.29 2, 5, 8, 9 | 39.84 2, 5, 8, 9 | 38.92 2, 3, 10, 11 | 43.07 2, 3, 7, 11 | 43.46 2, 3, 7, 10 | 104.55 7, 4 |
| 2 | 48.90 1, 4, 5, 8 | 48.32 1, 4, 5, 8 | 43.39 1, 3, 6, 8 | 42.86 1, 3, 6, 8 | 47.71 1, 2, 7, 9 | 49.89 1, 3, 5, 8 | 48.76 1, 2, 5, 8 | 60.22 5, 4 |
| 3 | 101.20 1, 3, 6, 14 | 43.21 1, 3, 9, 9 | 42.83 1, 3, 11, 9 | 61, 64 1, 4, 6, 10 | 52.69 1, 3, 7, 8 | 48.28 1, 2, 7, 8 | 131.31 1, 2, 7, 8 | 100.17 6, 5 |
| 4 | 46.58 3, 7, 10, 9 | 34.15 4, 6, 11, 15 | 34.58 4, 6, 12, 15 | 34.47 4, 6, 12, 15 | 34.72 4, 5, 10, 14 | 37.15 4, 5, 9, 15 | 39.54 3, 4, 9, 10 | 141.35 9, 4 |
| 5 | 58.78 1, 2, 6, 25 | 85.73 1, 3, 5, 24 | 85.23 1, 3, 5, 24 | 85.11 1, 3, 5, 24 | 95.02 3, 4, 6, 49 | 94.99 3, 4, 6, 49 | 95.05 3, 4, 6, 49 | 269.10 5, 17 |
| 6 | 27.17 0, 1, 14, 43 | 26.50 0, 1, 13, 45 | 26.49 0, 1, 13, 45 | 26.49 0, 1, 13, 45 | 116.18 7, 8, 13, 50 | 90.86 0, 1, 14, 23 | 118.89 7, 8, 13, 50 | 6091.24 10, 27 |
The first line of each cell consists of the value (in two decimals accuracy), while in the second line we provide the values of the chart’s design parameters in the form . For the combined chart, the parameters are given as .
The results in Table 9 and Table 10 reveal that there is not one scheme among the examined ones that outperforms the others in terms of . However, in none of the six cases do the and charts achieve the minimum value, while the is the best chart in Case 1 and only under Scenario 2. On the other hand, one of the schemes , is the best scheme in almost all Cases, under both Scenarios. We also notice that the combined scheme does not have a competitive performance except for Case 5 under Scenario 1.
5. Examples
In this section, we present two real data examples regarding the application of the proposed charts in practice. First, we discuss an application of the proposed chart in the monitoring of changes in the number of unintentional needle-stick occurrences per day in a hospital. The second example is from the area of public health and focuses on the monitoring of the monthly number of poliomyelitis cases in the USA. Both examples highlight the usefulness of control charts as powerful tools in biosurveillance and the detection of abnormalities. Consequently, authorities (such as hospital’s administration or the center for disease prevention and control) can be warned based on the signals given by the charts and take actions to improve the quality of the services offered by the professionals in health-care units.
5.1. Monitoring the Daily Number of Unintentional Needle-Stick Occurrences
In the first example, we consider a case where we are interested in the quality of the provided healthcare services. An increase on the average daily number of unintentional needle-stick occurrences is an indication of process deterioration, while a decrease shows improved performance by the hospital staff.
We use the dataset provided by Fatahi et al. [32] and analyzed further by Aly et al. [33]. Following [33], we assume that the data are from a ZIP process, and we use the first 40 observations as a Phase I sample to estimate the process parameters. Using the maximum likelihood method, we obtain , (in two decimals accuracy). Then, we apply the Shewhart chart as a Phase I analysis method. For illustrative purposes we assume that the desired IC value is (approximately) equal to 200, yielding a for the Shewhart chart equal to 6. The theoretical IC performance of the chart implies that on average, a false alarm signal would be triggered once in 6.5 months. Note also that the (theoretical) IC equals 204.39.
The respective chart is given in Figure 2 from which we notice that none of the 40 points exceeds the . Therefore, the process was IC, when these values have been collected, and we can proceed to the Phase II analysis by assuming that the IC values of the process parameters equal the respective estimates, that is , .
Figure 2.
The ZIP Shewhart chart for the Phase I data of the number of unintentional needle-stick occurrences per day in a hospital.
Since there are no further details regarding the future behavior of the process and the shifts that are considered important and must be detected quickly, we will use for Phase II monitoring the best chart in terms of the , with a desired . Again, we consider the two scenarios in Section 4. In Table 11, we provide for each chart the minimum value (first line), the IC (second line), and the set of values , in the third line, which gives the minimum . Even though the chart attains the minimum , the IC is not very close to the desired nominal value of 200. Thus, we choose the as the best chart in terms of , for both scenarios, since its IC performance is much closer to the desired one.
Table 11.
Best charts based on , for a ZIP process with and .
In Figure 3, we provide the chart with for the 50 Phase II observations. This chart gives an OOC signal at point 35 since points 33 and 35 are plotted in Region 2, while point 34 is plotted in Region 3. These points are marked with . It is worth noting that the gives this OOC signal at (almost) the same period with the AEWMA chart in [33].
Figure 3.
The chart with .
5.2. Monitoring the Monthly Number of Poliomyelitis Cases
In the second example, we consider the monthly number of polio cases in the USA (see also [21]). Rakitzis et al. [22] used 131 observations (from January 1973 to November 1983) where observations 1–100 are the Phase I dataset, and the remaining ones are the Phase II dataset. According to the results of the Phase I analysis conducted by Rakitzis et al. [22], the is the model with the best fit in the data when the process is IC. We assume that the IC values of process parameters are known and proceed directly to Phase II monitoring using the observations during the period May 1981–November 1983.
For illustrative purposes, we assume that the desired IC value is (approximately) equal to 20. This means that, on average, every 20 months, the chart gives a false alarm signal. Larger IC , such as the well-known textbook value 370.4, might not be very useful because, in this case, the chart would give, on average, a false alarm (i.e., a false indication of a change in the average monthly number of polio cases) every 370 months (30 years). This is a (very) long period for an infectious disease to retain its initial characteristics.
For Phase II monitoring, we will use the best chart in terms of the and IC as close as possible to 20. Again, we consider the two scenarios in Section 4. In Table 12, we provide for each chart the minimum value (first line), the IC (second line), and the set of values , in the third line, which gives the minimum . We choose the as the best chart since it has the minimum and its performance is very close to the desired one.
Table 12.
Best charts based on , for a GIP1 process with and .
In Figure 4, we provide the chart with for the 31 Phase II observations. The occurrence of 8 successive points in Region 4 (points 6–13, marked with *) indicates an OOC situation at point 13, which is related to a possible decrease in the average monthly number of polio cases. It is worth noting that for the same dataset, the charts proposed in [21,22] are not able to detect this OOC situation since they are upper one-sided charts and can only detect increases in the process mean level. Also, another OOC signal is given at point 31 (marked with *) since this point is above the (Region 1).
Figure 4.
The chart with .
6. Conclusions
In this work, we proposed and studied a control chart for attributes with multiple runs rules that can be used for detecting either increasing or decreasing shifts in the process mean level. The suggested scheme is preferable when the lower control limit of the usual Shewhart-type chart is negative and thus is not capable of detecting a decrease in the process mean level. The proposed chart was studied in the case of monitoring zero- or general inflated Poisson processes. However, it can be used (after some necessary but straightforward modifications) for the monitoring of any other process with count data. The Markov chain method was used for studying its in-control and out-of-control performance, and practical guidelines were provided. The out-of-control performance of the chart was evaluated in terms of and . The numerical results showed that there is not a unique scheme among the examined ones, with the best performance for all shifts, under either performance measures. However, for improving the performance in the detection of increasing shifts we suggest the use of a runs rule of type 2-out-of-. For the case of decreasing shifts, larger values are needed in the case of zero-inflated Poisson processes than in the case of general inflated Poisson processes.
It should be also mentioned that in this work, we assumed that the value of is known. Clearly, as well as and must be determined before the application of the charts. Therefore, the suggested approach is to first conduct a detailed model selection procedure among the possible candidate models (e.g., various distributions with different values) and then, by applying model selection criteria and/or goodness-of-fit tests, to choose the model with the best fit in the data. With this procedure, practitioners can determine the appropriate value for .
Topics for future research consist of studying the proposed charts in the case of estimated parameters, as well as the case of a chart with variable sampling intervals and variable sample sizes. For both topics, the research is in progress.
Finally, the programs for reproducing the results provided in this work have been written in R [34], version 4.3.0 and are available from the authors upon request.
Author Contributions
Methodology, E.M. and A.C.R.; Software, E.M. and A.C.R.; writing—original draft preparation, E.M.; writing—review and editing, E.M. and A.C.R. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
The data in Section 5.1 can be found in Reference [33], while the data in Section 5.2 can be found in the R package gamlss.data, version 6.0-2.
Acknowledgments
The authors would like to thank the three anonymous reviewers for their valuable comments and suggestions, which helped us to improve the paper.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Here, we briefly describe the Markov chain method that we used to study the proposed two-sided control chart . Let us consider a two-sided chart where four regions are defined, as in Figure 1. The probability that a single point falls on Region is denoted as , and for , , equals
where . The c.d.f. is given in Equation (2).
For illustrative purposes, we consider the case of the two-sided chart. Using the finite Markov chain imbedding (FMCI) method, it is possible to study any other chart and derive its entire run length distribution. Let be a sequence of i.i.d. multistate random variables taking values in the set such that , , where are the previously defined probabilities. The chart gives an OOC signal when either a single point is above the or if two-out-of-three successive points are in the interval and at most one intermediate point is in the interval or if consecutive points are in the interval .
Let be the waiting time (random variable) until the first occurrence of the compound pattern:
This means that is the distribution of the number of trials until the occurrence of one of the following simple patterns for the first time:
Then, the distribution of coincides with the run length distribution of the chart and can be obtained as follows. First, the simple patterns are decomposed into the following sub-patterns, defining the states of the embedding Markov chain:
The first states are the transient states of the Markov chain while state is the unique absorbing state. Let also be the imbedded Markov chain with state space . Then, is in transient state () if the maximum ending block of the first trials (counting backward) is identified to be the block corresponding to value .
Therefore, the transition probability matrix of the imbedded Markov chain is
where is the matrix of the transition probabilities between the transient states of the Markov chain, is the identity matrix and is a vector of ones.
From the theory of Markov chains, the average run length of the chart equals
where is the initial probabilities vector, with
References
- Page, E.S. Continuous inspection schemes. Biometrika 1954, 41, 100–115. [Google Scholar] [CrossRef]
- Roberts, S.W. Control chart tests based on geometric moving averages. Technometrics 1959, 1, 239–250. [Google Scholar] [CrossRef]
- Montgomery, D.C. Introduction to Statistical Quality Control, 8th ed.; John Wiley & Sons: New York, NY, USA, 2020. [Google Scholar]
- Bersimis, S.; Koutras, M.V.; Rakitzis, A.C. Run and scan rules in statistical process monitoring. In Handbook of Scan Statistics; Glaz, J., Koutras, M., Eds.; Springer: New York, NY, USA, 2020. [Google Scholar] [CrossRef]
- Jalilibal, Z.; Karavigh, M.H.A.; Amiri, A.; Khoo, M.B.C. Run rules schemes for statistical process monitoring: A literature review. Qual. Technol. Quant. Manag. 2023, 20, 21–52. [Google Scholar] [CrossRef]
- Johnson, N.L.; Kemp, A.W.; Kotz, S. Univariate Discrete Distributions, 3rd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2005. [Google Scholar]
- Yoneda, K. Estimations in some modified Poisson distributions. Yokohama Math. J. 1962, 10, 73–96. [Google Scholar]
- Murat, M.; Snyzal, D. Non-zero inflated modified power series distributions. Commun. Stat.-Theory Methods 1998, 27, 3047–3064. [Google Scholar] [CrossRef]
- Melkersson, M.; Rooth, D.O. Modeling female fertility using inflated count data models. J. Popul. Econ. 2000, 13, 189–203. [Google Scholar] [CrossRef]
- Lin, T.H.; Tsai, M.H. Modeling health survey data with excessive zero and K responses. Stat. Med. 2013, 32, 1572–1583. [Google Scholar] [CrossRef]
- Tang, Y.; Liu, W.; Xu, A. Statistical inference for zero-and-one inflated Poisson models. Stat. Theory Relat. Fields 2017, 1, 216–226. [Google Scholar] [CrossRef]
- Sun, Y.; Zhao, S.; Tian, G.L.; Tang, M.L.; Li, T. Likelihood-based methods for the zero-one-two inflated Poisson model with applications to biomedicine. J. Stat. Comput. Simul. 2023, 93, 956–982. [Google Scholar] [CrossRef]
- Begum, A.; Mallick, A.; Pal, N. A generalized inflated Poisson distribution with application to modeling fertility data. Thail. Stat. 2014, 12, 135–139. [Google Scholar]
- Rakitzis, A.C.; Castagliola, P.; Maravelakis, P.E. A two-parameter general inflated Poisson distribution: Properties and applications. Stat. Methodol. 2016, 29, 32–50. [Google Scholar] [CrossRef]
- Mahmood, T.; Xie, M. Models and monitoring of zero-inflated processes: The past and current trends. Qual. Reliab. Eng. Int. 2019, 35, 2540–2557. [Google Scholar] [CrossRef]
- Paulino, S.; Morais, M.C.; Knoth, S. An ARL-unbiased c-chart. Qual. Reliab. Eng. Int. 2016, 32, 2847–2858. [Google Scholar] [CrossRef]
- Nelson, L.S. Supplementary runs tests for np control charts. J. Qual. Technol. 1997, 29, 225–227. [Google Scholar] [CrossRef]
- Acosta-Mejia, C.A. Improved p charts to monitor process quality. IIE Trans. 1999, 31, 509–516. [Google Scholar] [CrossRef]
- Lucas, J.M.; Davis, D.J.; Saniga, E.M. Detecting improvement using Shewhart attribute control charts when the lower control limit is zero. IIE Trans. 2006, 38, 699–709. [Google Scholar] [CrossRef]
- Chang, T.C.; Gan, F.F. Modified Shewhart charts for high yield processes. J. Appl. Stat. 2007, 34, 857–877. [Google Scholar] [CrossRef]
- Mamzeridou, E.; Rakitzis, A.C. Synthetic-type control charts for monitoring general inflated Poisson processes. Commun. Stat.-Simul. Comput. 2022, in press. [Google Scholar] [CrossRef]
- Rakitzis, A.C.; Castagliola, P.; Maravelakis, P.E. Cumulative sum control charts for monitoring geometrically inflated Poisson processes: An application to infectious disease counts data. Stat. Methods Med. Res. 2018, 27, 622–641. [Google Scholar] [CrossRef] [PubMed]
- Woodall, W.H.; Adams, B.M.; Benneyan, J.C. The use of control charts in healthcare. In Statistical Methods in Healthcare; Faltin, F.W., Kenett, R.S., Ruggeri, F., Eds.; John Wiley & Sons: Oxford, UK, 2012; pp. 251–267. [Google Scholar]
- Reynolds, M.R., Jr. The Bernoulli CUSUM chart for detecting decreases in a proportion. Qual. Reliab. Eng. Int. 2013, 29, 529–534. [Google Scholar] [CrossRef]
- Bourke, P.D. Detecting a downward shift in a proportion using a geometric CUSUM chart. Qual. Eng. 2020, 32, 75–90. [Google Scholar] [CrossRef]
- Chakraborti, S.; Kumar, N.; Rakitzis, A.C.; Sparks, R.S. Time between events monitoring with control charts. In Wiley StatsRef: Statistics Reference Online; Balakrishnan, N., Colton, T., Everitt, B., Piegorsch, W., Ruggeri, F., Teugels, J.L., Eds.; John Wiley & Sons: Hoboken, NJ, USA, 2023; pp. 1–13. [Google Scholar]
- Fu, J.C.; Koutras, M.V. Distribution theory of runs: A Markov chain approach. J. Am. Stat. Assoc. 1994, 89, 1050–1058. [Google Scholar] [CrossRef]
- Fu, J.C.; Lou, W.W. Distribution Theory of Runs and Patterns and Its Applications: A Finite Markov Chain Imbedding Approach, 1st ed.; World Scientific: Singapore, 2003. [Google Scholar]
- Fu, J.C.; Shmueli, G.; Chang, Y.M. A unified Markov chain approach for computing the run length distribution in control charts with simple or compound rules. Stat. Probab. Lett. 2003, 65, 457–466. [Google Scholar] [CrossRef]
- Machado, M.A.G.; Costa, A.F.B. A side-sensitive synthetic chart combined with an X chart. Int. J. Prod. Res. 2014, 52, 3404–3416. [Google Scholar] [CrossRef]
- Mukherjee, A.; Sen, R. Optimal design of Shewhart-Lepage type schemes and its application in monitoring service quality. Eur. J. Oper. Res. 2018, 266, 147–167. [Google Scholar] [CrossRef]
- Fatahi, A.A.; Noorossana, R.; Dokouhaki, P. Zero inflated Poisson EWMA control chart for monitoring rare health-related events. J. Mech. Med. Biol. 2012, 12, 1250065. [Google Scholar] [CrossRef]
- Aly, A.A.; Saleh, N.A.; Mahmoud, M.A. An adaptive EWMA control chart for monitoring zero-inflated Poisson processes. Commun. Stat.-Simul. Comput. 2022, 51, 1564–1577. [Google Scholar] [CrossRef]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2023; Available online: https://www.r-project.org/ (accessed on 29 September 2023).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).