1. Introduction
As a key statistical technique in SPC, Control Charts (CCs) serve as graphical tools for analyzing process stability. By plotting quality characteristic values against predetermined control limits, CCs can promptly issue alerts upon detecting anomalous information, enabling proactive identification of production issues rather than relying on post-production inspection. This proactive capability has established control charts as essential methods for improving product quality and yield, with widespread applications across industrial and service sectors (see, e.g., [
1,
2,
3]).
Based on the number of monitored variables, CCs can be classified into univariate and multivariate types. Univariate CCs originated in the 1920s, with the first quality chart proposed by W.A. Shewhart in the United States. However, Shewhart CCs rely solely on current sample information and exhibit limited sensitivity in detecting small shifts in process parameters. To address this limitation, the Cumulative Sum (CUSUM) and Exponentially Weighted Moving Average (EWMA) CCs were developed to incorporate historical data (see, e.g., [
4,
5,
6,
7,
8,
9,
10,
11]). On the other hand, as products and manufacturing processes have grown increasingly complex, the need to simultaneously monitor multiple correlated quality characteristics has drawn industrial and academic attention to multivariate CCs. After decades of development, numerous multivariate CCs have emerged, with the three most typical structures being Hotelling’s
, Multivariate CUSUM (MCUSUM), and Multivariate EWMA (MEWMA) (see, e.g., [
12,
13,
14,
15,
16]). Subsequent research on CCs has primarily focused on three aspects: adapting to different data distributions (see [
17,
18,
19,
20]; updating statistical models (see [
21,
22,
23,
24]); and modifying charting structures, (see [
25,
26,
27]).
With the rapid advancement of manufacturing capabilities and inspection technologies, the number of variables requiring monitoring in production processes has increased dramatically. However, as dimensionality grows, traditional multivariate statistical methods become increasingly susceptible to noise accumulation, which severely impairs their ability to detect shifts in high-dimensional data. Through simulation studies, Wang and Jiang [
28] demonstrated that when the data dimension is 20 and only one variable undergoes a mean shift, the Hotelling’s
chart struggles to detect the change due to interference from noise in the remaining dimensions.
In modern quality monitoring scenarios, simultaneous faults across multiple locations are relatively uncommon. Instead, a single fault source is typically detected by a subset of correlated process sensors, meaning that only a few variables experience distributional shifts. Such sparse shifts are often obscured by noise from other dimensions and fail to be prominently reflected in traditional multivariate statistics, leading to monitoring failures. Consequently, an effective strategy for statistical inference in high-dimensional process monitoring is to incorporate variable selection. Wang and Jiang [
28] proposed a variable-selection-based multivariate statistical process control method that uses a forward-selection algorithm to screen faulty variables, achieving simultaneous process monitoring and fault diagnosis. In [
29,
30], variable selection was further applied to the MEWMA and MCUSUM CCs, respectively, improving detection efficiency for shifts of various magnitudes. In [
31], an MEWMA chart was proposed based on the Least Absolute Shrinkage and Selection Operator (LASSO) method, integrating high-dimensional data monitoring with change point detection using goodness-of-fit tests. In [
32], the retrieval speed of Out-Of-Control (OOC) variables was enhanced by adopting the Least Angle Regression (LARS) algorithm instead of stepwise regression.
Parallel to these developments in the variable dimension, significant advances have been made in the temporal dimension through adaptive sampling strategies. While conventional control charts employ Fixed Sampling Intervals (FSI) predetermined by production schedules, research has demonstrated that allowing sampling frequency to vary based on process data can substantially enhance detection efficiency. This concept underpins the Variable Sampling Interval (VSI) scheme, in which the time until the next sample is shortened if the current control statistic suggests a potential OOC condition and lengthened if no process disturbance is indicated. The primary advantage of VSI over FSI lies in its ability to detect process changes more rapidly (see, e.g., [
33,
34,
35,
36,
37]). Previous studies have demonstrated that employing just two sampling intervals is sufficient to achieve effective detection performance across a range of process shifts [
38,
39,
40].
Current research on adaptive EWMA CCs mainly includes two aspects. The first aspect concerns dynamically adjusting the smoothing parameter
to achieve global sensitivity to both small and large shifts. For example, ref. [
41] estimated the mean shift using an EWMA-based estimator, then determined
for the plotting EWMA statistic through a continuous function. The authors of [
42] replaced the conventional score functions with the Hampel function and proposed a new adaptive EWMA chart, while those of [
43] introduced a Support Vector Regression (SVR)-based adaptive EWMA scheme for improved mean monitoring in industrial processes. In [
44], a streamlined and parameter-free adaptive EWMA method was introduced for monitoring Poisson processes. In [
45], the authors constructed an EWMA scheme by introducing an adaptive method, aiming to improve the efficiency of detecting the existing ratio of two normal random variable charts for both small and large shifts. Finally, ref. [
46] proposed an EWMA control chart for inverse Maxwell processes.
The second aspect focuses on adjusting sampling intervals and sample sizes in the temporal dimension. In [
47], the authors evaluated the Average Time to Signal (ATS) properties of two-sided VSI EWMA CCs and presented a practical design procedure. In [
48], an EWMA scheme with VSI was developed to monitor the process coefficient of variation. In [
49,
50], the authors investigated the performance of the VSI EWMA scheme with estimated parameters, while [
37] introduced a Max EWMA scheme with VSI strategy for simultaneously monitoring both the frequency and amplitude of an event. Finally, ref. [
51] developed a VSI MEWMA scheme to monitor the mean vector of the Gumbel’s Bivariate Exponential (GBE) distribution. In addition, several studies have extended the VSI strategy to adaptive EWMA schemes [
52,
53,
54,
55,
56].
The latest developments in intelligent fault diagnosis have highlighted the importance of effective and efficient monitoring structures that can detect complex and compound faults in contemporary industrial facilities, especially when data availability is limited [
57]. The problem of detecting weak and incipient faults in high-dimensional process environments still represents a major issue, as small process variations are often lost in the noise and irrelevant variables; this has led to the creation of methodologies that are more sensitive to small and sparse process variations [
58]. Furthermore, real-life industrial data are normally noisy, variable, and subject to changing operational conditions, requiring the development of robust and adaptive monitoring plans [
59]. All of these issues underscore the relevance of incorporating adaptive control and efficient selection of variables within multivariate statistical process control systems to achieving enhanced speed and trustworthiness when identifying shifts in intricate manufacturing processes.
However, despite the acknowledged relevance of integrating adaptive control with variable selection, existing multivariate SPC methods still exhibit two critical gaps: first, most VSI-based EWMA charts lack variable selection, meaning that sparse shifts are masked by high-dimensional noise; second, existing variable selection SPC methods only use FSI. More importantly, a novel insight is that the choice of the selected variables directly affects the probability of entering the warning region. Selecting too many noisy variables triggers false short intervals, while selecting too few may miss weak signals. This interaction between variable selection and VSI zone division is entirely new and unexplored. To address this, we propose a new adaptive monitoring approach which we term the Variable Sampling Interval–Variable Selection-based Multivariate EWMA (VSI-VSME) chart. The remainder of this paper is structured as follows:
Section 2 provides a detailed exposition of the proposed VSI-VSME framework; in
Section 3, the statistical performance of the proposed chart is evaluated and discussed through comprehensive simulation studies;
Section 4 offers practical applications based on real datasets to demonstrate the methodology’s utility in operational settings; finally,
Section 5 concludes with summary remarks and directions for future research. And the
Appendix A lists all symbols and parameters used in this paper.
3. Performance Analysis
This study simulates a 20-dimensional process with different numbers of shifted variables, specifically:
In this simulation setup,
denotes the non-centrality parameter, representing the magnitude of the mean shift in the variables that undergo a change. Specifically, under the IC state, all variables have a mean of zero. When a shift occurs, the first
variables experience a mean shift of
, while the remaining variables stay at zero. By varying
, the detection performance under different shift sizes can be evaluated. In the implementation, three sets of sampling interval configurations are considered:
and ,
and ,
and .
Table 1 summarizes the
and
values for the proposed VSI-VSME scheme to achieve IC performance levels (
=
=
C) of 200, 370, and 500 when
,
, and
. All reported results are based on
Monte Carlo replications. The standard errors of the
estimates are consistently below
of the estimated value. All simulations were performed on an computer with an Intel Core i5 12600KF CPU with 32 GB RAM, using MATLAB R2024b parallelized over six workers. Moreover, the average computational time per replication for a run is approximately
s until signal.
3.1. VSI-VSME vs. FSI-VSME
This part evaluates the OOC performance of the proposed VSI-VSME scheme against the FSI-VSME scheme. Here, we set the parameter
s equal to
, which is the actual number of variables affected by the shift. Cases where
are discussed later.
Table 2,
Table 3 and
Table 4 provide a comprehensive comparison of the OOC
between the proposed VSI-VSME scheme and the FSI-VSME benchmark across three smoothing parameters
, five sparsity levels
, three IC performance levels
, and three VSI configurations
. We can draw the following conclusions:
First, regardless of
,
C, or
s, every VSI-VSME setting yields a strictly smaller
than the corresponding FSI-VSME benchmark, confirming that the adaptive sampling strategy consistently accelerates the detection of sparse shifts. The improvement is most pronounced for the most aggressive VSI design
. For example, in
Table 2 the FSI-VSME gives
, whereas the VSI-VSME with
reduces it to
, a relative reduction of about
. As
increases and
decreases, the advantage gradually shrinks; yet, even the mildest VSI setting
still outperforms the FSI scheme
for the same case.
Second, the benefit of the VSI mechanism is highly consistent across different
values. When
is small
, the
values are generally lower and the percentage improvement from VSI is larger. For instance, at
,
,
, the
VSI scheme achieves
versus an FSI value of
, a
reduction. At
in
Table 4, the same comparison
yields
versus
, a reduction of only
. These results indicate that the VSI mechanism is particularly effective when
is small.
Third, as the number of truly shifted variables
increases, both the FSI and VSI charts detect the shift faster, as expected, because a larger signal from more variables shortens the
. However, the relative improvement due to VSI tends to be larger for moderate
s (e.g.,
or 3) than for
or
. For example, in
Table 3 the reduction from FSI to VSI
is about
for
and about
for
, but reaches
for
and
for
. This suggests that the VSI strategy is most beneficial when the shift is moderately sparse.
In summary, the evidence from
Table 2,
Table 3 and
Table 4 unequivocally demonstrates that the VSI-VSME scheme provides a clear and robust improvement in detection speed over the FSI-VSME scheme across a wide range of parameters and that the improvement is largest for small
.
3.2. VSI-VSME vs. VSI- and VSI-MEWMA
Table 5,
Table 6 and
Table 7 systematically compare the OOC
of the proposed VSI-VSME scheme with those of the VSI-
and VSI-MEWMA schemes under three smoothing parameters
, various shift magnitudes
, and different choices of the sparsity parameter
s (from 1 to 5) while fixing the true number of shifted variables at
. The results reveal several critical insights.
First, across all settings, the VSI-
scheme performs dramatically worse than both VSI-MEWMA and VSI-VSME schemes, especially for small to moderate shifts
. For example, in
Table 5 the VSI-
scheme yields an
of
, whereas the VSI-MEWMA scehme achieves
and the VSI-VSME scheme (with
) reaches as low as
. This huge gap is expected because the conventional VSI-
statistic accumulates noise from all 20 dimensions, severely diluting the sparse signal.
Second, the comparison between VSI-VSME and VSI-MEWMA schemes is more nuanced and depends critically on
,
, and the choice of
s. When
is small (
,
Table 5), the VSI-VSME scheme consistently outperforms VSI-MEWMA across almost all
and
s values. For example, when
, for
, the VSI-VSME scheme with
gives
vs. the VSI-MEWMA scheme’s
; for
, we have
vs.
. This demonstrates that the variable selection mechanism effectively filters out noise and provides a cleaner signal for the EWMA recursion, which is particularly beneficial when the shift is sparse (only 2 out of 20 variables shift). As
increases beyond
, the differences narrow, and for very large shifts
the VSI-MEWMA scheme occasionally becomes slightly faster. This is because the shift is no longer “sparse” in effect; the mean shift is so large that even a naive multivariate statistic detects it almost instantly.
Third, the role of the sparsity parameter
s is highly informative. In
Table 5, when
is very small
, larger
s (e.g.,
or 5) yields lower
, meaning that allowing more variables into the selected set helps to capture the weak signal scattered across the two truly shifted variables plus noise. For moderate
, the optimal
s shifts to
or 2, i.e., a more parsimonious selection works best. For large
,
almost always gives the smallest
. This pattern is consistent with the intuition: under a very weak shift, the penalized selection needs more freedom to avoid missing the signal; under a strong shift, a tight sparsity constraint prevents noise from entering the monitoring statistic.
Fourth, as
increases to
(
Table 6) and
(
Table 7), the performance gap between the VSI-VSME and VSI-MEWMA schemes gradually diminishes; for
with small
(e.g.,
), the VSI-MEWMA scheme actually outperforms the VSI-VSME scheme for all
s. This occurs because a large
makes the EWMA statistic nearly equivalent to the current observation, reducing the benefit of exponential smoothing. At the same time, the variable selection step becomes more susceptible to sampling variability, slightly delaying detection. Nevertheless, for
, VSI-VSME with
or
still matches or beats the VSI-MEWMA scheme.
In summary, across all three tables, the VSI-VSME scheme exhibits remarkable robustness; even when s is mis-specified (e.g., while true ), the remains far below that of the VSI- scheme and often close to or better than that of the VSI-MEWMA scheme. This indicates that the proposed chart does not require exact knowledge of the number of shifted variables to deliver good performance.
4. Example Analysis
To demonstrate the practical usefulness of the proposed VSI-VSME scheme, we apply it to a real dataset from an automotive sheet metal assembly process. The data, adapted from Table 5.14 in [
60], consist of thickness deviation measurements recorded at six critical locations on automobile body panels. These six variables are correlated and their dimensional accuracy directly affects the vehicle’s structural integrity, safety, corrosion resistance, and final assembly fit. In this case study, the first 30 observation vectors are treated as IC samples and are used to estimate the process parameters and to establish control limits; the remaining 20 observation vectors are monitored sequentially to detect potential mean shifts that may arise from tool wear, material variations, or assembly misalignment. For all competing schemes, we set the same VSI parameter configuration of
to ensure a fair comparison and fix the IC
at 370. The VSI-
, VSI-MEWMA, and VSI-VSME schemes (with
) are then applied to monitor the 20 test observations.
The monitoring results are displayed in
Figure 2. The VSI-
scheme issues an OOC alarm at time
, while the VSI-MEWMA scheme triggers an alarm at
. For the proposed VSI-VSME chart, the detection time depends on the choice of the sparsity parameter
s. Specifically, when
, the chart signals an OOC condition at
. When
or
, detection occurs slightly later at
, which is still faster than the VSI-
scheme and comparable to the VSI-MEWMA scheme. This pattern indicates that the VSI-VSME scheme is not overly sensitive to the exact choice of
s, with even moderately mis-specified
s still yieldng good performance. More importantly, the fact that
achieve the fastest detection suggests that the underlying shift likely affects about two to four of the six monitored dimensions. In summary, the VSI-VSME scheme offers a practical and efficient tool for high-dimensional process monitoring in real manufacturing environments where sparse shifts are common and quick detection is critical for minimizing quality loss.
5. Conclusions
This study is motivated by a fundamental challenge in high-dimensional statistical process monitoring, namely, detecting small sparse shifts in the mean vector when only a small subset of variables is actually OOC. Traditional multivariate control charts suffer from severe noise accumulation in high dimensions, while standard adaptive sampling approaches predominantly focus on temporal adjustments (varying sampling intervals or sample sizes) without addressing the variable selection problem. To fill this gap, we propose a novel scheme which integrates two complementary mechanisms: (i) a forward variable selection algorithm that automatically identifies the most likely shifted variables and filters out irrelevant noise, and (ii) a variable sampling interval strategy that shortens the sampling interval when the process enters a warning region and lengthens it when the process appears stable.
Through extensive Monte Carlo simulations, we demonstrate several key findings. First, the simulation results strongly support that conclusion that the proposed VSI-VSME scheme consistently detects sparse shifts faster than the FSI-VSME scheme. This improvement is most pronounced for small smoothing parameters. Second, the proposed scheme exhibits a dramatic advantage over the VSI- and VSI-MEWMA schemes. This advantage is particularly evident for small and small to moderate shift magnitudes, where the variable selection component is able to successfully isolate the sparse signal from high-dimensional noise. Third, the proposed VSI-VSME scheme is robust to mis-specification of the sparsity parameter s. Even when s deviates from the true number of shifted variables , the performance rarely deteriorates below that of the VSI- and VSI-MEWMA schemes.
A potential direction for future research would be to explore dynamic selection of s during Phase-II monitoring. Moreover, the current variable selection procedure assumes that the IC covariance matrix is known or can be estimated accurately from a large Phase-I dataset. Developing a VSI-VSME chart that accounts for estimation uncertainty (e.g., through adjusted control limits or bootstrap calibration) would enhance its practical reliability. We believe that the VSI-VSME framework provides a solid foundation for these future innovations.