1. Introduction
The Fifth Industrial Revolution (Industry 5.0) recasts industrial value creation around human–machine collaboration, sustainability, and resilience [
1,
2,
3]. Within this paradigm, manufacturing security extends beyond perimeter and network defense to the integrity, reproducibility, and interpretability of the AI-driven decision systems on which human operators rely [
4,
5]. Failures in these properties—silent data corruption, non-reproducible model outputs, or opaque classifications—translate directly into unsafe process adjustments, downstream defects, and erosion of operator trust.
Injection molding, a core process in automotive, electronic, and precision-component manufacturing, illustrates these risks at scale. Modern lines generate large volumes of heterogeneous sensor and PLC data transmitted through industrial APIs to cloud-integrated Manufacturing Execution Systems (MESs) for quality control [
6,
7]. Three structural conditions complicate this pipeline: defect samples are rare and class-imbalanced [
8,
9]; nonlinear process behavior and material variability render fixed-threshold rules brittle [
10,
11]; and cloud transmission exposes the pipeline to integrity threats such as silent corruption, replay, and man-in-the-middle injection [
12,
13]. Adversarial machine learning research further shows that small, semantically plausible perturbations can deflect industrial classifiers without operator awareness [
14,
15].
Recent contributions to the Industry 5.0 security agenda address complementary layers. Nsoh [
16] proposed a Human-Centric Zero-Trust Identity Architecture (HC-ZTIA) securing the identity plane. Jho and Youn [
17] introduced stateful order-preserving encryption (SOPE) for cloud-resident data confidentiality. Jung and Kim [
18] analyzed integrity vulnerabilities in YOLO-based industrial vision pipelines. A complementary gap remains: securing the integrity and reproducibility of unsupervised ML-driven decisions on the shop floor, where labels are scarce and operators act on probabilistic outputs.
1.1. Operational Definition of Human-Centric Manufacturing Security
We operationalize human-centric manufacturing security through three measurable properties of the ML decision pipeline:
- (i)
Data Integrity: every sensor payload is hashed with SHA-256 in the Device Zone and verified in the Database Zone; the target is a 100% verification pass rate under normal operation, with mismatches triggering quarantine and operator alert.
- (ii)
Decision Reproducibility: clustering outputs must be statistically stable across random initializations and covariance configurations; the target is a Silhouette Score standard deviation σ ≤ 0.02 over 30 independent runs.
- (iii)
Interpretable Attribution: defect-related clusters must be associated with human-readable process-variable signatures rather than opaque latent codes.
Throughout this paper, the term “secure” denotes these three measurable properties under the passive-adversary threat model in
Section 3.1 rather than resistance to active attacks on the model; the latter is explicitly out of scope (
Section 5.3).
1.2. Hypotheses and Contributions
This study tests two hypotheses.
H1. Integrity-aware preprocessing (hash verification, IQR-based clipping, normalization) and correlation-based feature selection improve clustering reproducibility under industrial data imbalance.
H2. Probabilistic soft assignments via GMM provide a generative, interpretable representation of operating regimes—operator-readable per-variable signatures with calibrated per-cycle membership—complementing rather than outperforming hard-assignment baselines on internal cluster-validity indices.
The main contributions are: (i) a measurable, three-property operationalization of human-centric manufacturing security for unsupervised industrial ML; (ii) an operating-regime-analysis framework integrating SHA-256 integrity verification, TLS-secured transmission, correlation-based feature selection, and GMM clustering, with an explicit passive-adversary threat model; and (iii) empirical evidence on
n = 6719 real injection molding cycles showing a reproducible Silhouette Score of 0.323 ± 0.014, a statistically significant +18.2% improvement from feature selection over the all-feature baseline (paired t(29) = 3.39,
p = 0.002), and stable cluster topology across four covariance structures, positioned relative to identity-layer [
16], cloud-encryption [
17], and pipeline-vulnerability [
18] approaches.
2. Related Work
2.1. Human-Centric Security in Industry 5.0
The Industry 5.0 agenda originates from the European Commission’s vision of a sustainable, human-centric, resilient European industry [
1] and subsequent surveys characterizing the transition from Industry 4.0 [
2,
3]. Nsoh [
16] proposed HC-ZTIA, situating identity as the control plane and extending NIST SP 800-207 [
19] zero-trust principles. Jho and Youn [
17] proposed SOPE, preserving privacy in cloud databases while supporting range queries. Jung and Kim [
18] analyzed integrity vulnerabilities in OpenCV-YOLO-based vision pipelines. Collectively, these address identity, data-at-rest, and pipeline layers; the integrity and reproducibility of unsupervised ML-driven decisions on imbalanced industrial data remain less developed.
2.2. Unsupervised Anomaly Detection for Manufacturing Quality Control
Unsupervised approaches are attractive because defective samples are rare and labels costly [
8,
20]. GMMs have been widely applied for their probabilistic modeling and interpretable structure [
21]. Oluwasegun and Jung [
21] applied multivariate GMMs to anomaly detection in nuclear control element drive mechanisms; Geng et al. [
22] proposed an unsupervised deep model for online anomaly detection in continuous casting. In injection molding, supervised and ensemble approaches have addressed sink-mark detection [
10], shrinkage surrogate modeling [
11], and quality prediction [
9]. Kim et al. [
23] emphasized that preprocessing quality strongly influences anomaly-detection reliability. Most prior works prioritize predictive accuracy and rarely treat reproducibility, integrity, or interpretability as first-class criteria.
2.3. Data Integrity and Security in Industrial Cyber-Physical Systems
Cloud APIs, MESs, and OT networks expose ML pipelines to integrity threats. NIST SP 800-82 Rev. 3 [
24] and IEC 62443-3-3 [
25] codify integrity, authentication, and access-control requirements. Sebestyen et al. [
12] reviewed IoT security and identified data integrity and reliable decision-making as central challenges; Riaz et al. [
13] proposed a robust anomaly detector for imbalanced Industrial Internet of Things (IIoT) data; Zhang et al. [
26] introduced IDD-Net for data-quality variability. Beyond classical threats, adversarial machine learning shows small perturbations can mislead industrial classifiers [
14,
15], and federated learning [
27] has been proposed as a complementary defense. Recent studies further extend these threats to data poisoning and false-data-injection attacks in IIoT and Industry 5.0 environments [
28,
29], underscoring the need for integrity verification at ingestion.
2.4. Research Gap and Positioning
Identity-layer (HC-ZTIA [
16]) and cryptographic-storage (SOPE [
17]) approaches secure who can act and how data are kept at rest but do not constrain the integrity and reproducibility of unsupervised ML-driven decisions on the shop floor; vulnerability analyses [
18] reveal attack surfaces but stop short of an affirmative defense. The present study addresses this gap by combining integrity-verified ingestion, TLS-secured transmission, and a reproducibility-evaluated GMM.
Method-selection rationale: The three measurable security properties mentioned in
Section 1.1 motivate each methodological choice. Data integrity is enforced by SHA-256 verification and TLS transmission, the minimal mechanisms that detect payload corruption at ingestion under a passive-adversary model. Decision reproducibility and interpretability motivate the choice of a Gaussian Mixture Model over hard-partition or black-box alternatives: its probabilistic soft assignment represents process drift and uncertainty rather than forcing rigid boundaries, and its per-component means yield operator-readable signatures in physical process variables—both achievable without scarce defect labels. Correlation-based feature selection precedes clustering because the load sensors are strongly collinear (|ρ| ≥ 0.80), and removing redundancy stabilizes covariance estimation, which is the primary driver of run-to-run reproducibility. These choices are consequences of the security properties the framework is designed to deliver, not arbitrary algorithm selection.
3. Materials and Methods
3.1. Threat Model
We adopt a passive-adversary (honest-but-curious) model augmented with data-integrity threats during transmission. The adversary may observe data in transit between the Device, Database, and Machine Learning Zones and may attempt unauthorized modification (e.g., man-in-the-middle injection, replay).
Active attacks on the ML model itself (adversarial-example crafting, training-data poisoning) and verbatim replay of valid payloads are out of scope for the current evaluation and are identified in
Section 5.3 as priority extensions.
3.2. System Architecture
Figure 1 illustrates the architecture, organized as three zones (Device, Database, Machine Learning) connected by a secured Cloud Server. All inter-zone traffic uses TLS 1.3 over HTTPS. In the Device Zone, each acquisition cycle produces a payload to which a SHA-256 hash is appended; in the Database Zone, hash verification gates ingestion; in the Machine Learning Zone, the GMM consumes integrity-verified records and emits cluster outputs with reproducibility metadata (seed, covariance structure, run identifier).
3.3. Data Description and Acquisition
Data were collected from an injection molding production line from 25 October 2024 to 26 October 2024, yielding n = 6719 cycles. Seven process variables were retained after data-availability filtering: slide_position, load_160_l_rd, load_160_l_ton, load_160_r_rd, load_160_r_ton, load_160_lr_rd, and load_lr_ton, acquired from PLCs and the monitoring system. All records were anonymized and consolidated as structured tabular data.
Operator visual-inspection notes were available only at an aggregate level and were not retained as per-cycle labels aligned to the 6719 records; consequently, they are used as qualitative context only and not for any quantitative external validation (
Section 5.3). A representative sample of the raw structure is shown in
Table 1.
3.4. Integrity-Aware Preprocessing
3.4.1. SHA-256 Hash-Based Integrity Verification
For each acquisition cycle c, the Device Zone constructs a payload P_c containing timestamp, machine identifier, and the measured-variable vector. A SHA-256 digest h_c = SHA-256(P_c) is appended and transmitted under TLS 1.3. In the Database Zone the digest is recomputed and compared bytewise; mismatched records are logged, excluded, and surfaced to operators. In the present dataset, 100% of records passed verification under nominal conditions.
We note that SHA-256 verification detects payload corruption and modification but does not by itself detect a verbatim replay of a valid payload; replay resistance via timestamp-freshness/nonce checking is identified as a delimited extension (
Section 5.3). Injected-perturbation evaluation is future work.
3.4.2. Missing-Value Handling, Outlier Treatment, and Normalization
Variables with >10% missing values were excluded; the remaining missing entries were imputed with the per-variable median. Outliers were detected per variable using the IQR criterion (beyond Q1 − 1.5 × IQR or Q3 + 1.5 × IQR) and boundary-clipped rather than removed. All features were then scaled to [0, 1] via min-max normalization.
3.5. Mathematical Formulation of the Gaussian Mixture Model
Notation: the symbols used in Equations (1)–(9) are defined in
Table 2.
The GMM models the data density as p(x) = Σk πk N(x|μk, Σk) (1), with the multivariate Gaussian (2) subject to Σk πk = 1, πk ≥ 0 (3). Parameters Θ are estimated by Expectation Maximization. The E-step computes responsibilities rik (4); the M-step updates Nk (5), πk (6), μk (7), and Σk (8). Iterations proceed until the observed-data log-likelihood L(Θ) (9) converges within tolerance ε.
Equations (1)–(9) are the standard GMM/EM formulation; the contribution is not a modification of them but the integrity-aware pipeline that governs which data enter them and how their outputs are audited. Only SHA-256-verified, preprocessed records (
Section 3.4.1 and
Section 3.4.2) form the input set {x
i}; the EM updates (Equations (4)–(8)) run under fixed, logged hyperparameters (K, covariance type, tolerance ε) over 30 seeds; and the soft assignments r
ik (Equation (4)) and log-likelihood L(Θ) (Equation (9)) are emitted with reproducibility metadata. The configuration choices inside these equations—K, covariance structure, and ε—are precisely the knobs whose stability is quantified against the σ ≤ 0.02 target in
Section 1.1.
The number of components K was set to K = 4 on domain grounds—one low-load regime consistent with nominal operation and three recurring elevated-load regimes (left-side, right-side, and bilateral) identified jointly with line engineers—and verified for stability across seeds and covariance structures (
Section 4.6). For transparency, the Bayesian Information Criterion (BIC) for the selected-feature model is reported in
Table 3; it decreases monotonically over K = 2.8 and does not exhibit an interior minimum or elbow, so BIC was not used as the primary selection criterion. The “full” covariance structure is the primary configuration.
3.6. Correlation-Based Feature Selection
Pearson pairwise correlations were computed across the preprocessed variables (
Figure 2). Several load-related variables exhibit strong positive correlations, indicating coordinated mechanical behavior, while load_160_r_rd shows weak or negative correlations with the left-side group.
Applying the |ρ| ≥ 0.80 criterion, five collinear pairs were identified, all within the left/aggregate load group: load_160_l_rd and load_160_lr_rd (ρ = 0.87), load_160_l_ton and load_lr_ton (ρ = 0.88), load_160_l_rd and load_160_l_ton (ρ = 0.81), load_160_l_rd and load_lr_ton (ρ = 0.81), and load_160_lr_rd and load_lr_ton (ρ = 0.81). Because min-max normalization equalizes all post-normalization ranges, the redundant variable cannot be chosen by dynamic range; instead, a single representative of the cluster was retained on the basis of partition quality. Each candidate representative (combined with slide_position, load_160_r_rd, and load_160_r_ton) was evaluated over 30 seeds: load_lr_ton, the aggregate left–right tonnage, yielded the most well-separated and reproducible partition (Silhouette 0.323 ± 0.014), compared with 0.292 (load_160_l_ton), 0.214 (load_160_lr_rd), and 0.126 (load_160_l_rd). Accordingly, load_160_l_rd, load_160_l_ton, and load_160_lr_rd were removed. The four retained, non-redundant variables—slide_position, load_160_r_rd, load_160_r_ton, and load_lr_ton—constitute the clustering feature space; all GMM results in
Section 4.2,
Section 4.3,
Section 4.4,
Section 4.5 and
Section 4.6 are computed on these four variables.
3.7. Implementation, Hyperparameters, Reproducibility, and Complexity
The framework was implemented in Python 3.11 with scikit-learn 1.3.0, NumPy 1.26, and Pandas 2.1. GMM training used n_components = 4, covariance_type = “full” (primary), n_init = 10, init_params = “kmeans”, max_iter = 200, tol = 1 × 10−3, reg_covar = 1 × 10−6, with 30 runs at random_state in {0, …, 29}.
Computational complexity: Each EM iteration costs O(N × K × d^2 + K × d^3) (responsibilities and covariance updates plus per-component inversion), giving a total of O(I × n_init × (N × K × d^2 + K × d^3)) for I iterations and space O(N × d + N × K + K × d^2). With N = 6719, K = 4, d = 4, cost is linear in N; the measured wall clock was 0.62 s per run (n_init = 10) and 0.055 s per single EM run. The GMM scales linearly in N, whereas the One-Class SVM and LOF baselines scale quadratically (~O(N^2)), a practical deployment advantage.
Table 4 summarizes the complexity of all the methods.
4. Results
4.1. Process Data Characteristics
Table 5 reports the descriptive statistics of the seven variables (
n = 6719): slide_position has a narrow range (439–491, SD 5.86), while load-related variables span much wider ranges with right-skewed distributions concentrated at zero (
Figure 3), indicating that high-load events are rare relative to nominal cycles.
4.2. GMM-Based Clustering of Operating Regimes
The GMM partitioned the 6719 cycles into four operating regimes of comparable size rather than a single dominant cluster with rare outliers: a low-load regime consistent with nominal operation (N,
n = 1645), a right-side load regime (R,
n = 2198), a left-side load regime (L,
n = 1763), and a bilateral high-load regime (B,
n = 1113). Because the regimes are comparably represented, they are described as distinct operating regimes rather than a normal/rare-defect dichotomy; the association of the three elevated-load regimes with defect-prone conditions is a domain-based hypothesis not validated against per-cycle operator labels, which were unavailable (
Section 3.3). The soft-assignment property represents each cycle by a membership distribution, allowing gradual transitions to be monitored.
4.3. Regime Interpretation
Regime L combines elevated left-side load (load_160_l_rd ~ 30.7, load_160_lr_rd ~ 30.7) with mild left torque and near-zero right-side load, consistent with left-side mechanical loading such as material-flow imbalance or mold misalignment (
Figure 4). Regime R shows elevated right-side load (load_160_r_rd ~ 8.8) with negligible left activity. Regime B shows simultaneously elevated left and right load with the highest torque and slide-position offset, consistent with bilateral loading during critical molding stages. Regime N shows uniformly low load and the most stable slide_position. These signatures are operator-readable, satisfying the interpretability property from
Section 1.1; they constitute physically motivated hypotheses about defect-prone conditions rather than label-validated defect classes.
Regime occurrence was non-uniform across the two-day window (
n = 930 on 25 October 2024;
n = 5789 on 26 October 2024). The bilateral-load regime B was concentrated in the first day’s run (586 of 1113 cycles, 53%), whereas the second day was dominated by the low-load and single-side regimes (
Figure 5). This non-random temporal concentration supports the interpretation that the regimes reflect genuine process dynamics rather than stochastic noise.
4.4. Ablation Study: Contribution of Correlation-Based Feature Selection
Correlation-based feature selection improved the Silhouette Score from 0.274 ± 0.075 (all seven features) to 0.323 ± 0.014 over 30 matched random seeds—a +18.2% relative improvement (paired t(29) = 3.39,
p = 0.002; Cohen’s d = 0.62; SD of paired differences = 0.080; Wilcoxon W = 123,
p = 0.022). The Davies–Bouldin Index decreased from 1.63 ± 0.24 to 1.17 ± 0.11, and the Silhouette SD contracted from 0.075 to 0.014, indicating substantially more-reproducible partitions (
Table 6).
4.5. Comparison with Baselines
The proposed GMM was compared against five unsupervised baselines under identical preprocessing and the same 30-seed schedule: K-means, Principal Component Analysis (PCA) reconstruction-error thresholding, Isolation Forest, One-Class SVM, and LOF (
Table 7). On the Silhouette criterion, the hard-partition baseline (K-means) and reconstruction baselines score higher than the proposed GMM. The Silhouette rewards compact, well-separated hard partitions and does not measure the properties this framework targets. The GMM is therefore positioned not as the Silhouette-optimal or the most seed-stable method—K-means (0.476 ± 0.001) and Isolation Forest (0.352 ± 0.011) are both more reproducible on this dataset—but as a generative model that yields operator-readable per-variable regime signatures while still meeting the absolute reproducibility target (σ = 0.014 ≤ 0.02). On this near-deterministic data, the soft assignments are sharp (mean maximum posterior responsibility 0.999; only 2 of 6719 cycles below 0.90), so their practical value lies in monitoring drift toward regime boundaries rather than in resolving ambiguous memberships. The full comparison is reported transparently.
4.6. Reproducibility and Stability
Over 30 random seeds, the proposed (full-covariance, four-feature) model attained a Silhouette standard deviation of 0.014, satisfying the σ ≤ 0.02 target. Across covariance structures, the four-regime topology was preserved (
Figure 6), with 30-seed mean Silhouette Scores of 0.352 ± 0.016 (tied), 0.326 ± 0.021 (diagonal), 0.414 ± 0.049 (spherical), and 0.323 ± 0.014 (full). The primary full and tied configurations met the σ ≤ 0.02 target; the spherical structure showed higher run-to-run variability (SD 0.049), reported for completeness.
To assess the choice of K, Silhouette, Davies–Bouldin, and BIC were computed across K = 2–8 on the selected features (
Table 8). No internal index selects K = 4: Silhouette is maximized and Davies–Bouldin minimized at K = 3, while BIC decreases monotonically; K = 4 is therefore a domain-informed choice (
Section 5.3).
5. Discussion
The results support a revised reading of the two hypotheses. Integrity-aware preprocessing with correlation-based feature selection (H1) produced a statistically significant, moderate improvement in clustering quality (Silhouette +18.2%, p = 0.002, Cohen’s d = 0.62) and a marked reduction in run-to-run variance (SD 0.075 -> 0.014), supporting the reproducibility property. The probabilistic soft-assignment property of the GMM (H2) yielded interpretable, operator-readable regime signatures. We note explicitly that the GMM does not achieve the highest Silhouette among the evaluated methods; its contribution lies in a generative, interpretable representation of operating regimes—operator-readable per-variable signatures with calibrated membership—while meeting the absolute σ ≤ 0.02 reproducibility target, rather than in maximizing a single internal index or in being the most seed-stable method.
5.1. Positioning Within the Industry 5.0 Security Stack
The framework complements rather than replaces recent work: HC-ZTIA [
16] secures identity, SOPE [
17] secures data-at-rest confidentiality, and the OpenCV-YOLO analysis [
18] surfaces pipeline vulnerabilities. The present work addresses the integrity and reproducibility of unsupervised ML-driven decisions during process execution, pointing toward a defense-in-depth view of human-centric security in Industry 5.0.
5.2. Practical Implications for Operators
The soft-assignment property lets operators monitor the confidence margin between competing regimes, enabling earlier attention to cycles drifting from the low-load state. Regime signatures are expressed in physical process variables, so corrective actions can be reasoned about without algorithmic expertise.
5.3. Limitations
Several limitations should be acknowledged. First, K = 4 was set on domain grounds (one nominal and three elevated-load regimes identified with line engineers) and reflects the present line and product mix; internal indices over K = 2–8 on the selected features do not exhibit an interior optimum at K = 4 (Silhouette and Davies–Bouldin favor K = 3, and BIC decreases monotonically), so K = 4 is a domain-informed modeling decision rather than one selected by an internal criterion. Second, per-cycle operator labels were unavailable, so the regimes are not validated against named defect categories; a label-based external evaluation is future work. Third, the evaluation uses a single injection molding line over a two-day window with uneven daily volume (
n = 930 on 25 October 2024;
n = 5789 on 26 October 2024); the results should be read as a single-line feasibility demonstration of the integrity-, reproducibility-, and interpretability-oriented pipeline rather than as evidence of cross-line or cross-material generalization. Fourth, the threat model addresses passive adversaries and transmission-integrity threats; active attacks on the model (poisoning, evasion) and verbatim replay are out of scope. Fifth, privacy-preserving techniques such as differential privacy and federated learning [
27] are not yet integrated.
5.4. Future Work
Future work will (i) conduct multi-line, multi-shift, and multi-material validation as the primary objective; (ii) add per-cycle labeled external validation (precision/recall/F1, confusion matrix, Adjusted Rand Index); (iii) extend the threat model and evaluation to adversarial robustness, data-poisoning resistance, and replay resistance [
28,
29]; (iv) incorporate online concept-drift detection so that K and the component parameters adapt as the product mix evolves [
30,
31,
32]; and (v) integrate federated, privacy-preserving learning for multi-factory deployment.
6. Conclusions
This study proposed a secure machine learning framework—operationalizing human-centric manufacturing security as measurable integrity, reproducibility, and interpretability—for operating-regime analysis in injection molding under an explicit passive-adversary threat model. Integrating SHA-256 ingestion verification, TLS-secured transmission, correlation-based feature selection, and GMM clustering, the framework attained a Silhouette Score of 0.323 ± 0.014 over 30 seeds (95% CI [0.318, 0.329]), a statistically significant +18.2% improvement from feature selection over the all-feature baseline (p = 0.002), with the Davies–Bouldin Index reduced from 1.63 to 1.17 and decision reproducibility meeting the σ ≤ 0.02 target. Rather than outperforming hard-partition baselines on internal indices, the framework contributes integrity-verified ingestion, reproducible probabilistic clustering, and interpretable attribution to the emerging Industry 5.0 security stack. Future work will extend validation to multiple lines and materials and to active adversarial threats.