An Image Feature Extraction Method for Quick Inspection and Fault Detection of Objects in Production Systems

Gimenez-Valenzuela, Rodrigo; Montesdeoca, Julio; Saldarriaga-Mesa, Brayan; Capraro, Flavio; Patiño, Daniel

doi:10.3390/automation7010009

Open AccessArticle

An Image Feature Extraction Method for Quick Inspection and Fault Detection of Objects in Production Systems

by

Rodrigo Gimenez-Valenzuela

^1,*

,

Julio Montesdeoca

²

,

Brayan Saldarriaga-Mesa

^1,3

,

Flavio Capraro

¹

and

Daniel Patiño

¹

Instituto de Automática, Universidad Nacional de San Juan—CONICET, San Juan J5400ARL, Argentina

²

Faculty of Electronic and Automation Enginnering, Universidad Politécnica Salesiana, Cuenca 010105, Ecuador

³

Facultad de Ciencias Básicas e Ingeniería, Universidad de los Llanos, Villavicencio 500017, Colombia

^*

Author to whom correspondence should be addressed.

Automation 2026, 7(1), 9; https://doi.org/10.3390/automation7010009

Submission received: 9 August 2025 / Revised: 9 December 2025 / Accepted: 12 December 2025 / Published: 1 January 2026

Download

Browse Figures

Versions Notes

Abstract

In modern industry, continuous production systems require the integration of monitoring systems capable of real-time inspection and anomaly detection of final products. This necessitates high-speed capture of product images and rapid information processing to determine the rejection of defective products. To address the challenges of reducing processing time and increasing fault recognition accuracy in products, a novel detection method based on image analysis and subsequent classification is proposed. While the techniques employed, such as image histograms and Principal Component Analysis, are well-established in image and data processing, the innovative integration of these methods in this approach provides a streamlined and highly efficient solution for classification. Specifically, the classification process relies on prior image processing, where the histograms of the 3 color channels of each image are obtained and concatenated, then PCA is applied, resulting in separable clusters. Cluster classification is achieved through a simple SVM. A significant advantage of this method is that it requires a reduced amount of image data for training the SVM, simplifying this stage of the process. The proposed method is benchmarked using a dataset of images aimed at detecting defects in a pill blister pack, which may include missing pills, while a data augmentation process is implemented. The relationship between the image histogram and the presence of faults is demonstrated under controlled lighting and sensor arrangement environments.

Keywords:

Support Vector Machine; image processing; Principal Component Analysis; Artificial Neural Network; fault detection; pill blister

1. Introduction

Industrial mass production processes require the development of protocols to control various quality specifications. Some of these specifications can be visually recognized [1]. In processes where the production flow is high or is at high speed, executing these protocols with human eyes is unsafe and often impossible. In these cases, artificial vision systems must be implemented that, along with adequate processing, allow for automatic decision-making on whether a product meets the desired specifications or not. Automation of these systems is critical to reduce human error, improve consistency, and ensure product quality on a large scale.

The implementation of real-time fault detection and diagnosis (RT-FDD) is of paramount importance in Industry 4.0 environments, where immediate feedback is critical to sustaining efficient production and avoiding the propagation of failures. Industrial processes are generally classified into two categories: continuous and discrete. Continuous processes, such as production and packaging in pharmaceutical industry, operate uninterruptedly and maintain a steady output. This mode of operation results in relatively stable and predictable conditions, where deviations typically emerge gradually and are often linked to wear or efficiency losses [2].

In the design of new fault detection methods, it is a priority to ensure effectiveness so that they are accurate in their analysis, as errors can lead to economic losses and even safety failures. Another important factor to consider is the processing speed, as the system’s response to detecting faults must be consistent with production flow speeds. This is particularly important in real-time systems, where delays could cause disruptions in the production line, leading to inefficiencies and increased costs.

Although Support Vector Machines present a unique solution that is optimal, modeling any type of training set, they have a high cost in memory and training time [3]. This leads to the need to reduce the dimensionality of the input data as much as possible. PCA is widely used in fault detection methods [4,5]. This method allows for patterns in the data population to be revealed with lower dimensionality, generating more efficient processing [6,7,8,9].

1.1. Related Works

Fault detection systems have been widely studied and applied in all industries. Ref. [10] classifies fault diagnostic methods in processes, considering those based on pattern recognition and those using AI. In these methods, a reference pattern is available; the data is compared with this pattern, and by means of some measurable magnitude it is decided whether the sample presents anomalies or meets the specifications. However, the complexity and variability of industrial environments often require more sophisticated approaches. Recent advancements have leaned towards AI-based methods for higher accuracy and adaptability.

The adoption of Principal Component Analysis (PCA) in industrial process monitoring can be traced back to the late 1980s. Ref. [11] demonstrated the applicability of PCA for analyzing process data, using a case study based on a ceramic smelter employed in nuclear waste reprocessing. Their findings highlighted the potential of PCA to improve process insight by examining the contribution of individual variables to the principal components derived from historical operational data [12].

Several studies have leveraged PCA in innovative ways to enhance fault detection across different contexts. Ref. [13] combined PCA and LDA using a probabilistic fusion model to achieve robust fault diagnosis in induction motors under noisy conditions. Ref. [14] proposed PCANet, which integrates PCA with block-wise image histograms in a lightweight neural-style architecture, underscoring the compatibility of PCA and histogram-based representations in image analysis. Ref. [4] explores the use of PCA in cement rotary kilns for fault detection and diagnosis, demonstrating its ability to reduce the dimensionality of the large sensor dataset while maintaining detection accuracy. In another case, ref. [5] applies PCA to nuclear power plants, combining it with unsupervised machine learning techniques to monitor system health and detect anomalies. Ref. [15] proposed a robust fault detection framework that integrates multiscale Principal Component Analysis (MSPCA) with a Kantorovich (KD) distance-based approach. By applying wavelet decomposition, the method extracts multi-resolution features from process signals, enhancing sensitivity to faults under noisy conditions. The KD metric is then used to assess deviations in the projected data space, with non-parametric thresholding enabling flexible and effective anomaly detection. This approach demonstrated improved performance over traditional PCA and MSPCA in scenarios involving drift, bias, and intermittent. Ref. [16] introduced a dynamic fault detection approach based on Dynamic Kernel Principal Component Analysis (DKPCA) combined with a Weighted Structural Difference (WSD) metric. The method captures nonlinear and dynamic behavior in industrial processes by projecting time-series data into a high-dimensional feature space using kernel functions, followed by the extraction of dynamic correlations through DKPCA. The WSD metric, computed over sliding windows, quantifies structural variations in the evolving data distribution by considering both mean and variance changes, thus enhancing sensitivity to process shifts while maintaining robustness against non-Gaussian noise.

Other applications of PCA can be found. For example, ref. [17] addresses the challenge of non-linear fault detection in chemical processes employing Kernel PCA (KPCA) in combination with the Generalized Likelihood Ratio Test (GLRT). This method capitalizes on the ability of KPCA to project the data into a higher-dimensional space, making it easier to separate faulty and nonfaulty instances. The residual is then computed in the original space, enhancing the detection of anomalies.

However, KPCA is often suboptimal for uncertain or highly variable systems, as its processing requirements grow significantly with larger datasets. This limitation is addressed in [18], where a nonlinear Fault Detection Method based on Interval Reduced Kernel PCA (IRKPCA) is developed to monitor processes with uncertainty. The technique uses interval-valued Euclidean distances to retain only the most relevant measurements, reducing computational cost while maintaining high detection accuracy.

A similar approach is presented by [19], which proposes Reduced Kernel PCA (RKPCA) to monitor industrial processes. By decreasing the number of observations in the data matrix based on a dissimilarity metric, the system can reduce redundancy and improve processing times. This method is particularly useful in systems where large volumes of data are generated, such as in the petrochemical industry.

Recent contributions have also explored sparsity-constrained formulations to enhance variable selection and fault isolation. For instance, some works have incorporated

ℓ_{2, 0}

-norm optimization into PCA and CCA frameworks to achieve joint sparsity and reduce variable redundancy, resulting in improved detection speed and accuracy in benchmark industrial processes such as the Tennessee Eastman and cylinder–piston systems, as seen in [20,21]. These approaches highlight the growing relevance of sparse optimization in process monitoring and its potential to complement traditional PCA-based methods.

For image-based fault detection, particularly in pharmaceutical production, the challenge often lies in processing high-resolution images of products such as pill blisters efficiently. Traditional methods have struggled to keep up with the demand for real-time analysis without sacrificing accuracy. A deep learning approach using CNNs has been explored in [22] for the Tennessee Eastman process. The method successfully isolates various faults using sensor data and achieves a fault isolation performance of more than 98%.

Moreover, Variable Window Moving Kernel PCA (VMWKPCA) has been employed to diagnose faults in dynamic processes such as the Continuous Stirred Tank Reactor (CSTR) process [23]. In this case, a structured partial VMWKPCA is utilized to detect and diagnose faults. The proposed method demonstrates superior efficacy, particularly in complex, non-linear systems where traditional methods struggle to provide timely and accurate diagnostics.

1.2. Main Contribution

Despite the aforementioned advancements, the challenge of reducing processing time remains, particularly when dealing with large datasets and high-resolution images. In this study, we propose a novel method that combines image histograms with PCA to improve both accuracy and processing speed in fault detection systems. Histograms are utilized for feature extraction from images, which helps to reduce the dimensionality of the data before applying PCA. This method is tested on pill blister images to identify missing pills. Owing to the effectiveness of the feature extraction process, the classification task can be accomplished using a single neuron, highlighting the discriminative power of the proposed representation.

By leveraging the strengths of both PCA and image histogram techniques, this method reduces computational complexity while maintaining high detection accuracy, which is critical in real-time production environments. Our method shows promising results in detecting faulty blisters, paving the way for further applications in pharmaceutical quality control.

This paper is organized as follows: in Section 2, preliminary concepts are presented to introduce the topic of feature classification. Section 3 elaborates on the new proposed method with an application in the classification and detection of faults in a pill blister. Section 4 presents the training process of the SVM and the results obtained in fault classification, while a comparison with other state-of-the-art classifiers is made. Finally, in Section 5, the conclusions are stated.

2. Preliminaries

An analysis of cluster classification methods and anomaly detection in objects is presented in this section. It is necessary to initially delve into the fundamental concepts on which the present research is based, particularly those related to Support Vector Machines and Principal Component Analysis.

2.1. Support Vector Machines

Based on development seen in [24], let us consider a training set

{\{(x_{i}, d_{i})\}}_{i = 1}^{N}

, where

x_{i}

is the vector corresponding to the i-th input and

d_{i}

is its desired value. Assuming that the clusters determined by the input vectors are linearly separable, then the equation that defines the hyperplane of separation between them would be defined by:

w^{T} x + b = 0

(1)

where

x

is the input vector,

w

is the weight vector, and b is a bias coefficient.

Let us consider that for one cluster, the desired value is

d_{i} = + 1

, and for the other, it is

d_{i} = - 1

. Then, we would have

\begin{matrix} w^{T} x_{i} + b \geq 0 & for d_{i} = + 1 \end{matrix}

(2)

\begin{matrix} w^{T} x_{i} + b < 0 & for d_{i} = - 1 \end{matrix}

(3)

For a given w and b, the distance between the hyperplane defined in (1) and the nearest vector x (considering the Euclidean distance) is known as the margin and is symbolized by

ρ

.

The goal of Support Vector Machines is to maximize the value of

ρ

to optimize classification, minimizing decision errors. The key to this type of machine learning is to find the values of the parameters

w_{0}

and

b_{0}

that define the optimal hyperplane for the previously defined training set. This pair of parameters must satisfy that

\begin{matrix} w_{0}^{T} x_{i} + b_{0} & \geq 1 & for d_{i} = + 1 \end{matrix}

(4)

\begin{matrix} w_{0}^{T} x_{i} + b_{0} & \leq - 1 & for d_{i} = - 1 \end{matrix}

(5)

The input vectors that, in correspondence with their desired output, satisfy the equality in (4) or (5) are known as support vectors. Being the vectors closest to the separation hyperplane, they are the most sensitive to classification. This is why they play a fundamental role in calculating the hyperplane.

If (4) and (5) are combined into a single condition, we obtain

d_{i} (w^{T} x + b) - 1 \geq 0 for i = 1, \dots, N

(6)

The training is solved by finding the values of w and b that minimize the norm function

Φ (w) = \frac{1}{2} w^{T} w

under the condition defined in (6). This training could be performed, for example, using the method of Lagrange multipliers.

2.2. Principal Component Analysis

Suppose we have m-dimensional input vectors that will be used to train and execute a classifier. The classification is based on these m characteristics that define each sample. It is known that increasing the number of inputs to a neural network leads to an increase in its complexity, requiring greater memory and processing costs. The question that arises is whether there are some of these characteristics (or linear combinations of them) that explain most of the information provided by the input sample, without the need to use all of them.

Given the matrix

X = [x_{1} x_{2} \dots x_{N}] \in M_{m \times N}

, where

x_{i} \in R^{m}

are the input vectors. Assuming that each row of X has zero mean, the variance–covariance matrix is formed as

Σ = \frac{1}{N + 1} X X^{T} \in M_{m}

(7)

We seek to find a vector

v

on which to project the inputs such that the variability given by

v^{T} X

is maximized. This variability is calculated as

v^{T} X {(v^{T} X)}^{T} = v^{T} X X^{T} v = v^{T} Σ v

(8)

If we maximize (8) under the condition that

v

has norm equal to 1, we find that

v

must be an eigenvector of

Σ

associated with the largest eigenvalue. The vector

v

is known as the first principal component. If we want to find the other principal components, we will find them as the eigenvectors associated with the next largest eigenvalues ordered in decreasing order. In this way, it is possible to present the samples in a p-dimensional space, with

p < m

while maintaining the highest percentage of variability. This percentage that is maintained when projecting onto the first p principal components is given by

I_{%} = \frac{\sum_{i = 1}^{p} λ_{i}}{\sum_{i = 1}^{m} λ_{i}} \times 100

where

λ_{i}

are the eigenvalues of

Σ

.

3. Proposed Classification Method Development

In summary, the method is based on taking an RGB image of the product to be classified, generating the concatenated histogram of the 3 channels, conducting PCA with n principal components, and performing classification with SVM. In Figure 1, a synthesis of the developed method is presented. To analyze and evaluate the performance of the method, it has been considered to work on detecting faults in pill blisters, particularly in cases of missing one or more pills. This solution is highly demanded in the pharmaceutical industry due to the high production speed of blisters and the high level of quality required by the marketing standards of these products.

To implement it, it was first necessary to obtain images for classifier training. A blister of 10 circular pills was used. The blister is blue, while the pills are a light pink color. Images were taken with a cell phone, with a zenith angle, at a distance of 20 cm from the blister on a black background, with a resolution of 300p × 300p. 78 images were made, in which the horizontal orientation of the blister was varied (Figure 2).

Due to the number of available images, in which the highest possible variability had already been generated through orientation changes, a data augmentation process was carried out. This decision was made to improve the quality and reliability of the training and validation of the proposed method.

The data augmentation process consisted of applying five different types of transformations to generate images considered as new information for the classifier’s development. The transformations included vertical and horizontal flips, the addition of masks with intensity gradients to simulate lighting changes, and the generation of Gaussian and salt-and-pepper noise. For the illumination gradients, a linear directional mask was generated by projecting normalized image coordinates onto a random angle

θ \in [0, 2 π)

, and applying a multiplicative intensity variation with a randomly sampled strength

s \in [0.08, 0.30]

, producing illumination changes between

(1 - s)

and

(1 + s)

. Gaussian noise was added with zero mean and variance

0.01

, while salt-and-pepper noise was injected with a density of

0.01

(

1 %

of the pixels). As a result, the total number of available images increased sixfold, allowing the use of 468 images in total.

A fundamental characteristic that was recognized is that, if the number of pills, the lighting, and the relative position of the camera to the blister are kept constant, both in distance and angle, and only the horizontal orientation of the blister is varied—even considering the presence of different types of noise—the distribution of pixel intensities does not show significant variations, as was observed in the histograms obtained from the images. Additionally, it was noted that the most relevant information—in terms of variability—was found in the upper part of the histogram, since the lower part corresponds to the pixels of the background. For this reason, it was decided that only the top 157 data points of the histogram of each color channel would be used for classification, and it was compared to a version with a full histogram to justify this decision.

By applying PCA to the histograms, it was detected that, by projecting the 471-dimensional histogram information onto only 2 dimensions, corresponding to the two most representative principal components, 88.76% of the variability could be preserved. By doing this, it was possible to represent each image using only one point in the plane. The method was also tested using 20 principal components, which preserved 98.85% of the variance, showing that the number of components used is a tunable parameter of the method.

The threshold between clusters could be implemented, now that n-dimensional information was available, through an SVM with Kernel Feature Space to ensure optimal separation.

4. SVM Training and Results

In our implementation, a linear kernel was employed for the SVM, which provided a suitable balance between model complexity and generalization for this dataset. To increase robustness against noisy or atypical measurements, an outlier fraction of 5% was specified, allowing the optimization process to tolerate a small proportion of mislabeled or irregular samples without compromising the separating hyperplane. The optimization was carried out using Iterative Single Data Algorithm (ISDA), which in this case required 12,068 iterations to reach convergence. The final model relied on 159 support vectors, indicating that a meaningful subset of the training samples contributed directly to defining the optimal separating hyperplane.

A total of 468 images were used for training and validation. The dataset included 198 images of normal blisters, 138 images containing one missing pill, and 132 images with two missing pills. Since the goal of the experiment is to determine whether a blister is defective or not, a binary classification scheme was adopted. Thus, images with one or two missing pills were grouped into a single faulty class, while full blisters were assigned to the normal class. To strengthen the validation, a cross-validation was implemented using 5 folds. In each fold, 80% of the images were randomly selected for training and 20% for validation, and then the results from each fold were combined to measure accuracy and timings.

To perform a comparative analysis of the performance of this method, other state-of-the-art classifiers were trained, and performance metrics were obtained for each (Table 1).

5. Discussion

Although the individual techniques employed in this method, such as histogram analysis, PCA, and linear classification, are well established in the literature, their simple yet effective integration into fault detection in blister images represents a novel contribution. This deliberate simplicity is a key strength of the approach: rather than relying on advanced or computationally intensive architectures, the method combines fundamental techniques in a coherent pipeline that achieves high discriminative performance. The results suggest that, under controlled acquisition conditions, the use of more sophisticated feature extractors or deep models may not necessarily yield a real improvement in classification accuracy, while significantly increasing model complexity and, in some cases, computational cost. The method capitalizes on the simplicity and discriminative capacity of histogram-based features, which, when combined with PCA, result in a highly compact data representation that preserves most of the relevant variance. This allows the classification task to be performed with minimal computational complexity, in this case, using a single neuron modeled as a linear Support Vector Machine.

The effectiveness of the proposed approach is evident in its ability to detect missing pills with good classification accuracy on the validation set, without the need for complex image preprocessing or deep learning architectures. The preservation of over 88% of data variance after PCA underscores the relevance of the selected features. This suggests that even simple global descriptors can be highly informative when the acquisition conditions are adequately controlled.

Table 1 presents a comparative analysis of classifier performance across several state-of-the-art methods. The proposed histogram-based approach shows that using partial histograms consistently outperforms full histograms. For instance, with 2 principal components, the partial histogram achieves 84.17% accuracy compared to 64.32% for the full histogram. With 50 principal components, the partial histogram reaches 97.22%, slightly higher than 96.37% for the full histogram.

This improvement can be attributed to the fact that the main discriminative information comes from the blister regions themselves, while the background remains largely unchanged. Incorporating the full histogram introduces background pixels that do not contribute meaningful variability, slightly reducing classification effectiveness.

Traditional descriptors, such as HOG + PCA + SVM (82.22%) and LBP + PCA + SVM (78.61%), show lower accuracy compared to the proposed histogram-based method with 2 principal components, which achieves 84.17%. Gabor + PCA + SVM, despite using 20 principal components, only improves accuracy slightly to 88.89%.

In terms of computational efficiency, the proposed method also compares favorably. Its training time (120.6 ms) is substantially lower than that of HOG + PCA + SVM (1520.3 ms) and PCA + RF (347.1 ms), while maintaining classification times comparable to the fastest models. The kNN classifier yielded the lowest accuracy (62.50%), highlighting its limited generalization capability in this problem setup.

These results confirm that the proposed method achieves a superior trade-off between accuracy, model complexity, and computational cost. Unlike more intricate feature extraction techniques that require large amounts of data or fine-tuning of multiple hyperparameters, the proposed approach relies on a minimal and interpretable pipeline. Such characteristics make it particularly suitable for industrial contexts where interpretability, low latency, and ease of deployment are essential.

Nevertheless, several aspects warrant further exploration. The current dataset, while sufficient for initial validation, is relatively limited in terms of variation in pill types, blister materials, lighting conditions, and background textures. Although additional variability was introduced through data augmentation techniques—providing a stronger basis for evaluating the robustness and effectiveness of the proposed method—it remains advisable to perform further experiments under conditions more closely aligned with industrial environments. Such tests, conducted using real production infrastructure and acquisition setups, would allow a more comprehensive assessment of the method’s stability and generalization capacity in realistic operational contexts.

It should also be noted that the use of Convolutional Neural Networks (CNNs) was not explored in this work for two main reasons. First, the strength of such architectures is generally directed toward more complex classification problems, where large-scale data and intricate spatial relationships are involved, conditions that do not necessarily apply to the problem studied here. Second, the structural and computational complexity of CNNs greatly exceeds that of the proposed method and of the other state-of-the-art techniques used for comparison, which would have made the evaluation uneven and less representative of a fair performance assessment.

Finally, the method’s potential for transfer learning should also be investigated. For example, it could be adapted to other domains involving repetitive visual structures, such as defect detection in solar cells, food packaging, or electronic component assembly.

6. Conclusions

The proposed method demonstrates a robust and efficient approach for fault detection in blister images, combining histogram analysis, PCA, and a simple linear classifier. The results confirm that the image histograms exhibit clear and consistent alterations in the presence of missing pills, which provide sufficient information for accurate classification. By applying PCA, the dimensionality of the data was significantly reduced while preserving the most relevant variance, allowing the classification task to be performed using an extremely simple model.

Compared to other state-of-the-art techniques, the proposed approach achieves a superior balance between classification accuracy, computational efficiency, and model simplicity. Its simple integration of well-established techniques proves sufficient for the studied problem, and the introduction of additional variability through data augmentation further validates its effectiveness.

The use of more complex architectures, such as Convolutional Neural Networks, was not explored because their strength is more relevant to highly complex classification tasks, and their structural complexity greatly exceeds that of the proposed method and the other benchmarked approaches. Therefore, the proposed method represents a practical and well-justified solution under controlled acquisition conditions.

Overall, the method provides a reliable and straightforward solution suitable for real-time, in-line industrial applications. Its simplicity, combined with high classification performance, makes it particularly appealing for scenarios with limited computational resources or strict deployment constraints, while laying the groundwork for future evaluations under more diverse and realistic industrial conditions.

Author Contributions

Conceptualization, R.G.-V.; methodology, R.G.-V.; software, R.G.-V.; validation, R.G.-V.; formal analysis, R.G.-V.; investigation, R.G.-V. and B.S.-M.; data curation, R.G.-V. and B.S.-M.; writing—original draft preparation, R.G.-V. and B.S.-M.; writing—review and editing, J.M., F.C. and D.P.; supervision, F.C. and D.P.; funding acquisition, J.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by CONICET (Consejo Nacional de Investigaciones Cientificas y Tecnicas).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Acknowledgments

I would like to express my deepest gratitude to the National Scientific and Technical Research Council (CONICET) for providing the doctoral scholarship that made this research possible. I am also grateful to the Automatics Institute for its continuous support and for fostering an excellent research environment. I highlight the environment that the Faculty of Engineering provides for my professional and academic development. Special thanks to my advisors and colleagues for their invaluable guidance and insights throughout the development of this work. AI was used only for supporting English translation and grammar enhancement.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AI	Artificial Intelligence
CNN	Convolutional Neural Network
CSTR	Continuous Stirred Tank Reactor
GLRT	Generalized Likelihood Ratio Test
IRKPCA	Interval Reduced Kernel Principal Component Analysis
KPCA	Kernel Principal Component Analysis
PCA	Principal Component Analysis
RKPCA	Reduced Kernel Principal Component Analysis
SVM	Support Vector Machine
VMWKPCA	Variable Window Moving Kernel Principal Component Analysis
HOG	Histogram of Oriented Gradients
LBP	Local Binary Pattern
RF	Random Forest
kNN	k-Nearest Neighbors

References

Amaral, T.G.; Pires, V.F.; Pires, A.J. Fault Detection in PV Tracking Systems Using an Image Processing Algorithm Based on PCA. Energies 2021, 14, 7278. [Google Scholar] [CrossRef]
Leite, D.; Andrade, E.; Rativa, D.; Maciel, A.M.A. Fault Detection and Diagnosis in Industry 4.0: A Review on Challenges and Opportunities. Sensors 2024, 25, 60. [Google Scholar] [CrossRef] [PubMed]
Erfani, S.M.; Rajasegarar, S.; Karunasekera, S.; Leckie, C. High-dimensional and large-scale anomaly detection using a linear one-class SVM with deep learning. Pattern Recognit. 2016, 58, 121–134. [Google Scholar] [CrossRef]
Bencheikh, F.; Harkat, M.; Kouadri, A.; Bensmail, A. New reduced kernel PCA for fault detection and diagnosis in cement rotary kiln. Chemom. Intell. Lab. Syst. 2020, 204, 104091. [Google Scholar] [CrossRef]
Elshenawy, L.M.; Halawa, M.A.; Mahmoud, T.A.; Awad, H.A.; Abdo, M.I. Unsupervised machine learning techniques for fault detection and diagnosis in nuclear power plants. Prog. Nucl. Energy 2021, 142, 103990. [Google Scholar] [CrossRef]
Shen, K.; Mcguirk, A.; Liao, Y.; Chaudhuri, A.; Kakde, D. Fault Detection Using Nonlinear Low-Dimensional Representation of Sensor Data. In Proceedings of the 2020 Annual Reliability and Maintainability Symposium (RAMS), Palm Springs, CA, USA, 27–30 January 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–6. [Google Scholar] [CrossRef]
Zeng, Y.; Lou, Z. The New PCA for Dynamic and Non-Gaussian Processes. In Proceedings of the 2020 Chinese Automation Congress (CAC), Shanghai, China, 6–8 November 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 935–938. [Google Scholar] [CrossRef]
Puyati, W.; Walairacht, A. Efficiency Improvement for Unconstrained Face Recognition by Weightening Probability Values of Modular PCA and Wavelet PCA. In Proceedings of the 2008 10th International Conference on Advanced Communication Technology, Gangwon, Republic of Korea, 17–20 February 2008; IEEE: Piscataway, NJ, USA, 2008; pp. 1449–1453. [Google Scholar] [CrossRef]
Rehman, A.; Khan, A.; Ali, M.A.; Khan, M.U.; Khan, S.U.; Ali, L. Performance Analysis of PCA, Sparse PCA, Kernel PCA and Incremental PCA Algorithms for Heart Failure Prediction. In Proceedings of the 2020 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), Istanbul, Turkey, 12–13 June 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–5. [Google Scholar] [CrossRef]
Isermann, R. Fault-Diagnosis Systems: An Introduction from Fault Detection to Fault Tolerance; Springer: Berlin/Heidelberg, Germany, 2006; p. 475. [Google Scholar]
Wise, B.; Veltkamp, D.; Davis, B.; Ricker, N.; Kowalski, B. Principal components analysis for monitoring the west valley liquid fed ceramic melter. In Waste Management ’88; University of Arizona Nuclear Engineering Department: Tucson, AZ, USA, 1988; pp. 811–818. [Google Scholar]
Melo, A.; Câmara, M.M.; Pinto, J.C. Data-Driven Process Monitoring and Fault Diagnosis: A Comprehensive Survey. Processes 2024, 12, 251. [Google Scholar] [CrossRef]
Jung, D.; Lee, S.; Wang, H.M.; Kim, J.; Lee, S. Fault detection method with PCA and LDA and its application to induction motor. J. Cent. South Univ. Technol. 2010, 17, 1238–1242. [Google Scholar] [CrossRef]
Chan, T.H.; Jia, K.; Gao, S.; Lu, J.; Zeng, Z.; Ma, Y. PCANet: A simple deep learning baseline for image classification. IEEE Trans. Image Process. 2015, 24, 5017–5032. [Google Scholar] [CrossRef] [PubMed]
Kini, K.R.; Madakyaru, M.; Harrou, F.; Vatti, A.K.; Sun, Y. Robust Fault Detection in Monitoring Chemical Processes Using Multi-Scale PCA with KD Approach. ChemEngineering 2024, 8, 45. [Google Scholar] [CrossRef]
Zhang, C.; Yan, F.; Deng, C.; Li, Y. Industrial process fault detection based on dynamic kernel principal component analysis combined with weighted structural difference. Asia-Pac. J. Chem. Eng. 2024, 19, e3132. [Google Scholar] [CrossRef]
Baklouti, R.; Mansouri, M.; Nounou, H.; Nounou, M.; Slima, M.B.; Hamida, A.B. Fault detection of chemical processes using KPCA-based GLRT technique. In Proceedings of the 2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP), Fez, Morocco, 22–24 May 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar] [CrossRef]
Dhibi, K.; Fezai, R.; Bouzrara, K.; Mansouri, M.; Kouadri, A.; Harkat, M.F. Fault detection of uncertain nonlinear process using interval-valued data-driven approach. In Proceedings of the 2020 17th International Multi-Conference on Systems, Signals & Devices (SSD), Monastir, Tunisia, 20–23 July 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 585–590. [Google Scholar] [CrossRef]
Dhibi, K.; Fezai, R.; Mansouri, M.; Kouadri, A.; Bouzrara, K. Machine Learning based Reduced Kernel PCA for Nonlinear Process Monitoring. In Proceedings of the 2019 International Conference on Internet of Things, Embedded Systems and Communications (IINTEC), Tunis, Tunisia, 20–22 December 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 180–185. [Google Scholar] [CrossRef]
Xiu, X.; Miao, Z.; Liu, W. A Sparsity-Aware Fault Diagnosis Framework Focusing on Accurate Isolation. IEEE Trans. Ind. Inform. 2023, 19, 1356–1365. [Google Scholar] [CrossRef]
Xiu, X.; Pan, L.; Yang, Y.; Liu, W. Efficient and Fast Joint Sparse Constrained Canonical Correlation Analysis for Fault Detection. IEEE Trans. Neural Netw. Learn. Syst. 2024, 35, 4153–4163. [Google Scholar] [CrossRef] [PubMed]
Zarch, M.G.; Soltani, M. An artificial intelligence approach to fault isolation based on sensor data in Tennessee Eastman process. In Proceedings of the IECON 2020—The 46th Annual Conference of the IEEE Industrial Electronics Society, Singapore, 18–21 October 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 417–422. [Google Scholar] [CrossRef]
Fezai, R.; Mansouri, M.; Taouali, O.; Harkat, M.F.; Bouguila, N. Fault Diagnosis for Dynamic Nonlinear System Based on Variable Moving Window KPCA. In Proceedings of the 2018 15th International Multi-Conference on Systems, Signals & Devices (SSD), Yasmine Hammamet, Tunisia, 19–22 March 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 590–595. [Google Scholar] [CrossRef]
Haykin, S. Neural Networks: A Comprehensive Foundation; Pearson Education: Upper Saddle River, NJ, USA, 1999; p. 842. [Google Scholar]

Figure 1. Flow diagram of the proposed feature extraction method. Histograms of the three channels (colors) are concatenated.

Figure 2. Photos taken of complete blisters, with one pill missing, and with two pills missing.

Table 1. Comparative analysis of classifier performance (accuracy, training and classification time). Proposed method with 2 and 50 PCs, with full histogram and with upper part of histogram. HOG + PCA + SVM with

8 \times 8

cell size, 20 PCs. Gabor and LBP + PCA + SVM with 20 PCs. RF with 100 trees. kNN with 5 neighbours.

Table 1. Comparative analysis of classifier performance (accuracy, training and classification time). Proposed method with 2 and 50 PCs, with full histogram and with upper part of histogram. HOG + PCA + SVM with

8 \times 8

cell size, 20 PCs. Gabor and LBP + PCA + SVM with 20 PCs. RF with 100 trees. kNN with 5 neighbours.

Classifier	Accuracy (%)	Training (ms)	Classif. (ms)
Proposed method (2 PCs, full hist.)	64.32	290.6	0.570
Proposed method (2 PCs, upper hist.)	84.17	161.1	0.587
Proposed method (50 PCs, full hist.)	96.37	143.1	0.705
Proposed method (50 PCs, upper hist.)	97.22	120.6	0.721
HOG + PCA + SVM	82.22	1520.3	0.998
LBP + PCA + SVM	78.61	38.7	0.707
Gabor + PCA + SVM	88.89	17.9	0.720
PCA + RF	79.72	347.1	58.5
PCA + kNN	62.50	53.5	0.991

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gimenez-Valenzuela, R.; Montesdeoca, J.; Saldarriaga-Mesa, B.; Capraro, F.; Patiño, D. An Image Feature Extraction Method for Quick Inspection and Fault Detection of Objects in Production Systems. Automation 2026, 7, 9. https://doi.org/10.3390/automation7010009

AMA Style

Gimenez-Valenzuela R, Montesdeoca J, Saldarriaga-Mesa B, Capraro F, Patiño D. An Image Feature Extraction Method for Quick Inspection and Fault Detection of Objects in Production Systems. Automation. 2026; 7(1):9. https://doi.org/10.3390/automation7010009

Chicago/Turabian Style

Gimenez-Valenzuela, Rodrigo, Julio Montesdeoca, Brayan Saldarriaga-Mesa, Flavio Capraro, and Daniel Patiño. 2026. "An Image Feature Extraction Method for Quick Inspection and Fault Detection of Objects in Production Systems" Automation 7, no. 1: 9. https://doi.org/10.3390/automation7010009

APA Style

Gimenez-Valenzuela, R., Montesdeoca, J., Saldarriaga-Mesa, B., Capraro, F., & Patiño, D. (2026). An Image Feature Extraction Method for Quick Inspection and Fault Detection of Objects in Production Systems. Automation, 7(1), 9. https://doi.org/10.3390/automation7010009

Article Menu

An Image Feature Extraction Method for Quick Inspection and Fault Detection of Objects in Production Systems

Abstract

1. Introduction

1.1. Related Works

1.2. Main Contribution

2. Preliminaries

2.1. Support Vector Machines

2.2. Principal Component Analysis

3. Proposed Classification Method Development

4. SVM Training and Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI