Data-Driven Intelligent Model for the Classification, Identification, and Determination of Data Clusters and Defect Location in a Welded Joint

Oleka, Chijioke Jerry; Aikhuele, Daniel Osezua; Omorogiuwa, Eseosa

doi:10.3390/pr10101923

Open AccessArticle

Data-Driven Intelligent Model for the Classification, Identification, and Determination of Data Clusters and Defect Location in a Welded Joint

by

Chijioke Jerry Oleka

¹,

Daniel Osezua Aikhuele

^1,2,*

and

Eseosa Omorogiuwa

¹

Centre for Engineering and Technology Management, Institute of Engineering Technology and Innovation, University of Port Harcourt, Choba 500272, Nigeria

²

Faculty of Engineering and the Built Environment, University of Johannesburg, Auckland Park, Johannesburg 2092, South Africa

^*

Author to whom correspondence should be addressed.

Processes 2022, 10(10), 1923; https://doi.org/10.3390/pr10101923

Submission received: 25 August 2022 / Revised: 6 September 2022 / Accepted: 16 September 2022 / Published: 22 September 2022

(This article belongs to the Section Manufacturing Processes and Systems)

Download

Browse Figures

Review Reports Versions Notes

Abstract

In this paper, a data-driven approach that is based on the k-mean clustering and local outlier factor (LOF) algorithm has been proposed and deployed for the management of non-destructive evaluation (NDE) in a welded joint. The k-mean clustering and LOF model algorithm, which was implemented for the classification, identification, and determination of data clusters and defect location in the welded joint datasets, were trained and validated such that three (3) different clusters and noise points were obtained. The noise points, which are regarded as the welded joint defects/flaws, allow for the determination of the cluster size, heterogeneity, and silhouette score of the welded joint data. Similarly, the LOF model algorithm was implemented for the detection, visualization, and management of flaws due to internal cracks, porosity, fusion, and penetration in the welded joint. It is believed that the management of welded joint flaws would aid the actualization of the Industry 4.0 concept in the development of lightweight products for manufacturing.

Keywords:

k-mean clustering; LOF model algorithm; welded joint; flaws/defects; Industry 4.0

1. Introduction

The global demand for the reduction of carbon (CO₂) emissions, and thus for the design, development, and implementation of a lightweight product concept in the manufacturing industry, has resulted in the call for innovation in the management of welding techniques and procedures [1,2]. Welding technology, which is not only a critical factor to consider in this regard, could pose a major challenge to the actualization of this drive [3,4]. Although, it is believed that the introduction of innovative welding techniques and procedures could address these issues holistically and save up to about 50% of the CO₂ emissions [3,5]. However, the nature and types of welded joints produced from these welding techniques when trying to achieve the lightweight product concept are not free from their problems. It has been suggested that welding techniques are critical to the management of manufacturing energy consumption, the cost of achieving the lightweight product concept, and the quality and reliability of the welds produced [6]. Hence, it has been recommended that welded joints under the lightweight product concepts should be monitored continuously, such as to ensure balance and stability in the micro-structural and mechanical properties of the welded joints [7].

In monitoring welded joints, non-destructive evaluation (NDE) techniques, which can also be referred to as non-destructive testing (NDT) methods, are often used, where the generation, propagation, and response signals from the joints are modeled and simulated. It allows for a complete check of the welded joints for cracks/flaws and for managing the negative impact of the CO₂ emissions balance. The high cost and time required for using destructive test methods has made the NDE a sought-after technique for many industries [8,9]. Furthermore, since some parts of the Industrial Revolution 4.0 are gaining traction both in the manufacturing and the oil and gas industry lately [10,11]. It is believed, therefore, that innovation in the management and implementation of the NDE techniques, especially for welding technology, will greatly impact the concept [12]. In addition, the combination of data from welding techniques and NDT methods, historical data from the production steps of the lightweight product construction, or their manufacturing environment will greatly benefit the Industrial Revolution 4.0, as well as the actualization of reduced CO₂ emissions and the lightweight product concept [13].

Non-destructive evaluation is a group of methods/tools that have found application in several industries for determining the properties and integrity of components, parts, or structures without physically altering their shape or causing damage to them. Generally, the NDE methods/tools rely heavily on experimental methods for their measurements. These experimental methods, however, are costly and time-consuming. Hence, they have become very unpopular, lately, and so many of the industries that normally use them are now calling for an autonomous deep-computing approach for the management and implementation of the NDE techniques.

A number of deep-computing approaches has been proposed for the management and implementation of the NDE techniques. Among them include Liu et al. [14], who investigate the use of a laser sensor as a non-destructive technique for detecting welding defects in seam contours. The method incorporates image coding into the laser sensors before applying deep-learning algorithms for the classification and detection of weld defect images. Zeng et al. [15], proposed a visual sensor that is based on image features and support vector machines (SVM) for the automatic identification of weld joint type before welding. The model, which is aimed at the improvement of the weld efficiency and automation of the welding system, is applied in the field of welding robotics.

As part of the Industry 4.0 paradigm that has resulted in the introduction of modern technologies, Tripicchio & D’Avella [16] proposed the deep neural network within the context of welding defect detection by analyzing the common problems in industrial applications of such modern technologies and discussing potential solutions in the specific case of quality checks in fuel injector welding during the manufacturing stage. Similarly, weld bead identification, which is critical for providing data for automatic welding process control, has been faced with the complex characteristics of the industrial environment, such as weak texture, low contrast, and rust. Yang et al. [17], propose a deep neural network-based detection and identification method for weld beads. To begin, high-quality training samples were generated with a small number of samples, combined with image processing and a generative adversarial network (GAN). Secondly, a mechanism for updating training samples was established to ensure the deep neural network model could cover all samples, and finally, the deep neural network was used to detect and identify weld beads by avoiding the handcrafted features of traditional machine learning methods.

Provencal & Laperrière [18] applied the deep learning approach for the identification of welding defects in a weld geometry by using NDT (ultrasound scan) data. The method, which allows a more accurate automated assessment of the ultrasound data, provides a deeper insight into the management of the welding and the improvement of the quality and reliability of the weld defect analysis. Unlike traditional NDT methods that rely solely on certified analysts to assess weld quality, the proposed deep learning framework presented, here, expands the NDT industry’s understanding of ultrasonic scan analysis.

Among the prominent traditional available NDE methods, the ultrasonic inspection testing method stands out as one which uses a high-frequency sound approach to detect flaws in components, parts, or structures by visualizing the different sections (the deep and shallow parts). Studies, however, noted that it is difficult to use the ultrasonic inspection testing (UT) method on components and structures with intricate shapes and high curvatures [19]. Hence, a manual inspection approach is always recommended and used. This approach, however, is costly, time-consuming, and produces inconsistent results most of the time, mainly due to human factors. To address this issue, a data-driven approach for analyzing the inspection data has been proposed in the paper. The data-driven intelligent approach that is based on the k-mean clustering and local outlier factor (LOF) model is aimed at supporting the NDE tool for intricate shape analysis and for structures that cannot be easily accessed using the traditional UT scan method. Additionally, it is used for addressing inconsistency in the results obtained using the UT scan method and for managing the amount of time spent when a manual approach is used by providing a novel approach for internal flaw/defect location and detection in the welded joint.

The primary contributions of the data-driven intelligent approach to NDE and welding technology literature can be summed as follows:

(1): The k-mean clustering method, which was implemented for data analysis of welded joints with intricate shapes, provides statistically relevant features from NDE data by classifying the dataset into clusters and noise points. To the best of my knowledge, this is the first study to apply the k-mean clustering method for the classification of the dataset from a welded joint with intricate shapes and structures.
(2): The k-mean clustering method provides a visualization schema of the NDE data features of the welded joint considered, showing the clustering means, clustering coefficient, cluster heterogeneity, silhouette score, and the size and measurement in the form of clusters and noise point information.
(3): Finally, the LOF model algorithm is implemented for the detection of flaws due to internal cracks, internal porosity, internal fusion, and internal penetration in the welded joint. To the best of my knowledge, this is the first study to apply the LOF model algorithm for the detection of flaws in welded joints, especially for welded joints with intricate shapes and structures.

The practical advantages and significance of the data-driven intelligent approach as part of the Industry 4.0 paradigm are listed as follows:

(a): Less Waste: The use of modeling and simulation in NDE does not change or alter the structure or composition of a component or structure; therefore, their usage is not restricted and results in no samples wasted, unlike the traditional NDE where samples may be wasted.
(b): Reduced downtime: There is no need to halt operations when using the modeling and simulation approach for the NDE of components and structures because the procedures allow testing to take place while the materials are still in use.
(c): Prevention of accidents: Accidents can be avoided with the aid of modeling and simulation of the NDE process, which also lowers the price of maintenance, replacement, and equipment loss, as well as the need to close down a firm.
(d): We see the NDE and process (and environmental) monitoring being applied seamlessly as Industry 4.0 envisions cyber–physical systems, where they talk with each other in terms of processes, quality, and logistical aspects.
(e): Data collection with NDE at various stages of the value chain can be merged into a “digital twin” of a component or structure, which can be used as a reference for the condition or structural health monitoring later on. For predictive analytics to compute preventative maintenance or a remaining lifetime, machine learning algorithms must be used.

The rest of the paper is organized as follows. In the next section, the data-driven intelligent model for the welded joint is presented, followed by the implementation of the model in Section 3. In Section 4, the results from the analysis is discussed and then a concluding remark is presented in Section 5.

2. The Data-Driven Intelligent Model for Welded Joints

The clustering method, which has been adopted in this study for flaw/defect detection in welded joints, is an unsupervised learning method that takes input features and data such that it does not require proper labels to predict and evaluate them. It is a data analysis technique for identifying intriguing patterns in data, such as fault patterns and groupings. It provides a quick summary of the data that could be utilized to make inferences. Since the purpose of a clustering task is to find data structures, the clustering method must, therefore, be able to determine the number of structures/groups in the data and how the features are distributed within each group.

Clustering, for example, can be used to detect defects, faults, and anomalies in a system by using the system’s database or historical data or the locations of the faults or defects in the system. Additionally, the area where errors occur more frequently can also be determined using the clustering method. Several clustering methods can be used for this task, including k-mean clustering [20], mini-batch k-means clustering [21], spectral clustering, gaussian mixture clustering [20], birch clustering [22], density-based clustering [23,24], hierarchical clustering, and random forest clustering [25]. All of these methods can be successfully implemented to address the two objectives (flaw/defect detection and their internal damage location). In this paper, however, the study will be focused only on the k-mean clustering and the local outlier factor model algorithm.

2.1. K-Mean Clustering and the Local Outlier Factor (LOF) Model

K-means clustering is a vector quantization approach that seeks to partition n observations into k clusters, with each observation belonging to the cluster with the closest mean (cluster centers or cluster centroid), which serves as the cluster’s prototype such that the data space is divided into Voronoi cells as a result of this [26]. Within-cluster variances (squared Euclidean distances) are minimized by k-means clustering, but not the regular Euclidean distances. The mean optimizes squared errors, while only the geometric median minimizes the Euclidean distances [27]. The use of k-medians and k-medoids, for example, can lead to better Euclidean solutions.

There are three main characteristics of k-means that make it very efficient for solving engineering problems; however, these same characteristics are also frequently seen as its most significant drawbacks. These characteristics include [28]:

The Euclidean distance is used as both the metric and variance, and for measuring the cluster scatter.
The number of clusters k, when used as an input parameter; selecting an incorrect value for k, may result in bad results. It is important, therefore, to check the number of clusters in the data set when performing a diagnostic check with the k-mean clustering method.
Finally, the convergence to a local minimum can have unexpected (“wrong”) results.

Although the problem is computationally challenging, effective heuristic techniques quickly converge to a local optimum. Both k-means and gaussian mixture modeling use an iterative refining method that is comparable to the expectation–maximization algorithm for mixtures of gaussian distributions. They both use cluster centers to represent the data; however, k-means clustering finds clusters with similar spatial extents, whereas the gaussian mixture model enables clusters to have diverse shapes.

Definition 1.

If a set of observations is given by

(x_{1}, x_{2}, x_{3}, \dots, x_{n})

where each of the observations is a d-dimensional real vector, the k-means clustering, therefore, aims to partition the n observations into

k (k \leq n)

sets

S = (S_{1}, S_{2}, S_{3}, \dots, S_{k})

, such that the within-cluster sum of squares (WCSS) is minimized as much as possible (i.e., variance). The objective, therefore, is given as:

a r g_{S} m i n \sum_{i = 1}^{k} \sum_{X \in S_{i}} ‖ x - μ_{i} ‖^{2} = a r g_{S} m i n \sum_{i = 1}^{k} |S_{i}| V a r S_{i}

(1)

where

μ_{i}

is the mean of points in

S_{i}

and it is the equivalent to the minimization of the pairwise squared deviations of the different points within the same clusters.

a r g_{S} m i n \sum_{i = 1}^{k} \frac{1}{|S_{i}|} \sum_{x, y \in S_{i}} ‖ x - μ_{i} ‖^{2}

(2)

The overall variance is constant, and this is equivalent to maximizing the sum of squared deviations between points in various clusters, and it is equal to the between-cluster sum of squares (BCSS). The algorithm for the implementation of the k-mean clustering method that has been proposed for supporting the physics-based analysis used in the detection of flaws/defects and for the determination of the internal damage location in welded joints has been developed using the Python 3 programming language. The flowchart of the algorithm have been given in Figure 1.

The task of detecting observations that do not conform to typical, expected behavior is referred to as anomaly identification. In various application domains, these observations are sometimes referred to as anomalies, defects, outliers, flaws, novelties, exceptions, or surprises. Anomalies, flaws, and outliers are three of the most commonly used phrases in literature. The most recognized and commonly utilized local anomaly and flaws detection algorithm is the local outlier factor (LOF) model. It employs the concept of k nearest neighbors to calculate and identify anomaly or outlier scores in a dataset.

The LOF model is an unsupervised anomaly/flaw detection method that computes a particular data point’s local density deviation to its neighbors. Outliers are samples that have a significantly lower density than their neighbors. A point’s LOF is determined by the ratios of the local density of the area surrounding the point and the local densities of its neighbors. It takes into account the relative density of data points. When employing a LOF model, the following steps can be taken:

(a): Using a distance function such as Euclidean or Manhattan, calculate the distance between P and all of the specified points.
(b): Locate the nearest k (k-nearest neighbor) point. For example, if k = 3, find the distance to the third nearest neighbor.
(c): Locate the k nearest points.
(d): Using the following equation, calculate the local reachability density (lrd), $l r d_{k} = \frac{‖ N_{k} (0) ‖}{\sum_{0^{'} ϵ N_{k} (0)} r e a c h d i s t_{k} (0^{'} \leftarrow 0)}$ , where reachable distance can be calculated as $r e a c h d i s t_{k} (0^{'} \leftarrow 0) = m a x \{d i s t_{k} (0), d i s t (0, 0^{'})\}$ , $N_{k} (0)$ is the number of neighbors.
(e): The final step is to compute the local outlier factor, which is as follows, $L O F = \frac{\sum_{0^{'} ϵ N_{k} (0)} \frac{l r d_{k} (0^{'})}{l r d_{k} (0)}}{‖ N_{k} (0) ‖}$ .

2.2. Data Collection for the Implementation of a Data-Driven Intelligent Model

The welded joint dataset used in this paper was obtained from published studies and from the quality control unit from a leading oil and gas company around Port Harcourt, Nigeria, which unfortunately doesn’t want to be mentioned. The dataset, which comprises quality control records of welded joints, includes labels such as temperature, hardness of the welded joint (HB), base material nominal yield strength (MPa), base material nominal ultimate tensile strength (MPa), weld length (mm), porosity area and ratio, roughness, bead width and area, internal crack length, fusion data, groove angle (°) data, and spatters data. About 25 datasets were collected from published works for each of the labels presented above, while 100 were obtained from quality control units. Some of the experimental data used for the implementation of the intelligent model are given in Table 1 below. The datasets’ training/test ratio is set to 8:2. That is, 80% of the training data is drawn at random from the label database, while the remaining 20% serves as the test dataset. It should be noted that the samples in the training dataset are completely different from the samples in the test dataset.

3. Implementation of the Data-Driven Intelligent Model

In this section, the results of the analysis for the classification of the welded joint data collected are presented. First, they are classified into clusters and then flaws/defects in a welded joint are identified and detected, finally, the internal flaw/defect locations in the welded joint are visualized. In implementing the intelligent data-driven model for the dataset which has been collected from the application of the NDE techniques, first, the data are observed and then analyzed using the k-mean clustering method and finally, with the LOF model algorithm. The k-mean clustering method is implemented for the determination of data cluster and the computation of the cluster means which depict how organized the data are and the quality of the clusters, while the LOF model algorithm help in the identification and visualization of the internal flaws/defects and their locations in the welded joint.

3.1. The K-Mean Clustering Algorithm for Defect Classification for Welded Joint Data

In the classification and identification of potential defects and hidden patterns in the welded joint data presented above, the k-mean clustering model algorithm has been applied. The dataset used for this purpose includes temperature data, hardness of the welded joint (HB), base material nominal yield strength (MPa), base material nominal ultimate tensile strength (MPa), pin feature aggressiveness, roughness data, and the groove angle. The simulated results from the algorithm have been presented in the t-SNE cluster plot as shown in Figure 2. The t-distributed stochastic neighbor embedding (t-SNE) is a non-linear plot that maps multi-dimensional data into two or more clusters. It enjoys the advantage of being able to present data explicitly as compared to other features.

From the t-SNE cluster plot result, it is not hard to see that there are some noise points (defects) in the obtained dataset. Additionally, from the dataset, we can easily see that the trained and validated data in the k-mean clustering algorithm have been classified automatically into three (3) different clusters, which also show their size, heterogeneity, and silhouette score. The silhouette score is used to determine the quality, performance, and similarity of the features in the clusters. In Table 2 and Table 3 below, the cluster information for the dataset and the cluster mean values, respectively, have been presented.

Similarly, from Table 2 and Table 3 above, the k-means algorithm is applied to determine the range of the features in the cluster and their importance. The results, which have been presented in Figure 3, show the clustering coefficient of the features as well as how they influence the overall defect in the welded joint dataset. The clustering coefficient measures the degree to which the nodes in the dataset tend to cluster together as well as the noise points. It also shows the tightly knit group created from the dataset, which is characterized by a relatively high density of ties.

3.2. Application of LOF Model Algorithm for Flaw Detection in Welded Joints

The LOF model algorithm has been applied for welded joint sample data from a field in the oil and gas industry. This is mainly to address the gaps in the NDE literature, where it has been found that there are difficulties in the use of the NDE method on welded joint structures with intricate shapes and high curvatures, as well as the challenges in the interpretation of NDE data due to human factors (subjectiveness and uncertainty). The LOF model algorithm considers the density of data instances surrounding a given instance A to the density of data instances surrounding A’s neighbors, such that if the former is lower than the latter, it indicates that A is substantially isolated. This isolation (anomaly) is deemed as the flaws or defects of the welded joint.

Using the dataset in Table 1 above, flaws due to internal cracks, internal porosity of the welded joint, internal fusion, and internal penetration are identified in the welded joint, which has intricate shapes and high curvature. The following results from the analysis of the dataset using the LOF model algorithm have been presented in Figure 4.

It is not difficult to see from the results presented in Figure 4a that there are several flaws, which are represented as abnormal data behavior scattered in and around the welded joints, such that this can be interpreted as a gradual propagation of the internal crack in and around the welded joint and to other parts of the material. Hence, if not repaired immediately, the joint could become weak and collapse. In Figure 4b, there are very few flaws around the welded joints; hence, it can be concluded and interpreted that the internal porosity in the welded joint has no significant effect on the entire welded joint or the materials in general. With this result, it is recommended therefore, that the welded joint should be monitored regularly to prevent total failure. Similar to the result in Figure 4b above, Figure 4c,d shows some few anomalies and flaws, which is as a result of a very significant defect in the welded joint due to internal fusion and internal weld penetration, respectively. Hence, it has been recommended that the welded joint should be monitored regularly.

4. Discussion of the Results

Welded joint data collected and analyzed in this thesis has proven to be very critical in the understanding of welded joint defect identification, classification, location, and propagation. The study has presented two novel computational model algorithms for the management of flaws and defects in welded joints with intricate shapes and curvatures, which are very common in the oil and gas industry.

In the implementation of the model algorithms, a dataset that comprises the hardness of the welded joint (HB), base material nominal yield strength (MPa), base material nominal ultimate tensile strength (MPa), pin feature aggressiveness, roughness data, temperature, and the groove angle of the joints were analyzed to classify and identify defects in the welded joint. Similarly, a dataset that comprises the internal crack length check, internal porosity of the welded joint, internal fusion assessment, internal slag inclusion assessment, and lack of penetration assessment and spatters data of the welded joint were simulated to detect flaws and defects in the welded joint considered. From the results obtained from the analysis, the study can conclude, therefore, that the proposed model algorithms are feasible and can address and manage flaws and defects in the welded joint, which in extension has addressed the gap in the literature.

5. Conclusions

Given the diversity of NDE applications and the stringent quality control standards which have been encouraged for the actualization of Industry 4.0, researchers and practitioners are now frequently tasked to provide low-cost and time-efficient solutions to NDE-related problems. Lately, data engineering has proven to be an effective numerical tool for NDE. In this paper, a data-driven approach that is based on the k-mean clustering and LOF algorithm has been proposed and deployed as a special tool for the management of the NDE of welded joints. The k-mean clustering and LOF algorithm was implemented for the classification, identification, and determination of data clusters and defects in the welded joints dataset.

In the application of the model algorithm, flaws due to internal cracks, porosity, fusion, and penetration in the welded joint were analyzed and visualized. In the future, the proposed data-driven intelligent approach will be applied in other domains for the improvement of welding technology and the actualization of the Industry 4.0 concept for the development of lightweight products for manufacturing.

Author Contributions

Conceptualization, D.O.A. and C.J.O.; methodology, D.O.A. and C.J.O.; software, E.O.; validation, D.O.A., C.J.O. and E.O.; formal analysis, D.O.A.; investigation, C.J.O.; resources, D.O.A.; data curation, E.O.; writing—original draft preparation, C.J.O.; writing—review and editing, D.O.A.; visualization, E.O.; supervision, D.O.A.; project administration, D.O.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to companies privacy.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ahmed, F.; Kim, K.Y. Data-driven Weld Nugget Width Prediction with Decision Tree Algorithm. Procedia Manuf. 2017, 10, 1009–1019. [Google Scholar] [CrossRef]
Aikhuele, D.O. A hybrid-fuzzy model with reliability-based criteria for selecting consumables used in welding dissimilar aluminum alloys joint. Eng. Appl. Sci. Res. 2019, 46, 79–85. [Google Scholar] [CrossRef]
Albers, A.; Holoch, J.; Revfi, S.; Spadinger, M. Lightweight design in product development: A conceptual framework for continuous support in the development process. Procedia CIRP 2021, 100, 494–499. [Google Scholar] [CrossRef]
Ali, K.A.; Ahmad, M.I.; Yusup, Y. Issues, impacts, and mitigations of carbon dioxide emissions in the building sector. Sustainability 2020, 12, 7427. [Google Scholar] [CrossRef]
Xiao, B.; Wang, Z.; Liu, Q.; Liu, X. Smk-Means-an-Improved-Mini-Batch-K-Means-Algorithm. Tech. Sci. Press (TSP) 2018, 56, 365–379. [Google Scholar]
De Souza, G.C.; Pardal, J.M.; Tavares, S.S.M.; da Cindra Fonseca, M.P.; Ferreira Martins, J.L.; de Moura, E.P.; Filho, I.C. Evaluation of proportion of phases in joints welded from duplex stainless steel pipes by means of non-destructive testing. Weld. Int. 2015, 29, 762–770. [Google Scholar] [CrossRef]
Eshtayeh, M.; Hijazi, A.; Hrairi, M. Nondestructive Evaluation of Welded Joints Using Digital Image Correlation. J. Nondestruct. Eval. 2015, 34, 37. [Google Scholar] [CrossRef]
Et-Taleby, A.; Boussetta, M.; Benslimane, M. Faults detection for photovoltaic field based on k-means, elbow, and average silhouette techniques through the segmentation of a thermal image. Int. J. Photoenergy 2020, 2020, 6617597. [Google Scholar] [CrossRef]
Ikumapayi, O.M.; Akinlabi, E.T. Experimental data on surface roughness and force feedback analysis in friction stir processed AA7075—T651 aluminium metal composites. Data Brief 2019, 23, 103710. [Google Scholar] [CrossRef] [PubMed]
Islam, M.R.; Kim, Y.H.; Kim, J.Y.; Kim, J.M. Detecting and learning unknown fault states by automatically finding the optimal number of clusters for online bearing fault diagnosis. Appl. Sci. 2019, 9, 2326. [Google Scholar] [CrossRef]
Kalpana, J.; Rao, P.S.; Rao, P.G. Regression Analysis for Estimating Hardness and Tensile Strength of Vibratory Dissimilar Welded Joint. Int. J. Manag. Technol. Eng. 2018, 8, 3560–3573. [Google Scholar]
Khan, M.M.R.; Siddique, M.A.B.; Arif, R.B.; Oishe, M.R. ADBSCAN: Adaptive density-based spatial clustering of applications with noise for identifying clusters with varying densities. In Proceedings of the 4th International Conference on Electrical Engineering and Information and Communication Technology, iCEEiCT 2018, Dhaka, Bangladesh, 13–15 September 2018; pp. 107–111. [Google Scholar] [CrossRef]
Kulis, B.; Jordan, M.I. Revisiting k-means: New algorithms via Bayesian nonparametrics. In Proceedings of the 29th International Conference on Machine Learning, ICML 2012, Madison, WI, USA, 26 June–1 July 2012; Volume 1, pp. 513–520. [Google Scholar]
Liu, Y.; Yuan, K.; Li, T.; Li, S.; Ren, Y. NDT Method for Line Laser Welding Based on Deep Learning and One-Dimensional Time-Series Data. Appl. Sci. 2022, 12, 7837. [Google Scholar] [CrossRef]
Lorbeer, B.; Kosareva, A.; Deva, B.; Softić, D.; Ruppel, P.; Küpper, A. Variations on the Clustering Algorithm BIRCH. Big Data Res. 2018, 11, 44–53. [Google Scholar] [CrossRef]
Lu, Q.Y.; Wong, C.H. Additive manufacturing process monitoring and control by non-destructive testing techniques: Challenges and in-process monitoring. Virtual Phys. Prototyp. 2018, 13, 39–48. [Google Scholar] [CrossRef]
Nakhla, H.; Shen, J.Y.; Bethea, M. Environmental impacts of using welding gas. J. Technol. Manag. Appl. Eng. 2012, 28, 2–11. [Google Scholar]
Nidhi Patel, K.A. An Efficient and Scalable Density-based Clustering Algorithm for Normalize Data. Procedia Comput. Sci. 2016, 92, 136–141. [Google Scholar] [CrossRef]
Othman, Z.; Zamli, I.; Rahaizat, R.A.; Sorooshian, S. Role of industry 4.0 in process strategy. J. Manag. Sci. 2018, 8, 192–198. [Google Scholar] [CrossRef]
Patel, E.; Kushwaha, D.S. Clustering Cloud Workloads: K-Means vs. Gaussian Mixture Model. Procedia Comput. Sci. 2020, 171, 158–167. [Google Scholar] [CrossRef]
Pires, I.; Quintino, L.; Amaral, V.; Rosado, T. Reduction of fume and gas emissions using innovative gas metal arc welding variants. Int. J. Adv. Manuf. Technol. 2010, 50, 557–567. [Google Scholar] [CrossRef]
Posilović, L.; Medak, D.; Milković, F.; Subašić, M.; Budimir, M.; Lončarić, S. Deep learning-based anomaly detection from ultrasonic images. Ultrasonics 2022, 124, 106737. [Google Scholar] [CrossRef]
Pradhan, R.; Joshi, A.P.; Sunny, M.R.; Sarkar, A. Machine learning models for determination of weldbead shape parameters for gas metal arc welded T-joints—A comparative study. arXiv 2022, arXiv:2206.02794. [Google Scholar]
Provencal, E.; Laperrière, L. Identification of weld geometry from ultrasound scan data using deep learning. Procedia CIRP 2021, 104, 122–127. [Google Scholar] [CrossRef]
Rajendran, C.; Srinivasan, K.; Balasubramanian, V.; Balaji, H.; Selvaraj, P. Data set on prediction of friction stir welding parameters to achieve maximum strength of AA2014-T6 aluminium alloy joints. Data Brief 2019, 23, 103735. [Google Scholar] [CrossRef]
Ranganayakulu, S.V.; Burra, S.G.; Ravi, S. Characterization of Weldments Defects through Non Destructive Evaluation Techniques. Indian J. Sci. Technol. 2017, 10, 1–9. [Google Scholar] [CrossRef][Green Version]
Rosenthal, S.; Maaß, F.; Kamaliev, M.; Hahn, M.; Gies, S.; Tekkaya, A.E. Lightweight in Automotive Components by Forming Technology. Automot. Innov. 2020, 3, 195–209. [Google Scholar] [CrossRef]
Sarkar, S.S.; Das, A.; Paul, S.; Mali, K.; Ghosh, A.; Sarkar, R.; Kumar, A. Machine learning method to predict and analyse transient temperature in submerged arc welding. Meas. J. Int. Meas. Confed. 2021, 170, 108713. [Google Scholar] [CrossRef]
Sorooshian, S.; Panigrahi, S. Impacts of the 4th industrial revolution on industries. Walailak J. Sci. Technol. 2020, 17, 903–915. [Google Scholar] [CrossRef]
Tripicchio, P.; D’Avella, S. Welding Defect Detection with Deep Learning Architectures. In Engineering Principles—Welding and Residual Stresses; Cozza, K.O.C., Câmara, R., Eds.; IntechOpen Limited: London, UK, 2022. [Google Scholar] [CrossRef]
Verma, S.; Misra, J.P.; Singh, J.; Batra, U.; Kumar, Y. Prediction of tensile behavior of FS welded AA7039 using machine learning. Mater. Today Commun. 2021, 26, 101933. [Google Scholar] [CrossRef]
Yang, L.; Liu, Y.; Peng, J. An Automatic Detection and Identification Method of Welded Joints Based on Deep Neural Network. IEEE Access 2019, 7, 164952–164961. [Google Scholar] [CrossRef]
Yee, L.W.; Teck, T.S.; Sorooshian, S. Impacts of industry 4.0 on Malaysian manufacturing industries. WSEAS Trans. Bus. Econ. 2019, 16, 355–359. [Google Scholar]
Yu, J.; Zhu, L.; Qin, R.; Zhang, Z.; Li, L.; Huang, T. Combining k-means clustering and random forest to evaluate the gas content of coalbed bed methane reservoirs. Geofluids 2021, 2021, 9321565. [Google Scholar] [CrossRef]
Zeng, J.; Cao, G.Z.; Peng, Y.P.; Huang, S.D. A weld joint type identification method for visual sensor based on image features and SVM. Sensors 2020, 20, 471. [Google Scholar] [CrossRef] [PubMed]
Zhou, B.; Pychynski, T.; Reischl, M.; Kharlamov, E.; Mikut, R. Machine learning with domain knowledge for predictive quality monitoring in resistance spot welding. J. Intell. Manuf. 2022, 33, 1139–1163. [Google Scholar] [CrossRef]

Figure 1. Flowchart of the k-mean clustering method algorithm.

Figure 2. The simulated t-SNE cluster plot results of the dataset.

Figure 3. The clustering coefficient and features of the welded joint data.

Figure 4. Detection of flaws due to internal crack, internal porosity, internal fusion, and internal penetration in the welded joint.

Table 1. Experimental data for the intelligent model.

S/N	Predictive Parameters	Data Range	Source
1	The temperature in the middle of the weld	18–20	[29] and from quality control units
2	Hardness (HB)	Max. 290	[30] and from quality control units
3	Nominal yield strength (MPa)	420–610	[30] and from quality control units
4	Nominal ultimate tensile strength (MPa)	640–790	[31] and from quality control units
5	Impact strength (J) at 20 °C	30–100	[29] and from quality control units
6	Weld length (mm)	8.53–9.0	[32] and quality control units
7	Roughness (μm)	0.24–1.35	[33] and quality control units
8	Porosity area (mm²)	2.95–7.0	[34] and quality control units
9	Bead width (mm)	4.5–5.2	[34] and quality control units
10	Bead area (mm²)	40.5–46.9	[35] and quality control units
11	Porosity ratio (%)	6.7–15.2	[35] and quality control units
12	Crack initiation from weld (mm)	6.6–14.2	[34] and quality control units
13	Groove angle (°)	45 and 60	[34] and quality control units
14	Welding speed (mm/s)	3–4	[34] and quality control units
15	Welding current (A)	80–140	[36] and quality control units

Table 2. Cluster information for the welded joint data.

Cluster Information
Cluster	Noise Points	1	2	3
Size	9	5	15	15
Explained proportion within-cluster heterogeneity	0.000	0.073	0.506	0.421
Within the sum of squares	0.000	8.569	59.157	49.263
Silhouette score	0.000	0.455	0.194	0.390

Notes. The between sum of squares of the 3 cluster model is 142.09. The total sum of squares of the 3 cluster model is 259.08.

Table 3. The clustering means of the welded joint data.

Cluster Means
	The Temperature in the Middle of the Weld	Hardness (HB)	Nominal Yield Strength (MPa)	Nominal Ultimate Tensile Strength (MPa)	Pin Feature Aggressiveness	Shoulder Diameter (mm)	Roughness (μm)	Groove Angle
Cluster 0	−0.217	0.540	−0.176	0.201	0.529	−0.113	−0.190	−0.152
Cluster 1	0.092	−0.927	−1.537	−1.775	−0.517	−0.740	−1.336	−0.823
Cluster 2	−0.232	0.667	−0.444	1.465 × 10⁻⁸	−0.256	0.439	−0.500	−0.823
Cluster 3	0.332	−0.682	1.063	0.471	0.111	−0.125	1.060	1.188

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Oleka, C.J.; Aikhuele, D.O.; Omorogiuwa, E. Data-Driven Intelligent Model for the Classification, Identification, and Determination of Data Clusters and Defect Location in a Welded Joint. Processes 2022, 10, 1923. https://doi.org/10.3390/pr10101923

AMA Style

Oleka CJ, Aikhuele DO, Omorogiuwa E. Data-Driven Intelligent Model for the Classification, Identification, and Determination of Data Clusters and Defect Location in a Welded Joint. Processes. 2022; 10(10):1923. https://doi.org/10.3390/pr10101923

Chicago/Turabian Style

Oleka, Chijioke Jerry, Daniel Osezua Aikhuele, and Eseosa Omorogiuwa. 2022. "Data-Driven Intelligent Model for the Classification, Identification, and Determination of Data Clusters and Defect Location in a Welded Joint" Processes 10, no. 10: 1923. https://doi.org/10.3390/pr10101923

APA Style

Oleka, C. J., Aikhuele, D. O., & Omorogiuwa, E. (2022). Data-Driven Intelligent Model for the Classification, Identification, and Determination of Data Clusters and Defect Location in a Welded Joint. Processes, 10(10), 1923. https://doi.org/10.3390/pr10101923

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Data-Driven Intelligent Model for the Classification, Identification, and Determination of Data Clusters and Defect Location in a Welded Joint

Abstract

1. Introduction

2. The Data-Driven Intelligent Model for Welded Joints

2.1. K-Mean Clustering and the Local Outlier Factor (LOF) Model

2.2. Data Collection for the Implementation of a Data-Driven Intelligent Model

3. Implementation of the Data-Driven Intelligent Model

3.1. The K-Mean Clustering Algorithm for Defect Classification for Welded Joint Data

3.2. Application of LOF Model Algorithm for Flaw Detection in Welded Joints

4. Discussion of the Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI