1. Introduction
Austenitic stainless steels (ASSs) are widely known to exist in stable states depending on their chemical and mechanical conditions [
1,
2]. UNS S34700 steel, a metastable austenitic stainless steel, is widely utilized in the food, chemical, petrochemical, and medical industries [
3]. Because austenitic stainless steels like type AISI 347 have good resistance to sensitization and creep deformation, they are commonly used in components designed for high-temperature applications such as nuclear reactors, boilers, superheaters, and chemical reactors [
4]. One of the most important criteria in these areas is fine surface finish, which AISI 347 steels can achieve through the grinding process. The ultimate surface quality of a processed workpiece significantly influences strength, friction, heat generation, wear, and fatigue resistance. Therefore, the specified grade of surface finish is usually indicated based on its applicability [
5].
Grinding turns out to be the most practical and cost-efficient machining operation when it comes to achieving a fine high-quality surface. As the most common precision machining technique in the industry, it is important to monitor the efficiency of grinding operations. The choice of parameters, such as wheel speed, table speed, cross feed, depth of cut, dressing depth, and coolant supply has a significant impact on the efficiency of the grinding process [
6,
7,
8].
Machining factors such as surface roughness and cutting force directly impact the surface quality of workpieces. Additionally, forecasting these parameters is facilitated using models constructed in previous publications [
9,
10,
11]. Agarwal et al. [
12] studied the effects of different cutting parameters on surface finish, such as material removal rate (MRR), surface roughness, and surface burn. The results revealed that table speed, depth of cut, grit size, and abrasive grain density are the most important elements influencing surface integrity when using a silicon carbide grinding wheel. A decrease in the depth of the cut and the table speed also results in less subsurface damage and surface roughness.
Machine learning (ML) techniques are used to forecast ideal machining parameters and fault detection as a predictive maintenance solution. ML approaches are highly beneficial in enhancing the prediction of grinding factors including cutting forces, tool wear, tool life [
13], remaining useful life prediction, wheel wear detection [
14], surface roughness prediction, grinding burn, estimation of grinding specific energy [
15], wheel loading, and crack detection [
16]. Because of its capability to identify extremely sophisticated and nonlinear patterns in data, a data-driven approach based on machine learning is a researcher’s main focus [
17].
Gopan et al. [
18] used a hybrid ANN-PSO strategy to multi-objective optimize the surface grinding process on AISI D2 steel. Table speed, cross feed, and depth of cut were the machining parameters chosen for optimization, with the goals of minimizing surface roughness, F
Y, and F
X.
The confirmatory experiments for the ANN-PSO approach yielded error percentages of 9.8% for surface roughness, 3.75% for FX, and 1.35% for FY. For multi-objective optimization in complex and nonlinear scenarios, the ANN-PSO method outperforms the traditional RSM-MOGA method.
Kant et al. [
19] developed an artificial neural network model to estimate the cutting energy of carbon steel during grinding based on data from 27 trials by adjusting process parameters (spindle speed, depth, width of cut, and feed rate). The cutting energy was predicted by the model used with a 98.5% accuracy rate.
The experimental data collected over five processes and optimization approaches employing neural model-based control strategies for the industrial grinding process were reported in the study by [
20]. Another result of the study was the optimization of a technique affecting the machining of steel-based tools through the use of neural networks.
Prashanth et al. [
9] used machine learning to estimate the grinding variables for Inconel 751 under various environmental conditions. The aim of their study was to use machine learning methods to link the input and output characteristics of grinding Inconel 751. The methods for predicting the variables during Inconel 751 grinding were three different types of ML algorithms: support vector machine (SVM), boosted tree ensemble approaches, and Gaussian process regression (GPR). The GPR predicted grinding forces and temperature with a relatively better R
2 score and RMSE value at wheel work contact. It is clear that the GPR Matern 5/2 model produces an accurate forecast, especially when predicting F
t, with constant R
2 values ranging from 0.85 to 0.9.
In order to predict the material removal rate for Inconel-718, Zhang et al. [
21] used electrical discharge diamond grinding (EDDG) and developed the GPR model, whose input variables were wheel speed, current, pulse-on-time, and duty factors. The performance of the model was shown to be accurate and stable. In particular, the GPR model reaches a correlation coefficient (CC), mean percentage error (MPE), root mean square error (RMSE), and mean absolute error (MAE) of 99.17%, −0.55%, 0.6787, and 0.3789, respectively.
Jian-Lin Liu et al. [
22] investigated the intricate relationships between the geometric and energy variables that influence the transition between the Wenzel and Cassie–Baxter wetting states for sinusoidal substrates, providing an essential framework for the development of materials with improved hydrophobicity.
Prabhu et al. [
23] attempted to disperse multiwall carbon nanotubes (MWCNT) into cutting fluids of SAE 20W40 oil. Using the L8 orthogonal Taguchi design for experimental procedures in CNC grinding, AISI D3 tool steel was ground. CNC grinding was applied to AISI D3 tool steel using the L8 orthogonal Taguchi design for experimental methodologies. The experimental data were trained with the feed-forward ANN method using the Levenberg–Marquardt algorithm, and the significant variables were identified. With the ANN model, the maximum test errors for surface roughness were 10.31% and 11.3% for the cases with and without carbon nanotubes.
A unique method for predicting the surface roughness of the Al-based MMC in cylindrical grinding was presented by Ucar et al. [
24], and it has been effectively implemented. An optimized GPR model was created, using a real-life dataset collected experimentally from other research. The suggested machine learning-based GPR model proved to be a promising way to enhance the machining quality of the Al MMC materials when analyzing the surface roughness.
Finding methods to establish the relationship between the input parameters obtained from experimental processes and the output variables has been challenging research for engineers and authors in this field until now. Researchers also continue to use techniques to predict output variables accurately through efficient quantitative models [
25,
26,
27].
Abou-El-Hossein et al. [
25] published a paper about optimizing cutting force parameters using the response surface method on modified AISI P20 tool steel during the end-milling process. Predictive models were developed for cutting conditions, including feed rate, cutting speed, grinding depth of cut, and radial depth of cut. Based on variance analysis, the most effective input parameters and the interactions of parameters were calculated. The parameters in this research were optimized and results were predicted with a 95% confidence interval.
Noordin et al. [
28] used a model to establish a relationship between parameters to predict the tangential cutting force in the turning of AISI 1045. They concluded that feed rate and side cutting edge angle are the main factors affecting the output variable.
Artificial neural networks (ANNs) have been used in many publications as one of the most accurate prediction methods in recent years. The versatility of an ANN is due to its ability to handle nonlinear systems, particularly those systems whose definition is dynamic with various levels and a large number of parameters. The capability of ANN methods mostly comes from their versatility for training from past data and predicting based on errors. An ANN can monitor surface roughness and grinding forces in industrial grinding processes for prediction, preventing faults in subsequent data in the process [
29].
The authors [
29] proposed dynamic models for the case of the turning of AISI 4140 steel. They developed their study using ANN and support vector regression (SVR) methods in order to predict the cutting force and surface roughness. The results were validated, and they confirmed the accuracy of these techniques. The performance of the methods showed a fairly high detection accuracy in this research.
Gupta [
30] conducted a study to predict surface roughness based on cutting speed, feed rate, and cutting time. The methods that were used to calculate surface roughness from output factors were response surface methodology, ANN, and support vector regression (SVR). The results showed that the ANN and SVR models achieved higher accuracy than the response surface methodology model. The ANN model achieved an MAE of 0.0631 and an RMSE of 0.0501. In comparison, the SVR model recorded an MAE of 0.0898 and an RMSE of 0.0795. The R
2 was 0.9013 for the ANN and 0.9273 for SVR.
Table 1 depicts a comprehensive assessment of available research on the industrial grinding and machining process that uses artificial neural networks as a ground truth approach to anticipate and determine a relationship between input parameters.
The literature review clearly shows that many studies consider a wide range of grinding parameters. However, implementing all parameters is not always cost-optimized, and some grinding processes do not require specific parameters. Notably, only one study considered coolant as a changing parameter, while other studies fixed it, usually as MQL. None of the studies considered different grinding wheels, which are typically set to one model. The effects of different coolants and grinding wheels remain largely unexplored.
Previous studies [
32,
33] included a grinding test dataset with three different grinding wheels, four different coolants, and seven depths of cut. This approach is promising for considering these two often-overlooked parameters in assessing surface roughness quality. Most studies enhance their models using ANN algorithms as the main method, with some also employing support vector machines (SVMs). These two methods are well-known and widely adopted due to their stability, as reported in the literature.
2. Materials and Methods
2.1. Experimental Setup
The experimental data were sourced from previous studies [
32,
33]. Surface grinding tests were conducted on hardened stainless steel (UNS S34700) using a 20 mm width M7135A-NANTONG SHUANGZANG-NC, “NANTONG SHUANGZANG-NC, Jiangsu, China” surface grinding machine in down-grinding (plunge) mode with three grinding wheels which are produced by “TYROLIT Co., Schwaz, Austria”. Several parameters were varied during the experiments to evaluate their effects on surface roughness and grinding force, while the remaining parameters were kept constant.
Table 2 provides details of the experimental setup and the constant parameters used in the current grinding trials.
This study examined how changes in grinding input parameters affect output conditions. Surface roughness and grinding forces (tangential and normal) were measured using seven different depths of cut, three types of grinding wheels with different abrasive types, grain sizes, and hardness degrees, and four cooling methods. The type of coolant and grinding wheel, which are qualitative factors, along with the depth of cut, a quantitative factor, directly impact surface roughness and grinding forces. Aluminum oxide type 89A is ideal for high-strength materials like alloy steels, while 88A is used for general purposes. Using 89A180K6V111 grinding wheel produced a harder grade (K) and a finer grit. Another type of wheel was made out of silicon carbide (C), which is sharper and more brittle than aluminum oxide. Details of these variable parameters and their levels are provided in
Table 3.
After the tenth pass, surface roughness and grinding forces were measured. In accordance with DIN EN ISO 3274:1998, the workpiece roughness was assessed using a mobile roughness measurement tool (DIAVITE AG) with a cut-off length of 0.8 mm. Surface roughness (R
z) across the grinding direction was measured at five different locations on the ground surface at the end of each test. Unlike average roughness parameters (R
a and R
q), R
z (the average distance between the highest peaks and lowest valleys) provides accurate insights into the peaks and valleys of the surface profile, which would be valuable in various areas for investigating its sensitivity in predictive capability. A four-component dynamometer (Kistler type 9272) was used to measure the grinding force components while the workpiece was clamped with specific equipment (see
Figure 1).
2.2. Dataset Evaluation
In a typical grinding process, numerous factors come into play. However, assessing all these variables at different levels necessitates a complex testing procedure, leading to costly and often impractical results in each trial. Considering the main factors that influence grinding quality and the machine’s longevity, we concentrated on key parameters such as the material removal rate, type of coolant, and the grinding wheel.
As previously mentioned, the testing process included seven different removal rates: 5, 15, 25, 35, 45, 55, and 65 μm. Four types of coolant were utilized: dry compressed air, MQL1, and MQL2 (which vary in oil types). Additionally, three types of grinding wheels were used as input parameters. All other significant factors were kept constant, as detailed earlier. The outputs measured were surface roughness and grinding force.
Table 4 presents a subset of the 84 datasets designed using a full factorial experimental design, divided into input and output factors, referred to as predictors and responses. The design was chosen to encompass all experimental conditions, ensuring accurate predictive modeling [
34].
2.3. ANOVA
ANOVA (Analysis of Variance) is a statistical method used to analyze differences among group means within a sample. In mechanical engineering, it is particularly useful for identifying key factors and interactions in experimental designs [
35]. This includes optimizing material properties, improving manufacturing processes, and assessing performance characteristics [
36]. ANOVA is often paired with Robust Design to pinpoint the parameters that most significantly affect the outputs during development [
37].
The ANOVA process starts with a comprehensive model that includes all main effects and interactions. To streamline the model without compromising its quality, factors with the highest
p-values are gradually removed. This ensures that the factors with the lowest
p-values, which have the most significant impact, are retained [
38,
39].
In this study, which involves multiple factors, N-Way ANOVA is used. Here, “n” represents the number of independent variables in the analysis. This method provides a thorough examination of the interactions among these factors and their overall effect on the dependent variable, offering deeper insights into the underlying dynamics [
40,
41].
2.4. Genetic Algorithm Optimization
The genetic algorithm (GA) is a widely used heuristic search optimization technique. It is a population-based algorithm that mimics the natural reproduction system, following Darwin’s principle of survival of the fittest. The process begins with the creation of an initial population of alternatives, known as chromosomes. These chromosomes are then randomly chosen from the population and undergo mutation (fitness function evaluation), crossover (gene swapping), and selection repeatedly until the termination criteria are satisfied [
42].
The GA focuses on the local search for the best solution based on fitness evaluation. Researchers have applied GAs in various fields [
43,
44,
45,
46]. In the current study, the GA was implemented using the MATLAB programming platform.
In the realm of grinding process models and computer-aided manufacturing, the GA serves as a foundation for optimizing grinding factors. Various factors influence the cost-effectiveness of machining operations, such as machine tool capacity, the geometry of the workpiece, and cutting conditions like speed, feed rate, and depth of cut [
47,
48]. Genetic algorithms provide a powerful method for optimizing the relationship between various factors in processes like grinding. For instance, they can identify the optimal production cost by considering machining conditions and achieve high-quality results while ensuring surface finish and minimizing surface damage. According to recent academic research, this approach is highly effective in balancing these critical parameters [
49,
50,
51,
52,
53].
2.5. Formula Evaluation
Given the numerous factors influencing grinding and their substantial impact on the final performance of workpieces, optimizing each variable is both challenging and complex. Fortunately, some settings are chosen by operators, and certain parameters are more critical than others. This study focuses on the effects of removal rate, coolant, and grinding wheel types to develop an analytical formulation linking surface roughness and grinding forces.
This overview addresses two main concerns. First, the type of coolant and grinding parameters, along with their levels, are qualitative and need to be converted into appropriate quantitative parameters for analytical methods. The results and discussion section details the suitable conversion method used in this study. Second, the structure of the evaluation formula is crucial. While the correlations between input factors and coefficients are infinite, the relationship between single and dual parameters with appropriate coefficients is sufficient.
Another important issue is that the number of variables and coefficients, treated as unknowns to be solved by the genetic algorithm (GA), should be fewer than the number of data points in the dataset. The GA relies on the dataset to solve the optimization problem, and adhering to this criterion is essential for achieving a unique solution. Consequently, the formulas in Equations (1)–(3) are used to evaluate the relationships between inputs and outputs for surface roughness (R
z), grinding force tangential direction (F
t), and grinding force in normal direction (F
n), respectively.
In the equations mentioned above, (xr) stands for the removal rate, (xc) represents the coolant type (each type is assigned a specific number), and (xw) signifies the grinding wheel type.
The coefficients (ai), (bi), and (ci) are associated with the formulas for surface roughness, and grinding force tangential and normal directions, respectively. These coefficients were determined using the GA.
2.6. Machine Learning Implementation
Predictive models for grinding outputs were developed using various grinding input parameters. A comparative study was conducted employing three algorithms: support vector machine (SVM), Gaussian process regression (GPR), and an artificial neural network (ANN). The performance of these algorithms was evaluated based on parameters such as material removal rate (MRR), grinding forces (Ft and Fn), and surface roughness (Rz).
2.6.1. Supported Vector Regression (SVR)
Support vector regression (SVR) is a technique derived from support vector machines (SVMs) that is used for analyzing continuous data. In 1996, this approach was first presented. Although SVMs were initially developed for discrete category classification problems, SVR extends this methodology to continuous data, enabling it to be used for numerical value prediction [
54].
Figure 2 also shows the SVR method implementation procedure.
SVR is really just a modified form of linear regression. But a lot of issues in the actual world are not just linear. As a result, SVR was adapted for use with nonlinear situations. By converting the data into a higher-dimensional feature space, kernel functions allow for linear separation. These are referred to kernel functions [
55]. In this case, the SVR method implementation procedure is shown in
Figure 2, and the regression function is described using an equation where the optimization goal is to find the flattest function in the feature space instead of the input space [
56].
Here,
and
represent Lagrange multipliers. A linear dot, product of the nonlinear mapping, defines the kernel function
. By minimising the following regularised risk functional, the coefficients
and
in Equation (4) are produced.
The trade-off is determined by the constant (
), and the term ω denotes the model complexity. The following is the definition of the ε-insensitive loss function, or
.
In the SVR approach, data is transformed using kernel functions from the input space—where linear separation is not feasible—to a higher-dimensional space, or kernel space, where a linear hyperplane may separate the data [
57]. Selecting the appropriate kernel function is essential. The most often-utilized kernel functions for SVR are the linear, polynomial, and Gaussian (RBF) kernels, as given in
Table 5, despite the fact that some of the literature offers up to 12 kernel functions [
58].
2.6.2. Gaussian Process Regression (GPR)
Gaussian process regression (GPR), in contrast to the SVR approach, is characterized by a distribution function that is determined by the mean and covariance functions. Using a specified covariance function and a distribution estimated function, the GPR model is defined on a given vector-valued data set to predict the outputs [
59,
60]. Because of this,
f(x) is defined as the weighted sum of the basic functions.
where
is the weight matrix related to the output,
is the collection of values for the N basis functions at
, and
is the white noise model with correlation
over noise models.
The kernel function, often referred to as the covariance function, is a technique for interpreting the mapping of input data—whether linear or nonlinear—to a feature space. The function values for any two inputs are displayed below, and
Table 6 displays two popular kernel functions.
Ultimately, the probability of a given function (f) is specified, and its solution yields the ideal weight value taking uncertainties and noise in the system into account. Unlike the support vector regression (SVR) model, this model also includes a robust solution.
Therefore, another well-known and effective machine learning example is the Gaussian process regression (GPR) model, which is improved using a Bayesian technique to offer an uncertainty evaluation on the predicted values.
Figure 3 also depicts the GPR models implementation approach.
2.6.3. Artificial Neural Network (ANN)
Artificial neural networks (ANNs) have been effectively used for many application problems, such as function approximation and classification; as universal approximators, ANNs are especially helpful as function approximators since they do not require prior knowledge of the input data distribution [
61].
Figure 4 depicts a schematic of an artificial neural network (ANN).
A Multi-Layer Perceptron (MLP), a feed-forward neural network that associates sets of input data with suitable outputs, is the foundation of the implemented ANN structure [
62]. As seen in
Figure 4, this structure consists of an input layer, an output layer with a single node, and a single hidden layer with variable nodes. Activation functions are essential components of artificial neural networks (ANNs) because they facilitate the learning and comprehension of intricate and nonlinear mappings between inputs and outputs. While there are more than ten activation functions available in the literature [
63,
64], the ReLU activation function has been proven to perform adequately in this study.
Similar to other machine learning techniques, artificial neural networks (ANNs) offer superior performance in predicting intricate relationships between related factors. ANNs can be applied to a wide range of issues due to their scalability and flexibility, particularly in mechanical engineering [
65,
66,
67].
2.7. Accuracy Metrics
Several metrics are available for the regression model, which is a nonlinear structure that has a complex relation between input values and output values. In some of the literature, up to 29 metrics are discussed [
68,
69] for just regression; however, it is not necessary to work with all of them in order to select the best model. Three primary measures are introduced here, and the current study employed them to provide a comprehensive understanding of performance by comparing metrics that improve model stability and accuracy in the actual world.
A popular statistic for assessing models is the root mean square error (RMSE), especially for normal (Gaussian) errors [
70].
The relative ranks of the models are unaffected by taking the root; however, it produces a metric with the same units as the input data, which can be used to conveniently describe the typical or “standard” error for normally distributed errors (Equation (9)).
Another metric that is frequently used as a loss function in some of the literature to indicate the mean error in the prediction process is the mean absolute percentage error (MAPE), which is a measure of the extent of absolute error in percentage terms [
69].
When used for nonlinear regression models or other nonlinear techniques, the Coefficient of Determination (R2) becomes highly complicated, as the result does not necessarily fall between 0 and 1 and can even be negative.
This problem affects the dependability of (R
2) in nonlinear regression, a point raised by some scholars [
71]. Eight distinct formulas for (R
2) that are present in the literature are discussed by Kvålseth [
72], who also points out how they differ, what confusion they produce, and frequent mistakes that may be made while using them. Nevertheless, the common formula (Equation (11)) measures the “goodness of fit” and, when combined with other metrics, offers a clear picture of the precision and stability of the applied model [
73].
In Equations (9)–(11), the values denoted by , , , and are the actual and expected values, and the actual and predicted mean values, respectively.
The evaluation was based on their R
2 and root mean square error (RMSE) score values and mean absolute percentage error (MAPE) [
9]. The models were trained using machine learning algorithms using the MATLAB regression learner tool.
2.8. Structure of Study
The current study is a multi-disciplinary research project that consists of two distinct fields: manufacturing grinding and machine learning approaches. This integration aligns with the realm of digitalization and sustainability solutions.
To carry out the research, the current study consists of five distinct parts, which are illustrated in
Figure 5. The first part (data preprocessing) involves reading data, concatenating (decoding) qualitative inputs to quantitative (coolant type and grinding wheel types), and implementing primary statistical analysis and ANOVA to identify the influence of each parameter on the final outputs (surface roughness and grinding force). The second part (formula evaluation) includes a novel formula generalization with genetic algorithm optimization to provide the direct and analytical solution between input and output parameters available in the full dataset, also providing accuracy analysis to identify the performance of the proposed formula. The third part (ML implementation) includes the development of the ML model, the optimization algorithm, and the implementation of all three mentioned methods (SVR, GPR, and ANN) to the proposed dataset after splitting it into train and test portions. Then, accuracy metrics calculate the performance of each model and are reported separately, especially based on different coolant solutions as promising study parameters. The fourth step is implementing global sensitivity analysis, which uses a model-dependent approach to identify the most and least influential parameters on the final solution (both for surface roughness and grinding forces). Finally, in the last step (the fifth one), all the implementations are compared together to provide better insight into the flow of current digitalization implementation.
3. Results and Discussion
3.1. Statistical Analysis and ANOVA
At first glance, statistical analysis is crucial for the current investigation. The dataset comprises 84 sets of grinding process data, which include only one numerical value: the removal rate. Additionally, there are four types of coolant and three different grinding wheels, which are qualitative data associated with the grinding process. During the process, the overall force in the tangential and normal directions was measured and, at the end, the surface roughness (R
z) was recorded.
Table 7 provides an overview of all the data collected during the process.
However, implementing statistical or analytical procedures like ANOVA on qualitative data is impossible. Therefore, encoding qualitative data to quantitative representation is necessary. In this study, the four different coolants (dry, MQL model 1, MQL model 2, and fluid) are assigned integer values of 1, 2, 3, and 4, respectively. Similarly, the three types of grinding wheels (Al2O3 180K6, Al2O3 80L6, and 120I8) are assigned integer values of 1, 2, and 3, respectively.
Table 8 presents a primary statistical analysis of the entire dataset. It is crucial to note that, for any future investigations, such as those using machine learning algorithms, the input data (predictors) should be within the same range of minimum and maximum values. The predicted output accuracy and error are measured with the mean value of real predictors like R
z.
In many applications of ANOVA, the sum of squares and
p-values are the most relevant outputs, while other values are intermediate steps leading to the
p-values, which indicate the results of significance tests. Therefore, the
p-value is the key result to consider. However, it is customary to present all computed data in a table to support claims of statistical significance in ANOVA. Each hypothesis is tested using an F-test, which compares the mean square for the source of variation (large when the source significantly affects the measurement) to the error mean square. To determine if a source of variation has a significant effect, examine its
p-value. If the
p-value is sufficiently small, such as
p < 0.05, the effect is considered significant [
74].
The ANOVA results for different response variables, shown in
Table 9,
Table 10, and
Table 11 for surface roughness (R
z) and grinding force in the tangential direction (F
t) and in the normal direction (F
n), respectively, clearly indicate that all the predictors (input values) significantly affect the responses, as evidenced by
p-values below 5%. This suggests that new parameters, such as types of coolant and grinding wheel, will significantly impact the output variables in each test. However, ANOVA does not quantify the extent of each variable’s effect. Therefore, additional methods, such as formula implementation and machine learning, are necessary to systematically measure the impact of each parameter on the output values and to create a platform for predicting new responses based on new predictors.
3.2. GA and Formula Generation
The genetic algorithm (GA) is an iterative process that starts with an initial value and aims to find an optimized solution by minimizing a predefined cost function. In this implementation, the cost function is the squared mean value error, as defined in Equation (9). The GA iterates until it reaches the minimum threshold for the cost function value, which in this case is below 0.001 after 20 iterations. The final simplified formulas for each response—surface roughness, grinding force in the tangential direction (F
t), and grinding force in the normal direction (F
n)—are provided in Equations (12), (13), and (14), respectively. The accuracy metrics are summarized in
Table 12.
The obtained equations demonstrate that all input variables significantly affect the output responses, as none were removed from the equations. The coefficients are substantial, indicating that even the interactions between variables are important and cannot be ignored.
From
Table 12, although the R
2 value for surface roughness is around 0.55 and that for grinding force in both directions (tangential and normal) is more than 0.6, the MAPE is quite high, exceeding 28%. This indicates that using these equations requires more careful consideration. However, the equations do provide a clear analytical approach for defining the relationship between removal rate and other variables. By implementing further optimization processes with different methods, it is possible to achieve better solutions. Furthermore, although this sophisticated implementation establishes a pioneering relationship between feature parameters (such as depth of cut, types of coolant, and types of grinding wheel) and response variables like surface roughness, it is highly dependent on the provided dataset.
3.3. Surface Roughness
The results of implementing machine learning algorithms on the dataset with respect to surface roughness are illustrated in
Figure 6,
Figure 7 and
Figure 8 for the SVR, GPR, and ANN methods, respectively. Before implementing the ML models, the dataset was split homogeneously into training and test sets with equal distribution based on coolant type. The main focus of this study, besides implementing ML models, is to investigate the effects of coolant type on the accuracy and stability of each model. Therefore, each part of the dataset includes 21 sets of data for each coolant. Randomly, three data points were selected, resulting in the training portion containing 72 sets of data and the test portion containing 12 sets of data. All 72 data points were used for training each model with hyperparameter optimization based on the Bayesian algorithm. The final trained version of each model was then implemented on the test data, and all results were shown based on coolant type in
Figure 6,
Figure 7 and
Figure 8.
From
Figure 7a,b, the correlation between real and predicted data in training is lower than in other methods, visually showing that both MQL coolants perform better than other coolant types, with a clearly lower distribution. Furthermore, the accuracy metrics for the GPR method clearly indicate the best solution compared to other ML implementations, as this provided the highest R
2 value. For the training process, the dry and MQL1 coolants yielded the best R
2 value of 0.93, while the mean absolute percentage error (MAPE) for the dry coolant is lower than for MQL (4.58% and 5.12%, respectively). In the second stage, the fluid coolant also provided acceptable values, with an R
2 of 0.87 and a MAPE of 4.64%. However, for MQL2, the R
2 is 0.76 and the MAPE is 6.91%, which is somewhat reasonable for the current study.
The SVR method, which has a negative R2 in the training process, is not acceptable, although it has a lower MAPE value, under 7% for MQL2 and fluid coolants. In the testing process, it provided reasonable solutions.
The ANN implementation in a single-layer configuration for the training process for dry and MQL1 coolants also provided acceptable solutions (with R2 values of 0.89 and 0.77 and MAPE values of 9.2% and 12.8%, respectively). However, the values for MQL2 and fluid coolants are far from the acceptable range, with R2 values of 0.54 and 0.46 and MAPE values of 7.9% and 7.6%, respectively.
The results from the testing process indicate how well the implemented methods can predict unseen data within an acceptable range. From
Figure 6b,
Figure 7b and
Figure 8b for the testing process of SVR, GPR, and the ANN, respectively, it is challenging to determine which implementation provided better results without considering the accuracy metrics for the testing data.
Table 13 and
Table 14 clearly show that the GPR method provided the best testing solution without any hesitation. For dry, MQL1, and fluid coolants, the R
2 values are 1.0, 0.95, and 0.99, respectively, which are the best results. However, MQL2, with an R
2 of around 0.57, introduces some variability in the final result. Furthermore, the average accuracy for dry, MQL1, and fluid coolants in the testing process is more than 97%, indicating that the GPR method can predict surface roughness with reasonable accuracy and stability.
In contrast, the ANN method only provided an acceptable solution for fluid coolant, with an R2 of 0.98 and a MAPE of around 3.6%. The SVR method showed R2 values of 0.87 and 0.82 for fluid and MQL1 coolants, respectively. However, both methods had negative R2 values for MQL2 and dry coolants, indicating that these two methods, with the selected optimized hyperparameters, are not suitable for any prediction procedure for the current dataset and require further optimization.
3.4. Grinding Force Tangential Direction (Ft)
The provided dataset also includes grinding force in the tangential and normal directions, which specifically influences surface roughness and surface temperature; however, the temperature data were not provided. In this section, the considered machine learning methods were retrained to predict grinding force, first in the tangential direction (discussed in this section) and then in the normal direction (discussed in the next section). Like the previous section, the dataset was split homogeneously into training and test sets, with the training portion containing 72 sets of data and the test portion containing 12 sets of data. All 72 data points were used for training each model with hyperparameter optimization based on the Bayesian algorithm.
The results of implementing machine learning algorithms on the dataset with respect to grinding force in the tangential direction are illustrated in
Figure 9,
Figure 10 and
Figure 11 for the SVR, GPR, and ANN methods, respectively. Before implementing the ML models, the dataset was split homogeneously into training and test sets with equal distribution based on coolant type. This illustration provides better insight into the influence of coolant type specifically on grinding force.
From
Figure 9a,
Figure 10a and
Figure 11a, it is clear that both the GPR and ANN methods have lower data correlation with the perfect prediction line, with MQL2 in the GPR method (illustrated with a black triangle mark) having the lowest correlation in the training process. However, making judgments from a visual graph without clear numerical data seems odd, so
Table 15 and
Table 16 tabulate the accuracy metrics of the training and testing solutions.
It is clearly shown that both the GPR and ANN implementations have reasonably robust solutions based on R2 values, which are higher than 0.97 for all coolant types in GPR and higher than 0.96 for the ANN. However, the MAPE value is a bit challenging in this scenario. In the training procedure for grinding force in the tangential direction, the MAPE value for the GPR method is 9.9% and 7.8% for dry and MQL2 coolants, respectively, while this value for MQL1 is about 26% and 65% for MQL1 and fluid, respectively. More consideration is needed for using these methods for these coolant types. In contrast, the MAPE value for the ANN is reasonably high, reaching around 20.5% and 45% for dry and MQL1, respectively, which shows that although the ANN method is a stable machine learning solution, the accuracy results for all coolant types did not provide a valuable solution for any future consideration and digital predictor platform.
Additionally, the SVR method has R2 training values ranging from 0.3 to 0.93 for different coolants indicating that, overall, the SVR does not provide a robust network configuration.
Figure 9b,
Figure 10b and
Figure 11b illustrate the testing procedure implemented on a part of the dataset that was not used for training. This action measures the ability of the trained network to predict unseen data. The results in
Figure 9b,
Figure 10b and
Figure 11b are quite similar, making it difficult to judge the accuracy of these methods solely from the figures. However, it is clearly evident that the SVR method did not provide an acceptable solution, as its data correlation is higher than that of the other methods.
Table 15 and
Table 16 in the test section show that both the GPR and ANN methods, with R
2 values greater than 0.97 for all coolant types, provided a sophisticated and stable solution. The MAPE values are lower than 5.2% for dry, MQL1, and fluid coolants (with the fluid coolant achieving a MAPE of about 1.3%), which is acceptable overall. However, the MAPE value for MQL2 is somewhat high (about 24%) and should be considered when using this method for MQL2 data.
The ANN method also shows MAPE results under 7.1% for dry, MQL1, and fluid coolants, indicating that this method is also suitable for any further data prediction. It is worth noting that, like the GPR method, the ANN method reaches a high MAPE value for MQL2, around 15.5%.
3.5. Grinding Force Normal Direction (Fn)
The same data for grinding force in the normal direction is also available, and the same implementation conducted for the tangential direction was applied to the normal direction. The final results are illustrated in
Figure 12,
Figure 13 and
Figure 14 for the SVR, GPR, and ANN methods, respectively. The portion size for training and testing implementation is similar to the previous section (72 sets of data for training and 12 sets of data for testing).
Clearly, from the training results (
Figure 12a,
Figure 13a and
Figure 14a), the data variation for the normal direction is more than for the tangential direction. Because of this, the correlation between real data and predicted data is high, and the data distribution around perfect prediction seems unsatisfactory.
Table 17 and
Table 18 provide accuracy metric results to give better insight into the models’ implementation.
For the training process, it is clearly shown that the GPR method, with an R2 of more than 0.90 compared to other methods, provided robust machine learning implementations. Furthermore, the R2 for MQL2 and fluid coolant is 0.96 and 0.98, respectively, while the MAPE results are 11.3% and 14.6%, respectively, which is reasonably acceptable in the engineering field. However, for dry and MQL1 coolant, although the R2 is high and acceptable, the MAPE is absolutely high (54% and 89%) indicating that, for this type of coolant, the learning process was not conducted successfully and needs more training or other optimization processes.
For the ANN method, the R2 results vary from 0.8 to 0.92, which is acceptable for this type of jittering data. While MQL2 has an R2 of around 0.89, its MAPE is 21%. The R2 value for fluid coolant is 0.92, while the MAPE is 53%, making it hard to make a robust decision for selecting an appropriate method. For dry coolant, the R2 is 0.8 and the MAPE is about 18%.
The SVR method, like other implementations in this study, could not provide a reasonable solution. The R2 varies from 0.43 to 0.89, clearly showing that the SVR’s robustness for some coolant methods is absolutely problematic. Although the MAPE results for the SVR method in the training process are lower than for other implementations, the method’s robustness confirms that the training process could not provide a stable solution.
From the training results, it is clear that the test results illustrated in
Figure 12b,
Figure 13b and
Figure 14b for the SVR, GPR, and ANN methods, respectively, are not reasonably perfect. Therefore, the accuracy metrics in
Table 17 and
Table 18 in the test section provide all the necessary results for the final judgment. Notably, the GPR results are the best among the others. The R
2 for GPR in the test section is about 0.98 for dry, MQL1, and MQL2, while the R
2 for fluid coolant is 0.87, which is acceptable. The MAPE results are also around 9% for dry, MQL1, and fluid, while the MAPE for MQL2 is slightly higher at around 25%.
For the ANN method in the testing procedure, the R2 is also high, exceeding 0.94 for dry, MQL1, and MQL2, while the R2 for fluid is about 0.86, which is acceptable. The MAPE results do not exceed 11.6% for dry, MQL1, and fluid, while the MAPE for MQL2 is 54%, indicating that using the ANN method for MQL2 requires some consideration and attention.
The test results for SVR in this situation are not acceptable, with R2 values ranging from 0.64 to 0.99, and the MAPE for MQL2 is extremely high at around 42%. Therefore, this method will not be accepted for any further consideration.
3.6. Sensitivity Analysis
The sensitivity analysis results for surface roughness and grinding force in the tangential and normal directions are illustrated in
Figure 15,
Figure 16,
Figure 17,
Figure 18,
Figure 19 and
Figure 20. The effects of input parameters on surface roughness are investigated in
Figure 15 and
Figure 16.
Figure 15 illustrates the effects of depth of cut (5, 35, and 65 µm) for each coolant solution on surface roughness, considering the Al
2O
3 80L6 grinding wheel. The figure clearly shows that surface roughness values increase as the depth of cut increases. According to the current trend, the GPR and ANN methods predict the real values accurately with low absolute error, while the SVR method does not provide sufficient results.
Another interesting result from
Figure 15 is that the MQL1 coolant provides the lowest surface roughness value, as proven by real values and the GPR and ANN implementations. In contrast, the dry method results in the highest surface roughness, which is expected from industrial results, because the possibility of friction and heat in this method is more than other ones, causing thermal damage and abrasive wear on the workpiece surface [
75].
Figure 16 specifically studies the effects of different grinding wheels at different depths of cut (5, 35, and 65 µm), with the cooling method fixed at MQL1 (the best coolant in this study). It clearly shows that surface roughness increases with the depth of cut for all grinding wheel types. Overall, the GPR and ANN prediction methods can reasonably track the real values with the lowest error. Furthermore, the Al
2O
3 80L6(2) grinding wheel achieves the lowest surface roughness among the others at a 5 µm depth of cut. At a 65 µm depth of cut, the surface roughness values for all grinding wheels are similar, with negligible differences. Ultimately, due to its fine grit size, this type of wheel is capable of grinding finer finishes that are important for reducing surface roughness. Aluminum oxide, the abrasive used for wheel components, is great at removing harder surfaces such as stainless steel, and this depicts it correctly [
76].
For grinding force in the tangential and normal directions,
Figure 17 and
Figure 18, respectively, illustrate the trend changes with respect to different cooling methods while the grinding wheel remains Al
2O
3 80L6. It clearly shows that the force increases dramatically in both directions as the depth of cut increases from 5 to 35 µm, and this increasing trend in grinding force continues as the depth of cut increases to 65 µm.
From the mentioned figures, the value of the force in the tangential and normal directions does not change significantly at each depth of cut for different cooling methods. However, with a more specific comparison, the value of the grinding force for MQL1 is lower than the others, although identifying the difference is somewhat complicated. In contrast, the dry cooling method results in a higher grinding force, which is expected from industrial implementations.
A noteworthy finding from
Figure 17 and
Figure 18 is the notable consistency observed between the predicted values produced by the ML models (GPR and ANN). In other words, this harmonious alignment in grinding force prediction in both directions, achieved through the GPR, ANN models, highlights the models’ robustness, precision, and dependability in reflecting real-world scenarios within the specified range.
For the final consideration in this section, comparing the results illustrated in
Figure 17 and
Figure 18 clearly shows that the overall value of the grinding force in the normal direction is significantly higher than the grinding force in the tangential direction at all depths of cut and for all coolant methods.
Figure 19 and
Figure 20 present the final results of the grinding forces in the tangential and normal directions with respect to different grinding wheels, while the coolant method is set to MQL1. The trend shows an increase in grinding force values as the depth of cut increases. Although the absolute value of each grinding force at each depth of cut appears to be similar, the 120I8(8) grinding wheel achieves slightly lower grinding force values. Additionally, the grinding force in the normal direction is significantly higher than in the tangential direction due to the nature of the grinding process.
In addition to these results, which have been extracted from a regression methodology via machine learning technologies, to model the surface roughness on a workpiece after grinding, a continuum mechanics approach can be applied that integrates the principles of material deformation and tool–workpiece interaction. Using finite element methods, the interaction of abrasive grains and process dynamics is simulated to predict surface topography, which can then be validated through comparison with experimental results. This approach may be regarded as a comprehensive subject for future research, particularly in relation to a comparison of this methodology with those based on machine learning regression models.
4. Conclusions
The modeling of the relationship between parameters that affect the grinding surface roughness and force does not usually take into consideration cooling type and grinding wheel. This paper presents a new approach by incorporating qualitative factors with quantitative parameters, such as depth of cut, as seen in previous work. Model-bound sensitivity analysis, such as ANOVA, for example, which is frequently illustrated in the literature, must be carried out. Results show that, for all parameters and especially for the encoded qualitative ones, the p-value is below 5%; therefore, these factors strongly influence surface roughness and grinding force. ANOVA, on the other hand, cannot comprehensively explain the relationship between inputs and outputs.
A subsequent analytical expression was then used to define the relationship between the input parameters and outputs, including interactions that have a direct impact on the outcome.
The result of this methodology was 18 unknown coefficients solved with genetic algorithm optimization using the dataset and mean square error as the objective function. The results of this gave three complex mathematical formulations, proving all input parameters and their interactions have an influence on the output because none of the 18 coefficients were zero.
The measurement of the accuracy for these formulas was approximately 0.55 with an average correctness of about 70%. While this technique is not very accurate, it does provide a reverse on the suitability of each parameter. This model fails terribly in scalability and flexibility since this technique strongly depends on the dataset. Changes in data size or type result in changes within the validity of the formula. This calls for the need to develop flexible and affordable techniques. The literature review was carried out, and machine learning algorithms of universal techniques were selected and optimized for regression analysis using the Bayesian optimization of hyperparameters. This state-of-the-art methodology describes the relations between predictors and responses involving depth of cut, cooling technique, and grinding wheel classification with respect to surface roughness and grinding force in tangential and normal orientations.
Among these, the GPR technique had the best performance both during training and testing, with R-squared values ranging from 0.75 to 1 for testing and from 0.57 to 0.99 for training during the surface roughness prediction. The effect of cooling type on the predictions was studied. In this respect, the MQL1 technique had very consistent and reliable results, reflected in the high R-squared values for both the training and testing stages. The grinding force is investigated in both the tangential and normal directions, with good results from both the GPR and ANN models. The GPR model achieved a R2 of 0.99 with an MAPE of about 10% for the training of the tangential direction, while a value of 0.98 with an MAPE of 14.8% was received for the normal direction.
The R2 for the ANN model is 0.97 and MAPE 8% in the tangential direction while, in the normal direction, this model reached a value of 0.92 and MAPE of 14.2%. Even though GPR has a slight advantage over the ANN, both models were found to be adequate for further deployment based on the digitalization approach. A sensitivity study has validated that all specified input parameters, such as surface roughness and grinding forces in both orientations, directly influence the final result. The present investigation has confirmed that converting qualitative factors into quantitative data is cardinal because these directly influence response values. Furthermore, the chosen machine learning techniques monitor trends effectively, and this has been validated through the correspondence of actual data.