Machine Learning of Surface Layer Property Prediction for Milling Operations

: Tool wear and cutting parameters have a signiﬁcant effect on the surface layer properties in milling. Since the relation between tool wear, cutting parameters, and surface layer properties is mostly unknown, the latter cannot be controlled during production and may vary from part to part as tool wear progresses. To account for this uncertainty and to prevent premature failure, components often need to be oversized or surface layer properties need to be adjusted in subsequent manufacturing processes. Several approaches have been made to obtain models that predict the surface layer properties induced by manufacturing processes. However, those approaches need to be calibrated with a considerable number of experimental trials. As trials are time-consuming and surface layer measurements are laborious, no industrial applications have been realized. Complex models have one major drawback. They have to be re-parameterized as soon as process characteristics change. Therefore, manual experimental parameterization does not appear to be a feasible approach for industrial application. A highly automated approach for the machine learning of the relation between tool wear, cutting parameters and surface layer properties is presented in this paper. The amount of obtained measurement data allows a fundamental analysis of the approach, which paves the way for further developments. model performance. Deep NN, random forests, and gradient boosting machine learning


Introduction
Surface layer properties (SLP) of a workpiece, such as microhardness, martensite content, and compressive or tensile residual stresses, are influenced by temperatures and temperature gradients, which occur in the cutting zone during milling [1]. Mechanical tension and high strain rates might lead to severe plastic deformation and thus to the formation of nanocrystalline structures, resulting in an increase in tensile strength [2]. Accordingly, SLP have a significant impact on component life and premature failure, e.g., compressive residual stresses in the surface layer that prevent crack propagation [3][4][5]. Since plastic deformation and cutting temperatures are the result of forming, shear, friction, material separation, and material redirection, which are subjected to disturbances such as increasing tool wear and rising tool temperature, the manufactured SLP vary considerably over tool life.
The SLP are in most cases not monitored by quality control in the manufacturing industry or used as a feedback value for further production. To prevent premature failure of a critical part caused by lack of process knowledge in manufacturing, it is often necessary to design parts with safety factors [6]. This results in heavy and oversized components. Downstream processes such as heat treatments [7], plasma nitriding [8], deep rolling [9], or shot peening [10,11] offer an industrially applicable possibility for the defined configuration of SLP. Major disadvantages of these approaches are the significant time and cost expenditures, which are caused by additional processes and necessary component inspection. Therefore, it would be desirable to at least keep SLP within defined limits during the milling process with cutting parameter adaption, which compensates the influence of increasing tool wear on SLP.
Since modern surface layer measurement hardware, e.g., Barkhausen noise or eddy current sensors, are susceptible to vibrations, electromagnetic fields, temperature deviations in the workpiece, and cooling lubricant, direct measurement of SLP parallel to the milling process remains challenging [12][13][14][15]. Several approaches have been made to overcome this challenge by teaching soft sensors or creating analytical or FE-models [5]. Due to the complexity of the milling process as well as the high experimental effort required to calibrate the models for the specific milling setup, no industrial application with direct SLP measurement parallel to the milling process has been realized to this date.
However, the interactions between manipulated variables of the cutting process, tool wear state, thermal and mechanical load in the surface layer and the manufactured SLP are subject of several preliminary works. Investigations on cutting parameters and disturbances on target variables such as hardness, residual stresses, and workpiece microstructure are presented in the following section.
Influence of cutting parameters on SLP are mentioned in standard reference works for different machining processes [1,16,17]. Several researchers report relations between cutting speed v c , tool feed f z , cutting width a e and the resulting SLP [3,[18][19][20][21][22]. Microhardness and residual stresses are influenced differently depending on the feed direction [23,24]. The influence of the cooling strategy for milling [25] and hard machining [26,27] was shown as well. Other preliminary works report the impact of ultrasonic assistance on SLP in milling [28,29]. Furthermore, it was shown that tool wear state has a significant effect on SLP [3,[30][31][32][33]. Other works showed the consequences of tool substrate [3,31], tool coating [34], cutting edge chamfer [3,35,36] and rake angle variation [37]. Different workpiece materials and initial material properties influence manufactured SLP as well [38][39][40]. Zahoor et al. [41,42] report on the influence of tool vibrations, which are heavily dependent on the specific spindle and tool holder, on resulting SLP. Ulutan and Özel [43] published a comprehensive review on machining-induced SLP changes. A more recent review was given by Sales et al. [5] in 2020.
SLP prediction and control during machining was subject to various research activities over the past 20 years. The proposed systems can be subdivided into numerical, analytical, empirical, and soft-sensing approaches. The different academic developments will be presented in the following section.
El-Axir [44] calculates residual stress profiles based on the assumption that they are only dependent on initial tensile strength and the cutting parameters. Wang et al. [45] developed an FE-based model for the simulation of residual stress profiles in high-speed machining, without experimental validation. Zhang et al. [46] trained a neural network (NN) using experimental data. The NN uses cutting parameters, coolant type, tool wear, workpiece material, and tool geometries as the input and outputs the longitudinal and circumferential residual stress profiles. The presented results show good agreement between predictions and experimental data. Umbrello et al. [47,48] taught an NN to predict subsurface residual stresses when inputting cutting parameters and initial workpiece hardness for hard turning. They consider purely empirical models to be inapplicable due to their excessive experimental expenditure, an FE-model to be neither suitable nor sufficiently robust, and therefore use a hybrid FE-NN approach. Saini et al. [49] developed linear regression models on residual stress for turning while neglecting tool wear. They observed depth of cut a p and tool feed f z as being relevant. However, the influence of the tool nose radius r β and cutting speed v c is seen as less relevant for modeling. Ulutan et al. [50] also judge analytical and FE-based approaches for the prediction of SLP as making too many assumptions and idealizations. Therefore, they proposed the fitting of sinusoidal decay functions on the residual stress profile. Wang et al. [51] also consider FE-simulations to be inaccurate and time consuming and fitted a decaying cosine function for the residual stress profile and calculated its parameters with a regression on cutting parameters. Schulze et al. [52] evaluated different approaches for modeling the phase transformations in machining. Huang et al. [53] developed an efficient analytical model for the prediction of the residual stress profile. However, validation with experimental data remains vague. Yang et al. [54] proposed an analytical model for residual stress calculation for the orthogonal cutting process which also took heat generation into account. Uhlmann et al. [55] proposed a FEM approach for hard turning with consideration of phase transformations. Stampfer et al. [56] considered cutting force measurement insufficient for the development of a soft sensor and propose further process measurands such as acoustic emission. Meurer et al. [57] proposed a surface integrity sensor based on cutting force to predict white layer thickness. The predicted white layer thickness values showed deviations of more than 100% to experimental data. Uebel et al. [40] proposed a soft sensor based on force, temperature, pneumatic, and angle resolved scattered light sensor measurement. Fricke et al. [58][59][60] developed an eddy current sensor measurement in combination with a cutting simulation for the online prediction of martensite content during turning.
For the realization in industrial applications, manufacturers must be enabled to parameterize SLP models for the specific process characteristics, workpiece, and machine setup on hand to make reliable predictions and adapt cutting parameter accordingly. One feasible approach is the automation of the experimental step to gain sufficient process data quickly and reliably. A demonstrator of such a setup has been realized by the authors and is discussed in this paper.

Materials and Methods
For the development of the presented approach, the UltraSonic C260 Composites machine tool by Sauer GMBH, Stipshausen, Germany, with a Sinumerik 840d SL CNCcontrol and an OPC UA SW 3.0 SP1-server of the company Siemens AG, Munich, Germany, was used. The installed milling spindle was of type MFW-1412/40 HSK-E50 CSC by Fischer AG Präzisionsspindeln, Herzogenbuchsee, Switzerland. The uncoated solid carbide end mill F4114 with a diameter of d = 16 mm, a helix angle of h a = 30 • , a rake angle of γ = 8 • , a clearance angle of α = 9 • , and four cutting edges by Schwegler Werkzeugfabrik GMBH & Co. KG, Vöhringen, Germany, was clamped in a tool holder powRgrip HSK F63/PG25x100H by REGO-Fix AG, Tenniken, Switzerland. The milling tool consists of K10 microsubstrate. As a workpiece, a rolled and sawed plate 42CrMo4 + QT (1.7225) with a tensile strength of R m = 1042 N/mm 2 by AG Der Dillinger Hüttenwerke, Dillingen, Germany, was used. The initial state of the workpiece was measured to maximum Barkhausen noise amplitude M Max,init = 0.28 V and coercive force H cm,init = 22 A/m. To avoid damage to the surface layer measurement hardware, only dry cutting experiments with air cooling were conducted. To keep the cutting width constant, cutter radius compensation G41 was turned on. Each cut was conducted as a straight line with the cutting width of a e = 0.5 mm and an axial depth a p = 30 mm at a cutting length of c l = 100 mm.
To obtain micromagnetic measurands, which can be employed to indirectly quantify the mechanical SLP, a 3MA-II-sensor by Fraunhofer IZFP, Saarbrücken, Germany, was used. The tool wear was quantified by measuring the tool radius reduction ∆r using the laser measurement system LaserControl NT by Blum-Novotest GMBH, Grünkraut, Germany. The micromagnetic measurands and tool radius reduction ∆r were measured after each cut. To gain statistical security, for each run nine surface layer measurements were performed, with the sensor being moved away and back again three times in between. For each milling process, power consumptions, electric currents and voltages of the Sinamics motor modules were recorded with a sampling rate of f s = 1 kHz using core current transformers KUW1/30-100 by Janitza Electronic GMBH, Lahnau, Germany.
After initialization, all cutting processes, micromagnetic measurements (MM) and tool wear measurements are conducted autonomously and one after the other. All actions of the machine tool and measurement equipment were controlled by a Python 3.9.5 script by Python Software Foundation, DE, USA. To trigger the 3MA-II-sensor to start measuring and to receive and send machine tool data, the Python script communicated with TwinCAT3 software by Beckhoff GMBH & Co. KG, Verl, Germany, running on the same computer via ADS. Measurement data of the 3MA-II-sensor were written to a txt-file and then fetched and parsed by the Python script. The communication between machine tool, laser measurement system, and PC was realized by an OPC-client in TwinCAT and the OPC-server running on the CNC-control. One experiment lasted around two minutes, depending on the cutting parameters. The setup and dataflow are visualized in Figure 1.
After initialization, all cutting processes, micromagnetic measurements (MM) and tool wear measurements are conducted autonomously and one after the other. All actions of the machine tool and measurement equipment were controlled by a Python 3.9.5 script by Python Software Foundation, DE, USA. To trigger the 3MA-II-sensor to start measuring and to receive and send machine tool data, the Python script communicated with TwinCAT3 software by Beckhoff GMBH & Co. KG, Verl, Germany, running on the same computer via ADS. Measurement data of the 3MA-II-sensor were written to a txtfile and then fetched and parsed by the Python script. The communication between machine tool, laser measurement system, and PC was realized by an OPC-client in TwinCAT and the OPC-server running on the CNC-control. One experiment lasted around two minutes, depending on the cutting parameters. The setup and dataflow are visualized in Figure 1. To identify cutting parameters, which result in desired MM, the system performed a non-elitist evolution strategy with nine offspring. Initial cutting parameters are set according to tool manufacturers' recommendations. Offspring i is then defined by its cutting velocity vc,Offspring,i and feed per tooth fz,Offspring,i using Equations (1) and (2): vc,Offspring,i = vc,Parent + signvc,i × mvc,i fz,Offspring,i = fz,Parent + signfz,i × mfz,i where signi is randomly set to either '+' or '−' and m is a normally distributed random value with a mean of µ = 0 and standard deviations σvc = 20 and σfz = 0.02, respectively. Standard deviations are adapted according to the 1/5 success rule which states that the number of successful mutations to all mutations is best set to srg = 1/5. If srg is higher, step size of the next generation g + 1 is increased and decreased when smaller than 1/5. Following, Equation (3) is used: ↔ sr g > 1/5 As soon as one offspring achieves acceptable MM, the evolution strategy is aborted and a design of experiments is conducted where this offspring serves as center point. To identify cutting parameters, which result in desired MM, the system performed a non-elitist evolution strategy with nine offspring. Initial cutting parameters are set according to tool manufacturers' recommendations. Offspring i is then defined by its cutting velocity v c,Offspring,i and feed per tooth f z,Offspring,i using Equations (1) and (2): where sign i is randomly set to either '+' or '−' and m is a normally distributed random value with a mean of µ = 0 and standard deviations σ vc = 20 and σ fz = 0.02, respectively. Standard deviations are adapted according to the 1/5 success rule which states that the number of successful mutations to all mutations is best set to sr g = 1/5. If sr g is higher, step size of the next generation g + 1 is increased and decreased when smaller than 1/5. Following, Equation (3) is used: As soon as one offspring achieves acceptable MM, the evolution strategy is aborted and a design of experiments is conducted where this offspring serves as center point. under investigation. Machine learning and model investigation was executed with GMDH Shell 3.8.9, GMDH Inc., New York, NY, USA, as well as RapidMiner Studio Free 9.9.002, Rapidminer GMBH, Dortmund, Germany, and the Python library scikit-learn 0.24.2.

Results
During the experiments with the presented setup, four milling tools reached the wear criterium. An overall of 826 experiments were conducted, which could be subdivided into 522 data points for evolution strategy and 304 for the design of the experiment. Analysis and processing of the measurement data obtained is presented in the following sections.

3MA-II-Sensor Measurements
The 3MA-II-sensor shows constant reproducibility for all measurement values. No heteroskedasticity was observed as the standard deviation showed a constant average of 0.007 V for all measured averaged values of maximum Barkhausen noise amplitude M Max in the range of 0.169 V to 0.332 V. The coercive force shows a constant reproducibility with a standard deviation of 0.32 A/m. As 3MA-II measurements were executed 3 × 3 times after each cut, a shift in maximum Barkhausen noise amplitude values could be observed ranging from 0 V to 0.0014 V on average between first and ninth measurement which is attributed to the workpiece cooling down and the sensor heating up. For three measurements, during which the sensor is not removed from the milled surface, absolute reproducibility of the measurand ranges between ±0.2 mV.

Cable Core Converters Measurements
Power consumption P M,Mean of the motor module was measured via the sum of the three phase currents of its power supply. Measurements showed high correlation with the tool radius reduction ∆r of r PM,Mean,∆r = 0.95. As tool radius reduction progressed from ∆r = 20 to ∆r = 180 µm, motor module power P M,Mean almost doubled from around P M,Mean = 2000 to more than P M,Mean = 4000 W. Deviations, which are also observable in Figure 2, mainly derived from varying the cutting parameters feed per tooth f z and cutting speed v c . Since power consumption was not constant during one experiment, the power measurement time series data was averaged for each cutting experiment. fz,low = fz,center − (fz,UL − fz,LL) × 0.05 Experiments were performed until a tool radius reduction of Δr = 175 µm was exceeded, which corresponded with a wear mark width of VB = 600 µm. Measurement data of the cable core converters, all 3MA-II-sensor outputs, and the tool radius measurement were logged to a SQL-database. These data were then analyzed and further processed to gain knowledge on MM and tool wear behavior of the milling process configuration under investigation. Machine learning and model investigation was executed with GMDH Shell 3.8.9, GMDH Inc., New York, NY, USA, as well as RapidMiner Studio Free 9.9.002, Rapidminer GMBH, Dortmund, Germany, and the Python library scikit-learn 0.24.2.

Results
During the experiments with the presented setup, four milling tools reached the wear criterium. An overall of 826 experiments were conducted, which could be subdivided into 522 data points for evolution strategy and 304 for the design of the experiment. Analysis and processing of the measurement data obtained is presented in the following sections.

3MA-II-Sensor Measurements
The 3MA-II-sensor shows constant reproducibility for all measurement values. No heteroskedasticity was observed as the standard deviation showed a constant average of 0.007 V for all measured averaged values of maximum Barkhausen noise amplitude MMax in the range of 0.169 V to 0.332 V. The coercive force shows a constant reproducibility with a standard deviation of 0.32 A/m. As 3MA-II measurements were executed 3 × 3 times after each cut, a shift in maximum Barkhausen noise amplitude values could be observed ranging from 0 V to 0.0014 V on average between first and ninth measurement which is attributed to the workpiece cooling down and the sensor heating up. For three measurements, during which the sensor is not removed from the milled surface, absolute reproducibility of the measurand ranges between ±0.2 mV.

Cable Core Converters Measurements
Power consumption PM,Mean of the motor module was measured via the sum of the three phase currents of its power supply. Measurements showed high correlation with the tool radius reduction Δr of rPM,Mean,Δr = 0.95. As tool radius reduction progressed from Δr = 20 to Δr = 180 µ m, motor module power PM,Mean almost doubled from around PM,Mean = 2000 to more than PM,Mean = 4000 W. Deviations, which are also observable in Figure 2, mainly derived from varying the cutting parameters feed per tooth fz and cutting speed vc. Since power consumption was not constant during one experiment, the power measurement time series data was averaged for each cutting experiment.  When teaching a combinatorial model in GMDH Shell, with ∆r as target value and f z , P M,Mean , v c and their cube roots as input variables, a coefficient of determination R 2 > 0.96 with mean absolute error MAE ∆r < 6 µm was reached on the testing data set. Further inputs to improve model performance could not be identified. Even when 80% of the dataset were held out for the testing data set, and only the remaining 20% were used for the training data set, a coefficient of determination R 2 > 0.95 was reached. For samples with a low radius reduction ∆r < 50 µm, model performance was declining. Since the tool wear rate was so high for the unused tool, only a few samples could be obtained in in the range 0 < ∆r < 50 µm. Therefore, the mean absolute error rises to MAE ∆r,∆r < 50 < 73 µm.
Still, online power measurement shows great potential for obtaining information about the state of flank wear conditions in the cutting zone and is further utilized in this paper for modeling and predicting the MM.

Relationship of Cutting Parameters, Tool Radius Reduction, and Micromagnetic Measurements
When cutting with constant cutting parameters, a significant influence of tool radius reduction ∆r on maximum Barkhausen noise amplitude M Max and further measurands of the 3MA-II-sensor was observed. When teaching a combinatorial model in GMDH Shell, with Δr as target valu fz, PM,Mean, vc and their cube roots as input variables, a coefficient of determination R 2 with mean absolute error MAEΔr < 6 µm was reached on the testing data set. Further i to improve model performance could not be identified. Even when 80% of the d were held out for the testing data set, and only the remaining 20% were used fo training data set, a coefficient of determination R 2 > 0.95 was reached. For samples w low radius reduction Δr < 50 µ m, model performance was declining. Since the tool rate was so high for the unused tool, only a few samples could be obtained in in the 0 < Δr < 50 µ m. Therefore, the mean absolute error rises to MAEΔr,Δr < 50 < 73 µ m online power measurement shows great potential for obtaining information abou state of flank wear conditions in the cutting zone and is further utilized in this pap modeling and predicting the MM.

Relationship of Cutting Parameters, Tool Radius Reduction, and Micromagnetic Measurements
When cutting with constant cutting parameters, a significant influence of tool r reduction Δr on maximum Barkhausen noise amplitude MMax and further measuran the 3MA-II-sensor was observed. For this study, constant parameters were ch according to the manufacturer's recommendations for the dry, down milling proces  When cutting with an unused tool, increasing feed per tooth in a range of fz = 0 0.04 mm also increases MMax, whereas an increase in cutting speed vc does not res significant change of the MMax. With increasing tool wear, the sensitivity to feed per changes diminishes at Δr > 160 µm. An increase in feed per tooth fz and cutting spe results in a decrease of MMax. The coercive force HCM shows a strong positive corre with feed per tooth fz and cutting speed vc for the entire tool life. The magnitude and When cutting with an unused tool, increasing feed per tooth in a range of f z = 0.02 to 0.04 mm also increases M Max , whereas an increase in cutting speed v c does not result in significant change of the M Max . With increasing tool wear, the sensitivity to feed per tooth changes diminishes at ∆r > 160 µm. An increase in feed per tooth f z and cutting speed v c results in a decrease of M Max . The coercive force H CM shows a strong positive correlation with feed per tooth f z and cutting speed v c for the entire tool life. The magnitude and sign of the influence of cutting parameter adaptations on MM can therefore assumed to be highly depended on tool wear state. Still, if only M Max was considered in the MM measurements, workpiece zones with higher thermal damage cannot be distinguished from surface layers with less thermal damage. Therefore, further measurands of the 3MA-II-sensor need to be taken into account, primarily the coercive force H CM [61]. The tendencial increase of coercive force H CM for a tool radius reduction of ∆r > 90 µm, as shown in Figure 3, indicates a re-hardening zone. When the coercive force limit is exceeded, re-hardening zones in the workpiece are supposed.

Adaptive Cutting Processes
Nevertheless, adapting the cutting parameters to compensate for the influence of tool wear on the MM is feasible. In order to keep the maximum Barkhausen noise amplitude below a limit of M Max < 0.25 V and the coercive force H CM below the limit of 28 A/m, a high feed per tooth of f z = 0.06 mm, and a high cutting speed of v c = 85 m/min can be chosen for the unused tool. Feed per tooth and cutting speed must be reduced with increasing tool radius reduction ∆r to about f z = 0.01 mm and v c = 45 m/min. Since further processing supposedly results in re-hardening zones, additional machining was aborted. Measurement data portraying this circumstance and trend lines are visualized in Figure 4.
of the influence of cutting parameter adaptations on MM can therefore assumed to be highly depended on tool wear state. Still, if only MMax was considered in the MM measurements, workpiece zones with higher thermal damage cannot be distinguished from surface layers with less thermal damage. Therefore, further measurands of the 3MA-II-sensor need to be taken into account, primarily the coercive force HCM [61]. The tendencial increase of coercive force HCM for a tool radius reduction of Δr > 90 µm, as shown in Figure 3, indicates a re-hardening zone. When the coercive force limit is exceeded, re-hardening zones in the workpiece are supposed.

Adaptive Cutting Processes
Nevertheless, adapting the cutting parameters to compensate for the influence of tool wear on the MM is feasible. In order to keep the maximum Barkhausen noise amplitude below a limit of MMax < 0.25 V and the coercive force HCM below the limit of 28 A/m, a high feed per tooth of fz = 0.06 mm, and a high cutting speed of vc = 85 m/min can be chosen for the unused tool. Feed per tooth and cutting speed must be reduced with increasing tool radius reduction Δr to about fz = 0.01 mm and vc = 45 m/min. Since further processing supposedly results in re-hardening zones, additional machining was aborted. Measurement data portraying this circumstance and trend lines are visualized in Figure  4.

Data-Driven Prediction of Micromagnetic Measurements
In order to investigate the performance of purely data-driven predictions of the MM different machine learning methods have been tested. As model input values feed per tooth f z , tool radius reduction ∆r, cutting speed v c , average power consumption P M,Mean , and the standard deviation of power measurement P M,Std were chosen. The model performance has been improved by adding the cube roots of all the mentioned features as inputs. Further input or adding further mathematical functions on the used input did not improve the model performance. Deep NN, random forests, and gradient boosting machine learning methods have been tested on the dataset as well. No significant improvement to the performance results presented below could be determined. For the presented methods, the coefficient of determination R 2 on the testing set decreases drastically if fewer than 70% of all samples are used for training. The testing set was fixed at 27 samples for all presented training set magnitudes. It can be observed that purely data-driven approaches to predict MM show a decrease in model performance when the training set is too small, as shown in Figure 5. With sufficient datapoints however, model performance with a coefficient of determination higher than R 2 = 0.94 was achievable.

Data-Driven Prediction of Micromagnetic Measurements
In order to investigate the performance of purely data-driven predictions of the MM different machine learning methods have been tested. As model input values feed per tooth fz, tool radius reduction Δr, cutting speed vc, average power consumption PM,Mean, and the standard deviation of power measurement PM,Std were chosen. The model performance has been improved by adding the cube roots of all the mentioned features as inputs. Further input or adding further mathematical functions on the used input did not improve the model performance. Deep NN, random forests, and gradient boosting machine learning methods have been tested on the dataset as well. No significant improvement to the performance results presented below could be determined. For the presented methods, the coefficient of determination R 2 on the testing set decreases drastically if fewer than 70% of all samples are used for training. The testing set was fixed at 27 samples for all presented training set magnitudes. It can be observed that purely data-driven approaches to predict MM show a decrease in model performance when the training set is too small, as shown in Figure 5. With sufficient datapoints however, model performance with a coefficient of determination higher than R 2 = 0.94 was achievable. For the GMDH NN and the combinatorial approach, it is shown that inputting the features aggregated from the power measurement time series leads to a performance increase. The GMDH neural network with the best performance on the testing set of R 2 = 0.94 showed a standard deviation of the residuals on the whole testing set of ΔMMax = 0.014 V. Equivalent results were obtained when training GMDH neural networks for other measurands of the 3MA-II-sensor. For example, when choosing coercive force HCM as target value with the same inputs as chosen for MMax, a coefficient of determination on the testing data set of R 2 > 0.95 can be achieved with a standard deviation of the residuals of ΔHCM < 0.3 A/m. For both critical measurands, the standard deviation of the residuals was within the magnitude of the reproducibility of the 3MA-II-sensor measurements investigated in Section 3.1.

Discussion
The demonstrator setup and the presented approach enables great possibilities for further research and the development of industrially relevant machine learning solutions. It can be concluded that machine learning approaches on modeling MM in combination with a highly automated execution of cutting experiments shows great potential for industrial application. Training data sets for teaching machine learning techniques need For the GMDH NN and the combinatorial approach, it is shown that inputting the features aggregated from the power measurement time series leads to a performance increase. The GMDH neural network with the best performance on the testing set of R 2 = 0.94 showed a standard deviation of the residuals on the whole testing set of ∆ MMax = 0.014 V. Equivalent results were obtained when training GMDH neural networks for other measurands of the 3MA-II-sensor. For example, when choosing coercive force H CM as target value with the same inputs as chosen for M Max , a coefficient of determination on the testing data set of R 2 > 0.95 can be achieved with a standard deviation of the residuals of ∆ HCM < 0.3 A/m. For both critical measurands, the standard deviation of the residuals was within the magnitude of the reproducibility of the 3MA-II-sensor measurements investigated in Section 3.1.

Discussion
The demonstrator setup and the presented approach enables great possibilities for further research and the development of industrially relevant machine learning solutions. It can be concluded that machine learning approaches on modeling MM in combination with a highly automated execution of cutting experiments shows great potential for industrial application. Training data sets for teaching machine learning techniques need to contain a sufficient number of samples. In addition, it was clearly shown that inputting online measurements to the MM prediction model can improve model performance considerably. The approach allows automated and comprehensive parameterization of models for the predictions of MM and, if calibrated, SLP for the specific milling configuration. Therefore, cutting parameters can be adapted in order to keep the MM within defined tolerance bands and avoid re-hardening zones. The introduced level of automation is not limited to the milling process or the prediction of MM. Since the approach is straight-forward, does not rely on complex simulation, and can be exercised for the specific milling process configuration at hand, further automation and development of this demonstrator appears worthwhile. The following steps will be taken in further research: • 3MA-II-sensor needs to be calibrated on SLP such as hardness and residual stresses in the workpiece, considering several MM; • Conducting experiments with more industrially relevant, coated mills; • Enabling of wet and ultrasonic assisted cutting processes, using the cooling capacity and ultrasonic amplitude as manipulative values; • Realization of a prediction of a SLP depth gradient; • Using white or gray box approaches to improve the model performance; • Investigations on surface layer emergence during the cutting operation with suitable in situ measurement setup.