Next Article in Journal
Advances in Foot Biomechanics and Gait Analysis
Next Article in Special Issue
Size Effect on the Strength Behavior of Cohesionless Soil Under Triaxial Stress State
Previous Article in Journal
The Development and Validation of an Artificial Intelligence Model for Estimating Thumb Range of Motion Using Angle Sensors and Machine Learning: Targeting Radial Abduction, Palmar Abduction, and Pronation Angles
Previous Article in Special Issue
Groundwater Discharge Limits of Mountain Tunnels Based on the Normal Growth of Typical Herbaceous Plants
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction Model for Cutterhead Rotation Speed Based on Dimensional Analysis and Elastic Net Regression

1
Xinjiang Shuifa Construction Group Co., Ltd., Urumqi 830000, China
2
Xinjiang Water Resources and Hydropower Survey and Design Research Institute Co., Ltd., Urumqi 830000, China
3
Faculty of Engineering, China University of Geosciences, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(3), 1298; https://doi.org/10.3390/app15031298
Submission received: 19 December 2024 / Revised: 21 January 2025 / Accepted: 24 January 2025 / Published: 27 January 2025
(This article belongs to the Special Issue Tunnel and Underground Engineering: Recent Advances and Challenges)

Abstract

:
The development and maturation of TBM (tunnel boring machine) technology have significantly improved the accuracy and richness of excavation data, driving advancements in intelligent tunneling research. However, challenges remain in managing data noise and parameter coupling, limiting the interpretability of traditional machine learning models regarding TBM parameter relationships. This study proposes a cutterhead rotation speed prediction model based on dimensional analysis. By utilizing boxplot methods and low-pass filtering techniques, excavation data were preprocessed to select appropriate operational and mechanical parameters. A dimensionless model was established and integrated with elastic net regression to quantify parameters. Using TBM cluster data from a water diversion tunnel project in Xinjiang, the accuracy and generalizability of the model were validated. Results indicate that the proposed model achieves high prediction accuracy, effectively capturing trends in cutterhead rotation speed while demonstrating strong generalizability.

1. Introduction

As the development of TBM (tunnel boring machine) technology and the continuous maturation of its core techniques, the application of TBMs in tunnel construction has become increasingly widespread. This has led to a substantial increase in the amount of excavation data collected, with a marked improvement in data accuracy. Effectively leveraging this data to propel TBM technology towards greater automation and intelligence has become a frontier topic in tunneling technology. However, the raw data generated during TBM operation is often highly complex and coupled, posing significant challenges in extracting valuable information while minimizing interference—key hurdles in achieving intelligent tunneling [1,2,3].
To address these challenges, researchers globally have extensively explored TBM data processing and model analysis. Data preprocessing is critical in TBM data analysis because raw data often contains noise and irrelevant information. For instance, Zhang et al. [4] employed the BIRCH (balanced iterative reducing and clustering using hierarchies) algorithm to efficiently compress massive TBM operation data, removing data points where torque, thrust, cutterhead rotation, or penetration equaled zero. Zhao et al. [5] applied K-nearest neighbors (KNN) for outlier detection and combined it with principal component analysis (PCA) for dimensionality reduction, thereby mitigating multicollinearity among features. Rispoli et al. [6] proposed guidelines for efficient data filtering tailored to TBM operations in hard rock, exploring various analytical and modeling techniques, including descriptive statistics, regression analysis, and neural network fitting. Erharter and Marcher [7] utilized methods including standstill data removal, data grouping with mean calculations, outlier elimination, and uniform data point spacing. Li et al. [8] used PCA to identify the parameters most correlated with redundant data, while Wang et al. [9] employed wavelet packet transform to denoise signals, retaining their temporal characteristics. Xu et al. [10] utilized smoothing methods and outlier detection to stabilize four key TBM indicators during the ramp-up phase of complete operational segments. Xiao et al. [11] introduced an automated data processing approach that accurately segmented excavation cycles by analyzing parameters such as spindle rotation speed, advance rate, and downtime. Shan et al. [12] effectively removed noise in time-series data through smoothing techniques, improving data clarity and predictability. Their study indicated that achieving a balance between noise reduction and preserving authentic data characteristics is critical, as excessive smoothing may obscure significant details, while insufficient smoothing could retain excessive noise, reducing model performance. Because label noise in geological data obtained from field investigations presents challenges for accurate predictions. Zhang et al. [13] proposed a confidence learning-based support vector machine (CL-SVM) model to eliminate label noise, enhancing the reliability of geological classification. Despite significant progress in preprocessing methods to improve model performance, limitations persist when addressing complex datasets.
In the realm of model analysis, machine learning, empirical models, and traditional theoretical models are prominent research directions [14,15,16]. In machine learning, Xu et al. [17] applied supervised methods such as KNN, chi-squared automatic interaction detection (CHAID), support vector machines (SVM), classification and regression trees (CART), and neural networks (NN) to predict TBM penetration rate (PR). Their findings identified uniaxial compressive strength as the most critical factor and cutterhead rotation speed as the least significant. Feng et al. [18] employed deep belief networks (DBN) to predict PR, cutterhead speed, torque, and thrust, introducing the field penetration index (FPI) to quantify TBM performance. Xu et al. [10] evaluated statistical and ensemble machine learning methods, including Bayesian ridge regression (BR), nearest neighbor regression, random forests, gradient tree boosting (GTB), and deep neural networks like convolutional neural networks (CNN) and long short-term memory (LSTM). GTB demonstrated the highest prediction accuracy, while BR achieved the shortest computation time. Studies revealed that ensemble methods excel in small datasets, whereas deep learning models show potential when ample data are available. Mahmoodzadeh et al. [19] explored Gaussian process regression (GPR), SVR, decision trees (DT), and KNN for predicting TBM disc cutter life. Yang et al. [20] used evolutionary polynomial regression (EPR) and random forests (RF) to develop models for predicting PR and FPI. Kwon et al. [21] proposed an interpretable machine learning model with data augmentation to predict abnormal disc cutter wear, considering mixed ground conditions. Shin et al. [22] developed a model for predicting cutter wear based on cutter travel distance and CHI intervals. Despite the advancements, machine learning’s “black box” nature limits interpretability. In contrast, empirical models and traditional theoretical models primarily rely on existing empirical data and fundamental theoretical frameworks to explore the dynamic relationships among excavation parameters through data mining and multivariate regression analysis. For instance, Kim et al. [23] developed an excavation efficiency model based on linear cutting tests, unveiling physical relationships among certain parameters. However, these models exhibit limitations when dealing with small datasets and addressing nonlinear relationships, which hinders their ability to accurately describe and predict real-world problems.
To address these challenges, recent studies have sought to integrate physical knowledge with machine learning, among which dimensional analysis has emerged as a widely used approach [24,25]. In the field of TBM tunneling, dimensional analysis has mainly been applied for data preprocessing, while its direct application in model development remains relatively unexplored. Effectively integrating dimensional analysis with TBM excavation data is thus a key research challenge.
TBM excavation involves numerous parameters, but for hard rock TBMs, cutterhead speed is one of the key operational parameters directly specified by operators. It determines the rotational speed of the cutterhead during rock excavation and significantly impacts tunneling costs, construction planning, and risk assessment [26]. Since TBM performance is highly sensitive to geological conditions and rock properties [27], matching the cutterhead speed to the geological conditions is critical. Mismatched speeds can cause severe wear on disc cutters during hard rock excavation, significantly increasing construction costs and reducing efficiency. For example, studies have shown that excessively high cutterhead speeds dramatically accelerate cutter wear, while appropriately increasing the penetration rate can effectively reduce wear rates [28,29].
Therefore, investigating the factors influencing cutterhead speed based on TBM field data and developing reliable predictive models are crucial for optimizing TBM performance, reducing construction risks, and controlling costs.
This study focuses on predicting TBM cutterhead rotation speed using dimensional analysis. By selecting appropriate operational and mechanical parameters, dimensionless variable relationships were constructed and combined with elastic net regression to develop a linear prediction model. Validation on an independent test set demonstrates that the proposed model achieves high prediction accuracy and strong generalizability. These results provide theoretical support for uncovering intrinsic relationships among TBM physical parameters.

2. Dimensional Analysis of Cutterhead Rotation Speed

Cutterhead rotation speed is influenced by multiple factors, including structural and operational parameters, making it a typical multi-variable coupling problem. Developing a prior knowledge model with physical significance is challenging to achieve through theoretical derivation alone. Dimensional analysis offers a robust approach by analyzing numerous physical parameters from the perspective of fundamental dimensions that reflect their intrinsic physical meanings. This method enables the construction of dimensionally homogeneous parameter models that incorporate certain physical mechanisms, thereby facilitating the development of interpretable and generalizable models for cutterhead rotation speed. To enhance the model’s practical applicability and explicit interpretability, this study employs the elastic net regression algorithm for model training.
The modeling process comprises three main steps:
(1)
Parameter Identification. Key parameters significantly influencing cutterhead rotation speed, derived from operational and structural factors, are identified through analysis.
(2)
Model Framework Construction. Based on the principles of the Π-theorem of dimensional analysis and physical constraints, a rational framework for the cutterhead rotation speed prediction model is established.
(3)
Model Training and Prediction. Using specific project data, the constructed framework is employed as input for training the elastic net regression algorithm, which is well-suited for analyzing the statistical characteristics of tunneling data. The algorithm identifies individual working condition features and enables quantitative predictions of cutterhead rotation speed.
By integrating mechanics, dimensional analysis, and engineering data mining, this approach balances model interpretability, generalizability, and accuracy. The workflow is depicted in Figure 1. This workflow combines classic dimensional analysis methods (such as variable selection and π theorem) with modern data-driven techniques (such as elastic network regression analysis and engineering data verification). Through this fusion, not only the scientific nature of the theoretical model is guaranteed, but also the applicability of the model in engineering practice is enhanced.

2.1. Variable Selection

In TBM excavation data, key operational and mechanical factors influencing cutterhead rotation speed (ω) include the cutterhead diameter (D), conveyor speed (Vc), cutterhead torque (T), advance speed (v), total thrust (F), gripper force (Fs), and penetration depth (P). The dimensional expressions of these parameters are summarized in Table 1.
The cutterhead rotation speed can be expressed as follows:
ω = f ( D , V c , T , v , F , F s , p )
Organize the parameters into a column vector q, where:
q = [ ω , D , V c , T , v , F , F s , P ] T
This can be reformulated into a vector form as:
q = f ( ω , D , V c , T , v , F , F s , p )
Equation (3) can be expressed through the dimensional matrix Ω as follows:
Ω = 1 0 2 1 1 1 1 1 0 0 1 0 1 1 0 0 1 1 2 1 2 2 0 0
In this paper, Ω represents the parameter dimension matrix, which is used to describe the dimensional decomposition of each parameter in terms of the basic physical quantities (length [L], mass [M], time [T]). Each column of the matrix corresponds to a parameter, such as ω, D, Vc, T, etc., while each row represents the exponent of the respective parameter in the length, mass, and time dimensions. Since the rank of dimensional matrix Ω is 3, three independent variables must be selected as base variables to perform dimensionless analysis on the remaining parameters [30]. In this study, the selection of base variables considers their critical influence on the target variable, dimensional simplicity, and mutual independence [31]. Advance thrust (F), cutterhead diameter (D), and advance speed (v) were selected as the base variables. These variables not only directly impact TBM performance but also minimize potential blind spots in data analysis. The cutterhead diameter (D) represents a fundamental structural parameter with a simple dimensional property, facilitating the extraction of relevant features for rotation speed prediction through dimensional analysis. The advance speed (v), as a key operational parameter in tunneling, is directly correlated with cutterhead rotation speed, making it essential for constructing models that reflect actual operational performance. Selecting F, D, and v adheres to the principles of dimensional analysis while enhancing the interpretability and predictive capability of the model. This selection ensures the development of a robust and physically meaningful model for cutterhead rotation speed prediction.

2.2. π-Theorem Modeling

Using the selected base variables, the remaining variables are combined into dimensionless groups as follows:
π 1 = ω D v , π 2 = V c v , π 3 = P D , π 4 = T F D , π 5 = F s F
Based on Equation (4), the cutterhead rotation speed can be expressed as: π 1 = f ( π 2 , π 3 , π 4 , π 5 ) .
By integrating the dimensionless groups mentioned above:
π 1 = ω D v = f ( V c v , P D , T F D , F s F )
π4, π5 primarily represent the influence of load factors on cutterhead rotation speed. These groups are combined through a product transformation into a single dimensionless group, denoted as T F s D F 2 , which captures the impact of load-related operational factors on rotation speed. The other two dimensionless groups, π 2 = V c v , π 3 = P D , represent the effects of speed and length factors, respectively, on the cutterhead rotation speed.
From a mechanical and operational perspective, the interactions among the three dimensionless groups are minimal and can be approximated as linear relationships. Therefore, the model can be expressed as:
ω D v = k 1 V c v + k 2 F s T D F 2 + k 3 P D
Equation (7) presents a linear model derived from dimensional analysis to describe the relationship between the cutterhead rotation speed ω and key operational and mechanical parameters. Each dimensionless term in the model represents specific contributions: the first term, k 1 V c v , indicates the influence of the conveyor speed to advance speed ratio; the second term, k 2 F s T D F 2 , captures the combined effects of thrust, torque, and cutterhead diameter as load-related factors; and the third term, k 3 P D reflects the impact of the penetration depth to cutterhead diameter ratio. This linear model not only simplifies the analysis of complex mechanical systems but also provides a robust theoretical framework for predicting cutterhead performance under various operational conditions.

3. TBM (Tunnel Boring Machine) Data Processing

3.1. Study Area

The data analyzed in this study originates from a water supply project in Xinjiang, with a total length of approximately 285 km and tunnel diameters ranging from 6.8 to 8.9 m. The construction employed a combination of drill-and-blast methods and open TBM excavation. The TBM used in the project had a cutterhead diameter of 7.8 m. The excavated section primarily consisted of Variscan metamorphic granite, metamorphic biotite granite, and biotite quartz schist, characterized by high integrity, significant rock strength, and minimal joint and fracture development.
The overall tunnel section was intact, with some localized fragmentation. The predominant rock mass classifications were Class II and Class IIIb, with Class IIIa and IIIb accounting for 98.97% of the total rock mass. A 2.5 km long faulted and fractured zone, constituting 0.93% of the total length, was present along the tunnel route. Groundwater in the excavated section was mainly bedrock fissure water, and the tunnel area was located in a region with low groundwater recharge, resulting in minimal seepage. The data acquisition segment spanned terrain with mild undulations, characterized by exposed bedrock and residual aeolian hills.

3.2. Data Preprocessing

This study utilized excavation data from TBM No. 2 as the input dataset. During TBM operation, sensors recorded data at a frequency of seconds, generating massive datasets. These datasets included construction parameters such as total thrust, cutterhead torque, and cutterhead rotation speed, as well as auxiliary system data, such as hydraulic oil tank temperature and main drive cooling water temperature.
Because the sensors operated continuously—even during non-excavation periods such as downtime and maintenance—these datasets contained a significant proportion of non-operational data. These included periods dedicated to surrounding rock support, equipment maintenance, and machine downtime, which were not relevant to the study. Additionally, external disturbances and sensor malfunctions introduced anomalous data points, resulting in datasets that were vast yet sparse in meaningful information.
To improve the accuracy and reliability of the sensor data, reasonable preprocessing techniques were used, including numerical filtering, boxplot analysis, and low-pass filtering. Therefore, data preprocessing was essential to ensure the quality of the input data and the reliability of the analysis results.

3.2.1. Removal of Invalid Data

Due to the operational characteristics of TBMs, the actual excavation work time constituted only a small portion of the total construction time. As a result, the stored data contained large volumes of non-operational data, such as those related to surrounding rock support, equipment maintenance, and downtime. Since these periods were outside the scope of this study, such data were removed. Moreover, the complete excavation cycle was divided into four phases based on TBM excavation data characteristics: idle pushing, ramp-up, steady-state, and ramp-down. Data from the idle pushing and ramp-down phases were classified as invalid rock-breaking data and subsequently removed. As illustrated in Figure 2, gaps between dataset intervals correspond to non-operational segments, which were excluded during preprocessing to focus solely on effective excavation data.
When any operational parameter, such as advance speed, cutterhead rotation speed, or total load, is zero or near zero, it is typically considered as “idle pushing” data or non-operational data. Such data points are removed through direct numerical filtering in this study. The following table shows the statistical values of the processed data.
Table 2 shows the statistical summary of the dataset, partitioned by disc rotational speed. It includes the frequency, maximum, mean, and standard deviation of both rotational speed and torque. The disc speed is mostly concentrated in the 6–8 range, representing the stable operation zone. In contrast, the torque shows greater fluctuations, better reflecting the data variability. Therefore, torque is selected as the key metric for evaluating processing effectiveness.

3.2.2. Outlier Processing

Statistical analysis revealed the presence of outliers in TBM operational data. These anomalies may arise from sensor malfunctions, operator errors, or system glitches, all of which hinder subsequent analysis. Therefore, these outliers must also be removed. As illustrated in Figure 3, the blue boxes highlight the detected outliers.
The boxplot is a simple, efficient, and intuitive data visualization tool commonly used to describe data distributions and identify outliers. It is constructed from several key quartile values, including the first quartile (Q1), median (Q2), and third quartile (Q3), as well as the upper and lower whiskers, which represent the range of typical data values. These values effectively display the central tendency, dispersion, and shape of the data distribution.
A boxplot divides the data into quartiles, where the data inside the box is considered normal. The whiskers reflect the approximate range of the data distribution. The boxplot’s shape indicates whether the data distribution is symmetric and identifies outliers falling outside the whiskers.
The boxplot is particularly advantageous because it is robust to variations in data distribution, making it ideal for summarizing the distribution characteristics of data, comparing differences between datasets, and filtering outliers. In this study, the boxplot method was employed to eliminate outliers. The results of the outlier removal are shown in Figure 4.

3.2.3. Data Denoising

During TBM operations, machine vibrations consistently generate noise in the collected data. Noise in unprocessed data can significantly affect the prediction accuracy of elastic net regression models. Through proper preprocessing, the noise data is removed, which enhances the stability of the model, improves the extraction of the relationship between the cutterhead speed and other parameters, and reduces the overfitting phenomenon. To minimize the impact of noise on the model’s solution speed, this study utilized a low-pass filter for denoising. Low-pass filters smooth the data by restricting high-frequency components while effectively preserving signal variation. Figure 5 compares torque data before and after filtering. The blue box indicates a magnified section, clearly showing that the filtered data is smoother and better suited for subsequent analysis.

4. Elastic Net Regression Analysis

4.1. Elastic Net Regression Setup

When training and regressing the dimensionless model based on engineering data, selecting a suitable method is crucial. Due to challenges such as multicollinearity caused by issues like idle pushing, cutterhead jamming, and parameter regulation delays, along with the presence of anomalous data points, a regression method robust to these issues is required. Elastic net regression excels in handling multicollinearity while balancing variable selection and regularization. By combining L1 and L2 regularization, this method not only selects features but also retains some correlated yet less critical variables, ensuring that significant data is not overlooked. Its applicability and stability make it particularly suitable for dimensionless models predicting TBM cutterhead rotation speed, significantly enhancing the model’s generalizability and predictive accuracy.
To divide the dataset, a 0.2–0.8 split was applied, where 20% of the data was reserved for testing and 80% was used for training. The values of the L1 and L2 regularization parameters were determined using 10-fold cross-validation. This process involves randomly dividing the dataset into 10 subsets. In each iteration, one subset is used as the validation set, while the remaining nine subsets are used as the training set for model training and evaluation. This process is repeated 10 times, with each subset serving as the validation set in turn. The average of the evaluation metrics across all iterations is used to assess model performance for the current hyperparameter combination. This approach identifies the optimal L1 and L2 parameters, enhancing the model’s stability and predictive capability on unseen data.
In this study, the combined parameter L1_ratio was employed to automatically balance the influence of L1 and L2 regularization. The optimal L1_ratio value was determined through cross-validation, ensuring that both regularization effects are effectively considered to achieve the best model performance.

4.2. Regression Analysis and Validation

Through 10-fold cross-validation, the optimal L1_ratio was determined to be 0.1. Using this ratio, the parameters of the dimensionless rotation speed model were calculated as follows: k1 = 6.25 × 10−2, k2 = −2.71 × 10−3, k3 = 1.91 × 10−1.
The resulting dimensionless rotation speed model is expressed as:
ω D v = 6.25 × 10 2 V c V 2.71 × 10 3 F s T D F 2 + 1.91 × 10 1 P D
To evaluate the predictive ability and effectiveness of the model, it was applied to the test set for cutterhead rotation speed prediction and compared to the actual values. The results demonstrated a coefficient of determination (R2) of 0.954, indicating a high goodness-of-fit and the model’s ability to effectively capture the relationship between input and output variables.
The mean squared error (MSE), a key metric for assessing model error, was calculated as 0.00286. This low MSE value confirms the minimal discrepancy between the predicted and actual values, further validating the model’s accuracy.
Figure 6 shows the residual plot of the model, which is based on the results of the test set. The residuals predominantly range between −0.1 and 0.1, indicating small prediction errors and excellent model fit. The residuals exhibit uniform fluctuations around zero, suggesting no significant systematic bias in the model. Furthermore, the concentration of residuals within a narrow range reflects a low noise level in the data, validating the effectiveness of the preprocessing steps.
To further validate the model’s reliability, the predicted dimensionless rotation speed values were compared with the actual values, and an error percentage plot was generated. As shown in Figure 7 and Figure 8, the predicted values closely align with the observed values, demonstrating the model’s ability to accurately predict cutterhead rotation speed. Most prediction errors fall within 15%, with 90% of the data showing errors below 10%. This strongly indicates that the constructed rotation speed prediction model effectively captures variations in cutterhead rotation speed observed in real-world engineering scenarios, highlighting its practicality and reliability.

5. Discussion

This study, based on dimensional analysis and elastic net regression, developed a prediction model for the cutterhead rotation speed of tunnel boring machines (TBMs). Using real-world engineering data and excavation parameters, the model aims to enhance construction efficiency and safety. The following points are discussed:
  • In this study, we explored the factors influencing the cutterhead rotation speed of tunnel boring machines (TBMs). By identifying the key parameters affecting cutterhead speed, operators are empowered to quickly diagnose issues and adjust parameters accordingly when problems arise. This ability to quickly pinpoint and address the factors impacting cutterhead performance not only enhances operational safety but also improves overall construction efficiency. For example, when encountering unexpected ground conditions or equipment malfunctions, operators can refer to the model’s predictions to identify whether the issue stems from inappropriate rotation speed or other operational parameters. By adjusting the cutterhead speed based on real-time feedback, operators can prevent potential cutter fatigue, minimize rock disturbances, and avoid machine downtime, ultimately optimizing the tunneling process and ensuring timely project completion.
  • High-quality data are essential for accurate predictions. This study utilized numerical filtering, boxplot analysis, and low-pass filtering to effectively reduce noise and outliers, improving data reliability. Future work could further enhance data quality by improving sensor precision and integrating anomaly detection and data imputation techniques.
  • Despite the promising results, the study has the following limitations: (1) The dataset was sourced from a single project, requiring validation under diverse geological conditions. (2) The model’s ability to dynamically adjust parameters in real time has not been tested.
  • Despite the encouraging results of this study, future research should focus on the following aspects: (1) Dynamic real-time prediction: Develop a model that can dynamically adjust the cutterhead speed in real time according to tunnel conditions, combined with a real-time monitoring system to cope with unforeseen operational challenges. (2) Predictive maintenance: Integrate machine learning models to predict TBM downtime and maintenance needs, proactively schedule maintenance and reduce downtime by identifying abnormal patterns in the data. (3) Cross-project and geological validation: Apply the model to different projects and geological conditions for validation to enhance the versatility and accuracy of the model.

6. Conclusions

This study developed a TBM cutterhead rotation speed prediction model based on dimensional analysis, integrating physical knowledge with data analysis to address the traditional limitations in interpreting parameter relationships. The key findings are as follows:
(1)
The TBM cutterhead rotation speed prediction model, established using dimensional analysis and elastic net regression, achieved accurate predictions after comprehensive data preprocessing. Validation on an independent test set demonstrated the proposed model’s strong predictive ability and broad applicability.
(2)
By employing numerical filtering, boxplot analysis, and low-pass filtering techniques, the noise and interference from outliers in the excavation data were significantly reduced. This improved the model’s stability and prediction accuracy, providing crucial theoretical support for advancing intelligent tunneling with shield TBMs.
This study developed a tunnel boring machine (TBM) cutterhead rotation speed prediction model based on dimensional analysis and elastic net regression. By integrating physical knowledge with data analytics, it addresses the limitations of traditional methods in explaining parameter relationships. The main conclusions are as follows:
(1)
This study successfully developed a tunnel boring machine (TBM) cutterhead rotational speed prediction model based on dimensional analysis and elastic net regression. The validation results demonstrated a high degree of consistency between the predicted and actual values, with the majority of prediction errors within 15%, and 90% of the errors below 10%. These results indicate that the model can effectively and accurately predict the cutterhead rotational speed.
(2)
Through data preprocessing techniques such as numerical filtering, boxplot analysis, and low-pass filtering, significant reductions in noise and outliers in the excavation data were achieved. This further enhanced the model’s stability and prediction accuracy, providing reliable support for data-driven modeling in complex operating conditions.
(3)
By integrating dimensional analysis with elastic net regression optimization, the study effectively balanced the L1 and L2 regularization terms, ensuring both the physical interpretability of the model and its adaptability. This provides a theoretical foundation for capturing the complex relationships between the TBM cutterhead rotational speed and input parameters.
(4)
This research offers theoretical support for the intelligent tunneling of TBMs. Future work could focus on improving data quality, expanding the range of model parameters, and combining machine learning techniques for predictive maintenance, further enhancing the operational efficiency and stability of TBM operations.
In summary, this study successfully constructed a TBM cutterhead rotational speed prediction model based on dimensional analysis and elastic net regression, validating its high accuracy and broad applicability under various working conditions. By optimizing data processing methods and parameter selection, the model not only improved the stability and prediction accuracy of TBM operations but also laid the foundation for the further development of intelligent tunneling technology. Future research should aim to further enhance data quality, expand the model’s applicability, and incorporate intelligent maintenance technologies to improve the efficiency and safety of TBM operations.

Author Contributions

J.L.: writing—original draft preparation, data curation; F.L.: investigation, data curation; K.W.: writing—original draft preparation, methodology; C.Z.: writing—review and editing, conceptualization, project administration. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Some or all data, models, or code that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

Author Junsheng Liu was employed by the company Xinjiang Shuifa Construction Group Co., Ltd. And author Feng Liang was employed by the company Xinjiang Water Resources and Hydropower Survey and Design Research Institute Co., Ltd. The remaining authors declare that the re-search was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Zhang, Y.; Zhou, J.; Qiu, Y.; Armaghani, D.J.; Xie, Q.; Yang, P.; Xu, C. A visual survey of tunnel boring machine (TBM) performance in tunneling excavation: Mainstream direction, brief review and future prospects. Appl. Sci. 2024, 14, 4512. [Google Scholar] [CrossRef]
  2. Li, J.B.; Chen, Z.Y.; Li, X.; Jing, L.J.; Zhang, Y.P.; Xiao, H.H.; Wang, S.J.; Yang, W.K.; Wu, L.J.; Li, P.Y.; et al. Feedback on a shared big dataset for intelligent TBM Part I: Feature extraction and machine learning methods. Undergr. Sp. 2023, 11, 1–25. [Google Scholar] [CrossRef]
  3. Li, J.B.; Chen, Z.Y.; Li, X.; Jing, L.J.; Zhang, Y.P.; Xiao, H.H.; Wang, S.J.; Yang, W.K.; Wu, L.J.; Li, P.Y.; et al. Feedback on a shared big dataset for intelligent TBM Part II: Application and forward look. Undergr. Sp. 2023, 11, 26–45. [Google Scholar] [CrossRef]
  4. Zhang, Q.; Liu, Z.; Tan, J. Prediction of geological conditions for a tunnel boring machine using big operational data. Automa. Constr. 2019, 100, 73–83. [Google Scholar] [CrossRef]
  5. Zhao, J.; Shi, M.; Hu, G.; Song, X.; Zhang, C.; Tao, D.; Wu, W. A data-driven framework for tunnel geological-type prediction based on TBM operating data. IEEE Access 2019, 7, 66703–66713. [Google Scholar] [CrossRef]
  6. Rispoli, A.; Ferrero, A.M.; Cardu, M. TBM data processing for performance assessment and prediction in hard rock. In Tunnels and Underground Cities: Engineering and Innovation meet Archaeology, Architecture and Art; CRC Press: Boca Raton, FL, USA, 2020; pp. 2940–2949. [Google Scholar]
  7. Erharter, G.H.; Marcher, T. MSAC: Towards data driven system behavior classification for TBM tunneling. Tunn. Undergr. Sp. Technol. 2020, 103, 103466. [Google Scholar] [CrossRef]
  8. Li, J.; Li, P.; Guo, D.; Li, X.; Chen, Z. Advanced prediction of tunnel boring machine performance based on big data. Geosci. Front. 2021, 12, 331–338. [Google Scholar] [CrossRef]
  9. Wang, R.; Li, D.; Chen, E.J.; Liu, Y. Dynamic prediction of mechanized shield tunneling performance. Automa. Constr. 2021, 132, 103958. [Google Scholar] [CrossRef]
  10. Xu, C.; Liu, X.; Wang, E.; Wang, S. Prediction of tunnel boring machine operating parameters using various machine learning algorithms. Tunn. Undergr. Sp. Technol. 2021, 109, 103699. [Google Scholar] [CrossRef]
  11. Xiao, H.H.; Yang, W.K.; Hu, J.; Zhang, Y.P.; Jing, L.J.; Chen, Z.Y. Significance and methodology: Preprocessing the big data for machine learning on TBM performance. Undergr. Sp. 2022, 7, 680–701. [Google Scholar] [CrossRef]
  12. Shan, F.; He, X.; Armaghani, D.J.; Sheng, D. Effects of data smoothing and recurrent neural network (RNN) algorithms for real-time forecasting of tunnel boring machine (TBM) performance. J. Rock Mech. Geotech. Eng. 2024, 16, 1538–1551. [Google Scholar] [CrossRef]
  13. Zhang, X.; Zhang, X.; Liu, Q.; Xie, W.; Tang, S.; Wang, Z. TBM big data preprocessing method in machine learning and its application to tunneling. J. Rock Mech. Geotech. Eng. 2024. [Google Scholar] [CrossRef]
  14. Ayawah, P.E.; Sebbeh-Newton, S.; Azure, J.W.; Kaba, A.G.; Anani, A.; Bansah, S.; Zabidi, H. A review and case study of Artificial intelligence and Machine learning methods used for ground condition prediction ahead of tunnel boring Machines. Tunn. Undergr. Sp. Technol. 2022, 125, 104497. [Google Scholar] [CrossRef]
  15. Samantaray, S.; Sahoo, A.; Baliarsingh, F. Groundwater level prediction using an improved SVR model integrated with hybrid particle swarm optimization and firefly algorithm. Clean. Water 2024, 1, 100003. [Google Scholar] [CrossRef]
  16. Mostafa, S.; Sousa, R.L.; Einstein, H.H. Toward the automation of mechanized tunneling “exploring the use of big data analytics for ground forecast in TBM tunnels”. Tunn. Undergr. Sp. Technol. 2024, 146, 105643. [Google Scholar] [CrossRef]
  17. Xu, H.; Zhou, J.; Asteris, P.G.; Jahed Armaghani, D.; Tahir, M.M. Supervised machine learning techniques to the prediction of tunnel boring machine penetration rate. Appl. Sci. 2019, 9, 3715. [Google Scholar] [CrossRef]
  18. Feng, S.; Chen, Z.; Luo, H.; Wang, S.; Zhao, Y.; Liu, L.; Ling, D.; Jing, L. Tunnel boring machines (TBM) performance prediction: A case study using big data and deep learning. Tunn. Undergr. Sp. Technol. 2021, 110, 103636. [Google Scholar] [CrossRef]
  19. Mahmoodzadeh, A.; Mohammadi, M.; Ibrahim, H.H.; Abdulhamid, S.N.; Ali, H.F.H.; Hasan, A.M.; Khishe, M.; Mahmud, H. Machine learning forecasting models of disc cutters life of tunnel boring machine. Automa. Constr. 2021, 128, 103779. [Google Scholar] [CrossRef]
  20. Yang, J.; Yagiz, S.; Liu, Y.J.; Laouafa, F. Comprehensive evaluation of machine learning algorithms applied to TBM performance prediction. Undergr. Sp. 2022, 7, 37–49. [Google Scholar] [CrossRef]
  21. Yang, W.; Zhao, J.; Li, J.; Chen, Z. Probabilistic machine learning approach to predict incompetent rock masses in TBM construction. Acta Geotech. 2023, 18, 4973–4991. [Google Scholar] [CrossRef]
  22. Kwon, K.; Choi, H.; Jung, J.; Kim, D.; Shin, Y.J. Prediction of abnormal TBM disc cutter wear in mixed ground condition using interpretable machine learning with data augmentation. J. Rock Mech. Geotech. Eng. 2024. [Google Scholar] [CrossRef]
  23. Shin, Y.J.; Kwon, K.; Bae, A.; Choi, H.; Kim, D. Machine learning-based prediction model for disc cutter life in TBM excavation through hard rock formations. Tunn. Undergr. Sp. Technol. 2024, 150, 105826. [Google Scholar] [CrossRef]
  24. Kim, H.E.; Nam, K.M.; Rehman, H.; Kyeon, T.S.; Yoo, H.K. Comparison Study on Coarseness Index and Maximum Diameter of Rock Fragments by Linear Cutting Tests. Appl. Sci. 2022, 12, 8793. [Google Scholar] [CrossRef]
  25. Wu, X.Q.; Wang, X.; Wei, Y.P.; Song, H.W.; Huang, C.G. Parametric study on single shot peening by dimensional analysis method incorporated with finite element method. Acta Mech. Sin. 2012, 28, 825–837. [Google Scholar] [CrossRef]
  26. Jalali, S.M.E.; Naghadehi, M.Z. Development of a new laboratory apparatus for the examination of the rotary-percussive penetration in tunnel boring machines. Tunn. Undergr. Space Technol. 2013, 33, 88–97. [Google Scholar] [CrossRef]
  27. Liu, B.; Zhang, F.; Li, S.; Li, Y.; Xu, S.; Nie, L.; Zhang, C.; Zhang, Q. Forward modelling and imaging of ground-penetrating radar in tunnel ahead geological prospecting. Geophys. Prospect. 2018, 66, 784–797. [Google Scholar] [CrossRef]
  28. Mousapour, H.; Chakeri, H.; Darbor, M.; Hekmatnejad, A. Evaluating the wear of cutting tools using a tunnel boring machine laboratory simulator. Min. Miner. Depos. 2023, 17, 28–34. [Google Scholar] [CrossRef]
  29. Kaceнoв, A.; Maздyбaй, A.; Aбишeв, K.; Myкaнoв, P.; Tacкapинa, A. Calculation of prismatic forming cutters. Eng. J. Satbayev Univ. 2021, 143, 149–156. [Google Scholar] [CrossRef]
  30. Zhou, S.; Kang, Y.; Xie, H.; Wang, L.; Zhang, Q. An approach integrating dimensional analysis and field data for predicting the load on tunneling machine. KSCE J. Civ. Eng. 2019, 23, 3180–3187. [Google Scholar] [CrossRef]
  31. Ciulla, G.; D’Amico, A.; Brano, V.L. Evaluation of building heating loads with dimensional analysis: Application of the Buckingham π theorem. Energy Build. 2017, 154, 479–490. [Google Scholar] [CrossRef]
Figure 1. Dimensional analysis modeling workflow.
Figure 1. Dimensional analysis modeling workflow.
Applsci 15 01298 g001
Figure 2. Raw torque line chart.
Figure 2. Raw torque line chart.
Applsci 15 01298 g002
Figure 3. Line chart highlighting data outliers.
Figure 3. Line chart highlighting data outliers.
Applsci 15 01298 g003
Figure 4. Comparison of data before and after boxplot outlier removal.
Figure 4. Comparison of data before and after boxplot outlier removal.
Applsci 15 01298 g004
Figure 5. Comparison of torque data before and after low-pass filtering.
Figure 5. Comparison of torque data before and after low-pass filtering.
Applsci 15 01298 g005
Figure 6. Residual plot of the elastic net regression model.
Figure 6. Residual plot of the elastic net regression model.
Applsci 15 01298 g006
Figure 7. Comparison of predicted and actual values.
Figure 7. Comparison of predicted and actual values.
Applsci 15 01298 g007
Figure 8. Error percentage plot of the elastic net regression model.
Figure 8. Error percentage plot of the elastic net regression model.
Applsci 15 01298 g008
Table 1. Dimensional Expressions of Parameters.
Table 1. Dimensional Expressions of Parameters.
ParameterωDVcTvFFsP
Dimension[T−1][L][LT−1][ML2T−2][LT−1][MLT−2][MT−2][L]
Table 2. Statistical Distribution of Disc Speed and Torque.
Table 2. Statistical Distribution of Disc Speed and Torque.
Cutterhead Speed RangeSample SizeSample Cutterhead SpeedSample Cutterhead Torque
MaxMeanStdMaxMeanStd
0–11180.9950.5060.248452.73995.29053.241
1–21311.9981.5630.279268.378124.27437.192
2–31852.9952.5110.289329.793136.09040.722
3–44433.9993.6140.282861.905174.69797.073
4–525244.9994.4810.25849.932309.051172.227
5–637385.9995.5170.291018.987447.003249.512
6–777766.9996.5920.2771344.626736.204320.109
7–880197.797.290.1981400.759932.366295.201
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, J.; Liang, F.; Wei, K.; Zuo, C. Prediction Model for Cutterhead Rotation Speed Based on Dimensional Analysis and Elastic Net Regression. Appl. Sci. 2025, 15, 1298. https://doi.org/10.3390/app15031298

AMA Style

Liu J, Liang F, Wei K, Zuo C. Prediction Model for Cutterhead Rotation Speed Based on Dimensional Analysis and Elastic Net Regression. Applied Sciences. 2025; 15(3):1298. https://doi.org/10.3390/app15031298

Chicago/Turabian Style

Liu, Junsheng, Feng Liang, Kai Wei, and Changqun Zuo. 2025. "Prediction Model for Cutterhead Rotation Speed Based on Dimensional Analysis and Elastic Net Regression" Applied Sciences 15, no. 3: 1298. https://doi.org/10.3390/app15031298

APA Style

Liu, J., Liang, F., Wei, K., & Zuo, C. (2025). Prediction Model for Cutterhead Rotation Speed Based on Dimensional Analysis and Elastic Net Regression. Applied Sciences, 15(3), 1298. https://doi.org/10.3390/app15031298

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop