Comparative Analysis of Machine Learning and Multi-View Learning for Predicting Peak Penetration Resistance of Spudcans: A Study Using Centrifuge Test Data
Abstract
1. Introduction
2. Methods
2.1. Machine Learning Method
- (1)
- Random Forest: An ensemble method that builds upon the decision tree algorithm. It constructs a multitude of decision trees, each trained on a distinct bootstrap sample of the data and a random subset of features. The final prediction is obtained by aggregating the outputs of all individual trees, enhancing predictive accuracy and generalization capability [32,33].
- (2)
- Support Vector Machine (SVM): A supervised learning model applicable for both classification and regression tasks. Its core principle is to identify the optimal hyperplane that maximizes the margin between classes in a high-dimensional space. This is achieved using kernel functions (e.g., linear, polynomial, or radial basis function), enabling the model to handle complex, nonlinearly separable data effectively [34,35].
- (3)
- Multiple Linear Regression: A statistical method that models the linear relationship between a single dependent variable (the target) and multiple independent variables (features). The model is fitted by determining the coefficients that minimize the sum of squared differences between the observed and predicted values [36].
- (4)
- Neural Network: A computational model composed of interconnected processing units (neurons), typically arranged in an input layer, one or more hidden layers, and an output layer. Each connection between neurons has an adjustable weight. Through an iterative training process, these weights are optimized, allowing the network to approximate complex nonlinear functions by applying nonlinear activation functions at each neuron [37].
2.2. Principle of Multi-View Learning
2.3. Model Construction and Accuracy Evaluation
3. Machine Learning Prediction of Qp
3.1. Prediction in Sand-over-Clay Strata
3.2. Prediction in Three-Layer Clay Strata
4. Qp Prediction via Multi-View Learning
5. Discussion
5.1. Influence of the Number of Features
5.2. Influence of the Number of Training Sets on the Prediction
5.2.1. Training Combination 1
5.2.2. Training Combination 2
5.2.3. Training Combination 3
5.3. Prediction Feasibility for Three-Layer Clay
5.3.1. Machine Learning Prediction
5.3.2. Multi-View Learning Prediction
5.4. Summary of Feature and Training Set Influences
6. Conclusions and Limitations
- (1)
- The trained ML and MVL models generate accurate Qp predictions for the studied strata, with performance comparable to established deterministic models. Neural networks and MVL showed superior accuracy, confirming the feasibility of data-driven approaches for this geotechnical problem. This performance is comparable to that of established deterministic models, indicating their potential as complementary predictive tools.
- (2)
- For sand-over-clay strata, both the simplified three-feature and comprehensive six-feature yield satisfactory predictions. However, the model using the six-feature set delivers superior accuracy. Therefore, utilizing the full parameter set is recommended in practice to maximize model performance.
- (3)
- With sufficient training data, all models achieve an MRE below 20%. Under data-scarce conditions, a training set ratio of 1:2 for MVL to ML can maintain satisfactory accuracy. This offers clear guidance on minimum data requirements.
- (4)
- This study demonstrates that characterizing soft-stiff-soft and stiff-soft-stiff clay strata as a single stratigraphic class can achieve predictive accuracy for Qp equivalent to modeling them separately. This finding offers a practical reference for predicting capacity in multi-layered strata where detailed soil parameters are limited.
- (5)
- A key limitation is the constrained size of the dataset, which is based on the limited pool of reliable, published centrifuge tests. This restricted both model complexity and validation rigor, necessitating a hold-out validation approach rather than more robust cross-validation. Future work should focus on expanding the experimental database to enable more advanced modeling and comprehensive statistical validation.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Teh, K.L.; Leung, C.F.; Chow, Y.K.; Cassidy, M.J. Centrifuge Model Study of Spudcan Penetration in Sand Overlying Clay. Geotechnique 2010, 60, 825–842. [Google Scholar] [CrossRef]
- Jun, M.J.; Kim, Y.H.; Hossain, M.S.; Cassidy, M.J.; Hu, Y.; Park, S.G. Geotechnical Centrifuge Investigation of the Effectiveness of a Novel Spudcan in Easing Spudcan–Footprint Interactions. J. Geotech. Geoenviron. Eng. 2020, 146, 04020071. [Google Scholar] [CrossRef]
- Van Dijk, B.F.J.; Yetginer, A.G. Findings of the ISSMGE Jack-up Leg Penetration Prediction Event. In Proceedings of the 3rd International Symposium on Frontiers in Offshore Geotechnics (ISFOG 2015), Oslo, Norway, 10–12 June 2015; Taylor & Francis: Leiden, The Netherlands, 2015; pp. 1267–1274. [Google Scholar]
- Craig, W.H.; Chua, K. Deep Penetration of Spudcan Foundations on Sand and Clay. Geotechnique 1990, 40, 541–556. [Google Scholar] [CrossRef]
- Hossain, M.S.; Cassidy, M.J.; Baker, R.; Randolph, M.F. Optimization of Perforation Drilling for Mitigating Punch-through in Multi-Layered Clays. Can. Geotech. J. 2011, 48, 1658–1673. [Google Scholar] [CrossRef]
- Zheng, J.; Wang, D. Numerical Investigation of Spudcan-Footprint Interaction in Non-Uniform Clays. Ocean. Eng. 2019, 188, 106295. [Google Scholar] [CrossRef]
- Ullah, S.N.; Hu, Y. Peak Punch-through Capacity of Spudcan in Sand with Interbedded Clay: Numerical and Analytical Modelling. Can. Geotech. J. 2017, 54, 1071–1088. [Google Scholar] [CrossRef]
- Hu, P.; Stanier, S.A.; Cassidy, M.J.; Wang, D. Predicting Peak Resistance of Spudcan Penetrating Sand Overlying Clay. J. Geotech. Geoenviron. Eng. 2014, 140, 04013009. [Google Scholar] [CrossRef]
- Qiu, G.; Grabe, J. Numerical Investigation of Bearing Capacity Due to Spudcan Penetration in Sand Overlying Clay. Can. Geotech. J. 2012, 49, 1393–1407. [Google Scholar] [CrossRef]
- Lee, K.K.; Randolph, M.F.; Cassidy, M.J. Bearing Capacity on Sand Overlying Clay Soils: A Simplified Conceptual Model. Geotechnique 2013, 63, 1285–1297. [Google Scholar] [CrossRef]
- Young, A.G.; Focht, J.A. Subsurface Hazards Affect Mobile Jack-up Rig Operations. Soundings 1981, 3, 4–9. [Google Scholar]
- Hanna, A.M.; Meyerhof, G.G. Design Charts for Ultimate Bearing Capacity of Foundations on Sand Overlying Soft Clay. Can. Geotech. J. 1980, 17, 300–303. [Google Scholar] [CrossRef]
- Hu, P.; Dong, W.; Sam, S.; Mark, C. Assessing the Punch-through Hazard of a Spudcan on Sand Overlying Clay. Géotechnique 2015, 65, 883–896. [Google Scholar] [CrossRef]
- Zheng, J.; Hossain, M.S.; Wang, D. New Design Approach for Spudcan Penetration in Nonuniform Clay with an Interbedded Stiff Layer. J. Geotech. Geoenviron. Eng. 2015, 141, 04015003. [Google Scholar] [CrossRef]
- Zheng, J.; Hossain, M.S.; Wang, D. Estimating Spudcan Penetration Resistance in Stiff-Soft-Stiff Clay. J. Geotech. Geoenviron. Eng. 2018, 144, 04018001. [Google Scholar] [CrossRef]
- ISO 19905-1:2016; Petroleum and Natural Gas Industries: Site-Specific Assessment of Mobile Offshore Units. International Organization for Standardization: Geneva, Switzerland, 2016.
- SNAME. T&R Bulletin 5-05 A: Guidelines for Site Specific Assessment of Mobile Jack-Up Units; SNAME: Alexandria, VA, USA, 2008. [Google Scholar]
- Cassidy, M.; Li, L.; Hu, P.; Uzielli, M.; Lacasse, S. Deterministic and Probabilistic Advances in the Analysis of Spudcan Behaviour. In Proceedings of the Frontiers in Offshore Geotechnics III, Oslo, Norway, 10–12 June 2015; CRC Press: London, UK, 2015; pp. 183–212. [Google Scholar]
- Houlsby, G.T. A Probabilistic Approach to the Prediction of Spudcan Penetration of Jack-up Units. In Proceedings of the 2nd International Symposium on Frontiers in Offshore Geotechnics, Perth, Australia, 8–10 November 2010; CRC Press: Perth, Australia, 2010; pp. 8–10. [Google Scholar]
- Li, J.; Hu, P.; Uzielli, M.; Cassidy, M.J. Bayesian Prediction of Peak Resistance of a Spudcan Penetrating Sand-over-Clay. Geotechnique 2018, 68, 905–917. [Google Scholar] [CrossRef]
- Sheil, B.; Suryasentana, S.; Templeman, J.; Phillips, B.; Cheng, W.; Zhang, L. Prediction of Pipe-Jacking Forces Using a Bayesian Updating Approach. J. Geotech. Geoenviron. Eng. 2022, 148, 04021173. [Google Scholar] [CrossRef]
- Jiang, J.; Wang, D.; Zhang, S. Improved Prediction of Spudcan Penetration Resistance by an Observation-Optimized Model. J. Geotech. Geoenviron. Eng. 2020, 146, 06020014. [Google Scholar] [CrossRef]
- Zheng, J.; Zhang, S.; Wang, D.; Jiang, J. Optimization for the Assessment of Spudcan Peak Resistance in Clay–Sand–Clay Deposits. J. Mar. Sci. Eng. 2021, 9, 689. [Google Scholar] [CrossRef]
- Yang, X.; Wang, D.; Zhang, S. Probabilistic Prediction of Spudcan Peak Penetration Resistance Based on Parameter Estimation and Sectionalized Adaptive Linear Simplification. Ocean. Eng. 2024, 298, 117228. [Google Scholar] [CrossRef]
- Zhang, W.; Gu, X.; Hong, L.; Han, L.; Wang, L. Comprehensive Review of Machine Learning in Geotechnical Reliability Analysis: Algorithms, Applications and Further Challenges. Appl. Soft Comput. 2023, 136, 110066. [Google Scholar] [CrossRef]
- Li, B.; You, Z.; Ni, K.; Wang, Y. Prediction of Soil Compaction Parameters Using Machine Learning Models. Appl. Sci. 2024, 14, 2716. [Google Scholar] [CrossRef]
- Xu, Y.; Zhao, Y.; Jiang, Q.; Sun, J.; Tian, C.; Jiang, W. Machine-Learning-Based Deformation Prediction Method for Deep Foundation-Pit Enclosure Structure. Appl. Sci. 2024, 14, 1273. [Google Scholar] [CrossRef]
- Yin, X.; Sun, Y.; Xu, W.; Gao, W.; Wang, H.; Ruan, S.; Shao, Y. Seafloor Stability Assessment of Jiaxie Seamount Group Using the “Weight-of-Evidence” (WoE) Method, Western Pacific Ocean. J. Mar. Sci. Eng. 2025, 13, 1001. [Google Scholar] [CrossRef]
- Zhao, X.; Dong, P.; Li, Y.; Zhou, Y.; Zhao, X.; Wang, Q.; Zhan, C. Artificial Neural Network Model for Predicting Local Equilibrium Scour Depth at Pile Groups in Steady Currents. J. Mar. Sci. Eng. 2025, 13, 1742. [Google Scholar] [CrossRef]
- Xie, J.; Fu, J.; Wang, H.; Yang, J. Automation in Construction Intelligent Shield Machine Selection for Subway Tunnel Using Machine Learning. Autom. Constr. 2025, 180, 106492. [Google Scholar] [CrossRef]
- Liu, W.; Chen, Y.; Liu, T.; Liu, W.; Li, J.; Chen, Y. ScienceDirect Shield Tunneling Efficiency and Stability Enhancement Based on Interpretable Machine Learning and Multi-Objective Optimization. Undergr. Space 2025, 22, 320–336. [Google Scholar] [CrossRef]
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Ma, M.; Chen, G.; Xu, S.; Tan, W.; Yin, K. Machine Learning-Based Short-Term Forecasting of Significant Wave Height During Typhoons Using SWAN Data: A Case Study in the Pearl River Estuary. J. Mar. Sci. Eng. 2025, 13, 1612. [Google Scholar] [CrossRef]
- Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Arnaldo, C.G.; Jurado, R.D.; Moreno, F.P.; Suárez, M.Z. Enhancing Security in Airline Ticket Transactions: A Comparative Study of SVM and LightGBM. Appl. Sci. 2025, 15, 9581. [Google Scholar] [CrossRef]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: New York, NY, USA, 2021; ISBN 9781071614174. [Google Scholar]
- Lecun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Yan, X.; Hu, S.; Mao, Y.; Ye, Y.; Yu, H. Deep Multi-View Learning Methods: A Review. Neurocomputing 2021, 448, 106–129. [Google Scholar] [CrossRef]
- Lee, K.K.; Cassidy, M.J.; Randolph, M.F. Bearing Capacity on Sand Overlying Clay Soils: Experimental and Finite-Element Investigation of Potential Punch-through Failure. Geotechnique 2013, 63, 1271–1284. [Google Scholar] [CrossRef]
- Hu, P.; Stanier, S.A.; Wang, D.; Cassidy, M.J. Effect of Footing Shape on Penetration in Sand Overlying Clay. Int. J. Phys. Model. Geotech. 2016, 16, 119–133. [Google Scholar] [CrossRef]
- Hossain, M.S.; Randolph, M.F.; Saunier, Y.N. Spudcan Deep Penetration in Multi-Layered Fine-Grained Soils. Int. J. Phys. Model. Geotech. 2011, 11, 100–115. [Google Scholar] [CrossRef]










| Methods | Random Forest | Gauss SVM | SVM-2 | SVM-3 | Multiple-Linear Regression | Neural Network | |
|---|---|---|---|---|---|---|---|
| First training | MRE | 18.9% | 18.2% | 13.6% | 13.3% | 19.8% | 11.1% |
| RMSE | 99.19 | 123.35 | 74.87 | 73.21 | 103.37 | 69.34 | |
| Second training | MRE | 13.5% | 18.3% | 11.7% | 10.6% | 18.9% | 12.4% |
| RMSE | 88.6 | 123.37 | 74.09 | 63.33 | 103.34 | 72.34 | |
| Group | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
| Training sets (Samples) | 1, 2, 3 | 1, 2, 4 | 1, 3, 4 | 2, 3, 4 | 1, 2 | 1, 2 | 1, 3 | 1, 3 |
| Test sets (Sample) | 4 | 3 | 2 | 1 | 3 | 4 | 2 | 4 |
| Group | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 |
| Training sets (Samples) | 1, 4 | 1, 4 | 2, 3 | 2, 3 | 2, 4 | 2, 4 | 3, 4 | 3, 4 |
| Test sets (Sample) | 2 | 3 | 1 | 4 | 1 | 3 | 1 | 2 |
| Methods | Random Forest | Gauss SVM | SVM-2 | SVM-3 | Multiple-Linear Regression | Neural Network | |
|---|---|---|---|---|---|---|---|
| Soft-stiff-soft clay strata | MRE | 55.4% | 18.6% | 7.8% | 15.9% | 24.8% | 7.3% |
| RMSE | 182.08 | 107.69 | 40.46 | 103.35 | 96.30 | 23.52 | |
| Stiff-soft-stiff clay strata | MRE | 44.5% | 17.7% | 9.9% | 12.2% | 17.9% | 7.7% |
| RMSE | 147.42 | 117.6 | 71.38 | 72.28 | 107.43 | 37.11 | |
| Methods | SVM-2 | SVM-3 | Neural Network | Multi-View Learning |
|---|---|---|---|---|
| MRE | 11.4% | 9.5% | 8.2% | 6.2% |
| RMSE | 54.33 | 42.87 | 47.18 | 28.59 |
| Methods | SVM-2 | SVM-3 | Neural Network | Multi-View Learning |
|---|---|---|---|---|
| MRE | 14% | 11% | 11% | 9% |
| RMSE | 99.19 | 123.35 | 74.87 | 73.21 |
| No. | Machine Learning Training Set | Multi-View Learning Training Set | Testing Set |
|---|---|---|---|
| Training Combination 1 | 12 | 6 | 38 |
| Training Combination 2 | 6 | 12 | 38 |
| Training Combination 3 | 12 | 12 | 32 |
| Methods | SVM-2 | SVM-3 | Neural Network | Multi-View Learning |
|---|---|---|---|---|
| MRE | 13.7% | 14.3% | 10.2% | 17.5% |
| RMSE | 46.27 | 51.02 | 51.41 | 90.85 |
| Methods | SVM-2 | SVM-3 | Neural Network | Multi-View Learning |
|---|---|---|---|---|
| MRE | 138.9% | 142.5% | 24.7% | 16.9% |
| RMSE | 543.71 | 562.04 | 83.75 | 68.05 |
| Methods | SVM-2 | SVM-3 | Neural Network | Multi-View Learning |
|---|---|---|---|---|
| MRE | 12.5% | 28.2% | 14.5% | 10.1% |
| RMSE | 46.43 | 124.40 | 66.27 | 40.22 |
| Groups | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
| Training sets (Samples) | 1, 2, 3, 4 | 5, 6,7, 8 | 1, 3, 5, 7 | |||||||||
| Test sets (Sample) | 5 | 6 | 7 | 8 | 1 | 2 | 3 | 4 | 2 | 4 | 6 | 8 |
| Groups | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 |
| Training sets (Samples) | 2, 4, 6, 8 | 1, 2, 5, 6 | 3, 4, 7, 8 | |||||||||
| Test sets (Sample) | 1 | 3 | 5 | 7 | 3 | 4 | 7 | 8 | 1 | 2 | 5 | 6 |
| Methods | Random Forest | Gauss SVM | SVM-2 | SVM-3 | Multiple Linear Regression | Neural Network |
|---|---|---|---|---|---|---|
| MRE | 58.4% | 22.9% | 18.7% | 39.4% | 24.7% | 13.5% |
| RMSE | 165.36 | 93.27 | 86.70 | 237.70 | 77.21 | 42.71 |
| Groups | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
| Training sets (Samples) for machine learning | 5, 6,7, 8 | |||||||||||
| Test sets (Sample) for multi-view learning | 1, 2 | 1, 2 | 1, 3 | 1, 3 | 1, 4 | 1, 4 | 2, 3 | 2, 3 | 2, 4 | 2, 4 | 3, 4 | 3, 4 |
| Test sets (Sample) | 3 | 4 | 2 | 4 | 2 | 3 | 1 | 4 | 1 | 3 | 1 | 2 |
| Groups | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 |
| Training sets (Samples) for machine learning | 2, 4, 6, 8 | |||||||||||
| Test sets (Sample) for multi-view learning | 1, 3 | 1, 3 | 1, 5 | 1, 5 | 1, 7 | 1, 7 | 3, 5 | 3, 5 | 3, 7 | 3, 7 | 5, 7 | 5, 7 |
| Test sets (Sample) | 5 | 7 | 3 | 7 | 3 | 5 | 1 | 7 | 1 | 5 | 1 | 3 |
| Methods | SVM-2 | SVM-3 | Neural Network | Multi-View Learning |
|---|---|---|---|---|
| MRE | 27.1% | 29.2% | 30.4% | 10.6% |
| RMSE | 129.33 | 159.14 | 79.65 | 43.66 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wang, M.; Yang, X.; Yang, X.; Wang, D.; Sun, W.; Sun, H. Comparative Analysis of Machine Learning and Multi-View Learning for Predicting Peak Penetration Resistance of Spudcans: A Study Using Centrifuge Test Data. J. Mar. Sci. Eng. 2026, 14, 62. https://doi.org/10.3390/jmse14010062
Wang M, Yang X, Yang X, Wang D, Sun W, Sun H. Comparative Analysis of Machine Learning and Multi-View Learning for Predicting Peak Penetration Resistance of Spudcans: A Study Using Centrifuge Test Data. Journal of Marine Science and Engineering. 2026; 14(1):62. https://doi.org/10.3390/jmse14010062
Chicago/Turabian StyleWang, Mingyuan, Xiuqing Yang, Xing Yang, Dong Wang, Wenjing Sun, and Huimin Sun. 2026. "Comparative Analysis of Machine Learning and Multi-View Learning for Predicting Peak Penetration Resistance of Spudcans: A Study Using Centrifuge Test Data" Journal of Marine Science and Engineering 14, no. 1: 62. https://doi.org/10.3390/jmse14010062
APA StyleWang, M., Yang, X., Yang, X., Wang, D., Sun, W., & Sun, H. (2026). Comparative Analysis of Machine Learning and Multi-View Learning for Predicting Peak Penetration Resistance of Spudcans: A Study Using Centrifuge Test Data. Journal of Marine Science and Engineering, 14(1), 62. https://doi.org/10.3390/jmse14010062

