Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transformer Architectures
Abstract
1. Introduction
2. Related Work
3. Data Preparation
3.1. Statistical Analysis
3.2. Preprocessing and Feature Selection
4. Modeling and Analysis Pipeline
4.1. Balanced Ensemble Model
4.1.1. Binary Classification Under Business Constraints
4.1.2. Balanced Bagging Ensemble Approach
| Algorithm 1 Balanced Bagging for Heavy-Tailed Loss Distribution |
|
4.2. Transformer-Based Model
Methodology Overview
5. Results
5.1. Results for the Balanced Ensemble Model
5.1.1. Sampling Strategy Optimization Results
5.1.2. Model Performance on the Test Set
5.1.3. Economic Impact for the Ensemble Model
5.2. Results for the Transformer-Based Model
5.2.1. Augmentation Strategy Optimization Results
5.2.2. Model Performance on Test Set
5.2.3. Economic Impact for the Transformer Model
6. Discussion
6.1. Comparative Analysis Using a Real Dataset
6.2. Comparative Analysis with Baseline Methodology Using a Synthetic Dataset
7. Conclusions and Future Work
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| AUC | Area Under the Curve |
| CI | Confidence Interval |
| ML | Machine Learning |
| MSE | Mean Squared Error |
| ROC AUC | Receiver Operating Characteristic—Area Under the Curve |
| SINCO | Information System of the Insurance Compensation Consortium in Spain |
| SMOTE | Synthetic Minority Oversampling Technique |
References
- Anderson, Theodore W., and Donald A. Darling. 1952. Asymptotic theory of certain “goodness of fit” criteria based on stochastic processes. The Annals of Mathematical Statistics 23: 193–212. [Google Scholar] [CrossRef]
- Baran, Sebastian, and Przemysław Rola. 2022. Prediction of motor insurance claims occurrence as an imbalanced machine learning problem. arXiv arXiv:2204.06109. [Google Scholar] [CrossRef]
- Black, Julia, and Andrew Douglas Murray. 2019. Regulating ai and machine learning: Setting the regulatory agenda. European Journal of Law and Technology 10: 1–21. [Google Scholar]
- Brati, Esmeralda, Alma Braimllari, and Ardit Gjeçi. 2025. Machine learning applications for predicting high-cost claims using insurance data. Data 10: 90. [Google Scholar] [CrossRef]
- Bruun, Simone Borg, Christina Lioma, and Maria Maistro. 2024. Recommending target actions outside sessions in the data-poor insurance domain. ACM Transactions on Recommender Systems 3: 1–24. [Google Scholar] [CrossRef]
- Cambria, Erik, and Bebo White. 2014. Jumping nlp curves: A review of natural language processing research. IEEE Computational Intelligence Magazine 9: 48–57. [Google Scholar] [CrossRef]
- Chawla, Nitesh V., Aleksandar Lazarevic, Lawrence O. Hall, and Kevin W. Bowyer. 2003. Smoteboost: Improving prediction of the minority class in boosting. In European Conference on Principles of Data Mining and Knowledge Discovery. Berlin/Heidelberg: Springer, pp. 107–19. [Google Scholar]
- Chen, Tianqi, and Carlos Guestrin. 2016. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. New York: Association for Computing Machinery, pp. 785–94. [Google Scholar]
- Clauset, Aaron, Cosma Rohilla Shalizi, and Mark E. J. Newman. 2009. Power-law distributions in empirical data. SIAM Review 51: 661–703. [Google Scholar] [CrossRef]
- De Meulemeester, Hannes, and Bart De Moor. 2020. Unsupervised embeddings for categorical variables. In 2020 International Joint Conference on Neural Networks (IJCNN). Piscataway: IEEE, pp. 1–8. [Google Scholar]
- Dietterich, Thomas G. 2000. Ensemble methods in machine learning. In International Workshop on Multiple Classifier Systems. Berlin/Heidelberg: Springer, pp. 1–15. [Google Scholar]
- Doshi-Velez, Finale, and Been Kim. 2017. Towards a rigorous science of interpretable machine learning. arXiv arXiv:1702.08608. [Google Scholar] [CrossRef]
- Elbhrawy, Ahmed Shawky, Mohamed AbdelFattah Belal, and Mohamed Sameh Hassanein. 2024. Ces: Cost estimation system for enhancing the processing of car insurance claims. Journal of Computing and Communication 3: 55–69. [Google Scholar] [CrossRef]
- Finger, Dina, Hansjoerg Albrecher, and Lutz Wilhelmy. 2024. On the cost of risk misspecification in insurance pricing. Japanese Journal of Statistics and Data Science 7: 1111–53. [Google Scholar] [CrossRef]
- Fitriani, Maulida Ayu, and Dany Candra Febrianto. 2021. Data mining for potential customer segmentation in the marketing bank dataset. JUITA: Jurnal Informatika 9: 25–32. [Google Scholar] [CrossRef]
- Galar, Mikel, Alberto Fernandez, Edurne Barrenechea, Humberto Bustince, and Francisco Herrera. 2012. A review on ensembles for the class imbalance problem: Bagging-, boosting-, and hybrid-based approaches. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews) 42: 463–84. [Google Scholar] [CrossRef]
- Ghorbani, Elnaz, and Raquel Soriano-Gonzalez. 2024. Enhancing predictive models in insurance: A feature selection analysis. In Decision Science Alliance International Summer Conference. Berlin/Heidelberg: Springer, pp. 256–67. [Google Scholar]
- Giudici, Paolo, and Laura Parisi. 2018. Corisk: Credit risk contagion with correlation network models. Risks 6: 95. [Google Scholar] [CrossRef]
- Gorishniy, Yury, Ivan Rubachev, Valentin Khrulkov, and Artem Babenko. 2021. Revisiting deep learning models for tabular data. In Advances in Neural Information Processing Systems. Cambridge: The MIT Press, vol. 34, pp. 18932–43. [Google Scholar]
- Gupta, Vibhuti, Julian Broughton, Ange Rukundo, and Lubna Pinky. 2025. Learning unbiased risk prediction based algorithms in healthcare: A case study with primary care patients. Informatics in Medicine Unlocked 54: 101627. [Google Scholar] [CrossRef]
- Hanafy, Mohamed, and Ruixing Ming. 2021. Machine learning approaches for auto insurance big data. Risks 9: 42. [Google Scholar] [CrossRef]
- Henckaerts, Roel, Marie-Pier Côté, Katrien Antonio, and Roel Verbelen. 2021. Boosting insights in insurance tariff plans with tree-based machine learning methods. North American Actuarial Journal 25: 255–85. [Google Scholar] [CrossRef]
- Hu, Changyue, Zhiyu Quan, and Wing Fung Chong. 2022. Imbalanced learning for insurance using modified loss functions in tree-based models. Insurance: Mathematics and Economics 106: 13–32. [Google Scholar] [CrossRef]
- Hutagaol, B. Junedi, and Tuga Mauritsius. 2020. Risk level prediction of life insurance applicant using machine learning. International Journal of Advanced Trends in Computer Science and Engineering 9: 2213–20. [Google Scholar] [CrossRef]
- Jarque, Carlos M., and Anil K. Bera. 1987. A test for normality of observations and regression residuals. International Statistical Review 55: 163–72. [Google Scholar] [CrossRef]
- Khamesian, Farzan, Maryam Esna-Ashari, Eric Dei Ofosu-Hene, and Farbod Khanizadeh. 2022. Risk classification of imbalanced data for car insurance companies: Machine learning approaches. International Journal of Mathematical Modelling & Computations 12: 153–62. [Google Scholar]
- Kinga, Diederik, and Jimmy Ba Adam. 2015. A method for stochastic optimization. Paper presented at the International Conference on Learning Representations (ICLR), San Diego, CA, USA, May 7–9, vol. 5. [Google Scholar]
- Kolambe, Sapana, and Parminder Kaur. 2023. Survey on insurance claim analysis using natural language processing and machine learning. International Journal on Recent and Innovation Trends in Computing and Communication 11: 30–38. [Google Scholar] [CrossRef]
- Krupova, Marketa, Nabil Rachdi, and Quentin Guibert. 2025. Explainable boosting machine for predicting claim severity and frequency in car insurance. arXiv arXiv:2503.21321. [Google Scholar] [CrossRef]
- Le, Thi-Thu-Huong, Aji Teguh Prihatno, Yustus Eko Oktian, Hyoeun Kang, and Howon Kim. 2023. Exploring local explanation of practical industrial ai applications: A systematic literature review. Applied Sciences 13: 5809. [Google Scholar] [CrossRef]
- LeCun, Yann, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. Nature 521: 436–44. [Google Scholar] [CrossRef]
- Leo, Martin, Suneel Sharma, and Koilakuntla Maddulety. 2019. Machine learning in banking risk management: A literature review. Risks 7: 29. [Google Scholar] [CrossRef]
- Li, Yu, Yi Zhang, Lu Gan, Gengwei Hong, Zimu Zhou, and Qiang Li. 2021. Revman: Revenue-aware multi-task online insurance recommendation. In Proceedings of the AAAI Conference on Artificial Intelligence. Washington: AAAI Press, vol. 35, pp. 303–10. [Google Scholar]
- Lim, Sehyun, Taeyeon Oh, and Guy Ngayo. 2023. Analyzing factors affecting risk aversion: Case of life insurance data in korea. Heliyon 9: e20697. [Google Scholar] [CrossRef]
- Liu, Xu-Ying, Jianxin Wu, and Zhi-Hua Zhou. 2008. Exploratory undersampling for class-imbalance learning. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 39: 539–50. [Google Scholar] [CrossRef]
- Loisel, Stéphane, Pierrick Piette, and Cheng-Hsien Jason Tsai. 2021. Applying economic measures to lapse risk management with machine learning approaches. ASTIN Bulletin: The Journal of the IAA 51: 839–71. [Google Scholar] [CrossRef]
- Lundberg, Scott M., and Su-In Lee. 2017. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems. Cambridge: The MIT Press, vol. 30. [Google Scholar]
- Marhavilas, Panagiotis K., and Dimitrios E. Koulouriotis. 2021. Risk-acceptance criteria in occupational health and safety risk-assessment—The state-of-the-art through a systematic literature review. Safety 7: 77. [Google Scholar] [CrossRef]
- Masello, Leandro, German Castignani, Barry Sheehan, Montserrat Guillen, and Finbarr Murphy. 2023. Using contextual data to predict risky driving events: A novel methodology from explainable artificial intelligence. Accident Analysis & Prevention 184: 106997. [Google Scholar] [CrossRef]
- Mienye, Ibomoiye Domor, and Theo G. Swart. 2024. A hybrid deep learning approach with generative adversarial network for credit card fraud detection. Technologies 12: 186. [Google Scholar] [CrossRef]
- Ngwenduna, Kwanda Sydwell, and Rendani Mbuvha. 2021. Alleviating class imbalance in actuarial applications using cost-sensitive learning. Risks 9: 49. [Google Scholar] [CrossRef]
- Olasehinde, Olayemi, Boniface Kayode Alese, and Ojonukpe Eqwuche. 2025. Privacy-preserving artificial intelligence: Principles, methods, applications, and challenges. Journal of Applied Artificial Intelligence 6: 60–70. [Google Scholar] [CrossRef]
- Orji, Ugochukwu, and Elochukwu Ukwandu. 2024. Machine learning for an explainable cost prediction of medical insurance. Machine Learning with Applications 15: 100516. [Google Scholar] [CrossRef]
- Pes, Barbara. 2021. Learning from high-dimensional and class-imbalanced datasets using random forests. Information 12: 286. [Google Scholar] [CrossRef]
- Sadreddini, Zhaleh, Ilknur Donmez, and Halim Yanikomeroglu. 2021. Cancel-for-any-reason insurance recommendation using customer transaction-based clustering. IEEE Access 9: 39363–74. [Google Scholar] [CrossRef]
- Sahai, Rahul, Ali Al-Ataby, Sulaf Assi, Manoj Jayabalan, Panagiotis Liatsis, Chong Kim Loy, Abdullah Al-Hamid, Sahar Al-Sudani, Maitham Alamran, and Hoshang Kolivand. 2022. Insurance risk prediction using machine learning. In The International Conference on Data Science and Emerging Technologies. Berlin/Heidelberg: Springer, pp. 419–33. [Google Scholar]
- Sari, Puspita Kencana, and Adelia Purwadinata. 2019. Analysis characteristics of car sales in e-commerce data using clustering model. Journal of Data Science and Its Applications 2: 19–28. [Google Scholar] [CrossRef]
- Seiffert, Chris, Taghi M Khoshgoftaar, Jason Van Hulse, and Amri Napolitano. 2009. Rusboost: A hybrid approach to alleviating class imbalance. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans 40: 185–97. [Google Scholar] [CrossRef]
- Seyam, Eslam Abdelhakim. 2025. Predicting high-cost healthcare utilization using machine learning: A multi-service risk stratification analysis in eu-based private group health insurance. Risks 13: 133. [Google Scholar] [CrossRef]
- Shah, Chiranjibi, Qian Du, and Yan Xu. 2022. Enhanced tabnet: Attentive interpretable tabular learning for hyperspectral image classification. Remote Sensing 14: 716. [Google Scholar] [CrossRef]
- Simester, Duncan, Artem Timoshenko, and Spyros I. Zoumpoulis. 2020. Targeting prospective customers: Robustness of machine-learning methods to typical data challenges. Management Science 66: 2495–522. [Google Scholar] [CrossRef]
- Singla, Jimmy, Ali Kashif Bashir, Yunyoung Nam, Najam UI Hasan, and Usman Tariq. 2021. Handling class imbalance in online transaction fraud detection. Computers, Materials and Continua 70: 2861–77. [Google Scholar]
- Soriano-Gonzalez, Raquel, Veronika Tsertsvadze, Celia Osorio, Noelia Fuster, Angel A. Juan, and Elena Perez-Bernabeu. 2024. Balancing risk and profit: Predicting the performance of potential new customers in the insurance industry. Information 15: 546. [Google Scholar] [CrossRef]
- Tian, Xiaoguang, Jun Todorovic, and Zelimir Todorovic. 2023. A machine-learning-based business analytical system for insurance customer relationship management and cross-selling. Journal of Applied Business & Economics 25: 273–89. [Google Scholar]
- Tiwari, Anoop Kumar, Rajat Saini, Abhigyan Nath, Phool Singh, and Mohd Asif Shah. 2024. Hybrid similarity relation based mutual information for feature selection in intuitionistic fuzzy rough framework and its applications. Scientific Reports 14: 5958. [Google Scholar] [CrossRef]
- Uddin, Moin, Mohd Faizan Ansari, Mohd Adil, Ripon K. Chakrabortty, and Michael J. Ryan. 2023. Modeling vehicle insurance adoption by automobile owners: A hybrid random forest classifier approach. Processes 11: 629. [Google Scholar] [CrossRef]
- Voigt, Paul, and Axel Von dem Bussche. 2017. The EU general data protection regulation (gdpr). A Practical Guide, 1st ed. Cham: Springer International Publishing, vol. 10, pp. 10–5555. [Google Scholar] [CrossRef]
- Young, Tom, Devamanyu Hazarika, Soujanya Poria, and Erik Cambria. 2018. Recent trends in deep learning based natural language processing. IEEE Computational IntelligenCe Magazine 13: 55–75. [Google Scholar] [CrossRef]
- Zhou, Zhi-Hua. 2012. Ensemble Methods: Foundations and Algorithms. Boca Raton: CRC Press. [Google Scholar]






| Method | Strengths | Limitations | Difference vs. Our Approach |
|---|---|---|---|
| SMOTE/GAN oversampling | Increases minority samples via synthesis (Chawla et al. 2003; Mienye and Swart 2024). | May create unrealistic cases; poor fit for heavy-tailed data. | Keeps only real cases, no synthetic artifacts. |
| Cost-sensitive/Focal Loss | Penalizes costly misclassifications; improves recall (Sari and Purwadinata 2019; Tian et al. 2023). | Ignores business limits (e.g., ≤8% omission). | Improves recall but does not explicitly enforce operational constraints or optimize portfolio-level profit. |
| TabNet (deep learning) | Captures feature interactions; strong benchmarks (Shah et al. 2022). | High cost; low interpretability; limited regulatory use. | Interpretable tree ensembles, easy deployment. |
| Transformer models | Models complex, high-order dependencies (Gorishniy et al. 2021). | Requires large numbers of data; expensive; low transparency. | Provides high representational capacity but does not explicitly incorporate profit-driven objectives or operational constraints as in our approach. |
| SHAP-based selection | Non-linear feature importance; interpretability (Le et al. 2023; Lundberg and Lee 2017). | Heavy computation; post hoc only. | Mutual information used within training pipeline. |
| SMOTEBoost/RUSBoost/Balanced RF/EasyEnsemble2 | Combines boosting/bagging with resampling; widely used (Chawla et al. 2003; Liu et al. 2008; Pes 2021; Seiffert et al. 2009). | Does not preserve real minority; lacks profit/business focus. | Preserves genuine minority, profit-optimized, business limits enforced. |
| Proposed ensemble | Preserves minority; profit-driven; interpretable. | Slightly lower precision; higher cost than single models. | Directly aligned with insurance business and regulation. |
| Cumulative Loss | Customer Percentage | 95% CI |
|---|---|---|
| 70% | 13.5% | (11.8%, 15.1%) |
| 75% | 17.4% | (15.6%, 19.1%) |
| 80% | 22.5% | (20.7%, 24.4%) |
| 85% | 29.4% | (27.5%, 31.2%) |
| 90% | 38.5% | (36.7%, 40.2%) |
| Sampling Strategy | Mean Profit (Euros) | Bootstrap 95% CI (Euros) | Omission Rate (%) | F1-Score (Mean) | Constraint Compliant |
|---|---|---|---|---|---|
| 1.17 | 727,833 | (687,621, 766,880) | 27.6 | 0.797 | No |
| 1.47 | 713,699 | (674,201, 754,649) | 20.5 | 0.836 | No |
| 2.12 | 616,093 | (561,111, 671,588) | 12.4 | 0.874 | No |
| 2.80 | 536,471 | (472,676, 595,574) | 8.2 | 0.890 | No |
| 3.12 | 505,111 | (436,844, 566,107) | 7.0 | 0.894 | Yes |
| 3.20 | 489,752 | (422,913, 553,440) | 6.8 | 0.894 | Yes |
| 3.60 | 460,843 | (399,643, 526,616) | 5.7 | 0.897 | Yes |
| 3.85 | 435,281 | (382,279, 491,054) | 5.1 | 0.898 | Yes |
| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| False | 0.94 | 0.94 | 0.94 | 14,250 |
| True | 0.32 | 0.30 | 0.31 | 1236 |
| Accuracy | 0.89 | 15,486 | ||
| Macro avg | 0.63 | 0.62 | 0.63 | 15,486 |
| Weighted avg | 0.89 | 0.89 | 0.89 | 15,486 |
| Category | Count | Mean Profit (eu) | Total (eu) | Business Implication |
|---|---|---|---|---|
| True Negatives | 13,463 | 245.40 | +3,303,887 | Correctly selected |
| True Positives | 376 | −3007.55 | −1,130,838 | Correctly omitted |
| False Negatives | 860 | −2144.74 | −1,844,476 | Missed high-risk |
| False Positives | 787 | 98.39 | +77,434 | Foregone profit |
| Class | Precision | Recall | F1-Score | Support |
|---|---|---|---|---|
| False | 0.94 | 0.94 | 0.94 | 14,250 |
| True | 0.27 | 0.26 | 0.27 | 1236 |
| Accuracy | 0.88 | 15,486 | ||
| Macro avg | 0.60 | 0.60 | 0.60 | 15,486 |
| Weighted avg | 0.88 | 0.88 | 0.88 | 15,486 |
| Category | Count | Mean Profit (eu) | Total (eu) | Business Implication |
|---|---|---|---|---|
| True Negatives | 13,463 | 243.41 | +3,247,314 | Correctly selected |
| True Positives | 376 | −3362.44 | −1,106,242 | Correctly omitted |
| False Negatives | 860 | −2060.72 | −1,869,073 | Missed high-risk |
| False Positives | 787 | 147.42 | +134,007 | Foregone profit |
| Metric | Soriano-Gonzalez | Balanced Ensemble | Change |
|---|---|---|---|
| ROC-AUC (Test) | 0.72 | 0.90 | |
| Precision (High-Risk) | 0.67 | 0.32 | −52.2% |
| Recall (High-Risk) | 0.23 | 0.30 | |
| F1-Score (Weighted) | 0.79 | 0.89 | |
| Customer Omission Rate | 6.0% | 7.5% | |
| Business Metrics | |||
| Test Set Profit | 1,232,663 euros | 1,459,411 euros | |
| Avg. Profit per Customer | 85 euros | 102 euros | |
| Profit as % of Maximum | 33% | 40% | |
| Metric | Soriano-González | Transformer | Change |
|---|---|---|---|
| ROC-AUC (Test) | 0.72 | 0.71 | −1.4% |
| Precision (High-Risk) | 0.67 | 0.27 | −61.1% |
| Recall (High-Risk) | 0.23 | 0.26 | |
| F1-Score (Weighted) | 0.79 | 0.88 | |
| Customer Omission Rate | 6.0% | 8.0% | |
| Business Metrics | |||
| Test Set Profit | 1,232,663 euros | 1,378,241 euros | |
| Avg. Profit per Customer | 85 euros | 96 euros | |
| Profit as % of Maximum | 33% | 38% | |
| Dataset | Transformer | Soriano-Gonzalez et al. (2024) | Balanced Ensemble | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Profit (eu) | % | Training Time (s) | Profit (eu) | % | Training Time (s) | Profit (eu) | % | Training Time (s) | |
| Syntdetic000 | 1,378,241 | 37 | 250.14 | 1,232,663 | 33 | 0.11 | 1,459,411 | 40 | 40.59 |
| Synthetic001 | 1,418,905 | 39 | 235.84 | 940,327 | 26 | 0.11 | 1,214,087 | 33 | 38.09 |
| Synthetic003 | 1,363,697 | 37 | 245.82 | 1,160,155 | 32 | 0.21 | 1,477,696 | 40 | 44.77 |
| Synthetic005 | 1,206,787 | 33 | 237.68 | 1,111,400 | 30 | 0.19 | 1,250,344 | 34 | 46.41 |
| Synthetic008 | 1,274,579 | 35 | 240.92 | 482,108 | 13 | 0.11 | 1,202,612 | 33 | 43.86 |
| Synthetic010 | 1,268,976 | 35 | 239.32 | 938,804 | 26 | 0.21 | 1,109,951 | 30 | 46.51 |
| Synthetic030 | 1,133,961 | 31 | 259.79 | 639,825 | 17 | 0.16 | 827,443 | 23 | 52.99 |
| Synthetic050 | 1,111,445 | 30 | 243.68 | 406,006 | 11 | 0.09 | 890,799 | 24 | 37.06 |
| Synthetic070 | 1,081,369 | 29 | 239.89 | 508,981 | 14 | 0.14 | 782,906 | 21 | 37.48 |
| Synthetic090 | 1,112,750 | 30 | 269.36 | 482,103 | 13 | 0.10 | 857,873 | 23 | 35.19 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Solly, F.L.; Soriano-Gonzalez, R.; Juan, A.A.; Guerrero, A. Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transformer Architectures. Risks 2026, 14, 91. https://doi.org/10.3390/risks14040091
Solly FL, Soriano-Gonzalez R, Juan AA, Guerrero A. Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transformer Architectures. Risks. 2026; 14(4):91. https://doi.org/10.3390/risks14040091
Chicago/Turabian StyleSolly, Finn L., Raquel Soriano-Gonzalez, Angel A. Juan, and Antoni Guerrero. 2026. "Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transformer Architectures" Risks 14, no. 4: 91. https://doi.org/10.3390/risks14040091
APA StyleSolly, F. L., Soriano-Gonzalez, R., Juan, A. A., & Guerrero, A. (2026). Advanced Insurance Risk Modeling for Pseudo-New Customers Using Balanced Ensembles and Transformer Architectures. Risks, 14(4), 91. https://doi.org/10.3390/risks14040091

