Optimizing the Collection Process in Credit Risk Management: A Comparison of Machine Learning Techniques for Predicting Payment Probability at Different Stages of Arrears
Abstract
1. Introduction
1.1. Problem Statement
1.2. Scoring Models for Credit Collection
- 0 No arrears segment
- 1–30 segment
- 31–60 segment
- 61–90 segment
- 91–120 segment
- More than 120 segment
1.3. Literature Review
1.3.1. Credit Scoring with Logistic Regression
1.3.2. Credit Scoring with XGBoost
1.3.3. Credit Scoring with Artificial Neural Networks
1.3.4. Performance Evaluation
1.3.5. Comparison of Models
2. Methodology
2.1. Kolmogorov–Smirnov Test
2.2. Data Sample Selection
2.3. Dependent Variable Setting
3. Train and Test Models
3.1. Logistic Regression Training
Unbalanced Problem
3.2. Extreme Gradient Boosting Training
3.2.1. Hyperparameter Selected and Tuning
3.2.2. XGBoost Hyperparameter Nrounds
3.2.3. XGBoost Hyperparameter Eta
3.2.4. XGBoost Hyperparameter Max_DEPTH
3.2.5. XGBoost Hyperparameter Min_Child_Weight
3.2.6. XGBoost Hyperparameter Subsample
3.2.7. XGBoost Hyperparameter Colsample_Bytree
3.2.8. XGBoost Hyperparameter Lambda
3.3. Artificial Neural Networks Training
- Layers, which are combined into a network (or model)
- The input data and corresponding targets
- The loss function, which defines the feedback signal used for learning
- The optimizer, which determines how learning proceeds
3.3.1. Building the Neural Networks
- How many layers to use
- How many hidden units to choose for each layer
3.3.2. Adding Dropout
4. Interpretability
4.1. Interpretation of Logistic Regression Coefficients
4.2. Interpretation of XGBoost Models Results
4.3. Interpretation of Neural Networks
- Credit_installment_balance (standardized value: −1.1, weight: 0.0000237): This variable, which represents the normalized credit installment balance, shows a positive contribution to the prediction of , suggesting that lower installment balance values are associated with a customer’s failure to make payments.
- cp_pl (value: 1.82, weight: 0.0000232): This feature also contributes positively to the prediction, although with a slightly lower weight.
- Cadena.ARTEFACTA (value: −2.8, weight: −0.000023): This categorical variable, indicating membership to a specific retail chain, shows a negative contribution, suggesting that being a customer of this retail chain reduces the probability of being a bad customer.
5. Results and Discussion
- In No Arrears Segment models, XGBoost (63.36%) and ANN (61.84%) outperform LR (56.42%).
- In 1–30 Segment models, XGBoost (51.38%) and ANN (50.35%) also outperform LR (47.32%).
- In the 31–90 Segment models, however, ANN (38.77%) outperforms LR (36.62%), but XGBoost (34.47%) does not.
- Finally, in the All Segments models, both XGBoost (74.05%) and ANN (73.59%) outperform LR (71.01%).
6. Conclusions
- This paper compares three supervised learning models—logistic regression, XGBoost, and artificial neural networks—using the Kolmogorov–Smirnov (KS) statistic as a performance metric. The XGBoost algorithm consistently demonstrated superior performance across various segments, achieving accuracy rates of 63.36% for segments with no arrears, 51.38% for segments with 1–30 days of arrears, and 74.05% when all segments were analyzed together. These results indicate that XGBoost is more effective for binary classification compared to both logistic regression and neural networks.
- Although logistic regression requires more time for training, its performance in terms of KS did not outperform XGBoost and neural networks in any of the arrears segments.
- In the 31–90 Days of Arrears Segment, neural networks outperformed XGBoost with a 38.77% KS value, indicating that the complexity and adaptability of neural networks can be advantageous in certain scenarios, despite the longer training time required.
- The interpretation of XGBoost results relies on how much each variable contributes to the splits in the random trees. In contrast, neural networks are still being studied to achieve satisfactory interpretability. This indicates that when developing a scoring model, one must choose between interpretability and predictability. If the goal is to enhance predictive or discriminative power, XGBoost or neural networks are preferred options. These algorithms can be effectively utilized during the placement, servicing, and collection phases of the credit cycle, as there is a larger volume of data to analyze. This is particularly relevant in collections, where results can change rapidly.
- Future work involves exploring combinations of models to leverage the individual strengths of algorithms like XGBoost and neural networks, aiming to improve prediction accuracy across different segments or scenarios.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Set of Variables for Training Models
| Alias | Variable | Description |
|---|---|---|
| M1 | Atraso_Max_3meses | Maximum arrears in the last three months |
| M2 | Atraso_Max_6meses | Maximum arrears in the last six months |
| M3 | Atraso_Max_9meses | Maximum arrears in the last nine months |
| M4 | Atraso_Max_Credito | Maximum arrears since the start of the loan |
| M5 | Atraso_Prom_12meses | Average arrears over the last 12 months |
| M6 | Atraso_Prom_3meses | Average arrears over the last three months |
| M7 | Atraso_Prom_Credito | Average arrears since the start of the loan |
| M8 | Cadena | Distribution chain to which the purchased product belongs |
| M9 | canal_vta | Sales channel through which the loan was acquired |
| M10 | cant_con_efe_dom_12meses | Number of direct contacts at the customer’s home in the last twelve months |
| M11 | cant_con_efe_dom_3meses | Number of direct contacts at the customer’s home in the last three months |
| M12 | cant_con_efe_dom_6meses | Number of direct contacts at the customer’s home in the last six months |
| M13 | cant_con_efe_dom_9mese | Number of direct contacts at the customer’s home in the last nine months |
| M14 | cant_con_efe_tel_12meses | Number of phone contacts with the customer in the last twelve months |
| M15 | cant_con_efe_tel_6meses | Number of phone contacts with the customer in the last six months |
| M16 | cant_con_efe_tel_9meses | Number of phone contacts with the customer in the last nine months |
| M17 | cant_con_efe_tel_mesant | Number of phone contacts with the customer in the previous month |
| M18 | cant_ges_dom_12meses | Number of visits to the customer’s home in the last twelve months |
| M19 | cant_ges_dom_3meses | Number of visits to the customer’s home in the last three months |
| M20 | cant_ges_dom_9meses | Number of visits to the customer’s home in the last nine months |
| M21 | cant_ges_efe_dom_12meses | Number of unsuccessful home visits in the last twelve months |
| M22 | cant_ges_efe_dom_6meses | Number of unsuccessful home visits in the last six months |
| M23 | cant_ges_efe_dom_9meses | Number of unsuccessful home visits in the last nine months |
| M24 | cant_ges_efe_dom_mesant | Number of unsuccessful home visits in the previous month |
| M25 | cant_ges_efe_tel_9meses | Number of unsuccessful phone calls in the last nine months |
| M26 | cant_ges_efe_tel_mesant | Number of unsuccessful phone calls in the previous month |
| M27 | cant_ges_tel_12meses | Number of phone calls in the last 12 months |
| M28 | cant_ges_tel_6meses | Number of phone calls in the last 6 months |
| M29 | cant_ges_tel_9meses | Number of phone calls in the last 9 months |
| M30 | Cant_Num_Telef_Referen | Number of reference phone numbers held by the customer |
| M31 | Cant_Productos | Number of billed products |
| M32 | CapitalInteres | Capital with interest |
| M33 | cp_ctr | Ratio of paid installments to installments due |
| M34 | cp_pl | Ratio of paid installments to loan term |
| M35 | ctr_pl | Ratio of remaining installments to loan term |
| M36 | Cuotas_pagad_credito | Number of installments paid on the loan |
| M37 | Cuotas_pendt_credito | Number of installments due |
| M38 | CuotasGratis | Indicator of whether the customer has a free installment promotion |
| M39 | desc_mejor_resp_dom_3meses | Best response obtained from home visits in the last three months |
| M40 | desc_mejor_resp_dom_6meses | Best response obtained from home visits in the last six months |
| M41 | desc_mejor_resp_dom_9meses | Best response obtained from home visits in the last nine months |
| M42 | desc_mejor_resp_tel_6meses | Best response obtained from phone calls in the last six months |
| M43 | dif_mes | Number of months between the loan disbursement and the reporting date |
| M44 | Edad | Customer’s age in years at the time of data extraction |
| M45 | ID_Num_Telef_Particular1 | Indicator of whether the customer has a landline at home |
| M46 | ind_ges_preventiva | Indicator of whether preventive management was performed |
| M47 | IngresosPropios | Estimated value of customer’s own income |
| M48 | Inicial | Total initial payment at the time the loan was taken |
| M49 | Inicialbono | Amount less than an installment paid at the start of the loan |
| M50 | linea | Product line |
| M51 | MesesGracia | Number of grace months before the first installment due date |
| M52 | Num_atra_may_60dias_anio | Number of delinquencies over 60 days in the past year |
| M53 | Num_atra_may30dias_anio | Number of delinquencies over 30 days in the past year |
| M54 | Num_pag_12meses | Number of payments made in the last twelve months |
| M55 | Num_pag_3meses | Number of payments made in the last three months |
| M56 | Num_pag_6meses | Number of payments made in the last six months |
| M57 | Num_pag_9meses | Number of payments made in the last nine months |
| M58 | Pago_efec_1mes | Payment made for the installment due on the reporting date |
| M59 | Plazo | Total number of loan installments |
| M60 | RalacionTrabajo | Indicator of the customer’s current employment status |
| M61 | Rango_mora_max_mesant | Maximum arrear range in the month prior to the reporting date |
| M62 | Rango_mora_mesact | Current delinquency range as of the reporting month |
| M63 | region | Geographic region of the customer’s home |
| M64 | Saldo_cuota_credito | Outstanding installment balance |
| M65 | Saldo_vencido_Credito | Overdue loan balance |
| M66 | Sexo | Gender as self-identified by the customer |
| M67 | TasaCredito | Effective interest rate of the loan |
| M68 | tipoinicialbono | Type of initial bonus received when the loan was taken |
| M69 | TotFacturaInicial | Total billed amount excluding interest |
| M70 | Val_pag_1meses | Amount paid the month before the reporting date |
| M71 | Val_pag_2meses | Amount paid two months before the reporting date |
| M72 | Val_pag_3meses | Amount paid three months before the reporting date |
| M73 | ValorCuota | Total installment amount including interest |
Appendix B. Logistic Regression and XG Boost Results
| Alias | Beta Estimated | Description |
|---|---|---|
| V1 | −1.74701 | M34 ≤ 0.56 & M58(0; 77.62] & M55(2;3] |
| V2 | −1.74726 | M34 ≤ 0.56 & M58(77.62; 102.88] & M72(159.2;293.94] |
| V3 | −0.40177 | M34(0.56;0.77] |
| V4 | 1.61231 | M34 > 0.77 |
| V5 | 0.87094 | M64 ≤ 197.79 |
| V6 | −0.85414 | M64 > 1121.02 & M33(0.8;1] & M70 > 24.61 |
| V7 | −0.27386 | M35 ≤ 6.5 & M56(5;6] & M36 ≤ 11 |
| V8 | 0.10523 | (M38 == BONO INICIAL + N CUOTAS GRATIS | M38 == CUOTAS GRATIS) & M49 <= 0 & (M63 == QUITO | M63 == GUAYAQUIL) |
| V9 | −0.46984 | M38 == NULL |
| V10 | 0.08418 | M67(0;15] & M48 > 59 |
| V11 | −0.15439 | (M68 == NULL & M71(102.58;136.09] & M73(47.18;77.8]) | (M68 == NULL & M71(153.54;236.16] & M73 > 77.8) |
| V12 | −0.11464 | M31 > 2 & (M50 == Tienda | M50 == Recojo) & M60 == NO |
| V13 | −0.67996 | M35 ≤ 0.75 |
| Intercept | −0.80574 |
| Alias | Beta Estimated | Description |
|---|---|---|
| V1 | 1.56644 | M64 ≤ 123.25 |
| V2 | −0.00106 | M64(123.25;311.02] (cont) |
| V3 | −0.81316 | M64 > 922.58 & M37 ≤ 0 & M58(58.87;117.52] |
| V4 | −0.23837 | M34(0.28;0.65] & M55(2;3] & M4 ≤ 5 |
| V5 | 0.70477 | M34 > 0.83 (cont) |
| V6 | −0.17784 | M55(2;3] & M4 ≤ 5 |
| V7 | −0.54773 | M35 ≤ 3.2 & M65 ≤ 0 |
| V8 | 0.64187 | M35 > 0.87 (cont) |
| V9 | −0.01297 | M36(3;14] (cont) |
| Intercept | −0.39452 |
| Alias | Beta Estimated | Variable |
|---|---|---|
| V1 | −0.52883 | M1 ≤ 11 |
| V2 | −0.58201 | M1(11;27] & M64 > 297.19 |
| V3 | 0.19268 | M1 > 42 |
| V4 | −0.30622 | M3(29;45] |
| V5 | 0.51679 | M3(45;55] & M58 ≤ 0 |
| V6 | 0.77430 | M6 > 49.67 & M53 ≤ 2 |
| V7 | 0.12466 | M52 ≤ 0 & M65 > 108.23 |
| V8 | 0.16562 | (M50 == VIDEO M50 == AUDIO M50 == CONSTRUCCION) & M15 ≤ 0 & M69 ≤ 2046.82 |
| V9 | 0.16522 | (M39 == MENSAJE A TERCEROS M39 == CONTACTO SIN COMPROMISO) & M28 > 10 |
| V10 | 0.18916 | M12 ≤ 0 & M26 ≤ 0 & M47 ≤ 353 |
| Intercept | 0.69485 |



References
- Arnold, T. B., & Emerson, J. W. (2011). Nonparametric goodness-of-fit tests for discrete null distributions. R Journal, 3(2), 34–39. [Google Scholar] [CrossRef]
- Bartz, E., Bartz-Beielstein, T., Zaefferer, M., & Mersmann, O. (2023). Hyperparameter tuning for machine and deep learning with r: A practical guide. Springer Nature. [Google Scholar] [CrossRef]
- Chen, T., & Guestrin, C. (2016, August 13–17). Xgboost: A scalable tree boosting system. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794), San Francisco, CA, USA. [Google Scholar]
- Chen, Y., Wang, H., Han, Y., Feng, Y., & Lu, H. (2024). Comparison of machine learning models in credit risk assessment. Applied and Computational Engineering, 74, 278–288. [Google Scholar] [CrossRef]
- Chollet, F. (2018). Deep learning with R/François Chollet; with JJ Allaire. Maning, EEUU. [Google Scholar]
- Cifuentes Baquero, N., & Gutiérrez Murcia, L. (2022). Modelo predictivo de la probabilidad de aumento de los días de mora para usuarios de tarjeta de crédito [Masters’s thesis, Universidad de los Andes]. [Google Scholar]
- Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., & Pedreschi, D. (2019). A survey of methods for explaining black box models. ACM Computing Surveys, 51(5), 93. [Google Scholar] [CrossRef]
- Iñiguez, C., & Morales, M. (2009). Selección de perfiles de clientes mediante regresión logística para muestras desproporcionadas, validación, monitoreo y aplicación en la proyección de provisiones. Escuela Politécnica Nacional. [Google Scholar]
- Li, Z. (2022). Extracting spatial effects from machine learning model using local interpretation method: An example of SHAP and XGBoost. Computers, Environment and Urban Systems, 96, 101845. [Google Scholar] [CrossRef]
- Maddala, G. S., Contreras García, J., Lozano López, V., & García Ferrer, A. (1985). Econometría. McGraw-Hill. [Google Scholar]
- Massey, F. J., Jr. (1951). The Kolmogorov-Smirnov test for goodness of fit. Journal of the American Statistical Association, 46(253), 68–78. [Google Scholar] [CrossRef]
- Molnar, C. (2021). Interpretable machine learning. Available online: https://fedefliguer.github.io/AAI/redes-neuronales.html (accessed on 22 August 2025).
- Pérez Tatamués, A. E. (2014). Modelo de activación de tarjetas de crédito en el mercado crediticio ecuatoriano a través de una metodología analítica y automatizada en r [Bachelor’s thesis, Escuela Politécnica Nacional]. [Google Scholar]
- Reche, J. L. C. (2013). Regresión logística. Tratamiento computacional con R. Universidad de Granada. [Google Scholar]
- Ribeiro, M. T., Singh, S., & Guestrin, C. (2016, August 13–17). “Why should I trust you?”: Explaining the predictions of any classifier. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 135–144), San Francisco, CA, USA. [Google Scholar] [CrossRef]
- Sánchez Farfán, Y. S. (2023). Aplicación del modelo credit scoring y regresión logística en la predicción del crédito, en una entidad financiera de la ciudad del Cusco. Universidad de los Andes. [Google Scholar]
- Suquillo Llumiquinga, J. A. (2021). Credit scoring: Aplicando técnicas de regresión logística y modelos aditivos generalizados para una cartera de crédito en una entidad financiera [Bachelor’s thesis, Escuela Politécnica Nacional]. [Google Scholar]
- Thomas, L., Crook, J., & Edelman, D. (2017). Credit scoring and its applications. SIAM. [Google Scholar]
- Vargas Lara, D. O. (2015). Metodología para la obtención de un modelo de cobranza de créditos masivos. desarrollo y obtención de un modelo de score [Unpublished master’s thesis, Escuela Politécnica Nacional]. [Google Scholar]
- Yeh, I.-C., & Lien, C.-H. (2023). Credit scoring using machine learning techniques: A review and open research issues. Mathematics, 11(4), 839. [Google Scholar] [CrossRef]







| Train | Test | Validation |
|---|---|---|
| 87,821 | 37,638 | 18,819 |
| 61% | 26% | 13% |
| Dep.Var. | Arrears | Train | Test | Validation | |||
|---|---|---|---|---|---|---|---|
| G | B | G | B | G | B | ||
| 0-no arrears | 75% | 25% | 75% | 25% | 75% | 25% | |
| 1–30 | 68% | 32% | 68% | 32% | 67% | 33% | |
| 31–90 | 32% | 68% | 32% | 68% | 33% | 67% | |
| All | 63% | 37% | 63% | 37% | 63% | 37% | |
| Layer | (Type) | Activation Function | Output Shape | Param # |
|---|---|---|---|---|
| dense_2 | (Dense) | relu | (None, 128) | 15,616 |
| dropout_1 | (Dropout) | (None, 128) | 0 | |
| dense_1 | (Dense) | sigmoid | (None, 64) | 8256 |
| dropout | (Dropout) | (None, 128) | 0 | |
| dense | (Dense) | sigmoid | (None, 1) | 65 |
| Loss | binary_crossentropy | |||
| Optimizer | adam | |||
| Total params | 23,937 | (93.50 KB) | ||
| Trainable params | 23,937 | (93.50 KB) | ||
| Layer | (Type) | Activation Function | Output Shape | Param # |
|---|---|---|---|---|
| dense_8 | (Dense) | relu | (None, 64) | 7936 |
| dropout_5 | (Dropout) | (None, 64) | 0 | |
| dense_7 | (Dense) | sigmoid | (None, 32) | 2080 |
| dropout_4 | (Dropout) | (None, 32) | 0 | |
| dense_6 | (Dense) | sigmoid | (None, 1) | 33 |
| Loss | binary_crossentropy | |||
| Optimizer | adam | |||
| Total params | 10,049 | (39.25 KB) | ||
| Trainable params | 10,049 | (39.25 KB) | ||
| Layer | (Type) | Activation Function | Output Shape | Param # |
|---|---|---|---|---|
| dense_17 | (Dense) | relu | (None, 16) | 1984 |
| dropout_11 | (Dropout) | (None, 16) | 0 | |
| dense_16 | (Dense) | relu | (None, 16) | 272 |
| dropout_10 | (Dropout) | (None, 16) | 0 | |
| dense_15 | (Dense) | sigmoid | (None, 1) | 17 |
| Loss | binary_crossentropy | |||
| Optimizer | adam | |||
| Total params | 2273 | (8.88 KB) | ||
| Trainable params | 2273 | (8.88 KB) |
| Layer | (Type) | Activation | Output Shape | Param # |
|---|---|---|---|---|
| dense_20 | Dense | relu | (None, 16) | 1984 |
| dropout_13 | Dropout | – | (None, 16) | 0 |
| dense_19 | Dense | relu | (None, 16) | 272 |
| dropout_12 | Dropout | – | (None, 16) | 0 |
| dense_18 | Dense | sigmoid | (None, 1) | 17 |
| Loss | binary_crossentropy | |||
| Optimizer | adam | |||
| Total params | 2273 | (39.25 KB) | ||
| Trainable params | 2273 | (39.25 KB) | ||
| Model Type | Y | Y Prob | Feature | Feature Value | Feature Weight |
|---|---|---|---|---|---|
| 1 | 0.998 | cp_pl | 1.82 | 0.0000232 | |
| Classification | 1 | 0.998 | Saldo_cuota_credito | −1.1 | 0.0000237 |
| 1 | 0.998 | Cadena.ARTEFACTA | −2.8 | −0.000023 |
| 0 Segment Models | 1–30 Segment Model | ||||||
|---|---|---|---|---|---|---|---|
| LR | XGB | ANN | LR | XGB | ANN | ||
| KS | 56.42% | 63.36% | 61.84% | 47.32% | 51.38% | 50.35% | |
| Time | 83 | 42 | 65 | 410 | 23 | 90 | |
| 31–90 Segment Model | All Arrears Model | ||||||
| LR | XGB | ANN | LR | XGB | ANN | ||
| KS | 36.62% | 34.47% | 38.77% | 71.01% | 74.05% | 73.59% | |
| Time | 1441 | 10 | 163 | 1706 | 33 | 519 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Carrera, A.; Benalcázar, M.E. Optimizing the Collection Process in Credit Risk Management: A Comparison of Machine Learning Techniques for Predicting Payment Probability at Different Stages of Arrears. J. Risk Financial Manag. 2025, 18, 630. https://doi.org/10.3390/jrfm18110630
Carrera A, Benalcázar ME. Optimizing the Collection Process in Credit Risk Management: A Comparison of Machine Learning Techniques for Predicting Payment Probability at Different Stages of Arrears. Journal of Risk and Financial Management. 2025; 18(11):630. https://doi.org/10.3390/jrfm18110630
Chicago/Turabian StyleCarrera, Andrés, and Marco E. Benalcázar. 2025. "Optimizing the Collection Process in Credit Risk Management: A Comparison of Machine Learning Techniques for Predicting Payment Probability at Different Stages of Arrears" Journal of Risk and Financial Management 18, no. 11: 630. https://doi.org/10.3390/jrfm18110630
APA StyleCarrera, A., & Benalcázar, M. E. (2025). Optimizing the Collection Process in Credit Risk Management: A Comparison of Machine Learning Techniques for Predicting Payment Probability at Different Stages of Arrears. Journal of Risk and Financial Management, 18(11), 630. https://doi.org/10.3390/jrfm18110630

