Next Article in Journal
Forward-Looking Volatility Estimation for Risk-Managed Investment Strategies during the COVID-19 Crisis
Previous Article in Journal
Efficiency Testing of Prediction Markets: Martingale Approach, Likelihood Ratio and Bayes Factor Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Estimating the BIS Capital Adequacy Ratio for Korean Banks Using Machine Learning: Predicting by Variable Selection Using Random Forest Algorithms

1
Division of Management Information System, College of Business Administration, Hanyang University, Seoul 04763, Korea
2
Department of Consumer Sciences, College of Education and Human Sciences, South Dakota State University, Brookings, SD 57007, USA
*
Author to whom correspondence should be addressed.
Risks 2021, 9(2), 32; https://doi.org/10.3390/risks9020032
Submission received: 7 December 2020 / Revised: 3 January 2021 / Accepted: 14 January 2021 / Published: 1 February 2021

Abstract

:
The purpose of this study is to find the most important variables that represent the future projections of the Bank of International Settlements’ (BIS) capital adequacy ratio, which is the index of financial soundness in a bank as a comprehensive and important measure of capital adequacy. This study analyzed the past 12 years of data from all domestic banks in South Korea. The research data include all financial information, such as key operating indicators, major business activities, and general information of the financial supervisory service of South Korea from 2008 to 2019. In this study, machine learning techniques, Random Forest Boruta algorithms, Random Forest Recursive Feature Elimination, and Bayesian Regularization Neural Networks (BRNN) were utilized. Among 1929 variables, this study found 38 most important variables for representing the BIS capital adequacy ratio. An additional comparison was executed to confirm the statistical validity of future prediction performance between BRNN and ordinary least squares (OLS) models. BRNN predicted the BIS capital adequacy ratio more robustly and accurately than the OLS models. We believe our findings would appeal to the readership of your journal such as the policymakers, managers and practitioners in the bank-related fields because this study highlights the key findings from the data-driven approaches using machine learning techniques.

1. Background

The Bank for International Settlements’ (BIS) capital adequacy ratio is an internationally accepted key indicator for measuring a bank’s capital adequacy, which is defined and suggested by the BIS regardless of local financial, regulatory systems, policies, and laws (BIS 2020). BIS is an international financial institution that is owned by 63 central banks over countries (BIS 2020). BIS has been established in 1930 and have these missions (BIS 2020): (a) collaborating with participating central banks; (b) promoting financial stability; (c) supporting research about financial stability; (d) being prime counterparty for financial transactions; and (e) connecting international financial operations. As the missions imply, BIS is made for stabilizing the international finances as international efforts by participating central banks.
The BIS capital adequacy ratio represents prudent finance regulation indexes for monitoring domestic banks’ capital levels and raised capital (NLIC 2020a). It is also an important macro index against systemic risks because banks have the most important position in the financial system, and it control the highest percentage of payment systems.
The BIS capital adequacy ratio in South Korea is calculated with the formula of the equity capital ratio, which is the percentage of equity capital divided by the risk-weighted assets (RWA). Under Korean financial supervisory institutions’ regulations for calculating RWA, domestic banks can select one of the standard methods or an internal ratings-based (IRB) approach for credit RWA and market RWA. The calculation of operation RWA can be selected among the basic indicator approach, standardized approach, and advanced measurement approach (NLIC 2020b). Credit risk is the largest and most significant risk that banks face as a current or a potential risk. Credit risk directly represents a loss to a bank’s net income and capital when the borrower does not fulfill the terms of the contract or does not pay the debt to the bank. Regarding of credit risk, most of the domestic banks also use the IRB approach, such as Shinhan Bank, KB Kookmin Bank, Nonghyup Bank, KEB Hana Bank, and Woori Bank (Financial Supervisory Commission [FSC] 2020).
Total RWA designates credit risk, market risk, and operation risk as the three main categorized risks. Credit risk means the risk of a potential economic loss that is likely to be incurred by a debtor’s default or an obligator’s failure to fulfill a contract or an obligation. Operational risk means the risk of loss that is likely to be incurred as a result of inappropriate internal processes, human resources, or an external event, which could include legal risk (NLIC 2020e). Market risk means the risk of losses incurred in a bank’s on and off balance-sheet positions from movements of market prices. Market risk is further divided into general market risk and specific risk-weighted assets (NLIC 2020c).
However, the current methodologies need some improvements in relation to the denominator of the BIS ratio. There have been many questions and recommendations about credibility and comparability because the standard approach did not reflect well on high-risk discrimination of RWA, and the IRB has shown considerable gaps for each bank (FSS 2020). The IRB approach with high autonomy has a problem when calculating risk assets that are relatively small compared to the standard approach. This can cause the banks to be more likely to comply with regulatory ratios by reducing the size of RWA with the IRB approach if capital costs rise while banks’ profitability deteriorates (Korea Institute of Finance (KIF 2016)). For this reason, regulatory reform has required improvements of both the standard approach and IRB. This reform focuses on sensitivity, simplicity, and comparability (KIF 2016). Correspondingly, Korea’s FSC has been reviewing regulatory improvements according to Basel III Finalizing Post-Crisis Reforms, issued in December 2017, as proposed by The Basel Committee on Banking Supervision ([BCBS]; FSS 2020).
Under these circumstances, this study investigated data from the past 12 years, including all the information of domestic banks in South Korea. As a final point, this study shows the most important indicators representing the BIS capital adequacy ratio as a supplement index. Considering the large number of variables (1929) and strong prediction performance, machine learning techniques were used. Specifically, the dynamic ecological systemic framework suggested by Heo (2020) was used as the basic assumption of utilizing all 1929 variables as predictors for the BIS capital adequacy ratio. Heo explained that large numbers of predictors from multiple systems around an organization are interacting and influencing the managerial performance of the organization. Machine learning techniques, such as artificial neural networks (ANN), are suggested as the main methodology for the dynamic ecological systemic framework due to the large number of predictors used as dynamic ecological factors (Heo 2020).
Regarding these points, the main purpose of this study was to predict the BIS capital adequacy ratio more accurately. Accordingly, this study has two sub-purposes. First, this study determined which variables are the most important predictors for the BIS capital adequacy ratio. Second, all the variables were analyzed for better prediction by machine learning since using traditional statistical approaches for such a large number of variables was not possible. In short, the study confirmed the prediction performance of selected variables to determine whether the machine learning technique has a better prediction performance than the classical linear prediction method: Bayesian Generalized Linear Regression Model (BGLM).

2. Literature Review

2.1. Financial Stability and BIS Capital Adequacy Ratios of Korean Banks

The BIS capital adequacy ratio regulation was implemented by BCBS under the BIS in July 1988. To secure the banks’ capital adequacy, the banks must hold at least 8% of RWA as equity capital. Korean banks introduced BIS I in July 1992. In January 2008, BIS II was introduced to clarify the characteristics of the risk-oriented equity capital regulation. On 1 December 2013, BIS III was introduced to enhance the quality and quantity of the capital and introduced the capital buffer concept and liquidity expansion. In the beginning, the BIS capital adequacy ratio was introduced for the concept of risk and for the regulation of equity capital, starting with credit risk. It continues to expand to include market risk, operation risk and elaborate requirements for accreditation of the IRB approach. Equity capital regulation is playing a role not only as a supervisory tool to improve the financial soundness of companies, but also as an administrative regulation tool (FSS 2020).

2.2. Korean Laws and Regulations about BIS Capital Adequacy Ratio

Korean laws and regulations stipulate that all banks must follow the Bank Act and enforce its decrees, including the Regulations on Supervision of Banking Business and the Detailed Regulations on Supervision of Banking Business. The laws and regulations support the BIS capital adequacy ratio with management guidance standards and with the principle of bank soundness. All banks must comply with the management guidelines set by the Financial Services Commission (FSC) as prescribed by presidential decrees in relation to capital adequacy to maintain sound management. While the FSC establishes management guidelines, the principles of bank soundness supervision recommended by the BIS must be fully included (NLIC 2020a).
Enforcement of the Banking Act stipulates that the standards for equity capital ratio corresponding to a bank’s credit risk should be included in the standards for management guidelines. The FSC can ask for necessary measures for management improvement when there is a risk of seriously harming the soundness of management, such as failing to meet the management guidance standards according to the BIS capital adequacy ratio or if it is deemed inevitable for maintaining the soundness of management (NLIC 2020g).
Regarding the BIS management guidance ratio, it is stipulated in the regulations on the supervision of the banking business. All banks must maintain a management guidance ratio corresponding to the minimum for the capital ratio; a common equity capital ratio of 4.5/100; a core capital ratio of 6/100; and a total capital ratio of 8/100. Each calculation methodology is determined by the governor of the Financial Supervisory Services (FSS), and it needs to follow the standard guidelines recommended by the BIS (NLIC 2020h). Accordingly, when the equity capital ratio falls under the BIS management guidance ratio as a result of evaluation and analysis of management status, timely corrective actions are triggered as one of the three stages: management improvement recommendations, management improvement requirements, and management improvement orders. Timely corrective measures are mandatory regulations. These must be taken by the governor of the FSS or the FSC when the requirements for initiation are met (NLIC 2020i).
The first stage of timely corrective measures is management improvement recommendations by the FSC. This action is required that the bank has a total capital ratio of less than 8/100 or a core capital ratio of less than 6.0/100 or a common equity capital ratio of less than 4.5/100 (NLIC 2020j). The second stage of timely corrective measure is management improvement requirements by FSC. This action requires that a bank has a total capital ratio of less than 6/100 or a core capital ratio of less than 4.5/100 or a common equity capital ratio of less than 3.5/100 (NLIC 2020k). The final stage of timely corrective measures is Management Improvement Order by the FSC. These actions require that a bank has a total capital ratio of less than 2/100 or a core capital ratio of less than 1.5/100 or a common equity capital ratio of less than 1.2/100. The FSC can order the bank to take necessary actions and the governor of the FSS also has to monitor execution after receiving necessary actions within two months as determined by the FSC (NLIC 2020l).

2.3. Predicting Financial Ratio with Machine Learning Techniques

As such, the credit risk of bank such as BIS is better to be expected by some tools such as prediction modeling. Recently, machine learning techniques were utilized in predicting credit risk and BIS (e.g., Gambacorta et al. 2019; Petropoulos et al. 2019). Gambacorta et al. (2019) utilized Chinese fintech company data and found the effectiveness of using machine learning technique (i.e., fintech credit scoring models that was combined with big data) for predicting the loss in banks. Petropoulos et al. (2019) used data from Greek banking system and forecasted the credit quality. Specifically, considering that credit risk is the foundational concept of BIS, this literature implies that machine learning techniques are useful for prediction in credit issues in banks. Beyond the credit issues of banks industry, it is valuable to utilize the machine learning technique in larger realm such as assessing the various risks (i.e., credit risk, market risk, and operation risk) of the banks industry (i.e., BIS).
In addition, Bazarbash (2019) introduced the strength and the weakness of using machine learning in predicting the credit risks. Even though there are many strengths of using machine learning (e.g., better in classification, good for forecasting, and predicting the risks), the machine learning technique is still having weakness so called as black box. It meant that, by using machine learning, it is not possible to know which factors are precisely predicting the outcome. Therefore, in this study, three steps of machine learning were utilized to check which factors performed to predict BIS. Specifically, the first two steps are expected to sort out the important variables from the total predictors. As a result, the three steps of sorting the variable lists make the important variables to predict the BIS of banks.
To sum up, the uniqueness of this study that is different with the previous literature are three: (a) assessing the risks in the macro-size level with banks’ data, (b) utilizing multiple steps of machine learning to find the important orders of predictors (i.e., overcoming black box issue), and (c) using South Korea banks data that were not predicted yet.

3. Theoretical Background: BIS Capital Adequacy Ratio

Standards for the Calculation of Management Guidance Ratios are stipulated in the Detailed Regulations on the Supervision of Banking Business, which follows the BIS Rule of the Bank for International Settlement (NLIC 2020b). According to The Principles for the Calculation of Equity Capital Ratio Based on BIS I, the ratio of capital to risk-weighted assets must be calculated based on the capital and gross assets on the consolidated balance sheet. When calculating this ratio, a bank must classify the capital ratio based on the credit risk and the combined risks according to the gravity of the risks. The capital ratio based on the market risk must be calculated, where a bank whose ratio of the sum of the trading book to the total assets on the consolidated balance sheet is at least a 5% maximum per day, or where a bank whose sum of trading book is 100 billion Won maximum per day. As a result, the formula for the calculation of equity capital ratio based on credit risk, BIS I is calculated by Function (1) (NLIC 2020e).
E q u i t y   c a p i t a l r a t i o   b a s e d   o n   c r e d i t   R W A   = E q u i t y   c a p i t a l C r e d i t   r i s k w e i g h t e d   a s s e t s   × 100 8 %  
The principles for the calculation of the equity capital ratio based on BIS II are as follows: banks calculate the capital ratio based on credit and operational risks, and separately calculate the capital ratio based on credit, operation and market risks. The equity capital ratio based on market risk is calculated according to the same criteria as the market risk based on BIS I. The capital ratio based on credit and operation risks must be calculated by dividing the equity capital by the sum of the risk-weighted assets aggregated credit RWA, operational RWA, and the risk evaluation adjustment. Equity capital must be calculated by aggregating core capital and supplementary capital then subtracting deductions as 50% from both of the core capital and supplementary capital, except as otherwise prescribed expressly. As a result, the BIS II is calculated by Function (2) (NLIC 2020f).
E q u i t y   c a p i t a l   r a t i o   b a s e d   o n   c r e d i t   a n d   o p e r a t i o n   r i s k s = E q u i t y   c a p i t a l R i s k w e i g h t e d   a s s e t s   +   R i s k   e v a l u a t i o n   a d j u s t m e n t × 100   = C o r e   c a p i t a l   +   S u p p l e m e n t a r y   c a p i t a l     D e d u c t i o n s C r e d i t   r i s k w e i g h t e d   a s s e t s   +   O p e r a t i o n a l   r i s k w e i g h t e d   a s s e t s +   R i s k   e v a l u a t i o n   a d j u s t m e n t × 100
Principles for the calculation of the equity capital ratio based on BIS III are as follows. The capital ratio based on credit and operational risks should be calculated by dividing capital calculated by the formula. Total capital should be calculated by aggregating Tier 1 capital (the aggregate of Common Equity Tier 1 and Additional Tier 1 capital) and Tier 2 capital then dividing by the sum of the risk-weighted assets aggregated credit risk-weighted assets, operational risk-weighted assets, and the risk evaluation adjustment. BIS III is calculated by the ratio of Common Equity Tier 1, the ratio of Tier 1 capital, and the ratio of total capital, depending upon the scope of components of each capital. As a result, BIS III is calculated by Functions (3)–(5) (NLIC 2020d).
R a t i o   o f   C o m m o n   E q u i t y   T i e r   1   b a s e d   o n   c r e d i t   a n d   o p e r a t i o n a l   r i s k s = C o m m o n   E q u i t y   T i e r   1 C r e d i t   r i s k w e i g h t e d   a s s e t s + O p e r a t i o n a l   r i s k w e i g h t e d   a s s e t s + A d j u s t m e n t   o f   r i s k   a s s e s s m e n t × 100
R a t i o   o f   T i e r   1   c a p i t a l   b a s e d   o n   c r e d i t   a n d   o p e r a t i o n a l   r i s k s = T i e r   1   c a p i t a l R i s k w e i g h t e d   a s s e t s + A d j u s t m e n t   o f   r i s k   a s s e s s m e n t × 100   = C o m m o n   E q u i t y   T i e r   1   +   A d d i t i o n a l   T i e r   1   c a p i t a l C r e d i t   r i s k w e i g h t e d   a s s e t s + O p e r a t i o n a l   r i s k w e i g h t e d   a s s e t s + A d j u s t m e n t   o f   r i s k   a s s e s s m e n t × 100
R a t i o   o f   t o t a l   c a p i t a l   b a s e d   o n   c r e d i t   a n d   o p e r a t i o n a l   r i s k s = T o t a l   c a p i t a l R i s k w e i g h t e d   a s s e t s   + A d j u s t m e n t   o f   r i s k   a s s e s s m e n t   × 100   = C o m m o n   E q u i t y   T i e r   1   +   A d d i t i o n a l   T i e r   2   c a p i t a l C r e d i t   r i s k w e i g h t e d   a s s e t s + O p e r a t i o n a l   r i s k w e i g h t e d   a s s e t s + A d j u s t m e n t   o f   r i s k   a s s e s s m e n t × 100

4. Statistical Background: Machine Learning Algorithms

This study sought to find the key variables from 1929 variables that represent the banks’ BIS forecast. Machine learning algorithms in this study consisted of three steps: Random Forest Boruta, Random Forest Feature Elimination, and the Bayesian Regularized Neural Network (BRNN), See Figure 1. Regarding feature selection, it is known that many machine learning algorithms reduce prediction accuracy when using large numbers of variables compared to the optimal one (Kohavi and John 1997). It is also advisable to choose a small feature set that yields the best possible classification result according to minimal-optimal problem (Nilsson et al. 2007). In some situations, it is also important to find all the relevant variables to the classification by all-relevant problem (Nilsson et al. 2007).
Considering this practical point of view, Random Forest was used in this study for precise predictions in cases of large numbers of variables (Genuer et al. 2010). The Random Forest technique represents an early type of machine learning data classification modeling process (Breiman 2001). This technique is based on ensemble learning, which is a procedure that attempts to merge two or more algorithms into one algorithm. In the case of a random forest, the decision tree, average prediction, and bagging sample are all merged into a single algorithm (Ho 1998; Liaw and Wiener 2002).
Kursa and Rudnicki (2010) improved the variable selection performance of the Random Forest algorithm, which is now called the Boruta algorithm. In addition, it is known for the importance of ranking the predictor variables can be found by utilizing a combination of the same family of algorithms (e.g., Random Forest), if the predictor variables are numerous (1939 predictors) (Zhou et al. 2014). Thus, Random Forest RFE and a random forest-based recursive variable removal method, is used as an algorithm that predicts weights to rank predictors in ascending order (Guyon et al. 2002).
In the first stage, the Random Boruta algorithm analyzes 1929 variables from the financial and non-financial information of domestic banks. As a result of the analysis, 489 variables were identified. In the second stage, the root mean of squared error (RMSE) and mean of absolute error (MAE) were analyzed by Random Forest RFE. As a result, 58 and 38 variable sets were defined by importance rank. In the final stage, BRNN algorithms confirmed the statistical validity of the predictive power of the variables found in Random Forest RFE.
BRNN is a type of artificial neural network (ANN) (Sariev and Germano 2020), which estimates the efficient output of nonlinear data without overfitting and overtraining (Burden and Winkler 2008). BRNN is an advanced algorithm based on ANN, a type of feed forward neural network (Sariev and Germano 2020). The feed forward neural network assumes that the gradient process is expected only through an iterative process. The BRNN is also based on the Gauss-Newton approximation using Bayesian estimation and normalization approaches (MacKay 1992). Considering this normalization process, BRNN is superior to existing prediction models for predicting dependent variables composed of continuous variables (Sariev and Germano 2020).

5. Methods

The data for this study are the statistical information from the FSS, which is being updated quarterly by the Korean FSS. This is 12-year time series data from 2008 to 2019 covering all domestic banks. The data include all financial and non-financial variables of the bank, including general information, financial information, key management indices, major business activities, and metadata. Specifically, all information was downloaded from FSS website that were transformed to numeric variables. Total number of downloadable information were 1933. Among those 1,933 variables, company name was used as identification of analytic unit; year and quarter were used as time indicator; and BIS was used as outcome variable. Therefore, the total number of predictors at the initial procedure was 1929.
Domestic banks are classified into general banks established under the Banking Act and special banks established under the individual special Bank Act. The total number of banks in the FSS dataset were eighteen banks. Among those eighteen banks, sixteen banks were utilized in this study and two banks were excluded like following explanation.
First, twelve banks established in accordance with the Banking Act include commercial banks, local banks, foreign banks, domestic branches, and Internet banks (i.e., KEB Hana Bank, Shinhan Bank, Kookmin Bank, Woori Bank, Korean Bity Bank, SC Bank, Busan Bank, Kyungnam Bank, Daegu Bank, Gwangju Bank, Citi bank, and Jeonbuk Bank). Second, five banks as local banks have their own businesses of deposits, loans, and payment settlements (i.e., IBK, Korea Development Bank, Korea Export-Import Bank, Nonghyup, and Suhyup). These five banks were established in accordance with individual special laws to supply funds to certain sectors where it is difficult to supply sufficient funds due to financial constraints and difficulties in securing profitability. Therefore, there were seventeen banks were analyzed in this study.
However, this study excluded Internet banks such as Kakao Korea Bank and K-Bank that started operating in 2016 and 2017 after the approval. As the first step of data analysis, the Boruta algorithm analyzed 1929 predictors representing the financial and non-financial indicators of banks, and 489 variables were identified as significant variables, excluding the output value of avg BIS and the BIS capital ratio, which is the default value. As the second step, the Random Forest Feature Elimination Method was used to analyze these variables and find the optimal selection by order of importance for the volatility of root mean of squared error (RMSE) and mean absolute error (MAE) after inputting the selected 478 variables through the Boruta algorithm. As the third step, the Bayesian Regularized Neural Network (BRNN) showed the comparison results against traditional prediction models to examine the statistical validity. Specially, the results of this study compared the deep learning algorithm to the classical linear prediction by the Bayesian Generalized Linear Regression Model (BGLM) by RMSE and MAE results, since it cannot be directly compared due to the difference of model results. RMSE and MAE are commonly used by researchers as criteria for evaluating predictive models (Hyndman and Koehler 2006; Wooldridge 2016). In general, it means that the higher the RMSE, the lower the model’s accuracy, and the higher the MAE, the lower the accuracy.

6. Results

6.1. Stage 1. Feature Selection Using Random Forest Boruta

A total of 1929 variables were analyzed using the Random Forest Boruta technique after being classified as important, tentative, or unimportant in the primary classification criteria. As shown in Table 1 and Figure 2, the results of the analysis classified 299 as important, 465 as tentative, and 1165 as unimportant. The results by the second classification criteria showed 478 as important and 1451 as unimportant, excluding the 299 important variables. Finally, 478 variables were selected as significant predictors of the BIS capital adequacy ratio.

6.2. Stage 2. Feature Selection by Importance Rank Using Random Forest Feature Elimination

The Random Forest Feature Elimination Algorithm was used for marginal predictions of 478 significant variables selected through Random Forest Boruta. The variables were sequentially introduced in Random Forest Feature Elimination Algorithm to analyze the volatility of RMSE and MAE. As a result, 58 important variables and 38 most important variables were selected as determinants representing the BIS prediction as shown in Figure 3. The list of 38 variables and 58 variables were discussed in the discussion.

6.3. Stage 3. BIS Prediction Using Bayesian Regularized Neural Network Model

To examine the statistical feasibility with the accuracy, this research compared the performance of the deep learning modeling technique and the Bayesian Regularized Neural Network model to the comparison model, which was BGLM, based on the results error of RMSE and MAE.
All models were resampled 50 times and iterated nine times. Specifically, 50 folds resampling procedure per each iteration were utilized as cross validation in the study. The multiple times of folds for resampling are expected to eliminate the overfitting issue of the combination between Random Forest Feature Elimination and BRNN. Specifically, the comparison of indicators between training model and testing model is expected to indicate whether the overfitting issue occurred in this study. As shown in Table 2, the differences of indicators (i.e., MAE and RMSE) between training model and testing model were not big across nine iterations. Therefore, the combination of Random Forest and BRNN did not have overfitting issue.
The performance of nine times iteration was shown in Appendix A. As shown in Table 3, BRNN with 38 variables showed slightly better MAE and RMSE, but statistically, they were not significantly different from the t-test of MAE (See Table 4). BRNN with 38 variables showed significantly better prediction compared to the other two kinds of BGLMs from t-test of MAE. BRNN with 58 variables showed significantly better prediction compared to the other two kinds of BGLMs from t-test of MAE. BGLM with 58 variables showed slightly better MAE and RMSE, but statistically, they are not significantly different from the 70% of RMSE (See Table 5). Both BRNNs with 38 variables and 58 variables showed lower than 70% of BGLM with 38 variables from 70% of RMSE. Both BRNNs with 38 variables and 58 variables showed very close to 70% of BGLM with 58 variables.
In addition, seeing the standard deviation of MAE and RMSE, the prediction from BGLMs is more fluctuated than the prediction from BRNNs regardless of the number of variables. With the standard deviation of MAE and RMSE, the prediction from 38 variables with BRNN showed lower fluctuation than the prediction from 58 variables with BRNN.

7. Discussion

This study analyzed 1929 variables out of all the financial and non-financial information that consisted of general information, financial information, key management indices, and major business activities of all domestic banks in South Korea. First, this study confirmed that 478 variables out of 1929 variables are the significant indicators affecting the total BIS ratio when using the Random Forest Boruta method. Second, 58 out of 478 variables were selected as important variables for representing BIS ratio analyzed by the Random Forest recursive feature elimination technique. The 38 variables in Table 6 were confirmed as the most important determinants for the prediction of the total BIS ratio. The 58 variables were introduced in Appendix B.
The following determinants were selected as the Top 10 key indicators of the 38 variables through importance ranking: Tier 1 Capital Ratio; Borrowings_Bonds Payable_(Discount Present Value):Percentage; Borrowings:Percentage; Acceptances and Guarantees Others; Acceptances and Guarantees; Borrowings_Bonds Payable: Percentage; Borrowings_Borrowings:Percentage; Receivable Charge-Offs; Other Liabilities_(Transfer from National Pension):Amount; and Fixed Asset_Tangible Assets Used for Business Purpose_(Accumulated Depreciation):Amount.
The construction of Loans Receivable (Industries) asset can be selected as a Top 10 variable by excluding a Tier 1 capital ratio representing default values. Loans Receivable (Industries) asset is composed of Agricultural, Forestry, Fishery; Manufacturing; Construction; Wholesale and Retail; Transportation; Hotels and Restaurants; Telecommunications; Real Estate, Leasing and Others for a total of eight items.
Additionally, Top 10 variables were analyzed by classifying them into Assets, Stockholders’ Equity, and Liabilities, which are factors related to direct measurement of the financial position based on the Balance Sheet. First, the Total Assets element consists of Cash and Due from Banks, Securities, Loans, Tangible Assets, and Other Assets on the Balance Sheet. As a result of the analysis, only Tangible Assets used for Business (Accumulated Depreciation (-)) was included, as it is an account item of the Tangible Assets category.
Second, the Total Liabilities element consists of Depository Liabilities, Borrowings & Bonds Payable, Other Liabilities, Share Capital Repayable on Demand category of the Balance Sheet. As a result of the analysis, four variables are confirmed for each, Borrowings (Discount Present Value) (-); Borrowings & Bonds Payable (Percentage); Bonds Payable (Percentage); and Borrowings (Percentage) in the Borrowings & Bonds Payable element. Allowance Accounts (Transfer from National Pension (-)) was selected in the Other Liabilities element. Accordingly, a total of five account items were confirmed in the Liabilities element.
Third, the Total Stockholders’ Equity element consist of Capital Stock, Other Equity Instruments, Capital Surplus, Retained Earnings, Capital Adjustment, and Accumulated Other Inclusive Gain(loss) category on the Balance Sheet. The results of the analysis confirmed the Tier 1 capital ratio, which is the key indicator corresponding those total stockholders’ equity elements.
Finally, Footnotes and Off-balance Accounts consist of Acceptances and Guarantees Outstanding, Acceptances and Guarantees, Commitments, Bills Endorsed, Derivative Contracts, Contracts on Credit Derivatives Purchased, Checks & Bills on Clearing, Receivable Charge-Offs, OTC Bonds Sold and Loans Sold under Repurchase Agreements category on the Balance Sheet. Among them, three variables, Acceptances and Guarantees On L/C and On Others, Acceptances and Guarantees, and Receivable Charge-Offs, were confirmed as account items representing the Footnotes.
As shown in the above results, five account items of Total Liabilities, three account items of Footnotes and one account item of Total Assets were selected with the Tier 1 capital ratio in top 10 variables. The Tier 1 capital ratio was an expected result because it showed banks’ core equity capital against their total risk weighted assets. In the asset item, only Tangible Assets Used for Business (Accumulated Depreciation (-)) was selected. Accumulated depreciation means all recorded depreciation to calculate the present value of Fixed Assets on the asset line in the balance sheet. In the Footnotes, Acceptances and Guarantees On L/C and On Others consist of Payment guarantees confirmed by a bank, but the main obligation has not yet been determined. The Receivable Charge-Offs Accounts was selected for the account for the process of the amortized receivable balance.
In the Liabilities, five of the top 10 variables were covered for classifying into Assets, Stockholders’ Equity, and Liabilities based on the Balance Sheet. Especially, Borrowings & Bonds Payable, which are borrowed after the contract of the return of the principal and interest, showed most of the account items with four variables, except for the Transfer from the National Pension amount for Other Liabilities. Interestingly, Deposits in Won (Mutual Installment Deposits: Percentage) was ranked at 106th as account item of Depository Liabilities, thus it was excluded from the final result even though it represents the structure of sources of funds for bank with Equity Capital, Borrowings, Bonds Payable, and Other Liabilities. In addition, Liabilities cannot be found by BIS formula since BIS capital adequacy ratio is not considering the liability in the formula.
As such, the results by the machine learning technique confirmed that the total liabilities elements are important predictors, which represent the BIS capital adequacy ratio.
This result shows various factors affected both directly and indirectly, which needs to be considered for BIS prediction since various factors are influenced by several factors with either direct or indirect correlation.
In this study, there was found a direct or indirect correlation between Assets, Stockholders’ Equity and Liabilities. According to Heo (2020), the various factors from multiple systems are interacting and influencing the outcome on the dynamics of ecological and systemic framework. Therefore, large numbers of predictors from multiple systems around an organization need to be considered carefully for granular control of the financial soundness managerial performance. In addition, the total BIS ratio is calculated by the standard approach or the IRB approach, based on the exposure with risk. Therefore, it is not possible to simply predict total risk for BIS Capital Adequacy Ratio based on a preset of the determinants from previous literature. However, since this study highlighted the key findings (i.e., various determinants from diverse systems) by using machine learning approach, it has contributed to the policymakers as well as the managers and practitioners in the bank-related fields.
To support the discussion above, the statistical validity for the future predictions was compared and showed that the machine learning BRNN method was significantly better at predicting the total BIS ratio, which showed superior accuracy and performance compared to the BGLM. This result supports the previous literature (Bosarge 1993; Heo 2020; Heo et al. 2020; Linoff and Berry 2011; Thompson 2014; Ye 2013).

Author Contributions

J.P.—Conceptualization, data collection, original draft, investigation; M.S.—Project administration, writing revision, supervision; W.H.—Methodology, writing and review, data management. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Resampling Performance Over Subset Size

VariablesRMSERsquaredMAERMSESDRsquaredsdMAESD
11.146456240.521613920.915970480.369449150.250136720.30473594
20.834889150.695432090.658718930.285408530.243949320.23308458
30.678938780.793623390.540147380.240174450.202807340.20077234
40.626928110.834685610.500011680.217326370.170585450.17891839
50.574979850.863232630.453969740.20053590.122181310.16157981
60.496432680.893508690.385847880.191087130.109804850.14230643
70.464777580.910609330.361861550.18497040.092822030.14001393
80.444322760.918518110.344258330.168591130.085824310.12287298
90.426946480.926336850.330152630.167045460.075083130.11987712
100.410429640.931483020.316526930.15918510.07786890.1117455
110.400454950.932709290.309610610.15779010.076365890.10924144
120.391096510.935006840.301247060.156658370.075706520.11049778
130.379493070.939991630.293057740.152460070.072135410.10936046
140.37476620.942749160.288567820.146818980.063548680.10608978
150.364604010.945570230.281590990.146784790.058992020.10561506
160.358019080.948143480.277765220.140226830.05909240.10467481
170.351614320.95037220.272643810.140296140.055759430.10401329
180.343063660.952699710.264578820.137490410.055337910.09966472
190.343466630.954158720.264877920.131320920.048719660.09553778
200.338672160.954262850.260674390.131421730.050401020.09473011
210.341254510.952266570.260753250.138095730.05719210.0977487
220.337409050.953618330.258601150.139570960.060416240.09890045
230.334518220.953098570.255961870.142028120.061273140.09942867
240.335854020.953504440.255924160.142706430.062588730.09930184
250.334404360.953716270.256834290.144079210.06431460.10102568
260.333095690.954978050.255309760.140017550.058741750.09814218
270.330599040.954653010.252945320.137929760.059353990.09573769
280.33262360.954064080.255041380.140398860.064599860.09684133
290.330705120.954236250.254717990.139884590.061439980.09745162
300.330261330.954378320.253947030.142708390.064131260.09832754
310.33143820.954832920.254525920.136971780.059757620.09509734
320.329504840.955353790.254115150.139074560.058056110.09703359
330.329513930.955531310.253861250.139397430.058724020.09830013
340.330217710.955480040.254306480.14024020.056737060.09915603
350.327091890.956001840.251011740.136782940.057952150.09605582
360.32872370.955558930.252984140.139409360.055962660.09943597
370.326905240.956509360.250452440.137060890.052928010.09645311
380.329038350.955333340.250323010.139272930.054543180.09675861
390.325957640.956165630.249304080.136983860.055530990.09553185
400.323340270.957529410.247373940.135809490.05496320.09521656
410.322984610.957563630.248159070.132966810.051773160.09404707
420.321190020.957891690.245449740.137526530.055109580.09449879
430.323298420.957524970.247764680.136570250.054867170.09473953
440.324467050.957397040.248951910.136725970.056071820.09412781
450.320054370.95906590.244308840.137591810.050032510.09617369
460.323010060.958437160.247604510.13727330.050212540.09559924
470.320806980.958624850.245437910.134730290.048565810.09411708
480.32098670.958942090.244649880.137122680.051115230.09657376
490.319265990.959009480.243969940.137604970.051169150.09720571
500.317708390.959868920.241679170.136854670.047820060.09547324
510.319755910.959168420.242991390.137727050.053354690.09567098
520.317593630.960036220.24216580.136341340.048860020.09466409
530.318244830.959702730.242869840.131927110.04977330.0929538
540.318592780.959670060.242380030.136491420.052696580.09526267
550.317166270.960800820.242189710.133966210.048014560.09443763
560.31649210.960849160.240449040.131535270.047825320.09130198
570.317235760.960129410.240970470.136780160.051945510.09499573
580.314430860.961331160.239653060.134138160.046424940.09368547

Appendix B. Resampling Performance Over Subset Size: Important Predictors by RFE

NumOverallVariance (English)
122.151384Tier 1 Capital Ratio
211.111107Borrowings_Bonds Payable_(Discount Present Value):Percentage
39.3877522Borrowings:Percentage
48.9370711Acceptances and guarantees others
58.6561452Acceptances and Guarantees
68.2861521Borrowings_Bonds Payable:Percentage
77.9801142Borrowings_Borrowings:Percentage
87.8963313Receivable Charge-Offs
97.7832859Other Liabilities_(Transfer from National Pension):Amount
107.7440828Fixed Asset_Tangible Assets Used for Business Purpose_((Accumulated Depreciation)):Amount
117.6228497Construction
127.5957596Financing Without Cost_Other Non-cost Bearing Financing:(Average)
137.2821885Financing Without Cost_Provision for Other Allowances:Percentage
147.24095Performing Asset Management_Due From Banks in Won:(Average)
156.6978429Other Liabilities_Account for Agency Business_Grio Account:Amount
166.5836249Financing Without Cost_Demand Deposits:Percentage
176.4640151Loans in won _ Average Interest rate
186.4579122Financing With Cost_Borrowings in Won:Percentage
196.4524425Non-Performing Asset Management_Others:Average
206.4328985Performing Asset Management_Due From Banks in Won:Percentage
216.4170021General and Administrative Expenses_Amortization of Intangible assets:Current Quarter
226.3741191Financing Without Cost_Other Non-cost Bearing Financing:Percentage
236.3035924Operation_Loans & Discounts:Percentage
246.2622486Non-Performing Asset Management_Cash & Checks and Foreign Currency:Percentage
256.1880531Derivative Contracts
266.1864739Other Liabilities_Allowance Accounts_Allowance for Severance and Retirement Benefits_(Plan Assets)_(Due from Pension Plan):Percentage
276.0419083Loans_Loans & Discounts in Won Loans to Enterprise:Percentage
286.0393983Fixed Asset_Tangible Assets_Buildings Used for Business Purpose:Amount
296.0326002Asset Management for Benefit_Other Won-Denomiated Currency Asset Management:Average
306.0161628Securities_Banking Accounts (Available-for-Sales Securities)_ Available-for-Sales Securities in Won_Others: Amount
315.9820987Bond Accounts:Amount
325.9736885Personal Pension Trust:Percentage
335.9424913Deposits in Won _ Average Interest rate
345.9243753Interest_Interest and Dividends on Securities_Interest on Trading Securities:Current Quarter
355.8140847Loans_Credit Card Accounts_Cash Service:Percentage
365.7222446Securities-Banking Accounts (Subsidiaries)_Equity Investment (Won)_Consolidated Subsidiary Stock: Amount
375.7078509Loans & Discounts_Loans on Real Estate Collateral:Amount
385.7019038Non-Performing Asset Management_Fixed Assets Used for Business Purposes:Percentage
395.6784716Other Liabilities_Accrued Expenses Payable:Percentage
405.6422311Interest_Available-for-Sales Securities Interest:Current Quarter
415.6110259Deposits in Won_Mutual Installment Deposits
425.6005534Loans_Allowance for Credit Losses on Other Loans_Credit Card Accounts:Percentage
435.5961849(Allowance for Credit Losses) Amount
445.5531814Collateral_Others
455.4702801Nonoperating Income_Rental income:Current Quarter
465.4684209Securities-Banking Accounts (Subsidiaries)_Equity Investment (Won)_Consolidated Subsidiary Stock:Percentage
475.4506053Allowance for Credit Losses on Other Loans:Amount
485.4476118Loansoff-Shore Loans in Foreign Currency:Percentage
495.4371039Loans_Loans & Discounts in Won Interbank Loans:Amount
505.4077926Total Financing & Operation:Average
515.4046407Securities-Banking Accounts (Securities-Held-to-Maturity Securities):Securities (Foreign Currencies):Percentage
525.380378Financing_Trust Accounts:Average
535.3780784Depository Liabilities_Deposits (Won)_Trust Account:Percentage
545.3771052Performing Asset Management_Securities (Foreign Currencies):Percentage
555.3696979Securities-Banking Accounts (Securities-Held-to-Maturity Securities):Securities (Foreign Currencies)_Debentures:Amount
565.3564302Performing Asset Management_Loans in Foreign Currency:Percentage
575.3204636Loans_Loans on Trust Benefit Collateral:Percentage
585.3196173Consolidated Capital Surplus:Amount

References

  1. Bazarbash, Majid. 2019. FinTech in Financial Inclusion Machine Learning Applications in Assessing Credit Risk. IMP Working Paper No. WP/19/109. Basel: International Monetary Fund. [Google Scholar]
  2. Bank for International Settlements. 2020. About BIS—Overview. Bank for International Settlement. Available online: https://www.bis.org/about/index.htm?m=1%7C1 (accessed on 25 December 2020).
  3. Bosarge, W. E. 1993. Adaptive processes to exploit the nonlinear structure of financial markets. In Neural Networks in Finance and Investing. Chicago: Probus Publishing, pp. 371–402. [Google Scholar]
  4. Breiman, Leo. 2001. Random forests. Machine Learning 45: 5–32. [Google Scholar] [CrossRef] [Green Version]
  5. Burden, Frank, and Dave Winkler. 2008. Bayesian regularization of neural networks. In Artificial Neural Networks. Tolowa: Humana Press, pp. 23–42. [Google Scholar]
  6. Fsc.go.kr. 2020. Financial Services Commission, Early Implementation of Basel III Credit Risk Framework. Available online: https://www.fsc.go.kr/eng/pr010101/22439 (accessed on 20 December 2020).
  7. Gambacorta, Leonardo, Yiping Huang, Han Qiu, and Jingyi Wang. 2019. How do Machine Learning and Non-Traditional Data Affect Credit Scoring? New Evidence from a Chinese Fintech Firm (BIS Working Paper No. 834). Bank of International Settlements. Available online: https://www.bis.org/publ/work834.htm (accessed on 20 December 2020).
  8. Genuer, Robin, Jean-Michel Poggi, and Christine Tuleau-Malot. 2010. Variable selection using random forests. Pattern Recognition Letters 31: 2225–36. [Google Scholar] [CrossRef] [Green Version]
  9. Guyon, Isabelle, Jason Weston, Stephen Barnhill, and Vladimir Vapnik. 2002. Gene selection for cancer classification using support vector machines. Machine Learning 46: 389–422. [Google Scholar] [CrossRef]
  10. Heo, Wookjae. 2020. Theoretical background: A new theoretical framework for financial planning with the case of life insurance demand—Dynamic ecological systemic framework. In The Demand for Life Insurance. Cham: Palgrave Pivot, pp. 19–46. [Google Scholar]
  11. Heo, Wookjae, Jae Min Lee, Narang Park, and John E. Grable. 2020. Using artificial neural network techniques to improve the description and prediction of household financial ratios. Journal of Behavioral and Experimental Finance 25: 100273. [Google Scholar] [CrossRef]
  12. Ho, Tin Kam. 1998. The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence 20: 832–44. [Google Scholar]
  13. Hyndman, Rob J., and Anne B. Koehler. 2006. Another look at measures of forecast accuracy. International Journal of Forecasting 22: 679–88. [Google Scholar] [CrossRef] [Green Version]
  14. Korea Institute of Financia. 2016. The improvement and implications for risk-weighted asset calculation method of Basel Bank Supervisory Commission. International Financial Issues 25: 14–15. [Google Scholar]
  15. Kohavi, Ron, and George H. John. 1997. Wrappers for feature subset selection. Artificial Intelligence 97: 273–324. [Google Scholar] [CrossRef] [Green Version]
  16. Kursa, Miron B., and Witold R. Rudnicki. 2010. Feature selection with the Boruta package. Journal of Statistical Software 36: 1–13. [Google Scholar] [CrossRef] [Green Version]
  17. Liaw, Andy, and Matthew Wiener. 2002. Classification and regression by random forest. R News 2: 18–22. [Google Scholar]
  18. Linoff, Gordon S., and Michael J. A. Berry. 2011. Data Mining Techniques: For Marketing, Sales, and Customer Relationship Management. Hoboken: John Wiley & Sons. [Google Scholar]
  19. MacKay, David J. C. 1992. Bayesian Interpolation. Neural Computation 4: 415–47. [Google Scholar] [CrossRef]
  20. Nilsson, Roland, José M. Peña, Johan Björkegren, and Jesper Tegnér. 2007. Consistent feature selection for pattern recognition in polynomial time. Journal of Machine Learning Research 8: 589–612. [Google Scholar]
  21. Petropoulos, Anastasios, Vasilis Siakoulis, Evaggelos Stavroulakis, and Aristotelis Klamargias. 2019. A robust machine learning approach for credit risk analysis of large loan level datasets using deep learning and extreme gradient boosting. In Bank for International Settlements. IFC Bulletins. Basel: Bank for International Settlements, vol. 49. [Google Scholar]
  22. Sariev, Eduard, and Guido Germano. 2020. Bayesian regularized artificial neural networks for the estimation of the probability of default. Quantitative Finance 20: 311–28. [Google Scholar] [CrossRef]
  23. Thompson, Wayne. 2014. Data mining methods and the rise of big data. In Big Data, Mining, and Analytics. New York: Auerbach, p. 71. [Google Scholar]
  24. Wooldridge, Jeffrey M. 2016. Introductory Econometrics: A Modern Approach. Mason: Nelson Education. [Google Scholar]
  25. Ye, Nong. 2013. Data Mining: Theories, Algorithms, and Examples. Boca Raton: CRC Press. [Google Scholar]
  26. Zhou, Qifeng, Hao Zhou, Qingqing Zhou, Fan Yang, and Linkai Luo. 2014. Structure damage detection based on random forest recursive feature elimination. Mechanical Systems and Signal Processing 46: 82–90. [Google Scholar] [CrossRef]
  27. Fss.or.kr. 2020. Financial Supervisory Service, Basel Regulations Related Report. Available online: http://www.fss.or.kr/fss/kr/bbs/list.jsp?bbsid= 1207396739395&url=/fss/kr/1207396739395 (accessed on 25 August 2020).
  28. National Law Information Center. 2020a. Banking Act §34. Available online: http://www.law.go.kr/%eb%b2% 95%eb%a0%b9/%ec%9d%80%ed%96%89%eb%b2%95 (accessed on 24 August 2020).
  29. National Law Information Center. 2020b. Detailed Regulations on Supervision of Banking Business. Appendix §17. Available online: http://www.law.go.kr/%ed%96%89%ec% a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ec%97%85%eb%ac%b4%ec%8b%9c%ed%96%89%ec%84%b8%ec%b9%99 (accessed on 25 August 2020).
  30. National Law Information Center. 2020c. Detailed Regulations on Supervision of Banking Business. Appendix §3-2-2. Available online: http://www.law.go.kr/%ed%96%89%ec%a0%95% ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ec%97%85%eb%ac%b4%ec%8b%9c%ed%96%89%ec%84%b8%ec%b9%99 (accessed on 25 August 2020).
  31. National Law Information Center. 2020d. Detailed Regulations on Supervision of Banking Business. Appendix §3. Available online: http://www.law.go.kr/%ed%96%89% ec%a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ec%97%85%eb%ac%b4%ec%8b%9c%ed%96%89%ec%84%b8%ec%b9%99 (accessed on 25 August 2020).
  32. National Law Information Center. 2020e. Detailed Regulations on Supervision of Banking Business. Appendix §3-3. Available online: http://www.law.go.kr/%ed%96%89%ec% a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ec%97%85%eb%ac%b4%ec%8b%9c%ed%96%89%ec%84%b8%ec%b9%99 (accessed on 25 August 2020).
  33. National Law Information Center. 2020f. Detailed Regulations on Supervision of Banking Business. Appendix §3-4. Available online: http://www.law.go.kr/%ed%96%89%ec% a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ec%97%85%eb%ac%b4%ec%8b%9c%ed%96%89%ec%84%b8%ec%b9%99> (accessed on 25 August 2020).
  34. National Law Information Center. 2020g. Enforcement Decree of The Banking Act §20. Available online: http://www.law.go.kr/%eb%b2%95%eb%a0%b9/%ec%9d%80%ed%96%89%eb%b2%95%ec%8b%9c%ed%96%89%eb%a0%b9 (accessed on 25 August 2020).
  35. National Law Information Center. 2020h. Regulations on Supervision of Banking Business §26. Available online: http://www.law.go.kr/%ed%96%89%ec%a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ea%b7%9c%ec%a0%95 (accessed on 25 August 2020).
  36. National Law Information Center. 2020i. Regulations on Supervision of Banking Business §33. Available online: http://www.law.go.kr/%ed%96%89%ec%a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ea%b7%9c%ec%a0%95 (accessed on 25 August 2020).
  37. National Law Information Center. 2020j. Regulations on Supervision of Banking Business §34. Available online: http://www.law.go.kr/%ed%96%89%ec%a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ea%b7%9c%ec%a0%95 (accessed on 25 August 2020).
  38. National Law Information Center. 2020k. Regulations on Supervision of Banking Business §35. Available online: http://www.law.go.kr/%ed%96%89%ec%a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ea%b7%9c%ec%a0%95 (accessed on 25 August 2020).
  39. National Law Information Center. 2020l. Regulations on Supervision of Banking Business §36. Available online: http://www.law.go.kr/%ed%96%89%ec%a0%95%ea%b7%9c%ec%b9%99/%ec%9d%80%ed%96%89%ec%97%85%ea%b0%90%eb%8f%85%ea%b7%9c%ec%a0%95 (accessed on 25 August 2020).
Figure 1. Analytic Algorithm Methodology: Deep Learning Prediction and Regression Prediction Algorithms.
Figure 1. Analytic Algorithm Methodology: Deep Learning Prediction and Regression Prediction Algorithms.
Risks 09 00032 g001
Figure 2. Significant Variables Using the Random Forest Boruta Method.
Figure 2. Significant Variables Using the Random Forest Boruta Method.
Risks 09 00032 g002
Figure 3. Importance of Variables Using Random Forest Recursive Feature Elimination Method.
Figure 3. Importance of Variables Using Random Forest Recursive Feature Elimination Method.
Risks 09 00032 g003
Table 1. Variables Predicting Bank for International SettlementValues Using the Random Forest Boruta Method.
Table 1. Variables Predicting Bank for International SettlementValues Using the Random Forest Boruta Method.
  • The 1299 variables were selected as 299 important, 1165 unimportant, and 465 tentative. Boruta performed 99 iterations in 23.7025 min.
    (1)
    299 attributes confirmed as important: avg BIS, Tier 1 Capital Ratio, Household Money-In-Trust (amount), Household Loans Total, Households and 294 more.
    (2)
    1165 attributes confirmed as unimportant: Retained earnings; loan loss reserve; expected transfer amount—Reserve for credit loss; Unaccumulated carryover amount; Additional transfer amount, Automated Teller Machine, Certificate of Deposit, Household Loans, Household Loans Depreciation and 1160 more variables.
    (3)
    465 tentative attributes left: Accumulated Other Comprehensive Income\\Allowance, Certificate of Depsit, Automated Teller Machine, Household Money Trust Percentage, Household Loans Allowance, Household Loans for Repayment, and 460 more variables.
  • Machine learning analysis was performed by reclassifying the secondary variable selection criteria into important and unimportant. Using 1165 unimportant and 465 tentative, Boruta performed 99 iterations in 23.7025 min. Tentative roughness fixed over the last 99 iterations.
    (1)
    478 attributes confirmed as important: avgBIS, Tier 1 Capital Ratio, Household Money-In-Trust (amount), Household Loans Allowance, Household Loans Total, and 486 more.
    (2)
    1451 attributes confirmed unimportant: Accumulated Other Comprehensive Income Allowance, Retained Earnings, Loan Loss Reserve, Expected Transfer Amount - Reserve for Credit Loss, Unaccumulated Carryover Amount, Additional Transfer Amount, ATM, CD, CDATM, and 1445 more variables.
Table 2. Difference of Indicators between Training Model and Testing Model.
Table 2. Difference of Indicators between Training Model and Testing Model.
IterationIndicatorBRNN
With 38 vars
BRNN
With 58 vars
Training ModelPrediction ModelTraining ModelPrediction Model
1Neuron #20201616
MAE0.40080.40590.38070.4072
RMSE0.52650.57370.52880.5775
R20.8901 0.8937
2Neuron #4433
MAE0.44750.38580.44610.3849
RMSE0.62140.55070.58010.5497
R20.8577 0.8727
3Neuron #5533
MAE0.47420.44290.48150.4430
RMSE0.65510.80900.63600.8091
R20.8270 0.8505
4Neuron #441010
MAE0.43300.40400.38780.4065
RMSE0.56170.62540.54450.6274
R20.8808 0.8996
5Neuron #181833
MAE0.41770.38510.39610.3854
RMSE0.57880.54690.53830.5471
R20.8731 0.8690
6Neuron #16162030
MAE0.44370.44550.40820.3493
RMSE0.59750.67180.55750.5102
R20.8374 0.8768
7Neuron #331010
MAE0.43940.39540.36880.4604
RMSE0.57010.58430.80420.6297
R20.8694 0.8950
8Neuron #442020
MAE0.41140.35850.37520.3667
RMSE0.54130.55860.49880.5977
R20.8893 0.9086
9Neuron #331010
MAE0.45390.39710.35020.4984
RMSE0.57710.57570.47640.7602
R20.8752 0.9176
Note. MAE denotes mean absolute error; RMSE is root mean of squared error; BRNN means Bayesian Regularization Neural Networks. Neuron # indicates the optimal number of neurons that was selected by machine learning. 50-folds resampling per each iteration was used as cross validation to calculate MAE, RMSE, and R2.
Table 3. MAE and RMSE from 9 iterations of 340 companies.
Table 3. MAE and RMSE from 9 iterations of 340 companies.
BRNN
With 38 vars
BRNN
With 58 vars
Bayesian GLM
With 38 vars
Bayesian GLM
With 58 vars
MAE (S.D.)0.4022 (0.4663)0.4113 (0.4776)0.5477 (1.0433)0.5119 (0.7689)
RMSE (S.D.)0.6107 (0.0796)0.6232 (0.0943)1.000 (0.6229)0.8583 (0.3412)
Note. GLM means generalized linear modeling.
Table 4. MAE Comparison: Mean Comparison (t-test, n = 340 companies * 9 iterations).
Table 4. MAE Comparison: Mean Comparison (t-test, n = 340 companies * 9 iterations).
BRNN
(38 var)
BRNN
(58 var)
BGLM
(38 var)
BGLM
(58 var)
BRNN
(38 var)
BRNN
(58 var)
−0.7542
BGLM
(38 var)
−7.0432 ***−6.5759 ***
BGLM
(58 var)
−6.7482 ***−5.6656 ***1.5280
* p < 0.05; *** p < 0.001.
Table 5. RMSE comparison: 70% of compared model.
Table 5. RMSE comparison: 70% of compared model.
BRNN
(38 var)
BRNN
(58 var)
BGLM
(38 var)
BGLM
(58 var)
RMSE0.61070.62321.0000.8583
70% of RMSE 0.7000.6008
Table 6. 38 Important Predictors According to Random Forest RFE by Machine Learning Techniques.
Table 6. 38 Important Predictors According to Random Forest RFE by Machine Learning Techniques.
Report
(Financial Statics Information System)
(Importance Rank) by Variance (Item) of Domestic Bank
Capital Adequacy(1) Tier 1 Capital Ratio
Consolidated Balance Sheet (Liabilities & Shareholders’ Equity-Banking Account)(2) Borrowings_Bonds Payable_(Discount Present Value):Percentage
(3) Borrowings:Percentage
(6) Borrowings_Bonds Payable:Percentage
(7) Borrowings_Borrowings:Percentage
Loans Receivable (Industries)(11) Construction
Off-balance Accounts (Bank Accounts)(4) Acceptances and Guarantees Others
(5) Acceptances and Guarantees
(8) Receivable Charge-Offs
(25) Derivative Contracts
Principal Sources of Cash Flows in Bank Accounts(12) Financing Without Cost_Other Non-cost Bearing Financing:(Average)
(13) Financing Without Cost_Provision for Other Allowances:Percentage
(14) Performing Asset Management_Due From Banks in Won:(Average)
(16) Financing Without Cost_Demand Deposits:Percentage
(18) Financing With Cost_Borrowings in Won:Percentage
(19) Non-Performing Asset Management_Others:Average
(20) Performing Asset Management_Due From Banks in Won:Percentage
(22) Financing Without Cost_Other Non-cost Bearing Financing:Percentage
(24) Non-Performing Asset Management_Cash & Checks and Foreign Currency:Percentage
(29) Asset Management for Benefit_Other Won-Denomiated Currency Asset Management:Average
(38) Non-Performing Asset Management_Fixed Assets Used for Business Purposes:Percentage
Principal Sources of Cash Flows in Trust Accounts(23) Operation_Loans & Discounts:Percentage
Profitability(17) Loans in won _ Average Interest rate
(33) Deposits in Won _ Average Interest rate
Summarized Balance Statement (Assets-Banking Account)(10) Fixed Asset_Tangible Assets Used for Business Purpose_(Accumulated Depreciation):Amount
(27) Loans_Loans & Discounts in Won Loans to Enterprise:Percentage
(28) Fixed Asset_Tangible Assets_Buildings Used for Business Purpose:Amount
(30) Securities_Banking Accounts (Available-for-Sales Securities) Available-for-Sales Securities in Won_Others: Amount
(35) Loans_Credit Card Accounts_Cash Service:Percentage
(36) Securities-Banking Accounts(Subsidiaries)_Equity Investment (Won) Consolidated Subsidiary Stock: Amount
Summarized Balance Statement (Assets-Trust Account)(31) Bond Accounts:Amount
(37) Loans & Discounts_Loans on Real Estate Collateral:Amount
Summarized Balance Statement (Liabilities & Trust Account) (32) Personal Pension Trust:Percentage
Summarized Balance Statement (Liabilities &Shareholders’ Equity-Banking Account) (9) Other Liabilities_(Transfer from National Pension):Amount
(15) Other Liabilities_Account for Agency Business_Grio Account:Amount
(26) Other Liabilities_Allowance Accounts_Allowance for Severance and Retirement Benefits_(Plan Assets)_(Due from Pension Plan):Percentage
Summarized Income Statement (Banking Account)(21) General and Administrative Expenses_Amortization of Intangible assets:Current Quarter
(34) Interest_Interest and Dividends on Securities_Interest on Trading Securities:Current Quarter
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Park, J.; Shin, M.; Heo, W. Estimating the BIS Capital Adequacy Ratio for Korean Banks Using Machine Learning: Predicting by Variable Selection Using Random Forest Algorithms. Risks 2021, 9, 32. https://doi.org/10.3390/risks9020032

AMA Style

Park J, Shin M, Heo W. Estimating the BIS Capital Adequacy Ratio for Korean Banks Using Machine Learning: Predicting by Variable Selection Using Random Forest Algorithms. Risks. 2021; 9(2):32. https://doi.org/10.3390/risks9020032

Chicago/Turabian Style

Park, Jaewon, Minsoo Shin, and Wookjae Heo. 2021. "Estimating the BIS Capital Adequacy Ratio for Korean Banks Using Machine Learning: Predicting by Variable Selection Using Random Forest Algorithms" Risks 9, no. 2: 32. https://doi.org/10.3390/risks9020032

APA Style

Park, J., Shin, M., & Heo, W. (2021). Estimating the BIS Capital Adequacy Ratio for Korean Banks Using Machine Learning: Predicting by Variable Selection Using Random Forest Algorithms. Risks, 9(2), 32. https://doi.org/10.3390/risks9020032

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop