Next Article in Journal
What Future for the Green Bond Market? How Can Policymakers, Companies, and Investors Unlock the Potential of the Green Bond Market?
Next Article in Special Issue
The Application of Graphic Methods and the DEA in Predicting the Risk of Bankruptcy
Previous Article in Journal
CEO Diversity, Political Influences, and CEO Turnover in Unstable Environments: The Romanian Case
Previous Article in Special Issue
Assessment of Bankruptcy Risk of Large Companies: European Countries Evolution Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Support Vector Machine Methods and Artificial Neural Networks Used for the Development of Bankruptcy Prediction Models and their Comparison

School of Expertness and Valuation, Institute of Technology and Business in Ceske Budejovice, Okruzni 517/10, 37001 Ceske Budejovice, Czech Republic
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2020, 13(3), 60; https://doi.org/10.3390/jrfm13030060
Submission received: 29 January 2020 / Revised: 19 March 2020 / Accepted: 23 March 2020 / Published: 24 March 2020
(This article belongs to the Special Issue Modern Methods of Bankruptcy Prediction)

Abstract

:
Bankruptcy prediction is always a topical issue. The activities of all business entities are directly or indirectly affected by various external and internal factors that may influence a company in insolvency and lead to bankruptcy. It is important to find a suitable tool to assess the future development of any company in the market. The objective of this paper is to create a model for predicting potential bankruptcy of companies using suitable classification methods, namely Support Vector Machine and artificial neural networks, and to evaluate the results of the methods used. The data (balance sheets and profit and loss accounts) of industrial companies operating in the Czech Republic for the last 5 marketing years were used. For the application of classification methods, TIBCO’s Statistica software, version 13, is used. In total, 6 models were created and subsequently compared with each other, while the most successful one applicable in practice is the model determined by the neural structure 2.MLP 22-9-2. The model of Support Vector Machine shows a relatively high accuracy, but it is not applicable in the structure of correct classifications.

1. Introduction

In financial bankruptcy analysis, the diagnosis of companies at risk for bankruptcy is crucial in preparing to hedge against any financial damage the at-risk firms stand to inflict (Kim et al. 2018). According to Rybárová et al. (2016), bankruptcy models are early warning systems based on the analysis of selected indicators able to identify a thread for financial health of a company. Kiaupaite-Grushniene (2016) states that creation of reliable models of bankruptcy prediction is essential for various decision-making processes. According to Mousavi et al. (2015), frequently used models are mainly Altman Z-Score, Taffler Z-Score, and Index IN95.
A wide number of academic researchers from all over the world have been developing corporate bankruptcy prediction models, based on various modelling techniques. Numerous statistical methods have been developed (Balcaen and Ooghe 2004). Despite the popularity of the classic statistical methods, significant problems relating to the application of these methods to corporate bankruptcy prediction remain. Problems related to statistical methods according to Balcaen and Ooghe (2004, p. 1):
  • The dichotomous dependent variable,
  • The sampling method,
  • Nonstationarity and data instability,
  • The use of annual account information,
  • The selection of the independent variables,
  • The time dimension.
For the purpose of this article, Support Vector Machines (SVM) and artificial neural networks are used. These two methods have been used by many authors to predict corporate bankruptcy, and their results suggest that these two methods are more appropriate than traditional statistical methods (Shin et al. 2005; Xu et al. 2006; Kim et al. 2018; Vochozka and Machová 2018; Machová and Vochozka 2019; Krulický 2019). SVM is sensitive to model form, parameter setting and features selection. SVM, firstly developed by Vapnik in 1995 (Vapnik 1995), is a supervised learning model with associated learning algorithms that analyze data and recognize patterns, used for classification and regression analysis (Burges 1998). According to Lu et al. (2015), compared with other algorithms, SVM has many unique advantages when applied in solving small sample, nonlinear, and high-dimensional pattern recognition problem. The concept of a neural network has been developed in biology and psychology, but its use goes to other areas, such as business and economics (Vochozka 2017). They are especially valuable where inputs are highly correlated, missing, or there are nonlinear systems and they can capture relatively complex phenomena (Enke and Thawornwong 2005). Like any method, SVM or artificial neural networks have disadvantages. Although SVM or artificial neural networks have a good performance on classification accuracy, one main disadvantage of these methods is the difficulty in interpreting the results (Härdle et al. 2009).
The aim of the paper is to develop bankruptcy prediction models and compare results of different methods using classification methods, namely Support Vector Machines and artificial neural networks (multilayer perceptron artificial neural networks—MLP and radial basis function artificial neural networks—RBF). Further to the defined goal, we will ask a research question: “Are artificial neural networks (also NN) more accurate in predicting bankruptcy than SVM?”
The article meets the formal criteria of a scientific text. In the part of literature review there are described methods for evaluation of corporate bankruptcy, attention is paid to artificial neural networks and SVM methods. The methodological part describes used data for the calculation, specifies the particular variables used and presents two above mentioned methods. In the results part there are presented the results achieved by SVM method, then the results obtained by artificial neural networks and the results of both methods are compared. The results are also compared with the results of other authors and the added value of the article is defined. The final part summarizes the results, presents the variables that have the greatest predictive power and suggests further research in this area.

2. Literature Review

Company activities are directly or indirectly influenced by various external and internal factors (Boguslauskas and Adlyte 2010). Purvinis et al. (2005) argue that unfavourable business environment, risky decisions of business managers, and unexpected and disadvantageous events may influence a company in insolvency and lead to bankruptcy. Hafiz et al. (2015) state that bankruptcy models are mainly needed by financial entities, e.g., banks. Their advantage consists especially in their ability to provide clear information about potential risks and eliminate such problems in a timely manner. They are important for current and future decision-making (López Iturriaga and Sanz 2015). Predictive models of financial bankruptcy enable to take timely strategic measures in order to avoid financial distress (Baran 2007). For other stakeholders, such as banks, effective and automated rating tools will enable to identify possible financial distress of potential clients (Gestel et al. 2006). The ability to accurately predict business failure is a very important topic in financial decision-making (Mulačová 2012).
A very useful tool to predict the development of companies going to bankrupt is by using artificial neural networks (ANNs) or Support Vector Machine (SVM). Currently, neural networks are applicable in various areas. ANNS are used for solving possible future difficulties, e.g., for predicting company bankruptcy (Pao 2008; Klieštik 2013). Sayadi et al. (2014) state that their main advantages are the ability to generalize and to learn. According to Machová and Vochozka (2019), the disadvantages of ANNs include possible illogical behaviour of networks and required high quality data. Vochozka and Machová (2018) state that ANNs are currently one of the most popular prediction methods.
The SVM method has become a powerful tool for solving problems in machine learning. Many SVM algorithms include solving of convex problems, such as linear programming, quadratic programming, as well as nonconvex and more general problems with optimization, such as integer programming, bilevel programming, etc. However, there are also certain disadvantages of SVM. An important issue that has not been solved fully is choosing the parameters of the core functions. In practical terms, the crucial problem of SVM is its high algorithmic complexity and extensive requirements for the memory of required quadratic programming in complex tasks (Tian et al. 2012).
The aim of Erdogan (2013) was to apply the SVM method in analysing bank bankruptcy. In this work, the SVM method was applied for the analysis of financial indicators. The author states that SVB is able to extract useful information from financial data and can thus be used as a part of early warning system. Chen and Chen (2011) state that the prediction of financial crisis of a company is an important and widely discussed topic. They used particle swarm optimization (PSO) to obtain optimized parameter settings for the SVM method. Moreover, they used the PSO’s integrated commitment with the SVM approach to create a model of predicting financial crisis. Experimental results have shown that the approach is efficient in finding better parameter settings and significantly improves the success rate in predicting company financial crisis. Since financial indicators are independent variables, Park and Hancer (2012) applied ANNs on bankruptcy of a company operating in catering and compared the results with the results of logit model. On the basis of empirical results of these two methodologies, ANNs showed higher accuracy than logit model in sample testing. Dorneanu et al. (2011) use ANNs for predicting company bankruptcy. According to the authors, the use of ANNs for the prediction is extremely effective, since the percentage of prediction accuracy is higher than in the case of using conventional methods. The objective of Kim (2011) is to provide an optimal approach to company bankruptcy predicting and to explore functional characteristics of multivariate discriminant analysis, ANNs and the SVM method in predicting the bankruptcy of a specific company. The results have shown that ANNs and SVM are models applicable for predicting company bankruptcy and show promising results. On the basis of the information obtained, the objective of this paper can be considered relevant.

3. Materials and Methods

The Albertina database will be the source of data concerning industrial companies operating in the Czech Republic. In terms of sufficient amount of data and in particular the number of companies in liquidation and thus the relevance of the results, more fields within section C—Manufacturing of the CZ-NACE (comes from French – Czech Nomenclature statistique des Activités économiques dans la Communauté Européenne) = Classification of Economic Activities, will be used, namely in the groups 10–33:
  • 10: Manufacture of food products.
  • 11: Manufacture of beverages.
  • 12: Manufacture of tobacco products.
  • 13: Manufacture of textiles.
  • 14: Manufacture of wearing apparel.
  • 15: Manufacture of leather and related products.
  • 16: Manufacture of wood and products of wood and cork, except furniture.
  • 17: Manufacture of paper and paper products.
  • 18: Printing and reproduction of recorded media.
  • 19: Manufacture of coke and refined petroleum products.
  • 20: Manufacture of chemicals and chemical products.
  • 21: Manufacture of basic pharmaceutical products and pharmaceutical preparations.
  • 22: Manufacture of rubber and plastic products.
  • 23: Manufacture of other non-metallic mineral products.
  • 24: Manufacture of basic metals; foundry.
  • 25: Manufacture of fabricated metal products, except machinery and equipment.
  • 26: Manufacture of computer, electronic and optical products.
  • 27: Manufacture of electrical equipment.
  • 28: Manufacture of machinery and equipment.
  • 29: Manufacture of motor vehicles (except motorcycles), trailers and semi-trailers.
  • 30: Manufacture of other transport equipment.
  • 31: Manufacture of furniture.
  • 32: Other manufacturing.
  • 33: Repairs and installation of machinery and equipment.
For the same reasons, the selection of data will not be limited by the size of companies and the number of employees. The output will thus be applicable not only in specific companies, but basically in the whole economic sector.
The data series will consist of five consecutive fiscal years—for each year all the companies in liquidation will be selected and similarly, randomly selected three times the number of active enterprises. The numbers of companies for individual years are then as follows:
  • Year 2013: 488 in liquidation, 1464 active,
  • Year 2014: 416 in liquidation, 1248 active,
  • Year 2015: 354 in liquidation, 1062 active,
  • Year 2016: 287 in liquidation, 862 active,
  • Year 2017: 163 in liquidation, 489 active.
The same companies will be selected for each year. Different numbers are due to the fact that some companies went bankrupt during the monitored period, ceased to be active and went into liquidation, etc. The sample starts in 2013, that is, in the period of constant economic growth following the period of economic crisis. The authors tried to avoid the results of the models to be affected by economic crisis.
Financial statements, specifically balance sheets and profit and loss statements of all the above mentioned companies will be analysed. Table 1 shows selected financial data and their averages per individual years.
The data will be checked. Only the data that, at first sight, is not defective or intentionally distorted will be kept on the file for further analysis. This will eliminate record lines (a line represents financial statements per company and year) including:
  • Different assets and liabilities balance,
  • Negative assets,
  • Negative fixed assets,
  • Negative tangible fixed assets,
  • Negative current assets,
  • Negative financial assets,
  • Negative inventories.
The input continuous variables will be:
  • AKTIVACELK—Total assets resulting from past economic operations. Thus it means the future economic benefit of the company.
  • STALAA—Fixed assets are long-term, fixed and noncurrent. This item includes asset components used for the company business in a long term (more than 1 year) and consumed over time.
  • HIM—Intangible fixed assets will depreciate, expressed by the level of depreciation. Intangible fixed assets have a significant impact on the value of the enterprise, they maintain their value for a longer time and are not exposed to the fast operating cycle.
  • OBEZNAA—Current assets characterize the operating cycle. They continuously circulate and change their form. They include cash, material, semi-finished products, work in progress, products, or receivables from customers.
  • Z—Inventories are current (short-term) assets of the company. They are consumed during operation. In general, inventories include material, inventories for production of its own products and goods
  • KP—Short-term receivables are payable in less than 1 year from the date when their arise and represent the creditor’s right to seek fulfilment of a certain obligation from the other party, the receivable is extinguished when the obligation is paid.
  • FM—Financial assets including long-term and short-term financial assets. Long-term financial assets hold their value for a longer period of time, they do not change into cash quickly. They include securities, bonds, certificates of deposit, obligations, term deposits or loans granted to companies. Short-term financial assets are used for operation, especially for payment of liabilities. Short-term assets represent high liquidity; the expected holding is less than one year. They mainly include money in bank accounts, treasury, checks, clearing notes, valuables or short-term securities and shares.
  • PASIVACELK—Total liabilities—information concerning the source to cover the company’s assets.
  • VLASTNIJM—Equity is the internal source of finance for business assets and capital formation. It includes, in particular, contributions of the founders (owners or partners) to the capital stock and components arising from the business management.
  • FTZZ—Reserve funds, undistributable reserves and other funds from profit represent the company’s internal sources of finance increasing the company’s equity without changing its capital stock. Reserve funds are used as internal resources to cover future losses of the company. Undistributable reserves are created by cooperatives also to cover the loss.
  • HVML—Profit/loss brought forward is part of liabilities, an item of equity. These are resources created after tax in previous years. These are funds which are not transferred to funds or distributed and paid. It consists of three parts - retained earnings, loss carried forward and other profit/loss brought forward.
  • HVUO—Profit and loss of the current financial period is the sum of profit and loss from operations and financial activities in the financial period and the profit before tax. For calculation, the income tax for ordinary activities is deducted.
  • CIZIZDROJE—External resources are the company’s debts which must be paid within a certain period of time. These are the company’s payables to other entities.
  • KZ—Current liabilities are payable within 1 year and used for financing (together with equity) of the normal operation of the company. In particular, they include short-term bank loans, payables to employees and institutions, debts to suppliers or delinquent tax.
  • V—Production is goods and services that are used to meet the needs. They result from business activities of the company and characterize the main business activities—production.
  • VS—Production consumption mainly includes the costs of consumed material, energy, travel expenses, maintenance and repairs, or low-value assets. It is a sum item which correlates with consumption of materials, services and energy.
  • SPMAAEN—Material and energy consumption is an item accounting for inventories - current assets. Energy consumption rises proportionally and positively correlates with the production volume. However, material costs may decrease as the production volume increases. Material consumption is directly dependent on consumption standards and purchase prices.
  • SLUZBY—Services are systematic external activities that satisfy human needs, or the business needs in their own course.
  • PRIDHODN—Value added represents the sales margin, sales, stock level changes of internally produced inventories, or capitalization less production consumption. It includes the sales margin as well as production.
  • MZDN—Payroll costs generally comprise of the employee’s gross wages and premiums paid by the employer for each employee’s social security and health insurance.
  • NNSOCZAB—Employee’s social security and health insurance costs.
  • OHANIM—Depreciation of intangible and tangible fixed assets provides a tool for gradually assigning the value of fixed assets to expenses. Therefore, it means a gradual assignment of the fixed asset cost value to expenses. It represents depreciation of fixed assets.
The categorical output variable will be considered as:
  • STAV—Identifies the situation of the company whether active or in liquidation. There will only be two possible outcomes.
The variables were chosen so that it was possible to express the main features of the company´s capital structure, sources of assets financing, corporate payment history, customers´ payment history, cost structure, and the ability to generate outcomes (sales) and realized added value. The selection of indicators is based on the analysis of the existing Altman Z-Score (Altman 1968, 2000, 2003; Altman and Hotchkiss 2006), IN (Neumaierová and Neumaier 2005, 2008), Taffler index (Taffler and Tisshaw 1977; Taffler 1983), Kralicek Quick Test (Kralicek 1993), Harry Pollak´s method (Pollak 2003), and Vochozka´s method (Vochozka 2010; Vochozka and Sheng 2016; Vochozka et al. 2017). The conditions of external environment are not considered, as all companies in the dataset operate in one market, and therefore they are all influenced equally. The output is thus analogy to certain extent. If patterns are identified (although given by a large number of input variables combinations), it is possible to observe a similar development of two companies showing just about the same combination of input variables on the basis of similarity.
The Statistica software, version 13 of TIBCO will be used to apply the classification methods.

3.1. Support Vector Machines

Machine Learning option in the Data Mining module will be used to apply SVM. The file will be divided into a train (75%) and a test (25%) data subset. Then SVM type 2 will be specified where the error function is identified as:
1 2 w T w C [ v ε + 1 N i = 1 N ( ζ i + ζ i ˙ ) ] ,
which minimizes the entity to:
[ w T ( x 1 ) + b ] y i ε + ζ i y i [ w T ( x 1 ) + b i ] ε + ζ ˙ i ζ i ,   ζ ˙ i 0 ,   i = 1 ,   ,   N ,   ε 0 .
Then the SVM (kernel function) will be selected. In this case, it will be Sigmoid that should be able to identify the extreme values:
K ( X i , X j ) = tan h ( γ X i · X j + C ) ,
where K ( X i , X j ) = φ ( X i ) · φ ( X j ) , which means that SVM function represents an output value of input variables projected in multidimensional space using transformation φ.
The results (value 10, seed 1000) will then be cross-validated. A maximum of 10,000 iterations will be performed with a possible ending in case of the error 0.000001.

3.2. Artificial Neural Networks

Classification analysis based on multilayer perceptron neural networks and radial basis function neural networks. ANS (automatic neural network) mode will be used. In case of unsatisfactory results, the result may be corrected using the custom network designer.
The set will be divided by random into three groups of enterprises—i.e., a train file (where neural networks are trained to achieve the best results)—70% of the data, a test file (identify if the classification of trained neural structures is successful)—15% data and a validation file (used for additional verification of the result)—15% of data. Only MLP and RBF will be used in the calculation. For MLP networks, the minimum number of hidden neurons will be set to 8 and the maximum number to 25 while for RBF, the minimum will be 21 and the maximum will be 30 hidden neurons. The number of networks for training will be 10,000 whereas 5 networks with the best results will be retained. The error function will be the sum of squares:
E S O S = 1 2 N i = 1 N ( y i t i ) 2   ,
where N is number of training cases, yi is predicted target variable ti, ti is target variable of a i-th case.
The BFGS algorithm (Broyden–Fletcher–Goldfrarb–Shanno) will be used for calculation, for more details see Bishop (1995).
Another error function will be entropy (or, cross entropy error function):
E C E = i = 1 N t i ln ( y i t i )   ,
The activation functions shown in Table 2 will be considered for NN.
Neural networks work as follows: the data of a specific company are entered and subsequently, as an independent variable, the data are converted using the activation function and weights into the values of hidden neurons, which are the input variables for the second round of calculation. Here, the activation function and trained weights as used as well. The result obtained is subsequently compared at a given interval, and it is determined whether or not the company is able to survive possible financial distress.
Other settings will remain default. The result will be a bankruptcy model (the development of the company will be evaluated using two variables—survival of the company or a bankruptcy tendency—thus, the dependent variable will only take two values 0 or 1). The model development will be an iterative and recurrent process with actions to improve. The data to be analysed does not have to follow the normal distribution, the dependent variable is binary. The resulting model will have generalized characteristics—it will be applicable for prediction and the efficiency of classification into groups should be better than by chance, i.e., the efficiency of classification should be higher than 50%.

4. Results

4.1. Support Vector Machines

The defined inputs were used for calculation of a SVM model in C ++ code. The basic parameters are: 22 input continuous variables, 1 output categorical variable, classification type 2, Sigmoid function. 1162 vectors were created for active companies and 1161 vectors for companies in liquidation. The relevance of the model is examined in more detail in Table 3.
The accuracy of classifications, or predictions is more than 76%. This is certainly positive in terms of the model success. However, remember that this percentage consists of more than 99% of correct predictions of active companies and only above 8% of predictions of the companies in liquidation. Therefore, the model is not fully applicable in practice.

4.2. Artificial Neural Networks

10,000 artificial neural structures were calculated of which 5 with the best characteristics were retained (see Table 4).
The best characteristics of generated neural structures are exclusively shown by MLP networks. NNs have 22 neurons in the input layer (based on 22 input continuous variables), 6 to 12 neurons in the hidden layer and 2 neurons in the output layer (based on one output categorical variable that can take two values). Entropy was the error function in four cases, the sum of squares in one. The identity, logistic and hyperbolic tangent functions were used to activate the hidden layer of neurons. The logistic and Softmax functions were used to activate the output layer of neurons. The performance of individual networks is always above 81% in the train data set and above 80% in the test data set and above 81% in the validation set. Thus, the performance seems very high. Table 5 shows the performance decomposition.
Ideally, we are looking for a neural structure which shows the highest number of correctly classified cases. However, it is very important for NN to be able to predict (classify) both active companies (i.e., businesses capable of surviving a potential crunch) and companies in liquidation (i.e., businesses in bankruptcy). In this respect, 2.MLP 22-9-2 and 5.MLP 22-12-2 networks appear to be the most successful. There is a minimum difference between them. But a higher number of correct predictions of bankruptcy for 2.MLP 22-9-2 network is more advantageous. The dominance of both networks is illustrated by the chart in Figure 1.
Ideally, the characteristics are close to (0,1). The 2.MLP 22-9-2 and 5.MLP 22-12-2 networks are closest to this point.

4.3. SVM/NN Comparison

It is obvious from the results that the SVM model has a quite high level of reliability. However, the structure of correct classifications, i.e., 99% of correct predictions of active companies and only above 8% of predictions of companies in liquidation, makes this model inapplicable.
On the contrary, five NN models were retained by applying the methodology for creating NN. In all cases, those are MLPs that are applicable in practice. There are minimum differences between networks. Still we can identify the best neural network which is NN 2.MLP 22-9-2 without any doubt: very closely followed by NN 5.MLP 22-12-2. There is just a minimum difference between them.
This answers our research question. In this case, the answer is very simple. Artificial neural networks are much more accurate than SVM in predicting possible bankruptcy. Unlike SVM all retained NNs are well applicable in practice.
It is a bankruptcy model. We thus define a tool to identify the companies unlikely to survive a possible financial distress. In particular, we examine the ability of the tool to identify a company that can be expected to face financial distress in the future. The SVM model showed a great ability to predict the second opposite situation at first glance, that is, the ability of the company to survive a possible financial distress. In this case, the prediction of the model is correct in 99.39% of cases. However, the ability to predict bankruptcy is at the 8.22% level. In general, the SVM model predicts the future development of the company with 76.08% accuracy, which could be considered a good result. However, the problem is that the model would achieve the same or almost the same predictive power even if it did not predict any company that is going to bankrupt. In fact, the SVM method did not meet the requirements, although it shows a rather interesting result. The SVM model is thus nonapplicable.
As the confusion matrix in Table 5 indicates, artificial neural networks show higher prediction power—nearly up to 83%, but what is even more important, they have greater ability to predict companies that are going to bankrupt. Taking into account the most successful neural structure, 2.MLP 22-9-2, its accuracy is 82.79%. It is able to predict correctly 91.92% of companies that are able to survive a potential financial distress, and 56.2% of companies that are going to bankrupt. The prediction is thus applicable in practice.
Now the task is to find a generally acceptable model able to predict a potential financial distress. The Altman Z-Score (Altman 1968, 2000, 2003; Altman and Hotchkiss 2006) and many other models (Neumaierová and Neumaier 2005, 2008; Taffler and Tisshaw 1977; Taffler 1983; Kralicek 1993; Pollak 2003) were based on the data that are not relevant for the current corporate environment (small data volume, data more than 50 years old, etc.). Although the Altman Z-Score is still being used, corporate practice is well aware of their weaknesses. The paper aimed to find an alternative that respect the time lag and which would be easily applicable and showing an appropriate level of accuracy. Very often, it is about being able to detect a potential risk associated with a particular company. Subsequently, we would be able to analyse such a company in more detail, assessing whether the risk is real or not.
This requirement is definitely met by the generated neural networks, in particular 2.MLP 22-9-2. It is based on the current data in the environment where the resulting model of neural networks will be applied. As stated above, it is the first indication of possible problems used as an impulse for a more detailed analysis. The resulting model is interesting from another aspect. Despite its easy applicability, the artificial neural network assesses the future development on the basis of 22 variables characterizing the amount of company assets, structure of its financing, payment history of the company and the customers, cost structure, and the ability to generate sales (as a quantified output of core business). The individual indicators are described in Data and Methods.
Since 2000, many authors have tried to predict company bankruptcy using the models of neural networks. As an example, we can mention Becerra et al. (2002), who analysed the use of linear models and the models of neural networks for the classification of financial distress. Their calculation included 60 British companies from the period between 1997 and 2000. Zheng and Jiang (2007) used the data of Chinese listed companies between 2003 and 2005. All similarly created models are rather outdated, as they use the data that were up to date before the world financial crisis. This paper shows an up-to-date and simple model (most existing studies create relatively complex hybrid models—e.g., Xu et al. 2019), which can be gradually updated using new data, and thus even become more accurate (due to neural networks learning).

5. Discussion and Conclusions

Bankruptcy prediction is always a topical issue. This is due to very complicated business relationships between entrepreneurs and competition in the current business environment. It is characterized by instability, perhaps even turbulence. All the more important is to find a low-input tool that can evaluate future development of any company in the market.
The aim of this paper was to develop bankruptcy prediction models and evaluate the results obtained from classification methods, namely Support Vector Machines and artificial neural networks (multilayer perceptron artificial neural networks—MLP and radial basis function artificial neural networks—RBF).
In total, six models were created: 1 SVM, 5 NN. Consequently, a comparison was made between them. NN 2.MLP 22-9-2 appears to be the most successful model that is applicable in practice (NN code C++ forms). The financial variables with the highest bankruptcy predictive power are presented in Table 6.
The highest bankruptcy predictive power have “Depreciation of intangible and tangible fixed assets”, “Value added” and “Production consumption”. All three items are logical for the manufacturing industry.
The existing models (Altman index, Neumaier index and many others) are based on the standard statistical methods. Their deficiencies were identified by Balcaen and Ooghe (2004):
  • Dependent variable dichotomy,
  • Sampling method,
  • Stationarity and data instability,
  • Selection of variables,
  • Using information from financial statements, and
  • Time dimension.
Neural networks can resolve some of the defined problems. It is primarily the time dimension. For all the existing models, the previous development of the company, consequently evaluated as Active or in Liquidation, cannot be taken into account. Neural networks are able to handle large data volumes. Therefore, the values of variables of selection do not need to be restricted. It may appear that the dataset will be a limit when application for another period and different market (especially when used abroad). However, it is not the case, as we identified a structure with a relatively strong prediction power. Although it was trained and subsequently validated twice on a selected sample, the neural network can be quickly adapted to the specificities of a different market. Artificial neural network can adapt to a new environment by retraining it on a dataset sample of a given market. Due its ability to meet the requirement for changing the setting of its internal parameters, neural network can thus be considered flexible and widely applicable.
The future focus should to collect data other than information from financial statements. It will also be necessary to define the company status other than just Active or in Liquidation. However, the data problem may not be resolved.

Author Contributions

Conceptualization, J.H. and J.V.; methodology, J.H. and J.V.; software, P.S.; validation, J.H., J.V. and P.S.; formal analysis, J.H.; investigation, J.V. and P.S.; resources, J.H.; data curation, J.H.; writing—original draft preparation, J.H. and J.V.; writing—review and editing, P.S.; visualization, J.H. and J.V.; supervision, P.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Altman, Edward I. 1968. Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy. The Journal of Finance 23: 589–609. [Google Scholar] [CrossRef]
  2. Altman, Edward I. 2000. Predicting Financial Distress of Companies: Revisiting the Z-Score and Zeta Models. Working Paper. New York, NY, USA: New York University. [Google Scholar]
  3. Altman, Edward I. 2003. The Use of Credit Scoring Models and the Importance of a Credit Culture; Stern School of Business, New York University. Available online: http://pages.stern.nyu.edu/~ealtman/3-%20CopCrScoringModels.pdf (accessed on 25 January 2020).
  4. Altman, Edward I., and Edith Hotchkiss. 2006. Corporate Financial Distress and Bankruptcy: Predict and Avoid Bankruptcy, Analyze and Invest in Distressed Debt. Hoboken: John Wiley & Sons, 368p. [Google Scholar]
  5. Balcaen, Sofie, and Hubert Ooghe. 2004. 35 Years of Studies on Business Failure: An Overview of the Classical Statistical Methodologies and their Related Problems. Working paper. Ghent, Belgium: Universiteit Gent, 56p. [Google Scholar]
  6. Baran, Dušan. 2007. System approach to the stated policy of controlling the company. Ekonomicko-Manažerské Spektrum 1: 2–9. [Google Scholar]
  7. Becerra, Victor Manuel, Roberto Kawakami Harrop Galvao, and Magda Abou-Seada. 2002. On the utility of input selection and pruning for financial distress prediction models. Paper presented at the 2002 International Joint Conference on Neural Networks, Honolulu, HI, USA, May 12–17; pp. 1328–33. [Google Scholar] [CrossRef]
  8. Bishop, Christopher M. 1995. Neural Networks for Pattern Recognition. New York: Oxford University Press. [Google Scholar]
  9. Boguslauskas, Vytautas, and Ruta Adlyte. 2010. Evaluation of criteria for the classification of enterprises. Inzinerine Ekonomika-Engineering Economics 21: 119–27. [Google Scholar]
  10. Burges, Christopher J. C. 1998. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2: 121–67. [Google Scholar] [CrossRef]
  11. Chen, Bo-Tsuen, and Mu-Yen Chen. 2011. Applying particles swarm optimization for Support Vector Machines on predicting company financial crisis. Paper presented at International Conference on Business and Economics Research, Kuala Lumpur, Malaysia, November 26–28; pp. 301–5. [Google Scholar]
  12. Dorneanu, Liliana, Mircea Untaru, Doina Darvasi, Vasile Rotarescu, and Cernescu Lavinia. 2011. Using artificial neural networks in financial optimization. Paper presented at International Conference on Business Administration, Puerto Morelos, Mexico, January 29–30; pp. 93–96. [Google Scholar]
  13. Enke, David, and Suraphan Thawornwong. 2005. The use of data mining and neural networks for forecasting stock market returns. Expert Systems with Applications 29: 927–40. [Google Scholar] [CrossRef]
  14. Erdogan, Birsen Eygi. 2013. Prediction of bankruptcy using Support Vector Machines: An application to bank bankruptcy. Journal of Statistical Computation and Simulation 83: 1543–55. [Google Scholar] [CrossRef]
  15. Gestel, Tony Van, Bart Baesens, Johan A. K. Suykens, Dirk Van den Poel, Dirk Emma Baestaens, and Marleen Willekens. 2006. Bayesian Kernel based classification for financial distress detection. European Journal of Operational Research 172: 979–1003. [Google Scholar] [CrossRef]
  16. Hafiz, Alaka, Oyedele Lukumon, Bilal Muhammad, Akinade Olugbenga, Owolabi Hakeem, and Ajayi Saheed. 2015. Bankruptcy prediction of construction businesses: Towards a big data analytics approach. Paper presented at 2015 IEEE 1st International Conference on Big Data Computing Service and Applications, BigDataService 2015, San Francisco, CA, USA, March 30–April 3; pp. 347–52. [Google Scholar] [CrossRef]
  17. Härdle, Wolfgang, Yuh-Jye Lee, Dorothea Schäfer, and Yi-Ren Yeh. 2009. Variable selection and oversampling in the use of smooth Support Vector Machines for predicting the default risk of companies. Journal of Forecasting 25: 512–34. [Google Scholar] [CrossRef]
  18. Kiaupaite-Grushniene, Vaiva. 2016. Altman Z-Score model for bankruptcy forecasting of the listed Lithuanian agricultural companies. Paper presented at 5th International Conference on Accounting, Auditing, and Taxation, Tallinn, Estonia, December 8–9; pp. 222–16. [Google Scholar] [CrossRef] [Green Version]
  19. Kim, Soo Y. 2011. Prediction of hotel bankruptcy using Support Vector Machine, artificial neural network, logistic regression, and multivariate discriminant analysis. Service Industries Journal 31: 441–68. [Google Scholar] [CrossRef]
  20. Kim, Sungdo, Byeong Min Mun, and Suk Joo Bae. 2018. Data depth based support vector machines for predicting corporate bankruptcy. Applied Intelligence 48: 791–804. [Google Scholar] [CrossRef]
  21. Klieštik, Tomáš. 2013. Models of Autoregression Conditional Heteroskedasticity Garch and Arch as a tool for modeling the volatility of financial time series. Ekonomicko-Manažerské Spektrum 7: 2–10. [Google Scholar]
  22. Kralicek, Peter. 1993. Základy Finančního Hospodaření [Basics of Financial Management]. Prague: Linde, 110p. [Google Scholar]
  23. Krulický, Tomáš. 2019. Using Kohonen networks in the analysis of transport companies in the Czech Republic. Paper presented at SHS Web of Conferences: Innovative Economic Symposium 2018—Milestones and Trends of World Economy, Beijing, China, November 8–9. article number 01010. [Google Scholar]
  24. López Iturriaga, Félix J., and Iván Pastor Sanz. 2015. Bankruptcy visualization and prediction using neural networks: A study of U.S. commercial banks. Expert Systems with Applications 42: 2857–69. [Google Scholar] [CrossRef]
  25. Lu, Yang, Nianyin Zeng, Xiaohui Liu, and Shujuan Yi. 2015. A new hybrid algorithm for bankruptcy prediction using switching particle swarm optimization and Support Vector Machines. Discrete Dynamics in Nature and Society 2015: 1–7. [Google Scholar] [CrossRef] [Green Version]
  26. Machová, Veronika, and Marek Vochozka. 2019. Analysis of business companies based on artificial neural networks. Paper presented at SHS Web of Conferences: Innovative Economic Symposium 2018—Milestones and Trends of World Economy, Beijing, China, November 8–9. article number 01013. [Google Scholar]
  27. Mousavi, Mohammad M., Jamal Ouenniche, and Bing Xu. 2015. Performance evaluation of bankruptcy prediction models: An orientation-free super-efficiency DEA-based framework. International Review of Financial Analysis 42: 64–75. [Google Scholar] [CrossRef] [Green Version]
  28. Mulačová, Věra. 2012. The financial and economic crisis and SMEs. Littera Scripta 5: 95–103. [Google Scholar]
  29. Neumaierová, Inka, and Ivan Neumaier. 2005. Index IN05. In Evropské Finanční Systémy [European Financial Systems]. Edited by Petr Červinek. Brno: Masaryk University, pp. 143–48. [Google Scholar]
  30. Neumaierová, Inka, and Ivan Neumaier. 2008. Proč se ujal index IN a nikoli pyramidový systém ukazatelů INFA [Why took the IN index and not the pyramid system of INFA indicators]. Ekonomika a management 2: 1–10. [Google Scholar]
  31. Pao, Hsiao Tien. 2008. A comparison of neural network and multiple regression analysis in modeling capital structure. Expert Systems with Applications 35: 720–27. [Google Scholar] [CrossRef]
  32. Park, Soo Seon, and Murat Hancer. 2012. A comparative study of logit and artificial neural networks in predicting bankruptcy in the hospitality industry. Tourism Economics 18: 311–38. [Google Scholar] [CrossRef] [Green Version]
  33. Pollak, Harry. 2003. Jak Obnovit Životaschopnost Upadajících Podniků [How to Restore the Viability of Failure Businesses]. Prague: C. H. Beck, 122p. [Google Scholar]
  34. Purvinis, Ojaras, Povilas Šukys, and Ruta Virbickaité. 2005. Research of possibility of bankruptcy diagnostics applying neural network. Inzinerine Ekonomika-Engineering Economics 41: 16–22. [Google Scholar]
  35. Rybárová, Daniela, Mária Braunová, and Lucia Jantošová. 2016. Analysis of the construction industry in the Slovak Republic by bankruptcy model. Procedia—Social and Behavioral Sciences 230: 298–306. [Google Scholar] [CrossRef] [Green Version]
  36. Sayadi, Ahmad Reza, Seyyed Mohammad Tavassoli, Masoud Monjezi, and Mohammad Rezaei. 2014. Application of neural networks to predict net present value in mining projects. Arabian Journal of Geosciences 7: 1067–72. [Google Scholar] [CrossRef]
  37. Shin, Kyung-Shik, Taik Soo Lee, and Hyun-Jung Kim. 2005. An application of Support Vector Machines in bankruptcy prediction model. Expert Systems with Applications 28: 127–35. [Google Scholar] [CrossRef]
  38. Taffler, Richard J. 1983. The assessment of company solvency and performance using a statistical model—A comparative UK-based study. Accounting and Business Research 13: 295–308. [Google Scholar] [CrossRef]
  39. Taffler, Richard J., and Howard Tisshaw. 1977. Going, going, gone—Four factors which predict. Accountancy 88: 50–54. [Google Scholar]
  40. Tian, Yingjie, Yong Shi, and Xiaohui Liu. 2012. Recent advances on support vector machines research. Technological and Economic Development of Economy 18: 5–33. [Google Scholar] [CrossRef] [Green Version]
  41. Vapnik, Vladimir N. 1995. The Nature of Statistical Learning Theory. New York: Springer. [Google Scholar]
  42. Vochozka, Marek. 2010. Development of methods for comprehensive evaluation of business performance. Politická Ekonomie 58: 675–88. [Google Scholar] [CrossRef] [Green Version]
  43. Vochozka, Marek. 2017. Effect of the economic outturn on the cost of debt of an industrial enterprise. Paper presented at SHS Web of Conferences: Innovative Economic Symposium 2017—Strategic Partnership in International Trade, České Budějovice, Czech Republic, October 19. article number 01028. [Google Scholar]
  44. Vochozka, Marek, and Veronika Machová. 2018. Determination of value drivers for transport companies in the Czech Republic. Nase More 65: 197–201. [Google Scholar] [CrossRef] [Green Version]
  45. Vochozka, Marek, and Penfei Sheng. 2016. The application of artificial neural networks on the prediction of the future financial development of transport companies. Communications: Scientific Letters of the University of Žilina 18: 62–67. [Google Scholar]
  46. Vochozka, Marek, Jan Jelínek, Jan Váchal, Jarmila Straková, and Vojtěch Stehel. 2017. Využití Neuronových sítí při Komplexním Hodnocení Podniků [Use of Neural Networks in Complex Business Evaluation]. Prague: C. H. Beck, 234p. [Google Scholar]
  47. Xu, Xiao-Si, Ying Chen, and Ruo-En Ren. 2006. Studying on forecasting the enterprise bankruptcy based on SVM. Paper presented at the 2006 International Conference on Management Science & Engineering, Lille, France, October 5–7; pp. 1041–45. [Google Scholar]
  48. Xu, Wei, Hongyong Fu, and Yuchen Pan. 2019. A novel soft ensemble model for financial distress prediction with different sample sized. Mathematical Problems in Engineering 2019: 1–12. [Google Scholar] [CrossRef] [Green Version]
  49. Zheng, Qin, and Yanhui Jiang. 2007. Financial distress prediction based on decision tree models. Paper presented at the 2007 International Conference on Service Operations and Logistics, and Informatics, Philadelphia, PA, USA, August 27–29; pp. 426–31. [Google Scholar]
Figure 1. Threshold operating characteristics of neural network classification. Source: own construction.
Figure 1. Threshold operating characteristics of neural network classification. Source: own construction.
Jrfm 13 00060 g001
Table 1. Selected financial data of data sample.
Table 1. Selected financial data of data sample.
Active Companies
Financial Data20132014201520162017Total
Total assets113,590.43112,398.8972,359.0692,463.05102,843.1491,228.91
Fixed assets51,794.6448,418.1632,899.6541,244.6049,662.1140,808.34
Current assets61,093.9963,352.8638,750.2650,275.4252,550.7349,762.88
Liabilities in total113,590.43112,398.8972,359.0692,358.40102,843.1491,228.91
Equity51,663.0553,660.9539,599.7442,971.9959,471.7244,077.03
Borrowed capital61,076.0058,079.5832,437.8346,616.2242,632.0146,275.94
Operating result1574.1014,159.144604.237104.3310,576.956263.25
Economic result for accounting period1282.1111,387.143168.165231.829325.244916.57
Companies in Liquidation
Financial Data20132014201520162017Total
Total assets22,033.5921,401.3320,401.5314,201.2010,273.0977,297.73
Fixed assets6307.666768.545231.995439.591481.1233,904.17
Current assets15,615.9414,447.0315,116.568639.678670.9042,801.79
Liabilities in total22,033.2321,390.7720,400.4214,201.2010,273.0977,254.27
Equity5454.035998.587499.031768.262140.7637,917.64
Borrowed capital16,453.5615,338.0212,813.4712,382.058064.8738,566.51
Operating result−1791.99−166.49−1219.85214.53284.297107.70
Economic result for accounting period−1910.42−151.22−1492.21116.79141.065516.93
Note: all data in the Table are given in thousands of CZK. Source: own construction.
Table 2. Activation functions of MLP and RBF hidden and output layer.
Table 2. Activation functions of MLP and RBF hidden and output layer.
FunctionDefinitionRange
Identitya ( ; + )
Logistic sigmoid 1 1 + e a (0;1)
Hyperbolic tangent e a e a e a + e a (−1;+1)
Exponential e a ( 0 ; + )
Sine sin ( a ) [ 0 ; 1 ]
Softmax exp ( a i ) exp ( a i ) [ 0 ; 1 ]
Gaussian 1 2 π σ exp [ ( x μ ) 2 2 σ 2 ]
Source: own construction.
Table 3. SVM model prediction status.
Table 3. SVM model prediction status.
Status—Active CompanyStatus—In liquidationStatus—All
Total460615826188
Correct45781304708
Incorrect2814521480
Correct (%)99.398.2276.08
Incorrect (%)0.6191.7823.92
Source: own construction.
Table 4. Retained neural networks.
Table 4. Retained neural networks.
Statistics12345
Network nameMLP 22-6-2MLP 22-9-2MLP 22-12-2MLP 22-8-2MLP 22-12-2
Training performance81.4635383.0101682.225382.4099783.05633
Testing performance80.3879381.8965581.0344881.2581.14224
Validation performance81.3577682.6508683.4051782.6508683.40517
Training algorithmBFGS 170BFGS 332BFGS 56BFGS 110BFGS 220
Error functionEntropyEntropySOSEntropyEntropy
Hidden activation func.TanhTanhIdentityLogisticTanh
Output activation func.SoftmaxSoftmaxLogisticSoftmaxSoftmax
Source: own construction.
Table 5. Predictions of artificial neural networks.
Table 5. Predictions of artificial neural networks.
NetworkStatisticsStatus—Active CompanyStatus—In liquidationStatus—All
1.MLP 22-6-2Total460615826188
Correct42268045030
Incorrect3807781158
Correct (%)91.7550.8281.29
Incorrect (%)8.2549.1818.71
2.MLP 22-9-2Total460615826188
Correct42348895123
Incorrect3726931065
Correct (%)91.9256.2082.79
Incorrect (%)8.0843.8117.21
3.MLP 22-12-2Total460615826188
Correct43157735088
Incorrect2918091100
Correct (%)93.6848.8682.22
Incorrect (%)6.3251.1417.78
4.MLP 22-8-2Total460615826188
Correct43207715091
Incorrect2868111097
Correct (%)93.7948.7482.27
Incorrect (%)6.2151.2617.73
5.MLP 22-12-2Total460615826188
Correct42528735125
Incorrect3547091063
Correct (%)92.3155.1882.82
Incorrect (%)7.6944.8217.18
Source: own construction.
Table 6. Sensitivity analysis.
Table 6. Sensitivity analysis.
Variables1.MLP 22-6-22.MLP 22-9-23.MLP 22-12-24.MLP 22-8-25.MLP 22-12-2Average
OHANIM1.3077368.2988301.6237721.1975491.1432862.714235
PRIDHODN1.3022444.3954801.5846672.3391572.7485092.474011
VS1.3190402.6633961.6023473.7425762.8871392.442900
HVML1.2692372.1252921.5200031.5171793.1735111.921044
MZDN1.2947992.5632371.5619021.4942942.2316951.829185
OBEZNAA1.2744242.7371141.6278301.2094182.1233381.794425
SPMAAEN1.3249182.1467511.2957591.2665272.5990451.726600
STALAA1.1739152.1537401.2311611.0384802.5726541.633990
Z1.2894842.0955271.4940671.1156241.5855071.516042
V1.3159652.0924711.6083081.1132331.1466861.455333
FTZZ1.2787201.3791551.5396601.6687091.3895351.451156
CIZIZDROJE1.5273381.2694881.8535731.0730451.4872081.442131
SLUZBY1.4226731.3352202.0024301.1276641.2042121.418440
FM1.0762571.4189781.6017511.1857851.5250211.361559
HVUO1.2981081.4597861.4540341.2191131.3505171.356312
KZ1.2582291.4419231.2049711.3263701.3348371.313266
HIM1.0957011.9047641.0045511.3280131.2286241.312330
VLASTNIJM1.2888971.3381261.5266781.2256781.1609831.308072
KP1.2804381.5811551.3378261.0343921.1961511.285992
NNSOCZAB1.0161641.9912511.0586401.0601631.2843831.282120
AKTIVACELK1.2743101.4523141.1545541.0145831.3884331.256839
PASIVACELK1.2743341.4464611.1543201.0146631.3682531.251606
Source: own construction.

Share and Cite

MDPI and ACS Style

Horak, J.; Vrbka, J.; Suler, P. Support Vector Machine Methods and Artificial Neural Networks Used for the Development of Bankruptcy Prediction Models and their Comparison. J. Risk Financial Manag. 2020, 13, 60. https://doi.org/10.3390/jrfm13030060

AMA Style

Horak J, Vrbka J, Suler P. Support Vector Machine Methods and Artificial Neural Networks Used for the Development of Bankruptcy Prediction Models and their Comparison. Journal of Risk and Financial Management. 2020; 13(3):60. https://doi.org/10.3390/jrfm13030060

Chicago/Turabian Style

Horak, Jakub, Jaromir Vrbka, and Petr Suler. 2020. "Support Vector Machine Methods and Artificial Neural Networks Used for the Development of Bankruptcy Prediction Models and their Comparison" Journal of Risk and Financial Management 13, no. 3: 60. https://doi.org/10.3390/jrfm13030060

Article Metrics

Back to TopTop