Next Article in Journal
The Binomial Distribution: Historical Origin and Evolution of Its Problem Situations
Next Article in Special Issue
Simulating Portfolio Decisions under Uncertainty When the Risky Asset and Short Rate Are Modulated by an Inhomogeneous and Asset-Dependent Markov Chain
Previous Article in Journal
Investigating the Effectiveness of Government Public Health Systems against COVID-19 by Hybrid MCDM Approaches
Previous Article in Special Issue
Bonferroni Weighted Logarithmic Averaging Distance Operator Applied to Investment Selection Decision Making
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel IBA-DE Hybrid Approach for Modeling Sovereign Credit Ratings

by
Srđan Jelinek
1,
Pavle Milošević
2,*,
Aleksandar Rakićević
2,
Ana Poledica
2 and
Bratislav Petrović
2
1
FIS, Bulevar Milutina Milankovića 136b, 11000 Belgrade, Serbia
2
Faculty of Organizational Sciences, University of Belgrade, Jove Ilića 154, 11000 Belgrade, Serbia
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(15), 2679; https://doi.org/10.3390/math10152679
Submission received: 23 June 2022 / Revised: 18 July 2022 / Accepted: 27 July 2022 / Published: 29 July 2022

Abstract

:
Nowadays, the sovereign credit rating is not only an index of a country’s economic performance and political stability but also an overall indicator of development and growth, as well as the trust factor that is associated with the country. Due to its importance, the vast amount of available information, and the lack of a closed-form solution, prediction models based on machine learning (ML) and computation intelligence (CI) techniques are being increasingly used to complement traditional financial approaches. In this paper, we aim to introduce a novel ML-CI approach for sovereign credit rating prediction based on a differential evolution (DE) algorithm and interpolative Boolean algebra (IBA). In fact, the proposed approach is based on a pseudo-logical function in the IBA framework derived from the historical data of publicly available indicators using the DE algorithm. Such functions are easily interpreted and enable a subtle gradation among countries. It is shown that the IBA-DE approach outperforms back-propagation neural networks on the observed problem while also providing a deeper insight into each of the indicators used for prediction and its respective influence on the prediction rating on the other.

1. Introduction

Credit risk is a risk that rises from the possibility of a borrower failing to repay a loan or required payments, or meeting its contractual obligations, and is determined by a borrower’s ability to repay a loan according to its original terms [1]. Credit rating is a quantified assessment of credit risk and is therefore used to evaluate the credit risk of an individual, corporation, or state [2]. It can be determined and calculated by any entity using the internal and publicly available data and models, or, more commonly, by a credit rating agency [3]. The credit rating agency specializes in calculating credit ratings using its internal models and both publicly available and non-public information. A sovereign credit rating is a credit rating assigned to a country. Due to a large amount of available information and the importance and implications that a sovereign credit rating has on a country due to quantifying its financial, industrial, social, and political stability [4], a sovereign credit rating is usually determined by the trustworthy credit rating agencies which are overseen and regulated [5]. The three biggest agencies that control nearly 95% of the credit rating business are Moody’s Investors Service (Moody’s), Standard and Poor’s (S&P), and Fitch Ratings (Fitch).
One of the major concerns about credit ratings is the lack of transparent models used in order to determine the ratings. In today’s financial world where most of the basic instruments and derivatives such as futures, forwards, swaps, and options have explicit and verified pricing models, it is a considerable drawback not to have a similar publicly available transparent and closed-form solution for credit ratings. Therefore, this can lead to a violation of a non-arbitrage principle and cause serious economic issues for a country and its debtors and creditors if its rating is incorrectly determined. The non-transparency of the models used by credit rating agencies has resulted in many internally developed models for credit rating prediction by both other participants in the market as well as researchers for academic purposes.
Custom credit rating approaches and models can be utilized for the validation and deeper understanding of the models implemented by credit rating agencies [3,6]. Even though these agencies are supervised [5,7], it is inadvisable that the only publicly recognized credit rating models are essentially black boxes. Having several independently developed models reduces the potential information incompleteness and ensures a no-arbitrage principle in the credit risk market. Further, the benchmarks used for the training of the custom models are mostly taken from the published reports of the credit rating agencies [8]. Therefore, the obtained results emulate ratings of the credit rating agencies and usually cannot be as precise as theirs. On the other hand, the trained models can be used for forecasting ratings of the companies and entities not listed on the stock exchange [9].
In the 1990s, the majority of custom-made credit rating approaches were based on traditional financial modeling and econometrics. The first methods presented in the literature utilized either linear [10] or multiple regression [11], followed by models based on ordered logistics and probit [12,13,14] regression, as well as their generalized version G-logit [15]. These models were also compared against each other [16], as well as against other methods, such as case-based reasoning [17] and partially ordered set [18], where no best model could be clearly determined. Even though these models were easy to implement and understand, some of the most common drawbacks of these models are underfitting, inability to fully capture the nonlinear nature of the problem, and the assumptions necessary for methods to apply to the desired problems, which then led to the more frequent application of the machine learning (ML) and computation intelligence (CI) methods. Given the complexity of the problem [10,13,19], the fact that models used by credit rating agencies are non-transparent, and the rising popularity of the ML and CI over the last decade, these methodologies were a natural choice for deriving custom forecasting models that use publicly available microeconomic and macroeconomic indicators [11,20,21,22]. Nowadays, the majority of the custom-made models for credit rating forecasting are based on ML and CI techniques or as a hybridization of several methods. Most of the models built this way are based on an input-output logic, meaning that the model is provided with the chosen inputs and desired outputs, and it is expected to form a relation between the two in the training process in order to later apply that relation on newly provided inputs to predict the desired outcome. The model construction can therefore be split into several steps, for example, choosing the financial or economic dataset and time frame, choosing the appropriate forecasting technique, model training, and model testing. The main idea of using the machine learning techniques for credit rating analysis and forecasting, such as neural networks (NN), is used both solely [8,23,24] and in hybrid models with a genetic algorithm [25] and fuzzy logic [26,27]. There were also attempts on using unconventional NN models for credit rating forecasting such as emotional neural networks [28] and networks with unusual learning schemes [29]. Other ML techniques for credit rating forecasting include support vector machines (SVM) [30,31,32], genetic algorithms [33,34,35,36], and fuzzy logic [37,38,39]; all of these methods tend to improve the forecasting obtained by classic econometric methods. NNs are not only used for credit rating forecasting, but also in close fields, such as forecasting credit risk [40] and credit rating for bonds [41]. Each of the techniques mentioned has some disadvantages, e.g., the inability to interpret results for NNs, computational efficiency for genetic algorithms, the need for vast knowledge and expertise in order to use fuzzy logic to deal with a certain problem, etc. Therefore, ML techniques that can be used solely as predictors, such as NNs and support vector machines, which are later used in this paper as benchmark models, are combined with fuzzy logic and genetic algorithms in order to eliminate downsides and create superior hybrid models. Further, the drawback of several models proposed in the literature is that there are no clear conclusions whether they are better than their comparison models, as well as using too short data time frames with a time span of only several years, which makes the predicted results questionable in any other time period. Therefore, when the new model is introduced, it is vital that it clearly outperforms the target models in a longer time frame in order to claim its efficiency and performance improvement compared to the benchmark models.
In this paper, we propose the usage of Differential Evolution (DE) and interpolative Boolean algebra (IBA) to extract an aggregation function, logical by its nature, for forecasting sovereign credit ratings. DE is a biological-inspired optimization heuristic from a family of evolutionary computational algorithms [42]. In our approach, DE is used to detect patterns, interactions, and co-dependencies in historical data as a search algorithm. On the other hand, IBA is a consistent-valued realization of Boolean algebra [43] which preserves all the laws of the BA. Therefore, IBA provides a real-valued logic-based framework that is consistent with Boolean axioms. In the IBA-DE approach, logical and pseudo-logical IBA functions are employed as an aggregation function and further used for prediction. These functions have clear-cut meaning and they are easy to understand and analyze. Thus, the proposed approach is transparent and allows deeper insight into the observed problem. This approach can be divided into three main steps: selecting appropriate indicators and their transformation using IBA for inputs suitable for the DE algorithm; using the DE algorithm on the input data in order to obtain optimal IBA structure vectors; and applying IBA logical aggregation on the inputs and structure vectors to obtain the forecasted credit ratings.
This paper extends and continues the research started in [44], where the idea of using the DE algorithm for the optimization of a prediction model structure in the IBA framework is presented for the first time. In this paper, the IBA-DE algorithm is elaborated on in detail and used for sovereign credit rating prediction. The model is a pseudo-logical function that uses publicly available macroeconomic indicators to predict sovereign credit ratings. Compared to the research presented in [44], where only the financial inputs have been used, the inputs used in this paper are extended to encompass the industrial and social country aspects. The function which is obtained as a model output is easily analyzed and interpretable due to the fuzzy gradation of its elements. Additionally, the performance of the proposed approach exceeds back-propagation NNs.
The paper is organized in the following manner. In Section 2, we give a brief overview of the theoretical background, i.e., the two main methodologies used in the hybrid model, the DE algorithm and the IBA logical framework. The basic idea behind the IBA-DE approach and the main steps of the algorithms are presented in Section 3. Section 4 is devoted to the application of the IBA-DE approach to sovereign credit rating prediction. We present the two model realizations, along with their inputs, parameters, benefits, and limitations. The testing results are presented in Section 5, together with a parameter optimization process and a detailed interpretation of the results. The models’ performances are further compared with the neural networks. Finally, in Section 6, we outline the main conclusions and give ideas regarding model improvements and future work.

2. Theoretical Background

This section provides a summary of Differential Evolution and interpolative Boolean algebra as the two main components of a hybrid IBA-DE model. DE is an evolutionary optimization algorithm proposed in [42] and is used for finding the optimal vector by minimizing the objective function. On the other hand, IBA [43] is a real-valued [0, 1] realization of Boolean algebra which, unlike any other multivalued logic, satisfies all the laws of Boolean algebra (including two laws of complementarity).

2.1. Differential Evolution

DE, proposed by Storn and Price [42], is a well-known metaheuristic approach for solving nonlinear continuous optimization problems. It is flexible and easy to implement, and suitable for solving various problems that conventional optimization methods are not suitable for. The main steps of the DE algorithm are explained in the following.
DE algorithm starts with initialization in the first iteration when a set of m initial d-dimensional solutions (target vectors) are randomly or pseudo-randomly generated. The set size m, commonly referred to as population size, is usually predetermined and kept constant over time. After the initial generation is created, the iterative process begins so that each iteration consists of three steps: mutation, crossover, and selection.
Mutation is a process where every target vector from the population is altered in order to create the respective mutation vector. There are a vast number of mutation algorithms in the literature that differ by the crossover scheme, i.e., the number of difference vectors used to create a mutation vector, etc. One of the most used algorithms is DE/rand/1/bin, where the mutation vectors are generated as shown in the Equation (1):
M j , k + 1 = X r 1 , k + F ( X r 2 , k X r 3 , k )
where r 1 , r 2 and, r 3 are random indexes between 1 and d , with the exception of the index of the target vector that is being mutated. The indexes are also mutually exclusive to ensure the diversity of the structure vectors used in the mutation process, resulting in the m 4 condition for the population size. The parameter F is a mutation factor, which is a predefined real number that satisfies the 0 F 2 condition and is used to control the amplification of the vector difference.
The crossover phase takes place after the mutation and is used to increase the diversity between vectors in the generation. In this phase, a trial vector is generated for every target vector and its respective mutation vector. Binomial and exponential crossover are the two most commonly used strategies in practice. For every pair of target and mutation vector elements, a trial vector element is generated as shown in Equation (2).
c l j = { u l j ,   rand j , l ( 0 , 1 ) C r   or   l = rand ( 1 , m ) x l j ,   otherwise
where j = 1 , , m . The parameter C r is a crossover rate, which is a predefined real number that satisfies the condition: 0 C r 1 . The low value C r ensures a smaller diversity in the new generation (and vice versa) as the likelihood of choosing the element from the target vector is higher than from the mutant vector. The DE algorithm can be modified to allow variations of the C r value as the number of iterations increases. However, a constant value C r during the whole optimization process is appropriate for many optimization problems.
Selection is the last phase in the iterative process. For every target vector in the current generation, a decision is made whether to keep it in the next generation or to replace it with the respective trial vector. The decision is based on comparing fitness function values for the vectors, as shown in the Equation (3):
S k + 1 j = { C k j , f d e ( C k j ) f d e ( S k j ) S k j , f d e ( S k j ) > f d e ( C k j )
In case the structure and crossover vectors yield the same objective function values, a trial vector is chosen to ensure the higher diversity among the suboptimal solutions.
There are many variations and modifications of the classic DE algorithm applied to the different optimization problems, e.g., [45]. So far, there is very little research on the application of DE to credit rating modeling and forecasting [46,47].

2.2. Interpolative Boolean Algebra

Fuzzy logic is a mathematical framework for modeling uncertainty, inaccuracy, or missing information, which is often the case when dealing with real-world problems. The main advantage of fuzzy logic is being able to accurately model and explain problems with inaccurate or missing data, while one of the main downsides is not being in the Boolean frame, i.e., not obeying the laws of excluded middle and contradiction.
This led to the creation of interpolative Boolean algebra (IBA), defined by Radojević [43]. IBA is a consistent real-valued [0, 1] realization of the Boolean algebra which satisfies all the Boolean axioms and theorems, which can be applied to both logic and sets [48], as well as to relations [49]. IBA is a finite algebra consisting of two levels: symbolic and value. These levels are introduced via the principle of structural functionality, as opposed to the truth functionality principle, which states that the structure of every element in IBA is directly determined by the structure of its components [50]. In fact, this principle separates the symbolic from the value level and insists on structural transformation (calculating a structure) before the values are introduced.
On a symbolic level, an atom is defined as the simplest element in an IBA, and by definition, it cannot contain any element other than itself and 0 [48]. On the other hand, IBA attributes represent any real values such as height, color, inflation, and any other numerical or categorical variable. From n attributes, there can be formed 2 2 n elements (logical functions), out of which 2 n are atoms. The structure of any element in IBA is determined by a structure vector, S , of size n, whose elements can take values of 0 and 1 , depending on the presence of a certain atom in the element [51]. In fact, every element of Boolean algebra can be represented as a product of a corresponding structure vector and transposed vector of atoms (atomic vector). Therefore, in the realization of IBA, the main focus is on atoms rather than on attributes.
On the value level, values are introduced to the respective structures by mapping each element of the Boolean algebra into the corresponding generalized Boolean polynomial (GBP) [48]. The GBP is a polynomial whose elements are values of the attributes from the unit range and operators of standard + , , and generalized product , where the generalized product is defined as a function with a [ 0 , 1 ] × [ 0 , 1 ] [ 0 , 1 ] mapping [51] and is a t-norm subclass with the additional non-negativity condition. Analogous to the representation of any IBA element via the product of the corresponding structure vector and atomic vector, GBP can be represented as a product of the corresponding structure vector and transposed atomic vector values. This is one of the main postulates which will be used later in the paper during the credit rating modeling via the IBA approach.
Honoring all the Boolean axioms and theorems has resulted in several methodologies based on IBA. The most prominent one is a logical aggregation (LA), an aggregation procedure based on IBA [51]. The main steps of LA are data normalization followed by aggregation of the normalized values via logical function using GBP, or via a pseudo-logical function using pseudo-GBP. With the LA being implemented as the weighted sum of structure vectors in the case of mutually dependent attributes, it will be used later in the paper when credit ratings are derived from the predetermined attributes. LA has been used multiple times as a suitable tool for financial problems such as modeling [52], financial analysis [53,54], and credit score forecasting [55].

3. IBA-DE Approach

In this section, we introduce the IBA-DE hybrid approach. It is a logic-based ML approach that utilizes the extensive modeling benefits of the IBA framework and its presentation potential.
There have been a few attempts to hybridize the IBA framework with some optimization heuristic in order to learn a LA function from the data. In fact, a variable neighborhood search [56] and genetic algorithm [57] were used to optimize binary values of the IBA structure vector on different datasets, showing promising results. However, the full potential of the IBA framework is not utilized in these cases, i.e., the final model is a simple LA and not a pseudo-LA function.
In an IBA-DE approach, the aim is to obtain a structured vector of the pseudo-LA function as an output aggregation function that is more general compared to LA. IBA-DE is the first attempt to perform optimization of the structure vectors in the continuous space. The main idea is to use IBA to transform raw data points into suitable inputs for the DE algorithm, and then to use DE to find an optimal solution for the observed problem. Since the final model is a pseudo-LA function, IBA-DE may be considered a white-box model, i.e., it can be easily analyzed and interpreted.
Herein, we present the step-by-step implementation details and give pseudo code for non-vanilla parts of the hybrid algorithm.

3.1. The Main Steps of the IBA-DE Algorithm

The IBA-DE approach consists of four main steps:
  • Input preparation;
  • Model training;
  • Model testing;
  • Model interpretation.
A more detailed presentation of the algorithm is given in Figure 1.
Input preparation: The first step in the IBA-DE approach refers to the input preparation, and it consists of the normalization of training inputs and generating IBA atomic vectors. Since we are modeling and inferring in the IBA framework, all elements of input vectors must be valued on an [ 0 , 1 ] interval. Any normalization function can be used for this purpose. Still, classical min-max normalization along with some methods for handling outliers is a common choice. Data preparation for optimization is performed by generating and valuing IBA atomic vectors. Let V be the g-dimensional input vector and let A be the d-dimensional atomic vector. Each element of the input vector is an IBA attribute, while each element of the atomic vector is an IBA atom. Every element of the atomic vector represents one distinctive combination of attributes. It is represented as a product, i = 1 g y i , where y i { v i , 1 v i } and v i is a normalized value of an attribute. In Section 2.1, it was stated that the relationship between the dimensions of input and the atomic vector is d = 2 g , i.e., the number of atomic elements will exponentially increase with the increasing number of inputs. For instance, an atomic vector created from the two-dimensional input vector consists of four elements and it is shown in Equation (4).
[ v 1 , v 2 ] [ v 1 v 2 , v 1 ( 1 v 2 ) , ( 1 v 1 ) v 2 , ( 1 v 1 ) ( 1 v 2 ) ]
Therefore, for the dataset of m g-dimensional input vectors, the result of IBA transformation is a dataset of m d-dimensional atomic vectors. This is the input in the DE algorithm where an optimization process will take place. The pseudo-code for generating atomic vectors from the input vectors is given in Appendix A.
Model training: After the inputs are prepared in a manner suitable for DE optimization, the model training step may begin. The main goal of this phase is to obtain the optimal structure vector which will minimize the objective function, e.g., mean squared error (MSE) of prediction. In our case, elements of a structured vector are in the unit interval. Therefore, the pseudo-logical aggregation function [51] is obtained as a final output. This function is easy to interpret since it is basically a weighted sum of several logical functions. The DE algorithm used for optimization is the basic one with the steps described in Section 2.1. In addition to a typical stopping condition defined as a maximal number of iterations, we have included an early stopping, a standard machine learning mechanism to avoid model overfitting. Three DE control parameters may be considered as hyper-parameters important for model training: F , C r , and the population size. Their values should be assessed during the training phase in order to maximize the potential of the IBA-DE approach.
Model testing: The trained model is assessed and utilized in the model testing step of IBA-DE. A product of the atomic vector of an observed instance and the transposed optimal structure vector are the numerical values that are being forecasted, as explained in Section 3.2. All attributes should be scaled to a unit interval using the same normalization function as in the model training step. The final prediction may be validated by comparing the benchmarks, or re-evaluated if some input data changes in time.
Model interpretation: The final phase of the IBA-DE approach is the model interpretation step. The output of the DE algorithm is an optimal structure vector whose atoms are used to create the appropriate weight factors for each of the inputs. The weights show the impact of every input, compared to all other inputs, on the forecasted credit ratings. With this information, all inputs can be ranked and the ones with the lowest impact discarded or replaced with new ones. The model is therefore not only predicting the credit ratings but also giving the information necessary for further decision-making, and, therefore, cannot be considered a black box but rather an instrument that can be used not only for forecasting but also for the interpretation of the obtained results.

3.2. Benefits and Limitations

The IBA-DE hybrid algorithm is part of a general approach to utilize the IBA framework in a machine learning context. The final model is easy-to-understand, transparent, and universal since it utilizes a pseudo-LA function. Thus, the range of possible applications is very wide, i.e., any small-scale forecasting problem.
In our approach, the basic DE algorithm [42] is used, rather than some of its modifications. Many proposed hybrid DE algorithms are not easily reproduced due to various reasons. Even though the pseudo-code is provided for the majority of hybrid algorithms, the source code is not always available. Some algorithms are also created using overly complicated premises, where hybrid models are enhanced with additional techniques such as Adaptive Pursuit and Probability Matching [58]. This makes these models a bit difficult to understand and implement. Finally, many of the mentioned algorithms have not been tested on real-world problems but were rather made in order to improve general DE premises and/or to test them against other hybrid models in various yearly CEC DE competitions where different problems and hypothetical objective functions are defined [59,60]. These are all the reasons for not using any hybrid DE models which update control parameters and strategies during the iteration process.
The main drawback of the IBA approach is that it is not well suited for high-dimensional inputs due to the exponential growth of atomic vector size and therefore makes DE optimization slower. However, the trade-off for using IBA and the LA in the process is the interpretability of the forecasted results. Many machine learning techniques, such as NNs, are black-boxes and their results are not easily explainable. Thus, the interpretation of their results is a serious problem in many practical situations. This is evident in areas such as finance where a decision must be understood and explained before any action is taken. Therefore, the exponential increase in complexity with the linear increase in the inputs is a cost willingly paid for the interpretability of the end results and seeing how much each of the inputs influences the output. In addition, with the many dimension reduction methodologies, such as principal component analysis or different kinds of regression, it is possible to reduce the number of inputs and allow the IBA-DE model to provide results without losing valuable information in the process.

4. Forecasting Sovereign Credit Ratings with the IBA-DE Approach

In this section, we present the application of the IBA-DE approach for sovereign credit rating prediction. The approach was first used for credit rating prediction in [44]. However, the inputs used were from a very narrow domain and many aspects of the economy were not considered. Further, the parameters of the DE algorithm were obtained empirically, without sensitivity analysis or testing different combinations of parameters. In this paper, we extended the input space to include a broader domain of economic factors and also to take into consideration historical credit ratings. The latter is especially important for improving forecasting performance since the changes in credit ratings are considered rare events.
In the first part of this section, we discuss the input space and give an overview of the input variables. Input space was divided into two categories: historical credit ratings and macroeconomic factors. Three groups of macroeconomic factors were used in this study to take into account different aspects of a country’s economic strength and to discover those with the highest influence on credit ratings.
In the second part, we present two different forecasting approaches:
  • A single-aspect approach where different groups of macroeconomic factors (stability, activity, and social) are used separately to forecast sovereign credit ratings;
  • A multi-aspect approach that uses all groups of input variables to include a broader perspective in the prediction model;
These forecasting approaches were used to run two types of forecasts: with and without historical credit ratings. First, our goal was to run our models only with macroeconomic inputs to understand the influence of different factors on sovereign credit ratings. Second, we replaced the worst performing factor within our model with the historical credit ratings and ran the model again (train and test).

4.1. Inputs

Most studies that dealt with the credit rating determinants show that financial and economic indicators have the highest impact on the credit ratings [13,61]. Some of the factors which are highly correlated with the sovereign credit ratings are GNP, GDP, GNI, inflation, government income and debt, unemployment, and default history [13,62,63,64]. Indicators are usually selected based on their correlation to credit ratings data, or derived using some method such as a principal component analysis [65]. Some studies show that financial and economic factors are not the only ones to influence credit ratings [13,66]. Consequentially, unorthodox factors such as monetary policy credibility, government debt structure, financial sector depth, foreign currency dependence, and others are included in the recent studies [19]. In most papers, authors investigate how individual factors influence sovereign credit ratings, but do not consider the influence of groups of factors.
In this study, we used two types of inputs for our IBA-DE model: macroeconomic factors and historical credit rating. Macroeconomic factors are grouped into three groups: stability, activity, and social. Our idea was to enable our model’s broader perspective so that no individual group of factors would have a dominant influence on a credit rating. Each group consists of four indicators (Figure 2). Indicators were carefully selected based on two criteria: availability of data and their ability to outline a broader view of the aspect that characterizes the group. We have a limited number of inputs per group of four to keep the model as simple as possible and to avoid exponential growth in the size of IBA’s atomic vector size (explained in Section 3.2).
The first group of indicators represent economic stability determinants. These factors are frequently used by researchers and are proven to be the ones to impact a country’s credit ratings the most. In this paper, we have used the following indicators: GDP per capita, as one of the best indicators of a country’s economic strength and development; inflation of consumer prices, to include a measure of the quality of life of an average household; total reserves of a country, including gold, to represent the country’s ability to face unpredicted events and crises; and gross savings, as an indicator of country’s investment strength.
The second group of indicators represents economic activity determinants. When choosing these variables, our idea was to take into consideration the export, import, and credit activity of a country. The following indicators were used: ore and metal exports and fuel exports, as an indicator of a country’s strategic independence on energy and raw materials, due to increasing prices caused by supply chain bottlenecks, high inflation, and the latest geopolitical changes; domestic credit given to the private sector as an indicator of the country’s willingness to support its private sector; total merchandise imports, as an indicator of country’s dependence on import of goods.
The third group of factors provides an overview of the social aspects of a country’s economy. The following four indicators were selected as measures of a country’s social policies and overall quality of life: total unemployment, which shows how effectively a country deals with unemployment; life expectancy at birth, as one of the best indicators of the quality of life; population growth, which is an indicator of the potential for future economic growth; and expense, measuring the government’s spending.
All the above-mentioned input data were obtained from the World Bank (data.worldbank.org (accessed on 30 September 2020)). Our dataset covers 83 countries (listed in Appendix B) and 19 years of data, from 2000 to 2018. The countries were selected based on the availability of indicator data, to achieve as much variety as possible regarding the location and population size. Even so, there were some gaps (missing data) in the dataset due to such a long time range. These missing points are filled using linear interpolation and flat extrapolation.

4.2. Output

Sovereign credit ratings data were obtained from the Fitch Ratings (https://countryeconomy.com/ratings (accessed on 30 September 2020)) because they are publicly available for the countries used in this study and for a time span of 19 years. Given the similarity of the published ratings between the three top credit rating agencies, no information is lost by choosing any one of them. Fitch’s ratings are divided into categories (Table 1). We transferred credit rating categories into representative numerical values from the [0, 100] interval. The reason for using a numerical rather than a categorical variable was to increase precision and enable fuzzification of credit ratings. Using numerical values will cause this classification problem to transform into a forecasting problem. The representative values are assigned in a way to maximize the distance between the ratings within a category, and therefore to maximize the forecasting accuracy. Further, the fuzzification of credit ratings is done by adding representative intervals around the representative values (Table 1).
In the IBA-DE approach, credit ratings are forecasted as numerical values which are then sorted into numerical intervals and finally converted back to the appropriate credit rating categories. In addition, numerical outputs could be used for confidence interval extraction, based on the absolute difference between the forecasted values and the center of the credit rating’s representative intervals.

4.3. Forecasting Models

Using the hybrid IBA-DE approach, we built two types of forecasting models. First, we used three groups of input data separately, each group representing a different aspect of a country’s economic strength, to make separate (specialized) predictions of sovereign credit ratings. Second, we used all three groups of input data together to make broader-perspective predictions of sovereign credit ratings that take into account multiple aspects of the country’s economic strength.

4.3.1. Single-Aspect Model

Single-aspect models use only one group of input factors to predict sovereign credit ratings. They are meant for two reasons:
  • To investigate whether specific aspects of a country’s economic strength could produce satisfactory predictions and therefore reduce input space or substitute some group(s) of factors instead;
  • To provide the aggregated (multi-aspect) model with optimal inputs.
Input data were used to create IBA atomic vectors which are further used as input variables for the DE algorithm. Then, the DE algorithm optimizes the structure vector to minimize the objective function which is the mean square difference (MSE) between credit rating forecasts and targeted (real) numerical values. The initial population of structure vectors was generated in a random way. Credit rating forecasts, R i ( fcst ) , were obtained from quadruplets of input data by multiplying the corresponding atomic vector, A i , with the transposed optimal structure vector, S T , as shown in the Equation (5):
R i ( fcst ) = A i × S T

4.3.2. Multi-Aspect Model

The multi-aspect model represents an aggregated approach for credit rating prediction which considers different aspects of a country’s economic strength. This model uses the previously explained single-aspect models as components, more precisely, their optimal structure vectors. Therefore, to run this model, one must first run three separated (single-aspect) models, one for each group of input factors. The obtained vectors of the structure are linearly aggregated in a weighted sum to obtain multi-aspect predictions. The credit ratings are forecasted as shown in Equation (6):
R i ( fcst ) = A i ( sta ) × ( p ( sta ) S ( sta ) T ) + A i ( act ) × ( p ( act ) S ( act ) T ) + A i ( soc ) × ( p ( soc ) S ( soc ) T )
where p ( sta ) , p ( act ) and p ( soc ) represent weightings for single-aspect structure vectors. The objective function is the same as for the single-aspect models, while the optimization is now done to obtain the optimal vector of weights, rather than the optimal structure vector.

5. Experiment and Results

In this section, we test the two models proposed in Section 4.3. The models are tested against the benchmark ratings obtained from the Fitch credit rating agency. First, we describe the dataset used for this study. Further, we deal with parameter optimization and model training: analyzing the impact of the DE control parameters on the training process and obtaining optimal structure vectors on the training set. After model training, we aim to investigate the resulting IBA structure vectors to provide a deeper insight into the significance of inputs as well as their co-dependencies. Finally, the forecasting performance of IBA-DE models is compared with NNs, given that NNs are one the most used machine learning techniques for credit rating prediction.
As explained earlier, our models use three groups of macroeconomic inputs to take into account three different aspects of a country’s economic strength: stability, activity, and social aspect. Indicator values were calculated on an annual basis and normalized on the unit interval.

5.1. Data

Our dataset consists of 19 years of macroeconomic data for 83 world countries. There are three groups of input data with each group consisting of four macroeconomic indicators–in total, 4731 quadruplets of input data or 18,924 input observations. On the other side, there is only 1 output variable with 1577 observations.
We split the dataset into two subsets: training and testing. We used 17 years of data (2000–2016) to train our models, which counts as 16,932 input and 1411 output observations. The last two years (2017–2018) were used for testing purposes, yielding 1992 input and 166 output observations.
Therefore, we split our dataset on training and test sets using a higher than usual 90–10 split ratio. This could create a risk of overfitting for NNs, which are used as benchmark techniques for the proposed IBA-DE models. However, the risk of overfitting is relatively small due to slow changes in credit ratings over the years.

5.2. Hyper-Parameter Optimization and Training

As usual in machine learning, the model selection is performed in two phases. Bearing in mind that the IBA part of the IBA-DE approach is non-parametric, only the DE control parameter: F , C r , and the population size are considered as the model’s hyper-parameters. In the first phase, we aimed to assess the training convergence speed depending on the values of hyper-parameters and, subsequently, to determine their impact on the model output. The second phase was devoted to an appropriate training procedure in order to obtain the optimal structure vectors, avoiding model overfitting and biased predictions.
In the first phase of model selection, we employed a grid search by varying values of the control parameters F and C r between 0.1 and 0.9 with a 0.1 step. In addition, three values of the population size are considered: 10, 100, and 1000. Thusly, we performed 3 × 9 × 9 = 273 optimizations, i.e., model training processes, for each of three input sets: stability, activity, and social. This procedure was conducted on the training set, while the stopping criterion was defined as the fixed number of iterations, i.e., 300, in order to observe the model convergence over a longer period of time. Additional termination conditions are if the absolute difference between 100 consecutive objective functions is less than 10 4 , or if the objective function reaches 0. Due to the stochastic nature of DE, five simulations were conducted per combination of the control parameters with random starting points.
The DE algorithm has reached the same optimal solution for each parameter set in 97.5% of cases. In addition, the convergence paths with respect to the MSE presented in Figure 3 and Figure 4 are reasonably similar. However, in the other 2.5% cases, the algorithm became stuck in local minima that are rather close to the best result. Still, these results emphasize that the values of hyper-parameters F and C r are not crucial for the final results of the optimization. This can be seen in Figure 3, where no significant co-dependency can be seen between parameters’ values and model behavior. There is also no significant difference between model results for the population sizes of 100 and 1000, even though the latter converges a bit faster on average but not fast enough to justify a 10 times larger population size (see Figure 4). The model with a population size of 10 has slightly worse results, although for some control parameter combinations the model performs as well as the ones with higher population sizes. However, when inputs are amended with the historical credit ratings, the model with a population size of 10 achieves significantly worse results compared to the models using population sizes of 100 and 1000. Therefore, in our case, the recommendation is to use a medium population size when one of the inputs has a strong co-dependency with the output. Otherwise, the model can be defined as non-parametric. Finally, the results of the chaining forward cross-validation have supported this conclusion.
Therefore, we performed a model training with the following hyper-parameters values: F = 0.5 , C r = 0.5 , and the population size equal to 100. We employed an early stop stopping criterion in order to avoid model overfitting. The best results on the training data set, including the objective function values and the number of iterations necessary to achieve them, are presented in Table 2.
As can be seen from the table, the multi-aspect model performs slightly better than the single-aspect models, which is to be expected. The model using solely the activity inputs performs better than the models using stability and social inputs, while when the historical credit ratings are added, the differences between the models are negligible due to the high impact of the historical time series.
Finally, the main output of the optimization process is the IBA structure vector which is then used in generating forecasted credit ratings.

5.3. Test Results and Interpretation

In financial decision making, the transparency of models and interpretation of the data often play a crucial role. Sometimes, the most accurate models are not applied if they are not transparent or too complex for understanding. When introducing the IBA-DE approach we addressed this issue. Namely, IBA-DE is considered a white-box model, since its structure vector can be quickly analyzed to determine the significance of individual input factors and their co-dependencies [67]. The IBA structure vector consists of weights representing atoms’ importance in the model–the higher the weights, the higher the influence on predictions. In this section, we presented and discussed structure vectors of the two most accurate models (model based on macroeconomic stability inputs with and without the addition of historical credit ratings).
The four macroeconomic inputs used were GDP per capita (in1), consumer price inflation (in2), country’s total reserves (in3), and gross savings (in4). As a result of DE optimization, we obtained a 16-dimensional optimal structure vector of weights. Table 3 presents structure vectors for the two most accurate models achieved with and without using historical credit ratings as inputs.
Using these optimal structure vectors, we were able to derive weights for each input variable (Table 4). The weight equations were designed to take into consideration how atomic vectors were generated (in Appendix A). The resulting weights are similar in value when historical credit ratings were not used as input variables. However, when historical credit ratings were included as inputs instead of the first macroeconomic indicator, the resulting weights showed historical credit ratings have dominantly influenced model predictions. This is easy to explain since there is a very low probability the credit ratings will change during a year. Table 5 presents the test results obtained after parameter optimization was performed and optimal structure vectors were generated. These results were obtained with the best performing set of parameters.
Results showed clearly, as expected, that using historical credit ratings improves forecasting performance significantly. The best results when not using historical credit ratings were obtained for higher values of hyper-parameters, indicating the higher diversification of the population matters. Among single-aspect models, the best performing was the one that used macroeconomic stability indicators. Even though the IBA-DE multi-aspect approach performed the best on the training dataset, it yielded the worst results over the testing dataset. One possible explanation could be that introducing the multi-aspect model caused overfitting of IBA-DE since the multi-aspect model aggregates optimal structures of single-aspect models without optimizing aggregate structure. This is because it would be extremely time and resource consuming to optimize the structural vector for 12 inputs. The size of such vector would be 2 12 .
Finally, Table 6 presents the credit rating forecasts for each country for the best performing model, the IBA-DE single-aspect model with historical credit ratings, along with the real Fitch ratings. The overall hit rate was satisfactory with 79.52% of ratings correctly predicted.

5.4. Comparison with Neural Networks and Support Vector Machine Algorithms

A neural network is a well-known unsupervised machine learning algorithm often used for forecasting in finance and the economy. We have chosen NNs as a benchmark to compare against IBA-DE performance for several reasons: (a) NNs are one of the most popular and most used machine learning algorithms for credit rating prediction and bankruptcy prediction; (b) NNs can model both linear and nonlinear relations between the input data and do not require expert knowledge nor deeper understanding of the inputs; (c) NNs has many variations, which allow using different architectures to increase the confidence of the results; and (d) finally, the premise of updating the weights and biases of the inner neuron layers is simple and can be looked at as a basic optimization problem, similarly to the DE premise.
On the other hand, NNs can be prone to overfitting, which can be problematic when forecasting is done on a time series that has a high autocorrelation, such as sovereign credit ratings. Therefore, a support vector machine (SVM) algorithm is used as a second benchmark model to ensure no bias is present in the forecasted time series and to increase the validity of the IBA-DE forecasting results.
The neural network architecture used in this study is a feed-forward backpropagation NN due to its simplicity both for understanding and implementation. We employed seven different gradient descent algorithms for NN training: Levenberg–Marquardt (LM), Bayesian regularization (BR), resilient backpropagation algorithm (RB), Broyden–Fletcher–Goldfarb–Shanno (BFGS), gradient descent with momentum backpropagation (GDM), one step secant (OSS), and scaled conjugate gradient backpropagation (SCG). The NN contains one hidden layer with two neurons as there is no reason for implementing more since there are four macroeconomic inputs within each group and there is only one output. To avoid overfitting, we employed an early stopping criterion. To compare the IBA-DE with NN models, we tested both approaches on the same two datasets: first, we used only macroeconomic indicators as input variables; second, we used historical credit ratings together with macroeconomic indicators. The best training performance was achieved with BFG, LM, and RB algorithms (Table 7).
The three SVM kernel functions used in this paper are linear, polynomial, and Gaussian radial basis function (RBF). The training was also performed for the various values of the penalty parameter; the best performing models are shown in Table 8.
In Table 7 and Table 8, we present the best performing NN and SVM learning algorithms. It is important to notice that these results represent average training performances, making it possible some of the other algorithms could perform better on a specific time series. The best results per input groups are marked in bold.
Finally, in Table 9, we compare the test results of the IBA-DE model and the best performing NN and SVM models. In general, the single-aspect IBA-DE model based on macroeconomic stability indicators outperforms other models, including multi-aspect IBA-DE, NN, and SVM. These results have also indirectly confirmed that macroeconomic stability determinants have the highest impact on credit ratings, followed by activity determinants. It is important to emphasize that overfitting did not occur, since there were no significant deviations of MSEs on training and test sets for all algorithms, respectively. The best results per input groups are marked in bold.
Bearing in mind that NNs are the black-box models, these results additionally speak in favor of the IBA-DE model. Namely, the transparency of the solution obtained by the IBA-DE approach is a vast advantage compared to NN and SVM from the application aspect and decision making.

6. Conclusions

In this paper, we proposed a hybrid IBA-DE model for forecasting sovereign credit ratings. Since credit rating agencies do not disclose their models, this topic attracts machine learning researchers to extract data-driven models to help them better understand the main sources of credit risk. The proposed model is based on two approaches: interpolative Boolean algebra, used for input processing and interpretation of results, and differential evolution, used for optimization and forecasting. This is the very first attempt to utilize the full potential of IBA by learning pseudo-LA functions from the data. The idea of this paper was to use machine learning and computational intelligence to increase the transparency of credit ratings modeling as well as to improve forecasting performance.
Data used for this study was obtained from the publicly available domain and consisted of four groups of macroeconomic indicators: economic stability, economic activity, the social aspect of the economy, and historical credit ratings. The dataset consisted of 19 years of data, from 2000 to 2018. The IBA was used to model macroeconomic inputs and translate them into atomic vectors which were further used as inputs for the DE algorithm. The output of the DE algorithm was a structure vector, while the final output of the model was an easily interpretable pseudo-logical function in the form of a weighted sum of several logical functions. The structure vector was further used to determine the impact of each input and improve the model by excluding those factors with the smallest impact on the final output. The proposed IBA-DE approach was used to build two types of models, single-aspect and multi-aspect models. The single-aspect models are specialized, i.e., they use one group of macroeconomic indicators with the goal to optimize structure vectors. In contrast, the multi-aspect models use these optimal structure vectors as inputs to aggregate different aspects into one variable.
The main results emphasize the importance of economic stability indicators since the single-aspect model performed better than both the multi-aspect model and neural networks in almost every case. Additionally, unlike the neural networks, IBA-DE offers a possibility to analyze the impact of each indicator and interpret the obtained solution. Therefore, IBA-DE is particularly useful for financial experts and other decision makers.
Finally, combining IBA with DE was part of a general approach to utilize the IBA framework in a machine learning context. The idea was to expand the application of the IBA framework to other machine learning techniques for problems that require optimization over continuous space, as well as to try to identify the tools and methodologies used by credit rating agencies by comparing other machine learning techniques against the IBA-DE model and identifying the best performing ones. The DE algorithm can be further improved by applying some modifications to its hyper-parameters during the optimization process, as well as applying different strategies for the mutation, crossover, and selection processes. Finally, improvement can be made by addressing the high autocorrelation in the credit ratings time series by including the credit rating transition matrices modeled using the DE algorithm which were first presented in a recent study [68].

Author Contributions

Conceptualization, S.J., P.M. and B.P.; methodology, S.J. and P.M.; software, S.J.; validation, P.M., A.R. and A.P.; data curation, S.J.; writing—original draft preparation, S.J. and P.M.; writing—review and editing, A.R., A.P. and B.P.; visualization, S.J.; supervision, P.M. and B.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

All the data used in this paper are publicly available and obtainable from the following addresses: (1) input factors were obtained from the World Bank site: data.worldbank.org (accessed on 30 September 2020); (2) historical credit ratings were obtained from the https://countryeconomy.com/ratings (accessed on 30 September 2020) site.

Acknowledgments

The authors are thankful to the anonymous reviewers for their valuable remarks that improved the readability of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Pseudo code for generating atomic vector from a regular vector
GenerateAtomicVector(iVector)
{
           n = size(iVector); // number of input vector elements [a b ... z]
           cVector = zeros(2*n); // define complete vector
           // complete vector is of form [a 1-a b 1-b ... z 1-z]
           for (i=0,i<n,i++)
           {
                  cVector[2*i]=iVector[i];
                  cVector[2*i+1]=1-iVector[i];
           }
           cMatrix = zeros(2^n,n); // coefficient matrix
           for (i=1,i<n+1,i++)
           {
                  for (j=0,j<2^n,j++)
                    {
                      cMatrix[j,i-1]=mod(fix(j/(2^(n-i))), 2) + 2*i-1;
                      }
           }
           oVector = zeros(2^n); // define output vector
           hVector = zeros(n); // define helper vector
           for (i=0,i<2^n,i++)
           {
                    for (j=0,j<n,j++)
                      {
                      hVector[j] = cVector[cMatrix[i, j]];
                      }
           oVector[i] = prod(hVector); // multiply hVector elements
           }
           return oVector;
}

Appendix B

The total number of countries used in this paper is 83. The countries are grouped into four regions: Europe, North and South America, Asia, and Africa. There are 37 countries from Europe (Austria, Belgium, Bulgaria, Croatia, Cyprus, Czech Republic, Denmark, Estonia, Finland, France, Georgia, Germany, Greece, Hungary, Iceland, Ireland, Italy, Latvia, Lithuania, Luxemburg, Malta, Netherlands, North Macedonia, Norway, Poland, Portugal, Romania, Russian Federation, Serbia, Slovak Republic, Slovenia, Spain, Sweden, Switzerland, Turkey, Ukraine, and the United Kingdom), 15 from North and South America (Argentina, Bolivia, Brazil, Canada, Chile, Colombia, Costa Rica, Dominican Republic, El Salvador, Mexico, Panama, Paraguay, Peru, United States, and Uruguay), 16 from Asia (Australia, China, Hong Kong SAR, China, India, Indonesia, Israel, Japan, Kazakhstan, Korea, Rep., Malaysia, Mongolia, New Zealand, Philippines, Singapore, Sri Lanka, and Thailand), and 15 from Africa (Cameroon, Congo, Dem. Rep., Egypt, Arab Rep., Jamaica, Kenya, Morocco, Namibia, Nigeria, Rwanda, Saudi Arabia, Seychelles, South Africa, Tunisia, Uganda, and Zambia).

References

  1. Basel Committee on Banking Supervision. Principles for the Management of Credit Risk; Bank for International Settlements: Basel, Switzerland, 2020; Available online: https://www.bis.org/publ/bcbs75.pdf (accessed on 5 April 2022).
  2. Weber, R.H.; Darbellay, A. The regulatory use of credit ratings in bank capital requirement regulations. J. Bank. Regul. 2008, 10, 1–16. [Google Scholar] [CrossRef] [Green Version]
  3. Hill, P.; Brooks, R.; Faff, R.W. Variations in sovereign credit quality assessments across rating agencies. J. Bank. Financ. 2010, 34, 1327–1343. [Google Scholar] [CrossRef]
  4. Brooks, R.; Faff, R.W.; Hillier, D.; Hillier, J. The national market impact of sovereign rating changes. J. Bank. Financ. 2001, 28, 233–250. [Google Scholar] [CrossRef]
  5. Mathis, J.; McAndrews, J.; Rochet, J.-C. Rating the raters: Are reputation concerns powerful enough to discipline rating agencies? J. Monet. Econ. 2009, 56, 657–674. [Google Scholar] [CrossRef]
  6. Alsakka, R.; Ap Gwilym, O. Leads and lags in sovereign credit ratings. J. Bank. Financ. 2010, 34, 2614–2626. [Google Scholar] [CrossRef]
  7. Blaurock, U. Control and Responsibility of Credit Rating Agencies. J. Comp. Law 2007, 11, 341–356. [Google Scholar]
  8. Kumar, K.; Bhattacharya, S. Artificial neural network vs linear discriminant analysis in credit ratings forecast: A comparative study of prediction performances. Rev. Account. Financ. 2006, 5, 216–227. [Google Scholar] [CrossRef]
  9. Jiao, Y.; Syau, Y.-R.; Lee, E.S. Modelling credit rating by fuzzy adaptive network. Math. Comput. Model. 2007, 45, 717–731. [Google Scholar] [CrossRef]
  10. Afonso, A.; Gomes, P.; Rother, P. Short- and Long-run Determinants of Sovereign Debt Credit Ratings. Int. J. Financ. Econ. 2011, 16, 1–15. [Google Scholar] [CrossRef] [Green Version]
  11. Alexe, S.; Hammer, P.L.; Kogan, A.; Lejeune, M.A. A non-recursive regression model for country risk rating. RUTCOR-Rutgers Univ. Res. Rep. RRR 2003, 9, 1–40. [Google Scholar]
  12. Teker, D.; Pala, A.; Kent, O. Determination of Sovereign Rating: Factor Based Ordered Probit Models for Panel Data Analysis Modelling Framework. Int. J. Econ. Financ. Issues 2013, 3, 122–132. [Google Scholar]
  13. Bissoondoyal-Bheenick, E. An analysis of the determinants of sovereign ratings. Glob. Financ. J. 2005, 15, 251–280. [Google Scholar] [CrossRef]
  14. Afonso, A.; Gomes, P.; Rother, P. Ordered response models for sovereign debt ratings. Appl. Econ. Lett. 2009, 16, 769–773. [Google Scholar] [CrossRef] [Green Version]
  15. Oral, M.; Kettani, O.; Cosset, J.-C.; Daouas, M. An Estimation Model for Country Risk Rating. Int. J. Forecast. 1992, 8, 583–593. [Google Scholar] [CrossRef]
  16. Afonso, A. Understanding the Determinants of Sovereign Debt Ratings: Evidence of the Two Leading Agencies. J. Econ. Financ. 2003, 27, 56–74. [Google Scholar] [CrossRef]
  17. Bissoondoyal-Bheenick, E.; Brooks, R.; Yip, A.Y.N. Determinants of sovereign ratings: A comparison of case-based reasoning and ordered probit approaches. Glob. Financ. J. 2006, 17, 136–154. [Google Scholar] [CrossRef] [Green Version]
  18. Hammer, P.L.; Kogan, A.; Lejeune, M.A. Modelling country risk ratings using partial orders. Eur. J. Oper. Res. 2006, 175, 836–859. [Google Scholar] [CrossRef] [Green Version]
  19. Choy, S.Y.; Chit, M.M.; Teo, W.L. Sovereign credit ratings: Discovering unorthodox factors and variables. Glob. Financ. J. 2021, 48, 100548. [Google Scholar] [CrossRef]
  20. Yim, J.; Mitchell, H. Comparison of country risk models: Hybrid neural networks, logit models, discriminant analysis and cluster techniques. Expert Syst. Appl. 2005, 28, 137–148. [Google Scholar] [CrossRef]
  21. Tsai, C.-F.; Chen, M.-L. Credit rating by hybrid machine learning techniques. Appl. Soft Comput. 2010, 10, 374–380. [Google Scholar] [CrossRef]
  22. Wallis, M.; Kumar, K.; Gepp, A. Credit Rating Forecasting Using Machine Learning Techniques. In Managerial Perspectives on Intelligent Big Data Analytics; IGI Global: Hershey, PA, USA, 2019; pp. 180–198. [Google Scholar]
  23. Peng, H. Research on Credit Evaluation of Financial Enterprises Based on the Genetic Backpropagation Neural Network. Sci. Program. 2021, 2021, 7745920. [Google Scholar] [CrossRef]
  24. Li, X.; Sun, Y. Application of RBF neural network optimal segmentation algorithm in credit rating. Neural Comput. Appl. 2021, 33, 8227–8235. [Google Scholar] [CrossRef]
  25. Oreski, S.; Oreski, D.; Oreski, G. Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment. Expert Syst. Appl. 2012, 39, 12605–12617. [Google Scholar] [CrossRef]
  26. Abiyev, R.H. Credit Rating Using Type-2 Fuzzy Neural Networks. Math. Probl. Eng. 2014, 2014, 460916. [Google Scholar] [CrossRef] [Green Version]
  27. Piramuthu, S. Financial credit-risk evaluation with neural and neurofuzzy systems. Eur. J. Oper. Res. 1999, 112, 310–321. [Google Scholar] [CrossRef]
  28. Khashman, A. Credit risk evaluation using neural networks: Emotional versus conventional models. Appl. Soft Comput. 2011, 11, 5477–5484. [Google Scholar] [CrossRef]
  29. Khashman, A. Neural networks for credit risk evaluation: Investigation of different neural models and learning schemes. Expert Syst. Appl. 2010, 37, 6233–6239. [Google Scholar] [CrossRef]
  30. Lee, Y.-C. Application of support vector machines to corporate credit rating prediction. Expert Syst. Appl. 2007, 33, 67–74. [Google Scholar] [CrossRef]
  31. Luo, S.-T.; Cheng, B.-W.; Hsieh, C.-H. Prediction model building with clustering-launched classification and support vector machines in credit scoring. Expert Syst. Appl. 2009, 36, 7562–7566. [Google Scholar] [CrossRef]
  32. Xie, R.; Liu, R.; Liu, X.B.; Zhu, J.M. Evaluation of SMEs’ credit decision based on support vector machine-logistics regression. J. Math. 2021, 2021, 5541436. [Google Scholar] [CrossRef]
  33. Shin, K.; Han, I. Case-based reasoning supported by genetic algorithms for corporate bond rating. Expert Syst. Appl. 1999, 16, 85–95. [Google Scholar] [CrossRef]
  34. Ong, C.-S.; Huang, J.-J.; Tzeng, G.-H. Building credit scoring models using genetic programming. Expert Syst. Appl. 2005, 29, 41–47. [Google Scholar] [CrossRef]
  35. Chen, M.-C.; Huang, S.-H. Credit scoring and rejected instances reassigning through evolutionary computation techniques. Expert Syst. Appl. 2003, 24, 433–441. [Google Scholar] [CrossRef]
  36. Chi, B.-W.; Hsu, C.-C. A hybrid approach to integrate genetic algorithm into dual scoring model in enhancing the performance of credit scoring model. Expert Syst. Appl. 2012, 39, 2650–2661. [Google Scholar] [CrossRef]
  37. Syau, Y.-R.; Hsieh, H.-T.; Lee, E.S. Fuzzy Numbers in the Credit Rating of Enterprise Financial Condition. Rev. Quant. Financ. Account. 2001, 17, 351–360. [Google Scholar] [CrossRef]
  38. Liu, X.; Liu, W. Credit Rating Analysis with AFS Fuzzy Logic. In Proceedings of the International Conference on Natural Computation, Advances in Natural Computation, ICNC 2005, Changsha, China, 27–29 August 2005; pp. 1198–1204. [Google Scholar]
  39. Hájek, P. Credit rating analysis using adaptive fuzzy rule-based systems: An industry-specific approach. Cent. Eur. J. Oper. Res. 2012, 20, 421–434. [Google Scholar] [CrossRef]
  40. Yu, L.; Wang, S.; Lai, K.K. Credit risk assessment with a multistage neural network ensemble learning approach. Expert Syst. Appl. 2008, 34, 1434–1444. [Google Scholar] [CrossRef]
  41. Hájek, P. Municipal credit rating modelling by neural networks. Decis. Support Syst. 2011, 51, 108–118. [Google Scholar] [CrossRef]
  42. Storn, R.; Price, K. Differential Evolution—A Simple and Efficient Heuristic for global Optimization over Continuous Spaces. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
  43. Radojević, D. (0, 1)—Valued logic: A natural generalization of Boolean logic. Yugosl. J. Oper. Res. 2000, 10, 185–216. [Google Scholar]
  44. Jelinek, S.; Milošević, P.; Rakićević, A.; Petrović, B. Forecasting Sovereign Credit Ratings Using Differential Evolution and Logic Aggregation in IBA Framework. In Proceedings of the International Conference on Intelligent and Fuzzy Systems, Istanbul, Turkey, 24–26 August 2021; pp. 506–513. [Google Scholar]
  45. Das, S.; Mullick, S.S.; Suganthan, P.N. Recent advances in differential evolution—An updated survey. Swarm Evol. Comput. 2016, 27, 1–30. [Google Scholar] [CrossRef]
  46. Krink, T.; Paterlini, S.; Resti, A. Using differential evolution to improve the accuracy of bank rating systems. Comput. Stat. Data Anal. 2007, 52, 68–87. [Google Scholar] [CrossRef]
  47. Zhang, J.; Sanderson, A.C. Adaptive Differential Evolution—A Robust Approach to Multimodal Problem Optimization; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
  48. Radojević, D. Fuzzy Set Theory in Boolean Frame. Int. J. Comput. Commun. Control 2008, 3, 121–131. [Google Scholar]
  49. Radojević, D. Interpolative Relations and Interpolative Preference Structures. Yugosl. J. Oper. Res. 2005, 15, 171–189. [Google Scholar] [CrossRef]
  50. Radojević, D. Real-valued implication function based on real-valued realization of Boolean algebra. In Proceedings of the 11th Serbian-Hungarian Joint Symposium on Intelligent Systems, Subotica, Serbia, 26–28 September 2013; pp. 45–50. [Google Scholar]
  51. Radojević, D. Logical aggregation based on interpolative Boolean algebra. Mathw. Soft Comput. 2008, 15, 125–141. [Google Scholar]
  52. Rakićević, A.; Simeunović, V.; Petrović, B.; Milić, S. An Automated System for Stock Market Trading Based on Logical Clustering. Teh. Vjesn. 2018, 25, 970–978. [Google Scholar]
  53. Rakićević, A.; Milošević, P.; Petrović, B.; Radojević, D. DuPont Financial Ratio Analysis Using Logical Aggregation. In Soft Computing Applications; Springer: Berlin/Heidelberg, Germany, 2016; pp. 727–739. [Google Scholar]
  54. Milošević, P.; Petrović, B.; Dragović, I. A Novel Approach to Generalized Intuitionistic Fuzzy Sets Based on Interpolative Boolean Algebra. Mathematics 2021, 9, 2115. [Google Scholar] [CrossRef]
  55. Latinović, M.; Dragović, I.; Bogojević-Arsić, V.; Petrović, B. A Fuzzy Inference System for Credit Scoring using Boolean Consistent Fuzzy Logic. Int. J. Comput. Intell. Syst. 2018, 11, 414–427. [Google Scholar] [CrossRef] [Green Version]
  56. Milošević, P.; Poledica, A.; Dragović, I.; Rakićević, A.; Petrović, B. VNS for optimizing the structure of a logical function in IBA framework. In Proceedings of the Variable Neighborhood Search: 6th International Conference, ICVNS 2018, Sithonia, Greece, 4–7 October 2018. [Google Scholar]
  57. Rakićević, A. Adaptivni Fazi Sistem za Algoritamsko Trgovanje: Interpolativni Bulov Pristup. In Fakultet Organizacionih Nauka; Univerzitet u Beogradu: Beograd, Serbia, 2020. [Google Scholar]
  58. Gong, W.; Fialho, Á.; Cai, Z.; Li, H. Adaptive strategy selection in differential evolution for numerical optimization: An empirical study. Inf. Sci. 2011, 181, 5364–5386. [Google Scholar] [CrossRef]
  59. Liang, J.; Qu, B.; Suganthan, P. Problem Definitions and Evaluation Criteria for The Cec 2014 Special Session and Competition on Single Objective Real-Parameter Numerical Optimization; Zhengzhou University: Zhengzhou, China; Nanyang Technological Univiversity: Singapore, 2013. [Google Scholar]
  60. Liang, J.; Qu, B.; Suganthan, P.; Hernández-Díaz, A. Problem Definitions and Evaluation Criteria for the CEC 2013 Special Session on Real-Parameter Optimization; Computational Intelligence Laboratory, Zhengzhou University: Zhengzhou, China, 2013. [Google Scholar]
  61. Bozic, V.; Magazzino, C. Credit Rating Agencies: The Importance of Fundamentals in the Assessment of Sovereign Ratings. Econ. Anal. Policy 2013, 43, 157–176. [Google Scholar] [CrossRef]
  62. Erdem, O.; Varli, Y. Understanding the sovereign credit ratings of emerging markets. Emerg. Mark. Rev. 2014, 20, 42–57. [Google Scholar] [CrossRef]
  63. Afonso, A.; Gomes, P.; Rother, P. What ‘Hides’ Behind Sovereign Debt Ratings. Work. Pap. Ser. Eur. Cent. Bank. 2007, 711. [Google Scholar] [CrossRef]
  64. Cantor, R.; Packer, F. Determinants and Inputs of Sovereign Credit Ratings. FRBNY Econ. Policy Rev. 1996, 2, 37–53. [Google Scholar]
  65. Mellios, C.; Paget-Blanc, E. Which factors determine sovereign credit ratings? Eur. J. Financ. 2006, 12, 361–377. [Google Scholar] [CrossRef]
  66. Grunert, J.; Norden, L.; Weber, M. The role of non-financial factors in internal credit ratings. J. Bank. Financ. 2005, 29, 509–531. [Google Scholar] [CrossRef]
  67. Rakićević, A.; Milošević, P.; Poledica, A.; Dragović, I.; Petrović, B. Interpolative Boolean approach for fuzzy portfolio selection. In Applying Fuzzy Logic for the Digital Economy and Society; Meier, A., Portmann, E., Terán, L., Eds.; Springer: Cham, Switzerland, 2019; pp. 23–46. [Google Scholar]
  68. Jelinek, S.; Makajić-Nikolić, D.; Milošević, P. Sovereign Credit Rating Transition Matrix Estimation using Differential Evolution. In Proceedings of the XLVIII Simpozijum o operacionim istraživanjima, SYM-OP-IS 2021, Banja Koviljača, Srbija, 20–23 September 2021. [Google Scholar]
Figure 1. Main steps of the IBA-DE approach.
Figure 1. Main steps of the IBA-DE approach.
Mathematics 10 02679 g001
Figure 2. Model inputs.
Figure 2. Model inputs.
Mathematics 10 02679 g002
Figure 3. Convergence graph.
Figure 3. Convergence graph.
Mathematics 10 02679 g003
Figure 4. Convergence graph with population sizes.
Figure 4. Convergence graph with population sizes.
Mathematics 10 02679 g004
Table 1. Credit Rating Numerical Values.
Table 1. Credit Rating Numerical Values.
RatingRep. ValueRep. IntervalRatingRep. ValueRep. IntervalRatingRep. ValueRep. Interval
AAA AA+88(86.66, 90)A+78(76.66, 80)
100(91, 100)AA85(83.33, 86.66)A75(73.33, 76.66)
AA−82(80, 83.33)A−72(70, 73.33)
BBB+68(66.66, 70)BB+58(56.66, 60)B+48(46.66, 50)
BBB65(63.33, 66.66)BB55(53.33, 56.66)B45(43.33, 46.66)
BBB−62(60, 63.33)BB−52(50, 53.33)B−42(40, 43.33)
CCC+38(36.66, 40)CC28(25, 30)RD15(10, 20)
CCC35(33.33, 36.66)C22(20, 25)
CCC−32(30, 33.33)
D8(6.66, 10)
DD5(3.33, 6.66)
DDD2(0, 3.33)
Table 2. The best IBA-DE performance on the training set.
Table 2. The best IBA-DE performance on the training set.
SeriesNon-HistoricalHistorical
InputsStabilityActivitySocialMulti-AspectStabilityActivitySocialMulti-Aspect
MSE333.03320.63339.0431112.0111.9911.9211.82
Iterations18918919761671751745
Table 3. Optimal structure vectors.
Table 3. Optimal structure vectors.
Structure Vectors
Inputss1s2s3s4s5s6s7s8s9s10s11s12s13s14s15s16
Macro0.710.880.620.941.000.680.810.650.640.670.510.790.910.580.590.63
Macro + Hist. ratings1.000.991.000.961.000.960.991.000.010.000.030.050.030.010.000.02
Table 4. Weights/importance of specific input group.
Table 4. Weights/importance of specific input group.
Weightw1 (in1)w2 (in2)w3 (in3)w4 (in4)
Equation i = 1 8 s i / i = 1 16 s i i = 1 , 9 , j = i i + 3 s 1 / i = 1 16 s i i = 1 , 5 , 9 , 13 j = i i + 1 s 1 / i = 1 16 s i i = 1 i mod 2 = 1 16 s i / i = 1 16 s i
Macro0.5410.4950.5230.499
Macro + Hist. ratings0.9830.5020.4960.505
Table 5. Best IBA-DE forecasting performance.
Table 5. Best IBA-DE forecasting performance.
InputMacroMacro + Hist. Ratings
ModelStabilityActivitySocialMulti-AspectStabilityActivitySocialMulti-Aspect
MSE303.15317.79319.21324.376.526.537.347.22
Table 6. Forecasting results for the single-aspect stability model using historical credit ratings.
Table 6. Forecasting results for the single-aspect stability model using historical credit ratings.
CountryYearCredit RatingCountryYearCredit RatingCountryYearCredit Rating
RealForecastRealForecastRealForecast
Argentina2017BBHungary2017BBB−BBB−Peru2017A−A−
2018BB2018BBB−BBB−2018A−A−
Australia2017AAAAAAIceland2017AA−Philippines2017BBBBBB−
2018AAAAAA2018AA2018BBBBBB
Austria2017AA+AA+India2017BBB−BBB−Poland2017A−A
2018AA+AA+2018BBB−BBB−2018A−A−
Belgium2017AA−AAIndonesia2017BBB−BBB−Portugal2017BB+BB+
2018AA−AA−2018BBBBBB−2018BBBBB+
Bolivia2017BB−BB−Ireland2017AARomania2017BBB−BBB−
2018BB−BB−2018A+A2018BBB−BBB−
Brazil2017BB−BB−Israel2017A+A+Russia2017BBB−BBB−
2018BB−BB−2018A+A+2018BBB−BBB−
Bulgaria2017BBB−BBB−Italy2017BBBBBB+Rwanda2017B+B+
2018BBBBBB−2018BBBBBB2018B+B+
Cameroon2017BBJamaica2017BBSaudi Arabia2017A+AA−
2018BB2018BB2018A+A+
Canada2017AAAAAAJapan2017AASerbia2017BBBB−
2018AAAAAA2018AA2018BBBB
Chile2017A+AA−Kazakhstan2017BBBBBBSeychelles2017BB−BB−
2018A+A+2018BBBBBB2018BBBB−
China2017A+A+Kenya2017B+B+Singapore2017AAAAAA
2018A+A+2018B+B+2018AAAAAA
Colombia2017BBBBBB+Korea, Rep.2017AA−AA−Slovakia2017A+A+
2018BBBBBB2018AA−AA−2018A+A+
Congo, D. p.2017CCCCCCLatvia2017A−A−Slovenia2017A−BBB+
2018CCCCCC2018A−A−2018A−A−
Costa Rica2017BBBB+Lithuania2017A−A−South Africa2017BB+BBB−
2018BBBB2018A−A−2018BB+BB+
Croatia2017BBBBLuxemburg2017AAAAAASpain2017BBB+BBB+
2018BB+BB2018AAAAAA2018BBB+BBB+
Cyprus2017BB−BB−Malaysia2017A−A−Sri Lanka2017B+B+
2018BBB−BB−2018A−A−2018B+B+
Czech Rep.2017A+A+Malta2017A+ASweden2017AAAAAA
2018AA−A+2018A+A+2018AAAAAA
Denmark2017AAAAAAMexico2017BBB+BBB+Switzerland2017AAAAAA
2018AAAAAA2018BBB+BBB+2018AAAAAA
Dominican Rep.2017BB−BB−Mongolia2017B−BThailand2017BBB+BBB+
2018BB−BB−2018BB−2018BBB+BBB+
Egypt2017BBMorocco2017BBB−BBB−Tunisia2017B+BB
2018BB2018BBB−BBB−2018B+BB−
El Salvador2017CCCB+Namibia2017BBB−BBB−Turkey2017BBB−BBB−
2018B−CCC2018BB+BBB−2018BB+BBB−
Estonia2017A+A+Netherlands2017AAAAAAUganda2017B+B+
2018A+A+2018AAAAAA2018B+B+
Finland2017AA+AANew Zealand2017AA+AA+Ukraine2017B−CCC+
2018AA+AA+2018AA+AA+2018B−B
France2017AAAANigeria2017B+B+United Kingdom2017AAAA
2018AAAA2018B+B+2018AAAA
Georgia2017BB−BB−N. Macedonia2017BBBB+United States2017AAAAAA
2018BB−BB−2018BBBB2018AAAAAA
Germany2017AAAAAANorway2017AAAAAAUruguay2017BBB−BBB−
2018AAAAAA2018AAAAAA2018BBB−BBB−
Greece2017CCCPanama2017BBBBBBZambia2017BB
2018BCC2018BBBBBB2018BB
Hong Kong2017AA+AA+Paraguay2017BB+BB+
2018AA+AA+2018BB+BB+
Table 7. Neural network training results summary.
Table 7. Neural network training results summary.
NN Learning
Algorithm
SeriesNon-HistoricalHistorical
InputsStabilityActivitySocialMulti-AspectStabilityActivitySocialMulti-Aspect
BFGMSE328.49329.92359.07316.4216.6118.6117.9911.82
Iterations2726173759454238
LMMSE329.16317.93351.10314.3514.3214.8817.9213.39
Iterations70302227138534037
RBMSE343.75331.57347.42327.2611.9715.3715.2114.87
Iterations248013839914234
Table 8. Support vector machine training results summary.
Table 8. Support vector machine training results summary.
SVM Kernel FunctionSeriesNon-HistoricalHistorical
InputsStabilityActivitySocialMulti-AspectStabilityActivitySocialMulti-Aspect
LinearMSE344.99362.83363.82322.0814.2714.2714.1911.32
Iterations647701701582311363363295
Gaussian RBFMSE323.47308.61325.13295.4612.2712.4712.219.08
Iterations755809755643415519467386
PolynomialMSE334.07317.74337.22304.2513.1613.5913.5310.29
Iterations755755701629571519519489
Table 9. IBA-DE vs. neural network and support vector machines test results summary.
Table 9. IBA-DE vs. neural network and support vector machines test results summary.
ModelSeriesNon-HistoricalHistorical
InputsStabilityActivitySocialMulti-AspectStabilityActivitySocialMulti-Aspect
IBA-DEMSE303.15317.79319.21324.376.526.537.347.22
NN311.90314.73343.56307.266.616.6414.596.57
SVM305.46330.12320.93309.087.247.837.606.86
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Jelinek, S.; Milošević, P.; Rakićević, A.; Poledica, A.; Petrović, B. A Novel IBA-DE Hybrid Approach for Modeling Sovereign Credit Ratings. Mathematics 2022, 10, 2679. https://doi.org/10.3390/math10152679

AMA Style

Jelinek S, Milošević P, Rakićević A, Poledica A, Petrović B. A Novel IBA-DE Hybrid Approach for Modeling Sovereign Credit Ratings. Mathematics. 2022; 10(15):2679. https://doi.org/10.3390/math10152679

Chicago/Turabian Style

Jelinek, Srđan, Pavle Milošević, Aleksandar Rakićević, Ana Poledica, and Bratislav Petrović. 2022. "A Novel IBA-DE Hybrid Approach for Modeling Sovereign Credit Ratings" Mathematics 10, no. 15: 2679. https://doi.org/10.3390/math10152679

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop