Abstract
Recently, the total net assets of mutual funds have increased considerably and turned them into one of the main investment instruments. Despite this increment, every year a considerable number of funds disappear. The main purpose of this paper is to determine if the neural networks can be a valid instrument to detect the survival capacity of a fund, using the traditional variables linked to the literature of disappearance funds: age, size, performance and volatility. This paper also incorporates annualized variation in return and the Sharpe ratio as variables. The data used is a sample of Spanish mutual funds during 2018 and 2019. The results show that the network correctly classifies funds into surviving and non-surviving with a total error of 13%. Moreover, it shows that not all variables are significant to determine the survival capacity of a fund. The results indicate that surviving and non-surviving funds differ in variables related to performance and its variation, volatility and the Sharpe ratio. However, age and size are not significant variables. As a conclusion, the neural network correctly predicts the 87% of survival capacity of mutual funds. Therefore, this methodology can be used to classify this financial instrument according to its survival or disappearance.
1. Introduction
In recent years, the total net assets of Spanish mutual funds have experienced a significant increase and became one of the main investment instruments used by investors. By the end of 2019, Spanish mutual funds had reached €276,866 millions of total net assets, outstripping the historical maximum reached in 2006. Furthermore, on analyzing the evolution of the risk profile of Spanish investors, it can be observed that they have channeled their investments towards more dynamic positions with a greater amount of equities component in their profiles. Among the reasons for this behavior, the most outstanding is the negative evolution of interest rates, which has forced participants to look for an extra return on their investments. The other main reason is structural and consists in the increasing financial culture among the average Spanish savers.
Despite this increment in the assets of mutual funds, every year a considerable number of funds disappear. In particular, there was a total of 273 disappearances between 2018 and 2019. Therefore, the main purpose of this paper is to determine if the neural networks can be a valid instrument to detect the survival capacity of mutual funds, as an alternative technique to existing ones. It will also allow us to contrast which variables are significant in the survival of the funds. If the error produced by the network is small, it can be analyzed which factors describe the disappearance of mutual funds and thus provide additional information about the risk of this product for both investors and managers.
In [1], a systematic review of the literature is carried out, finding that certain variables affect the survival capacity of mutual funds.
The two most analyzed variables in the literature are the size of the fund and the return of the fund. Numerous studies have stated that the most profitable funds and those with a greater asset volume have a greater probability of survival [2,3,4,5,6,7].
There is a lack of agreement as to over how many years prior to disappearance the return of a fund must be analyzed. To this effect, refs. [8,9,10] analyze the minimum period with negative returns before the fund disappears. While [8] finds that a fund with negative returns for three consecutive years is more likely to disappear, ref. [9] observes that funds that present bad results are more likely to continue behaving this way and do not survive if they exhibit negative returns for up to five years before their disappearance. Consequently, 1- and 3-year returns, and annualized variations in returns 1, 2 and 3 years before the disappearance, have been included in the present analysis.
Other studies also analyze what happens with investment flows prior to disappearance. In [3] mutual fund mergers and liquidations are analyzed, finding that smaller funds with lower inflows are more likely to disappear to the market. This finding is in line with [2,11,12].
Recently, refs. [6,13] analyze the characteristics of the target and acquiring funds. The study in [6] is one of the few that analyzes the Sharpe ratio as a determinant of mortality, concluding that target portfolios have lower Sharpe ratios in the pre-merger period than their acquiring portfolios. On the other hand, ref. [13] shows that the likelihood of disappearance is not significantly related to past performance because the worsening performance is a temporary phenomenon. Moreover, this author exposes that the risk measured as volatility is not significantly related to the probability of disappearance. Consequently, this study incorporates the traditional variables reported in the literature—age, size, investment flows, return, and risk—while also adding others considered as suitable to complement the study: annualized variation in return and the Sharpe ratio.
The use of an artificial neural network is proposed in this paper to analyze the incidence of these variables in the survival capacity of mutual funds. The applied network is the Self Organizing Maps (SOM) introduced by T. Kohonen.
This kind of artificial neural network has been applied in numerous studies in the field of finance, marketing and management, among others [14]. Regarding the application of this methodology to mutual funds, ref. [15] used SOM to evaluate the official classification of Spanish mutual funds by the CNMV (Spanish National Securities Market Commission) and Inverco (Spanish Association of Investment and Pension Funds), aiming to improve this classification with nonlinear techniques. Ref. [16] used SOM to determine if the characteristics which define the non-surviving funds are different according to their investment objectives. Recently, ref. [7] tested the suitability of SOM to predict the survival of mutual funds. Other authors have focused on forecasting the net asset value of mutual funds [17] and mutual fund performance [18,19], using other types of artificial neural networks such as back-propagation neural networks.
This paper aims to contribute towards the financial literature on mutual funds from three points of view. Firstly, using SOM to define the survival capacity of mutual funds as a complementary method to econometric models. Secondly, analyzing in a different market to the US, since this is the focus of most previous studies [1] and, thirdly, using other variables to those traditionally used. This paper is a totally complementary study to the previous ones because [16] used SOM to cluster mutual funds that disappear during 2013–2015 and analyzed if the variables, which define the survival capacity, take similar values for all of them or were different depending on the funds’ investment objectives. On the other hand, ref. [7] used the Cox model to define the survival model of Spanish mutual funds.
2. Self Organizing Maps
SOM are a particular kind of artificial neural network. As part of them, this tool, developed by Teuvo Kohonen [20], is inspired by human neural functioning. SOM are based on the human brain’s ability to store similar information in a nearby area. For this reason, SOM are unsupervised neural networks, that is, they do not use external information for their learning algorithm, but rather use the similarity between the input information to create a features map where the input patterns are located according to the similarity between all their characteristics. SOM are formed by two layers with input and output neurons completely connected. A weight is associated with each connection or synapsis. In addition, to accomplish the training process, the output neurons (which form a bidimensional map) are connected among them and with themselves (lateral and auto-recurrent connections, respectively).
The implementation of this methodology has been carried out using the Toolbox for Matlab developed by the Laboratory of Computer and Information Science at the Helsinki University of Technology. Briefly, we can explain how SOM work in the following steps:
- Each pattern from the input information is represented by a vector, in which each component collects the value of a variable that defines the pattern. In this paper, the patterns are mutual funds, and the components that form the vectors are the variables that influence their survival capacity. Thus, an input pattern is represented as where p refers to the pattern and i to the variable, having a total of n variables that will coincide with the number of input neurons in the SOM. To homogenize the data, all the variables are normalized, so the variance of all of them is equal to one.
- As SOM use a competitive learning process, neurons in the output layer compete to become the winning neuron or the Best Matching Unit (BMU). For a pattern p, its BMU is the output neuron that accomplishes , where symbolizes a measure of distance, is the vector of weights formed by the weights that connect each input neuron i with an output neuron k, and k* refers to the BMU. When using the Euclidean distance, the criterion for determining the BMU for a pattern p is . Initially, we consider all the weights as random values.
- Once the BMU for a pattern has been determined, the weights associated with this neuron, as well as its neighbor neurons, are modified. The objective of this process is that patterns with similar characteristics also have the same BMU or another located close to it. The way to define the neighborhood area is by using a function that decreases as the distance between the output neurons increases. The function used in this case is the Gaussian function, , where indicates the distance between an output neuron and the BMU, and σ is the neighborhood radius that decreases when the number of iterations increases. The new weights, then, are calculated as follows: , where is the learning rate. This rate, for convergence reasons, must decrease, using in our case , where is the initial learning rate (by default, 0.5) and T is the total number of iterations.
- All the patterns are introduced into the network until obtaining the location of all patterns on the map (their position is determined by the corresponding BMU). In this way, the n-dimensional patterns are placed on a bidimensional map, with the most similar patterns being close and those that are different being further away.
It is important to note that SOM allows us to analyze non-linear relationships between variables without previously defining a specific relationship between them.
SOM can be applied to solve different problems; however, in the vast majority of studies, they are used for interpreting data, identifying objects, clustering and even reducing the dimension of the problem [21,22].
In this article, SOM is applied to cluster funds according to the variables analyzed in the financial literature on the survival capacity of mutual funds.
3. Data
3.1. Context of Spanish Mutual Funds Industry
Our study is placed in the context of the Spanish mutual fund industry. This section highlights some key features of this industry, and the data and variables used in this analysis are presented.
Figure 1 shows the total number of non-surviving funds each year and the annual mortality rate. This rate relates the number of disappeared funds in a period with the total number of live funds in the previous period.
Figure 1.
Evolution of the number of non-surviving funds in the Spanish market, 1985–2019.
Figure 1 clearly shows two time periods with high fund mortality, 2001–2003 and, even more obviously, 2008–2015. The first period coincides with the bursting of the technological bubble, causing huge losses among mutual fund holders, especially if they were particularly linked with the technological sector.
The second period coincides with the financial crisis which began in 2007, with a significant increase in the number of funds disappearing in 2009, reaching a total of 440 funds and a mortality rate of approximately 15%, representing the worst year ever for these funds in the Spanish market. There was also high mortality in the period 2013–2015, with a total of 309 (14% of the mortality rate) and 289 (14.8%) funds disappearing in the years 2013 and 2015, respectively. Notably, many fund disappearances in this period came about because of the major financial restructuring that occurred in Spain in this period, causing a process of mergers forced by the absorption or acquisition of fund companies.
As of 2016, the number of disappearances reduced, although more than 100 were still reported per year.
This study focuses on the years 2018 and 2019 because it is a period without external effects (for example, the financial restructuring that took place in Spain in the previous years) that may condition the results. Furthermore, this period is not directly affected by any of the financial crises that occurred in the early years of this century.
3.2. Sample
The data have been obtained from the Morningstar Direct database, which contains information on mutual funds around the world. In the database, there are a total of 1778 Spanish surviving funds at the end of 2019 and 291 corresponds to non-surviving funds during the last two years (2018 and 2019).
The funds that disappeared in the years covered in this study, 2018 and 2019, and the variables included in the analysis, were extracted from this database. Notably, only those funds for which data were available for up to 4 years prior to their disappearance, or those alive as of 2019, were used, because the analysis of certain variables, three-year annualized return and three-year standard deviation, limits the sample. Moreover, it is worth mentioning that the sample excludes all the guarantee funds in the Spanish market. These funds, by definition, have inception and obsolete dates a priori, so their inclusion could distort the results.
Once the funds that do not fulfill these characteristics have been removed, a sample of 142 non-surviving funds is obtained and 142 surviving funds are randomly added from the total available sample that met the requirements. It is important to point out that surviving funds and non-surviving funds have the same weight in the sample because the aim of our study is to understand which variables have a greater impact on survival capacity.
The variables were selected considering the theoretical framework of survival capacity of mutual funds. Table 1 details the variables selected for the analysis and their description and Table 2 presents the descriptive statistics of the sample divided into surviving and non-surviving funds.
Table 1.
List and definition of variables.
Table 2.
Descriptive statistics.
Table 3 presents the correlation matrix. A high correlation between the variables related to performance (variables 5 and 9) and fund risk (variables 10 and 11) is found. In order not to overrepresent these variables in the network, one-year return (variable 5) and one-year standard deviation (variable 10) are excluded from the analysis, maintaining the three-year annualized return (variable 5) and the three-year deviation (variable 11).
Table 3.
Correlation matrix.
All variables are normalized. This process is necessary because the input variables are measured on different scales.
4. Results
When the network is implemented, it generates an output map of dimension 12 × 7 (12 rows × 7 columns). The dimension of the map depends on the number of units in it. When no number of units is specified, the default value assigned by the Toolbox is 5 times the square root of the number of patterns. Since we work with 284 mutual funds, the map will have 5 ∗ sqrt (284) = 84.26 units, which is rounded to 84 units. The number of rows and columns are then determined by calculating the two biggest eigenvalues of the input vectors. The ratio between side lengths is established to be the closest possible to 84, the value that in our case coincides with the product of 12 × 7.
Since the aim of this paper is to determine if the neural networks can be a valid instrument to detect the survival capacity of mutual funds in the Spanish market, it is established to force the output into two groups.
Figure 2 shows this map, where the corresponding patterns (funds) have been numbered and “YES” or “NO” indicate whether the fund disappeared or not at the end of 2019. If the fund disappears, the year of disappearance is also detailed (“18” or “19”).
Figure 2.
Self Organizing Maps (SOM) for Spanish mutual funds.
Group 1, located at the bottom, is formed by 133 funds of which 119 are surviving funds and 14 are non-surviving funds. Group 2, located at the top, contains 151 funds, specifically 23 surviving funds and 128 non-surviving funds.
Table 4 shows the accuracy percentage of the model, the error percentages of each type, and the total percentage error. As can be seen, the SOM correctly predicts 86.97% of the survival capacity of mutual funds in the Spanish market. Therefore, neural networks can be used to classify this financial instrument. If the variables analyzed in this work for a fund are included, this methodology makes it possible to predict with high accuracy the possibility that the fund will disappear or not.
Table 4.
Error and accuracy percentages.
Once the methodology is validated, the second step is to analyze which variables define the survival capacity of mutual funds. To carry out this analysis, it is necessary to interpret the map of features (Figure 3). The map of features shows the value taken for each variable in the corresponding cell of the SOM (Figure 2). This value is represented by a color scale, where the highest values correspond to the red color, while the minimum values of each variable are represented in blue (Figure 3). The best expected value for some variables (volatility, for example) is represented by a low value (blue color), whereas for other variables (return, Sharpe ratio) a high level (red color) is desirable.
Figure 3.
Map of features (The scale of values next to each map shows the rank of values which are taken by the representative patterns of all the mutual funds located in one cell).
Figure 3 shows that variables 6–9 and 12 present a location of high and low values similar to the SOM groups. This distribution is not observed in variables 1–5 and 11. For example, for group 1 of SOM (surviving funds), variable 1 (age) includes the full scale of values (from blue to red), so this group includes young and old funds, while the variable 6 (VarReturn1y) clearly shows that there are only high values of this variable in the group 1 (surviving funds) and there are medium and low values for group 2 (non-surviving funds).
Table 5 summarizes each group’s characteristics with their corresponding mean and standard deviation of each variable.
Table 5.
Mean and standard deviation of variables in each group.
5. Discussion
The map of SOM (Figure 2) and features (Figure 3), and Table 5 allow us to observe the similarities between the funds included in each group.
If each variable is analyzed separately, it can affirm that age (variable 1) and size (variable 2) do not determine the survival capacity of Spanish mutual funds during 2018 and 2019. This result is different from findings by [3,4,5,6,7,13,23], among others.
Regarding the variation of size (variables 3 and 4), they do not define the survival of funds, however, group 2 shows investment outflows as average in 1 (mean: −0.21) and 2 (mean: −0.21) years prior to the study in comparison to group 1 which presents investment inflows (mean: 0.33). This finding is in line with [2,3,7,11,12,16].
On the other hand, it can be observed that the annualized variation of return (variables 6, 7 and 8), three-year return (variable 9) and the Sharpe ratio (variable 12) define the mortality of Spanish funds because they are clearly different in each group. The average performance is clearly higher in surviving than in non-surviving funds over the long term, in line with other studies [2,4,7,12].
The Sharpe ratio, which is considered as a variable different from traditional, is higher as average in surviving funds (mean: 1.58) in comparison to non-surviving funds (mean: −8.11).
In conclusion, group 1, which includes the surviving funds, is characterized by high performance in three years, high volatility, and a high value of the Sharpe ratio, while group 2 (non-surviving funds) shows the following characteristics: low performance, low volatility and, moreover, funds of this group show investment outflows.
Author Contributions
Methodology, L.F.-A., M.-T.S.-F., G.B.-M., and A.T.; software, L.F.-A., M.-T.S.-F., G.B.-M., and A.T.; writing—original draft preparation, L.F.-A., M.-T.S.-F., G.B.-M., and A.T.; writing—review and editing, L.F.-A., M.-T.S.-F., G.B.-M., and A.T. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data presented in this study are available on request from the corresponding author.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Fabregat-Aibar, L.; Terceño, A.; Barberà-Mariné, M.G. Analysis of the survival capacity of mutual funds: A systematic review of the literature. Int. J. Manag. Financ. 2017, 13, 440–474. [Google Scholar]
- Jayaraman, N.; Khorana, A.; Nelling, E. An analysis of the determinants and shareholder wealth effects of mutual fund mergers. J. Financ. 2002, 57, 1521–1551. [Google Scholar]
- Zhao, X. Exit decisions in the U.S. mutual fund industry. J. Bus. 2005, 78, 1365–1401. [Google Scholar]
- Rohleder, M.; Scholz, H.; Wilkens, M. Survivorship bias and mutual fund performance: Relevance, significance, and methodical differences. Rev. Financ. 2011, 15, 441–474. [Google Scholar]
- Sherrill, D.E.; Stark, J.R. ETF liquidation determinants. J. Empir. Financ. 2018, 48, 357–373. [Google Scholar]
- Lapatto, A.; Puttonen, V. Life after death: Acquired fund performance. Manag. Financ. 2018, 44, 389–402. [Google Scholar]
- Barberà-Mariné, M.G.; Fabregat-Aibar, L.; Terceño, A. Investment objectives and factors that influence the disappearance of spanish mutual funds. J. Bus. Econ. Manag. 2020, 21, 255–276. [Google Scholar]
- Brown, S.J.; Goetzmann, W.N. Performance Persistence. J. Financ. 1995, 50, 679–698. [Google Scholar]
- Carhart, M.M. On persistence in mutual fund performance. J. Financ. 1997, 52, 57–82. [Google Scholar]
- Cogneau, P.; Hübner, G. The prediction of fund failure through performance diagnostics. J. Bank. Financ. 2015, 50, 224–241. [Google Scholar]
- Allen, D.; Parwada, J. Investor’s response to mutual fund Company mergers. Int. J. Manag. Financ. 2006, 2, 121–135. [Google Scholar]
- Andreu, L.; Sarto, J.L. Financial consequences of mutual fund mergers. Eur. J. Financ. 2016, 22, 529–550. [Google Scholar] [CrossRef]
- McLemore, P. Do Mutual Funds have decreasing returns to scale? Evidence from Fund Mergers. J. Financ. Quant. Anal. 2019, 54, 1683–1711. [Google Scholar] [CrossRef]
- Tkác, M.; Verner, R. Artificial neural networks in business: Two decades of research. Appl. Soft Comput. 2015, 38, 788–804. [Google Scholar] [CrossRef]
- Moreno, D.; Marco, P.; Olmeda, I. Self-organizing maps could improve the classification of Spanish mutual funds. Eur. J. Oper. Res. 2006, 174, 1039–1054. [Google Scholar] [CrossRef]
- Terceño, A.; Gloria Barberà-Mariné, M.; Fabregat-Aibar, L.; Sorrosal-Forradellas, M.T. The Behaviour of Non-Surviving Spanish Funds According to Their Investment Objectives; Springer: Cham, Switzerland, 2018; Volume 125. [Google Scholar] [CrossRef]
- Chiang, W.C.; Urban, T.L.; Baldridge, G.W. A neural network approach to mutual fund net asset value forecasting. Omega 1996, 24, 205–215. [Google Scholar] [CrossRef]
- Indro, D.C.; Jiang, C.X.; Patuwo, B.E.; Zhang, G.P. Predicting mutual fund performance using artificial neural networks. Omega 1999, 27, 373–380. [Google Scholar] [CrossRef]
- Wang, K.; Huang, S. Using fast adaptive neural network classifier for mutual fund performance evaluation. Expert Syst. Appl. 2010, 37, 6007–6011. [Google Scholar] [CrossRef]
- Kohonen, T. Self-Organized Formation of Topologically Correct Forms Features Maps. Biol. Cybern. 1982, 43, 59–69. [Google Scholar] [CrossRef]
- Chen, N.; Ribeiro, B.; Vieira, A.; Chen, A. Clustering and visualization of bankruptcy trajectory using self-organizing map. Expert Syst. Appl. 2013, 40, 385–393. [Google Scholar] [CrossRef]
- Du Jardin, P.; Séverin, E. Predicting corporate bankruptcy using a self-organizing map: An empirical study to improve the forecasting horizon of a financial failure model. Decis. Support Syst. 2011, 51, 701–711. [Google Scholar] [CrossRef]
- Bu, Q.; Lacey, N. On understanding mutual fund terminations. J. Econ. Financ. 2008, 33, 80–99. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).