Research on Financial Early Warning Models of A-Share Listed Companies Based on EBWO-BP Neural Networks

Chu, Yizhou; Liu, Guiyang; Yu, Qiuyu; Yang, Chunyan

doi:10.3390/math14132261

Open AccessArticle

Research on Financial Early Warning Models of A-Share Listed Companies Based on EBWO-BP Neural Networks

by

Yizhou Chu

¹,

Guiyang Liu

¹,

Qiuyu Yu

^2,* and

Chunyan Yang

³

¹

School of Management, Hefei University, Hefei 230601, China

²

Library, NingboTech University, Ningbo 315100, China

³

Library, Ningbo University, Ningbo 315211, China

^*

Author to whom correspondence should be addressed.

Mathematics 2026, 14(13), 2261; https://doi.org/10.3390/math14132261 (registering DOI)

Submission received: 22 May 2026 / Revised: 15 June 2026 / Accepted: 17 June 2026 / Published: 25 June 2026

(This article belongs to the Special Issue Quantitative Finance with Mathematical Modelling)

Download

Browse Figures

Versions Notes

Abstract

The financial early warning mechanism of listed companies has an important strategic value for maintaining the stability of the capital market and preventing systemic financial risks. This study proposes a hybrid model (EBWO-BP) based on the improved beluga optimisation algorithm (EBWO) and BP neural network for financial early warning research. Innovative T-SNE nonlinear dimensionality reduction technique is applied to the multidimensional evaluation system constructed by 23 financial and two non-financial indicators. The empirical evidence based on the data of A-share listed companies in 2022–2024 shows that the accuracy of the EBWO-BP test set reaches 86.51% (AUC = 0.83), which demonstrates a significant prediction advantage compared with the optimisation algorithm models such as GA-BP and PSO-BP, as well as the CNN and LSTM deep learning models; when the sample size is increased to 700 groups, the accuracy is improved to 89.05%, verifying the model robustness. The method achieves significant improvement of financial risk prediction through algorithm fusion innovation, and provides methodological innovation and practical reference for intelligent financial risk monitoring.

Keywords:

financial early warning; beluga optimisation algorithm; BP neural network; T-SNE dimensionality reduction; non-linear risk modelling

MSC:

91-08

1. Introduction

The stability of capital markets is the cornerstone of macroeconomic resilience; however, the global financial landscape has become increasingly volatile in recent years. Factors such as escalating geopolitical tensions and disruptive technological changes have collectively amplified systemic risks [1]. Against this backdrop, the Chinese A-share market—the primary financing platform for the world’s second-largest economy—has seen a significant rise in financial distress events. Market regulatory data shows that hundreds of listed companies are subject to special treatment each year due to consecutive losses or audit irregularities, resulting in a significant decline in shareholder wealth and triggering a chain reaction within the financial system. Therefore, establishing an accurate, timely, and robust financial early-warning mechanism for listed companies has become more important than ever. Such mechanisms serve not only as tools for investors to hedge against downside risks but also as key means for regulators to monitor systemic vulnerabilities and take proactive measures before potential liquidity crises escalate [2].

Current research in this field faces three core challenges: (1) the nonlinear and complex nature of financial risks, (2) multicollinearity and noise within high-dimensional indicators, and (3) the insufficient generalization capability and robustness of early warning models. To address these challenges, this study aims to construct a high-precision, strongly generalized financial early warning model. Traditional econometric models struggle to adequately address these challenges due to their limitations in handling nonlinear relationships, high-dimensional features, and complex interaction effects. Consequently, research on financial early warning models worldwide has progressively shifted its focus from traditional econometric approaches to machine learning models [3,4,5]. Traditional econometric models mainly include the univariate model, multivariate discriminant model, and logistic model. The univariate model is simple and intuitive to construct, with lower requirements on data volume and arithmetic power, and the results are easier to interpret, but it has obvious limitations because it relies on only a single indicator, which cannot comprehensively reflect the financial status of the enterprise [6,7]. The advantage of the multivariate discriminant model is that it integrates the multidimensional information, and it can effectively identify systematic differences between different categories, but it relies on the static financial indicators and a strict multivariate positive discrimination model. However, it relies more on static financial indicators and strict multivariate normal distribution assumptions, lacks consideration of industry differences and changes in the economic environment, and is difficult to capture complex nonlinear risk relationships with linear discriminant methods [8,9]. Compared to Z-score models, logistic models offer greater applicability as they do not require strict normal distribution assumptions and can output probabilistic risk results [10,11]. However, they still struggle to adequately capture the complex nonlinear relationships between financial indicators and risk and remain sensitive to multicollinearity.

As scholars’ research deepens, BP neural networks in machine learning have gradually replaced traditional econometric models in financial early warning [12]. BP neural networks are far superior to traditional econometric models in terms of nonlinear modelling ability, high-dimensional data processing ability, dynamic learning, and adaptive optimization [13,14], but the single BP neural network still possesses inherent flaws that remain unresolved. Its performance is highly dependent on the initial weight and threshold settings. The traditional gradient descent method is prone to getting stuck in local optima, converges slowly, and is sensitive to parameter initialization. These limitations severely constrain its predictive accuracy and stability when applied to complex financial data. Although previous studies have attempted to optimize BP neural network initial parameters using genetic algorithms (GA) [15], particle swarm optimization (PSO) [16], and the whale optimization algorithm (WOA) [17], these optimization algorithms themselves suffer from issues such as premature convergence and insufficient search efficiency, resulting in limited improvements in model performance Table 1 [18,19].

Therefore, a core research gap is the need for a more efficient and robust optimization algorithm to overcome the inherent limitations of BP neural networks. Such an algorithm would fundamentally enhance the accuracy and generalization capability of financial early warning models. To address this, our study innovatively introduces an Enhanced Beluga Whale Optimization (EBWO) algorithm to optimize the BP neural network, constructing a hybrid EBWO-BP model. The primary rationale for selecting EBWO lies in its unique algorithmic mechanism, which precisely addresses the shortcomings of BP networks and the characteristics of financial data. The parameter space of financial indicators often contains multiple local optima, requiring optimization algorithms to possess both extensive global exploration and refined local exploitation. By employing a quasi-backpropagation learning strategy, EBWO systematically expands the search range, significantly enhances population diversity, and effectively avoids the problem of slow convergence in the later stages often encountered by optimization algorithms such as GA and PSO when optimizing BP neural networks. Its whirlwind foraging strategy simulates an efficient spiral search, guiding the population to rapidly converge toward potential optimal regions. This balanced mechanism of “global exploration and local exploitation” enables EBWO to outperform traditional optimization algorithms such as GA and PSO in finding the optimal initial parameters for BP networks.

Financial indicators provide a basic and verifiable objective basis for financial early warning by quantifying the historical financial performance, but the lag and manipulability limit their early warning effectiveness; non-financial indicators provide forward-looking signals by capturing industry risks, management decision-making bias, and external shocks, effectively making up for the static defects of financial data. At present, most scholars only consider financial indicators, ignoring the impact of non-financial indicators on the prediction results of the corporate financial early warning model [23,24]. Therefore, in this paper, the selection of financial early warning indicators, not only selected 23 traditional financial indicators, but also selected the proportion of shares held by the first major shareholder of the enterprise, whether the internal control is defective two non-financial indicators, the two synergistically build a dynamic early warning system, taking into account short-term thresholds and long-term drivers, to enhance the timeliness and robustness of the early warning.

For the dimensionality reduction of high-dimensional financial early warning indicator data, the current research mostly adopts the mainstream principal component analysis [25,26], factor analysis [27], and linear discriminant analysis [28,29], which are highly efficient in dealing with the linear relationship, but are unable to deal with the nonlinear relationship between the indicators, and are more sensitive to outliers. In this paper, the T-SNE algorithm is adopted for the dimensionality reduction of the data, and its visualisation advantages can not only assist in the identification of potential risk clustering features, provide data support for the dynamic adjustment of warning thresholds, but also retain the local structure, which effectively solves the limitations of the traditional linear dimensionality reduction methods that are difficult to deal with non-linear relationships [30,31], while reducing the computational complexity of the model, which is of high value in the financial early warning applications.

This paper employs an enhanced Beluga Whale Optimization Algorithm to optimize a BP neural network, constructing an EBWO-BP financial early warning model. Early warning indicators from both financial and non-financial metrics are selected and subjected to dimensionality reduction via the T-SNE algorithm. The objective is to achieve a significant improvement in financial risk prediction performance, providing new methodological support and practical reference for intelligent financial risk monitoring.

The primary innovative contributions of this study are reflected in the following three aspects. First, at the core model algorithm level, addressing the inherent limitations of traditional BP neural networks-such as sensitivity to initial parameters, susceptibility to local optima, slow convergence speed-and existing optimization algorithms (e.g., GA, PSO)-such as premature convergence and insufficient global exploration capabilities-this paper innovatively designed the EBWO-BP hybrid model. This effectively enhances parameter optimization efficiency and global optimization capabilities. Second, at the technical pathway level, an integrated innovative framework was constructed. This framework combines multidimensional non-financial indicators, T-SNE nonlinear dimensionality reduction, and intelligent algorithm optimization to systematically process high-dimensional, nonlinear early-warning data from listed companies. Finally, at the application value level, rigorous robustness testing confirmed the model’s outstanding generalization performance. Its specific application value is threefold. First, it provides regulatory authorities with a tool for real-time, dynamic monitoring of financial risks among A-share-listed companies. Second, it offers institutional investors quantitative decision support to identify potential ST companies and mitigate investment risks. Third, it provides enterprises with intelligent diagnostic insights to optimize financial structures and enhance risk governance. This contributes practical methodological innovation and a practical model for building an intelligent financial risk prevention and control system aligned with the development requirements of new-quality productive forces.

2. Improvement of the Beluga Optimization Algorithm

2.1. Beluga Optimization Algorithm

The BWO algorithm is a novel meta-heuristic optimization algorithm proposed by Changting Zhong and his research team in 2022. The algorithm constructs a three-stage optimization framework containing global exploration, local exploitation, and population renewal mechanisms by simulating three typical behavioural patterns exhibited by beluga whale populations in natural environments—collaborative cruising, dynamic predation, and the ecological phenomenon of whale colonies [32]. It innovatively maps individual beluga whales as candidate solutions in a multi-dimensional solution space, and achieves the optimization goal through intelligent updating of individual positions during the iterative process. Its position matrix is:

X = (\begin{matrix} x_{1,1} & x_{1,2} & \dots & x_{1, m} \\ x_{2,1} & x_{2,2} & \dots & x_{2, m} \\ ⋮ & ⋮ & ⋮ \\ x_{n, 1} & x_{n, 2} & \dots & x_{n, m} \end{matrix})

(1)

The population of beluga whales is represented by a position matrix

X \in (R n \times m)

, where n is the population size and mis the dimension of the optimization problem. Each element

X_{i, j}

represents the position (value) of the

i t h

candidate solution (individual beluga) in the

j t h

dimension of the search space.

The selection between the global exploration and local exploitation phases is governed by a dynamic balance factor

B f

, which is compared against a predefined threshold

T v

(set to 0.5 in this study). Bf is calculated as:

B f = B 1 (1 - \frac{a}{2 A})

(2)

where a is the current iteration number, A is the maximum iteration number, and B1 is a random number between (0, 1).

When

B f

>

T v

, the algorithm prioritizes the execution of the global exploration strategy, which enhances the ability to search for unknown regions of the solution space by simulating the collaborative swimming behaviour of a group of beluga whales.

\{\begin{matrix} X_{i j}^{a + 1} = X_{i p_{j}}^{a} + (X_{r p_{1}}^{a} - X_{i p_{j}}^{a}) (1 + r_{1}) \sin (2 π r_{2}) j = 2 α \\ X_{i j}^{a + 1} = X_{i p_{j}}^{a} + (X_{r p_{1}}^{a} - X_{i p_{j}}^{a}) (1 + r_{1}) \cos (2 π r_{2}) j = 2 α + 1 \end{matrix}

(3)

where

X_{i j}^{a + 1}

is the updated position of the

i t h

individual in dimension j,

P j

is the dimension index randomly drawn from the q-dimensional decision space,

X_{i p_{j}}^{a}

and

X_{r p_{j}}^{a}

are the current positions of the

i t h

and

r t h

individuals, and r₁ and r₂ obey a random perturbation factor uniformly distributed U(0, 1), which is used to enhance the exploratory capability of the algorithm.

When Bf ≤

T v

, a local exploitation strategy is triggered to simulate the fine-grained feeding behaviour of beluga whales, focusing on deep optimality seeking within the neighbourhood of the current optimal solution.

X_{i}^{a + 1} = r_{3} X_{b e s t}^{a} - r_{4} X_{i}^{a} + C_{1} L_{F} (X_{r}^{a} - X_{i}^{a})

(4)

where

X_{i}^{a}

and

X_{r}^{a}

are the current positions of the

i t h

and

r t h

individuals,

X_{i}^{a + 1}

is the updated position of the

i t h

individual, and

X_{b e s t}^{a}

represents the position of the global optimal solution of the current population in the

a t h

iteration.

When the global or local search phase is completed and

B f

≤

Z f

, the beluga whale enters the whale fall phase.

X_{i}^{a + 1} = r_{5} X_{i}^{a} - r_{6} X_{r}^{a} + r_{7} X_{s t e p}

(5)

X_{s t e p} = (u_{b} - l_{b}) \exp (- C_{2} \frac{a}{A})

(6)

Z f = 0.1 - \frac{0.05 k}{K}

(7)

where

X_{s t e p}

is the dynamic step size of the whaling behaviour, the value of which is determined by the positional difference between the current individual and the random individual, the probability factor, and the population characteristics; C2 adaptive step factor;

Z f

is the whaling probability threshold.

2.2. Quasi-Opposite Learning Strategy

In this study, we propose a beluga optimization algorithm based on the improvement of a quasi-opposite learning strategy. The quasi-opposite learning strategy makes up for the deficiencies of the standard BWO algorithm in population diversity maintenance and local optimum avoidance through the symmetric exploration of the solution space and the dynamic perturbation mechanism, and the quasi-opposite learning strategy is shown in the following equation:

\{\begin{matrix} X_{i, j}^{o} = L b_{j} + U b_{j} - X_{i, j} \\ C_{i, j} = (L b_{j} + U b_{j}) / 2 \end{matrix}

(8)

\{\begin{matrix} X_{i, j}^{a + 1} = C_{i, j} + (X_{i, j}^{o} - C_{i, j}) \times rand, (X_{i, j} < C_{i, j}) \\ X_{i, j}^{a + 1} = C_{i, j} + (C_{i, j} - X_{i, j}^{o}) \times rand, else \end{matrix}

(9)

where

X_{i, j}

is the location of beluga whales,

X_{i, j}^{o}

is the quasi-opposite mirror solution of

X_{i, j}

,

X_{i, j}^{a + 1}

is the a+1th generation candidate solution generated by the QOL strategy,

L b_{j}

,

U b_{j}

are the lower and upper bounds of the

j t h

dimensional variables, respectively, and

C_{i, j}

are the statistical features characterising the current population in the

j t h

dimension.

2.3. Cyclone Foraging Strategy

This study proposes a second improved strategy as a cyclone foraging strategy for the local development phase of the beluga optimization algorithm. This strategy reconfigures the local search mechanism of the algorithm by simulating the spiral convergence behaviour of biological groups around the optimal individual in nature, which significantly improves the convergence speed and solution quality, and the process can be shown by the following equation.

X_{i}^{a + 1} = \{\begin{matrix} X_{b e s t}^{a} + r_{8} (X_{b e s t}^{a} - X_{i}^{a}) + δ (X_{b e s t}^{a} - X_{i}^{a}), & i = 1 \\ X_{b e s t}^{a} + r_{8} (X_{i - 1} (a) - X_{i}^{a}) + δ (X_{b e s t}^{a} - X_{i}^{a}), & i = 1,2, \dots, n \end{matrix}

(10)

δ = 2 e^{r_{9} \frac{A_{m a x} \cdot a + 1}{A_{m a x}}} \cdot \sin (2 π r_{9})

(11)

where Delta is the initial weight, which controls the initial motion of the helical motion, r₈, r₉ are independent random variables obeying a uniform distribution (0, 1).

3. Construction of EBWO-BP Neural Network Financial Early Warning Model

The operation mechanism of BP neural network is forward propagation of information and backward propagation of error, and the model can be trained to achieve a predetermined error target by transferring the error in the reverse direction [33]. His basic structure is shown Figure 1:

In this paper, the improved beluga optimization model is used to optimize the BP neural network so as to improve its prediction effect in financial early warning. Parameter settings: Population size (N) is 50, maximum number of iterations (A) is 100, equilibrium factor threshold (

T v

) is 0.5, whale fall probability threshold (

Z f

) is calculated dynamically, solution space bounds (

L b, U b

) [−1, 1], 10 neurons in the hidden layer, Tanh activation function in the hidden layer, Sigmoid activation function in the output layer, learning rate 0.01, maximum number of training epochs (Epochs) 1000, Target error (Goal) is 0.001, training algorithm is Levenberg–Marquardt backpropagation, and the random seed is 42.

The construction process of the EBWO-BP neural network model can be summarised in the following three core stages:

(1): Firstly, the standard BP neural network framework is established, and the number of neurons in the input layer, the topology of the hidden layer and the computational units in the output layer are specified based on the dimensions of the feature space and the demand of the prediction target.
(2): Introduce the improved beluga optimization algorithm, which seeks the optimal initial weight matrix and activation thresholds in the solution space by introducing an adaptive weight adjustment mechanism and dynamic search strategy.
(3): Input the standardized training set data into the optimized network for forward propagation computation. When the number of successive iterations with validation set error reaching the preset accuracy threshold, terminate the training and output the network parameters; if the error does not satisfy the convergence conditions, start the back-propagation mechanism to correct the network parameters and perform forward computation again.

The specific flow of the model run is shown in Figure 2:

4. Research Design

4.1. Sample Selection and Data Sources

Based on the definitional criteria generally adopted by existing studies, special treatment (ST) enterprises are used as the basis for determining financially distressed companies in this study, while non-ST enterprises are categorized as the reference sample with normal financial status [34]. The data are selected from China’s A-share listed companies from 2022 to 2024. Existing literature shows that corporate financial crises usually have significant prospective early warning characteristics, and empirical analyses show that about 80% of the cases of financial distress present identifiable abnormal financial indicator fluctuations 24–36 months before the outbreak of the crisis. This lagged effect has led to the widespread adoption of inter-period forecast modelling in academia, where financial ratios from the T-2 to T-3 accounting periods are used as input variables.

In this paper, financial data from year T-2 are used to predict the financial status of year T, with the corresponding year interval being 2020–2022. A total of 140 ST companies are obtained, and in accordance with the actual situation that there are fewer ST companies than non-ST companies in the stock market, the sample screening in this study follows a 1:2 matching ratio to match ST companies with non-ST. To ensure that the model does not favor the majority class (non-ST companies) during training due to differences in sample size, we further implemented loss function weighting (the weighting method) at the algorithmic level, which is the most direct and effective approach for embedding algorithms. In the cross-entropy loss function of the BP neural network, we assigned a higher penalty weight to samples from the minority class (ST companies). The weights are calculated based on the inverse of the class frequency in the training set, specifically set as: minority class weight = total number of samples/(number of classes × number of samples in the minority class). This causes the model, during optimization, to incur a greater loss when misclassifying minority-class samples (ST companies as normal), thereby forcing it to focus on and learn the key features of companies in financial distress. The control group of companies is screened through the same year, and industry classification criteria are used, based on the principle of asset size similarity, and finally, 420 groups of valid paired samples are obtained. Before building the model, the raw financial data underwent rigorous preprocessing. For financial indicators with missing values, the median values for the same year and industry were used to impute the missing data. Samples that could not be imputed using reasonable methods or that had critical data missing were excluded. To mitigate the impact of outliers on model training, all continuous financial indicators were trimmed to the top and bottom 1% percentiles. To eliminate the influence of unit differences and accelerate neural network convergence, all input features were Z-score standardized to have a mean of 0 and a standard deviation of 1. Following the aforementioned cleaning and transformation, a standardized dataset was ultimately obtained for modeling. ST companies are assigned a value of 1, and matched non-ST companies are assigned a value of 2 as a normal control group, which serves as the categorical dependent variable in the risk early warning model. The financial data used in this study are all from the Cathay Pacific Economic and Financial Research Database, and are extracted in strict compliance with industry standards and the principle of time consistency.

4.2. Financial Early Warning Indicators

At present, scholars have not reached a unanimous conclusion on the determination of key indicators affecting ST companies in the financial risk early warning of ST-listed companies, which is a unique problem in the Chinese capital market [35]. Existing empirical studies on the early warning of corporate financial risk are generally based on a multi-dimensional financial ratio indicator system, which can effectively identify and predict the enterprise’s financial health and its ability to sustain operations. Based on the theoretical framework of financial analysis, this study screens financial indicators from five key dimensions of core business activities: solvency, operational capacity, profitability, development capacity, and cash flow, as shown in Table 2.

4.3. Non-Financial Early Warning Indicators

In addition to the financial indicators that will obviously have an impact on the financial situation of the enterprise, non-financial indicators have a non-negligible role in financial early warning in listed companies [36,37]. In constructing the financial early warning model, this study fully recognizes the incremental value of non-financial indicators in revealing corporate governance issues and agency conflicts, thereby effectively addressing the lag and operational limitations of traditional financial data. To avoid introducing noise and overfitting due to indicator generalization, this study adheres to the principles of “theory-driven, literature-supported, data-accessible, and significantly correlated,” focusing on the core signals with the broadest early warning capabilities. Ultimately, the shareholding ratio of the largest shareholder and the presence of internal control deficiencies were selected as non-financial early warning indicators Table 3. Theoretically, these two indicators originate from agency theory and signaling theory, respectively, serving as direct proxy variables for equity structure risk and the effectiveness of internal controls. Their predictive efficacy has been widely supported by empirical evidence in the literature. Other potential variables, such as management background and ESG scores, were not included primarily based on three considerations: first, equity structure and internal controls are fundamental governance elements subject to mandatory disclosure and uniformly defined for all listed companies, ensuring data comparability; Second, many corporate governance characteristics (such as board structure and audit opinions) are highly correlated with internal control quality; selecting representative indicators maximizes information contribution while managing model complexity; third, to ensure comparability across the entire 2020–2024 sample, the selected indicators must have consistent disclosure standards throughout the study period; some emerging indicators have insufficient early coverage, and their inclusion would result in sample loss or introduce interpolation bias.

This study selects 23 financial indicators across five dimensions-solvency, operational efficiency, profitability, growth potential, and cash flow-and incorporates two non-financial indicators: the largest shareholder’s ownership ratio and the presence of internal control deficiencies. Together, these form a 25-dimensional feature space to comprehensively characterize the multidimensional and nonlinear nature of financial risk in A-share listed companies. However, the high-dimensional data itself introduces multiple issues of multicollinearity and computational complexity, necessitating effective feature reduction and fusion.

4.4. Feature Fusion Based on T-SNE

In the above paper, 23 financial indicators as well as two non-financial indicators are selected to form a 25-dimensional feature space. However, high-dimensional data suffers from multicollinearity issues, which not only prolongs model training time but also reduces predictive accuracy. To address the high-dimensionality and nonlinearity of financial risk indicators for A-share listed companies, this study abandons traditional methods based on linear assumptions, such as principal component analysis, and innovatively employs the T-SNE algorithm for dimensionality reduction.

T-SNE has inherent advantages in handling nonlinear relationships. The correlations among various financial risk indicators often exhibit complex nonlinear patterns, whereas linear methods such as principal component analysis (PCA), which reduce dimensionality by preserving global variance, struggle to capture these structures effectively. T-SNE, however, learns the intrinsic manifold of the data by optimizing the conditional probability similarity between high-dimensional and low-dimensional spaces. Since it does not require linear assumptions, it can more accurately reveal the underlying nonlinear clusters and risk patterns within high-dimensional financial data. Second, regarding the optimization of classification objectives, T-SNE’s strategy is more direct and effective. Compared to other nonlinear manifold learning algorithms (such as Local Linear Embedding, LLE), T-SNE’s core optimization objective is to strongly emphasize the preservation of the data’s local neighborhood structure—that is, to ensure that similar sample points in the high-dimensional space are closely adjacent to one another in the low-dimensional space. This is crucial for the financial early warning classification task, as it directly aggregates samples within the same risk category into tight clusters in the low-dimensional space, thereby significantly optimizing the decision boundary of subsequent classifiers. Furthermore, T-SNE’s exceptional visualization capabilities provide an intuitive basis for this study to objectively determine the optimal dimension for dimensionality reduction.

T-SNE feature fusion was performed across 25 indicators with parameters set to a confusion rate of 50% and 1000 iterations. To scientifically determine the optimal dimension (d) after dimensionality reduction and avoid arbitrary subjective selection, this study adopted an objective evaluation method based on minimizing reconstruction error. The core principle is that an ideal low-dimensional embedding should maximize the preservation of structural information from the original high-dimensional data. Its effectiveness can be quantified by calculating the discrepancy between the low-dimensional representation and the high-dimensional data (i.e., reconstruction error). Specifically, we computed a weighted composite score of Trustworthiness and Continuity between the T-SNE-reduced features and the original data across different target dimensions (d ranging from 2 to 10). Trustworthiness measures the extent to which points that are close in the low-dimensional space also remain close in the original high-dimensional space. Continuity, conversely, measures the extent to which points that are close in the original space maintain proximity in the low-dimensional space. Together, they assess the quality of dimensionality reduction. The calculation can be briefly expressed as:

S c o r e (d) = α * T r u s t w o r t h i n e s s (d) + β * C o n t i n u i t y (d)

(12)

Here, α and β are weighting coefficients, typically set to 0.5 each to achieve balance. We plotted the composite score curves for different dimensions d and used the Elbow Method to determine the optimal dimension. This method identifies the “inflection point” where the rate of score growth undergoes a significant change; beyond this point, the information gain (marginal benefit) from increasing the dimension declines sharply. We calculated the Kullback–Leibler divergence between the original distribution and the dimension-reduced distribution, and visually inspected the cluster separation of ST and non-ST samples in the two-dimensional projection.

Applying this method, we obtained the curve shown in Figure 3. It can be observed that a distinct “elbow” appears when d = 7, indicating this dimension achieves the optimal balance between information retention and model complexity. Choosing d > 7 may slightly improve scores but unnecessarily increases model complexity and computational burden. Therefore, selecting seven features as inputs for the BP neural network represents an objective, data-driven optimal solution.

The results show that the samples form two well-separated clusters in two-dimensional space that are significantly spaced apart while partially overlapping Figure 4, suggesting the presence of potential subgroup structure. The sample category labels better match the cluster distribution. This distribution pattern indicates that in the original high-dimensional feature space, the intrinsic feature structures of the two sample classes exhibit systematic differences, which the t-SNE algorithm effectively captures and visualizes. To prevent data leakage and ensure the model’s generalization ability, the T-SNE dimension reduction process is strictly conducted within a training-testing separation framework. The specific workflow is as follows: First, the data is split. Using stratified sampling, the original dataset is divided into a training set and a test set. Next, only the training set is fitted; the T-SNE model is trained exclusively on the training set data. This step is entirely based on the distribution characteristics of the training data, learning the nonlinear manifold structure and mapping function from a 25-dimensional space to a seven-dimensional space. Finally, the dataset was simultaneously transformed, and the learned mapping function was applied to both the training and test sets to map them into a seven-dimensional feature space.

4.5. Early Warning Study on Financial Risks

The study employed a stratified random allocation strategy to distribute data across a sample of 420 listed companies. Considering the imbalance in the actual market between ST companies and non-ST companies among A-share listed firms (ST companies being far fewer than non-ST companies), the overall sample was divided into a training set (n = 294) and a test set (n = 126) at a ratio of 7:3. This division method mitigates potential model bias from class imbalance while effectively avoiding sampling bias caused by industry cyclicality and geographical distribution.

To ensure a comprehensive and robust evaluation of the EBWO-BP model’s performance, this study employs ROC curves, accuracy, and confusion matrices as evaluation metrics. The receiver operating characteristic (ROC) curve and its corresponding area under the curve (AUC) value measure the model’s overall discrimination capability across all classification thresholds, which is particularly crucial in the A-share market where the number of ST and non-ST companies is imbalanced. The ROC curve visualizes the trade-off between true positive rate and false positive rate, revealing the model’s sensitivity and specificity. Accuracy serves as an intuitive metric reflecting the proportion of correct predictions, while the confusion matrix provides a detailed breakdown of classification results to validate precision and recall for each category. Collectively, these metrics compensate for the limitations of single-metric evaluations. Subsequently, the constructed EBWO-BP model is trained and tested, and the test results shown in Figure 5.

When the Area Under the ROC Curve (AUC) approaches 1, it indicates that the model’s ability to distinguish between positive and negative samples is nearing perfection, with significantly enhanced discriminative efficacy. In this study, the model achieved an AUC of 0.83, substantially higher than the 0.5 benchmark for random classification. This directly confirms that the constructed financial crisis early warning model possesses significant discriminative power. Furthermore, the high AUC value of 0.83 not only signifies the model’s overall excellent ranking capability but also reflects an ideal balance between recall and false positive rate. This equilibrium is crucial when evaluating a model’s practicality in handling early warning tasks with highly imbalanced positive and negative sample distributions. This outcome not only statistically validates the model’s effectiveness but also demonstrates its robust discriminative performance in addressing category-imbalanced financial risk scenarios.

The EBWO-BP financial early warning model demonstrates excellent performance in an empirical study of financial early warning for 420 listed companies in the A-share market during 2022–2024. The empirical data show that the classification accuracy of the model reaches 86.73% in the training set, and the prediction efficacy of the model maintains a high of 86.51% on the test set Figure 6, with a weak decay of only 0.22 percentage points, which strongly verifies its overfitting suppression ability Table 4. The model successfully builds a cross-cycle risk identification framework by deeply exploring the financial and non-financial differences between normal (non-ST) and risk-marked (ST) enterprises.

5. Comparison of EBWO-BP Neural Network Financial Early Warning Models

5.1. Comparison of Homogeneous Classification Models

In order to systematically evaluate the predictive efficacy advantage of the EBWO-BP model, this study adopts a controlled experimental design, selecting a homogeneous dataset and a standardized preprocessing process, and comparing it with GA-BP neural networks [38], PSO-BP neural networks [39], and BP neural networks, which are typical deep learning architectures in homogeneous classification models. The control variable method is used to ensure that all models remain identical in feature engineering and data partitioning; all baseline models underwent a rigorous hyperparameter tuning process. GridSearchCV was used to perform five-fold hierarchical cross-validation on the training set to automatically determine the optimal configuration. The search space for shared parameters remained consistent across all models, and architecture-specific parameters were also tuned within equivalent ranges. The comparison results shown in Figure 7.

A comprehensive comparison of Table 5 reveals significant performance disparities in multi-model experiments conducted on a unified financial dataset. While all four early warning models achieved training set classification accuracies exceeding the 80% benchmark, the enhanced EBWO-BP fusion model demonstrated a clear lead with an 86.73% recognition rate. This advantage is particularly evident in the comparison of training (82.65%) and testing (80.16%) accuracy for the GA-BP model, where the EBWO-BP model achieves a 4.08 percentage-point performance improvement over the former. Notably, the comparative analysis of test-set prediction performance further underscores the value of algorithmic innovation. The PSO-BP model and the baseline BP model exhibit test set accuracies of 81.75% and 78.57%, respectively—both significantly lower than their training set levels, indicating varying degrees of generalization capability degradation. Particularly for the baseline BP model, its test-set performance dropped by 5.44 percentage points relative to the training set, indicating substantial prediction volatility. Against this backdrop, the EBWO-BP model consistently leads with a robust performance of 86.51%, demonstrating highly consistent performance across training and test sets. This fully validates the model’s exceptional generalization capability and reliability.

The Genetic Algorithm Optimization model (GA-BP) and the Particle Swarm Optimization model (PSO-BP) show a high degree of consistency in the test environment, implying that there is a commonality of convergence between the parameter optimization paths of the two meta-heuristic algorithms in the context of the prevailing financial metrics system. On the contrary, the EBWO-BP model makes a breakthrough in the extraction of time-series features from financial data by introducing an adaptive weight adjustment mechanism and shows significantly better discriminative sensitivity than similar models. This validates that the EBWO algorithm effectively resolves the local optimization and generalization issues of BP networks, thereby enhancing the performance of corporate financial risk early warning systems. In contrast, the optimization capabilities of GA-BP and PSO-BP prove insufficient, resulting in inferior performance. The EBWO-BP model not only achieves breakthrough technological performance but also captures the multidimensional characteristics of financial risk in China’s capital market.

5.2. Comparison of Heterogeneous Classification Models

In order to systematically assess the predictive efficacy advantage of the EBWO-BP model, this study adopts a controlled experimental design with selected homogenous datasets and standardized preprocessing processes for side-by-side comparisons with typical deep learning architectures in heterogeneous classification models. Specifically, Convolutional Neural Network (CNN) [40] and Long Short-Term Memory Network (LSTM) [41] are used as the benchmark models, and the control variable method is used to ensure that all the models remain identical in terms of feature engineering and data partitioning. All baseline models underwent a rigorous hyperparameter tuning process. GridSearchCV was used to perform five-fold hierarchical cross-validation on the training set to automatically determine the optimal configuration. The search space for shared parameters remained consistent across all models, and architecture-specific parameters were also tuned within equivalent ranges. The comparison results shown in Figure 8.

As shown in Table 6, in the comparative experiments of heterogeneous models, the EBWO-BP early warning model significantly outperformed models such as CNN, LSTM, XGBoost, and Transformer, achieving a training set accuracy of 86.73% and a test set accuracy of 86.51%, with its test set accuracy leading by an average of approximately 6 percentage points. The results indicate that when processing such static financial risk features that have undergone T-SNE dimensionality reduction, the EBWO-BP model demonstrates superior applicability and predictive accuracy compared to classical deep learning architectures. This technical feature makes it a novel tool for monitoring the financial risks of A-share listed companies, especially suitable for handling financial indicator systems with high dimensionality and multiple covariances, and provides theoretical support and methodological innovation for regulators to build a dynamic early warning mechanism.

5.3. Incremental Information Test for Non-Financial Indicators

To scientifically evaluate the contribution of non-financial indicators (X24, X25) to early warning effectiveness, this study designed a set of control experiments. While maintaining identical data preprocessing, T-SNE dimensionality reduction parameters, and EBWO-BP model structure, a comparison model was constructed using only 23 financial indicators (X1–X23) for T-SNE dimensionality reduction and training. The experimental results shown in Figure 9.

The results clearly show in Table 7, prior to incorporating non-financial indicators, the training set accuracy of the financial risk warning model was 82.50%, which was 4.23 percentage points lower than the accuracy after introducing non-financial indicators. The test set accuracy was 80.27%, which was 6.24 percentage points lower than the accuracy after introducing non-financial indicators. The improvement in both training and testing set accuracy indicates that non-financial indicators are not redundant information. They complement financial indicators by effectively capturing soft risks such as corporate governance and internal controls, providing critical incremental information to the model. This significantly enhances the overall effectiveness of the financial early warning model.

5.4. Statistical Significance Test for Differences in Model Performance

To scientifically verify whether the performance advantage of the EBWO-BP model over the benchmark models is statistically significant, and to avoid conclusions being skewed by the randomness of a single experiment, this study conducted McNemar’s test on the prediction results from the same test set. This test is suitable for comparing the performance of two classification models on the same test set, with the null hypothesis being that there is no difference in performance between the two models. By analyzing samples where the two models’ predictions differ, it is possible to determine whether the performance difference exceeds random fluctuation. Based on the prediction results from the aforementioned test set (n = 126), the statistical test results are shown in the table (Table 8) below:

The McNemar test results confirm the superior discriminative capability of EBWO-BP over most benchmark models Table 8. Specifically, the model exhibits statistically significant advantages over PSO-BP (p = 0.048), BP (p = 0.021), CNN (p = 0.048), and LSTM (p = 0.048). Regarding the comparison with GA-BP, while the p-value (0.066) did not strictly exceed the 0.05 threshold, it suggests a marginal improvement. This indicates that although the evidence for outperforming GA-BP is not statistically definitive, the consistent numerical advantage in accuracy and the significant gaps against other baselines collectively underscore the effectiveness of the proposed EBWO optimization mechanism.

Across all disaggregated pairs, the b-values were systematically and significantly higher than the c-values. This provides conclusive statistical evidence that the EBWO-BP model can more accurately identify samples misclassified by other models; its higher overall accuracy is not coincidental but a direct reflection of its superior discriminative capability.

5.5. Cross-Validation Analysis

To rigorously address potential overfitting risks and further validate the generalization ability of the EBWO-BP model, we evaluated the model using five-fold stratified cross-validation. The original dataset contained 420 samples, which were randomly divided into five subsets of equal size. To ensure the representativeness of risk features, we employed a stratified sampling method to maintain a 1:2 ratio of ST to non-ST companies in each subset, consistent with the original dataset. In each iteration, four subsets (accounting for 80% of the data) were used for training, while the remaining subset (20%) was used for testing. This process was repeated five times until each sample had been included exactly once as part of the test set. The final performance metrics were calculated as the average of the results from the five iterations.

As shown in Table 9, the EBWO-BP model achieved a mean cross-validation accuracy of 86.40%, which is highly consistent with the 86.51% accuracy obtained from the single train-test split in the previous sections. This consistency confirms that the model’s performance is not contingent on a specific data partition. Furthermore, the EBWO-BP model recorded a standard deviation of ±0.25%, which is the lowest among all compared models. This minimal variance indicates exceptional stability across different subsets of data, effectively alleviating concerns regarding overfitting. In contrast, the baseline BP neural network exhibited a higher variance and lower mean accuracy, suggesting that it is more susceptible to fluctuations in the training data due to its sensitivity to initial parameters.

6. Stability Test Under Varying Sample Scales

Based on the analysis of the model architecture in the previous study, the EBWO-BP financial early warning system demonstrates significant predictive efficacy. Compared with homogeneous and heterogeneous classification models, the model has significant advantages in feature nonlinear mapping and risk gradient identification. To assess the model’s sensitivity to sample size and class imbalance, this study conducted stability tests using an expanded dataset. The objective was to determine whether the model maintains its performance when the number of non-ST samples increases within the same time frame.

In terms of the sample construction strategy, ST-type listed companies in China’s A-share market from 2022 to 2024 are selected as the risk sample (n = 140), and matched non-ST companies are used as the control sample. In order to achieve the full expression of risk exposure characteristics, the original ST:non-ST matching ratio is expanded from 1:2 to 1:4, and the non-ST sample size is increased to 560 groups, resulting in a balanced dataset containing 700 observations. The data time window was set to 2020–2022 to ensure that the financial indicators have a three-year risk warning front. In particular, it should be noted that by expanding the sample size of non-ST companies to 560 groups, the potential interference of category imbalance on model training is effectively mitigated, and the test results shown in Figure 10 and Figure 11.

As shown in Table 10, when the sample size of the EBWO-BP financial early warning model is expanded from 420 to 700, the model performance presents a significant improvement. Specifically, the accuracy of the training set increases from 86.73% to 89.39%, while the accuracy of the test set increases from 86.51% to 89.05%, with a 2.6% increase in the overall accuracy. This empirical result shows that the increase in sample size is positively associated with the improvement in model prediction performance, and the accuracy improvement trend remains synchronous in both the training and test sets, which effectively avoids the risk of model overfitting to specific data samples.

Further analysis reveals that the model maintains a stable ability to discriminate financial status across different data sizes, and the generality of its predictive validity is confirmed by cross-sample validation. This property reflects the good generalization ability of the model parameter settings, rather than only generating adaptations to specific data distributions. Particularly noteworthy is that the sustained improvement of 2.54% in the accuracy of the test set forms a reasonable gradient with the optimization of 2.66% in the training set, confirming that the model improvement stems from effective feature extraction rather than purely memory effects.

Based on the above experimental results, it can be argued that the EBWO-BP model exhibits significant scale effects and robustness characteristics in the field of financial early warning. The performance improvement does not depend on the local features of a specific dataset, but is achieved by enhancing the learning depth of the model on the financial risk characterization factors. This feature makes the model valuable for practical application in financial risk prediction in complex business environments. It provides a new methodological support for financial institutions and regulators to build dynamic early warning systems. The study not only verifies the validity of the model architecture but also establishes a quantifiable empirical framework for the stability assessment of financial early warning models.

7. Research Findings and Outlook

7.1. Research Findings

This study systematically explored financial early warning for A-share listed companies by constructing an EBWO-BP hybrid model. Empirical results demonstrate that through algorithmic fusion and feature engineering innovation, this model exhibits significant advantages in prediction accuracy, robustness, and generalization capability, specifically as follows:

First, the construction of a multidimensional indicator system is proven to be a key foundation for enhancing early warning effectiveness. Experimental results show that incorporating two non-financial indicators—“largest shareholder ownership ratio” and “internal control deficiencies”—significantly increased the model’s test set accuracy from 80.27% to 86.51%. This fully demonstrates the irreplaceable incremental informational value of non-financial indicators in capturing governance risks and potential crisis signals beyond financial data, thereby addressing the static limitations of traditional single-financial-indicator models.

Second, the innovative application of the T-SNE nonlinear dimensionality reduction method effectively addressed data noise and computational complexity issues in high-dimensional feature spaces. By objectively determining the optimal seven feature dimensions using the elbow rule, the approach maximally preserves the original data structure while establishing an efficient data foundation for subsequent neural network training. Visualization of the dimensionality reduction effect further demonstrates that risk samples and normal samples exhibit excellent clustering separability in the low-dimensional space, providing a methodological innovation for nonlinear risk modeling.

Third, the synergistic optimization mechanism between the EBWO algorithm and the BP neural network is central to achieving breakthrough model performance. The data indicate that the EBWO-BP model achieves an accuracy of 86.51% on the test set, with an Area Under the Curve (AUC) of 0.83. This significantly outperforms optimization algorithm models such as GA-BP and PSO-BP, as well as deep learning models like CNN and LSTM. This validates the EBWO algorithm’s exceptional capability in overcoming local minima in BP neural networks and enhancing convergence efficiency, achieving performance gains through algorithmic fusion.

Fourth, the model’s robustness and generalization capability passed rigorous testing. Robustness tests indicate that when the sample size was expanded from 420 to 700 groups, the overall model accuracy further increased to 89.22%, with minimal performance degradation between the training and testing sets. This result strongly demonstrates the robust adaptability and reliability of the EBWO-BP model across varying data distributions, providing empirical evidence for its application in China’s capital markets, characterized by diverse industry distributions and complex financial structures.

7.2. Research Limitations

Despite achieving anticipated outcomes, this study retains limitations in the following aspects:

First, potential overfitting risk, although T-SNE dimensionality reduction and cross-validation techniques were employed, the “black-box” nature of neural networks inherently carries the risk of overfitting to specific macroeconomic conditions during the 2020–2024 window. Therefore, the model’s stability in the face of sudden shifts in market dynamics or long-term time drifts requires further validation.

Second, sample size constraint, although the dataset of 420 samples meets the standards of mainstream literature, it remains relatively small compared to the vast scale of China’s A-share market (which comprises over 5000 companies). This limited scale may restrict the model’s ability to capture heterogeneity across industries, potentially limiting its generalizability when applied to specific industries outside the training distribution.

Third, the breadth of non-financial indicators. Coverage primarily focused on equity structure and internal controls. Future work could incorporate broader non-financial signals—such as management background, ESG performance, and media sentiment—to construct more comprehensive risk profiles.

7.3. Research Outlook

Building upon the findings and limitations of this study, future work may deepen in the following directions:

First, exploration of dynamic early warning mechanisms. Introducing time series models to transition from static cross-sectional prediction to dynamic continuous tracking, enhancing the timeliness of warnings.

Second, multimodal data fusion. Integrating unstructured data such as annual report texts, analyst reports, and online sentiment, this approach utilizes natural language processing to uncover deeper risk signals, constructing a more sophisticated hybrid early warning system.

Third, the current comparison does not include certain state-of-the-art models specifically designed for tabular data, such as gradient boosters (e.g., CatBoost, LightGBM) and TabNet. These models have demonstrated exceptional capabilities in handling structured financial data. Therefore, extending the comparative analysis to include these advanced tabular models represents an important direction for future work aimed at further evaluating EBWO-BP’s performance.

Fourth, although this study has demonstrated the high accuracy of the EBWO-BP model, the “black-box” nature of neural networks limits their interpretability in high-risk financial applications. Future research will integrate explainable AI techniques to quantitatively decompose the model’s outputs and reveal the marginal contributions of each financial and non-financial indicator.

Author Contributions

Conceptualization, Y.C. and Q.Y.; Methodology, G.L.; Resources, Q.Y. and C.Y.; Data curation, G.L.; Writing—original draft, Y.C. and G.L.; Writing—review and editing, Q.Y.; Funding acquisition, Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the [Anhui Provincial Philosophy and Social Sciences Planning Projects] grant number [AHSKQ2023D091] and the [Hefei University Graduate Student Research and Practical Innovation Project] grant number [2025Yxscx11].

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors have no competing interests to declare that are relevant to the content of this article.

References

Furceri, D.; Hannan, S.A.; Ostry, J.D.; Rose, A.K. Are tariffs bad for growth? Yes, say five decades of data from 150 countries. J. Policy Model. 2020, 42, 850–859. [Google Scholar] [CrossRef] [PubMed]
Chin, T.; Li, Z.; Huang, L.; Li, X. How artificial intelligence promotes new quality productive forces of firms: A dynamic capability view. Technol. Forecast. Soc. Chang. 2025, 216, 124128. [Google Scholar] [CrossRef]
Pavlicko, M.; Durica, M.; Mazanec, J. Ensemble Model of the Financial Distress Prediction in Visegrad Group Countries. Mathematics 2021, 9, 1886. [Google Scholar] [CrossRef]
Mselmi, N.; Lahiani, A.; Hamza, T. Financial distress prediction: The case of French small and medium-sized firms. Int. Rev. Financ. Anal. 2017, 50, 67–80. [Google Scholar] [CrossRef]
Duan, Y.Q.; Ren, A.Z. Time-Series Modeling for Corporate Financial Crisis Prediction: Evidence from Recurrent Neural Networks. Mathematics 2026, 14, 657. [Google Scholar] [CrossRef]
Chen, G.; Firth, M.; Gao, D.; Rui, O. Ownership structure, corporate governance, and fraud: Evidence from China. J. Corp. Financ. 2006, 12, 424–448. [Google Scholar] [CrossRef]
Liao, L.; Luo, L.; Tang, Q. Gender diversity, board independence, environmental committee and greenhouse gas disclosure. Br. Account. Rev. 2015, 47, 409–424. [Google Scholar] [CrossRef]
Altman, E.I.; Iwanicz-Drozdowska, M.; Laitinen, E.K.; Suvas, A. Financial Distress Prediction in an International Context: A Review and Empirical Analysis of Altman’s Z-Score Model. J. Int. Financ. Manag. Account. 2017, 28, 131–171. [Google Scholar]
Hillegeist, S.; Keating, E.; Cram, D.; Lundstedt, K. Assessing the probability of bankruptcy. Rev. Account. Stud. 2004, 9, 5–34. [Google Scholar] [CrossRef]
Sang, B. Application of genetic algorithm and BP neural network in supply chain finance under information sharing. J. Comput. Appl. Math. 2021, 384, 113170. [Google Scholar] [CrossRef]
Lee, T.K.; Cho, J.H.; Kwon, D.S.; Sohn, S.Y. Global stock market investment strategies based on financial network indicators using machine learning techniques. Expert Syst. Appl. 2019, 117, 228–242. [Google Scholar] [CrossRef]
Cao, L. AI in Finance: Challenges, Techniques, and Opportunities. ACM Comput. Surv. 2023, 55, 1–38. [Google Scholar] [CrossRef]
Li, X.T.; Wang, J.; Yang, C.Y. Risk prediction in financial management of listed companies based on optimized BP neural network under digital economy. Neural Comput. Appl. 2023, 35, 2045–2058. [Google Scholar]
Gülmez, B. Stock price prediction with optimized deep LSTM network with artificial rabbits optimization algorithm. Expert Syst. Appl. 2023, 227, 120346. [Google Scholar] [CrossRef]
Dumitrescu, E.; Hué, S.; Hurlin, C.; Tokpavi, S. Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. Eur. J. Oper. Res. 2022, 297, 1178–1192. [Google Scholar] [CrossRef]
Zhou, W.W.; Chen, M.Y.; Yang, Z.L.; Song, X.B. Real estate risk measurement and early warning based on PSO-SVM. Socio-Econ. Plan. Sci. 2021, 77, 101001. [Google Scholar] [CrossRef]
Song, Y.H.; Sang, X.Y.; Wang, Z.; Xu, H.Q. Risk Assessment of Supplier R&D Investment Based on Improved BP Neural Network. Mathematics 2025, 13, 2094. [Google Scholar] [CrossRef]
Shi, Y.; He, T.; Zhong, J.; Mei, X.; Li, Y.; Li, M.; Zhang, W.; Ji, D.; Su, L.; Lu, T.; et al. Classification and rapid non-destructive quality evaluation of different processed products of Cyperus rotundus-based on near-infrared spectroscopy combined with deep learning. Talanta 2024, 268, 125266. [Google Scholar] [CrossRef] [PubMed]
Long, X.; Mao, M.H.; Su, T.X.; Su, Y.T.; Tian, M.K. Machine learning method to predict dynamic compressive response of concrete-like material at high strain rates. Def. Technol. 2023, 23, 100–111. [Google Scholar] [CrossRef]
Lin, Z.; Fan, Y.; Tan, J.; Li, Z.; Yang, P.; Wang, H.; Duan, W. Tool wear prediction based on XGBoost feature selection combined with PSO-BP network. Sci. Rep. 2025, 159, 3096. [Google Scholar]
He, Q.; Li, P. Research on quality and safety risk identification of import and export toys based on the WOA-BP model. Sci. Rep. 2025, 15, 29882. [Google Scholar] [CrossRef] [PubMed]
Zhang, H.; Yang, R.; Zhong, M.; Lei, X.; Yin, J.; Yang, M.; Tu, X.; Li, F.; Xing, X.; Zhou, C. An improved BP neural network algorithm for accurate prediction of impact defects depth in civil aircraft composite materials. Infrared Phys. Technol. 2026, 155, 106475. [Google Scholar] [CrossRef]
Dou, B.; Guo, S.; Chang, X.; Wang, Y. Corporate digital transformation and labor structure upgrading. Int. Rev. Financ. Anal. 2023, 90, 102904. [Google Scholar] [CrossRef]
Qin, L.; Aziz, G.; Hussan, M.; Sarwar, A. Empirical evidence of fintech and green environment: Using the green finance as a mediating variable. Int. Rev. Econ. Financ. 2024, 89, 33–49. [Google Scholar] [CrossRef]
Li, T.; Yue, X.-G.; Qin, M.; Norena-Chavez, D. Towards Paris Climate Agreement goals: The essential role of green finance and green technology. Energy Econ. 2024, 129, 107273. [Google Scholar]
Luo, Q.; Ma, F.; Wang, J.; Wu, Y. Changing determinant driver and oil volatility forecasting: A comprehensive analysis. Energy Econ. 2024, 129, 107187. [Google Scholar]
Luan, B.; Zou, H.; Huang, J. Digital divide and household energy poverty in China. Energy Econ. 2023, 119, 106543. [Google Scholar] [CrossRef]
Delcea, C.; Oprea, S.; Dima, A.; Domenteanu, A.; Bara, A.; Cotfas, L. Energy communities: Insights from scientific publications. Oecon. Copernic. 2024, 15, 1101–1155. [Google Scholar] [CrossRef]
Kumar, A.; Chakraborty, S.; Bala, P.K. Text mining approach to explore determinants of grocery mobile app satisfaction using online customer reviews. J. Retail. Consum. Serv. 2023, 73, 103363. [Google Scholar] [CrossRef]
Policar, P.; Strazar, M.; Zupan, B. openTSNE: A Modular Python Library for t-SNE Dimensionality Reduction and Embedding. J. Stat. Softw. 2024, 109, 1–30. [Google Scholar] [CrossRef]
Wang, G.; Zhao, B.; Wu, B.; Zhang, C.; Liu, W. Intelligent prediction of slope stability based on visual exploratory data analysis of 77 in situ cases. Int. J. Min. Sci. Technol. 2023, 33, 47–59. [Google Scholar]
Zhong, C.; Li, G.; Meng, Z. Beluga whale optimization: A novel nature-inspired metaheuristic algorithm. Knowl.-Based Syst. 2022, 251, 109215. [Google Scholar] [CrossRef]
Zhao, H. Risk Management of Supply Chain Green Finance Based on Sustainable Ecological Environment. Sustainability 2023, 15, 7707. [Google Scholar] [CrossRef]
Zhu, F.-J.; Zhou, L.-J.; Zhou, M.; Pei, F. Financial Distress Prediction: A Novel Data Segmentation Research on Chinese Listed Companies. Technol. Econ. Dev. Econ. 2021, 27, 1413–1446. [Google Scholar] [CrossRef]
Sehgal, S.; Mishra, R.K.; Deisting, F.; Vashisht, R. On the determinants and prediction of corporate financial distress in India. Manag. Financ. 2021, 47, 1428–1447. [Google Scholar] [CrossRef]
Zhang, Z.; Yu, J.; Li, S. Analysis of the Impact of Non-Financial Elements on Financial Risks in Organizations from the Perspective of Various Enterprise Types. Highl. Bus. Econ. Manag. 2024, 45, 714–724. [Google Scholar] [CrossRef]
Balasubramanian, S.A.; Radhakrishna, G.S.; Sridevi, P.; Natarajan, T. Modeling corporate financial distress using financial and non-financial variables. Int. J. Law Manag. 2019, 61, 457–484. [Google Scholar] [CrossRef]
Qin, X.; He, Q.; Zhang, X.; Yang, X. Data Value Assessment in Digital Economy Based on Backpropagation Neural Network Optimized by Genetic Algorithm. Symmetry 2025, 17, 761. [Google Scholar] [CrossRef]
Zhou, H.; Sun, G.; Fu, S.; Liu, J.; Zhou, X.; Zhou, J. A Big Data Mining Approach of PSO-Based BP Neural Network for Financial Risk Management with IoT. IEEE Access 2019, 7, 154035–154043. [Google Scholar] [CrossRef]
Elhoseny, M.; Metawa, N.; Sztano, G.; El-Hasnony, I.M. Deep Learning-Based Model for Financial Distress Prediction. Ann. Oper. Res. 2022, 345, 885–907. [Google Scholar] [CrossRef]
Pellegrino, M.; Lombardo, G.; Adosoglou, G.; Cagnoni, S.; Pardalos, P.M.; Poggi, A. A Multi-Head LSTM Architecture for Bankruptcy Prediction with Time Series Accounting Data. Future Internet 2024, 16, 79. [Google Scholar] [CrossRef]

Figure 1. Structure of BP neural network.

Figure 2. EBWO-BP flowchart.

Figure 3. Dimension selection for the elbow rule.

Figure 4. T-SNE downscaling diagram.

Figure 5. ROC graph.

Figure 6. Performance evaluation of the EBWBO-BP model on training and test sets. (a) Prediction Performance on the Training Set. (b) Prediction Performance on the Test Set. (c) Training Set Confusion Matrix. (d) Test Set Confusion Matrix.

Figure 7. Homogeneous classification models.

Figure 8. Heterogeneous classification models.

Figure 9. Comparison results. (a) Training Set Comparison Results. (b) Test Set Comparison Results.

Figure 10. Stability detection training effect.

Figure 11. Stability detection test effect.

Table 1. Core Mechanisms and limitations of hybrid models.

Hybrid Model	Core Mechanism	Limitations
GA-BP	Genetic variation, cross selection	Premature maturation and astringency; decreased efficiency in the later stages of processing; complex parameter adjustment [20]
PSO-BP	Particle Speed Update, Group Collaboration	Prone to falling into local optima; exhibits insufficient convergence accuracy when dealing with high-dimensional complex data [21]
WOA-BP/GWO-BP	Encircle the prey and update its location	The system exhibits weak balance between global exploration and local development, with a relatively slow convergence rate [22]
Standard BWO	The white whales swim and then surface.	The original algorithm still has room for improvement in high-dimensional spaces and may miss the global optimal neighborhood

Table 2. Financial indicators.

Type of Indicator	Indicator Name	Serial Number
solvency	Mobility ratio	X1
	quick ratio	X2
	cash ratio	X3
	gearing	X4
	equity multiplier	X5
	equity ratio	X6
	Long-term debt to working capital ratio	X7
operating ability	Accounts receivable turnover ratio	X8
	Current asset turnover ratio	X9
	Fixed asset turnover	X10
	Total asset turnover	X11
	Shareholders’ equity turnover	X12
profitability	return on assets	X13
	Net interest rate on total assets	X14
	Net interest rate on current assets	X15
	return on net assets	X16
	Return on invested capital	X17
	operating profit margin	X18
Development capacity	Total asset growth rate	X19
Development capacity	Owner’s equity growth rate	X20
cash flow	Net cash content of operating income	X21
	Full cash recovery rate	X22
	operating index	X23

Table 3. Non-financial indicators.

Type of Indicator	Indicator Name	Serial Number
Shareholding structure	Shareholding ratio of the largest shareholder	X24
Internal control	Whether there are deficiencies in internal controls	X25

Table 4. Classification accuracy for different categories (Class I: ST companies; Class II: Non-ST companies).

Modelling	Class I Accuracy	Class II Accuracy
training set	86.4%	86.9%
test set	83.8%	87.6%

Table 5. Comparison table of homogeneous models.

Mdelling	Training	Test	Recall	AUC	F1
GA-BP	82.65%	80.16%	78.50%	81.00%	79.30%
PSO-BP	84.69%	81.75%	80.20%	82.00%	80.20%
BP	84.01%	78.57%	76.80%	79.00%	76.80%
EBWO-BP	86.73%	86.51%	85.20%	83.00%	85.20%

Table 6. Comparison table of heterogeneous models.

Modelling	Training	Test	Recall	AUC	F1
CNN	85.71%	80.16%	79.20%	82.00%	79.60%
LSTM	84.35%	80.95%	79.80%	81.00%	80.30%
XGBoost	84.65%	81.03%	80.10%	81.00%	80.50%
Transformer	85.22%	81.31%	80.50%	82.00%	80.90%
EBWO-BP	86.73%	86.51%	85.20%	83.00%	85.20%

Table 7. Incremental information verification.

Input Features	Training Set Accuracy	Test Set Accuracy
Financial indicators only (23 dimensions)	82.50%	80.27%
Financial + Non-Financial Metrics (25 Dimensions)	86.73%	86.51%
Enhancement Effect	4.23%	6.24%

Table 8. McNemar’s test.

EBWO-BP	b + c	b	c	p-Value	Significantv (α = 0.05)
GA-BP	22	15	7	0.066	No
PSO-BP	24	16	8	0.048	Yes
BP	30	21	9	0.021	Yes
CNN	26	17	9	0.048	Yes
LSTM	24	16	8	0.048	Yes

(Note: Here, b + c represents “Differences in quantity”; b represents “Only EBWO-BP is correct”; and c represents “Only the model is correct.”).

Table 9. Five-Fold Cross-Validation Performance Comparison.

Model	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mea ± Std
GA-BP	79.85	80.5	80.1	79.95	80.3	80.1 ± 0.27
PSO-BP	81.2	81.9	81.5	81.1	81.85	81.5 ± 0.32
BP	78.1	78.95	78.5	78.2	78.8	78.5 ± 0.31
EBWO-BP	86.40	86.75	86.20	86.55	86.10	86.4 ± 0.25

Table 10. Comparison table for stability testing.

Sample Size	Training Set Accuracy	Test Set Accuracy
420	86.73%	86.51%
700	89.39%	89.05%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chu, Y.; Liu, G.; Yu, Q.; Yang, C. Research on Financial Early Warning Models of A-Share Listed Companies Based on EBWO-BP Neural Networks. Mathematics 2026, 14, 2261. https://doi.org/10.3390/math14132261

AMA Style

Chu Y, Liu G, Yu Q, Yang C. Research on Financial Early Warning Models of A-Share Listed Companies Based on EBWO-BP Neural Networks. Mathematics. 2026; 14(13):2261. https://doi.org/10.3390/math14132261

Chicago/Turabian Style

Chu, Yizhou, Guiyang Liu, Qiuyu Yu, and Chunyan Yang. 2026. "Research on Financial Early Warning Models of A-Share Listed Companies Based on EBWO-BP Neural Networks" Mathematics 14, no. 13: 2261. https://doi.org/10.3390/math14132261

APA Style

Chu, Y., Liu, G., Yu, Q., & Yang, C. (2026). Research on Financial Early Warning Models of A-Share Listed Companies Based on EBWO-BP Neural Networks. Mathematics, 14(13), 2261. https://doi.org/10.3390/math14132261

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Research on Financial Early Warning Models of A-Share Listed Companies Based on EBWO-BP Neural Networks

Abstract

1. Introduction

2. Improvement of the Beluga Optimization Algorithm

2.1. Beluga Optimization Algorithm

2.2. Quasi-Opposite Learning Strategy

2.3. Cyclone Foraging Strategy

3. Construction of EBWO-BP Neural Network Financial Early Warning Model

4. Research Design

4.1. Sample Selection and Data Sources

4.2. Financial Early Warning Indicators

4.3. Non-Financial Early Warning Indicators

4.4. Feature Fusion Based on T-SNE

4.5. Early Warning Study on Financial Risks

5. Comparison of EBWO-BP Neural Network Financial Early Warning Models

5.1. Comparison of Homogeneous Classification Models

5.2. Comparison of Heterogeneous Classification Models

5.3. Incremental Information Test for Non-Financial Indicators

5.4. Statistical Significance Test for Differences in Model Performance

5.5. Cross-Validation Analysis

6. Stability Test Under Varying Sample Scales

7. Research Findings and Outlook

7.1. Research Findings

7.2. Research Limitations

7.3. Research Outlook

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI