1. Introduction
Catastrophic phenomena refer to the sudden shifts in a system’s state that occur in various fields due to strong nonlinearities in the system. Since the occurrence of catastrophe phenomena has a great influence on the development process of items, research on catastrophe phenomena, especially the control and utilization of catastrophe phenomena, is of great significance in controlling the development direction and the final results of items [
1,
2,
3].
Establishing a relatively precise mathematical model is the basis and key to the control and utilization of catastrophe phenomena. Catastrophe theory [
4] is a tool to establish the mathematical model of catastrophe phenomena. In the more than 50 years since the establishment of catastrophe theory, people [
5,
6,
7,
8,
9] have used the modeling method provided by it to establish various mathematical models for a large number of catastrophe phenomena, and the mathematical models have been used to control and utilize the catastrophe phenomena, achieving a lot of results. In the existing research, some inherent defects and congenital problems of the modeling method for catastrophe theory have also attracted people’s attention. One of the problems that has the greatest impact on the application development of the precise modeling of catastrophe phenomena and the modeling methods of catastrophe theory, and that people are most interested in solving, is the following: except for a few special cases [
10,
11,
12], the actual (experimental) catastrophe mathematical models established by using experimental observation data of the catastrophe phenomena are qualitative (analog) models that do not have the ability to perform quantitative prediction, and it is difficult for these models to meet the requirements of precise control and rational utilization of catastrophe phenomena. In fact, this is the “practical efficiency problem” [
13] of catastrophe theory, which greatly limits its application scope and quality. Because of the existence of this problem, the academic community participated in a far-reaching debate on supporting and opposing the catastrophe theory at the beginning of the creation of the catastrophe theory [
14,
15]. In the face of doubt, Thom [
16] openly admitted that “catastrophe theory is fundamentally a qualitative explanatory theory, which itself has no predictive ability” and “in a sense, it can be called analogy theory”. Unfortunately, to this day, this problem has not been resolved.
The root cause of the problem is as follows: when using the modeling method of catastrophe theory, especially the “experimental modeling method” [
17], which is considered by the founder of the catastrophe theory to be the most promising method to establish a catastrophe mathematical model, the corresponding relationship and mapping form between the actual parameters (the actual control and state parameters used to represent the actual catastrophe mathematical model) and the canonical parameters (the canonical control and state parameters used to represent the canonical catastrophe mathematical model) are very complex and have strong nonlinear characteristics. However, the founder of the catastrophe theory did not inform the corresponding relationship and mapping form in advance and did not provide an effective method to determine them. It can only be subjectively set by the modeler based on experience. To deal with the problem conveniently, almost all researchers set the corresponding relationship between the actual parameters and the canonical parameters as a one-to-one correspondence, and they set the mapping form between them as a simple proportional function, linear function [
18], or polynomial function [
19]. For example, the manifold of the cusp-type catastrophe mathematical model is directly written as “
” [
20] or “
” [
21] (
is the canonical state parameter;
and
are the canonical control parameters; and
and
are the proportional coefficients between the actual parameters and the canonical parameters) and so on. The fact that it is difficult to make the artificially determined corresponding relationship and mapping form be completely consistent with the real situation results in unforeseen and, in many cases, large prediction errors in the catastrophe mathematical models that have been established. When the prediction error exceeds a certain range, the ability of the models to make quantitative predictions is lost.
So, how can the complex correspondence and mapping relationship (nonlinear relationship) between the actual parameters and the canonical parameters be expressed and obtained accurately?
As we all know, the machine learning model [
22] established by experimental observation data has a strong ability to characterize and simulate the complex nonlinear relationship between data points. Therefore, it is a natural choice for people to use machine learning to solve the above problem. Zheng et al. [
23] established a machine learning model by using the random forest algorithm and the experimental data of chip bifurcation catastrophe and realized the accurate prediction of chip bifurcation phenomenon. The prediction accuracy, precision, and recall rate of whether the chip bifurcation phenomenon occurs under the condition of given control parameters reached 100%. Li et al. [
24] used an RBF neural network trained with simulation data to simulate the mapping between the canonical control parameters (
,
) and the actual control parameters (Reynolds number
and angle of attack
) to modify the experimental modeling method of catastrophe theory. Using the modified experimental modeling method, the prediction error (compared to the simulation value) of the bifurcation set of the cusp-type catastrophe mathematical model for the airfoil-stall boundary at a low Reynolds number was reduced to less than 1%. Unfortunately, the modified experimental modeling method and related techniques are only used to improve the prediction accuracy of the bifurcation set in the specific catastrophe mathematical model, but not to improve the prediction accuracy of the potential function and the manifold of the catastrophe mathematical model. Daw et al. [
25] used a mixture density neural network to simulate the bimodal (upper and lower leaves of the manifold) of the statistical mathematical model of a cusp-type catastrophe based on statistical catastrophe theory and achieved a successful result. When the machine learning model was experimentally verified using three previously established datasets of the Zeeman catastrophe machine [
26] (a mechanism, as shown in
Figure 1, hereinafter referred to as “ZCM”), it was found that the root mean square errors (loss function values) of the two-component mixture density neural network were 0.10, 0.20, and 0.79, respectively.
So far, there has been no report on the use of machine learning technology to completely solve the above problem in the experimental modeling method.
In this paper, we propose a method of precise experimental modeling of catastrophe phenomena using experimental datasets of a singularity set and manifold, which establishes the quantitative relationship between the experimental catastrophe mathematical model and the canonical catastrophe mathematical model by assuming that the experimental potential function is equal to the canonical potential function, and then we determine the diffeomorphism [
27] represented by a machine learning model that can achieve error-free transformations of the two models. Taking the precise experimental modeling of a cusp-type catastrophe for ZCM as an example, the key points and implementation results of this method are explained. Through programming calculation, it is found that the prediction errors of the potential function, manifold, and bifurcation set of the experimental mathematical model of a cusp-type catastrophe for ZCM are 0.0309%, 0.0650%, and 0.1252%, respectively. This indicates that the established model has the ability to quantitatively predict the catastrophe phenomenon.
2. Establishment of Experimental Mathematical Model of Cusp-Type Catastrophe for ZCM
2.1. Introduction to ZCM
The ZCM (
Figure 1) is an instructional device invented by the British mathematician Zeeman to verify the validity of conclusions related to catastrophe theory [
26]. This device played an irreplaceable role in the development of catastrophe theory and holds significant academic and historical value.
Anyone familiar with the history of catastrophe theory knows that the widespread acceptance, dissemination, and application of the theory owe much to the ZCM. Years ago, Zeeman spent several weeks personally constructing this simple mechanical demonstration model, which allowed him to clearly illustrate the process of catastrophe phenomena, their characteristic signatures, and their objective existence. This demonstration convincingly validated the correctness of catastrophe theory, earning it broad recognition.
Since then, many researchers working on catastrophe theory and its applications have used the ZCM as a primary physical model to evaluate their proposed methods or validate their research findings. It is generally believed that if a method can effectively analyze and handle the ZCM, it largely meets the requirements of catastrophe theory.
Therefore, in this study, we take the establishment of an experimental mathematical model of a cusp-type catastrophe for the ZCM as an example to illustrate the key aspects of the proposed method and its implementation results.
2.2. Establishment of Experimental Dataset
As mentioned above, in order to use machine learning technology to establish the experimental mathematical model of a cusp-type catastrophe for the ZCM, it is necessary to obtain the experimental dataset of the singularity set and the manifold of ZCM.
The accuracy of the ZCM, which can only be manufactured by hand, is hard to ensure, leading to a larger error in the experimental dataset obtained through real test experiments. To enhance modeling accuracy, the experimental dataset required for modeling is obtained by discretizing the singularity set and manifold of the theoretical mathematical model of a cusp-type catastrophe for the ZCM established by previous researchers.
2.2.1. Theoretical Mathematical Model of Cusp-Type Catastrophe for ZCM Established by Previous Researchers
The theoretical mathematical model of a cusp-type catastrophe for the ZCM established by Poston et al. [
17] through theoretical analysis is
where
is the theoretical potential function of the theoretical mathematical model of a cusp-type catastrophe for the ZCM;
is the actual state parameter;
and
are the actual control parameters;
,
, and
are the performance and structural parameters of ZCM;
is the theoretical manifold;
is the theoretical singularity set; and
is the theoretical bifurcation set.
2.2.2. Establishment of Discrete Dataset S1 of Theoretical Singularity Set ST
The value interval () of the actual state parameter is discretized in an equal interval manner. If the discrete interval is , then 361 values are obtained, denoted by ().
By substituting into in Equation (1), the binary system of equations about is obtained. By solving the binary system of equations numerically, the coordinates of discrete control points corresponding to on the theoretical bifurcation set are obtained.
is the discrete dataset
of the theoretical singularity set
, as shown in
Table 1.
2.2.3. Establishment of Discrete Dataset M1 of Theoretical Manifold MT
The control plane is discretized in a given region () with rectangular grids with side lengths of and , respectively. The coordinates of the discrete grid nodes are substituted into the theoretical manifold to obtain a univariate equation about . In this way, the phase point coordinates corresponding to can be obtained by solving the univariate equation.
is the discrete dataset
of the theoretical manifold
, as shown in
Table 2.
2.3. Modeling Assumptions
According to the catastrophe theory, the canonical mathematical model of a cusp-type catastrophe is
where
is the canonical potential function of the canonical mathematical model of a cusp-type catastrophe,
is the canonical state parameter,
and
are the canonical control parameters,
is an integral constant term independent of the canonical state parameter
,
is the canonical manifold,
is the canonical singularity set, and
is the canonical bifurcation set.
It is important to note the following: (1) The canonical mathematical model is the fundamental mathematical model in catastrophe theory. It has a standardized form and comprehensive content, serving as a crucial foundation for constructing experimental catastrophe mathematical models for specific phenomena. (2) Compared to the canonical catastrophe model, an experimental mathematical model of catastrophe is derived from experimental data and describes the catastrophic behavior of a specific phenomenon. Any experimental mathematical model of catastrophe expressed in terms of actual parameters can be transformed into a canonical catastrophe model through a diffeomorphism between actual and theoretical parameters.
As mentioned above, in order to use the above experimental dataset and the canonical mathematical model of a cusp-type catastrophe to establish the experimental mathematical model of a cusp-type catastrophe for the ZCM, it is necessary to first establish the quantitative correspondence between the experimental mathematical model of a cusp-type catastrophe for the ZCM and the canonical mathematical model of a cusp-type catastrophe.
Considering that the experimental mathematical model of a cusp-type catastrophe for the ZCM is obtained by the error-free diffeomorphism transformation of the canonical mathematical model of a cusp-type catastrophe, the potential function values at the corresponding phase points before and after the transformation should be equal. The following assumption is made:
Assumption 1. The potential function of the experimental mathematical model of a cusp-type catastrophe for the ZCM is equal to the canonical potential function .
Taking into account that the canonical potential function is the universal unfolding 2 of the singularity, which is the simplest form of the versal unfolding 2 of the singularity, and in order to simplify the subsequent calculation process, the following assumption is made:
Assumption 2. The origin of the singularity set of the experimental mathematical model of a cusp-type catastrophe for the ZCM coincides with the origin of the singularity set of the canonical mathematical model of a cusp-type catastrophe, so the potential function values corresponding to these two origins are equal.
2.4. Establishment of Experimental Bifurcation Set
The specific process and steps of establishing the experimental bifurcation set of the experimental mathematical model of a cusp-type catastrophe for the ZCM are as follows:
- (1)
Calculate the discrete dataset S2 of the canonical singularity set SC corresponding to the discrete dataset S1.
It can be seen from Assumption 1 that the potential function values at the matching points on the actual singularity set
and the canonical singularity set
are equal, that is,
From Assumption 2, it is evident that the origin of the actual singularity set
coincides with the origin of the canonical singularity set
, so the potential function values corresponding to these two origins are equal, that is,
It can be seen from the catastrophe theory that the coordinates
of any point (such as point
) on the canonical singularity set
satisfy the canonical manifold
, that is,
The points on the bifurcation set are the projection of the singularity set on the control plane, so their coordinates
should also satisfy the canonical bifurcation set
, that is,
By combining Equations (3)–(6), a quaternary system of equations for , , , and is obtained. Note that , , and are known in the system of equations, and the remaining four unknowns are , , , and . By solving the system of equations, the coordinates of the points of the canonical singularity set that match the coordinates of the points of the theoretical singularity set can be obtained. Equation (4) indicates that the integral constant term is a constant value.
is the discrete dataset
of the canonical singularity set
, as shown in
Table 3.
- (2)
Determine the form of functions between the actual control parameters and the canonical control parameters .
To determine the functions between the actual control parameters
and the canonical control parameters
, we must first determine its form. It is well known that any function can be approximated by a polynomial, and the higher the degree of the polynomial, the better the approximation effect. After many attempts, it is found that setting the functions as a cubic polynomial and a higher polynomial does not significantly increase the accuracy of the experimental mathematical model of a cusp-type catastrophe for the ZCM to be established, that is, setting the functions as a quadratic polynomial can meet the requirements, namely,
where
and
are the functions between the actual control parameters
and the canonical control parameters
expressed in polynomial form;
,
,
, and
are the exponents of polynomials, and
; and
and
are 12 undetermined coefficients, and their values are generally determined by experimental dataset and optimization techniques.
- (3)
Determine the undetermined coefficients of the functions f1 and f2.
Construct the following function:
where
is the function coefficient vector,
.
By substituting the coordinate
of the actual control points corresponding to the discrete dataset
and the coordinate
of the canonical control points corresponding to the discrete dataset
into Equation (8), 361 functions about the coefficient vector
are obtained, that is,
Construct the unconstrained optimization objective function as follows:
The 1stOpt mathematical optimization toolkit is used to obtain all the undetermined coefficients
when
is minimized, as shown in
Table 4.
As shown in
Table 4, the dependent variable
in
is not correlated with the independent variables
and
, while the dependent variable
in
is only correlated with
and
. After removing the irrelevant terms, the final forms of
and
are determined as follows:
The regression coefficients corresponding to the solution results are R = 0.9966 and R = 0.9966, respectively.
Substituting the coefficients obtained from
Table 4 into Equation (11) yields the desired functions
and
.
According to the canonical bifurcation set
, the experimental bifurcation set
(
Figure 3) is
2.5. Establishment of Experimental Manifold
The specific process and steps of establishing the experimental manifold of the experimental mathematical model of a cusp-type catastrophe for the ZCM are as follows:
- (1)
Calculate the discrete dataset M2 of the canonical manifold MC that matches the discrete dataset M1.
By substituting the actual control parameters and into the functions and , the corresponding canonical control parameters and are obtained. By substituting and into Equation (5), a cubic equation about is obtained. By solving it, 1~3 real roots are obtained, that is, 1~3 canonical state parameters are obtained. According to the catastrophe theory, if three roots are solved, one of them corresponds to the maximum value of the potential function, which belongs to the unstable equilibrium point, and the corresponding system state cannot be obtained. The other two roots correspond to the two minimum values of the potential function, that is, the two stable equilibrium points of the system, and their symbols are opposite. One of these two roots with the same symbol as is taken as the canonical state parameter corresponding to .
is the discrete dataset
of the canonical manifold
, as shown in
Table 5.
- (2)
Determine the diffeomorphism between the actual parameters and the canonical parameters .
As is well known, neural networks can approximate functions of arbitrary complexity with any desired precision [
28]. Therefore, in this study, a fully connected neural network is employed to model the diffeomorphic mapping between the actual parameters
and the canonical parameters
. The mathematical proof demonstrating that a fully connected neural network can realize diffeomorphic transformations is provided in
Appendix A.
On the MATLAB R2023a platform, a fully connected neural network
is constructed to approximate diffeomorphic mapping, with the actual parameters
as inputs and the canonical parameters
as outputs, as shown in
Figure 4. In a neural network
, both the hidden layers and the output layer utilize the sigmoid activation function, while the mean squared error (MSE) is adopted as the loss function. The training dataset
is derived from the computational results in
Table 3 and
Table 5.
The Bayesian optimization algorithm [
29] is employed to optimize the hyperparameters of the neural network model, including the number of hidden layers, the number of neurons per hidden layer, the learning rate, and the early stopping patience [
30]. During optimization, the five-fold cross-validation error is used as the objective function for Bayesian optimization to prevent overfitting. The optimization results of the model are presented in
Table 6.
By retraining the neural network
with the optimized hyperparameters, it is observed that the regression coefficients for the training set, validation set, test set, and overall dataset all exceed 0.9998, indicating an excellent regression performance (
Figure 5). The trained neural network is then saved and denoted as
.
According to the canonical manifold
, the experimental manifold
is
where
.
2.6. Establishment of Experimental Potential Function PE
The specific process and steps of establishing the experimental potential function of the experimental mathematical model of a cusp-type catastrophe for the ZCM are as follows:
- (1)
Calculate the value of the integral constant term of the canonical potential function.
Substitute the discrete dataset
and its matching
into the theoretical potential function
and the canonical potential function
with the integral constant term removed, respectively, and calculate the value of the integral constant term
corresponding to each discrete point:
The calculation results are shown in
Table 6.
- (2)
Determine the neural network of the integral constant term of the canonical potential function .
On the MATLAB software platform, a neural network
with
as the input and
as the output is built (
Figure 6). Here,
is the parameter that distinguishes the phase points
at the upper and lower leaves of the manifold. When the phase point
is located on the upper leaf of the manifold,
,
; when the phase point
is located on the lower leaf of the manifold,
,
.
In a neural network
, both the hidden layers and the output layer use the sigmoid activation function, with the mean squared error (MSE) being used as the loss function. The training dataset
is derived from the computational results in
Table 5.
The Bayesian optimization algorithm [
29] is used to optimize the hyperparameters of the neural network model. During optimization, the five-fold cross-validation error is employed as the objective function for Bayesian optimization. The optimization results of the model are shown in
Table 7.
By retraining the neural network
with the optimized hyperparameters, it is found that the regression coefficients for the training set, validation set, test set, and overall dataset all exceed 0.9998, demonstrating excellent regression performance (
Figure 7). The trained neural network is then saved and denoted as
.
According to the canonical potential function
, the experimental potential function
is
where
;
.
Equations (12), (13) and (15) are combined to form the precise experimental mathematical model of a cusp-type catastrophe for the ZCM.