1. Introduction
In nature, a large number of nonlinear chirp (NC) signals are generated every day, such as human communication sounds, and the ultrasonic waves from bats, dolphins, and whales, etc. [
1,
2,
3,
4]. In the radar system [
5,
6], if the sinusoidal signal is transmitted and the target is in uniform-acceleration motion, the echo signal can be approximately regarded as the NC signal. In communication systems [
7,
8,
9], some clutter and interference can be regarded as NC signals. As NC signals and their parameters have practical physical meaning in many fields, the research on NC signals detection and parameter estimation is of great significance.
Most decomposition methods extract each signal mode directly in the time domain, of which empirical mode decomposition (EMD) [
10,
11,
12,
13] is the most famous one. EMD is a completely data-driven method that can adaptively decompose complex NCs into simple modes. Although EMD is sensitive to noise and lacks mathematical models, it has been widely used in various fields. On the basis of EMD, the methods of EMD and ensemble empirical mode decomposition (EEMD) have been proposed; however, these approaches only address part of the problem, and some challenging issues remain to be addressed. For instance, these improved methods based on EMD have a poor resolution for split-off mode.
To further understand the characteristics of NC signals, it is necessary to extract each nonlinear chirp model from the signal data. In the past few decades, many methods have been developed to decompose NCs. These methods implement signal decomposition in the time, frequency, and time–frequency (TF) domains. From the TF plane, we can observe time-varying features and perform signal decomposition. However, due to the limitation of the Heisenberg uncertainty principle, the TF representation generated by conventional methods is usually fuzzy, and it is impossible to provide accurate TF representation for time-varying signals. The disadvantages of traditional TF methods severely limit their application in actual signal processing. In the past decades, TF has gradually developed many advanced methods, such as the reassignment method (RM) [
14,
15], synchrosqueezing transform (SST) [
16,
17], demodulated SST (DSST) [
18,
19], and high-order SST [
20,
21]. RM technology stimulates post-processing of traditional TF methods and is an effective way to obtain clearer TF results. However, the RM framework is based on a spectrogram, which means that RM results lose the ability to reconstruct the signal. The SST method not only enhances the TF resolution but also allows the signal to be reconstructed. On the contrary, when processing signals that change over time (such as a linear frequency-modulated signal or a non-linear frequency-modulated (FM) signal), SST cannot generate a concentrated TF result.
The time–frequency transform maps the chirp signal to the time–frequency domain. It is easy to observe the rule of frequency changes with time, which is conducive to signal detection and parameter estimation. The time–frequency transform can be used to describe the energy density of the signal at different times and frequency by constructing the joint function of time and frequency. At present, time–frequency transforms commonly used are the short-time Fourier transform (STFT) [
22,
23,
24,
25], Wigner–Ville distribution (WVD) [
26,
27], fractional Fourier transform [
28,
29], cubic phase function [
30], and Zak transform (ZT) [
31,
32]. In addition, as the chirp signal usually presents an impact function of frequency linearly varying with time in the time and frequency domain, the linear detection and extraction method in image processing can be combined with the commonly used time–frequency transformation to estimate the initial frequency and frequency modulation slope. Chen [
33] applied the VMD method to nonlinear chirp signals and achieved good results. However, as it processes the signals based on VMD, it is still limited to the determination of the number of signal decomposition layers. Chen [
34] uses the matching tracking algorithm of the greedy algorithm to estimate each signal adaptively on the basis of Reference [
33], which overcomes the defect of Variational nonlinear chirp mode decomposition (VNCMD). In reference [
34], two important parameters 
 control the components of the output signal and instantaneous frequency (IF), respectively. However, as the selection of 
 and 
 is artificially set, there are errors in practical applications.
Based on the above problems, this paper proposes to use the state transition algorithm (STA) [
35] to adaptively determine 
 and 
 to improve the resolution of the output signal. As the number of species in the STA is an artificial error, this paper proposes the Rastrigin optimization test equation to optimize the number of different species, and detects the time of each optimization (the number of species is 1–100, in which the rotation operator, translation operator, expansion operator, and axesion operator are unchanged, and are set to 
, respectively). In order to accurately determine the number of species, this paper takes the average value of 10,000 times the operation time of the same species optimization and compares the average optimization time of each species to determine the number of species corresponding to the minimum optimization time. Finally, the adaptive chirp mode decomposition (ACMD) method optimized in this paper is applied to the chirp signal from marine animals, which verifies the feasibility of the proposed method.
  2. Basic Theory
  2.1. ACMD
ACMD was proposed by Chen on the basis of VNCMD, and they used a matching pursuit algorithm in the greedy algorithm to adaptively estimate each signal. The i-th signal should be defined as follows:
        where 
 is the residual signal from which the current component signal is removed, and 
 is the weight of the coefficient. Like matching pursuit, ACMD greedily looks for signal components that extract the most energy from the input signal. Assuming that the signal is discrete in time 
 series, the discrete expression of Equation (1) is as follows:
        where 
 and 
 denotes the second-order difference matrix; 
, 
, 
, and 
.
        
        where 
.
Equation (2) gives a matrix 
. The vector 
 can be estimated by solving the 
 normalized least-squares problem. On the other hand, the phase function of the demodulation function contains information on the frequency function. The optimization problem of Equation (2) can be solved by an iterative algorithm, which can update the demodulated signal and frequency function in turn. For the j-th iteration, the update of the vector 
 is as follows:
The frequency function 
 constructs 
, and the superscript 
 indicates the number of iterations. In addition, the parameter 
 controls the smoothness of the output signal. At this time, the method is a TF band-pass filter, and the bandwidth of the filter is determined by 
. Too small an 
 will cause the filter to have a narrow bandwidth (i.e., a smooth output signal). The signal components can be estimated as follows:
Employing the demodulated signal obtained in Equation (6), the frequency increase can be estimated as follows:
IF is a smoothing function and the growth of frequency should be smooth enough. Therefore, in practice, in order to reduce the influence of noise, a low-pass filter is used to process the signal growth. The IF may eventually be updated as follows:
        where 
, 
 is the identity matrix, 
 is the second-order difference matrix, and 
 denotes a low-pass filter. If too small a 
 is used, the output IF will be very smooth. However, rapidly changing IFs require a relatively large 
. Vector 
 can be updated by employing 
 (Equations (3)–(5)), and Equations (6)–(9) can be repeated in the next iteration.
As mentioned before, the parameters  control the components of the output signal and IF, respectively. In ACMD, in order to process the vibration signal with fast fluctuation IFs, it is recommended to use a relatively large  and  ( ), and the termination threshold  is set to  according to the required accuracy. However, the choice of  is artificially set, and there are errors in practical applications. Therefore, this paper employs the state transition algorithm (STA) to adaptively optimize  to improve the signal-to-noise ratio (SNR) and resolution of the signal.
  2.2. State Transition Algorithm
The STA [
35] can optimize several parameters at the same time, and it is an efficient random optimization algorithm. This algorithm treats the result of the optimization problem as a state, and regards the process of searching in the search space as an evolutionary algorithm of the process of state transition. This algorithm uses four state operators of rotation, translation, expansion, and axesion to solve the continuous optimization problem. The algorithm has strong global search ability, fast convergence speed, and high optimization precision.
The STA can be written as:
        where 
 and 
 represent the current state and the transition state, respectively. 
 and 
 are state transition matrices and can be considered as the operator of the optimization algorithm. 
 is a function related to the current state 
 and the historical state. 
 denotes the objective function.
To solve the optimization problem of Equation (10), the four core operation operators are as follows:
Rotation operator:
        where 
, 
 is a positive number and becomes a rotation operator, 
 denotes the number of variables, and 
 is a uniformly distributed random matrix between 
. The operator is a search algorithm to search in a super surface with 
 as the center and 
 as the radius, and the radius 
 satisfies 
.
Translation operator:
 is a positive number and becomes a translation operator. 
 is a random number with a uniform distribution between 
. The operator searches along the positive direction of the 
 to 
 gradient with a maximum search step of 
.
Expansion operator:
 is a positive number and becomes an expansion operator. 
 is a diagonal matrix that follows a Gaussian distribution. The operator can optimize the whole search space.
Axesion operator:
 is a positive number and becomes an axesion operator. 
 is a diagonal matrix that follows a Gaussian distribution, and the matrix has only one random position with elements other than zero.
The basic flowchart of the STA is shown in 
Figure 1. In 
Figure 1, 
 is the optimal individual in the species, 
 is the number of individuals in the search species, and 
 are the rotation operator, translation operator, expansion operator, and axesion operator, respectively. In 
Figure 1, three steps of expansion, rotation, and axesion are performed, and the translation operation is embedded in these three steps.
Specific steps are as follows:
		
- Step1:
- Determine the problem objectives and variables. The number of searches  for initializing the state transition algorithm and the value of each operation factor . 
- Step2:
- Calculate the fitness value of each individual in the species. 
- Step3:
-  expansion , , its fitness value is . 
- Step4:
- Copy  into a species with  individuals, and expand to obtain species . Calculate the individual fitness value in . At this time, the best individual in the group is , and the fitness value is . 
- Step5:
- If  (for the minimization problem), then , and execute Step6; otherwise, execute Step7. 
- Step6:
- , each copied  times, paired in pairs. Species  is obtained after performing the transfer operation according to Equation (12). Calculate the individual fitness value in . At this time, the optimal individual of the group is , and the fitness value is . 
- Step7:
- If , then  . 
- Step8:
- Output  as an expansion operation. 
- Step9:
-  rotation  and  axesion  are similar to Step3–Step8. 
- Step10:
- Whether the final result satisfies the termination condition, if it is satisfied, the optimal result is output; otherwise, Step 2 to Step 9 are continued. 
  2.3. Optimized ACMD Method
Like other algorithms, the STA is based on the iterative search of individuals in the neighborhood. The difference is that each individual in the search species uses the group optimal value, that is, the elite individual. The search process is only related to the group optimal value of the previous generation, and the search is carried out within the group optimal neighborhood. For example, when performing the expansion operation, the optimal individual of the group is first taken as an input, the optimal individual is then copied to the number of individuals  of the search species, and the expansion operation is performed on these optimal individuals. The optimal individual in the group is calculated by performing the expansion operation, and the optimal individual is used as the input for the next operation.
The state transition algorithm has five important parameters, that is, the number of individuals in the search species 
, rotation operator 
, translation operator 
, expansion operator 
, and axesion operator 
. In the case where the number of individuals 
 is constant, the larger the value of 
, the larger the search range of the corresponding operator, and the larger the search range, which is beneficial to the global search. When its value is small, the search accuracy is high, which is advantageous for the local search. In References [
36,
37], the species 
 is set to 
 and 
, respectively, because the number of artificial species will affect the accuracy of the search. This paper proposes the Rastrigin optimization test equation to optimize the number of different species, and detects the time of each optimization (the number of species is 1–100, in which the rotation operator, translation operator, expansion operator, and axesion operator are unchanged and are set to 
, respectively). Through multiple inspections, the number of species of the state transition algorithm when the average optimization time is the shortest is finally determined.
Rastrigin function:
        where 
 is the 
i-th element of the 
-dimensional Euclid space element.
This paper uses the improved STA method to optimize in the ACMD method. The optimization process is shown in 
Figure 2.
- Step1:
- Determine the species of the STA when the average optimization time is the shortest through multiple detections. 
- Step2:
- Bring the number of optimal species groups into the STA algorithm. 
- Step3:
- Employ the optimized STA algorithm to find the optimal parameter. 
- Step4:
- The optimal parameter is input into ACMD, and the optimized ACMD is used to process the chirp signal. 
  4. Experimental Verification
In the experimental part, the proposed method was applied to the signals from the marine animals False Killer Whale, Baird’s Beaked Whale, Weddell Seal, and Bottlenose Dolphin [
38]. As marine animals use sound for echolocation and communication, it is very important to analyze these sound signals in studying the behavior of marine animals. As shown in 
Figure 8, the time-domain diagram of the animal sound signal and the time–frequency diagram processed by the ACMD method are shown, where 
. It can be observed from the time–frequency diagram that there is noise interference in the signal.
The method proposed in this article is employed to find the optimal 
 value, and its optimization result is shown in 
Figure 9. Due to a large amount of data, this article will locally enlarge around its maximum point of SNR. The 
 corresponding to the maximum point of the four types of biological SNR in the 
Figure 10 is: False Killer Whale, 
; Baird’s Beaked Whale, 
; Weddell Seal, 
; Bottlenose Dolphin, 
. The most suitable 
 is brought into ACMD and the marine animals’ signal is processed to obtain the results shown in 
Figure 10. Compared to the time–frequency diagram in 
Figure 8 and 
Figure 10, it can be observed that the signal noise in 
Figure 10 is less, which proves the effectiveness of the method proposed in this paper.
In order to describe the change in the SNR before and after improvement, this paper uses Equation (17) to show the change in the SNR before and after. According to Equation (17), the SNRs of the False Killer Whale, Baird’s Beaked Whale, Weddell Seal, and Bottlenose Dolphin that increased are 672.018%, −185.595%, 7.777%, and 32.759%, respectively. 
Table 4 lists the SNR changes before and after ACMD optimization. Although the improvement in Baird’s beaked whale is −185.595%; in fact, the SNR is increased from −2.443 to −1.434. In 
Table 5, 
Table 6, 
Table 7 and 
Table 8, 100 SNR parameters around the maximum SNR of four marine animals are listed.