Deep Learning-Based Cyclic Shift Keying Spread Spectrum Underwater Acoustic Communication

A deep learning-based cyclic shift keying spread spectrum (CSK-SS) underwater acoustic (UWA) communication system is proposed for improving the performance of the conventional system in low signal-to-noise ratio and multipath effects. The proposed deep learning-based system involves the long- and short-term memory (LSTM) architecture-based neural network model as the receiving module of the system. The neural network is fed with the communication signals passing through known channel impulse responses in the offline stage, and then directly used to demodulate the received signal in the online stage to reduce the influence of the above factors. Numerical simulation and actual data results suggest that the deep learning-based CSK-SS UWA communication system is more reliable communication than a conventional system. In particular, the collected experimental data show that after preprocessing, when the communication rate is less than 180 bps, a bit error rate of less than 10−3 can be obtained at a signal-to-noise ratio of −8 dB.


Introduction
The underwater acoustic (UWA) channel is a dual-selective fading channel and has time-varying and space-varying characteristics, which brings difficulties to underwater information interaction based on the acoustic wave [1]. Direct-sequence spread spectrum (DSSS) communication technology has low spectrum density and strong resistance to multipath fading and the Doppler effect. Thus, it is widely used in UWA communication scenarios [2,3]. Compared with the conventional DSSS system, the cyclic shift keying spread spectrum (CSK-SS) modulation can provide a higher data rate. He et al. proposed a passive time reversal with CSK-SS using hyperbolic frequency-modulated (HFM) waveform for a reliable point-to-point UWA communication system [4]. Jing et al. proposed a novel interleave-division multiple access system based on CSK-SS modulation for multiuser UWA communications [5]. The communication rate of the above two systems is higher than that of the conventional DSSS communication system.
Recently, deep learning (DL) has received more and more attention as it can transform the original data features through multi-step feature conversion to obtain a higher and more slightly abstract feature representation, and further input to the prediction function to obtain the final result [6,7]. Due to the neural network models being able to easily solve credit assignment problems [8], the neural network model has become the main model used in DL. Many neural network models such as long-and short-term memory (LSTM) For the spread spectrum communication system, the spreading gain will severely affect the communication rate of the DSSS UWA communication system. However, the CSK-SS UWA communication system uses a circular cyclic shift of the spreading sequence to carry the information, which breaks the limitation of spreading gain on data rate [4]. In the process of demodulation, the conventional CSK-SS system performs correlation processing on the received baseband signal and the local cyclic shifted spreading sequence during the demodulation process and selects the maximum correlation value for decision. The position of the peak value is the information modulated on the code phase to realize the recovery of the source information. However, it needs to be pointed out that the improvement of data rate comes at the expense of destroying the partial autocorrelation function (PACF) characteristics of the sequence in the DSSS system [20]. Due to the influence of the noise and UWA multipath fading channel, the size of the correlation peak in the demodulation process of the CSK-SS UWA communication system changes, resulting in inaccurate decision results, making demodulation results wrong and affecting the performance of the CSK-SS system.
Inspired by the application of DL-based methods in the field of acoustic signal processing [16,17], in this paper, a DL-based CSK-SS UWA communication system is proposed, which innovatively applies DL to complete the demodulation of CSK-SS UWA communication signals and obtain several times higher communication rates than DSSS. More importantly, it avoids the performance limitations of conventional CSK-SS systems due to UWA multipath channels and noise environments. Furthermore, compared with the conventional system, this system can directly demodulate the received signal at the receiving end without completing the de-carrier and despreading operation of the received signal, which simplifies the processing flow of the received signal to a certain extent. By sufficiently training the LSTM architecture-based neural network model in the offline stage, a large number of random data samples modulated by CSK-SS and a large number of channel impulse responses (CIRs) generated under the ray acoustics model are used as the training datasets of the network, which gives the trained model the ability to remember and analyze CSK-SS time domain signals affected by noise and multipath fading. This paper is aimed at the application scenario of signal transmission and command control between Each cyclic shift of the spreading sequence can be obtained by multiplying the matrix H by the spreading sequence once. Each cyclic shift of the spreading sequence can be obtained by multiplying the ma H by the spreading sequence once.
At the transmitter of the system, the spreading sequence is generated by the c sequence generator. Carry out the serial-parallel conversion on the transmission seque ( ) x n , convert the binary bitstream information into a decimal data stream with m and shift the symbols of the spreading sequence by using the decimal data stream in mation. The transmitted signal reaches the receiving end of the system after passing thro the underwater acoustic channel. In the propagation process, this paper focuses on impact of multipath fading and noise on the received signal. The received signal can represented as

=H ,
τ is the propagation delay of the main path signal and 0 A is the amplitude a At the transmitter of the system, the spreading sequence is generated by the code sequence generator. Carry out the serial-parallel conversion on the transmission sequence x(n), convert the binary bitstream information into a decimal data stream with m bits, and shift the symbols of the spreading sequence by using the decimal data stream information.
where c represents the vector form of the spreading sequence, c j represents the spreading sequence obtained after j cyclic shifts of the spreading sequence c, 1 ≤ j ≤ N. After carrier modulation, the transmitted signal can be expressed as where A is the amplitude of the transmitted signal and c j (t) is the spreading code with a code length of N and chip duration of T c . Assuming that the duration of each symbol is T s , T s = NT c , f c and ϕ 0 are the carrier frequency and initial phase, respectively. The transmitted signal reaches the receiving end of the system after passing through the underwater acoustic channel. In the propagation process, this paper focuses on the impact of multipath fading and noise on the received signal. The received signal can be represented as where τ 0 is the propagation delay of the main path signal and A 0 is the amplitude after attenuation. τ l is the propagation delay of multipath signal, 1 ≤ l ≤ L, L is the number of multipath signals, A l is the amplitude of multipath signal reaching the receiving end, ϕ l is the phase of the multipath signal, ϕ l = 2π f c τ l + ϕ 0 , c j (t − τ l ) is the shift spreading sequence generating delay, and the local carrier is cos(2π f c t + ϕ ). When the system carrier is synchronized, f c = f c , ϕ = ϕ 0 , and n(t) are additive Gaussian white noise. c k (t) can be obtained from the locally generated spreading sequence through the sequence selector and cyclic shifter. k is the symbol shift information, 1 ≤ k ≤ N. After the system completes symbol synchronization, c k (t − τ 0 ) can be obtained.
After passing through the low-pass filter, only the integral output within the duration of one symbol is considered. Within the duration τ 0 ≤ t ≤ T s + τ 0 , the output of the integrator can be expressed as The correlation functions R k (0) and R k (τ l − τ 0 ) in Equation (5) can be expressed in the form of spread spectrum code Z(t) changes with the change of k, and the position of its maximum value is the information modulated on the code phase.
In the multipath fading channel, the influence of multipath fading on the performance of the CSK-SS system can be explored through the correlation function of the spread spectrum code. As shown in Figure 2, When SNR = 10 dB, the correlation peak of the spreading sequence decreases by nearly 15% after the CSK-SS signal passes through the multipath fading channel and decreases by nearly 28% when SNR = 0 dB. Figure 3 shows the correlation peak of the spreading sequence under the influence of multipath fading of the DSSS signal. When SNR = 10 dB, its peak value decreases by about 10% and decreases by nearly 18% when SNR = 0 dB. The above comparison shows that compared with the DSSS system, the CSK-SS system breaks the limitation of spreading gain on communication rate, and the improvement of communication rate is at the cost of destroying the excellent PACF characteristics of sequences in the DSSS system. In order to reduce the impact of multipath fading and noise on the technology of the CSK-SS system, this paper proposes a DL-based CSK-SS communication system. At the receiving end of the system, the CSK-SS signal is demodulated through the neural network model, which replaces the de-carrier, despreading, and related decision operations in the conventional system. The trained neural network establishes the mapping relationship between the CSK-SS signal and its carrying bits and solves inaccurate decision results caused by multipath fading in a conventional CSK-SS system.

LSTM Neural Network Model
The hidden layer of the LSTM neural network is composed of LSTM cells. A unique gating mechanism is introduced to control the accumulation speed of information. The standard unidirectional LSTM network transmits information in the positive order of time. In addition, to enhance the performance of the network, the BiLSTM network is proposed [23]. Compared with the standard unidirectional LSTM network, it adds a network layer that transmits information in the reverse order of time and connects the two hidden layers to the same output layer. In phoneme classification [24] and speech recognition [11], the performance of the bidirectional network is better than that of the unidirectional network. In order to explore the DL-based CSK-SS UWA communication system, this paper analyzes the application effects of the unidirectional LSTM and BiLSTM network models in the system, respectively.
As a fundamental component of the one-way LSTM and BiLSTM network hidden layer, the LSTM cell is introduced below. The structure of the LSTM cell [25] is shown in Figure 4.

LSTM Neural Network Model
The hidden layer of the LSTM neural network is composed of LSTM cells. A unique gating mechanism is introduced to control the accumulation speed of information. The standard unidirectional LSTM network transmits information in the positive order of time. In addition, to enhance the performance of the network, the BiLSTM network is proposed [23]. Compared with the standard unidirectional LSTM network, it adds a network layer that transmits information in the reverse order of time and connects the two hidden layers to the same output layer. In phoneme classification [24] and speech recognition [11], the performance of the bidirectional network is better than that of the unidirectional network. In order to explore the DL-based CSK-SS UWA communication system, this paper analyzes the application effects of the unidirectional LSTM and BiLSTM network models in the system, respectively.
As a fundamental component of the one-way LSTM and BiLSTM network hidden layer, the LSTM cell is introduced below. The structure of the LSTM cell [25] is shown in  For a given input sequence X 1:T = (x 1 , x 2 , . . . , x t , . . . , x T ), in each time step, t, x t ∈ R d is used as the input vector feed to the LSTM cell. The LSTM cell outputs a cell state vector c t ∈ R m for the transmission of cyclic information, and a hidden state h t ∈ R m is an output as the output vector of the LSTM cell, which can be expressed as where f t ∈ (0, 1) m , i t ∈ (0, 1) m , and o t ∈ (0, 1) m are forget gate, input gate, and output gate, respectively. They are used to control the path of information transmission, c t−1 is the cell state at the previous moment, and c t ∈ R m is the activation state vector of the cell. They can be expressed as where σ(•) is the Logistic function, W ξ ∈ R d×m , U ξ ∈ R m×m , and U ξ ∈ R m×m are the weight matrix and bias vector parameters that the network needs to learn during the training process, ξ ∈ { f , i, o, c}, h t−1 is the hidden state of the LSTM cell at the last moment.

DL-Based CSK-SS UWA Communication System Structure
The structure of the DL-based CSK-SS UWA communication system is shown in Figure 5. Compared with the conventional system, in the receiving part of the DL-based system, the neural network model is used to replace the receiver module of the conventional system. The DL-based signal demodulation will be divided into the offline training stage and the online test stage. In the offline stage, the transmitter completes the modulation of the known source information, reaches the receiver of the system through the UWA channel, and completes the training of the neural network model by receiving the signal data samples. In the test stage, the synchronized unknown received signal is input to the trained neural network model to complete the signal demodulation.
This paper selects unidirectional standard LSTM and BiLSTM as the neural network model in the system. In addition, two different schemes will be proposed for whether there is channel equalization preprocessing. In scheme A, the CSK-SS signal through the UWA channel is directly input to the DL-based system after signal synchronization. The neural network model training and the demodulation of the received signal are completed in two stages. In scheme B, the preprocessing operation of the received signal is added. Firstly, the channel estimation is completed by the orthogonal matching pursuit (OMP) algorithm [26], then the channel equalization is realized by the virtual time-reversal mirror (VTRM) technology [27], and finally, the preprocessed received signal is processed. In the two stages, the training of the neural network model and the demodulation of the received signal are completed.
In the offline training stage of scheme A, the known information sequence generates signal samples after CSK-SS modulation. Multiple UWA channel samples for training are generated by using BELLHOP. The training dataset consists of the CSK-SS signal passing through the UWA channel, which can be expressed as where s(t) is the CSK-SS signal transmitted by the transducer, h(t) is the CIR of the UWA channel, n(t) is the noise interference, and ⊗ is the convolution operation.
system. The DL-based signal demodulation will be divided into the offline training stage and the online test stage. In the offline stage, the transmitter completes the modulation of the known source information, reaches the receiver of the system through the UWA channel, and completes the training of the neural network model by receiving the signal data samples. In the test stage, the synchronized unknown received signal is input to the trained neural network model to complete the signal demodulation. This paper selects unidirectional standard LSTM and BiLSTM as the neural network model in the system. In addition, two different schemes will be proposed for whether there is channel equalization preprocessing. In scheme A, the CSK-SS signal through the UWA channel is directly input to the DL-based system after signal synchronization. The neural network model training and the demodulation of the received signal are completed in two stages. In scheme B, the preprocessing operation of the received signal is added. Firstly, the channel estimation is completed by the orthogonal matching pursuit (OMP) algorithm [26], then the channel equalization is realized by the virtual time-reversal mirror (VTRM) technology [27], and finally, the preprocessed received signal is processed. In the two stages, the training of the neural network model and the demodulation of the received signal are completed. In the offline training stage of scheme B, in order to suppress the ISI caused by multipath expansion of the UWA channel and reduce the impact of channel fading, channel equalization preprocessing can be carried out before the received signal is fed to the neural network model. Based on the reciprocity theorem [28], the time-reversal mirror technology matches the UWA channel of acoustic transmission and guides spatial focusing and time compression [29,30]. VTRM technology can make the multipath signals generated by the acoustic channel superimpose in phase simultaneously, compress the signal in the time domain, suppress the ISI caused by multipath spread, obtain the focusing gain, and improve the SNR. Especially in constructing a UWA communication network, when the nodes are fixedly arranged under the complex shallow water acoustic channel conditions, a better communication performance will be obtained by using VTRM technology. In addition, since the focusing effect of VTRM is related to the accuracy of UWA channel estimation, the 9 of 22 OMP algorithm is used to estimate the channel. The received signal processed by VTRM will be used as the training dataset of the neural network, which can be expressed as where h (−t) is the CIR estimation result after inversion,ĥ(t) is the cross-correlation function between the CIR h(t) and its estimated value h (−t), also known as the virtual time-reversal channel.
In the training process, quantifying the difference between the expected probability distribution of the output result and the predicted probability distribution, iterate continuously on the weight parameters and bias parameters in the model to gradually reduce the difference. Through the softmax activation function that maps the eigenvector to the effective real space of [0, 1] to represent the probability of the category, the cross-entropy loss function is selected to calculate the loss value to quantify the difference. The cross-entropy loss function [31] is as where p(Y i (n)) is the true probability distribution of the i-th sample and q( f (x i (n))) is the predicted probability distribution of the i-th sample.
In the online test stage of scheme A and scheme B, the unknown source data is modulated into the CSK-SS signal, and then the UWA channel sample space for the test is established. It should be noted that the above test channel samples are different from the training channel samples. In scheme A, the modulated CSK-SS test signal is directly received by the receiving end after testing the channel. In scheme B, it is received by the receiving end after preprocessing. The sufficiently trained neural network models will realize the direct demodulation of the received signal according to the mapping relationship between the waveform in the time domain and the information bits carried by each symbol.

Environment Configuration and Parameter Settings
In the simulation, two neural network models are discussed. Similarly, both models comprise an input layer, hidden layer, fully connected layer, softmax layer, and output layer. Different from each other, the hidden layer is LSTM and BiLSTM, which respectively complete unidirectional propagation and bidirectional propagation of information according to the input time series. The size of the input layer of the two neural network models is 744, the number of hidden units in the LSTM cells is 30, and the output size of the fully connected layer and the number of neural units in the output layer are determined by the number of types of tags. The training dataset is generated by using the m-sequence with a spreading gain of 31 as a spreading sequence and generated under SNR = 5 dB. The training dataset and the test dataset are divided according to a ratio of 3:1.
In the offline training stage, in order to sufficiently train the neural network models and make them have the ability to remember and analyze complex UWA channels, based on the SSP actually collected in a specific sea area, the spatial positions of the transmitting transducer and the receiving hydrophone are continuously adjusted according to a specific step. In addition, the BELLHOP model is used to obtain multiple groups of CIRs according to the combination of different positions of the transmitting transducer and the receiving hydrophone. In addition, in the online test stage, the spatial positions of the transmitting transducer and the receiving hydrophone will be further adjusted to obtain a variety of position combinations to generate multiple groups of CIRs used in the test stage. Figure 6 shows an environmental configuration for generating CIRs. Figure 7 shows the SSP obtained through the actual collection in the Yellow Sea of China in May 2020. cific step. In addition, the BELLHOP model is used to obtain multiple groups of CIRs according to the combination of different positions of the transmitting transducer and the receiving hydrophone. In addition, in the online test stage, the spatial positions of the transmitting transducer and the receiving hydrophone will be further adjusted to obtain a variety of position combinations to generate multiple groups of CIRs used in the test stage. Figure 6 shows an environmental configuration for generating CIRs. Figure 7 shows the SSP obtained through the actual collection in the Yellow Sea of China in May 2020.  The parameter settings of the UWA channel simulation are given in Table 1. In the simulation, the ocean bottom parameters [32] are set. In this paper, the flat seabed with very fine sand is considered, in which the density ratio is 1.268, the sound The parameter settings of the UWA channel simulation are given in Table 1. In the simulation, the ocean bottom parameters [32] are set. In this paper, the flat seabed with very fine sand is considered, in which the density ratio is 1.268, the sound velocity ratio is 1.0568, and the attention coefficient is 0.01875dB/m. Figure 8 shows the shallow water CIR obtained through SSP through BELLHOP software. Figure 9a-c shows CIRs obtained by controlling the change of the position of the transmitting transducer and the change of the horizontal distance between the transmitting transducer and the receiving hydrophone. The shallow water environment generally has the characteristics of high temporal and spatial variability. The propagation of acoustics in shallow water is mainly the repeated interaction with the sea surface and seabed. Figure 10a-f shows the difference in acoustic transmission loss in shallow water when the transmitting transducer is located at different depths.

Performance Analysis of the System without Preprocessing in Shallow Water Channels
In the test stage, the input size of the information sequence in each time step of the neural network model is determined by the symbol length of the signal. By comparing the output of the neural network model with the source information, the BER curve is obtained and the performance of the DL-based CSK-SS UWA communication system is evaluated.
In CSK-SS modulation, the number of bits carried by each symbol determines the symbol rate of the system. When each symbol carries 4 bit, 3 bit, and 2 bit information, the symbol rates are 275.86 bps, 206.89 bps, and 137.9 bps, respectively. The simulation results of the DL-based CSK-SS communication system in shallow water acoustic channels are shown in Figure 11. Due to the random selection of several UWA channels which are not in the range of training samples in the test stage (like the green circle in Figure 6), the BER curves of the conventional system and the DL-based system represent the mean BER of various channels under different SNR.

Performance Analysis of the System without Preprocessing in Shallow Water Channels
In the test stage, the input size of the information sequence in each time step of the neural network model is determined by the symbol length of the signal. By comparing the output of the neural network model with the source information, the BER curve is obtained and the performance of the DL-based CSK-SS UWA communication system is evaluated.
In CSK-SS modulation, the number of bits carried by each symbol determines the symbol rate of the system. When each symbol carries 4 bit, 3 bit, and 2 bit information, the symbol rates are 275.86 bps, 206.89 bps, and 137.9 bps, respectively. The simulation results of the DL-based CSK-SS communication system in shallow water acoustic channels are shown in Figure 11. Due to the random selection of several UWA channels which are not in the range of training samples in the test stage (like the green circle in Figure 6), the BER curves of the conventional system and the DL-based system represent the mean BER of various channels under different SNR. Without channel equalization preprocessing, the performance of DL-based systems on two different neural network models is better than that of the conventional system in the SNR range of −14 dB to 0 dB. Meanwhile, the anti-noise ability of the LSTM neural network model is improved by 4.5 dB, 8 dB, and 9 dB when the symbol rate is 275.86 bps, Figure 11. BER curve of DL-based system and conventional system (Con-S) without preprocessing.
Without channel equalization preprocessing, the performance of DL-based systems on two different neural network models is better than that of the conventional system in the SNR range of −14 dB to 0 dB. Meanwhile, the anti-noise ability of the LSTM neural network model is improved by 4.5 dB, 8 dB, and 9 dB when the symbol rate is 275.86 bps, 206.89 bps, and 137.9 bps, and the magnitude of the BER is 10 −2 . Compared with the conventional system, the anti-noise ability of the BiLSTM neural network model is improved by 7 dB, 10 dB, and 10 dB. It can be seen that the BiLSTM neural network model has better performance than the LSTM neural network model.
From one perspective, the BiLSTM network increases the vertical depth of the network compared with the unidirectional LSTM network by adding a network layer that transmits information in reverse time to enhance the capability of the network. From another perspective, under the condition of a complex shallow water channel with a serious multipath effect, the BiLSTM network can use the internal relationship of ISI in sequential data to reduce the impact of ISI on performance.

Performance Analysis of the System after Preprocessing in Shallow Water Channels
In the preprocessing stage, firstly, the CIR is reconstructed by the OMP algorithm, then the channel equalization is realized by VTRM technology, and finally, the processed signal data are input into the neural network model to complete the signal demodulation. Through simulation analysis, it was found that the performance of the preprocessed DLbased system is better than that of the preprocessed conventional system.
The simulation results of the preprocessed DL-based CSK-SS communication system in shallow water channels are shown in Figure 12. When each symbol carries 4 bit, 3 bit, and 2 bit information, and the magnitude of the BER is 10 −3 , the SNR of the LSTM neural network model is about 9.5 dB, 10 dB, and 10.5 dB lower than that of the conventional system, respectively. In addition, when each symbol carries 4 bit and 3 bit information, the SNR required by the BiLSTM neural network model to achieve the same system performance is reduced by about 1 dB and 0.5 dB, respectively, compared with the LSTM neural network model. When each symbol carries 2 bit information, the performance of the two mod similar, but the BiLSTM neural network model still has a slight advantage at −8 dB whys and wherefores are that after preprocessing, the performance of the DL-base tem is better than that of the conventional system and the BiLSTM neural network m has more advantages in performance.
As shown in Figure 13, compared with the DL-based CSK-SS UWA communic When each symbol carries 2 bit information, the performance of the two models is similar, but the BiLSTM neural network model still has a slight advantage at −8 dB. The whys and wherefores are that after preprocessing, the performance of the DL-based system is better than that of the conventional system and the BiLSTM neural network model has more advantages in performance.
As shown in Figure 13, compared with the DL-based CSK-SS UWA communication system without VTRM technology, for the BiLSTM neural network model, the required SNR is reduced by about 7 dB, 3.5 dB, and 2 dB, respectively. VTRM technology suppresses the ISI caused by multipath expansion of the UWA channel so that the multipath signal energy is superimposed to obtain the focusing gain. In addition, VTRM can make the signal components coherently superimposed and the noise components incoherently superimposed to increase the SNR of the signal. Therefore, after the preprocessing of the communication signal is completed by using this technology, the performance of the system will be further improved when demodulated by the neural network model.
When each symbol carries 2 bit information, the performance of the two m similar, but the BiLSTM neural network model still has a slight advantage at −8 whys and wherefores are that after preprocessing, the performance of the DL-b tem is better than that of the conventional system and the BiLSTM neural netwo has more advantages in performance.
As shown in Figure 13, compared with the DL-based CSK-SS UWA comm system without VTRM technology, for the BiLSTM neural network model, the SNR is reduced by about 7 dB, 3.5 dB, and 2 dB, respectively. VTRM technol presses the ISI caused by multipath expansion of the UWA channel so that the m signal energy is superimposed to obtain the focusing gain. In addition, VTRM c the signal components coherently superimposed and the noise components inco superimposed to increase the SNR of the signal. Therefore, after the preprocessi communication signal is completed by using this technology, the performance o tem will be further improved when demodulated by the neural network model.  Figure 13. BER curve of DL-based system with or without preprocessing.

System Robustness Analysis in Specific Application Scenarios
For the specific application scenarios of fixed UWA communication nodes, the robustness of the system needs to be analyzed. In the previous section, the test channel samples were taken outside the range of the training samples. This is because the underwater nodes will be affected by ocean currents and tides, which will lead to the change of node position. In order to analyze the impact of the sample mismatch between the offline training stage and the online test stage on the system performance, this section will first select the training channel sample space and use it as the test channel to obtain the system performance when the two-stage samples match each other. Secondly, by changing the transmitting transducer depth (d t ), receiving hydrophone depth (d r ), and the distance (d l ), the CIR outside the training sample space is obtained to analyze the system performance when the two-stage samples mismatch. Only the robustness of the BiLSTM model is analyzed in this section. The simulation results are shown in Figure 14. By comparing the BER curves, it can be seen that based on the simulation results in this application scenario, due to the change of underwater node position, the UWA channel sample mismatch between the training stage and the test stage does not have significant damage to the performance of the DL-based CSK-SS UWA communication system. lyzed in this section. The simulation results are shown in Figure 14. By comparing the BER curves, it can be seen that based on the simulation results in this application scenario, du to the change of underwater node position, the UWA channel sample mismatch betwee the training stage and the test stage does not have significant damage to the performanc of the DL-based CSK-SS UWA communication system.

Experimental Scene Construction and Parameter Setting
In this section, to verify the performance of the DL-based CSK-SS communicatio system, the demodulation of the actual signal is tested in the water tank experiment. Th experiment also consists of two stages: offline training and online test. The size of the wa ter tank is 45 m (length) * 6 m (width) * 5 m (depth). There are anechoic tiles installed o the walls on both sides of the water tank, and the bottom of the water tank is covered wit smooth tiles, which will make the sound waves reach the receiving end of the system afte multiple reflections from the bottom and the water surface, thereby simulating the prop agation process of the signal in the shallow water channel. Figure 15 shows the scene con struction of the experiment, in which A represents the position of the transmitting trans ducer, B1, B2, B3, and B4 represent the position of the receiving hydrophone in the offlin training stage, and C represents the position of the receiving hydrophone in the onlin test stage. In the experimental equipment, the power amplifier (PA) is B&K2713 [33], th operating frequency band of the transmitting transducer is 8-16 kHz, the attenuator i BEHRINGER DI-100 [34], and the receiving hydrophone is ST300HF [35].

Experimental Scene Construction and Parameter Setting
In this section, to verify the performance of the DL-based CSK-SS communication system, the demodulation of the actual signal is tested in the water tank experiment. The experiment also consists of two stages: offline training and online test. The size of the water tank is 45 m (length) * 6 m (width) * 5 m (depth). There are anechoic tiles installed on the walls on both sides of the water tank, and the bottom of the water tank is covered with smooth tiles, which will make the sound waves reach the receiving end of the system after multiple reflections from the bottom and the water surface, thereby simulating the propagation process of the signal in the shallow water channel. Figure 15  The transmitted signal used in the offline training stage is composed of several data packets. The signal in each data packet is composed of an HFM signal and communication signal, in which the communication signal is the known source information modulated by CSK-SS. After being transmitted by the transducer, the hydrophone receives the signal The transmitted signal used in the offline training stage is composed of several data packets. The signal in each data packet is composed of an HFM signal and communication signal, in which the communication signal is the known source information modulated by CSK-SS. After being transmitted by the transducer, the hydrophone receives the signal after passing through the water tank acoustic channel. Finally, the received signal is processed as the training dataset of the neural network model to complete the training of the neural network model.
In the online test stage, the hydrophone receives the transmitted signal and is directly input into the trained neural network model after resampling to complete the signal demodulation.
The analysis of the simulation results shows that in the DL-based CSK-SS UWA communication system, the BiLSTM network has better performance than the LSTM network. Therefore, the neural network model used in the water tank experiment is BiLSTM and the parameter settings in the neural network are consistent with those in the simulation.
The primary parameter setting and data structure of the transmitted signal are shown in Tables 2 and 3.

Analysis of Experimental Results
In the water tank experiment, the performance of the conventional system and the DL-based system are compared. Moreover, in CSK-SS modulation, three schemes with communication rates of 180.45 bps (each symbol carries 4-bit), 146.34 bps (each symbol carries 3-bit), and 106.19 bps (each symbol carries 2-bit) are analyzed. In addition, in experimental data processing, noise interference is added artificially to obtain the condition of a low SNR.
In the experiment, the training data samples of the neural network are received by hydrophones at positions B1, B2, B3, and B4, respectively. The data used to test the performance of the DL-based system are received by the hydrophone at position C. Due to the different locations of the hydrophone, the UWA channel structures in the training and test stage are also different. Due to the change of the position of the receiving hydrophone, the different CIRs is shown in Figure 16.
hydrophones at positions B1, B2, B3, and B4, respectively. The data used to test th formance of the DL-based system are received by the hydrophone at position C. D the different locations of the hydrophone, the UWA channel structures in the trainin test stage are also different. Due to the change of the position of the receiving hydrop the different CIRs is shown in Figure 16. The performance of the conventional and the DL-based systems without p cessing are compared experimentally. The scatter diagram of BER under three C modulation schemes under different SNRs is given in Figure 17. The BER curve is by calculating the mean of scatter points under each SNR. A more intuitive represen is given in Figure 18. Through the analysis of the data, it can be found that under th CSK-SS modulation schemes, the performance of the DL-based system is better tha of the conventional system, and the anti-noise performance is improved by 1 dB to The performance of the conventional and the DL-based systems without preprocessing are compared experimentally. The scatter diagram of BER under three CSK-SS modulation schemes under different SNRs is given in Figure 17. The BER curve is given by calculating the mean of scatter points under each SNR. A more intuitive representation is given in Figure 18. Through the analysis of the data, it can be found that under the three CSK-SS modulation schemes, the performance of the DL-based system is better than that of the conventional system, and the anti-noise performance is improved by 1 dB to 3 dB.    The experiment also compares the performance of the conventional and the based systems after preprocessing. Figure 19 shows the scatter diagram of BER un three CSK-SS modulation schemes under different SNRs. Similarly, the BER curve is g through Figure 20. The data analysis shows that the preprocessing improves the per mance of the DL-based CSK-SS UWA communication system. Under the three CSK modulation schemes, compared with the preprocessed conventional system, the pre cessed DL-based system improves the anti-noise ability by 9 dB to 10 dB. When the S is greater than −8 dB, each symbol in CSK-SS modulation carries 4 bits, 3 bits, and 2 the BER of the DL-based system is less than 2.5×10 −3 , 1.7×10 −3 , and 0.8×10 −3 , respectiv Under the condition of SNR = −14 dB, when each symbol in CSK-SS modulation carri bit, 3 bit, and 2 bit information, the BER is 8.6×10 −2 , 7.6×10 −2 , and 4.5×10 −2 , respectively The experiment also compares the performance of the conventional and the DL-based systems after preprocessing. Figure 19 shows the scatter diagram of BER under three CSK-SS modulation schemes under different SNRs. Similarly, the BER curve is given through Figure 20. The data analysis shows that the preprocessing improves the performance of the DL-based CSK-SS UWA communication system. Under the three CSK-SS modulation schemes, compared with the preprocessed conventional system, the preprocessed DL-based system improves the anti-noise ability by 9 dB to 10 dB. When the SNR is greater than −8 dB, each symbol in CSK-SS modulation carries 4 bits, 3 bits, and 2 bits, the BER of the DL-based system is less than 2.5 × 10 −3 , 1.7 × 10 −3 , and 0.8 × 10 −3 , respectively. Under the condition of SNR = −14 dB, when each symbol in CSK-SS modulation carries 4 bit, 3 bit, and 2 bit information, the BER is 8.6 × 10 −2 , 7.6 × 10 −2 , and 4.5 × 10 −2 , respectively.     Tables 4 and 5 respectively shows the BER of the conventional and DL-based systems with or without preprocessing operations under different SNRs.   The experimental results show that the performance of the DL-based CSK-SS UWA communication system is better than that of the conventional CSK-SS UWA communication system. In addition, for the DL-based system, when the UWA channel structure in the offline training stage is inconsistent with that in the online test stage, the BiLSTM network still has a certain generalization ability. It is worth noting that under the condition of a low SNR, the channel equalization preprocessing method can significantly improve the performance of the DL-based system. Therefore, the DL-based CSK-SS UWA communication system can realize reliable signal transmission in complex shallow water acoustic channels under the condition of a low SNR. The memory and analysis of the received signal are completed through the BiLSTM network model, which solves the problems in the signal demodulation process of the conventional CSK-SS UWA communication system affected by multipath fading and noise.

Suggestions for Future Experiments
By processing and analyzing the actual data in the water tank experiment, we have thought about the problems that may be encountered in future experiments. In order to deal with these problems, we will provide suggestions and references for all scholars to pay attention to the process of experimental verification in the future.

− Influence of experimental equipment
It is necessary to consider the negative impact of the PA on the transmitted signal in the experiment. When the PA amplifies the signal, it usually introduces the corresponding nonlinear distortion [36]. The additional high-order components obtained after the PA will cause certain distortion to the amplitude and phase of the signal. This influence will produce phase modulation components and cause clutter interference. Digital predistortion technology [37] can be used to compensate for the nonlinear distortion caused by the PA by distorting the signal before passing through the PA in the digital domain to reduce the impact on the performance of the neural network. In addition, the frequency response of the preamplifier, filter, and transducer will also affect the signal waveform. It is necessary to adopt more strict standards to screen the instruments and equipment that may be used in the test process.

− Variation of UWA channel and coping strategies
The environment of the water tank is relatively stable and the channel conditions will not change much. However, the marine environment is more complex and changeable, and the channel will change due to marine dynamic factors such as sea surface floating. It is necessary to "saturation train" the network model. For the specific application scenario of underwater fixed communication nodes, it can be considered to wake up the underwater nodes regularly in non-working hours, send training signals to the receiver to obtain more diverse UWA channel information, and periodically strengthen the neural network model to improve the performance of the system.

Conclusions and Prospect
In this paper, the neural network model is used as the receiver structure of the DLbased CSK-SS UWA communication system to demodulate the signal. The neural network model is trained based on the training samples with distortion caused by the influence of UWA channels and noise in the offline stage.
The simulation results show that the DL-based CSK-SS UWA communication system has better reliability than the conventional system in the complex shallow water acoustic channels with low SNR. The neural network model-based receiver module will reduce the impact of the multipath effect on the performance of the conventional CSK-SS system in fading channel. Furthermore, the neural network model has good generalization ability. When the online deployment conditions do not precisely agree with the offline training conditions, the neural network model can still work effectively to a certain extent. In other words, the model can analyze and memorize the complex characteristics of the UWA channel. In addition, the focusing gain brought by VTRM will introduce new beneficial features to the neural network, which will bring more reliable performance to this system.
The performance of the DL-based CSK-SS UWA communication system is verified by a water tank experiment. The experimental results show that the performance of the DL-based system is improved compared with the conventional system. Especially for the DL-based system after channel equalization preprocessing, when SNR is greater than −8 dB, the BER is less than 2.5 × 10 −3 . It should be pointed out that the BER of the communication system can be further reduced with appropriate coding under the condition of low BER. In addition, some suggestions are put forward for the problems that scholars may encounter in future experiments.
In the future, we can also consider using convolutional neural network, combined with transfer learning, few-shot learning, and other technologies to expand the application scope of the system further and reduce the cost and complexity of network training. Moreover, further exploration of the application of the system in practical engineering will be considered.