Neural-Network-Based Equalization and Detection for Underwater Acoustic Orthogonal Frequency Division Multiplexing Communications: A Low-Complexity Approach

: The performance of the underwater acoustic (UWA) orthogonal frequency division multiplexing (OFDM) system is often restrained by time-varying channels with large delays. The existing frequency domain equalizers do not work well because of the high complexity and difﬁculty of ﬁnding the real-time signal-to-noise ratio. To solve these problems, we propose a low-complexity neural network (NN)-based scheme for joint equalization and detection. A simple NN structure is built to yield the detected symbols with the joint input of the segmented channel response and received symbol. The coherence bandwidth is investigated to ﬁnd the optimal hyperparameters. By being completely trained ofﬂine with real channels, the proposed detector is applied independently in both simulations and sea trials. The results show that the proposed detector outperforms the ZF and MMSE equalizers and extreme learning machine (ELM)-based detectors in both the strongly reﬂected channels of the pool and time-variant channels of the shallow sea. The complexity of the proposed network is lower than the MMSE and ELM-based receiver


Introduction
With the increasing requirements for an Internet of Things in the oceans, efficient data processing and transmission become critical for ensuring the instantaneity for the underwater environment monitoring [1,2] and emergency rescue [3]. Orthogonal frequency division multiplexing (OFDM) has been a viable method in bandwidth-constrained underwater acoustic communications [4,5], as a result of its high spectral efficiency and ability to resist frequency selective fading. Nonetheless, the selective channels decided by variant parameters, such as distribution of sound speed [6], bottom reflection coefficient and surface waves [7], limit the performance improvement of the underwater acoustic (UWA) OFDM system [8,9].
To better detect the OFDM signals from UWA channels, variable equalizers have been applied, e.g., linear equalizers including zero-forcing (ZF) and minimum mean square error (MMSE) equalizers [10], and decision feedback equalizer (DFE) such as the Turbo equalizer [11,12]. The linear equalizers with simple structures are widely used in terrestrial communication links [13,14], whose performances rely on accurate channel estimations. Altough the DFEs show satisfactory performance without a channel estimator, it is at the expense of the requirement for higher computational complexity and extra channel coding [15,16]. In practice, real-time underwater acoustic communications (UAC) do not allow for a large number of online iterations [17][18][19], which still require linear equalizers. However, the noise amplification problem occurs when applying the ZF equalizer. Although MMSE equalizer overcomes this problem by considering the signal-to-noise ratio (SNR), it is hard to estimate the statistical values of the noise in real underwater environments including non-Gaussian and colored noise [20,21]. To further optimize the equalizer, deep learning (DL) and neural network (NN) have been developed [22][23][24][25][26]. With enough samples, a DL-based receiver can statistically learn to detect the symbols from the channel and other interference.
H. Ye et al. proposed a DL-based OFDM receiver [22], which used three fully connected layers to deal with a 64-subcarrier OFDM symbol with a block-type pilot. The bit error rate (BER) of the NN-based receiver was lower than the least square (LS) and MMSE estimation and detector in the simulation. To further obtain higher detecting accuracy, researchers have tried to substitute the whole communication system for the end-to-end networks [22,24,25,27]. The traditional digital modulation and subcarrier mapping has been replaced by the autoencoder (AE) [24]. A blind receiver without the pilot has been built with a convolutional neural network (CNN), which showed better performance compared with the traditional baselines. Similarly, an AE has been designed [25] to provide a modulation scheme for the multicarrier system. This work fed the decoder with both the channel state information (CSI) and received symbol, constructing a data-driven model for symbol detection. The simulations showed significant BER performance in additive white Gaussian noise (AWGN) channels. In B. Lin's work [28], a super-resolution channel reconstruction network was combined with AE for the marine communication system, proving its effectiveness in slow fading channels. H. Zhao et al. [29], J. Liu et al. [30], and Y. Zhang et al. [31] proposed different network structures for the UWA OFDM receiver, and trained them with the WATERMARK dataset. These studies focused on designing specific network structures to improve the performance of the OFDM receiever. Nevertheless, the theoretical explanation of the networks remains limited. There is no quantitative analysis for the hyperparameters.
Although the above NN-based communication systems show good performances in simulations, it is difficult to practically implement them, particularly in underwater acoustic channels because of the heavy computations and complex structures. For instance, Refs. [29][30][31] did not conduct sea trials. Another option is to build simple networks for the module optimization. M. Turhan et al. proposed an NN-based generalized frequency division multiplex with index modulation (GFDM-IM) detector to detect the symbols after a coarse detector [32]. With perfect CSI in the receiver, the simulation results showed lower BERs of this network than the ZF detector. T. Wang et al. have built a CNN for index modulated OFDM (IM-OFDM) detection, whose performance approximates the maximum likelihood (ML) detector [33]. A further option for NN is the extreme learning machine (ELM)-based receiver. This kind of receiver integrates the channel estimator and equalizer with a single layer NN, which is trained online for each time [34,35]. In L. Yang's work, with enough block pilots for training, a long frame with a large quantity of OFDM symbols was simulated [35], showing better performance than MMSE equalizer and NNbased detector proposed by H. Ye et al. [22]. Since the UWA channels were time-variant, the ELM detector was unable to show good performance, because the transmitted frames had to be kept short to reduce the influence of the time-variant channel. H. Zhao et al. [36] proposed a transfer strategy for the DNN-based OFDM receiver and tested it with the WATERMARK dataset and real experimental data. This study focused on the network retraining, and did not discuss the design of the applied NN structure. Y. Zhang et al. [37] focused on solving the channel sample augmentation problem for the NN-based channel estimator. Both [36,37] proposed an innovative strategy to solve the application problem of DNN-based receievers.
Despite the good simulation results produced by the above structures, problems still exist when the system is implemented in UWA channels.

•
Firstly, the real dataset is difficult to obtain because the UAC links are usually one-way with no feedback. Hence, the uncertain time-variant channel states do not allow the system to obtain the samples in a short period. • Secondly, it is not realistic for the NNs to be retrained in a high rate link because the computation loading is still heavy for real-time applications. Consequently, the datadriven works mentioned above barely discuss the performance of the system with real experiments.
In this paper, an attempt at the design and derivation of an NN-based receiver is made for the UWA OFDM system. A simple NN is proposed to integrate the equalization and symbol detection, containing only one fully-connected layer. Firstly, taking both channel frequency response and received symbol as input, the network learns a robust structure to output the symbol directly. To minimize the complexity of the network, the channel and received symbol are divided into blocks of the same size, matched with small-size networks. For attribution to the simple structure, the hyperparameters (mainly the hidden layer size and input dimension) are inferred according to the delay and coherence bandwidth of the channel. Thus, the channel-driven networks are constructed. After being trained with mixed channels and noise samples, the networks show robustness in both simulations and sea trials, performing better than the ZF and MMSE equalizers as well as the ELM-based detector in [35]. The contributions of this paper are listed as follows. • We propose a low-complexity NN-based symbol detector for the UWA OFDM system. The network takes the segmented channel response and symbol block as input and integrates the equalization and detection processes. The small input dimension also reduces the requirement for the hidden neurons. The proposed detector shows lower computational complexity than the MMSE and ELM-based detectors. • The NN-based detector is trained offline with a channel dataset containing simulated and real channels. Then the detector can be applied completely independently online with fixed hyperparameters, improving the efficiency of the online receiver. Under the same LS channel estimator, the trained network outperforms the ZF and MMSE equalizers, and the whole receiver is more reliable than the online ELM-based detector in both frequency selective channels in the pool and time-variant shallow sea channels. • To obtain the optimal network structure, the block size of an OFDM symbol is associated with the coherence bandwidth. By testing each network with the input sizes in the range of less than the coherence bandwidth, the optimal hyperparameters can be found. The simulations verify the above configurations.
The remainder of this paper is organized as follows. Section 2 describes the UWA OFDM system. Section 3 discusses the UWA channels and the traditional detectors for symbols suffering from them. The network structure and training strategy are described in Section 4, while the result discussions of simulations and sea trials are included in Sections 5 and 6. Section 7 concludes our work.

Preliminary
An UWA OFDM system with frequency domain equalization is shown in Figure 1. The bit stream b to be transmitted is modulated to symbols with digital modulation. After the inverse fast Fourier transform (IFFT) is performed, the signal is up-converted to the carrier frequency. Then it is transmitted through the channel and suffers from noise. In the receiver, the signal is represented as where s(t), h(t, τ) and z(t) are the transmitted signal, channel impulsive response and additive noise. τ is the channel delay. ⊗ denotes convolution. After being down-converted to the baseband and performing the fast Fourier transform (FFT), the received symbol in the frequency domain can be written as where S, H, and Z are the transmitted symbol, channel transfer function, and additive noise in frequency domain. Usually, the UWA channels and noise are different from those in terrestrial communications. With more powerful recognizable paths and impulsive noise, the OFDM system does not show good performance in shallow water. To reduce the influence of channels on symbols, before detection, the channel should be estimated and used to equalize the symbol with specific algorithms such as ZF and MMSE.

Signals Suffering from UWA Channel with Large Delays
The UWA channel differs from the terrestrial electromagnetic channel [38]. The ray theory reveals that the delay of an underwater channel is decided by the path length and sound speed [7], and in addition, according to [39], the motion of the transmitter/receiver pair, the scattering of the moving sea surface and the refraction due to sound speed variations. For a received symbol, the time-varying UWA channel impulse response (CIR) can be written as where N p is the number of paths, and τ i (t) ≈ τ i − a i t is the time-varying delay of the i-th path, and a i is the Doppler factor. c i (t) is the channel coefficient of each path varying with time. In the receiver, after resampling, FFT, and low-pass filtering [4], the channel function in frequency domain is written as As a result of the slow sound speed and low reflection loss, τ i (t) is large in long distances. These recognizable paths result in a small coherence bandwidth for the OFDM symbol. Figure 2 shows a CIR and corresponding transfer function caught in the water tank of Xiamen University. It can be seen from Figure 2b that the frequency selectivity is severe because of the long delay of the recognizable paths in Figure 2a.

Signal Detection in UWA Environments
For the received signal described in Equation (1), using the minimum squared Euclidian distance is considered to detect the above symbol, the optimization problem can be expressed asŜ (n) = arg min where · represents 2-norm. H(n) is the channel frequency response in any one OFDM symbol. M is the modulation order, and S m (n) is the m-th referred symbol. Further, the squared Euclidian distance can be written as Considering a system utilizing frequency domain equalization (FDE) for better performance, for the signal in Equation (6), a multiplier G(n) is included in the detector, yielding Considering G(n) as the entry of multiplier matrix G, for ZF equalizer, whereĤ is the estimated channel matrix. Multiplying R in Equation (2) with G, and expending the equation, the equalized symbol can be expressed as where g nn is the element of G. It can be seen in Equation (10) that the equalized symbol includes amplified noise which will influence the detection. In the noiseless channel, the second term is zero.
For the MMSE equalizer, there is where σ 2 and P are powers of noise and signal, respectively, and I is the identity matrix. With Equation (11), the MMSE-equalized symbol can be expressed as The first term of Equation (11) includes a factor matrix that only contains positive coefficients. As Figure 3 shows, for a single received point in the decision regions, the elements of this factor matrix linearly scale the received point to draw it closer to the reference point, which does not change its quadrant. Consequently, for the low-level constellations that can decide the symbols according to the quadrants they lie, such as BPSK and QPSK, the MMSE does not perform better than the ZF equalizer. In addition, the MMSE equalizer requires a priori SNR, which is difficult to obtain in time-varying UWA channels. When the frequency selectivity becomes strong, with imperfectly estimated channels, both ZF and MMSE equalizers cannot recover the symbols effectively [40]. However, it is still necessary to develop a more effective frequency domain equalizer in underwater acoustic channels because of the attractive low complexity. Variable NN structures provide new solutions to such interference elimination problems. The NN now has been proved to learn one or more nonlinear processes well with a proper structure [33]. With an intelligent interference simulating model [41], it is possible to train an equalization network offline, which can be applied independently online without extra a priori environment information.

NN-Based Joint Equalization and Detection
Unlike the frequency domain equalizers mentioned above, this paper combines equalization and symbol detection and implements a joint detector with the NN. The detected symbol can be written as where L{·} is the loss function and D[·] represents the process of the proposed network. It has been proved that a simple network structure is enough to well solve the receiver problems [33,42]. Inspired by this, the proposed joint detector utilizes a single-layered network. As shown in Figure 4, after channel estimation, R(n) and H(n) are sent to the network for detection. Moreover, a block-input strategy is proposed to further decrease the complexity of the network. N b is the number of blocks and N L denotes the number of neurons. The structure configurations of the network are described as follows. Input: Before input to the network, the received OFDM symbol R(n) with N subcarriers is firstly divided into N b blocks. Each block contains N c subcarriers. The same process is conducted with the corresponding estimated channel functionĤ(n). Thus, a small reusable network can be designed for each combination of the data and channel block. To further determine the optimal hyperparameters, a proper N c should be defined to balance the computation and accuracy of the network, i.e., two rules are proposed: (1) N c should optimize the performance of the joint detector with estimated channel.
(2) The required number of neurons is positively correlated to the input dimension, and N c should be as small as possible to minimize the computation of the network.
Based on the analysis above, the narrow coherence bandwidth W c of the channel is considered, which always limits the performance of the underwater OFDM system. Because the subcarriers in each coherence frequency band suffer from relatively flat fading, an opportunity is found to find the optimal N c [43]. By setting N c in the range of coherence bandwidth, the joint detection network is able to deal with the symbol blocks separately in a flat fading channel. Figure 5a shows the frequency response of a simple channel with one path. A 1 to D 3 are values of the frequency points. According to [43], for a channel that follows the homogeneous assumption, the coherence bandwidth is inversely proportional to the maximum channel delay T d , which can be approximately represented as For the channel in Figure 5a, W c is easily observed as the frequency range between B 2 and D 3 , written as In practice, the UWA channel contains more paths as shown in Figure 5b, whose frequency response is the sum of more than one paths with time-varying coefficients distributions [39]. With the same estimated W c as Figure 5a, different situations are listed in Table 1 by taking different sections as blocks. It should be noted that the starting point of the first block should always be A 1 , which is also the first subcarrier of an OFDM symbol. When the width of the block W b = W c , block [A 1 , C 2 ] suffers from selective fading because the section includes inflection points of the frequency response. The same result occurs when W b = W c /2 because [B 1 , B 3 ] includes an inflection point. When W b = W c /3, no sections include an inflection point or experience flat fading. This consistency of flat fading is more conducive to the symbol detection [44].
According to the discussion above, there is a W b in the range (0, W c ], which decides the optimal N c , yielding where ∆ f is the frequency interval of two contiguous subcarriers. The NN with input length N c deals with blocks that all suffer from flat fading. Therefore, although the accurate W c is difficult to find, its estimate can be an upper bound for finding the optimal input dimension for the network. Consequently, the input block size N c can be quickly found by going through the coherence bandwidth of the estimated channel. Algorithm 1 shows the steps to find N c . N w c represents the number of subcarriers contained in the range of the empirical coherence bandwidth. The threshold C Th is used to filter the paths with low power which does not affect the performance of the system. I Th contains the indexes of the recognizable paths. BER( · ) represents the process to calculate the BER. Because there is only one optimal N c for each W c that varies slowly in a short duration in fixed locations, steps 7 to 13 can be conducted independently with offline training.
with N c as block length; 9: P bn = BER(b tn , b n ); 10: if P bn < P b 11: P b = P bn ; 12: end if 13: N c = N c − 1; 14: return N c ; After N c is decided, the input matrix is the combination of the symbol block and channel block. To input these complex symbols and channels to the real-value NN, the real and imaginary parts of the symbol and channel blocks are extracted and rearranged, which can be written as where R k b (n) and H k b (n) are the n-th received symbol block and estimated channel block of k-th training batch. B is the batch size. Since H(n) is input directly, the network does not need to learn the changing channel characteristics [25]. Instead, a generative analytical process can be learned to be adaptive to any kind of channel.
Network configurations: As Figure 4 shows, the single-layered network proposed in this paper contains a fully connected layer with N L neurons. To detect a symbol block, the input firstly multiples a weight vector W L , then adds bias vectors B L , yielding where g(·) is the activation function. To transform Y m to the output with required length, another linear map is built: where W out and B out are weights and bias of the output layer. Y contains N c symbols, which are further input with the one-hot format reference symbols to calculate the cross entropy as the loss function, yielding where S onehot ij (n) and Y s ij (n) are the one-hot reference symbol and the corresponding output data. M is the modulation order; softmax[·] is a function to map Y s ij (n) to the range (0, 1) [28]. The use of cross entropy can make the network converge quickly.
Training strategy: For each N c , there are two tasks for the network. One is to find the optimal N L and another is to train the parameter matrix D. Both tasks can be finished offline with one training. Figure 6 shows the training strategy of the network. The samples are constructed with three parts: random data symbols, different types of channels and noise. The channel samples contains Rayleigh distributed ones and underwater acoustic channels collected in the pool, artificial lake, and Wuyuanwan Bay, Xiamen, while the noise includes Gaussian distributed noise and impulsive noise generated by the GAN in [41].
The network for the given input dimension is trained with the range [N bottom , N up ] processed with a step to find the optimal number of neurons N Lopt . N bottom and N up are lower and upper bound of the possible N Lopt , respectively. Meanwhile, the optimal weight matrix D opt trained with N Lopt is memorized. Then both D opt and N L are delivered to the online network to detect the real received symbols. Online applications: Unlike the NN-based detector, which takes the only the received symbol as the input, the estimated channel is included in the input in the proposed detector. This configuration offers more information for the network to detect symbols, determining a low-complexity semi-analytical detection network, which is generalized to different channels. Therefore, the final goal of this study was to train the networks which could be independently applied online without retraining. The simulations and experiments are described and discussed below to demonstrate the performance of the proposed structure.

Complexity Analysis
To analyze the time-complexity of the proposed NN-based detector, it should be firstly noted that in the following simulations and sea trials, the networks were all trained once in advance and were not retrained online. All parameters of the networks was fixed when conducting simulations and experiments. Thus the large batch size of the dataset was not included in the computation. Hence, only N L and N were considered. The computations of both the LS estimator and minimum distance detector were included for the calculation of complexity of other equalizers. Consider the following LS channel estimator The number of multiply-accumulate operations (MACC) of the above process is the number of subcarriers N. For the minimum distance detector, each symbol experiences M times of complex operations, which contains 3MN + 5N MACC.
As a similar implementation of the NN-based equalizer, the ELM-based detector in [34] was considered for performance comparisons. According to [34], the number of MACC of the ELM-based detector was derived and listed in Table 2. Ignoring the addition with constant terms, the time complexities of different detectors are compared in Table 2. It can be seen that the computation of the proposed NN-based detectors is less than that of MMSE for 1 order of magnitude. The complexity of the ELM-based detector is the highest, which is 2 orders of magnitude larger than the proposed NN-based detector. Although the ZF equalizer shows the lowest complexity, it has been proved to be a suboptimal algorithm in noisy channels, which could be substituted by more advanced methods.

Numerical Simulations
The configurations of the simulated system are listed in Table 3. An OFDM system with the bandwidth 5000 Hz was built. BPSK and QPSK were chosen as the digital modulations. The number of subcarriers, which was 384, should be divisible by N c , whereas several possible N c s were chosen to build the networks. Before comparing the performance, the network was firstly trained with mixed samples. As mentioned in Section 4, besides Rayleigh channels generated with MATLAB, the real channels collected in the pool, artificial lake, and Wuyuanwan Bay, Xiamen, were taken as samples. All channel samples were mixed randomly in the proportion 1:1:1:1. For Rayleigh channel samples, the maximum CFO was set to 100 Hz. Moreover, Figure 7a-c shows the real scenarios to collect channels. The average depths of the pool, artificial lake, and testing sea area were 1 m, 5 m, and 8 m, while the depths of the transmitter and the receiver in the three areas were 0.5 m, 0.8 m, and 1.5 m. The only factor which affected the pool channel was the hard wall and bottom made of tiles. In addition, the average wind power in the artificial lake and testing sea area was <level 3. The outdoor tests were all performed in sunny days. Different distances were covered to obtain different maximum delays shown in Figure 7d-f. Furthermore, the power threshold C Th was set to 0.01 to filter the paths with low power. Taking mixture of the AWGN and impulsive noise generated with the GAN in [41] as noise samples, the dataset was finally constructed. Table 4 shows the training parameters for the network. The networks were trained with dynamic SNR and mean square errors (MSE) of the estimated channels. To accelerate convergence, ReLU was taken as the activation function, along with the Adam optimizer. The whole training process was conducted in Python with TensorFlow.  Figure 8 shows the BERs of the given N c changing with N L . The SNR was 25 dB and the assumed MSE of the channel estimated was 0.004. The step size for N L was 4. It can be seen that large input dimensions, such as N c = 12 and N c = 16, required more neurons to reach the best performance. In addition, N c for QPSK was larger than that of BPSK. To be clear, the estimated N Lopt s from Figure 8 are listed in Table 5. For each N c , QPSK needed at least 20 more neurons than BPSK.  It should be noted that the optimal N c of both modulations in Figure 8 is 1. This is because the training set contained multiple CIRs, including Rayleigh channels, with different delays. In this situation, the coherence bandwidth of the channel samples was limited to a small value. To further demonstrate the influence of the coherence bandwidth, the trained networks were further used to detect the symbol from the channels with specific maximum delays. The pool channels in Figure 7a were cut off with lengths of 48, 64, and 77 points, the corresponding N w c s of which were 8, 6, and 5. Figure 9a-f show the BERs with different N c in BPSK-OFDM and QPSK-OFDM systems. The BER curves of SNR = [15,25] dB are enlarged in Figure 9b,d,f. The MSE of the channels estimated was set to 0.01. It can be seen from the figures that the optimal N c varies with modulation and N w c . According to Algorithm 1 , the optimal N c s are listed in Table 6. It can be found that in all situations, the networks with N c > N w c showed poor performance. These results prove the analysis in Section 4, and further demonstrate the feasibility of Algorithm 1. Table 6. The optimal N c for different systems. 4 3 In addition, the networks with optimal N c s were compared with the ZF and MMSE equalizers. The pool channel with N w c = 8 was used. Figure 10 shows the BERs of different equalizers. The proposed network showed lower BERs than the ZF and MMSE equalizers both under perfect channel estimation (MSE = 0) and MSE = 0.01 of the estimated channel. In particular, when SNR = 20 dB and MSE = 0, the BER of QPSK detection network was 18.66% lower than that of the ZF and MMSE equalizers, while the BER of BPSK detection network was 14.23% lower than that of the ZF and MMSE equalizers. When SNR = 20 dB and MSE = 0.01, the BER of QPSK detection network was 29.26% lower than that of the ZF and MMSE equalizers and the BER of BPSK detection network was 22.16% lower than that of the ZF and MMSE equalizers. In addition, it can be seen from Figure 10 that the BER curve of MMSE equalizer for each estimation error is almost the same as the ZF equalizer, which confirms the discussion in Section 2.  [15,20] dB are enlarged in (b,d,f), respectively. Further, the above equalizers were tested with the LS estimation. The ELM-based detector in [35] was also compared. All receivers apply the minimum Euclidean distance method to detect symbols. Figure 11 shows the BER for these detectors. It can be seen that the ELM-based detector showed the worst performance in pool channels. The NN-based receiver still showed low BERs. When SNR = 20 dB, the BERs of NN were lower than ZF and MMSE equalizers for 25.92% and 30.99% under BPSK and QPSK modulations. The above results have proved the efficiency of the proposed NN-based detector. Furthermore, the underwater trials introduced in Section 6 show the practicability of the proposed detector.

Underwater Trials
The above systems were first tested in the pool in Figure 7. The signals were transmitted with the carrier frequency F c = 12 kHz and the sampling frequency F s was 100 kHz. The parameters shown in Table 3 were taken to generate baseband signals. The block-type pilots were used for estimation of the time-invariant channels, and the length of cyclic prefixes was 1/3 of the symbol length. Table 7 lists the BERs of different receivers. It should be noted that not all the NN-based detectors were retrained, which was the same as the trained networks in the last section. The optimal N c s for the NN was found to be 2. This was because the hard wall of the pool caused strong reflections that resulted in a small coherence bandwidth. By adjusting the transmitting power, two groups of signals were tested with SNR = 30 dB and 5 dB. The ELM-based receiver still showed the worst performance. It could also be found that the proposed NN showed low BERs in all conditions, while the ZF and MMSE equalizers showed relatively high BERs. Compared with the simulations, the gap of the performance between the proposed NN and other equalizers became smaller. This was because the delays of the real channels were much longer than the simulated ones, which caused intersymbol interference (ISI) besides intra-symbol interference. Because the NN and traditional equalizers are only designed to eliminate the interference in each symbol, the influence of ISI could not be well equalized. Further, the receivers were tested in real sea. The spot chosen was Xiamen Bay near the location in Figure 7c and the communication distance was 1 km. In addition to the block-type pilots, the comb-type pilots were applied to make more accurate estimations, for the channels were time-variant in the shallow sea. In addition, the LS estimator was used for both pilot types. The SNR was controlled intentionally to 4.5 dB for comparison of the performances in hostile environments, leading to the received signals in Figure 12b. The impulse interference can be observed, which severely affects the performance of the receiver.   Table 8 lists the BERs for block-type pilots, N c = 16, and for comb-type pilots, N c = 4. It can be seen that the received symbols with block-type pilots can hardly be detected with all detectors, although the proposed NN performs best. With comb-type pilots, the results are better. Since the ELM-based detector can only use the block-type pilots, the BERs with comb-type ones are ignored. Nevertheless, the BERs of the NN-based detector are the lowest among all equalizers. These results show the generation of the proposed NN-based detector in different UWA environments. It can be seen from Table 8 that compared with the ZF and MMSE equalizers, the proposed NN-based detector shows less difference than Figure 11. This is because the signals transmitted in sea trials experience more complex interferences caused by the time-variant sound speed field and noise distributions. In this situation, the signals suffer from more distortions than simulations, such as inter-carrier interference (ICI), which should be reduced by other algorithms.
To show the generation of the NN-based detector, the networks were tested with the data transmitted in Xiamen Bay in June 2018. The signals were transmitted at 500 m with level 3 sea conditions. A pair of NI-6341 data acquisition cards were connected with 30 kHz transducers as the transmitter and the receiver, shown in Figure 12b. The data were modulated by QPSK and the numbers of subcarriers were set as 128 and 512. A comb-type pilot was applied for the LS channel estimation. The NN-based detector was applied to replace the original MMSE equalizer and Euclidean distance detector. The SNR at the receiver was estimated as 32 dB. Table 9 shows the BERs of different detectors. Both detectors could detect the symbols well and the NN-based detector outperformed the original detector with both 128 and 512 subcarriers.

Conclusions
In this paper, a low-complexity NN-based detector has been proposed to be implemented in an OFDM system. The segmented channel responses and received symbols are input to the single-layered network, which directly outputs the detected symbols. By associating the network with the assist of coherence bandwidth of the estimated channel, an algorithm is built to find optimal hyperparameters. The networks are all trained offline, which are applied for both simulations and sea trials with fixed parameters. The quantitative simulations have compared the ZF, MMSE equalizers, and the ELM-based detector with the proposed NN-based detector, and the results show that the proposed detector reaches the lowest BER in the tested UWA channels. The same results can be found in the sea trials. With the best performance among the tested equalizers and detectors, the proposed detector has lower computational complexity than MMSE and ELM-based detectors. The proposed NN-based detector needs accurate estimated channels for better performance, which leads future research to focus on the construction of the optimization of the channel estimator.