A Multi-Dimensional Deep-Learning-Based Evaporation Duct Height Prediction Model Derived from MAGIC Data

: The evaporation duct height (EDH) can reﬂect the main characteristics of the near-surface meteorological environment, which is essential for designing a communication system under this propagation mechanism. This study proposes an EDH prediction network with multi-layer perception (MLP). Further, we construct a multi-dimensional EDH prediction model (multilayer-MLP-EDH) for the ﬁrst time by adding spatial and temporal “extra data” derived from the meteorological measurements. The experimental results show that: (1) compared with the naval-postgraduate-school (NPS) model, the root-mean-square error (RMSE) of the meteorological-MLP-EDH model is reduced to 2.15 m, and the percentage improvement reached 54.00%; (2) spatial and temporal parameters can reduce the RMSE to 1.54 m with an improvement of 66.96%; (3) the multilayer-MLP-EDH model can match measurements well at both large and small scales by attaching meteorological parameters at extra height, the error is further reduced to 1.05 m, with 77.51% improvement compared with the NPS model. The proposed model can signiﬁcantly improve the prediction accuracy of the EDH and has great potential to improve the communication quality, reliability, and efﬁciency of ducting in evaporation ducts.


Introduction
An atmospheric duct is a unique phenomenon in the lower atmosphere, and electromagnetic waves can experience less attenuation in the trapped layer by limiting the spread of the wavefront from spherical to cylindrical expansion, where the waves are bent by atmospheric refraction [1,2].A long-range transmission in microwave bands can also be realized.Namely, microwave radio signals may refract in the lower layers of Earth's atmosphere and propagate far beyond the line of sight [3].This feature may be appropriate for communications at sea [4], while the public land mobile network (PLMN) [5,6] is limited due to the special meteorological conditions and terrain features [7,8].Moreover, communication systems using evaporation ducts are expected to become an important means of the sixth-generation communication system until we further understand its distribution characteristics.
As a strong negative vertical humidity gradient near the sea surface, the evaporation duct exists due to the moisture content rapidly decreasing with increasing altitude [9,10].Dramatic effects may be applied to the microwave communication system while transmitting in the ducting layer, especially for frequencies above 1 GHz [11], which may meet the demand for large-bandwidth, high-speed, and long-range applications [12].The evaporation duct frequently occurs over the ocean and the occurrence in the South China Sea (SCS) exceeds 75% [13].However, the spatial and temporal refractivity variations significantly affect shipboard communication performances at sea and near shore [4,14,15].As a result, communication effects may experience several disadvantages, including a specific time loss (about 25% in the SCS based on the above statistical characteristic of the evaporation duct) and an available low antenna height with minimal effects on any land-land or shipbased system, etc., constraining it from becoming a convenient and widely used maritime communication.Therefore, the prerequisite for communication using evaporation ducts is the accurate prediction of transmission effects, which relies on two critical components: the atmospheric refractive condition and the prediction in the given environment [16].The evaporation duct height (EDH) is the characteristic parameter of the refractivity profile.The accurate prediction of the EDH has special significance in the practical design of the evaporation duct communication system and the instrument parameters.The EDH can be directly measured [17,18] and evaluated by numerous methods [19][20][21] and theoretical models [9,[22][23][24][25][26].The directly measured method may take lots of time and effort, and the latter two methods can calculate from meteorological detection at a certain height, but the accuracy needs further improvement.Therefore, a focus topic in the present and the future is to maximize the use of measurement datasets and to improve the prediction accuracy of EDH.
Presently, "artificial intelligent (AI) enhanced operation" has become one of the hotspot directions [14,[27][28][29][30].The combination of AI and the analysis of evaporation ducts has also boosted the accuracy of EDH prediction.In modeling construction, Yan et al., propose a numerical profiling method that adopts the artificial neural network and training data from the remote sensing data and the naval postgraduate school (NPS) model [14].Zhao et al., propose a method based on a multi-layer perception (MLP) of five hidden layers to predict the EDH, and the applicability in different areas is analyzed [28].In short-term prediction, Zhao et al., constructed an EDH prediction model based on a long short-term memory network [29].In addition, Mai et al., introduced the Darwinian evolutionary algorithm and compared the accuracy with the neural network in EDH prediction [30].
In this paper, deep learning methods are utilized to improve the prediction accuracy of the EDH so that the communication system can be better designed and operated.Furthermore, we construct a multi-dimensional EDH prediction model for the first time by blending with spatial and temporal "extra data" during meteorological detection [31].Section II describes the background and previous EDH prediction method and Section III describes the modeling process of the proposed model.Finally, predictions of the proposed model and the theoretical method are compared with the measurements; the effectiveness has also been verified.

Evaporation duct Diagnosis
The refraction in the atmosphere refers to the bending characteristics while the electromagnetic wave propagates in the medium, and the degree can be measured by the where c and v are the propagation speed of the electromagnetic wave in free space and the medium, respectively.Radio refractivity N (N-unit) is usually used in the troposphere to reflect the corresponding spatial structure characteristics.According to the ITU-R Recommendation P.453-14 [32] N = (n − 1) × 10 6 = 77.6 where P is the atmospheric pressure (hPa), T is the absolute temperature (K), and e is the water vapor pressure (hPa).
For the convenience of considering the curvature of the Earth, the modified refractivity M (M-unit) is often utilized as [9,10] where z is the height above the ground (m) and r e is Earth's radius (the average Earth radius is 6371 km).Electromagnetic waves are bent towards the ground by atmospheric refraction, while the vertical gradient of modified refractivity becomes negative (dM/dz < 0).Signals can refract and propagate over the horizon with matching frequencies and angles.

Theoretical Models of EDH
Based on the Monin-Obukhov similarity theory [33,34], the vertical profile of mean wind speed U(m/s), potential temperature θ (K), and specific humidity q (kg/kg) in the surface layer can be calculated.At present, the extensively utilized numerical methods in the evaporation duct prediction include the Paulus-Jeske (PJ) model [22], the Musson-Gauthier-Bruth (MGB) model [23], the Babin-Young-Carton (BYC) model [9], the NPS model [24], and the surface heat budget of the arctic ocean experiment (SHEBA) model [26] are extensively utilized at present.An evaporation duct's modified refractivity (M-profile) can be defined by a limited number of meteorological factors, such as pressure and temperature at the sea surface, relative humidity, temperature, and wind speed at a certain altitude.
The comparison between the BYC model, the NPS model, and the SHEBA model are listed in Table 1.During the calculation, the scale parameters and the thermodynamically roughness height of the sea surface are defined by the COARE algorithm [33], and the profile stability functions calculate the wind speed and temperature under stable conditions.
Taking the NPS model as an example, the advanced air-sea flux algorithm COARE 3.0 is adopted, keeping good consistency with the measured results [35].The input parameters are used to determine the modified refractivity profile, and the altitude with minimum value is the EDH.The vertical profile of air temperature T and specific humidity q at altitude z can be calculated as [24]: where θ * and q * are the characteristic scales of potential temperature and specific humidity, respectively; ψ h is stability functions; z 0t is thermodynamic roughness height; Γ d is the dry adiabatic decline rate; κ is Karman constant; L is the Monin-Obukhov length.
According to theoretical models, the EDH can be calculated with meteorological parameters at the sea surface and at a certain height.The calculation function can be expressed as where T h 0 and P h 0 are the pressure and temperature at the sea surface h 0 , respectively, RH h 1 , T h 1 , and U h 1 are the relative humidity, temperature, and wind speed at the altitude h 1 , respectively.

2007
SHEBA model [26] Grachev and Andreas (SHEBA07) where ξ = z/L is the Monin-Obukhov parameter, used to express the atmospheric stability; z is the altitude; L is the similarity length.

Analysis of Transmission Effects
For long distance transmission in the microwave band, the main transmission mechanisms are normal propagation close to the Earth's surface and troposcatter propagation [36,37].Anomalous propagation of transmission in the ducting layer may apply to communication systems over the ocean, signals may experience less attenuation under appropriate conditions.
The spatial and temporal distribution of meteorological parameters is uneven, leading to the changes of transmission effect changes with the variation of the evaporation duct.As a result, extra propagation loss among the designed communication system may be incurred.To analyze the influence of the variation of evaporation ducts quantitatively, a communication link was designed with the antenna heights fixed at 10 m.A specific position in the Pacific Ocean (30.0 • N, 130.0 • W) was selected, where the annual refractivity was 346.68 N-units [32].The transmission loss diagram at 12 GHz with a distance of 0-500 km is shown in Figure 1.To perform the propagation curves corresponding to the typical EDH varied from 6 m to 18 m, the parabolic equation toolbox (PETOOL) [38,39] with the parabolic equation (PE) method [40,41] has been applied.In Figure 1, the path loss curves varied considerably with different EDH.Path losses increase with the EDH fixed at 6 m and lower than the antenna height, resulting in a poor communication effect.Significant improvement of channel conditions arises when the EDH is between 16 m and 18 m and the path loss fluctuates around 150 dB.Overall, the transmission loss fluctuates with the increase of the EDH.Especially in the range of 10-12 m, a 1 m variation in the height may lead to an increase in the path loss at 500 km of 16.92 dB to 34.00 dB, which brings much uncertainty to the operation of the communication system.This is roughly the same as the results of [38,42].

MAGIC Datasets
To verify the prediction accuracy of existing models and to explore a better method, we use measured meteorological data from the ship-based marine ARM GCSS Pacific cross-section intercomparison (GPCI) investigation of clouds (MAGIC) field campaign.The ship-based MAGIC field campaign, with the marine-capable second ARM mobile facility (AMF2) deployed, lasted for nearly 200 days between Los Angeles, California and Honolulu, Hawaii, which provided high-resolution measured datasets of clouds, precipitation, and marine boundary layer (MBL) [43,44].The ship completed 20 round trips from October 2012 to September 2013.Lots of instruments were deployed to measure meteorological parameters aboard the ship throughout the campaign: a Vaisala weather station, an inertial navigational location and attitude system, an infrared SST autonomous radiometer (ISAR), radiosondes, etc. [44].
The meteorological parameters of the MAGIC datasets collected in this paper mainly include temperature, pressure, relative humidity, wind speed, and direction measured by the marine meteorological system (MARMET) at approximately 27 m above sea level; the sea surface skin temperature (SSST) measured by the ISAR; the ship location by a navigation system.The time resolution of the MARMET and the ISAR devices is 1 min.The time distribution of the sounding data collected from MAGIC datasets is shown in Figure 2. In addition, standard radiosondes (Vaisala model MW-31, SN E50401) were launched at 1 m to measure the vertical profiles of temperature, pressure, relative humidity, and wind speed and direction.Meteorological data at different altitudes were also collected using the Vaisala radiosonde (MW-31), with 0.5 Hz vertical resolution at fixed times [31].As a result, 571 sets of radiosonde data were formed.

Data Processing
Using the boat measurements of the MARMET and ISAR devices at the sea surface and 27 m above sea level and the radiosonde-collected data during their ascent, we can obtain meteorological parameters for at least seven altitudes at each geographic location.The sea surface relative humidity (RH) was set at 98% [43].As shown in Table 2, the measured datasets are preprocessed before modeling, with invalid datasets removed.In addition, we selected 476 effective datasets from 571 collections of radiosonde sounding data.Then, the modified refractivity index at seven layers can be obtained based on the effective meteorological datasets.A least-squares curve fit was applied to each of the 476 measurements.Furthermore, we got the M-refractivity profile by a log-linear function [9,10,45].
where M is the modified refractivity (M-units) and h is the altitude (m).f 0 , f 1 , and f 2 are coefficients that can be calculated for a least-squares best fit.The constant 0.001 was used to prevent the curve from blowing up at zero altitudes [9,10].
Figure 3 shows the calculating process of the EDH, modified refractivity M for every 0.1 m between the surface and 40 m altitude based on MAGIC datasets, and the height at which the minimum M is achieved is the EDH.

Reliability Assessment of Theoretical Models
Meteorological parameters, including the pressure and temperature at the surface, the relative humidity, the temperature, the wind speed in the air, and the altitude, will be considered in the calculation of the evaporation duct characteristics using theoretical models [24].The statistical root-mean-square error (RMSE) of the EDH predictions of the BYC model, the NPS model, and the SHEBA model based on the MAGIC datasets are described in Table 3.The minimum RMSE is 4.52 m by the NPS model.Let x and y represent measured EDH and predicted EDH of the NPS model, and the fitting line is y = 0.42x + 5.52, far from the evaluation criteria y = x, as shown in Figure 4.The EDH calculated by the NPS model does not match the measured data well.According to Figure 1, the RMSE exceeds 4 m, which may lead to a transmission loss error of more than 100 dB, which may bring significant deviation to the receiving effect of the transmission system.

Modeling Method
Theoretical models are generally based on the Monin-Obukhov similarity theory and are constrained by some basic physical boundary layer assumptions [3].On the contrary, the neural network training prediction method can be derived entirely from original data.Therefore, it is more suitable for the natural atmospheric environment and will not be constrained by theoretical assumptions.
Considering that MAGIC has special characteristics from other similar experiments: (1) the experimental positions were spatially repeatable (the ship completed 20 round trips); (2) the radiosonde data were concentrated at several hours (it launched every 6 h); (3) the experiment instruments were set at multilayers, which have great data background both in time and space.Therefore, combine the experimental data with the neural network by adding the spatial data, such as latitude, longitude, and meteorological parameters in multilayers, and the temporal data, such as experiment time, to construct new datasets as training input for the prediction model.
Artificial intelligence originated in the 20th century and has been used in various industries, but it is seldom used in EDH prediction [27][28][29][30].MLP is a kind of artificial neural network (ANN) with a forward structure [46][47][48] that maps a group of input vectors to a group of output vectors.The MLP consists of multiple layers and their neurons are fully connected to the next layer.It has a high nonlinear global function and powerful adaptive and self-learning ability, which is suitable for finding the characteristics of EDH prediction data in multi-dimensions.Here, the MLP model is considered to implement the construction of the training network.

Principle of the MLP
The MLP has universal approximation property [46].Theoretically, an MLP network composed of a linear output layer and at least one hidden layer with activation functions can describe any function from a finite dimensional space to another with arbitrarily high precision with sufficient hidden neurons supplied.Each node in the MLP is the neuron with a nonlinear activation function, except the input node.MLP is an extension of perceptron, which overcomes the weakness of not recognizing linear non-fractional data.
Compared with the single-layer perceptron, the hidden layer of MLP changes from one to multiple.The training purpose of MLP is to make the network approximate the function that needs to be fitted.During the training process, the information is carried out from the input layer to the hidden layers and then to the output layer.The input layer is responsible for receiving the characteristics of the training data and is connected to the hidden layer with weight parameters.In contrast, the output layer is the target value that the training is expected to achieve through the hidden layers to realize the nonlinear mapping of the input space.
A typical MLP training process is as follows: (1) the weights are randomly allocated; (2) the neural network is activated by using all features of the training datasets from the input layer and then the output value is obtained through forwarding propagation; (3) the error is calculated between the output and the target value and the weight is updated by backpropagation; (4) the training is repeated until the output error is lower than the established standard.The trained MLP network can accept new input datasets at the end of this process.

Modeling
The essence of prediction is complex regression function construction and a multidimensional EDH prediction model can be constructed with the "extra data" in the experi-ment.The essence of MLP is also a nonlinear function mapping from input vector to output, similar to the model we tried to train.The advantages of MLP in learning and in processing nonlinear global data may solve the regression problem of meteorological characteristics.With a reasonable network structure and hyperparameters combined with enough training data, the performance of MLP can be excellent compared with the theoretical model.
While constructing the complete dataset of the MLP model, spatial data, temporal data, and meteorological data at multiple altitudes, including temperature, pressure, wind speed, and RH at the data measurement location of the MAGIC field campaign, were collected as a set of modeling data.To complete a comprehensively trained network for the validation of testing datasets and the generality of the method, we should select the training dataset that would cover the main features of the total dataset.A commonly accepted hold-out approach [49] is a 7:3 ratio between the training and the testing set.Namely, the training and the testing set proportion is 70% and 30% of the total dataset.Therefore, the first 334 groups of data in about 12 round trips were selected and randomly reordered as training datasets.The remaining 142 groups of data were used as a testing dataset.
By classifying and selecting the corresponding parameter information with the training datasets, meteorological-MLP-EDH, spatial-MLP-EDH, temporal-MLP-EDH, spatialtemporal-MLP-EDH, and multilayer-MLP-EDH models were constructed.The modeling process is shown in Figure 5. (1) Meteorological-MLP-EDH As a comparison with the theoretical models, the meteorological-MLP-EDH model takes the temperature (T) and the pressure (P) at the sea surface (h 0 ); temperature, wind speed (U), and RH in the air (h 1 ); measured altitudes in each training dataset as the input parameters.The mapping output is the corresponding EDH and the calculation function can be expressed as (2) Spatial-MLP-EDH The spatial data such as latitude and longitude may positively affect the prediction results of the experiment ship completing 20 round trips.The input vector of the spatial-MLP-EDH model takes the same parameters as the meteorological-MLP-EDH model.Furthermore, the spatial parameters of the experimental positions with latitude (λ) and longitude (ϕ) in the MAGIC campaign are also used as additional information to supply the feature of the selected meteorological parameters on a complete path.
(3) Temporal-MLP-EDH The radiosonde data were collected every 6 h in the MAGIC campaign.The temporal information may have a positive effect on prediction accuracy.With the measured time (UT), new datasets can be collected to construct the temporal-MLP-EDH model, implying the feature of the selected meteorological parameters at the specific time.
(4) Spatial-Temporal-MLP-EDH The spatial-temporal-MLP-EDH model is a three-dimensional regression function consisting of spatial-temporal information and meteorological parameters at a single layer at sea surface and air.The input values supply the feature of selected meteorological parameters at a specific time on a complete path.
(5) Multilayer-MLP-EDH In addition, we constructed a four-dimensional regression function multilayer-MLP-EDH with meteorological parameters located on another layer.The sensitivity and accuracy of predicted results have been explored.The input vector mainly consists of spatialtemporal information and meteorological parameters at multiple layers with the altitudes of h 0 , h 1 , and h 2 , which implies the feature of selected meteorological parameters over a wide vertical range at a specific time on a complete path.
During the modeling process, the design of MLP and the selection of corresponding parameters will also greatly influence the prediction accuracy of the training data, so the related parameters need to be adjusted systematically.The section for MLP design mainly includes the activation function, loss function, optimization algorithm, and network structure [50].
(1) Activation Function In the hidden layer of MLP, the activation function is to introduce nonlinear changes to enhance the approximation ability of the neural network [28].It uses differentiable functions and a back-propagation algorithm for effective learning.The most commonly used activation functions include rectified linear unit (ReLU), logistic sigmoid function, radial basis function (RBF), etc.In this paper, ReLU was used so that it can be tuned in a biomimetic way.The problem of gradient explosion and gradient disappearance is avoided by more efficient gradient descent and backpropagation [51].ReLU function can be expressed as ReLU(x) = max(0, x) where x is the input data that the neuron received.
(2) Loss Function In addition to the activation function, the loss function also needs to be defined to evaluate the difference between the output of the current network and the expected result.The network will update the weight parameter automatically according to the difference so that the whole network can fit the nonlinear mapping relationship as much as possible.
The general loss function mainly includes mean squared error (MSE), cross-entropy (CE), etc.The CE function is usually chosen when facing the problem of image classification and recognition.MSE is mainly used to deal with data prediction and inversion, as in this paper, and its calculation function is where n is the dataset number, y i is the measured EDH, and y p i is the predicted EDH.

(3) Optimization Algorithm
The original intention of the optimization algorithm is to define the parameters to be optimized, to create the objective function, to set the learning rate, etc.Then, the descent gradient is calculated and iterated according to the gradient.
Stochastic gradient descent with momentum (SGDM) and adaptive moment estimation (Adam) are the most commonly used optimization algorithms [52].SGDM can reach the optimal global solution, but it has strict requirements on the learning rate and is easy to stop at the saddle point, which is suitable for reliable initialization parameters.Meanwhile, with the progress of training, the speed of the SGDM method will slow down and the learning rate needs to be manually adjusted.Sometimes, it will converge to the optimal local value and the training results will also be affected.Adam has the advantages of fast speed, small memory requirements, and adaptive learning rates for different parameters.It is good at handling sparse gradients and non-stationary objects and is more suitable for large datasets and high-dimensional spaces to be processed in this paper.Using the Adam function will eventually converge to the optimal global value by automatically adjusting the learning rate.Therefore, the Adam function is finally selected as the optimization algorithm given the inversion problem to be solved in this paper.The initial learning rate is set as 0.0001.
(4) Network Structure MLP introduces one-to-multiple hidden layers based on the single-layer neural network; the appropriate hidden layers can be selected according to the original intention.
For the data input module, the hold-out method [49] was used to randomly divide the 476 sets of measurements into fixed mutually exclusive datasets; the proportion is 70% in the training set and 30% in the testing set.To avoid the impact of deviations introduced in the partitioning process, we tried to maintain the spatial and temporal consistency of the training and the testing set.The first 334 groups of data in about 12 round trips were selected and randomly reordered as training datasets and the remaining 142 groups of data were used as a testing dataset.
The selection of hyperparameters is complicated and engineering work and network hyperparameters, including the hidden layers, the number of neurons in each layer, the batch size, and the number of training epochs, are introduced during the modeling process.
The number of hidden layers is essential to the hyperparameter in the MLP design, which is directly related to the function approximation capability of the network.However, excessive hidden layers may lead to overfitting by learning extra characteristics of the training datasets.Therefore, in the parameter adjustment experiment, we explored the parameter ranges during the parameter selection: the number of hidden layers (1-8) and the number of neurons per hidden layer (1-300).In the end, we selected an MLP with four hidden layers by a large number of computer experiments; the neurons in each layer were 100, 50, 20, and 5, respectively.
When constructing the EDH prediction model, it is necessary to consider that its design performs well on training data and can generalize on new input datasets.A deep learning model with too many parameters and few training datasets is easily overfitting during the training progress.The specific performance of overfitting is as follows: the loss function of the model is small in the training data and the prediction accuracy is high; however, the loss function of the testing data is relatively large and the prediction accuracy is low.In deep learning, regularization strategies are designed to reduce test errors, which may come at the expense of increasing training errors.
(1) Early stopping The regularization strategy most commonly used in deep learning is called early stopping.When the training has sufficient representation ability and even overfits the model, the training error will gradually decrease with time, but the verification error will rise as a consequence.The early stopping strategy means storing a copy of the model parameters after each validation error improvement.The algorithm terminates when the validation error does not improve further within a predetermined number of cycles.
(2) L2 regularization L2 regularization is one of the means to prevent overfitting.The model complexity is controlled by limiting the parameter range space, thus overfitting can be avoided.In this paper, L2 regularization is adopted for the convenience of derivation and optimization.
(3) Dropout Dropout can be a choice for training deep neural networks.The concept of dropout makes the model more generalized by stopping the activation of a particular neuron with a certain probability, thus it will not fully connect to local features.In addition, the interaction between neurons in the hidden layer can be reduced.

Results and Discussion
To evaluate the accuracy and improvement of the prediction model, we introduced three evaluating standards as follows: (1) bias, which reflects the deviation from the measurements; (2) variance, which reflects the stability and robustness of the prediction model; (3) improvement, which reflects the enhancement compared with the original model.Meanwhile, three performance indexes were also introduced to measure better the bias of multi-dimensional EDH prediction models: the RMSE, the mean absolute error (MAE), and the coefficient of determination (R 2 ).In addition, the variance of prediction error (Var) and the improvement (σ) are also used to assess the accuracy of predictions.The definitions and characteristics of these indexes are listed in Table 4.

Index
Definition Characteristic Evaluate the absolute deviation between the predicted value y p i and measured value y i , it is not susceptible to extreme values, where n is the number of samples.
Evaluate the conformance of fitting the estimated regression equation, it indicates the degree of linear correlation between the predicted and measured value.
Evaluate the deviation of the prediction error e and the stability of the accuracy of the predictions.

Generalization Performance of Spatial-Temporal Models Based on MLP
In order to better analyze the robustness of the trained model, testing datasets were used for prediction accuracy analysis.Meanwhile, the number of floating-point operations (FLOPs) is utilized to compare the computational load of the algorithm, considering that the number of input parameters used for models ( 9)-( 13) is different [53].The analysis results are shown in Figures 6 and 7 and Table 5.
where I and O are the input and output neuron numbers.
It can be seen that: (1) In Figure 6a, the trained meteorological-MLP-EDH model with the same input parameters as the NPS model has a better-matched degree with the measured data.The RMSE decreases from 4.67 m to 2.15 m and the percentage improvement reaches 54.00%.In addition, the MAE and variance all improve, while the coefficient of determination R 2 remains at a low level with the promotion of the MLP.The RMSE of the meteorological-MLP-EDH model exceeds 2 m so that the maximum variation of transmission loss at 500 km could exceed 120 dB, according to Figure 1.(2) The prediction curve of the model fits much closer to the measurements by continuously adding spatial information (such as latitude and longitude) and temporal information (such as UT).The blue bar in the diagram, which symbolizes absolute deviation, gradually decreases.While the RMSE in Figure 6d has been greatly improved, the RMSE of the spatial-MLP-EDH, the temporal-MLP-EDH, and the spatial-temporal-MLP-EDH is 1.84 m, 1.75 m, and 1.54 m, and the coefficient R 2 has also made furtherly progress.The corresponding percentage improvement reached 60.53%, 62.53%, and 66.96%, respectively.Notably, introducing spatial and temporal parameters has little effect on the variation results.In Figure 6 and Table 5, the spatial-temporal-MLP-EDH essentially agrees with the measured EDH, but it still fails to match the local maximum.Overall, the model has excellent generalization ability after training 70% of the original datasets and maintains good consistency in the testing datasets.Overall, the EDH prediction model based on MLP can maintain good consistency with the measurements at a large scale.However, a significant difference exists when predicting small-scale fluctuations, such as local maximum and minimum.Moreover, an optimization model with low bias and variance is always preferable based on MLP.
The training and testing datasets were collected from 20 repeated trips along one path and the experiment time was covered diurnal cycles.With the spatial parameters of the experimental positions with latitude and longitude and temporal parameters every 6 h in the MAGIC campaign introduced, the prediction accuracy of the model is gradually improved, indicating that the spatial and temporal variability is significant.By extracting much more "hidden information" from "extra data" in the training process, the spatial variability of the three-dimensional geographic information and the temporal variability of the diurnal cycles are repeatedly learned and memorized based on the MLP method.With the constructed multi-dimensional deep learning model, the geographic and time domain feature can be extracted, which supplies an improvement in EDH prediction.The variance achieves another reduction with meteorological parameters in multilayers and decreases to 1.02 m.The trained model can match the trend of measurements at a large scale; meanwhile, the maximum and minimum values of the measurements at a small scale can also become significantly matched.According to statistical analysis, the RMSE of the multilayer-MLP-EDH method reached 1.05 m and the improvement percentage reached 77.51%, compared with the NPS model.Furthermore, the computational load of this algorithm (FLOPs) is 2.27 times as much as the meteorological-MLP-EDH model, which reached 8.64 × 10 13 .
The overall trend of predicted EDH by NPS differs significantly from the measurements, mainly because air-sea coupling conditions limit the NPS model.The prediction accuracy is hard to maintain when the air-sea temperature difference (ASTD) is greater than 0 [14].Table 6 shows the statistical RMSEs in stable and unstable conditions, and the prediction error of the NPS model increases when ASTD > 0. The multilayer-MLP-EDH method is maintained in RMSE, which reflects the consistency of the proposed method in dealing with different conditions.As shown in Figure 9, the fitting line between the predicted and the measured data changes from y = 0.42 x + 5.52 of the NPS model to y = 0.93 x + 0.99, close to y =x.Therefore, this method has better operability by setting meteorological instruments at two different heights (the cabin and the deck, for instance).The EDH predicted error could reach nearly 1 m combined with the sea surface meteorological parameters.The predicted RMSE of EDH of the theoretical method is 4.67 m, which may lead to the uncertainty range of path loss exceeding 120 dB at the 500 km transmission range.For instance, as shown in Figure 1, the path loss can increase from the original design of 179.12 dB at a predicted EDH of 11 m to exceed 300 dB at a true EDH of nearly 6 m.This state will leave the transmission system in an unstable situation.However, a significant improvement arises when single-layer models based on MLP become involved.The predicted deviation of EDH decreases to 1.54-2.15m, corresponding to a path loss variation from 162.20 to 213.12dB.The prediction accuracy of the evaporation duct channel continues to improve with multilayers.Furthermore, the uncertainty of path loss is reduced by 16.92 dB on the single-layer models.Therefore, the multilayer-MLP-EDH model can be essential in designing a communication system using the evaporation duct.
Table 7 provides a summary and comparison of the performance of EDH prediction, with the AI method introduced.From the comparison results, the four-dimensional regression function multilayer-MLP-EDH with meteorological parameters located on another layer proposed in this paper has the advantage of extracting the spatial-temporal information and the meteorological parameters at multiple altitudes in the training process.At the same time, a wider application range, higher precision, and model generalization are also achieved.Furthermore, the proposed model has great potential for enhancing the communication quality, reliability, and efficiency of ducting in evaporation ducts.

Conclusions
Low altitude atmospheric refractive conditions significantly affect the performance of shipboard communications at sea and near shore [12].The accurate prediction of the EDH is thus crucial in the demonstration, design, development, operation, and maintenance management of the communication system under this mechanism.Based on the MLP deeplearning method, the multidimensional deep-learning model was proposed to improve the prediction accuracy of EDH.First, the meteorological-MLP-EDH model was designed, which improved the prediction accuracy by 54.00%, with the same input parameters as the NPS model.The spatial-temporal-MLP-EDH model has gone one step further by superimposing the spatial-temporal "extra data" in the experiment.As a result, it can be essentially in agreement with measurements at large scales and the predicted RMSE is 1.54 m, with a 66.96% percentage improvement compared with the NPS model.Lastly, the multilayer-MLP-EDH model with the temperature, RH, and wind speed at an additional height was trained, significantly matching measurements at large and small scales.According to statistical results, the predicted RMSE can reach 1.05 m and the percentage improvement reached 77.51%.
The proposed model in this paper can break through the limitations of theoretical models by extracting much more "hidden information" from "extra data" in the training process, significantly improving EDH prediction accuracy.As a result, the proposed model has great potential for enhancing the communication quality, reliability, and efficiency of ducting in evaporation ducts.
The models constructed in this paper are based on 476 sets of MAGIC data in the Pacific Ocean; the training and testing datasets are limited to a sea area of 21.2197 • N, 33.6001 • N, 118.3299 • W, 157.7416 • W at specific experimental time intervals.Future experiments should be performed to more completely validate the models.In addition, measurements should be made at comprehensive coverage, massive data acquisition, and high spatial and temporal resolution to improve the constructed model.Furthermore, the distribution of EDH in high precision, detailed resolution, and broad coverage with the improved proposed model would be valuable to the communication system using evaporation ducts over the ocean.

Figure 1 .
Figure 1.The transmission loss diagram at 12 GHz with a distance of 0-500 km.

Figure 2 .
Figure 2. The spatial and temporal distribution of MAGIC measurements.

Figure 3 .
Figure 3.The calculating method of the EDH.

Figure 4 .
Figure 4.The EDH fitting line by the MAGIC datasets and inverted by the NPS model.

( 3 )
The statistical results in Figure7show the deviation variation of the abovementioned models.The box of each frequency represents the upper and the lower quartiles of the deviations and the horizontal line in the middle of the box is the median of deviations.The black line connected with the colored box shows the confidence interval of the deviations.Diamond symbols of corresponding colors represent outliers that deviate from the confidence range.The variation range of each model changes on a small scale, but the median value of deviation changes from -1.57 m of the meteorological-MLP-EDH model to 0.13 m of the spatial-temporal-MLP-EDH model, which is essentially in agreement with the measurements on a large scale.

Figure 6 .
Figure 6.Generalization performance of the EDH prediction models based on MLP.

Figure 7 .
Figure 7. Statistical results of EDH prediction models based on MLP.

4. 2 .
Generalization Performance of Multilayer Model Based on MLP The RMSE of the spatial-temporal-MLP-EDH model has improved to 1.54 m and the parameters as a coefficient of determination and variance of prediction error still have room for improvement.Adding the temperature, RH, and wind speed at an additional height, new datasets with the original parameters may improve prediction accuracy.The comparison results with measured data, EDH predicted by the NPS model, and the multilayer-MLP-EDH model are shown in Figure 8.

Figure 9 .
Figure 9. Scatter plot of measured EDH against modeled EDH using the multilayer-MLP-EDH model.

Table 1 .
The prediction methods of evaporation ducts.

Table 2 .
The dataset preprocessing process.

Table 3 .
The statistical RMSEs of three theoretical methods.

Table 5 .
Equation and characteristics of the performance criteria.

Table 6 .
The statistical RMSEs in different conditions.

Table 7 .
Comparison of different models in EDH prediction.