A Deep Learning Method for Short-Term Dynamic Positioning Load Forecasting in Maritime Microgrids

Featured Application: Application of deep learning techniques to dynamic positioning in maritime microgrids for power management system. Abstract: The dynamic positioning (DP) system is a progressive technology, which is used in marine vessels and maritime structures. To keep the ship position from displacement in operation mode, its thrusters are used automatically to control and stabilize the position and heading of vessels. Hence, the DP load forecasting is already an essential part of DP vessels, which the DP power demand from the power management system (PMS) for thrusting depends on weather conditions. Furthermore, the PMS is used to control power generation, and prevent power failure, limitation. To perform station keeping of vessels by DPS in environmental changes such as wind, waves, capacity, and reliability of the power generators. Hence, a lack of power may lead to lower DP performance, loss of power, and position, which is called shutdown. Therefore, precise DP power demand prediction for maintaining the vessel position can provide the PMS with su ﬃ cient information for better performance in a complex decision-making process for the DP vessel. In this paper, the concept of deep learning techniques is introduced into DPS for DP load forecasting. A Levenberg–Marquardt algorithm based on a nonlinear recurrent neural network is employed in this paper for predicting thrusters’ power consumption in sea state variations due to challenges in power generation with the relative degree of accuracy by combining weather parameter dependencies as environmental disturbances. The proposed method evaluates with three traditional forecasting methods through a set of practical real-time DP load and weather parametric data. Numerical analysis has shown that with the proposed method, the future DP load behavior can be predicted more accurately than that obtained from the traditional methods, which greatly assists in operation and planning of power system to maintain system stability, security, reliability, and economics.


Introduction
Operation in deep waters is one of the most challenging in marine industries due to the risk of maneuvering in sea conditions. To avoid using anchor chains, the speed and position of vessels must be for the DP vessel is that the crucial load, such as thrusters demand in rough to extreme sea state, might have an upper power consumption ramp rate than the capacity of the power generators and can potentially increase the risk of blackouts. To avoid losing the position due to power blackouts in high sea state, DPS has superiorities than other key loads that exist on the bus feeders. Typically, DPS reduces the thruster's power dramatically to avoid dynamic instability in power systems [41]. For example, the uncertainties in electrical power demand for the power management system (PMS) to supply power for DPS in maritime operations. In marine the vessels, the PMS is an advanced controller to define the optimum power flow according to DP power demand in the future operating condition. In the shipboard power plants, PMS is employed to control the generator, prevent power failure, constraints, load flow, and shedding. The DPS is given the existing power by PMS to compute corresponding commands force and direction for the thrust allocation system. Consequently, unknown DP load demand due to uncertainties of sea conditions in the future may be a problem for PMS to optimize power generators [42].
To provide high-quality energy for safe and economical entry into ships, ship owners face many technical and economic problems, such as the operational planning and control of the shipboard power systems. Optimization methods with the considerable cost-saving prospect are used for the optimal planning and operation of dynamic loads and modern drive systems [43]. To approach this target, predicting future thrust force is crucial requirements. It is therefore very important to predict the power consumption of the thrusters. The DP loading estimation period can be for one year or one month for long-term load forecasts (LOTLF), medium-term load forecasts (METLF), and one hour or a day ahead for the short-term load forecasts (SOTLF). Typically, the proposed forecasting methods are applied to determine production capability or power generation, distribution, program planning, and maintenance schedules, etc. [44,45]. The SOTLF is essential to control and plan for the DP shipboard power system, which can be used as input for the power flow or fault analyses. The DP power demand forecasting can consider the impact of climate changes on the power system because of the characteristics of thruster consumers. The previous method for load forecasting performance by Fourier series or tendency curves regarding time functions. Moreover, an automatic regressive-moving average (ARIMA) method has been proposed to establish the load behavior [46]. Furthermore, they are used as an exponential smoothing model for METLF in the existence of power demand predictions.
The authors in [47,48] proposed a time variable, linear estimation models, such as automatic regression (AR) and an automatic variable for modeling uncertainty and load forecasting of consumer demand. In [49], the author used the time-frequency technique known as an empirical decomposition for analyzing the short term power predicting based on the historical load data. The time series modeling is explained in terms of ARIMA and multiple analyses for SOTLF [50]. In the presence of a large data set, an artificial neural network (ANN)-based SOTLF model is used in [51,52]. In [53,54], an ANN-based Levenberg-Marquardt (LM) backpropagation (BP) algorithm for SOTLF and METLF is used. The METLF-based ANN model is discussed using a monthly historical data set [55]. A fuzzy based NN based BP algorithm with particle optimization method is used for METLF [56,57].
Another combination load forecasting method which imposes in [58] as statistical and physical constraints. In [59], BP-based METLF analysis features such as complexity of parameter, operating speed, and convergence have been proposed. Moreover, the linear, statistical, dynamic, fuzzy logic and AI-based models are similarly used to predict wind speeds. Extensive analysis of wind speed and electric load prediction is investigated in literature [60] for LOTLF based on nonlinear recurrent neural networks (NRNN). In addition, the models are minimized by using gradient descent (GDC) algorithms to decrease the NN error signals locally. Furthermore, researchers in time series modeling techniques should determine the significance of the selected input parameter and their patterns in the data. Moreover, in [60], the authors are reviewed wind predicting methods by using the NN based automatic regression with recessive exogenous (ARX), automatic regression moving average with recessive exogenous (ARMAX), nonlinear automatic regression with recessive exogenous (NARX), and NN based LM-BP algorithms to regulate the weight of neurons for SOTLF of wind.
In this paper, the concept of deep learning techniques is executed based NN-NARX algorithm as a recurrent network to predict the SOTLF in DPS for operational planning, and optimal dispatching between distributed generators. The precise DP demand forecasting and power estimation for compensating the vessel motion are considered. In this method to provide the necessary information about load changes for better decision-making of the PMS in sea disorders. Hence, the DP load profile is predicted by applying the autocorrelation methods as the climate-sensitive load, which is demonstrated by the proposed technique to overcome the difficulty of previous methods. With the aim of this, an LM-BP-NARX network with external inputs as hybrid modeling is proposed in this paper for SOTLF. The sea disturbances as exogenous inputs are considered to enhance the precision of DP load prediction. The suggested method is used for predicting the hourly, daily, and sea state pattern recognition for short-term thrusting power, that the LM-BP is employed to train the weights of NARX neurons collection. The performance of the proposed method is demonstrated and compared with three traditional forecasting methods through a set of practical real-time DP load and weather parametric data. Furthermore, a literature review in [61] is presented for the comparison between the conventional and intelligence control methodologies for the DPS controller structure improvements in offshore ships.
This paper is organized as follows. Section 2 reviews the artificial neural network. The proposed method is described in Section 3 to train, validate, and test of predicted DP load in different sea states. Section 4 shows the evaluation performance of the proposed method. Finally, Section 5 terminates a brief survey of the proposed work.

Review of Artificial Neural Networks
The concept of artificial neural networks has been developing considerably. Due to the nonlinearity nature of the neural network, it can learn deeply from the unknown environments and globally estimate the property of neural networks in the controlled methods which are extremely appropriate for solving problematic signal processing. The challenge of measuring the area of neural networking models is to classify the NN configurations, which have been designed to forecast actual model problems. It is essential to recognize the nature of the DP problem. Therefore, the neural network is suitable and can be used for solving the DP problem. Furthermore, it is also significant to evaluate the effect of neural networks on efficiency and reliability. Another main subject is to assess the NN learning models and features and determine those algorithms for solving signal-processing problems [62,63].
An ANN is a computing system consists of several simple and strongly interconnected processing elements that process information by reacting to its dynamic state to external inputs. The investigation of ANN models has recently accelerated because they have the potential to provide solutions to some of the problems that have been arisen in the standard series of computer science in the field of computer science and artificial intelligence. Neural networks are better suited for human-like performance in areas such as speech processing, image recognition, device vision, robot control, and more. Figure 1a shows an overview of a general feedback network which is commonly using in the ANN model. There are n inputs as shown in Figure 1a, X = [x 1 . . . x n ] with corresponding weights w 1 to w n . The inputs signal is defined with a fixed value as the bias term as shown in Figure 1a, where, b = 1, w = [w 1 . . . w n ] is the weight matrix of inputs vector [x 1 . . . x n ] T . Therefore, the NN net function is calculated as:

The Architecture of Multilayer Perceptron
The ANN can be determined as an exceedingly interconnected array of processing elements named neurons. An extensively ANN model used by highly neurons connection known as the multi-layered perceptron (MLP) for deep learning of neurons [63]. Figure 1b shows the MLP type ANN feed-forward, a layered network of neurons contains one or more input, output layers, and hidden layers. Hence, each layer in an MLP has numerous neurons, which each neuron in the hidden layers is connected to the neurons in the neighbor layers with the nonlinear activation function and various weights. The input layer signals defined by weights and flow via the hidden layers to arrive at the output layer passing through the nonlinear or linear transfer functions termed as an activation function. Most of the common activation functions for MLP include sigmoid and hyperbolic function as shown in Table A1 in Appendix A. Figure 1b shows a typical MLP design for the popular MLP configuration, in which connections are made only between the neurons of successive layers in the network. Each circle symbolizes a single neuron in this shape. The vertical structure of the neurons in the layers is defined as the hidden layer 1, the hidden layer 2, and the output layer. A neuron via hidden layers is connected to neighbor neurons with different layers between input and output. The input neuron x i connects to hidden neurons h i by weight w ij and h i connecting to output neurons y i by weight w jk . The multilayer perceptron executes mathematically nonlinear estimation by using sigmoidal functions as hidden units and linear weights. During the network learning, the weights of neuron vectors feature are continuously adjusted to reduce the error signal corresponding to the desired output, and the estimated network output. The circle's activation function in the MLP is given by the sigmoid function and is the standard activation function in feedforward neural networks. To compute the hidden-layer neurons output signal passing the summed input signal through a sigmoid function as shown in Figure 1a,b, where the sigmoid function is defined as f (u) = 1 1+e −u/t . Hence, the hidden layer is calculated as follows: To define the value of output neurons based on hidden layers output as represented in (1):

Back Propagation Algorithm
The proposed backpropagation training method is that how a neural network can adjust or tune the weight of the neurons itself to control a nonlinear dynamic system. The backpropagation learning technique is an extension of the Widrow-Hoff algorithm to adapt the weight of the synapses by error modification rule, which is termed as generalized delta rule (GDR) for training the layered perceptron weights. The sum of weighted inputs to the neuron passes the activated function, and the signal output is generated. The generated error signal decreases the difference between the target and the weights w ij of the network outputs error. The signal error at the output layer spreads regressively to the hidden layer and it goes to beyond the input layer. Due to the regressive propagation of the error signal, the GDR is named as the error backpropagation (EBP) algorithm.

The Back-Propagation Training Algorithm of MLP
The multilayered neural network consists of hidden layers and the training method is much more difficult. A key method to adjust the weights of neurons is spread over an MLP model. A single-layered MLP structure shown in Figure 2 contains a single neuron to present the back-propagation training procedure. According to Figure 2, a neuron has been divided into two parts. The first summation part computes the neuron function u, and the second part is the nonlinear activation function  The network output ∧ y is compared with the desired output value y, and their difference is calculated as an error signal e = y − ∧ y. Assumed the inputs and outputs samples are {(x(n)),(y(n)); 1 ≤ n ≤ n}, and the error BP training for feeding n inputs through the MLP network and calculates the corresponding output { ∧ y; 1 ≤ n≤ n}. Hence, the initial estimate for the weights vector matrix is represented by a summation square error as follows: The square error E b in (4) regulates the weights matrix w to decrease the error signal. Fundamentally, these algorithms are defined as follows: where w i (t + 1) is the modified value of the present weights of w i (t), and ∆w i (t) is the form of the differing algorithm in (5). This leads to a problem of optimizing non-linear squares. To reduce the square error E b , the weights matrix-vector is to be adjusted. To solve the problem in (4), there are several types of nonlinear optimization algorithms. Since this is a nonlinear mean square optimization problem, the gradient descent (GD) algorithm regulates the weights corresponding to the gradient error as the basis of the EBP learning algorithm. The derived scalar quantity E b concerning discrete weights can be defined as follows: Hence where δ(n) is the error signal e = y − ∧ y reduced through the derivative of the activation function. Therefore, the number of modifications is required to be adopted to the weight w i for the given input x i (n). Consequently, an MLP with a single layer of the neuron is adjusted based on the overall weight change in (5) as the following formula: where δ(n) is the error signal of output, as determined by: To perform the tuning of the weights in MLP, new symbolizations in Figure 3 are assumed to individual neurons at diverse layers. The network active function and their outputs corresponding to the training term kth of the jth neuron in the (l − 1) layer are indicated by u(n) respectively. The output of the previous neuron is passed through the ith neuron of the lth layer as a synaptic weight member w l ij (t) as shown in Figure 3. ∂w ij l , as gradient error (GE): (12) In (12), the output ∧ y(n) (l−1) is estimated through spreading on the kth train model of x i (n)th input of MLP weights is adjusted into w l ij (t), and the delta error (DE) phrase δ l i (n) has to be calculated. Figure 4 illustrates how to delta error δ l i (n) is iteratively computed from δ l+1 m (n) and the (l + 1)th layer weights. Where ∧ y(n) l i is passed through all m numbers of neurons in (l + 1)th layer of NN. Hence, the error of BP formula is defined by (13) which calculates the DE from the output layer rearward to the input layer, overtaken the hidden layers.
The delta error will be updated according to the adapted formulation as follows: where η and µ are named as the learning ratio and the momentum continuous parameter, respectively. To define the past weight changes effect in (14), the second term of the equation is the gradient of the mean square error corresponds to w l ij (t). Moreover, the third term is defined as a momentum term that gives a procedure to reduce the mean-square error by adjusting the weight size adaptively. The last term in (14) is a minor random noise rate that will have a slight effect since the second or third terms have higher magnitudes. Giving noise to the NN during training will help the learning algorithm more robustness, which results in decrease generalization error, and faster learning.

Recurrent Dynamic Neural Network
The structure of the learning algorithm and weights adjustment at the input layer of MLP feedforward networks for dynamic networks have been so far discussed. The nonlinear autoregressive network with NARX is a recurrent dynamic network, with feedback connections enclosing several layers of the network. The NARX model is defined on the linear ARX model [64], which is used commonly in time-series modeling. ANN is employed in various applications such as smart grids. For example, NN based NARX is implemented in [64] to estimate the electrical load demand, wind speed, and state of battery charging (SOC). Furthermore, a technique is proposed in [65,66] to predict the PV power generation by applying the NN-NARX in smart grids. Choosing an accurate category of the NN is very essential and it is specified the type of system nature. Especially, the network type is influenced by the correlations between inputs and the outputs system nature. Accordingly, the feedforward neural networks system is applied when the relationship between input and output is static. On the other hand, when the relationships depend on the weather conditions and sea state parameters as external inputs. For this reason, we have proposed the recurring neural network as a proper superior due to memory functions that are embedded in their neurons. The NN-NARX is classified as a recurrent neural network (RNN). To compare with the other conventional RNN such as time series-NN, nonlinear automatic regressive (NAR) prove proposes that NN-NARX has characteristic advantages in speed convergence, accuracy, and capability of the deep learning algorithm [67]. Moreover, it should be mentioned that the other types of RNN might be also executing well, but due to environmental disturbances, such as sea conditions as external disturbances, which can affect inaccuracy of the load demand prediction in real-time or online conditions. As a result, it is also important to consider these disturbances as external inputs during neurons weight adjusting as we have noted in Section 3. Moreover, we should mention, the comparative performance of the three different kinds of RNN methods in the paper is investigated. To illustrate that this type of RNN is precise for solving the PMS challenges in DP vessels, we are used the NN-NARX model. For achieving this purpose, we have trained the NN with the offline procedure. Hence, the speed of convergence is not a significant problem in this method but, the convergence speed with minor step size plays a key role to have a successful training process in the NARX application.
For many applications, recurrent or operational training processes are not executed, and they are applying offline. Nevertheless, a precise trained NN is anticipated. In the next sections, the deviation error concerning the evaluated and predicted values by the NN-NARX performs initially in the DP controller, then through the communications appears in the PMS system for dispatching economically. Therefore, due to the precision of trained NN, accurate operation of the expert NN, the sea state historical relationship between inputs and the output of the NN-NARX is executed in this work to predict DP power demand in the various weather condition. The results show that the proposed NN-NARX based on LM-BP has a precise operation to approximate the trusting force command as a sensitive load related to weather situations. Therefore, it persuades the concept of employing the NN-NARX in this paper. The initial mathematical correlation of the proposed method in the nonlinear time-amplitude scheme is defined as: where y(n, t), x(n, t) and F(x) represent kth output sample, kth input term, and nonlinear activation function, respectively. Furthermore, Td x and Td y are integer numbers of the inputs and outputs of the NN memorial order, where they are made known the tapped delay lines (TDLs) in the RNN, respectively. Hence, the equation value of the output y(n, t) in (15) is preceded by the previous values of the time delay tapped outputs and inputs. To estimate the output of the feedforward NN-NARX model, a nonlinear activation function f(x) is applied. As a result, for the training process of DP load, the preceding values of the NARX output, and input, a feed-forward series-parallel architecture is used as shown in Figure 5. Consequently, the desired output signal ∧ y(n, t) is defined in (16) based on previous values of the output signal and external input signal such as sea state as environment disturbances. Furthermore, the mathematical model of output of NARX is computed as: where ∧ y(n, t), b out , b hid , ∧ W out and f ∧ out are the predicted value of the outputs, bias vectors, hidden bias, output matrix of weights, and activation functions of the output layers, respectively. Additionally, W hid and f hid are the matrix of weights and activation functions of the hidden layers, correspondingly.
Typically, the NN feedforward learning algorithm based on LM-BP is formulated as [68]: In (17), ∇y n (w ij ) T and ∇ 2 y n (w ij ) are the vector of gradient and the error function of the hessian matrix, hence the dw ij is defined as: where ∇ 2 y n (w ij ) = J T J + S. The J is specified as the Jacobian matrix, which includes the initial network derived signal error, which is related to adjust the weights of the neuron. By considering the directional propagation algorithm, (17) is computed in (17), where S expresses the second-order derivative function ∇ 2 y n (w ij ). By ignoring the S in (17), the equation is solved by the Gauss-Newton method. We can also obtain results for the hessian matrix by the following approximation in (19).
To define the size of the µ coefficient of I in (19) as the unit specify matrix, a scalar quantity is applied. Hence, The LM-BP algorithm is used to solve the BP scaling problem, which frequently tends to minimize the converge of the local error in preference to the global error. Moreover, BP has appropriate performance on the simple training problem; on the other hand, the BP performance will be decreased, by increasing the data and data-sizing complexity, therefore BP cannot perform discontinuously. For DP load predicting, we have faced in the environmental disturbance similar problem.

The Proposed Method
As we mentioned in Section 2, to justify the NARX equations as the proposed method and its computational procedure in (15), the following dynamic load values are used for one-day ahead prediction, where y(n,t) = {y(n,t) ∈ t = 1, 2, . . . , 24} as the present DP load demand at day n, time t, ∧ y(n, t) = { ∧ y(n, t) ∈ t = 1, 2, . . . , 24} as the predicted DP load demand at day n, time t by the NARX network, w(n,t); is the weight vector at day n, time t and the F(U) is the nonlinear vector function representing ANN. Furthermore, the Table 1 shows the environmental disturbances according to Figure 6 where the W h1 (n, t), W V2 (n, t), W S3 (n, t), W D4 (n, t), D(n, t), H(n, t) are determined as wave height (m), wave swing (deg), wind speed(m/s), wind swing (deg), head swing (deg), and ship displacement (deg) as external inputs in x(n,t) = {x(n,t) ∈ t = 1, 2, . . . , 24} in (15) and (16) respectively. Also tapped delay lines (TDL) values Tdy and Tdx for the NARX inputs which we have used for training the weights in (13)-(17) as shown in Figure 7 with daily DP load forecasting as 24, 48, 72, hourly from 1 to 24, and weekly 168 have been selected, respectively.

Parameter
Description Range DP L DP power (MW) y(n, t) 0-7 W h1 (n, t) wave height (m) heading deg (Ө) x 6 (n, t) 0-360 Figure 6. Historical data patterns of weather parameters, and Vessel displacement for the period one month of offshore operation. To train, validate, and test the proposed RNN method for DP load forecasting, we used real-time data from the noon report of IFDC [69]. The data noon has collected from the actual DP load profile for January 2020. The dataset also includes weather information of the sea state of the Caspian Sea, such as wave height, wind speed, wave, and wind directions, which are shown in Figure 6 and Table 1. The sea state parameters affect the thrusting power demand as a dynamic load profile, which changes with the vessel motion. To define the DP load-forecasting problem and optimal dispatching between generators in various sea conditions, we used an hourly sea state and DP load data to train the proposed NN model. The hourly data is the average of the 'thirty' 2 min dataset, which we collected each 2-min data set during the DP operation.
For that reason, a precise range of input parameters increases the DP demand forecasting in sea state more accuracy. Furthermore, we used the correspondence study on the historical data of thrust power consumption that determines the nonlinear and dynamic dataset of historical data, contains hourly, daily, and weekly in the DP load predicting model. The DP load demand dataset is analyzed and the load profiles are classified according to weather conditions in Table 2. The past DP load demands and the profiles, which are the current DP load, the weather condition, and ship displacement are included affect the current load. For instance, environmental disturbances such as wave, wind, and their patterns affect the DP load demand, which leads to vessel movements. Hence, the following dynamic propulsion model is forecasted essential load demand for one hour, one day, and one week for PMS according to the Caspian Sea state, where the data has collected in a DP ship. To make the NARX, NAR, and NN with TDL, as NN-Time Series schemes which are used in the MATLAB deep learning toolbox [70]. Furthermore, the percentage of external inputs as we revealed in Table 1 is defined as 70%, 15%, and 15% for training, validating, and testing, respectively. After the definition of the percentage, the number of hidden layers neurons are adjusted 25. Finally, according to the parameters which we have defined, the NN is trained and implemented offline to predict the desired target. Table 2. Weather parameters data patterns based on the code of sea states (data from [36]).  [15][16][17][18][19][20] [5.5-9.5] [5][6][7] According to the collected data, as shown in Figure 6, we observed a non-linear relationship between the sea state parameters, such as wind, wave, and hourly DP load profile. Therefore, we have modeled the sea disturbances on the DP load profile as shown in Figure 8a,b. Hence, we applied hourly data of wind and wave variations and ship displacement as external inputs (disturbances) in (15) and (16) for high accuracy and speed convergence of the NARX-NN to train weights in MLP based LM-BP in (17) for DP load forecasting. To analyze the performance of the training procedure by using the proposed method, we have validated and tested the trained data set based on actual thruster's power consumption. Figure 9 displays the regression plots of STELF for the hourly, daily, and weekly for training, validation, and testing epochs. The regression results show that the training, validating, and testing are fairly successful where the data sets are trained more than 0.99, validated and tested values of trained parameters are around 0.99 and 0.98 at the 11 iterations. In addition, the majority of the vast data of the training collection produces result output on the edge of the first-order line, which is correlated to the best fitting connection. Accordingly, the relationship between the proposed method and target fitting line has shown that it can have a fairly good performance concerning the actual and predicted DP power demands by using the LM-BP-NARX technique. Regression figures are achieved for every single training process similarly. In our case study, we evaluate LM-NARX, and the others time series NN based on following two performance measurement indicators are defined as follow: (a) mean absolute percentage error (MAPE), and (b) error variance (δ), given as:

Test Results and Discussions
This paper considers three scenarios to evaluate the performance of the proposed method. Scenario 1 is the operation in sea conditions where the DP load in different sea states is evaluated to ensure that the proposed strategy has higher performance compared with other time series-NN schemes. Scenario 2 is a weekly load forecast in various sea conditions where the performance of DP load in the weekly thrusting profile is analyzed, which indicates the prediction of DP profile changes for days of the weeks. Finally, in Scenario 3, hourly load forecasting is proposed to estimate the peaks of DP load for thrusting the vessel at the multiples of 24-h lags at rough to high sea conditions. In addition, the performance of the NN-NARX estimate in comparison with other categories of time series NN schemes. In the continuation of this section, the results of the three executed scenarios are compared with diverse types of predictors.

Scenario 1: Operation at Sea State
The performance of the proposed strategy to estimate average DP load patterns for the various sea condition for a month as shown in Figure 8a,b, the hourly thrusting load patterns during environmental disturbance from calm to high, such as peak load periods January 2020. We are observed that the deviations in the DP load demands for different days of the months. As shown in Figure 8b, the entire peaks load in the graph occurs in the rough to high sea conditions. Typically, the DP load demand is comparatively less on the calm-slight situation than the moderate weather condition. On calm situation days, DP load is lower than the other moderate, and rough conditions days. The results have obtained for four representative sea conditions. The weather conditions have classified as inputs, and output of the NARX model with the 168, 336, and 504 lead times as determined in Table 1 and Figure 6 for 31 days SOTLF based on sea conditions. The backpropagation learning constant and the momentum ratio in (14) have set to 0.75 and 0.1 as an initial guess, respectively. In Scenario 1, the regulating tolerance for the weights vector is selected as 0.005. The results are presented in Figure 10a,b and Table 3 in terms of the performance measurement indicators (PMIs) defined in (20) and (21). Accordingly, the average values of the MAPE and δ for LM-NARX-NN are about 0.08% and 0.13 MW, respectively. In the meantime, the maximum DP load demand error and percentage values belong to TDL-NN, which are 0.2 MW and 0.13%, respectively. In contrast, the NAR has a more accurate performance as shown in Table 3 with the δ of 0.17 MW and MAPE of 0.12% than those for the TDL-NN. However, as mentioned in Section 3, due to lack of external inputs the weather distortion cannot be considered during the offline network training.

Scenario 2: Weekly Load Forecast in Various Sea Conditions
As we have defined in Section 3, the proposed method is trained by applying the dataset for the previous seven existing days for the DP load profile. For instance, to predict the weekly load pattern, the delayed 7 × 24-h DP load profile is used as an output, and the next three latest days for thrusting demand, and environmental distortion of 72 h for inputs data, therefore we applied the numbers of TDL 24, 48 and 72 in (15) and (16), respectively. When the NN training has executed, then the NARX model progresses with one day ahead to forecast the upcoming of the 24-h DP load by employing the previous 72-h sea state data and parameters as given in Tables 1 and 2.
The total number of the input and hidden layers of neurons are adjusted as 7 and 25, respectively. To improve the weights vector fluctuations, one hidden layer is used with 25 different numbers of neurons and the weight vector convergence in a predefined tolerance. In this case, Table 4 has analyzed the average weekly DP load demand, which shows the changes in DP load profile through several days of the week. We have observed that the DP load profile has strong relation through weekdays, such as peak load periods in high sea state as shown in Figure 11a. In some of the weekdays, the thrusting demand is lower than the other weekdays due to the Calm-Slight sea state. The error percentage of DP load in MW presents in Figure 11b. The accuracy of the NARX and the minimum error percentage in Table 4 Table 4 show that for the NAR-NN it has a more accurate performance during the weekly prediction of DP demand in sea disturbances in a month rather than TDL-NN with δ of 0.16 MW and MAPE of 0.17%, respectively. Furthermore, the NAR-NN can predict the value of DP demand to approach approximately to that obtained from the NARX-NN. Nevertheless, as mentioned in Section 3, due to uncertainties of the sea and weather conditions it would be necessary to consider disturbances as external inputs to the NARX-NN for learning procedure to recognize the sudden changes in order to obtain better results.

Scenario 3: Hourly Load Forecasting
Another significant characteristic of the DP load profile has shown in Figure 10a which provides the autocorrelation function of the hourly DP demand over four weeks. The peaks of DP profile indicate for thrusting demand at the multiples of 24-h delays at rough to high sea conditions, which shows that the load demand at the identical hours has a high relationship with each other dependent on the sea state and hours of the days as shown in Figure 8a,b. The DP load demand is categorized into hours-day and sea state patterns. The weight vector has approximated at each time intervals by applying the previous DP demand dataset for each pattern as in (15) and (16). When the weights matrix at day n, time t has been determined in (14) and (17), the thrusting force command is predicted with the DP profile data set of previous days as well as the estimated DP demand data by the network for the same day at previous time steps in (16). Hence, the adjustment of the weights vector tolerance is decreased to 0.0006 and the dataset lengths Tdx and Tdy are defined to two. Therefore, the sea state and vessel movement parameters (total 7 inputs) into the NARX model at times t, t − 1, and t − 2 of the past two days to predicted DP demand for time t + 1 of the same day. The NN with 25 hidden layers with seven neurons is used in this case.
The results have analyzed in Table A2 with details on hourly based for one-day ahead forecasting and presented in Figure 12a,b by the performance measurement indicators in (20) and (21) Table A2. The proposed method illustrates significant accuracy, minimum error percentage, and reliability in thrust force estimating models compared with other conventional time-series patterns. To ensure the performance of the proposed method, the speed of convergence in the NN-training run-time is also tested and compared with those obtained from NAR and TDL-NN methods.

Conclusions
The DP load demand predicting is already an essential part of DP vessel controllers, which the power demand from PMS for thrusting depends on weather conditions. As a result, unknown DP load demand in the future has predicted based on sea state by NARX-NN model. In this paper, we have implemented the concept of deep learning techniques, which we employed the NARX-NN as a recurrent network for DPS for operational planning, and economically dispatching between distributed generators. Hence, accurate DP load and power prediction for fixing the vessel position in this method can help the PMS to better performance in a difficult decision-making procedure for all DP vessels. The LM-NN based NARX has evaluated on real DP load profile in a PSV for thrusting the vessel position in various sea states. The proposed method confirms significant precision and defensibility in predicting technique compared with other conventional NN-time series structures. Moreover, we have trained the NARX base on the environmental disturbances as external dataset inputs to enhance the accuracy of the DP load predicting in different sea states. Furthermore, we have modernized the DP controller to forecast load by NARX-NN as a dynamic neural network, including recruited delay lines that they used for nonlinear filtering and prediction. This innovative method provides an integrated DP controller with PMS to increase the prediction accuracy of the DP load demand with the aim of enhancing operational planning in different sea conditions. In addition, three scenarios are considered for hourly, daily, and weekly load forecasting in different sea states. By using the environment disturbances dataset as external inputs in the NARX-NN, which effects on DP load demand, the proposed method can allow the PMS to increase the accuracy of short-term load prediction and operational planning for DP vessel power plant.

Conflicts of Interest:
The authors declare no conflict of interest. Table A1. Summary of neural network activation functions.

Activation Functions
Formula a = f(u) Derivatives Comments