Prediction of Waterway Cargo Transportation Volume to Support Maritime Transportation Systems Based on GA-BP Neural Network Optimization

: Water transportation is an important part of comprehensive transportation and plays a critical role in a country’s economic development. The world’s cargo transportation is dominated by waterway transportation, and maritime transportation Systems (MTS) are the main part of the waterway transportation system. The flow of goods plays a key role in the economic development of the ports along the route. The sustainable development of maritime transportation, the maritime transportation economy and the environment have great practical significance. In this paper, the principle of the BP (back propagation) neural network is used to predict the freight transportation volume of China’s waterways, and the genetic algorithm (GA) is used to optimize the BP neural network, so as to construct the GA-BPNN (back propagation neural network) prediction model. By collecting and processing the data of China’s water cargo transport volume, the experimental results show that prediction accuracy is significantly improved, which proves the reliability of the method. The experimental methods and results can provide certain reference information for the optimization, upgrade, and more scientific management of sustainable MTS in China and internationally, provide key information for port cargo handling plans, help optimize port layout, and improve transportation capacity and efficiency.


Introduction
Water transportation of goods is one of the main modes of modern transportation, and the MTS plays a key role in the water transportation system [1,2]. Waterway cargo transportation can be divided into inland waterway transportation and sea transportation according to the navigation area of ships, which is the basic transportation form of waterway transportation. Waterway cargo transportation refers to the behavior in which the carrier collects freight between domestic coastal ports, coastal and inland river ports, and inland river ports and is responsible for transporting the goods consigned by the shipper from one port to another port by water. An important form of transportation is a way of using ships to move goods between ports in different countries and regions through sea lanes.
Compared with railway and road transportation, waterway transportation uses natural rivers and oceans as transportation channels, so the energy and resources consumed per unit project are small. Waterway transportation also causes much less damage to natural resources and the environment, and waterway transportation produces less harmful gases or other wastes, which is a relatively environmentally friendly mode of transportation. Especially in recent years, the low-carbon economy has been steadily and further developed, and under the control of the national carbon peaking and carbon neutrality goals, the energy consumption of water transportation is relatively low, and the pollution to the environment is also very small. Water transportation has large capacity and low energy consumption, less pollution, is the best choice for "green shipping", and can be expected in the future.
Water transport has several advantages, including (1) large single shipment volume; (2) low transportation cost per unit of mileage; (3) small unit mileage investment of the route and saved land resources; (4) high labor productivity; (5) ease international trade; and (6) long average transport distance.
The MTS is an important part of waterway transportation, which plays an important role in the country's economic development [3,4]. Maritime transport is very important to the global economy as it accounts for approximately 80% of global trade [5][6][7][8]. JS Park et al. believe that the regional economy is closely related to the throughput of the port; the cargo port's purpose is, only in the case of sufficient throughput for the regional economic growth, to promote the role. When the inhibition effect is insufficient, the economic development of the port can provide employment opportunities for the nearby area, reduce the unemployment rate in the region, and promote the development of the regional economy, while the impact of the maritime transport system on the world economy is greater than that of air and land transport [9][10][11]. At present, the world economy is being hit by COVID-19, which has had a great impact on water transport, such as the general increase in freight rates and impact on oil prices [12][13][14]. Ocean freight is one of the most important transports in the global network of transportation systems. The composition of the maritime transport system includes waterways, ports and landside connections. The maritime transport system consists of 7241 nautical miles of routes. MTS relies on the existence of sea lanes for maritime services. MTS helps to ensure fair competition in trade and commerce. There are multiple intermodal connections at MTS, such as airport-ferry connections and ferry-to-train connections. There are many kinds of goods transported by water, among which the transportation of bulk cargoes such as coal, petroleum and its products, mineral building materials, metallic ores and nonmetallic ores occupies the main position. There is a close relationship between water transportation and land transportation. It is necessary to predict it, and it has an important reference role for land transportation. The forecast of water transportation is based on the needs of national economic and social development for transportation. The tasks that need to be undertaken are to seek the goals and ways to develop transportation capacity, to study the reasonable distribution of transportation volume among various transportation modes and the construction of comprehensive transportation network to form the basis for a reasonable transportation industry structure, such as the combination of waterway and railway transportation, making accurate forecasts can rationalize the network and quantity of shipments.
Most of the world's freight transport is based on the waterway to transport goods, so waterway transport port cargo flow research has great practical significance. At present, highways, railways, and other transportation modes are developing rapidly, water transportation is facing fierce market competition, some shipping markets are in a state of stagnation for a long time, and prospects are not optimistic. Some shipping lines have excess capacity, shipping enterprises generally have low profits, and some even suffer serious losses. According to statistics from relevant departments in China, in 2020, China invested 4933 km in new railway lines. At the end of 2021, China's ports had 20,867 berths for production terminals, a year-on-year decrease of 1275; China carried 125,900 water transport vessels in 2021, down 0.7 percent year-on-year. According to the United Nations Conference on Trade and Development (UNCTAD) [8], global seaborne trade shrank by 3.8% in 2020. Due to the continuous impact of the COVID-19 pandemic and the development of the situation in Russia and Ukraine, there is still some uncertainty in the market and volatility will increase. Many ports have the advantages of accommodating large ships, good geographical location and high route accessibility, and port service capacity has high port service capacity under these advantages, providing a strong business basis for the maritime transport system. However, some ports are far away from the main routes and have less direct connection with the main routes, so the business capacity of such ports is low. According to the relevant literature and data analysis, a considerable number of ports have excessive service capacity, especially under the impact of COVID-19. The phenomenon of oversupply is becoming increasingly obvious, and the risk of port congestion is increasing, leading to the low utilization rate of some routes and ports. However, the service capacity of some ports cannot meet the transportation needs, the port capacity is low, and ship congestion often occurs [15][16][17]. At the same time, due to the slow freight speed, the freight transportation is divided into road transportation and railway transportation. Compared with railway transportation and road transportation, the quality of shipping services by waterway transportation is not high. At the same time, the delivery period is prolonged, and the punctuality rate is low, which leads to the decline of customer satisfaction, thus losing a stable source of customers and reducing the competitiveness of enterprises. Therefore, improving the vitality of shipping enterprises and promoting the healthy development of waterway transportation is a key problem that needs to be solved at present. It is necessary to forecast the volume of freight transported by waterways. This is conducive to optimizing port and route planning and distribution in the MTS, and helps managers provide information on optimizing the system, helping the government and shipping companies plan the transportation of goods and avoid transportation risks. It is of great significance to forecast the volume of waterway cargo transportation to ensure that the transportation industry adapts to the development of the economy.
The meaning mainly includes three contents: (1) it can be used as a powerful reference for government departments to adjust the waterway transport capacity structure and formulate waterway transport plans and corresponding macro shipping policies; (2) it provides guidance and a theoretical basis for shipping enterprises to choose business scale and market investment direction; and (3) the forecast data provides important reference information and a decision-making basis for the sustainable MTS.

Literature Review
In terms of conventional forecasting methods, some scholars use the economic indicators of the forecast object as variable input to find the relationship between variables and forecast sequences analysis, principal component analysis, multivariate adaptive regression splines, etc. [18][19][20]. Niu Z et al. applied the grey model GM (1,1) to the shortterm forecasting of railway passenger traffic, and the experiments proved that GM (1,1) is not a single forecasting model; it can reduce the forecasting error by using a combined model [21]. The main research object of academic authors such as MiChaelw is to collect the amount of grain transported on the railway as the original data. Due to the large volatility of the data, regression modeling is difficult to achieve. Therefore, a time series model is proposed to conduct experiments for the purpose of predicting the data [22].
A variety of forecasting methods to the original time series, such as establishing forecasting models based on analyzing factors such as outliers and situational changes can be applied, optimizing traditional forecasting methods, or using integrated forecasting models [23]. For example, SARIMA refers to a method of forecasting based on time series. When Farhan J used this model to predict the container throughput of international container ports, the experiment proved the validity of the SARIMA model [24]. Awah P C et al. provided a practical method for predicting the actual handling capacity and attracted maximum container throughput of ports based on time series through random forest (RF) and multilayer perceptron (MLP) models [25].
Many scholars use neural network methods to make predictions, and they mainly use the asymmetric principle of BP neural network to make predictions. These methods are mainly combined with machine learning to make predictions. To improve the operational efficiency and energy efficiency of shipping, Zhi Yung Tay et al. analyzed the application of big data analysis and machine learning in port ships, using supervised and unsupervised machine learning systems to analyze and preprocess shipping-related data. The study shows that machine learning methods can handle complex data, while giving the advantages and disadvantages of supervised and unsupervised machine learning in operational efficiency and energy efficiency, which can provide reference for mitigating the adverse effects of climate change [26]. Pocajt V et al. used the selected sustainability indicators to predict the municipal waste generation (MWG) of countries with different development levels through a neural network prediction method. The experimental results show that the model is suitable for national MWG prediction [27]. Rahman et al. used different data-driven models of the ANN method to forecast renewable energy; these models can be applied to renewable energy and forecasts in the future, and these models have important significance and impact [28]. Niedbała G. et al. used a neural network prediction model to predict rapeseed yield. The experimental results are reliable, and the yield can be improved by reducing the dosage of mineral fertilizers [29]. Barrera J M et al. used the prediction method of a neural network to predict the energy output of solar panels. The experimental results show that the model is suitable for predicting the energy output of target solar panels, and the experimental results are reliable [30]. Using BP neural network algorithm and MATLAB toolbox, W Jiang et al. proposed a new product reliability prediction model and used reliability prediction to predict the reliability parameters of the example, and the prediction effect was more perfect [31]. Taking the Baishuihe landslide in the Three Gorges Reservoir as an example, HT Long et al. used the BP neural network to predict the landslide deformation. The results show that the prediction value of the BP neural network prediction model is highly accurate [32]. Lúcia Moreira et al. used the data on ship-related routes to train a neural network to predict ship speed and fuel consumption. The experimental results show that the neural network prediction model has good adaptability and good accuracy in predicting ship speed and fuel consumption [33]. Tamara A. Volkova et al. used an artificial neural network to correct the position coordinates of the ship when it is close to the water building, and then helped the trajectory prediction of the navigation section during the ship maneuvering process [34]. Michalis Chondros et al. developed an artificial neural network model suitable for flood risk prediction in coastal areas, which is helpful to the development and utilization of ships and routes in MTS [35].
However, most scholars use other methods combined with the BP neural network algorithm to make predictions before using BP neural network to make predictions, indicating that a single asymmetric neural network has certain shortcomings, and it needs to be optimized by combining other algorithms. For example, the container throughput prediction method based on the ARIMA-BP neural network by Zhang Y et al. can improve the accuracy of container throughput [36]. Zhang L et al. proposed a constrained optimization method based on the BP neural network in another study. By combining the fitting and optimization of the BP network, the application of the BP neural network was expanded. The optimization method is effective [37]. Arsad has established a performance prediction system based on neural network and linear regression with the students of Universiti Teknologi MARA as the research object [38]. Zhang Q et al. predicted traffic flow based on a wavelet neural network and IFOA's hybrid frame model (IFOA-WNN), which provided sufficient information for the formation of symmetric traffic flow. The experimental results showed that the model has higher prediction accuracy and stability [39]. Lee C Y et al. established a feature selection process composed of MIV to extract features as a feature database and used the PSO-BPNN model for fault diagnosis. The results show that the model is effective [40]. Juan Fang et al. constructed a deep neural network fusion-based collaborative filtering recommendation algorithm (CF-DNNF) to improve the recommendation performance of the collaborative filtering algorithm, and the experimental results show that the accuracy of the CF-DNNF model is significantly improved [41]. Zhao Y et al. combined the gray prediction model with the BP neural network model to improve the prediction accuracy of water traffic accidents, and the experiment proved that the Gray-BP model has less error, higher prediction accuracy and better stability [42]. Cheng W et al. used particle swarm optimization to optimize the BP network. The experiments of this study show that the PSO-BP algorithm can improve the prediction accuracy of network traffic and accelerate the convergence speed of the BP network [43]. Ma S et al. established a prediction model by combining the factor analysis method and neural network method, to improve the feasibility and accuracy of the prediction model of blasting vibration velocity. The research experiment proves that the improved BP neural network prediction model has better prediction accuracy [44]. Shi L et al. combined particle swarm optimization (PSO) and principal component analysis (PCA) to compensate for the shortcomings of the BP neural network. In this model algorithm, PCA is mainly used to process the original data. The experimental results show that the BPNN optimized by PSO has higher accuracy than the single BPNN [45]. Ding, HW et al. proved through simulation experiments that using KPCA method to reduce the data dimension and modify the initial value and loss function of the BP neural network can improve the learning ability of the BP neural network, and the learning accuracy is improved [46]. Muhammad Nasir Amin et al. used a neural network and ANFIS to predict the compressive strength of VAM by the sixfold symmetry of concrete failure [47].  [50]. Yumin Su et al. used the long short-term memory (LSTM) of the recurrent neural network to perform a real-time prediction algorithm for the vertical acceleration of the ship and used Python to predict the data. The results show that the recurrent neural network prediction model is effective [51].
Genetic Algorithms can search for optimal solutions during evolution. The general iterative operation makes the neural network algorithm fall into the local minimum and loop phenomenon, and then the neural network algorithm cannot run, and GA is a global optimization algorithm, which can overcome this phenomenon [52]. In order to improve the inventory bonus, Xiaoning Li et al. used GA to eliminate relatively redundant features in the optimal solution of the model, and further explained the superiority of GA [53]. Dunjing Yu et al. used GA to optimize the nonlinear predictive controller of the ship trajectory tracking model. The experiments showed that GA improved the efficiency and accuracy of the controller [54].
Through the analysis of the above literature, it can be seen that a single BPNN has advantages and disadvantages. The advantage is that it has a strong nonlinear ability, and this advantage makes it outstanding in solving problems with complex internal mechanisms. At the same time, the self-learning and adaptive ability of BP neural network is very strong. The learning results are stored as the weights of the network. These advantages can improve the prediction accuracy. However, its shortcomings will affect the prediction results, that is, as the training ability of the BP neural network algorithm improves, the prediction ability will decrease, that is, so-called "overfitting" occurs, which is easily falls into local extreme values, resulting in network training failure and convergence slowly.

BP Neural Network
A neural network is a machine learning model. The algorithm of this model is inspired by the principle of neuron information processing in the human brain. It is a neural network model built on the basis of many neurons. Each neuron in the model can be regarded as an independent unit of study. These neurons take certain features as input and obtain output according to their own models. The neural network has an input layer, a hidden layer, and an output layer. The input layer is the input sample value. The hidden layer and the output layer are calculated by an activation function, and the layers are connected by a weight matrix. The hidden layer of the neural network is invisible; there can be multiple, such as a black box, and the output layer is the classification or regression result we want. Figure 1 shows a neural network with a 3-layer structure. It can be seen that the neural network structure is very symmetrical, and the so-called symmetry relative to a hidden layer, no matter how many layers, are symmetrical. If all parameters are set to 0, the weight on each edge above is 0, which means that the parameters of neurons on each column of the hidden layer are the same, and the updates obtained by each neuron are consistent. Additionally the weights of the updated neurons are still the same, which causes the network to enter a symmetric state. The consequence of this is that the forward propagation and back propagation algorithms of all neurons are the same, so the symmetry cannot be broken (fail to break symmetry), and the neural network cannot learn more features and ends up being a linear neural network. Therefore, all parameters cannot be initialized to 0, nor can they be initialized to any of the same values, because we have to "break the symmetry". Since the residual connection breaks the symmetry or symmetric state of the network, the representation ability of the network is improved. As the depth of the network increases, the weight matrix degenerates and the network degenerates.
Cong Fang [55] proposed an analytical model of "stripping" between layers of neural networks and gave a new idea for the symmetric structure of deep neural networks, as well as generalization performance and robustness. Cong Fang [56] discovered a brandnew phenomenon from theoretical analysis, Minority Collapse, which pointed out that when the number of some classes in the training sample is large, and the number of others is small, the neural network of the highly symmetric simple equiangularly compact frame structure is broken in the collapse, and the class with a larger number of samples dominates the loss function.
The network connection of the BP neural network is connected by residuals, which breaks the symmetry/symmetry state of the network but improves its representation ability. Therefore, the optimized BP neural network is used in this paper to predict China's waterway cargo transportation volume.
The BP neural network is an asymmetric network. The learning process of the BP neural network includes signal forward propagation and signal back propagation. Forward propagation inputs the sample from the input layer, and the signal passes through the hidden layer and then outputs the signal from the output layer. When the output value does not match the expected value, the error will be backpropagated. The error is input to the input layer by layer through the hidden layer, and at the same time, the error is distributed to all units of each layer, and then the basis for the weights of each unit is corrected. The BP neural structure diagram is shown in Figure 2. The number of nodes in the input layer is M and the number of output layers is L. The input and output values are is the connection weight between the input layer node and the hidden layer node, and is the connection weight between the hidden layer and the output layer. Figure 2 shows that the input value and output value of the BP network can be regarded as independent variables and dependent variables of the nonlinear function, respectively, and the BP neural network represents the functional relationship from M independent variables to L dependent variables.

Process of Signal forward Propagation
The input e of the ith hidden layer node.
The output of the ith hidden layer node.
The input e of the kth node of the output layer.
The input of the kth node of the output layer.

Error Backpropagation Process
The reverse input of the error is to first calculate the output error of the neurons in each layer, then use the gradient descent method to modify the weights and thresholds of each layer, and finally make the output value closer to the expected value I need.
The quadratic error criterion function for each sample p is Ep: The overall error criterion function of the system for p training samples is E: The sequential correction process of the error gradient descent method is as follows.
where Δ is the correction of the output layer weight; Δ is the correction of the output layer threshold; Δ is the correction of the hidden layer weight; and Δ is the correction of the hidden layer threshold.

Disadvantages of the BP Algorithm
The BP algorithm is a common method for training feedforward neural networks, it has the following two disadvantages: 1. The initial solution value randomly generated by the BP algorithm has a great impact on the performance of the algorithm, so the algorithm has unstable factors.
2. The gradient descent method is used in the BP algorithm, and this algorithm is prone to the situation that the convergence speed is too slow, and even falls into a local minimum and cannot converge.
3.1.4. Increase the Momentum Term in the BP Algorithm to Accelerate Learning Speed In this paper, the momentum term (11) commonly used in the BP neural network algorithm is modified to (12), which is conducive to accelerating the convergence of the zone.
The momentum term Δ ( − 1) is the adjustment experience accumulated before the algorithm. The error gradient falls into the local minimum, although → 0, Δ ( − 1) ≠ 0 makes the algorithm eliminate the phenomenon of local minima, thereby speeding up the algorithm iteration convergence speed.

Genetic Algorithm (GA)
The GA was proposed by John Holland in the 1970s. The basic principle is to draw lessons from the natural selection and genetic mechanism of biological evolution. The essence of the algorithm is to search for the optimal solution in the evolutionary process. Figure 3 shows the main flowchart of the genetic algorithm. It mainly includes six parts. The initialization is mainly to set the relevant parameters, the evolutionary algebra counter g is set to 0, the maximum evolutionary algebra is set to G, and NP individuals are randomly generated by the algorithm, and they are used as the initial group P(0). To Choosing an operation: The selection operator is applied to the group, and according to the fitness of the individual, excellent individuals are selected to be inherited into the next generation group. The mutation operation applies the mutation operator to the population, and for the selected individual, some gene values are changed to other alleles with a certain probability. Population P(t) obtains the next generation population P(t + 1) after evolution. Its fitness is sorted according to the fitness value and prepared for the next genetic operation. Algorithm termination condition: when g > G, the calculated individual with the maximum fitness is output as the optimal solution, then the algorithm terminates; when g ≤ G, then g = g + 1, and the algorithm re-evaluates the individual to calculate its fitness value.
The genetic algorithm has strong adaptability: it does not need to use gradient and other problem information in the iterative process, there is no problem of flat areas, and the selection operator can be used to eliminate models that fall into local minima. The genetic algorithm has excellent global search performance: there are multiple individuals in a population, and they perform global search in parallel to ensure that the optimal a model can be found in the end. The genetic algorithm can make the BP neural network eliminate the phenomenon of falling into the local minimum, thus improving the accuracy of the algorithm model.
The advantages of using GA to optimize BP lie in the following three points: (1) Genetic algorithms do not easily fall into local optima when searching in space and can easily obtain the global optimum solution. (2) The genetic algorithm is particularly suitable for dealing with complex nonlinear problems, because the conventional algorithm adopts gradient descent, the search direction is fixed, and the genetic algorithm adopts the overall search strategy. (3) The genetic algorithm adopts a parallel search mechanism, which has a small amount of calculation and more processing modes.

Construction of the GA-BP Neural Network Model
In the process of BP neural network training, the algorithm is used to update the weight threshold through forward propagation of data and error backpropagation. On the one hand, in this process, the weights and thresholds of the first forward propagation process, that is, the weights and thresholds are initialized. The method of deep learning is to use the randomization method to obtain the initial weight and threshold parameters. After the initial parameters are selected, the gradient descent algorithm uses the initial parameter values as the starting point to optimize and update the parameters. The weight training process of BPNN is essentially an optimization problem of a complex function. The normal method of obtaining weights is to use certain definite rules and gradually adjust them during the training process.
In the development of optimization algorithms, there are two categories: deterministic algorithms and heuristic algorithms. A deterministic algorithm refers to the use of mathematical methods to find the optimal problem, and the result found is related to the initial point of the derivation, which is generally a definite value. The heuristic algorithm is inspired by the laws of biological evolution in nature. The main idea is to iteratively approach the optimum, and the result of the optimization is a variable value that meets the requirements of engineering accuracy (infinitely close to the theoretical optimum).
In the above process, as a deterministic algorithm, the convergence of the gradient descent algorithm has been proven, but the convergence value is not necessarily the global optimum, which is related to the initial parameter value (the starting point of the gradient descent algorithm). Since random initial parameters may not be the optimal starting point (meaning both accurate training and reliable prediction), the reliability and stability of the trained model are greatly affected by the initial random parameters. As a heuristic algorithm, the genetic algorithm GA has a very good global search ability, and GA is introduced to solve this problem.
The main steps are as follows: (1) Use floating-point numbers to encode the weight threshold of the neural network.
(2) In the coding space, an initial population is randomly generated.
(3) Calculate the group fitness value as a training sample according to Formula (13).
where m is the number of training samples, ( ) f i is the fitness value of the ith genetic individual, ij y is the expected output of the jth training sample, and ˆij y is the actual output of the jth training sample.
In this paper, the competitive selection method is used in the selection, that is, randomly selecting a certain number of individuals from the population, then selecting the best individual as the parent, and repeating this operation to complete the selection of the individual. The arithmetic intersection method (linear intersection) is used in the intersection, that is: where α is a random number between [0,1], parent1 and parent2 are a certain component of the parent individual, and child1 and child2 are the corresponding components of the child individual.
(5) Generate a new generation of groups. (6) Repeat steps (3) to (5), and when the evolution reaches N generations, the individuals with the best fitness will be retained. After the algorithm is over, the optimal individual in the final group can be decoded to obtain the weight threshold of the optimized BP neural network.

Data Processing
The empirical analysis data come from the statistical database of the Ministry of Transport of China [57]. The scope of data in this paper mainly includes the value of each month of waterway freight transportation in the whole of China. This paper selects the monthly data of waterway freight volume from January 2015 to December 2021, and forecasts based on these data. Taking the throughput of the first three adjacent months as the sample input value, and the throughput of the fourth month as the output value, we cyclically arranged 81 sets of statistical data. In this paper, the first 66 groups are used as the training set, and the last 15 groups of samples are used as the test set. Table 1 shows the data processing results.

Group size NP
The value of the swarm size affects the efficiency of the algorithm. If the NP is too small, the optimization performance of the GA will be reduced. If the NP is too large, the calculation of the algorithm will be complicated. Therefore, the optimal range of NP is 10-200.

Crossover probability Pc
Pc controls how often the crossover operation is used. Too large or too small Pc will make the algorithm unstable; Pc is generally taken as 0.25 to 1.00.

Mutation probability Pm
In general, a lower Pm can reduce the possibility of loss of important genes in the population, so Pm usually takes a value of 0.001 to 0.1.

Evolutionary algebra G
Terminating evolutionary algebra G is the condition for the end of genetic algorithm operation. The value of G depends on the specific problem, and the value of G can be between 100 and 1000.
In this experiment, the optimization training is carried out using MATLAB according to the process shown in Figure 4. We compiled the GA-BP neural network code for training. The largest feature of the genetic algorithm is its speed. The initial population can be . The initial population should not be too large, otherwise it will affect the operation speed. Therefore, the initial population of the GA algorithm is set to 50. The BP neural network adopts the most primitive three-layer structure. To determine the number of hidden layer nodes, the empirical Formula (16) is used. The activation function uses the S (Sigmoid abbreviation) type function. Compared with the linear function, its biggest advantage is nonlinearity. This means that when multiple neurons use a sigmoid function as the activation function, the output is also nonlinear. The training method uses trainlm, which has the fastest convergence speed for medium-scale BP neural networks, and is the default algorithm of the system, which reduces the amount of calculation in training.  Figure 5 shows the neural network training interface. In this experiment, the numbers of input layers, hidden layers and output layers are three, eleven, and one, respectively. From the analysis of Figure 5 and Table 1, it can be seen that the optimal number of hidden layer nodes is 11, the training effect is better, and the corresponding mean square error is 0.041436. Table 2 shows the determination process of the hidden layer nodes.  As shown in Figure 6, the goodness of fit of the training samples is 89.675%, the goodness of fit of the test set is 80.74%, and the overall goodness of fit is 87.554%. This fitting verifies that the network training effect is excellent, and the model can fully predict China's waterway cargo transportation volume.  Figure 7 shows that the predicted value of the BP neural network algorithm improved by the GA is more accurate than the predicted value of the BP neural network without improvement, and the error of the GA-BP model algorithm is smaller than that of the BP algorithm. This shows that the GA plays a significant role in the optimization of the BP neural network.  Table 3 is the comparison table between the prediction error of the BP neural network and the prediction error of the GA-BP neural network. The error of the BP neural network optimized by the GA is smaller than that of the BP neural network, which proves the feasibility of optimization. The mean absolute error percentage of the optimized BP neural network is 7.281% higher than that of the unoptimized neural network.  Table 4 shows the prediction results of 15 test samples. In the table, the GA-improved neural network and the unimproved neural network are compared. The experiment shows that the BP neural network optimized by the GA is more accurate.  Table 5 uses the GA-BPNN prediction model proposed in this paper to predict the 12-month waterway cargo transportation volume in China in 2022. 76,150 Figure 8 shows the GA-BPNN prediction model to forecast the trend of China's 12month waterway cargo transportation in 2022. It can be seen that there is an upward trend throughout the year. Since the Chinese Spring Festival is mostly in February each year, the model can also identify the impact of the Chinese Spring Festival holiday. The lowest predicted value for February is 56,950, which is also in line with the actual situation in China. The model has a strong learning ability, which shows that the model reliability. In summary, Tables 3 and 4, Figures 5-8 all show that the prediction accuracy of the GA-BP neural network is generally higher than that of the BP neural network. Figure 9 shows the growth rate of China's 12-month forecast in 2022 compared to the same period in 2021. It can be seen that four months in 2022 will increase compared to the same period in 2021, and the whole year will show a growth trend, so it can be used for MTS to provide relevant decision-making and forecast information, make relevant preparations in advance for the growth of the same period, clear the shipping lanes in advance, reduce the phenomenon of port congestion, reduce the probability of overcapacity in ports and shipping routes, and improve the overall efficiency of the MTS, improving water transportation efficiency.  Figure 10 shows China's 2015-2021 full-year total value and 2022 full-year forecast and annual growth rate. It can be seen that there is an increasing trend in 2015 and 2022, and the predicted value of the model in this paper is also in line with this trend, which further illustrates the adaptability and reliability of the model. The growth rate in 2020 was 2.1%, which was a decrease compared to the same period, mainly due to the impact of the global spread of COVID-19, which led to a downturn in the maritime transport market, increased port congestion, and reduced shipping capacity. Although the growth rate has increased in 2021, COVID-19 is still raging around the world, so the growth rate of the forecast model in this paper has been reduced. This actually provides important warning information to MTS managers and enterprises to prepare in advance for possible risks and reduce economic losses as much as possible.  Figure 11a is the forecast result of the port cargo throughput of the top ten ports in China in 2021 for the whole year of 2022. The first port is Ningbo Zhoushan Port, whose predicted value is 124,211. Figure 11b is the proportion of the predicted cargo throughput of the top ten ports in the total waterway cargo transportation volume in 2022. The top ten ports accounted for the largest proportion of cargo throughput in Ningbo Zhoushan Port, and the total proportion of the top ten ports reached approximately 78%, the others accounting 22%. The distribution of these ports along the coast of China is shown in Figure  11c. The top ten ports are mostly in the eastern and northern coastal areas, and these areas are mostly connected to the sea routes in the MTS. These ports have an important position on the routes in the MTS. These ports have an important position in the MTS, so it is necessary to forecast their cargo throughput. The forecasts of these data can provide important information for the optimization and upgrading of the MTS to better cope with different emergencies and the downturn in the shipping economy caused by the stillspreading COVID-19 pandemic virus around the world.

Discussion and Conclusions
Studying the forecast of waterway cargo traffic has important implications for the study of sustainable MTS. Water transportation is the basic need and an important link of economic development, and water transportation has an irreplaceable position in transportation due to its advantages of wide coverage, small investment in waterways, strong transportation capacity, small footprint, and low cost. A good forecast in these aspects can better improve and upgrade the MTS. Doing these forecasts can better optimize the overall transportation system, while enabling seamless integration with rail, road, and air transportation systems, better protecting all MTS and maintaining higher levels of maritime traffic.
In the MTS, dry bulk transportation is an important component of the MTS and one of the main forms of water transportation. Therefore, it is of great significance to predict the volume of waterway cargo transportation. In 2021, the global dry bulk shipping volume will continue to grow, but the growth rate will slow down. According to the forecast of relevant institutions, the global dry bulk shipping trade volume in 2022 will be 5.46 million tons, a year-on-year increase of 1.6%. The predicted value of dry bulk shipping volume It accounted for 65.3% of the annual forecast value. Among them, the growth rate of iron ore, coal, and nickel ore remained relatively stable, and the growth rate of seaborne shipments of soybeans and bauxite continued to increase. All these provide important research information and ideas for the study of MTS.
Container transportation in the MTS is another important form of water transportation. The spillover effect of the container market running at a high level still exists, and the level of dry bulk freight is still in a cycle of recovery from the bottom. In 2022, the overall global container shipping demand will maintain a strong momentum. The World Trade Organization (WTO) predicts that the global merchandise trade volume will increase by 4.7% this year. A number of domestic and foreign institutions and shipping companies predict that the growth rate of international container shipping demand in 2022 will be 4-6%. Due to the spread of COVID-19 in many countries around the world, the congestion of some major ports in the world has not shown obvious signs of improvement, which will have a relatively long-term impact on the stability and smoothness of the international maritime logistics supply chain, which will directly affect the effective supply of shipping capacity and thus affect freight rates. Therefore, forecasting the of waterway cargo transportation volume is particularly critical, and the development of COVID-19 and port congestion will be the main factors determining the direction of the waterway cargo transportation market.
The forecast research of waterway cargo transportation volume can provide a powerful reference for the government departments of relevant countries to adjust the waterway transportation capacity structure, formulate waterway transportation planning and corresponding sustainable macro shipping policies, and serve as the basis for the planning and investment decision-making of shipping enterprises, which is conducive to helping shipping companies plan and formulate business strategies and sustainable development strategies. Therefore, the forecast of waterway cargo transportation will help reasonable planning of waterway transportation, MTS upgrade and optimization, and transportation enterprises to make correct sustainable management decisions. At the same time, it can provide a scientific decision-making reference for government departments in water transportation investment and other aspects. This paper chooses a more intelligent forecasting method among various forecasting methods, namely BP neural network forecasting. A BP neural network is a kind of asymmetric neural network. The fundamental reason is that the residual method introduced in the weight calculation breaks the symmetrical structure of the network, which is conducive to improving the representation ability of the network as the network increases. Improve its computing power and convergence speed. At the same time, the momentum term is added to accelerate the convergence speed of the algorithm, and the running speed of the algorithm is improved by modifying the formula of the commonly used momentum term.
Our research has explained the principle of neural networks and the significance of waterway cargo transportation from the beginning and introduced the commonly used BP neural network. Next, we introduce the main methods and their goals and problems for forecasting China's waterway freight volume. Then, considering the advantages and disadvantages of a single BP neural network and a genetic algorithm, for the slow convergence of the BP neural network prediction model and prone to local optima problems, the advantages of the genetic algorithm can make up for the shortcomings of the asymmetric neural network, namely that the GA algorithm has a fast convergence speed, which increases the possibility of getting rid of the local optimal phenomenon in the algorithm, which helps the algorithm find the global optimal solution. Finally, the genetic algorithm combined with the BP neural network is used to construct the prediction model, and the relevant data are collected to conduct experiments. The simulation experiments show that the use of the GA-BP prediction model, compared with the single traditional BP neural network prediction model, improves the convergence of the model. Speed and prediction accuracy reduce the possibility of the BP neural network prediction model falling into local minima. Shipping companies can use the forecast results of this method as a reference for related work and management. This method can meet their adaptability requirements for forecast accuracy to the greatest extent. This method can be used to ensure that the forecasting process of waterway cargo transportation is more effective, more economical, and safer. Simulation experiments show that compared with the single traditional BP neural network prediction model, this method improves the model's convergence speed and prediction accuracy, and reduces the possibility of the BP neural network prediction model falling into local minima. The BP neural network waterway cargo transportation volume prediction model is optimized by the GA, and the prediction result is closer to the real waterway cargo transportation volume, which is suitable the waterway cargo transportation volume prediction. In the experiment, the prediction accuracy is improved by 7.281%, and the average error is 3.2148%.
However, the method in this paper still has room for optimization, such as political factors, economic forms, weather, inventory and other external characteristics, which will also have a certain degree of impact on waterway transportation-though these factors are difficult to quantify and rationalize-and can be used in future research on orientation and direction for subsequent model tuning.