Adaptive MPC Based Real-Time Energy Management Strategy of the Electric Sanitation Vehicles

: Electric sanitation vehicles have increasingly been applied to cleaning work due to the requirement of air pollution control. The power distribution and energy management strategy (EMS) inﬂuence the vehicle’s performance a lot both in the aspects of cleaning effect and electricity consumption. Aiming to improve energy economy and ensure clean tasks, ﬁrst, the electricity consumption percentages of the vehicle onboard devices are analyzed and the main contributors are clariﬁed, and the power requirement model of the working motor is built based on experimental data. Second, a universal modeling method of garbage distribution on the road surface is proposed, which implements a nonlinear autoregressive neural network as the predictor. Third, an adaptive model predictive control (AMPC)-based EMS is proposed and veriﬁed. The results show the AMPC method can accurately predict the garbage density and the proposed EMS can approximate the energy consumption of the DP-based EMS with little deviation. Compared to conventional rule-based EMS, the AMPC-based EMS achieved a 15.5% decrease in energy consumption as well as a 14.6% decrease in working time.


Introduction
Faced with air pollution and energy security concerns, the electrification process of conventional vehicles is overwhelming, and sanitation vehicles are no exception. The performance of an electric sanitation vehicle, which has a working motor system and a driving motor system, is strongly affected by the used energy management strategy (EMS). However, the single and rough rule-based EMS conducted by the driver fails to realize the ideal equilibrium between sanitation tasks and electricity consumption. Thus, how to elaborate an appropriate EMS, which can be implemented in real time and satisfy both clean effect and energy economy requirements, is critical in practical application.
the driving motor and the working motor operated separately. Namely, the torque split was not allowed through the planet sets in the transmission. The clutch was kept closed, and the working motor drove three clean devices consisting of a high-pressure water pump, a powerful vacuum collector, and a sweeper to clean the road surface. Since the particular principles of the clean process were not the point, the energy consumed by the three devices was simplified as an integration equaling that of the working motor.    Appl. Sci. 2021, 11, x. https://doi.org/10.3390/xxxxx www.mdpi.com/journal/applsc the driving motor and the working motor operated separately. Namely, the torque split was not allowed through the planet sets in the transmission. The clutch was kept closed, and the working motor drove three clean devices consisting of a high-pressure water pump, a powerful vacuum collector, and a sweeper to clean the road surface. Since the particular principles of the clean process were not the point, the energy consumed by the three devices was simplified as an integration equaling that of the working motor.
(a) (b)     The electric sanitation vehicle had two operating modes: One was driving mode at high speed, the other was working mode at low speed. For the case in the working mode, the driving motor and the working motor operated separately. Namely, the torque split was not allowed through the planet sets in the transmission. The clutch was kept closed, and the working motor drove three clean devices consisting of a high-pressure water pump, a powerful vacuum collector, and a sweeper to clean the road surface. Since the particular principles of the clean process were not the point, the energy consumed by the three devices was simplified as an integration equaling that of the working motor. Figure 3 shows the typical velocity profiles of the sanitation vehicle, which could be divided into the scenario transition condition part and the working condition part. The former part occurred in the interim when the vehicle performed such manipulations as adding water, emptying the garbage, and moving at high speed. Herein, concentrating on the working part, it is indicated that the velocity was maintained at about 8 km/h most of the time. A slightly higher speed ranging from 10 km/h to 22 km/h was adopted by the vehicle when moving to the target field or turning at the road's end were required, which correspond to the segment from 2000 s to 2300 s.  Figure 3 shows the typical velocity profiles of the sanitation vehicle, which divided into the scenario transition condition part and the working condition former part occurred in the interim when the vehicle performed such manipu adding water, emptying the garbage, and moving at high speed. Herein, concen the working part, it is indicated that the velocity was maintained at about 8 km the time. A slightly higher speed ranging from 10 km/h to 22 km/h was adopt vehicle when moving to the target field or turning at the road's end were requir correspond to the segment from 2000 s to 2300 s. Although the scenario transition condition part was not of concern, the en sumption analysis covered the whole driving cycle to reasonably evaluate the power components. Generally speaking, the energy stored in the battery was c by the driving motor (DM), the working motor (WM), an air compressor (AC braking, a steering motor vehicle (SM), and a DCDC (DC-to-DC converter) conne low-voltage accessories. Figure 4 illustrates the power curves of these compon pled from the cleaning routine. Specifically, under the driving mode, the work was kept off. The driving motor consumed power varying from −120 kW to 1 which being positive means that battery energy was consumed by the driving m being negative means that the electricity was regenerated by the driving motor f ing and restored to the battery. While under the working mode, the working mo ily consumed power at about 50 kW, and extremely low operating power was c by the driving motor due to the low velocity. In Figure 5, the operating states of the rotating speed and the torque of the working motor are displayed. Although the scenario transition condition part was not of concern, the energy consumption analysis covered the whole driving cycle to reasonably evaluate the principal power components. Generally speaking, the energy stored in the battery was consumed by the driving motor (DM), the working motor (WM), an air compressor (AC) used for braking, a steering motor vehicle (SM), and a DCDC (DC-to-DC converter) connected with low-voltage accessories. Figure 4 illustrates the power curves of these components sampled from the cleaning routine. Specifically, under the driving mode, the working motor was kept off. The driving motor consumed power varying from −120 kW to 120 kW, in which being positive means that battery energy was consumed by the driving motor and being negative means that the electricity was regenerated by the driving motor from braking and restored to the battery. While under the working mode, the working motor steadily consumed power at about 50 kW, and extremely low operating power was consumed by the driving motor due to the low velocity. In Figure 5, the operating states consisting of the rotating speed and the torque of the working motor are displayed.

Analysis of the Energy Consumption
In Figure 4, even though the peak power overpassed 20 kW, it is indicated that the steering motor and air compressor merely occasionally consumed that power for a relatively short time. The energy consumption proportions of these devices are shown in Figure 6. Taking the whole driving cycle into consideration, the sum of energy consumed by the two motors took a proportion up to 89%. Consequently, aiming at a better energy economy of the sanitation vehicle, only these two motors' operating states were analyzed and targeted in the optimization method to acquire optimal control sequences.   In Figure 4, even though the peak power overpassed 20 kW, it is indicated that th steering motor and air compressor merely occasionally consumed that power for a rel tively short time. The energy consumption proportions of these devices are shown in Figure 6. Taking the whole driving cycle into consideration, the sum of energy consume by the two motors took a proportion up to 89%. Consequently, aiming at a better energ economy of the sanitation vehicle, only these two motors' operating states were analyze and targeted in the optimization method to acquire optimal control sequences.   In Figure 4, even though the peak power overpassed 20 kW, it is indicated that the steering motor and air compressor merely occasionally consumed that power for a relatively short time. The energy consumption proportions of these devices are shown in Figure 6. Taking the whole driving cycle into consideration, the sum of energy consumed by the two motors took a proportion up to 89%. Consequently, aiming at a better energy economy of the sanitation vehicle, only these two motors' operating states were analyzed and targeted in the optimization method to acquire optimal control sequences.     In Figure 4, even though the peak power overpassed 20 kW, it is indicated that the steering motor and air compressor merely occasionally consumed that power for a relatively short time. The energy consumption proportions of these devices are shown in Figure 6. Taking the whole driving cycle into consideration, the sum of energy consumed by the two motors took a proportion up to 89%. Consequently, aiming at a better energy economy of the sanitation vehicle, only these two motors' operating states were analyzed and targeted in the optimization method to acquire optimal control sequences.   In this study, the vehicle was considered a discrete-time dynamic model. The required tractive torque T de (Nm) and demand driving power P de (kW) are obtained by [23]

Working
where M represents the mass of the vehicle, kg; g represents the gravational acceleration, m/s 2 ; f represents the rolling resistance coefficient; A represents the frontal area, m 2 ; C d represents the air resistance coefficient; r represents tire radius, m; δ represents rotational mass conversion factor; β represents the inclination angle of the road; a v represents the acceleration of the vehicle, m/s 2 ; v represents the speed of the vehicle, km/h, η represents the mechanical efficiency of the transmission system, δ represents conversion ratio of vehicle rotational mass, i 0 represents the product of final drive ratio and ring drive ratio of the planet set, and i g represents the transmission ratio, which was set to be the first gear due to the low velocity under working mode.
Since the vehicle was only propelled by the driving motor under the working mode, the demand torque of the vehicle approximately equaled that of the motor.

Modeling the Electric Motors and Battery Pack
Since the dynamics of the electric motor were not what this study concerned, the demand power of the two motors and the battery can be calculated by where T m represents the torque of the motor, Nm; n m represents rotating speed, rpm; P m and P bat represent the demand power (kW) of the motor and the battery, respectively; being positive (P + m ) means the motors consumed electricity from the battery, and being negative (P − m ) means the motors regenerated electricity to store in the battery; η dis and η cha represents the discharging and charging efficiency of the battery, respectively; and η m stands for the motor efficiency, which could be obtained by the look-up tables of the motors shown in Figure 7.
where represents the mass of the vehicle, kg; represents the gravational acceleration, m/s 2 ; represents the rolling resistance coefficient; represents the frontal area, m 2 ; represents the air resistance coefficient; represents tire radius, m; represents rotational mass conversion factor; represents the inclination angle of the road; represents the acceleration of the vehicle, m/s 2 ; represents the speed of the vehicle, km/h, represents the mechanical efficiency of the transmission system, represents conversion ratio of vehicle rotational mass, 0 represents the product of final drive ratio and ring drive ratio of the planet set, and represents the transmission ratio, which was set to be the first gear due to the low velocity under working mode.
Since the vehicle was only propelled by the driving motor under the working mode, the demand torque of the vehicle approximately equaled that of the motor.

Modeling the Electric Motors and Battery Pack
Since the dynamics of the electric motor were not what this study concerned, the demand power of the two motors and the battery can be calculated by where represents the torque of the motor, Nm; represents rotating speed, rpm; and represent the demand power (kW) of the motor and the battery, respectively; being positive ( + ) means the motors consumed electricity from the battery, and being negative ( − ) means the motors regenerated electricity to store in the battery; and ℎ represents the discharging and charging efficiency of the battery, respectively; and stands for the motor efficiency, which could be obtained by the look-up tables of the motors shown in Figure 7. Some important characteristic curves of the battery, including the open-circuit voltage and discharging resistance, are shown in Figure 8. Considering the battery health protection, the characteristics of the battery should have changed steadily [24]. Hence, the battery SoC (state of charge) was set to vary from 0.3 to 0.8.  Some important characteristic curves of the battery, including the open-circuit voltage and discharging resistance, are shown in Figure 8. Considering the battery health protection, the characteristics of the battery should have changed steadily [24]. Hence, the battery SoC (state of charge) was set to vary from 0.3 to 0.8. Assuming that the effects of temperature on the battery are not involved, a simplified battery model is adopted with appropriate precision, which is described as [25]   (5) where , , and represent the open-circuit voltage (V), the internal resistance (Ω), and the capacity of the battery (Ah), respectively. Specifically, the increment of the SoC signifies the variation of vehicle electricity consumption.

Modeling the Power Requirement of the Working Motor
In practical applications, the operating state of the working motor only had three options (or three fixed rotating speeds), which are named general mode, standard mode, and extreme mode. The nominal parameters of the three modes are displayed in Table 2, which were measured when the vehicle was stationary. Since the working motor was controlled based on a certain rotating speed, the output torque changed against the resistance force from the road. To investigate how garbage density on the road interacts with vehicle energy consumption, a series of field experiments, which were devised based on actual operations of drivers, were conducted to establish the power requirement model of the working motor. The real garbage was replaced with sand and rubble in the experiments. Figure 9 displays the main experimental results. It is indicated that the residues after cleaning were basically consistent or even less, despite having more and more "garbage" Assuming that the effects of temperature on the battery are not involved, a simplified battery model is adopted with appropriate precision, which is described as [25] dSoC where V bat , R bat , and Q bat represent the open-circuit voltage (V), the internal resistance (Ω), and the capacity of the battery (Ah), respectively. Specifically, the increment of the SoC signifies the variation of vehicle electricity consumption.

Modeling the Power Requirement of the Working Motor
In practical applications, the operating state of the working motor only had three options (or three fixed rotating speeds), which are named general mode, standard mode, and extreme mode. The nominal parameters of the three modes are displayed in Table 2, which were measured when the vehicle was stationary. Since the working motor was controlled based on a certain rotating speed, the output torque changed against the resistance force from the road. To investigate how garbage density on the road interacts with vehicle energy consumption, a series of field experiments, which were devised based on actual operations of drivers, were conducted to establish the power requirement model of the working motor. The real garbage was replaced with sand and rubble in the experiments. Figure 9 displays the main experimental results. It is indicated that the residues after cleaning were basically consistent or even less, despite having more and more "garbage" on the road surface. From Figure 9, it seems the "garbage" could not be cleaned completely. However, due to far less existing garbage in real scenarios, hardly any residues would be left after cleaning. Setting the quantity of "garbage" in experiments to be significantly more than the real would achieve a clearer visual comparison. on the road surface. From Figure 9, it seems the "garbage" could not be cleaned completely. However, due to far less existing garbage in real scenarios, hardly any residues would be left after cleaning. Setting the quantity of "garbage" in experiments to be significantly more than the real would achieve a clearer visual comparison. The detailed results of the experiments are summarized in Table 3, where the performance was evaluated by the proportions of residues in the total amount of garbage.  As mentioned before, the working motor was controlled based on rotating speed, which is formulated as where is the working motor output torque, is the rotating speed, and represents the resistance from the road surface, which is theoretically related to the velocity, garbage density, and the rotating speed of the working motor.
Based on the experiment data, it can be summarized that the variations of vehicle speed and garbage density exerted very limited impact on the motor torque, which is also testified in Figure 5. Consequently, assuming being immune from those minor factors, the relationship between power and the rotating speed of the working motor could be approximated mathematically.
Furthermore, it is evident the clean force, which ranged from 0~1, representing normalized cleaning ability of the vehicle, increased with the power of the motor but decreased with the vehicle velocity. Hence, the discount factor dependent on velocity is defined and the cleaning effect of the vehicle is formulated as The detailed results of the experiments are summarized in Table 3, where the performance was evaluated by the proportions of residues in the total amount of garbage. As mentioned before, the working motor was controlled based on rotating speed, which is formulated as where T wm is the working motor output torque, n wm is the rotating speed, and F resist represents the resistance from the road surface, which is theoretically related to the velocity, garbage density, and the rotating speed of the working motor. Based on the experiment data, it can be summarized that the variations of vehicle speed and garbage density exerted very limited impact on the motor torque, which is also testified in Figure 5. Consequently, assuming being immune from those minor factors, the relationship between power and the rotating speed of the working motor could be approximated mathematically.
Furthermore, it is evident the clean force, which ranged from 0~1, representing normalized cleaning ability of the vehicle, increased with the power of the motor but decreased with the vehicle velocity. Hence, the discount factor γ dependent on velocity is defined and the cleaning effect of the vehicle is formulated as where δ garbage represents the garbage density on the road, g/m 2 ; γ represents the discount factor; φ(.) represents the functional relationship between clean force and motor power; and ψ res represents the garbage residue, g/m 2 . All functional relationships, which include f (.) in Equation (6) and φ(.), γ(, ) in Equation (7), are modeled with the non-linear least square algorithm. The modeling method is formulated as: where m is the number of samples, y i is the real value, f (.) represents the relationships to be established, and β is a vector whose elements are decided by minimizing the S. The power requirement model of the working motor is illustrated in Figure 10.
1, x FOR PEER REVIEW 9 of 23 where represents the garbage density on the road, g/m 2 ; represents the discount factor; (. ) represents the functional relationship between clean force and motor power; and represents the garbage residue, g/m 2 . All functional relationships, which include (. ) in Equation (6) and (. ), (, ) in Equation (7), are modeled with the non-linear least square algorithm. The modeling method is formulated as: where is the number of samples, is the real value, (. ) represents the relationships to be established, and is a vector whose elements are decided by minimizing the . The power requirement model of the working motor is illustrated in Figure 10.

Modeling the Garbage Distribution on the Road Surface
Conventionally, garbage density with the unit as kg/m 2 is used to describe the amount of garbage, which is a broad category including waste and dust. The target street where the sanitation vehicles work was a typical one stretching from downtown to the suburb. Through field observation, it was concluded that the garbage density distribution was significantly influenced by the green belts, which periodically appeared along the roadside, bringing fallen plant tissues, and commercial regions with strong fluidity. Thus, the garbage distribution model was established as the following steps: Step 1: cleaning levels ranging from 1 to 4 were defined to preliminarily classify the different segments of the road. Segments labeled with the high cleaning level had more garbage. Specifically, level 1 aimed at relatively clean segments without green belts alongside; level 2 was applied to the segments with some fallen tissues or garbage; then, segments marked level 3 had more garbage or leaves that were simple to clean. However, there were more or less some segments covered by materials such as oil stains or squashed dirt, making it tougher to wash thoroughly, for which the highest level 4 was defined. Some examples are shown in Figure 11.

Modeling the Garbage Distribution on the Road Surface
Conventionally, garbage density with the unit as kg/m 2 is used to describe the amount of garbage, which is a broad category including waste and dust. The target street where the sanitation vehicles work was a typical one stretching from downtown to the suburb. Through field observation, it was concluded that the garbage density distribution was significantly influenced by the green belts, which periodically appeared along the roadside, bringing fallen plant tissues, and commercial regions with strong fluidity. Thus, the garbage distribution model was established as the following steps: Step 1: cleaning levels ranging from 1 to 4 were defined to preliminarily classify the different segments of the road. Segments labeled with the high cleaning level had more garbage. Specifically, level 1 aimed at relatively clean segments without green belts alongside; level 2 was applied to the segments with some fallen tissues or garbage; then, segments marked level 3 had more garbage or leaves that were simple to clean. However, there were more or less some segments covered by materials such as oil stains or squashed dirt, making it tougher to wash thoroughly, for which the highest level 4 was defined. Some examples are shown in Figure 11.  Step 2: The garbage density distribution was a result of multiple factors involving both natural and artificial influence. However, a qualified model could be established based on the fact that garbage density was probably continuous and followed historical patterns. Therefore, statistically, the garbage density on a certain level of segment tended to be normally distributed, whose mean value and standard deviation could be assumed reasonably.
Step 3: A certain garbage density value generated from the normal distribution model in step 2 was probably maintained for tens of meters on the segment, which could similarly be estimated based on the actual condition.
Step 4 resulted from multiple uncertain factors, and the fluctuation of the garbage density was inevitable. Hence, different sets of white Gaussian noise were added to the garbage model to reflect the randomness of the real scenario.
The modeling process is illustrated by a flow chart shown in Figure 12.  It is worth noting that, apart from segment levels 1 to 3 that were affected by regular factors such as artificial green belts, the segments marked level 4 were almost unforeseeable, occurring less but more randomly and continuing for a much shorter distance in practice. Step 2: The garbage density distribution was a result of multiple factors involving both natural and artificial influence. However, a qualified model could be established based on the fact that garbage density was probably continuous and followed historical patterns. Therefore, statistically, the garbage density on a certain level of segment tended to be normally distributed, whose mean value and standard deviation could be assumed reasonably.
Step 3: A certain garbage density value generated from the normal distribution model in step 2 was probably maintained for tens of meters on the segment, which could similarly be estimated based on the actual condition.
Step 4 resulted from multiple uncertain factors, and the fluctuation of the garbage density was inevitable. Hence, different sets of white Gaussian noise were added to the garbage model to reflect the randomness of the real scenario.
The modeling process is illustrated by a flow chart shown in Figure 12. Step 2: The garbage density distribution was a result of multiple factors involving both natural and artificial influence. However, a qualified model could be established based on the fact that garbage density was probably continuous and followed historical patterns. Therefore, statistically, the garbage density on a certain level of segment tended to be normally distributed, whose mean value and standard deviation could be assumed reasonably.
Step 3: A certain garbage density value generated from the normal distribution model in step 2 was probably maintained for tens of meters on the segment, which could similarly be estimated based on the actual condition.
Step 4 resulted from multiple uncertain factors, and the fluctuation of the garbage density was inevitable. Hence, different sets of white Gaussian noise were added to the garbage model to reflect the randomness of the real scenario.
The modeling process is illustrated by a flow chart shown in Figure 12.  Figure 13 displays one of the possible garbage distribution models. It is worth noting that, apart from segment levels 1 to 3 that were affected by regular factors such as artificial green belts, the segments marked level 4 were almost unforeseeable, occurring less but more randomly and continuing for a much shorter distance in practice.  Figure 13 displays one of the possible garbage distribution models. It is worth noting that, apart from segment levels 1 to 3 that were affected by regular factors such as artificial green belts, the segments marked level 4 were almost unforeseeable, occurring less but more randomly and continuing for a much shorter distance in practice.

The Prediction Method
Since the vehicle was clean task-oriented, any optimal manipulations were radically decided by the garbage density ahead on the road surface. Considering the vehicle velocity was almost constantly under the working mode, the garbage density sequence in the spatial domain could be equivalently regarded as a time series. Therefore, on the spatial prediction horizon, the predictor could anticipate the garbage density series with the methods dealing with time series. To make the description uniform , the rest of this study discusses problems from the perspective of the spatial domain.
Due to the high autocorrelation of the garbage density distribution, the future values could be forecasted using the past values in the same series by the predictor. The autoregressive relationship to be modeled can be formulated as where ( ) represents the value of the garbage density series at displacement ; ( − 1), ( − 2),… represent the past values of the series; (. ) represents the unknown non-linear relationships to be approximated, represents the number of past values, and _ stands for the error of the approximation. However, for the garbage density series, the relationships (. ) might have varied with different segments of the road, which made the predictors using a Markov chain [15] or ARIMA [11] suitable for nonstationary "time" series perform poorly. Hence, the nonlinear autoregressive neural network (NARNN) was implemented, which deals effectively with "time" series characterized by frequent variations and high nonstationarity. The discrete, non-linear, autoregressive model build by NARNN can be precisely written as [26]: (11) where represents the number of the past values (or delays) used as the input to predict; represents the number of hidden layer neurons in NARNN with the activation function defined as (. )-specifically, each neuron can approximate a unique relationship (. ); represents the parameter corresponding to the weight of the connection between the input neural cell and the hidden cell ; stands for the weight of the connection between the hidden cell and the output cell; and 0 and 0 are the constants respectively corresponding to the hidden cell and the output cell. The general working principle of the predictor is shown in Figure 14.

The Prediction Method
Since the vehicle was clean task-oriented, any optimal manipulations were radically decided by the garbage density ahead on the road surface. Considering the vehicle velocity was almost constantly under the working mode, the garbage density sequence in the spatial domain could be equivalently regarded as a time series. Therefore, on the spatial prediction horizon, the predictor could anticipate the garbage density series with the methods dealing with time series. To make the description uniform, the rest of this study discusses problems from the perspective of the spatial domain.
Due to the high autocorrelation of the garbage density distribution, the future values could be forecasted using the past values in the same series by the predictor. The autoregressive relationship to be modeled can be formulated as where δ(k) represents the value of the garbage density series at displacement k; δ(k − 1), δ(k − 2), . . . represent the past values of the series; f (.) represents the unknown non-linear relationships to be approximated, p represents the number of past values, and ε_t stands for the error of the approximation. However, for the garbage density series, the relationships f (.) might have varied with different segments of the road, which made the predictors using a Markov chain [15] or ARIMA [11] suitable for nonstationary "time" series perform poorly. Hence, the non-linear autoregressive neural network (NARNN) was implemented, which deals effectively with "time" series characterized by frequent variations and high nonstationarity. The discrete, non-linear, autoregressive model build by NARNN can be precisely written as [26]: where p represents the number of the past values (or delays) used as the input to predict; m represents the number of hidden layer neurons in NARNN with the activation function defined as φ(.)-specifically, each neuron can approximate a unique relationship f (.); β ij represents the parameter corresponding to the weight of the connection between the input neural cell i and the hidden cell j; α j stands for the weight of the connection between the hidden cell j and the output cell; and β 0j and α 0 are the constants respectively corresponding to the hidden cell and the output cell. The general working principle of the predictor is shown in Figure 14. To guarantee the calculation efficiency, the learning rule of the NARNN uses the Levenberg-Marquardt backpropagation-type algorithm procedure (LMBP), which is regarded as the fastest of the category. Furthermore, the number of hidden layers and neurons per layer for NARNN is completely flexible and can be optimized through a trialand-error procedure to get the best performance [27].
Increasing the number of neurons helps the NARNN simulate more complex relationships and improve the generalization ability, while decreasing that number alleviates the computation burden and contributes to the real-time requirement. Consequently, the NARNN was set to have 20 neurons in 1 hidden layer, which could well trace the variation tendency of the garbage density, and the delay of the spatial series was set to be 11 steps optimally. Established on the Matlab platform, the open-loop training process of the NARNN is shown in Figure 15. It is critical to bear in mind that the essence of ensuring the performance of a neural network is whether broad possible instances are covered in the training set. Since the garbage distribution model constituted segments involving all likely scenarios, the training set was generated from the former 75% of data in the established model ( Figure 13). Thus, the parameters of the NARNN predictor mentioned in the Equation (11)    To guarantee the calculation efficiency, the learning rule of the NARNN uses the Levenberg-Marquardt backpropagation-type algorithm procedure (LMBP), which is regarded as the fastest of the category. Furthermore, the number of hidden layers and neurons per layer for NARNN is completely flexible and can be optimized through a trial-and-error procedure to get the best performance [27].
Increasing the number of neurons helps the NARNN simulate more complex relationships and improve the generalization ability, while decreasing that number alleviates the computation burden and contributes to the real-time requirement. Consequently, the NARNN was set to have 20 neurons in 1 hidden layer, which could well trace the variation tendency of the garbage density, and the delay of the spatial series was set to be 11 steps optimally. Established on the Matlab platform, the open-loop training process of the NARNN is shown in Figure 15.
To guarantee the calculation efficiency, the learning rule of the NARNN uses th venberg-Marquardt backpropagation-type algorithm procedure (LMBP), which i garded as the fastest of the category. Furthermore, the number of hidden layers and rons per layer for NARNN is completely flexible and can be optimized through a and-error procedure to get the best performance [27].
Increasing the number of neurons helps the NARNN simulate more complex tionships and improve the generalization ability, while decreasing that number allev the computation burden and contributes to the real-time requirement. Consequently NARNN was set to have 20 neurons in 1 hidden layer, which could well trace the vari tendency of the garbage density, and the delay of the spatial series was set to be 11 optimally. Established on the Matlab platform, the open-loop training process o NARNN is shown in Figure 15. It is critical to bear in mind that the essence of ensuring the performance of a n network is whether broad possible instances are covered in the training set. Since the bage distribution model constituted segments involving all likely scenarios, the trai set was generated from the former 75% of data in the established model ( Figure 13). T the parameters of the NARNN predictor mentioned in the Equation (11) could be upd with supervised learning. The performance of the trained neural network is shown in ure 16 and the final precision was appropriate, with a mean square error (MSE) equ 1.35. Details of training performance of NARNN predictor are shown in Figure 17. It is critical to bear in mind that the essence of ensuring the performance of a neural network is whether broad possible instances are covered in the training set. Since the garbage distribution model constituted segments involving all likely scenarios, the training set was generated from the former 75% of data in the established model ( Figure 13). Thus, the parameters of the NARNN predictor mentioned in the Equation (11) could be updated with supervised learning. The performance of the trained neural network is shown in Figure 16   After the open-loop training was accomplished, as Figure 18 displays, the network was converted to a closed loop for prediction. Supplied with the initial values, the future garbage density ( + 1), ( + 2), … ( + ) series could be iteratively forecasted using the latest past values, including the predicted ones [28]. Nevertheless, with the increase of prediction horizon length, the error inevitably accumulates because of the iteration process. To restrain it within an acceptable range, the NARNN predictor updates the past values with the real detected ones as the prediction horizon moves forward, which can be realized with a visual sensor used to estimate the garbage density. The length of the prediction zone was set to be 20 m.
In addition, two adaptive measures for the NARNN predictor were taken: For one thing, aiming at saltation tended to cause inaccurate prediction, especially those between level 4 and the others. One possible solution was to lessen the length of the prediction zone to 10 m when detecting such dramatic changes [29]. The demonstration of real-time feasibility is completed in Section 3.2.
For another, nerual networks trained by different training sets can be switched to better adapt to improve the generalization, when a certain pattern of the garbage distribution was recognized [30]. Different types of training sets could be obtained from the garbage model statistically established on the observations from different garbage condition roads.
Principal parameters of the NARNN predictor are listed in Table 4. After the open-loop training was accomplished, as Figure 18 displays, the network was converted to a closed loop for prediction. Supplied with the initial p values, the future garbage density δ(k + 1), δ(k + 2), . . . δ(k + L) series could be iteratively forecasted using the latest past p values, including the predicted ones [28]. Nevertheless, with the increase of prediction horizon length, the error inevitably accumulates because of the iteration process. To restrain it within an acceptable range, the NARNN predictor updates the past values with the real detected ones as the prediction horizon moves forward, which can be realized with a visual sensor used to estimate the garbage density. The length of the prediction zone was set to be 20 m. After the open-loop training was accomplished, as Figure 18 displays, the n was converted to a closed loop for prediction. Supplied with the initial values ture garbage density ( + 1), ( + 2), … ( + ) series could be iteratively for using the latest past values, including the predicted ones [28]. Nevertheless, w increase of prediction horizon length, the error inevitably accumulates because o eration process. To restrain it within an acceptable range, the NARNN predictor u the past values with the real detected ones as the prediction horizon moves fo which can be realized with a visual sensor used to estimate the garbage densi length of the prediction zone was set to be 20 m.
In addition, two adaptive measures for the NARNN predictor were taken: For one thing, aiming at saltation tended to cause inaccurate prediction, es those between level 4 and the others. One possible solution was to lessen the lengt prediction zone to 10 m when detecting such dramatic changes [29]. The demon of real-time feasibility is completed in Section 3.2.
For another, nerual networks trained by different training sets can be swit better adapt to improve the generalization, when a certain pattern of the garbag bution was recognized [30]. Different types of training sets could be obtained fr garbage model statistically established on the observations from different garbage tion roads.
Principal parameters of the NARNN predictor are listed in Table 4.   In addition, two adaptive measures for the NARNN predictor were taken: For one thing, aiming at saltation tended to cause inaccurate prediction, especially those between level 4 and the others. One possible solution was to lessen the length of the prediction zone to 10 m when detecting such dramatic changes [29]. The demonstration of real-time feasibility is completed in Section 3.2.
For another, nerual networks trained by different training sets can be switched to better adapt to improve the generalization, when a certain pattern of the garbage distribution was recognized [30]. Different types of training sets could be obtained from the garbage model statistically established on the observations from different garbage condition roads.
Principal parameters of the NARNN predictor are listed in Table 4. To avoid the overlap, the NARNN took the remaining 25% of the data in the model as the test set. The prediction result of the NARNN is shown in Figure 19. Some magnified details are displayed in Figure 20 to highlight the predictor performance in particular regions.
Appl. Sci. 2021, 11, x FOR PEER REVIEW 14 of 23 details are displayed in Figure 20 to highlight the predictor performance in particular re gions.
Compared with the ARIMA model, which sometimes either overshot or dropped in advance [11], the NARNN seized the changing tendency of predicted series a little better As Figure 20 shows, predicted values were a little more responsive and better anticipated the trends in the transitional areas from 400 m to 500 m or from 2400 m to 2600 m whereas in an area from 1600 m to 2200 m, the range of predicted values better covered the real ones. In other words, the random fluctuations of real values in the flat distribution area were truly reflected.  For sanitation vehicles, such improvement might be critical because the constraints on garbage residue are so strict that a few wrong forecasts probably make the vehicle fai to accomplish clean tasks, which are manifested in Section 4.  Figure 20 to highlight the predictor performance in particular regions.
Compared with the ARIMA model, which sometimes either overshot or dropped in advance [11], the NARNN seized the changing tendency of predicted series a little better. As Figure 20 shows, predicted values were a little more responsive and better anticipated the trends in the transitional areas from 400 m to 500 m or from 2400 m to 2600 m, whereas in an area from 1600 m to 2200 m, the range of predicted values better covered the real ones. In other words, the random fluctuations of real values in the flat distribution area were truly reflected.  For sanitation vehicles, such improvement might be critical because the constraints on garbage residue are so strict that a few wrong forecasts probably make the vehicle fail to accomplish clean tasks, which are manifested in Section 4.
Statistically, the mean and the maximum values of the root mean square error (RMSE, depicting deviations between real and predicted value) of all prediction zones were used Compared with the ARIMA model, which sometimes either overshot or dropped in advance [11], the NARNN seized the changing tendency of predicted series a little better. As Figure 20 shows, predicted values were a little more responsive and better anticipated the trends in the transitional areas from 400 m to 500 m or from 2400 m to 2600 m, whereas in an area from 1600 m to 2200 m, the range of predicted values better covered the real ones. In other words, the random fluctuations of real values in the flat distribution area were truly reflected.
For sanitation vehicles, such improvement might be critical because the constraints on garbage residue are so strict that a few wrong forecasts probably make the vehicle fail to accomplish clean tasks, which are manifested in Section 4.
Statistically, the mean and the maximum values of the root mean square error (RMSE, depicting deviations between real and predicted value) of all prediction zones were used to evaluate the performance of the NARNN, which is illustrated in Figure 21.  For sanitation vehicles, such improvement might be critical because th on garbage residue are so strict that a few wrong forecasts probably make th to accomplish clean tasks, which are manifested in Section 4.
Statistically, the mean and the maximum values of the root mean square depicting deviations between real and predicted value) of all prediction zon to evaluate the performance of the NARNN, which is illustrated in Figure 21

The Framework of AMPC-Based EMS
Endowed with the prediction ability by the NARNN predictor proposed above, an AMPC based EMS can be established to optimize the power distribution and ensure clean tasks. Therefore, a control-oriented system can be expressed as In the formulation, the vehicle acceleration a v m/s 2 and the rotating speed of the working motor n wm (rpm) are considered the control variables, noted as a vector → u = [a v , n wm ]; the SoC of the battery, vehicle velocity v (km/h), and working time t (s) are regarded as state variables, noted as a vector → x = [SoC, v, t]; and the garbage density δ g/m 2 is considered a disturbance variable, noted as d = δ predict , which is predicted by the NARNN predictor.
With the spatial step (∆x) set to 1 m, for each spatial step, the discrete cost-to-go function J n is written as Equation (13): where J n (kW·h) represents the cost function on the prediction zone at the n step, which minimizes the electricity consumption of the sanitation vehicle represented by that of the working motor and the driving motor; L represents the length (m) of the prediction zone; v(n) represents the velocity (km/h) per spatial step; and the P wm (n) and the P dm (n) represent the electric power (kW) consumed by the working motor and the driving motor per spatial step, respectively. As mentioned before, the former can be calculated by Equations (3) and (6), whereas the latter one can be obtained by Equation (2).
Since the solution to this problem is a multistage process requiring a sequence of interrelated decisions, a local DP solver was implemented on the prediction zone to obtain optimal control series [30,31]. Based on the power requirement model (Equations (6) and (7)) and the predicted garbage, the DP optimized the cost function through relationships expressed as v(k + 1) = 3.6· 2a(k)∆x + (v(k)/3.6) 2 (14) additionally, the parameters in the optimization process should be subject to the following constraints: where vectors v(k), SoC(k), and a(k ) represent vehicle velocity, km/h, battery SoC, As, and vehicle acceleration, m/s 2 at spatial step k, respectively; T dm_lb , T dm_ub , T wm_lb , and T wm_ub represent the lower and upper bound torque of the driving motor and the working motor, N·m, respectively; n dm_lb , n dm_ub , n wm_lb , and n wm_ub represent the lower and upper bound rotating speed of the driving motor and the working motor, rpm, respectively; I bat and V bat represent the current and voltage of the battery; and ψ res and ψ de_res represent the residue and required residue of the garbage density, which was set to 13 g/m 2 referring to governmental regulations. Setting the initial 11 spatial delays, the predictor continues the prediction process by updating the past 11 values with the detected real values. After having the first one of the acquired optimal control sequences performed, the vehicle updates state variables and repeats the whole process as moving to the next spatial step. The working principle of the AMPC-based EMS is illustrated in Figure 22.
Appl. Sci. 2021, 11, x FOR PEER REVIEW 16 of 23 _ represent the residue and required residue of the garbage density, which was set to 13 g/m 2 referring to governmental regulations.
Setting the initial 11 spatial delays, the predictor continues the prediction process by updating the past 11 values with the detected real values. After having the first one of the acquired optimal control sequences performed, the vehicle updates state variables and repeats the whole process as moving to the next spatial step. The working principle of the AMPC-based EMS is illustrated in Figure 22. It is believed that the increase in the prediction zone length helps to improve global optimization but leads to a heavy computation burden. To testify the real-time implemen- It is believed that the increase in the prediction zone length helps to improve global optimization but leads to a heavy computation burden. To testify the real-time implementation feasibility, the proposed AMPC-based EMS was implemented in an industrial personal computer (IPC, Inter Core i7-3610QM @2.3GHz) shown in Figure 23. The vehicle was assumed to travel at 8 km/h due to almost constant velocity. Then, 20 m for the prediction zone equaled to 9 s on the time domain. The calculation time of the AMPC-based EMS on each prediction zone ranging from 0.3~0.45 s was far less than the predicted range (3%), which was also suitable for the shortened 10 m prediction zone because the calculating time was saved simultaneously. Thus, its real-time performance was well substantiated. It is believed that the increase in the prediction zone length helps to improve global optimization but leads to a heavy computation burden. To testify the real-time implementation feasibility, the proposed AMPC-based EMS was implemented in an industrial personal computer (IPC, Inter Core i7-3610QM @2.3GHz) shown in Figure 23. The vehicle was assumed to travel at 8 km/h due to almost constant velocity. Then, 20 m for the prediction zone equaled to 9 s on the time domain. The calculation time of the AMPC-based EMS on each prediction zone ranging from 0.3 s~0.45 s was far less than the predicted range (3%), which was also suitable for the shortened 10 m prediction zone because the calculating time was saved simultaneously. Thus, its real-time performance was well substantiated.  Additionally, other existing EMSs, including the rule-based EMS and the DP-based EMS, were proposed to comprehensively evaluate the energy economy performance of the AMPC-based MPC.

Other Existing EMSs
First, a rule-based EMS was proposed based on the experience of drivers working on the target road. Those rules can be roughly summarized as two combinations of the vehicle velocity and the working motor rotating speed: One was 8 km/h grouped with 1700 rpm; the other was 4 km/h coupled with 1900 rpm. With visual judgment, drivers chose the former combination to deal with ordinary situations involving segments marked from levels 1 to 3, whereas the latter one was especially executed to deal with situations that level 4 describes. Although well adapted to different scenarios, this conservative EMS was expected to realize more economic improvement.
The simple manipulation logic is illustrated in Figure 24. Secondly, assuming the garbage density distribution is completely known, which uses the garbage model as a substitution, a DP-based EMS can be proposed. Applied to the global rather than the local prediction zone, the DP-based EMS had the same kinetic relationships and constraints as the local DP solver in the AMPC framework. Thus, the cost function of the DP-based EMS is expressed as where X 0 and X n are the start and the final point of the solution domain decided by the garbage distribution model. In the last section, the optimization result represented by the SoC variation of the DP-based EMS is used as a benchmark to evaluate the electricity consumption performance of the AMPC-based EMS.
Appl. Sci. 2021, 11, x. https://doi.org/10.3390/xxxxx www.mdpi.com/jou 1700 rpm; the other was 4 km/h coupled with 1900 rpm. With visual judgment chose the former combination to deal with ordinary situations involving s marked from levels 1 to 3, whereas the latter one was especially executed to d situations that level 4 describes. Although well adapted to different scenarios, servative EMS was expected to realize more economic improvement. The simple manipulation logic is illustrated in Figure 24.  Secondly, assuming the garbage density distribution is completely know uses the garbage model as a substitution, a DP-based EMS can be proposed. Ap the global rather than the local prediction zone, the DP-based EMS had the sam relationships and constraints as the local DP solver in the AMPC framework. T cost function of the DP-based EMS is expressed as    Figure 24 shows garbage residue distributions under different control strategies. fixed control variable combinations, which were the standard and the extreme mod the working motor coupled with vehicle speed at 8 km/h, were also included in the c parison. Some critical details of Figure 24 are magnified in Figure 25.

Comparison of the Garbage Residue
From the perspective of mission accomplishment, except for the fixed stand mode, all strategies met the requirement with varying degrees. It is noted that DP and AMPC took full advantage of the regulation by restricting the residues within a nar area very close to the threshold. By contrast, the rule-based strategy performed more sirably because of the low average of residues. Besides, there is no doubt that the f extreme mode can realize the best cleaning effect regardless of economic energy cost Specifically, similar to the DP result, the adaptive AMPC strategy left residues a higher and in a slenderer range, partly because the limited prediction zone disenabled AMPC to foresee the garbage variation on a larger spatial scale. As mentioned in Sec 3.1, in the parameter tuning process of the NARNN predictor, the AMPC occasionally a few residues exceed the limit, manifesting the necessity of the accuracy improvem compared with the ARIMA model.
Statistical indicators for each strategy are summarized in Table 5, where the unit g/m 2 .   From the perspective of mission accomplishment, except for the fixed standard mode, all strategies met the requirement with varying degrees. It is noted that DP and the AMPC took full advantage of the regulation by restricting the residues within a narrow area very close to the threshold. By contrast, the rule-based strategy performed more desirably because of the low average of residues. Besides, there is no doubt that the fixed extreme mode can realize the best cleaning effect regardless of economic energy cost.
Specifically, similar to the DP result, the adaptive AMPC strategy left residues a little higher and in a slenderer range, partly because the limited prediction zone disenabled the AMPC to foresee the garbage variation on a larger spatial scale. As mentioned in Section 3.1, in the parameter tuning process of the NARNN predictor, the AMPC occasionally had a few residues exceed the limit, manifesting the necessity of the accuracy improvement compared with the ARIMA model.
Statistical indicators for each strategy are summarized in Table 5, where the units are g/m 2 .  Compared TO the rule-based strategy, optimization methods such as DP and the AMPC decreased battery SoC more evenly. Particularly, each time a sanitation vehicle encountered segments marked level 4, the expended electric energy increased dramatically, which is highlighted in the enlarged red rectangle. As a result, more segments that were difficult to clean up occurred on the road, and more energy was "wasted" by the rule-based strategy. For two fixed working modes, on the one hand, it is appropriate that the extreme mode consumed the most energy. On the other hand, it seems that the standard mode performed a little worse than DP but won on simplicity by making some sacrifices on garbage residue.
The results of the different strategies are evaluated in Table 6 with the rule-based one used as the baseline.  Compared TO the rule-based strategy, optimization methods such as DP and the AMPC decreased battery SoC more evenly. Particularly, each time a sanitation vehicle encountered segments marked level 4, the expended electric energy increased dramatically, which is highlighted in the enlarged red rectangle. As a result, more segments that were difficult to clean up occurred on the road, and more energy was "wasted" by the rule-based strategy. For two fixed working modes, on the one hand, it is appropriate that the extreme mode consumed the most energy. On the other hand, it seems that the standard mode performed a little worse than DP but won on simplicity by making some sacrifices on garbage residue.
The results of the different strategies are evaluated in Table 6 with the rule-based one used as the baseline.

Comparison of Other Variables
The distribution of optimal control variable sequences is shown in Figure 27. Compared with the MPC-based EMS, the rotating speed of working motor and velocity under DP control was more dispersed and had a slightly lower average, which reflects that DP tended to have the control variables change evenly and realize the global optimization because of grasping global information.
Appl. Sci. 2021, 11, x. https://doi.org/10.3390/xxxxx www.mdpi.com/journal/applsc fices on garbage residue. The results of the different strategies are evaluated in Table 6 with the rule-based one used as the baseline.

Comparison of Other Variables
The distribution of optimal control variable sequences is shown in Figure 27. Compared with the MPC-based EMS, the rotating speed of working motor and velocity under DP control was more dispersed and had a slightly lower average, which reflects that DP tended to have the control variables change evenly and realize the global optimization because of grasping global information.  As shown in Figure 28a, for one thing, the general mode of the working motor was dismissed by all three strategies, indicating that excessively low power of the working motor is not adequate for cleaning jobs in the target environment. For another, based on the distribution of optimization method, the regular rotating speed set at about 1700 rpm in the rule-based strategy proved to be reasonable for a fixed deterministic strategy. However, optimal results indicated that more accurate and flexible adjustments ranging from 1675 rpm to 1750 rpm tend to be a better choice.
ci. 2021, 11, x FOR PEER REVIEW 20 of 23 As shown in Figure 28a, for one thing, the general mode of the working motor was dismissed by all three strategies, indicating that excessively low power of the working motor is not adequate for cleaning jobs in the target environment. For another, based on the distribution of optimization method, the regular rotating speed set at about 1700 rpm in the rule-based strategy proved to be reasonable for a fixed deterministic strategy. However, optimal results indicated that more accurate and flexible adjustments ranging from 1675 rpm to 1750 rpm tend to be a better choice.
In terms of velocity, which is shown in Figure 28b, both optimization strategies had a higher range than the rule-based one. Because higher velocity meant a greater discount on the cleaning effect of the working motor, there was redundant power of the rule-based strategy that could be spared. Hence, regular vehicle speed might be more desirably set at 8.5 km/h for ordinary segments like level 1-level3, and at 7 km/h for the extreme conditions. Moreover, due to a lack of flexibility on the adjustment of vehicle velocity, the rule-based strategy failed to decrease the consumed power in some regions as 500-700 m or 2200-2500 m, which are shown in Figure 29a, leading to unnecessary energy loss. Details about vehicle operating states are shown in Figure 29. It is noted that the DPbased EMS performed a few more combinations of high rotating speed and low velocity than the AMPC, which explains the reason why the SoC under the AMPC decreased a little more slowly but had more garbage residue left than DP during the early stage in In terms of velocity, which is shown in Figure 28b, both optimization strategies had a higher range than the rule-based one. Because higher velocity meant a greater discount on the cleaning effect of the working motor, there was redundant power of the rule-based strategy that could be spared. Hence, regular vehicle speed might be more desirably set at 8.5 km/h for ordinary segments like level 1-level3, and at 7 km/h for the extreme conditions. Moreover, due to a lack of flexibility on the adjustment of vehicle velocity, the rule-based strategy failed to decrease the consumed power in some regions as 500-700 m or 2200-2500 m, which are shown in Figure 29a, leading to unnecessary energy loss. Details about vehicle operating states are shown in Figure 29. It is noted that the DPbased EMS performed a few more combinations of high rotating speed and low velocity than the AMPC, which explains the reason why the SoC under the AMPC decreased a little more slowly but had more garbage residue left than DP during the early stage in Figures 25 and 26.
Additionally, considering the sanitation vehicle equipped a full water-filled tank, frequent changes of acceleration should be avoided in pursuit of driving comfort. Hence, it is rational that both DP and the AMPC restricted acceleration to fluctuating slightly between 0.5 m/s 2 and −0.5 m/s 2 , as shown in Figure 30.  As shown in Figure 28b, due to the higher average velocity, the AMPC proposed in this study lessened operating time from 35 min to 29 min, which was a 16.7% improve- Details about vehicle operating states are shown in Figure 29. It is noted that the DPbased EMS performed a few more combinations of high rotating speed and low velocity than the AMPC, which explains the reason why the SoC under the AMPC decreased a little more slowly but had more garbage residue left than DP during the early stage in Figures 25 and 26.
Additionally, considering the sanitation vehicle equipped a full water-filled tank, frequent changes of acceleration should be avoided in pursuit of driving comfort. Hence, it is rational that both DP and the AMPC restricted acceleration to fluctuating slightly between 0.5 m/s 2 and −0.5 m/s 2 , as shown in Figure 30.  As shown in Figure 28b, due to the higher average velocity, the AMPC this study lessened operating time from 35 min to 29 min, which was a 16.7 ment by contrast with the rule-based strategy.Time cost of each strategy is sh 7. Eventually, some guidelines for driver manipulations were concluded sonably adjusting regular driving speed to be a little higher and decreasing t highest rotating speed of the working motor as much as possible. In additio pure manual operation, accurate and flexible manipulations of the real-time A EMS could be realized with the assistance of automatic driving.

Conclusions and Future Work
The main conclusions can be drawn as follows: 1. Through the energy consumption analysis of principal devices onboa cluded that the working motor and the driving motor were the main which took up to 89% of the total electricity consumption. 2. Based on the practical observation of the target route, a universal garb tion modeling method can be concluded, which covers almost all scena level 1 to level 4. As shown in Figure 28b, due to the higher average velocity, the AMPC proposed in this study lessened operating time from 35 min to 29 min, which was a 16.7% improvement by contrast with the rule-based strategy. Time cost of each strategy is shown in Table 7. Eventually, some guidelines for driver manipulations were concluded, such as reasonably adjusting regular driving speed to be a little higher and decreasing the use of the highest rotating speed of the working motor as much as possible. In addition, instead of pure manual operation, accurate and flexible manipulations of the real-time AMPC-based EMS could be realized with the assistance of automatic driving.

Conclusions and Future Work
The main conclusions can be drawn as follows: 1.
Through the energy consumption analysis of principal devices onboard, it is concluded that the working motor and the driving motor were the main contributors, which took up to 89% of the total electricity consumption.

2.
Based on the practical observation of the target route, a universal garbage distribution modeling method can be concluded, which covers almost all scenarios marked level 1 to level 4.

3.
A clean task-oriented AMPC-based EMS using the NARNN as the predictor is proposed, and its real-time feasibility was verified. Compared with the rule-based EMS, it was substantiated that the AMPC-based EMS saved electricity by 15.53%, and well approximated the global optimization of the DP-based EMS. Additionally, the working time was lessened from 35 min to 29 min (17.14%) by the AMPC-based EMS.
Future work might focus on cleaning trajectory optimization for sanitation vehicles fleets or developing unmanned sanitation vehicles that automatically recognize the garbage type with visual sensors and better conduct manipulations obtained by optimized control strategies. Since the battery is aging with the practical application, the EMS considering battry state of health is also worthwhile to explore in future studies.