Energy-E ﬃ cient Speed Proﬁle Optimization and Sliding Mode Speed Tracking for Metros

: Nowadays, most metro vehicles are equipped with an automatic train operation (ATO) system, and the speed control method, combining cruise speed planning and proportional-integral-derivative (PID) control, is widely used. The automation is achieved, and the energy-e ﬃ cient can be improved. This paper presents an improved artiﬁcial bee colony algorithm for speed proﬁle optimization with coast mode and an adaptive terminal sliding mode method for speed tracking. Speciﬁcally, a multi-objective optimization model is established, which considers energy consumption, comfortableness, and punctuality. Then, a novel artiﬁcial bee colony algorithm named regional reinforcement artiﬁcial bee colony (RR-ABC) is designed, to search the optimal speed proﬁle with coast mode, in which some improvements are made to speed up convergence and to avoid local optimal solutions. For speed-tracking control, the adaptive terminal sliding mode controller (ATSMC) is used to improve the speed error, robustness, and energy saving. In addition, a disturbance observer (DOB) is designed to improve the anti-interference ability of the system and further improve the robustness and anti-disturbance, which are also conducive to speed error and energy saving. Finally, the line and train data of the Qingdao Metro Line 6 are used for simulation, which proves the e ﬀ ectiveness of the study. Speciﬁc to the energy saving rate, and compared with normal algorithms, RR-ABC with coast mode is approximately 9.55%, and ATSMC + DOB is 7.58%.


Introduction
In the modern metro system, the quality of operation, including energy saving, punctuality, and comfortableness, is the core factor for operation [1]. With the popularization of the Automatic Train Operation (ATO) system, the automation of train operation has been achieved. However, the performance of the algorithm in ATO system is not good enough, and it can be further improved [2]. Inaccuracy and high energy consumption seriously affect the quality of train operation.
The frequent switching of operation mode is needed for the metro system because the start-stop is frequent. The ATO system needs to refer to the planned speed profile as a reference to issue control commands, such as acceleration, braking, and coasting [3]. In practice, the PID controller is widely used to realize speed control, and the fluctuation of control force occurs when the coasting mode is needed [4]. This problem eventually leads to larger speed errors and poor energy saving effects. To improve the tracking quality, a large number of papers are published to try to find a perfect substitute of the PID controller. Predictive controller is presented to track the speed profile. In Reference [5], an integral sliding mode control (ISMC) is used to design distributed adaptive control strategies. In Reference [6], a tracking controller is formulated to resolve errors caused by unmolded effects.
In addition, for the intelligent evolutionary algorithm mentioned above, there are always two problems to be solved. Firstly, maintaining good population diversity to avoid a local optimal solution while evolving rapidly is always a problem [33]. Secondly, the initial value of the population has a significant impact on the optimization result, so how to obtain a better initial value is worth studying.
To solve these problems, the artificial bee colony (ABC) algorithm, which shows good global search ability, may be used in speed profile optimization [34]. However, though good at global search, the ABC algorithm is poor in local search. When it is close to the optimal solution, the search efficiency drops sharply in a local region. To make up for this deficiency, a regional reinforcement artificial bee colony (RR-ABC) algorithm is proposed, to optimize the speed profile. By using chaotic mirror initialization, local search strategy, and the elite group guidance strategy, the RR-ABC algorithm maintains better population diversity and enhances search efficiency.
The specific structure is as follows. Section 2 shows the dynamic model and some special restrictions of metro. Section 3 establishes a speed profile optimization calculation model which considers energy consumption, comfortableness, and punctuality. In Section 4, the RR-ABC algorithm is proposed, to optimize the speed profile. In Section 5, to track the speed, we propose the ATSMC+DOB controller. In Section 6, the validity of the method is verified based on the practical data of Qingdao Metro Line 6. Finally, we conclude the study in Section 7.

Dynamic Characteristics of Metro Trains
There are two common dynamic models: multi-mass point model and single-particle model. For the multi-particle model, each vehicle of the train is treated as an independent mass point. As a result, the force deviation caused by train length and the power transmission between different vehicles are considered in the model. However, this model has a complicated modeling process and is also difficult to calculate [35]. Due to the lighter weight and shorter length of the metros, this accuracy is not necessary. Compared with multi-particle model, the single-particle model regards the entire train as a mass point which has a simple modeling process. A large number of related studies have adopted the single-particle model [36,37].
Train single-particle dynamics model in Reference [36] is used in this paper. The schematic diagram of the train force is shown in Figure 1. In this model, m is the train mass, which is considered as a constant value throughout the journey. About the decomposition of the forces, denotes the traction force, F b denotes the braking force, R c denotes the additional resistance of the line that mainly comes from the slope, and R b denotes the basic running resistance of the train, mainly from the track and air. The dynamics model can be written as follows: where v denotes the speed, and x is the position is the additional resistance, and R b (v) is the basic resistance; and γ is the rotary mass coefficient which is usually a given constant. About the rotary mass coefficient, λ, it is a coefficient to compensate the moment of inertia. The train kinetic energy has two parts, one is the horizontal movement energy of the train, and the other is the moment of inertia of the wheels. Therefore, when only the horizontal movement is concerned, the moment of inertia needs to be compensated. This value can be obtained by dividing the converted weight of the moment of inertia by the total weight. Since this value is difficult to determine, empirical values are generally used in simulation. Therefore, m(1 + γ) is the total weight of the train after correction. About the traction, the maximum that the train can provide is as follows: where F t (v) is the real force, and µ t control factor. About the braking, similar to traction, the calculation formula can be written as follows: About the basic running resistance, it can be written as follows: where r 1 , r 2 , and r 3 are empirical parameters determined by experiment. The additional resistance can be written as follows: where w i is the additional resistance of the slope, w r is the additional resistance in the curve, and w s is the additional resistance in the tunnel. Generally, the resistance in the curve and tunnel are not considered in the study and the slope resistance is given as: where m is the train mass, g is the acceleration of gravity, and θ is the slope value. The external characteristic curves (max force) of train traction and braking force are shown in Figure 2, and Equations (2), (4), and (6) are examined by the actual experiment from Qingdao Metro Line 6.

Fitness Calculation Function about Speed Profile Optimization
This part establishes the optimization model of the profile, including the energy consumption calculation model, penalty function about punctuality, and comfortableness. These factors will be discussed separately below.

Energy Consumption Calculation Function
As shown in Figure 3, because of the speed limit, the operation of the metro on the entire line is divided into N intervals, where v 0 i (i = 1, 2, . . . , N) represents the starting speed of the train in the interval, i, and it also represents the ending speed in the interval, i − 1; and v 0 1 and v 0 N+1 represent the starting speed and the arrival speed, respectively; obviously, v 0 As shown in Figure 4, inside the interval, i, we split the single one into n subintervals, which have same running time, ∆t. In subinterval n, the description of the symbols is shown in Table 1.

Symbol Description
F n traction force E n energy consumption a n acceleration X n length of the subinterval Therefore, the length, L i , and the time, T i , of the interval,i, can be descripted as follows: The total length, S total , and the total time, T total , of all intervals are as follows: The traction energy consumption, E j , of subinterval can be calculated as follows: In Equation (11), the length of subinterval X j can be calculated as follows: Thus, the traction energy consumption of the interval, i, can be calculated as follows:

Punctuality Penalty Function
Punctuality is a key factor in evaluating the quality of metro operation. The current trains run on time according to the timetable issued by the ATS system. The running time between stations is already specified, and the error between the actual running time and the specified running time needs to be limited within a required error. According to the actual operation requirements of the metro, the maximum allowable time error is generally 6 s.
Define the punctuality judgment function of the train as follows: where T error is the arrival time error, T D is the planned running time, and T R is the actual running time.
As shown in Figure 5, drawing on the Gaussian model, the punctuality penalty function is established as follows: where R t is the time penalty factor, T E is the maximum allowable time error (set 6 s here), and ρ is the sensitivity control coefficient, which can be used to control penalty level.

Comfortableness Penalty Function
Comfortableness is another important indicator of subway operation. Generally, the rate of change of acceleration is used to evaluate comfortableness.
Define the acceleration change rate calculation model as follows: where Jerk is the acceleration rate, a is acceleration, and t is running time. As shown in Figure 6, similar to punctuality penalty function, the comfortableness penalty function is established as follows: where R com is comfortableness penalty factor, J d is maximum allowable acceleration rate of change, and ρ is the sensitivity control coefficient.
As shown in Table 2, according to international standard IS02631 [38], set the value of J d to 1. When the acceleration change rate is larger, the penalty for comfort is greater, so that the non-conforming values in the result can be eliminated.

Fitness Calculation Function Based on Double Penalty Mechanism
As considered in Sections 3.1-3.3, the fitness calculation function based on double penalty mechanism is summarized as follows:

Emergency Braking Intervention (EBI) Curve and Warning Speed Limit
Resulted from some safety reasons, the metro line should consider various speed-limit protections. Firstly, the metro line has a fixed speed limit from the line. In addition, the metro also has a security protection mechanism to ensure that emergency braking is feasible in any position that the train cannot cross the fixed speed limit; this mechanism is called emergency braking intervention (EBI). To ensure that the metro does not exceed the fixed speed limit during emergency braking under any circumstances, the EBI protection considers such an extreme case:

1.
At first, the train is in the process of maximum acceleration. After the train issued an emergency braking command, the train still maintained the maximum acceleration during the transmission delay and traction cutoff delay, because the traction cannot be cut off instantly and transmission delay exists objectively at any moment.

2.
Since the braking system needs some time to receive instructions and gradually generate braking force, it usually takes some time to reach the maximum force from 0. During this period, the train is considered to keep coasting.
Therefore, as shown in Figure 7, the emergency braking process is divided into three stages: Stage1: In this stage, the train accelerates with the maximum traction acceleration, and the acceleration time is the sum of the traction cutoff delay and the braking system action delay.
Stage2: After the train traction is cut off, the emergency braking force needs a certain time to reach the nominal value, and the train is considered to coast during this period.
Stage3: The train follows the emergency braking curve to brake. Finally, as a result of the single-mass model, we ignore the size and length of the train. However, the ignorance will bring security risks to trains, so we design the warning speed limit based on the EBI curve.
As shown in Figure 8, when the train moves from the low-speed-limit zone to the high-speed-limit zone, the train needs to reserve a buffer equal to the length of the train. When the train moves from the high-speed-limit zone to the low-speed-limit zone, the same buffer is required. According to the above principle, taking Qingdao Metro Line 6 grabbing Zhuomashan station to Heluobu station as an example, these kinds of speed limit curves are obtained as shown in Figure 9.

Energy-Saving Strategies and Coast Choices
According to actual operating conditions and theory, coast travel is an effective strategy for train energy saving. The coast strategy we designed is shown in the Figure 10, in which two different coast intervals are set. Features: (1) Driving from high speed limit zone to low speed limit zone (2) Maintain constant speed at the next low speed limit • Strategy selection: From the coast point, let the train maintain the coast condition until point S 1 , then we get the speed of S 1 point v S1 .
(1) If v S1 > v 3 , then brake to v 3 ; (2) If v S1 < v 3 , then when v = v 3 , let the train drive at a constant speed v 3 .

Coast Interval x 2 •
Features: (1) Driving from high speed limit zone to low speed limit zone; (2) Braking in the next low speed limit.
• Strategy selection: From the coast point, let the train maintain the coast condition until point S 2 , then we get the speed of S 2 point v S2 .
(1) If v S2 > v 4 , then brake to v 4 ; (2) If v S2 ≤ v 4 , then keep coasting to maintain the coast condition until it contacts the braking curve.

Regional Reinforcement Artificial Bee Colony (RR-ABC) Algorithm
The ABC algorithm simulates the process of the bee colony searching for the honey source with the highest honey content. It has few control parameters, and it has good global convergence. However, when the local search is approached to converge to the optimal solution, the search efficiency drops sharply [39,40]. With the deepening of evolution, population diversity is seriously lacking, and the search efficiency is significantly reduced at the end of evolution. To balance the global and local search capabilities and to improve efficiency of the ABC, a more efficient ABC algorithm is proposed called RR-ABC algorithm. The principles and steps of the RR-ABC algorithm are as follows:

(1) Model initialization: Chaos mirror initialization
Because of the randomness of the chaotic map and the sensitivity to the initial conditions, we extracted the decision space information to grow the diversity of the group. In this part, the sine formula is used to iteratively generate chaotic variables. The chaotic initialization Equation is as follows: where ch 1 is random number between 0 and 1; k is number of iterations; k = 1, 2, . . . , D − 1, and j = 1, 2, . . . , D; x ij is the j-dimensional parameter of the i-th honey source; and x ijmax ,x ijmin are the top and bottom limits of the source.
Use Equation (19) to perform the chaos initialization operation to obtain the i − th honey source . , x iD ), then mirror it to get the mirror honey source mir i = (x iD , . . . , x i2 , x i1 ), then compare the fitness value of the initial solution one by one, and select the better individual as the initial population.

(2) Employed bee stage: Evolutionary dimension adjustment strategy
In the stage of employed bee and onlooker bee, they need to update the honey source through Equation (20); the original honey source has only one dimension for each change. Obviously, this method is extremely inefficient. To solve the problem, we use a new parameter W, to control the dimensions that need to be changed in each evolution. The new search Equation is as follows: where w = {1, 2, . . . , W}, parameter W controls the number of dimensions of each individual change, and W = {1, 2, . . . , D}.
Every time an employed bee searches for a new honey source, it will estimate the potential of the source (calculate the fitness value of the objective function): where f i is the objective function value of the i-th honey source; f it i is the fitness value of i-th honey source. The employed bee chooses the better honey source by comparing the fitness value of X i and X i (Greedy choice). The larger the fitness value, the smaller the corresponding objective function value, indicating that the honey source is also better, and with a higher probability of getting better honey sources around it.

(3) Onlooker bees stage: Search radius adjustment strategy
As mentioned in "Model of employed bees", the employed bee will search within a certain radius of the initial solution. In the early stage of evolution, because the honey source x i is far from the real solution, the use of a larger search radius can accelerate the convergence rate; when the honey source evolves to the final stage, it is closer to the real solution, and reducing the search radius can improve the search accuracy. Therefore, using a variable search radius and adjusting the size of the search radius can significantly speed up the convergence rate.
In this paper, the strategy of changing the search radius with the individual threshold, that is, as the threshold, Limit i (the digit of times that honey source x i was not selected), increases, the search radius continues to decrease. The Equation is as follows: where r iw is search radius honey source x iw , φ is the basic value of search radius, Limit iw is the threshold of the honey source x iw , and Limit is the maximum allowed threshold. The new search Equation is as follows: Remark 1. For the traditional ABC algorithm, with the deepening of evolution, more nectar sources are continuously explored on a global scale. However, due to the unknown quality of the preliminary search results, each explored nectar needs to be searched more deeply, which seriously affects the search efficiency in the later stages of evolution, even if it can avoid the generation of local optimal solutions. In addition, for classic biomimetic algorithms, the quality of the initial value usually greatly affects the evolution process, while a simple random process can easily produce low-quality initial values. Through the flowchart about the comparison of classic method and reinforcement method (Figure 11), we can see that dynamically adjusted evolutionary dimensions and search radius are adopted. In the later stages of the search, fewer high-quality nectar sources will be paid attention, and a better efficiency can be got.

Speed Profile Optimization Model
Based on the above discussion, the speed curve optimization model is summarized as follows:

Adaptive Terminal Sliding Mode Controller
This section needs to design a speed tracking controller to track the speed profile optimized before. As mentioned before, compared with PID controller, which is widely used in modern industrial control, the sliding mode controller is quick to respond and completely robust to system parameter changes and external interference. However, sliding mode control will produce chattering when dealing with uncertain factors. This kind of high-frequency chattering will not only affect the accuracy of the control, but also cause the system to oscillate or become unstable.
Adaptive terminal sliding mode control not only has strong robustness, but also lets the system state converge to the ideal trajectory within a limited time. In addition, it also has a strong parameter adaptive processing function, to ensure that the system will not have unnecessary continuous switching when the parameters are uncertain. In order to design a speed-tracking controller with excellent comprehensive performance, this paper designs an adaptive terminal sliding mode controller with disturbance observer.

Dynamics Model of Speed Tracking
This part models and transforms the known fixed time delay. Here, the time delay is compensated by the sliding mode control coefficient, so that the system can be used as a non-delay system to design the sliding mode control rate in the subsequent chapters.
As mentioned in Section 2, the ideal dynamics model is Equation (1); for convenience of expression, the model with speed v as its independent variable is rewritten as the model with time, t, as its independent variable. Furthermore, the traction force factor, F t (v), and braking force factor,F b (v), are replaced by the actual generated traction/braking force, F u (t), and the model is summarized as follows: In the actual control system, due to response delay, the model needs to be written as a first-order lag model: where a c is the control acceleration, which is produced by the controller; a t is the target acceleration; τ is the response time constant; and T d is controlling transmission delay. For subsequent kinetic description, we define the following Equation: Therefore, the acceleration formed by the basic resistance of the train and the additional resistance can be expressed as follows: Furthermore, according to Equation (30), we can get the following: where d(t) is the resistance acceleration. The existing metro controllers are mostly stepless control modes, and the control system model is shown in Figure 12. In order to facilitate the design of subsequent controllers, pade approximation is used to rewrite the delay parameters T d , in Equation (28): where λ is a constant related to delay, T d . Laplace transform of Equation (28): In order to facilitate expression, Equation (33) is rewritten as follows: As we can see, if the response time constant τ and the delay T d are known and kept unchanged, then q 1 , q 2 , and q 3 are all constants. In the following content q 1 ,q 2 , and q 3 are used to design the sliding mode control rate, so the delay will be compensated. The time delay is compensated to the sliding mode control rate coefficient through the abovementioned method, and the effect of delay is eliminated in this way.

Design of Sliding Mode Terminal Controller
We define the state error as follows: where e 1 is the train position error, e 2 is the train speed error, x r is the reference position of optimized profile, and v r is the reference speed of optimized profile. Differentiate Equation (35) and substitute it into Equation (31), and the resulting error state space Equation is as follows: .
To improve the tracking accuracy, the designed sliding mode surface needs to introduce the train position error e 1 , and speed error e 2 , to ensure fast error convergence. The designed terminal sliding mode function is as follows: where β > 0; and p and q are positive odds,1 < p q < 2. Design the sliding mode controller below. Differentiate the sliding mode function of (37) as follows: .
v, and get the following: By combining Equations (39) and (40), and bringing Equation (40) into the first derivative (38) of the sliding mode function, we can obtain the following: Based on the principle of sliding mode controller, the sliding mode control input has the following form: where u eq is the equivalent control items of the system, and u n is the nonlinear switching control items of the system. Make Equation (41) approach zero, to get the following: . Make is a smooth resistance and has a boundary; then its first and second derivatives are bounded, and the basic running resistance of the train f b (t) is bounded, so there is D ≤ D max , and D max is the boundary. According to the design principle of sliding mode control, the nonlinear switching term is designed as u n = −ksgn(s m ), and then the terminal sliding mode control input is as follows: where k > 0 is the control gain. In addition, since the resistance items of the train are unknown in the actual operation of the train, that is D is unknown, but D is bounded, the actual controller should be changed to the following: where q 4 = q 1 c; q 5 = q 2 b; q 6 = q 2 c; and k > D max . Select the Lyapunov function of the following form: Let k 0 = k − D max , derivate Equation (46), and bring the controller formula, Equation (45), into it: The system is stable, but in order to reduce the control burden of switching items and reduce the possibility of chattering, a parameter adaptation mechanism is introduced.
First modify the controller to the following form: whereq 1 ,q 2 ,q 3 ,q 4 ,q 5 ,q 6 ,â,b,ĉ are the estimated values of parameters q 1 , q 2 , q 3 , q 4 , q 5 , q 6 a, b, c, respectively. Construct the following Lyapunov function: Bring Equation (48) into Equation (49), and we can get the following: To meet the stability rules of the Lyapunov formula, organize Equation (50), and the available parameter adaptation law is as follows: As showed in Equations (50) and (51), discontinuous switching function ksgn(s m ), will result in unconnected control signals to keep the train running on the sliding surface. However, due to resistance deviation, measurement deviation, and other factors, the train cannot stay on the sliding surface, which will eventually cause chattering. To avoid chattering, replace signum function sgn(s m ), in Equation (47) with saturation function, sat(s m ) and φ is the saturation width.
Generally speaking, the distinguishing feature of typical sliding mode control is the use of a signum function. After the introduction of a saturation function, the controller will no longer be a classic sliding mode controller. According to References [41,42], the sat function implies a pseudo sliding mode in fact.

Disturbance Observer
The introduction of the saturation function will reduce the robustness of the system, but the line interference may be greater during train operation, especially on ramps. Disturbance observer (DOB) has been one of the most widely used robust control tools which can effectively improve robustness and anti-disturbance [43,44]. Therefore, a disturbance observer is introduced in this part.
The Equation (48) is introduced into the disturbance observation value and transformed into the following form: As shown in Figure 13, the actual model and the nominal model are as follows: According to the principle of disturbance observer, by considering the input term as the transfer function of u r and d, we can get the following: Furthermore, the expression of the disturbance observation can be obtained as follows: According to Equation (56), if G n (s) = G(s), the required disturbance value can be accurately observed, but in the actual system, there may be a deviation between G n (s) and G(s). From Equation (56), the deviation of the estimated value mainly comes from the error of the model: where ∆G(s) = G n (s) − G(s), Q(s) is the low pass filter, and Q(s) = 1. According to Equation (57), if there is an error in the model, the obtained observations will also have a certain deviation. The magnitude of the model error determines the magnitude of the deviation of the observations. After introducing the disturbance observer, the gain of the switching function will be weakened to a certain extent. In general, the error of the train model is small, so the disturbance observer can improve the anti-interference ability of the system and reduce the switching gain, to avoid chattering.

Experimental Result
To verify the validity, this section uses the interval between Zhuomashan station and Heluobu station in Qingdao Metro Line 6 as the simulation object.

Basic Test of RR-ABC
The vehicle data and line data (slope and speed limit) used are shown in Table 3, Table 4, and  Table 5, respectively. Table 3. Vehicle data.

Vehicle Parameters Evaluation
Formation 6 trains Mass-AW2 (ton) 339.6 Length (m) 120 Basic resistance parameters (r 1 ) 9.067 Basic resistance parameters (r 2 ) 0 Basic resistance parameters (r 3 ) 0.001334 To verify the effectiveness of the AA-ABC algorithm, we compare it with the standard ABC algorithm and the GA, respectively. We set the given interval running time to 147 s and use different evolutionary algorithms for testing. For each algorithm, we conducted three repeated experiments. The results are shown in Figure 13. Figure 14a is the comparison of the RR-ABC algorithm and the standard ABC algorithm; the line in the figure represents the changing trend of fitness. We can observe the following two results: (1) The initial value of the RR-ABC algorithm is better than that of the standard ABC algorithm; (2) Compared with the standard ABC algorithm, RR-ABC maintained good evolutionary performance, both in the early and late stages; especially in the late stage, the evolution efficiency of RR-ABC is significantly higher than that of ABC. This proves that the RR-ABC algorithm has good local search capabilities. Figure 14b is the comparison of the RR-ABC algorithm and GA; we can observe that GA gets a local optimal solution twice. Relatively speaking, ABC has good global search capabilities By combining Figure 14a,b, we can find that RR-ABC algorithm has better global search capabilities compared with ordinary evolutionary algorithms. Furthermore, it also improves the local search capabilities, which are poor in standard ABC algorithm. RR-ABC algorithm can quickly obtain the optimal solution. Figure 15a is the optimal speed profile, and Figure 15b is the energy-consumption curve. Figure 15a shows the fixed speed limit curve, the economic speed limit curve, and the optimal speed curve. The curve in Figure 15b shows the cumulative upward trend of energy consumed by the train as the vehicle moves. We can observe the following results: (1) Running with the safety speed limit curve, the train can safely travel under the fixed speed limit curve; (2) The speed profile contains two coast sections; (3) When the train is in traction section and cruising section, the energy consumption increases.
Furthermore, the rate during traction is higher than that in cruise. In contrast, the energy consumption remains a level-out during coast section and braking section.

Advanced Test about Different Interval Time and Different Mode
To verify the performance of the algorithm under different running times, set the given interval time to 141 and 153 s, respectively. The results are shown in Figures 16 and 17.
In Figure 16, because of the reduction of running time, compared with the speed profile shown in Figure 15, we can observe that coast sections are shortened and the energy consumption is higher. In Figure 17, resulted from the growth of running time, we can observe that coast sections are extended and the energy consumption is lower.
The detailed data of the comparison are shown in Table 6. We can see that, with the increase of the given time, the energy consumption of the train gradually decreases, and the amount of decrease continues to fall. The reason is that the train can use more coast sections with increasing given time. However, the energy-saving effect of coast section is not linear. As the coast section becomes longer, the energy-saving effect of coast section will get worse. In addition, we can also see that the indexes about time and acceleration are within the allowable range, which means that the train maintains good punctuality and comfortableness while performing energy-saving driving.

Advanced Test about Different Driving Mode
In order to further test the performance of the algorithm, we compared the profile and consumption of optimal mode with that of cruising mode, which is widely used in engineering practice.
As shown in Figure 18, if we set the given interval time to 147 s, both for optimal mode and cruising mode, the profile of cruising mode maintains a level-out after a period of acceleration, which is definitely easy to plan. In comparison, following with the acceleration, the optimal profile conducts a coast condition. In addition, the energy consumption of the optimal profile is less than the cruising mode.
Detailed data are listed in Table 7; we can observe the following two results: (1) The energy consumption of optimal mode is 9.55% lower than that of cruising mode, so optimal mode is at energy saving; (2) The average acceleration rate of cruising mode is much lower than that of optimal mode, and although the index of optimal mode is qualified, the cruising mode is much better at comfortableness.

Basic Tracking Test about ATSMC+DOB Controller under Different Fixed Delays
To verify the effect of delay compensation, four groups of time delay experiments with different values are set up. The delay parameters of each group are shown in Table 8. Usually, T is less than 0.9 s and τ is less than 0.3 s in modern metro control system. As shown in Figure 22a,c and Figure 23a,c, after a period of adjustment, the tracking curve can always converge to the reference curve. As shown in Figure 22b,d and Figure 23b,d, when the time delay becomes larger, the system applies more acceleration to keep it tracked. As a result, in all the given cases, the system has achieved accurate tracking. Since the time delay of the actual system generally does not exceed 0.9 s, the sliding mode controller is considered effective for the compensation of the known fixed time delay.

Tracking Test about ATSMC+DOB Controller
To verify the effect of the ATSMC+DOB, it is compared with the PID controller and the STSMC controller, respectively. Select the 153 s speed curve as the reference curve, and the comparison results are shown in Figures 19 and 20.
Compared with the ATSMC algorithm, as shown in the Figure 19, the disturbance observer significantly reduces the control error, especially when the train is running on a steep slope. Furthermore, compared with the PID algorithm, as shown in Figure 20, the ATSMC+DOB algorithm significantly reduces the speed error, which will greatly improve the comfort, punctuality, and parking accuracy of the train.
In addition, as the focus of our attention, as shown in Figure 21, due to the precise tracking of the ATSMC algorithm, although its energy consumption is slightly higher than that of the reference curve, it still much better than the PID algorithm, which has bad energy saving due to a large tracking error. In conclusion, the ATSMC+DOB algorithm performs good tracking accuracy.
The detail data are listed in Table 9. The ATSMC+DOB algorithm performs better than the PID algorithm and the ATSMC algorithm in all indicators; PID algorithm is the worst, far worse than ATSMC+DOB algorithm and ATSMC algorithm. In terms of energy consumption, ATSMC+DOB is 7.58% lower than PID and 0.7% lower than ATSMC. Generally, the ATSMC+DOB algorithm perfectly meets the requirements of the metros for energy saving, punctuality, and comfortableness.
In addition, as we can see in Figure 19b, ATSMC+DOB has less speed errors than ATSMC, and ATSMC+DOB can converge faster. The detailed data in the table also prove this. The observer brings better anti-disturbance.

Conclusions
This paper carried out a series of researches on metro speed profile optimization and speed tracking control, and further designed corresponding algorithms and conducted simulation verification based on real data. The relevant conclusions are summarized as follows: (1) A multi-objective optimization model that considers energy consumption calculation and takes punctuality and comfortableness as penalty factors is established to optimize the train speed profile. This model takes the comfortableness of the metro and the punctuality into consideration when optimizing energy saving. (2) An optimization strategy that considers the metro EBI speed limit and the actual volume of the train (warning speed limit) is proposed. An improved ABC algorithm named RR-ABC algorithm is proposed for speed profile optimization. Compared with ordinary algorithms, the RR-ABC algorithm not only has good global search ability to avoid the local optimal solutions, but it also has excellent local search ability to improve the evolutionary efficiency. (3) A terminal sliding mode controller with disturbance observer (ATSMC+DOB) is designed by introducing parameter adaptation mechanism and disturbance observer. The controller has better robustness and anti-disturbance which brings minor speed tracking error and good energy saving. (4) The real data from Qingdao Metro Line 6 were used for the verification of the research. The simulation test results prove that the research on speed-profile optimization and speed-tracking control is effective.
The optimizations based on coast optimization and various speed limits proposed in the paper are targeted and only effective for a specific single interval, but for most metro lines, the conditions of each interval are different. Assuming that we design a different strategy for different intervals, it must not be a good way. Therefore, in future research, a more general optimization strategy is worthy of being studied.