1. Introduction
An aero engine is a device that provides power for an aircraft and has a decisive influence on the performance of the aircraft. One of the core problems currently limiting the development of the engine is the vibration of the rotor structure when rotating at high speed. According to the statistics, the number of engine failures caused by vibration accounts for 50% to 60% of the total number of engine failures [
1]. It has been proven that rotor unbalance is the main factor that generates vibration during the operation of an aero engine [
2].
An aero engine fan rotor is composed of a multi-stage disk and multi-stage blades [
3]. Due to the manufacturing error of parts, the centroid of the disk and blade group deviates. If the assembly is done blindly, there is a high probability that the rotor unbalance will be excessive. To avoid the occurrence of the unbalance phenomenon, the most fundamental measure is to change the mass distribution of the rotor, eliminate the unbalance force, so that the rotor is balanced [
4,
5]. In the rotor assembly process, the assembly sequence of the blade affects the deviation of the mass center position of the blade group, and the assembly phase angle of the multi-stage blade and multi-stage disk affects the overall mass distribution of the rotor, both of which are important elements for controlling the overall mass distribution of the rotor. Through blade sorting to make the overall mass center deviation of the blade set smaller and make the blade set mass center position a specific angle (assembly phase angle) relative to the disk mass center position during assembly, the mass center deviation of each rotor part can be made offset so as to reduce the overall rotor unbalance. Therefore, a suitable method for optimizing the balance of the fan rotor assembly process is of great importance for the improvement of the rotor assembly performance.
The current process for optimizing balance in aero engine assembly plants has the following main problems: (1) The blade sorting process only considers flushness and randomly selects pieces from the blades according to the blade code. Due to the blade manufacturing error with individual differences, randomly selected parts not only cause assembly quality dispersion, but poor connection performance consistency and often need to be repeatedly disassembled and adjusted to meet the assembly balance requirements, resulting in low assembly efficiency and high cost due to its “trial and error” nature. (2) The assembly process does not take into account the phase of blade assembly at all levels and the influence of the initial unbalance of the disk on the overall balance of the rotor, which can easily lead to the problem of large eccentricity of the overall mass of the rotor after assembly. (3) The main work step of rotor assembly is the assembly of the blade; because the assembly performance of the blade assembly process cannot be monitored, resulting in a lot of reliance on manual experience, it is difficult to regulate the rotor assembly balance in real time. Even if the installation is carried out in accordance with the pre-determined order of blade arrangement, the phenomenon of excess unbalance occurs, making the rotor performance less reliable and more volatile.
For the optimization of balance in the fan rotor assembly process, this paper proposes a semi-physical simulation method based on reinforcement learning. Firstly, based on the mass moments of individual blades, the diagonal mass moment difference is introduced as a constraint to build a single-stage blade sorting optimization model, and reinforcement learning is used to find the optimal sorting path so that the balance of the single-stage blade after sorting is optimal. Then, on the basis of the initial unbalance of the disk and the unbalance of the single-stage blade set, a multi-stage blade assembly phase optimization model is established, and reinforcement learning is used to find the optimal assembly phase so that the overall balance of the rotor is optimal. Finally, based on the collection of data during rotor assembly, the least-squares fitting method is used to calculate the real-time assembly unbalance, and the actual assembly data are combined with the theoretical data to realize the semi-physical simulation of the optimization of balance during the assembly process.
2. Literature Review
This section discusses the current status of domestic and international research on the following three aspects: rotor balancing methods, blade matching methods, and assembly phase optimization methods.
2.1. Status of Rotor Balance Research
In recent years, domestic and foreign scholars have obtained many results in the field of rotor balancing. Goodman [
6] proposed the least-squares and weighted least-squares methods to minimize the sum of squares of the residual vibrations at each side vibration point measured on the calibration front by applying a set of correction masses to the calibration front. Bin Guangfu et al. [
7] proposed the vibrational circular balance method, which can roughly determine the approximate distribution of the unbalance, substantially improving the balance accuracy. P. Gnieka [
8] proposed a modal equilibrium method without trial weight, where a system of algebraic equations is derived by modal analysis based on the rotor’s vibration pattern and generalized masses and then the optimal counterweight is solved, and the rotor is balanced by increasing the counterweight. Zhengshi Liu et al. [
9] proposed the relative coefficient method on the basis of the influence coefficient method to improve the balance efficiency of the rotor by directly measuring the relative coefficients with a two-channel dynamic signal analyzer.
Relevant domestic and foreign research has improved the balance of rotors to a certain extent, but the calculation method is complicated, and when the number of rotor stages is large, manufacturers face a large workload, it is impossible to monitor the balance of the rotor during the assembly process, and it is easy to have more than one installation and more than one trial.
2.2. Status of Research on Blade Selection Methods
In terms of blade matching methods, Jianmin Yue et al. [
10] used a stepwise tuning simulation algorithm, which gives two models of mass and frequency as ranking schemes for optimization under the requirement of frequency. Tang Shaojun et al. [
11] established the fitness function of the blade ranking problem and the evaluation method of its optimization by dividing the hub into quadrants according to the number of blades and optimizing the ranking of a given set of blades using a genetic algorithm under the constraints of the smallest possible mass difference between quadrants and the large frequency difference between blades, which improved the solution efficiency to some extent. By analyzing the disadvantages of the combinatorial optimization method and traditional genetic algorithm in solving the blade sorting problem, Zhao Desheng [
12] transformed the multi-objective optimization problem into a single-objective optimization problem, used the penalty function method for the design of the fitness function, and established a blade sorting model based on the improved genetic algorithm, and its sorting results were accurate and converged quickly using the improved genetic algorithm. Edward A. Thompson et al. [
13] used a polynomial time-simulated annealing algorithm to solve a large combinatorial optimization problem for blades. Wangbai Pan et al. [
14] combined the hybrid finite-element model (MDFEM), Gaussian regression, and a genetic algorithm to propose a new method for the optimization of detuned impeller blades, which solved the problem of the large computational effort of the enumeration method, improved the optimization efficiency of the detuned impeller, and verified the effectiveness of the method with the example of a turbine.
Domestic and foreign scholars have made great research progress in blade selection methods. However, the blade matching problem is a combinatorial optimization problem, and the difficulty of solving the matching problem grows geometrically with the number of rotor stages and parts. So, it is difficult to find the optimal solution to the problem by traditional methods, and it is easy to fall into the local optimum.
2.3. Research Status of Assembly Phase Optimization Method
In terms of assembly phase optimization, Ding et al. [
15] investigated a multi-stage, rotation-optimized assembly technique with a deviation transfer model for rotating component assembly and demonstrated the feasibility and applicability of applying the optimization objective function to the deviation transfer. Yang [
16,
17,
18,
19,
20] proposed a direct assembly optimization technique to control the transmission accumulation of assembly deviations based on the error transmission model of rotor connection, used statistical analysis to derive the error probability distribution for different assembly qualities [
21], and simulated the assembly example by Monte Carlo method. Mao-Guo Cao [
22] used Powell’s method to optimize the angular mounting position of rotor parts to reduce the unbalance forces and moments acting on the bearings due to the unbalance measure during the high-speed rotation of the engine. On this basis, Lixin Li et al. [
23] further improved the optimization of the balance by optimizing the angular mounting positions of the disks at all levels through a genetic algorithm.
The above method can solve the rotor assembly phase optimization problem to a certain extent, but the influence of the initial unbalance of the disk on the assembly phase is not considered, and the optimization results have errors. Moreover, the traditional optimization algorithm can only search for a certain extreme value point in the solution space, so it is easy to fall into local, optimal solutions.
3. Rotor Semi-Physical Simulation Implementation Framework
The fan rotor blade assembly process is shown in
Figure 1. The fan rotor is a multi-stage disk structure consisting of components such as disks and blades at various levels. The fan rotor assembly process is as follows: firstly, multi-stage disk assembly; secondly, single-stage blade matching, determining the order of blade arrangement and single-stage blade assembly; finally, multi-stage blade combination assembly. The method proposed in this paper treats the multi-stage disk as an already-assembled part in the implementation and assumes that the balance of the multi-stage disk is already in compliance with the requirements. On this basis, the influence of blade sequencing and the multi-stage blade assembly phase angle on the overall rotor unbalance during blade assembly is investigated, and the real-time assembly performance is monitored during the assembly process to provide optimization guidance for the subsequent assembly process.
During the assembly process, the blades need to be assembled according to the pre-determined blade layout sequence. The assembly result of each completed blade assembly is not only related to the current assembly process, but also to the assembly result of the previous step. Each blade affects the overall unbalance of rotor after assembly. In order to ensure that the final rotor balance meets the design requirements and to avoid excess assembly unbalance due to perceived installation errors and other factors, the assembly process needs to be monitored. Since the final overall unbalance is formed after the process is fully completed and the subsequent assembly steps are not performed during the assembly process, the actual measurement data of the subsequent steps cannot be obtained. Therefore, this paper constructs a semi-physical simulation model for single-stage blade assembly, combines the measured data of assembled blades with the theoretical data of unassembled blades, and calculates the overall unbalance of the rotor. The implementation process mainly includes:
(1) Establishing a single-stage blade sequencing optimization model. A mathematical model of single-stage blade sequencing is established with blade mass moment as input, diagonal position mass moment difference as a constraint, and minimum unbalance as the objective, and the blade sequencing order of each stage is optimized based on a reinforcement learning algorithm to reduce the unbalance of the blade set. The blade layout sequence is the main basis for blade assembly, while the blade set unbalance is also an input for the semi-physical simulation model;
(2) Calculation of the unbalance during assembly. As shown in
Figure 2, when assembling to the
tth blade, the previous t−1 work steps are performed, and the measured data are used. The remaining work steps are replaced by theoretical scheduling data. The final unbalance value can be calculated by Equations (1) and (2).
In the formula, is the remaining unbalance calculated by the blade assembly process; is the measured value of the unbalance generated by the assembly steps; is the theoretical data of the unbalance of the later steps not performed;
(3) Optimization of unbalance. If the calculation result of unbalance during assembly exceeds the design requirement, the sequencing of the unassembled blades is re-executed, and the sequence of blades between the t + 1th work step and the nth work step is optimized until the design requirement is satisfied. If the calculation results of unbalance in the assembly process meet the design requirements, the assembly continues in the order of the original blades.
The above single-stage blade assembly is the assembly process because the fan rotor is usually composed of a multi-stage blade and multi-stage disk, so the multi-stage blade assembly process is completed before the fan rotor assembly, and each level of the blade assembly process is the same. In this paper, a multi-stage blade assembly phase optimization model is established to ensure that the overall balance of the final fan rotor meets the requirements. As shown in
Figure 3.
The phase optimization of the multi-stage blade assembly is based on the analysis of the unbalance forces and torques generated by the residual unbalance of each stage’s blade in space synthesis. Taking the residual unbalance of each blade as input and aiming to minimize the unbalance force and torque, a mathematical model of rotor multi-stage blade assembly phase optimization considering the initial unbalance of the disk is established.
4. Single-Stage Blade Sequencing Optimization
The order in which the blades are arranged directly determines the size and direction of the unbalance and is one of the main factors affecting the balance of the fan rotor. The essence of blade sequencing optimization is to make the sum of the centrifugal forces of all blades on the axis of rotation as small as possible, thus, eliminating unbalanced forces and making the rotor’s mass eccentricity smaller. In practice, due to the existence of machining errors, blade geometry, shape profile, and mass center distribution have small differences; it is difficult to achieve a rotor unbalance of 0 only through reasonable blade sorting optimization to make the rotor unbalance as small as possible.
4.1. Single-Stage Blade Sequencing Optimization Modeling
Based on the above analysis, the mechanical modeling of the blade sequencing problem is performed. When the fan rotor is rotating at angular speed
, the centrifugal force of each blade is expressed as:
In the formula, is the centrifugal force of the th blade; is the mass of the th blade; is the vector diameter from the center of rotation of the th blade to the center of mass; and is the angular velocity of rotor rotation.
The purpose of blade sequencing optimization is to eliminate the unbalanced force, that is, to minimize the centrifugal force, as shown in the following formula:
Since the rotational angular velocity of all blades is the same, the minimum centrifugal force is actually to minimize the vector sum of the mass moments of all blades to the axis of rotation, so the objective function is:
In the formula, n is the number of blades; and is the mass moment of the th blade, that is, the product of the distance between the blade centroid and the rotating center. From the derivation of Equations (4) and (5), the following can be seen: the process of single-stage blade sequencing actually balances the mass moment of each blade, and the mass moment vector sum of each blade is the remaining unbalance of the single-stage disk.
In order to make the rotor operation more stable, the deviation range of the blade set should be controlled to avoid the mass moment difference of the same group of blades being too large. According to the enterprise design regulations, the blade diagonal position on the two blades of the mass moment difference cannot exceed a specific value. Therefore, the constraints of the blade sorting model in this paper are shown in the following equation:
In the formula, is the upper limit of the mass moment difference on the diagonal position of the blade at a certain level.
4.2. Reinforcement Learning-Based Optimization Model Solving for Blade Sorting
In this section, a Q-learning learning algorithm is used to solve the blade sorting optimization model problem. Reinforcement learning addresses the entire blade sorting problem and is global in nature. The Q-learning learning algorithm can converge faster than ordinary reinforcement learning algorithms, which can improve the efficiency of blade matching.
4.2.1. Reward and Punishment Function Design
The purpose of reinforcement learning in the blade ranking optimization model is to find a ranking strategy that minimizes the unbalance of blade groups. A good design of the reward and punishment function determines not only how well the model can be rewarded in the long run, but also how quickly the model can learn the ranking strategy. In order to express the reward and punishment function more clearly, the mass moment of the blade is decomposed orthogonally to the x-axis and y-axis, then, as shown in
Figure 4, the sum of the x-axis and y-axis unbalance can be expressed as:
In the formula, denotes the sum of the mass moments in the x-axis direction; denotes the sum of the mass moments in the y-axis direction; denotes the mass moment of the ith blade; and denotes the angle between the center of mass of the ith blade and the positive direction of the x-axis.
In the
Figure 4, the numbers 1–8 indicate 8 blades;
indicates the installation angle of the blades.
Thus, the equation for the unbalance of a single-stage blade can be expressed as:
In the formula, denotes the remaining unbalance after single-stage blade sorting; and denotes the position of the remaining unbalance.
The optimization objective considered in this paper is to minimize the unbalance value, so the reward and punishment function is set as:
4.2.2. Q-Value Update Strategy
R is the immediate reward for each action of the model and considers the long-term impact of each action. The matrix is the knowledge learned from experience, h represents the number of leaves, the rows of the matrix Q represent the current state, and the columns represent the possible actions to reach the next state; the matrix Q is initialized to 0.
Every time the model is trained, the elements of the matrix
Q are updated by the following equation:
In the formula, s is the current state; a is the action; R is the immediate reward; and is the learning variable.
The value of the elements in the matrix Q is then equal to the sum of the immediate reward R under the current state s and action a and the learning variable multiplied by the maximum reward value for all possible actions to reach the next state. The parameter takes a value between 0 and 1, . If is closer to 0, the model tends to consider only the immediate reward; if is closer to 1, the model considers the overall reward with a greater weight.
The blade sorting optimization model learns from experience using the algorithm described above, with each experience equating to a training session. In each training session, the model continuously selects blades, superimposes moments of mass, and receives a reward value once it reaches the target state. The purpose of training is to enhance the experience of the model, represented by the matrix Q. More training results result in a more accurate matrix Q. If the matrix Q is trained several times, then the model will not explore blindly during the blade sorting process but will reach the target state the fastest.
5. Multi-Stage Blade Assembly Phase Angle Optimization
5.1. Multi-Stage Blade Assembly Phase Angle Optimization Modeling
The blade sequencing optimization model established in the previous section can calculate the remaining unbalance after the blade arrangement of each level, while the overall unbalance produced by the aero engine fan rotor is the result of the accumulation of residual unbalance after the assembly of all blades. These residual unbalances produce large unbalance forces and unbalance moments when the rotor is rotating at high speed [
24], and the engine performance is seriously affected under the unbalance forces and moments [
22], as shown in
Figure 5. Take the three-stage blade as an example; after each level of blade assembly, the blade group unbalance is superimposed on the xy plane. In the actual assembly process, changing the blade assembly phase angle can effectively adjust the overall unbalance after multi-stage blade assembly, so it is necessary to consider the impact of blade assembly phase on the overall unbalance of the rotor.
The size of the remaining unbalance after each stage of blade arrangement is
, and
is the initial unbalance of the disk. Because of the unbalanced moment, a deformation force is generated at the journal position of the fan rotor so the position of the journal force in the rotor is recorded. Additionally, the distance coordinates of the other blade sets from the force point are defined as
. Taking the direction
of each stage of blade imbalance as the design variable, the components of the magnitude of the unbalance force on the x- and y-axis are expressed as:
In the formula, is the component of the unbalanced force on the x-axis; is the component of the unbalanced force on the y-axis; and Z is the number of fan blade stages.
The moments of the unbalanced forces around the x-axis and y-axis are expressed as:
In the formula, is the moment of the unbalanced force around the x-axis; and is the moment of the unbalanced force around the y-axis.
The purpose of multi-stage phase optimization is to minimize the amount of unbalance after multi-stage blade assembly, which means that the unbalance forces and moments are minimized. Therefore, the objective function of the multi-stage blade phase optimization model is:
5.2. Multi-Stage Blade Assembly Phase Angle Optimization Model Solving
The objective of reinforcement learning for multi-stage blade assembly phase angle optimization is to be able to find an assembly phase combination that minimizes the overall unbalance. Taking the three-stage blade as an example, this paper assumes that the reference plane where the initial unbalance force of the disk is located coincides with the plane where the second-stage blade is located, and the distance between the first-stage blade and the third-stage blade and the reference plane is considered equal. A coordinate system is established on the datum, and the unbalance force
of the disk is set to coincide with the x-axis, then, the equation of the unbalance force in the x-axis and y-axis can be simplified as:
The equation for the unbalanced moments around the x-axis and y-axis is simplified as:
The objective function of the multi-stage blade phase optimization model can be simplified as:
The optimization objective considered in this paper is to minimize the amount of unbalance after multi-stage blade assembly, that is, to minimize the unbalance forces and moments. Therefore, the Q-learning reward and punishment function is set as follows:
In the formula, are the weights of the reward factors.
The Q-value update strategy for the multi-stage blade phase optimization model is the same as described in
Section 4.2.2. By training the matrix
Q several times, the model reaches the optimal state quickly in the multi-stage blade phase optimization process.
6. Instance Validation
6.1. Experimental Preparation
This paper takes the assembly of a certain stage of an aero engine fan rotor blade as an example verification object. The experimental platform is a common lathe (CS6150), and the rotor material is aluminum alloy (AlSi10Mg). A laser displacement sensor (HG-C1030) is used to measure the displacement of rotor vibration. The sensor is fixed directly to the fixture, and the data collected by the sensor are stored by the microcontroller. The microcontroller is connected to the PC via the USB interface, and the data are displayed on the XCOM software in the PC. The rotor speed is set to 100 r/min, and the sampling frequency of the sensor is 800 HZ. The specific installation of the experimental device is shown in
Figure 6.
Before the experiment starts, the mass moment data of the blade are measured, and the data are obtained by using the mass moment measuring instrument (MW0) from Schenker, Germany, and the blade data are shown in
Table 1. The blades are numbered according to the mass moment data from smallest to largest, as shown in
Figure 7.
6.2. Single-Stage Blade Sequencing Optimization
Experimental validation of the blade sorting optimization model after the blade data and the initial unbalance of the disk are known. The reinforcement learning algorithm is used to obtain the blade ranking order, install the blades on the disk in order, and compare them with the experimental measurements to verify the effectiveness of the blade ranking method.
The order of blade arrangement is shown in
Figure 8, and the black triangle indicates the angle where the remaining unbalance is measured. The numbers in the figure indicate the angular value of the location of the black triangle.
In order to characterize the blade optimization order more intuitively, the ranking order is presented as a numerical sequence, and the ranking order is 30-29-15-32-11-16-9-4-3-6-14-20-25-13-2-23-12-19-7-8-31-27-28-26-17-33-24-21-1-5-10-18-22. In order to verify the optimization effect of reinforcement learning, the results of the remaining unbalance and the diagonal position mass moment differences are obtained by ranking the blades according to several methods mentioned in the literature [
21,
22], as represented in
Table 2.
As can be seen from
Table 2, for Weight Sorted [
25] and Sorting-1 (4skip) [
25] in the blade arrangement process, the remaining unbalance value seriously exceeds the requirements, and the diagonal mass moment difference does not meet the requirements. Work-Shop [
25], Sorting-2 (7skip) [
25], Sorting-3 [
14], Sorting-4 [
14], and Sorting-5 [
14] have a relatively low remaining unbalance under the condition where the diagonal mass moment difference meets the requirement, but the results are still less than satisfactory. Then, a comparison is made between the Q-learning algorithm and the traditional sorting method. As can be seen from the table, the diagonal mass moment differences of the reinforcement learning meet the requirements, and the calculated values of the remaining unbalance are much smaller than those of the traditional ranking methods, thus proving the effectiveness of the proposed method in this paper.
6.3. Single-Stage Blade Sequencing Semi-Physical Simulation
To verify the validity of the semi-physical simulation model according to the blade installation sequence 30-29-15-32-11-16-9-4-3-6-14-20-25-13-2-23-12-19-7-8-31-27-28-26-17-33-24-21-1-5-10-18-22, 14 blades are assembled in advance, and, when blade numbered 2 is installed, it is wrongly installed as blade number 7, as shown in
Figure 9.
Before the blades are assembled, the initial unbalance of the disk is calculated. The initial unbalance cannot be measured directly, so this paper calculates the initial unbalance indirectly by measuring the displacement change of the disk, which is measured as follows:
(1) The change in the radial displacement of the disk during rotation of the rotor is measured by a laser displacement sensor, and the value is recorded;
(2) The actual center of the mass position of the rotor is fitted using the least-squares method [
16] to derive the eccentricity of the disk, as shown in
Figure 10. Black point indicate theoretical rotor center of mass position, red point indicate actual rotor center of mass position;
(3) After the eccentricity distance is known, the initial unbalance of the disk is calculated by Equation (25), and the data are shown in
Table 3.
In the formula, is the initial unbalance of the disk; is the mass of the disk; and is the eccentricity of the disk, that is, the distance between the center of mass of the disk and the axis of rotation.
When the blade numbered 16 is wrongly installed as blade number 13, the overall remaining unbalance is calculated by the semi-physical simulation model to be 549.2, which exceeds the value of permissible unbalance. To avoid repeated disassembly of the blade, the overall unbalance is adjusted by optimizing the order of the unassembled blade, and the blade order is adjusted before and after, as shown in
Table 4.
After calculation, the unbalance of the blade assembly after the wrong adjustment sequence is 17.71, which meets the requirement of balance, indicating that the constructed semi-physical simulation model can realize the monitoring of the unbalance in the blade assembly process.
6.4. Multi-Stage Blade Assembly Phase Angle Optimization
Taking the first-, second-, and third-stage blade data of the same rotor as examples, the optimized blade layout diagram derived by Q-learning reinforcement learning is shown in
Figure 11. After the completion of the blade arrangement at all levels, the assembly phase angle between disk and blades at all levels is optimized through reinforcement learning. The initial unbalance force of the disk and the remaining unbalance force vector of the blades at each level are superimposed, which finally makes the remaining unbalance force and moment of the whole rotor meet the requirements.
The sequence table (counterclockwise) of the first-, second-, and third-level blades optimized using reinforcement learning is shown in
Table 5.
By optimizing the multi-stage blade assembly phase angle through reinforcement learning, the overall residual unbalance force and unbalance moment and the blade assembly phase angle of each stage before and after optimization are compared, as shown in
Table 6.
Analyzing the data in
Table 6, before optimization, the maximum remaining unbalance force calculated with the phase angle of the first-, second-, and third-stage blade set installation is 1.3633 at 0°, the unbalance moment is 1.2523, and the remaining unbalance is 150.12, which obviously exceeds the design value of 100 for the overall remaining unbalance of an aero engine plant. The optimized installation angle of the first-stage blade set relative to the disk is 328°, the angle of the second-stage blade set relative to the disk installation is 113°, and the angle of the third-stage blade set relative to the installation disk is 180°, and the remaining unbalance is 77.17, which meets the overall remaining unbalance requirement. As shown above, it is feasible to optimize the phase angle of multi-stage blade assembly by using reinforcement learning.
7. Conclusions
In this paper, we proposed a semi-physical simulation method based on reinforcement learning to optimize balance in the fan rotor assembly process. Firstly, based on the mass moments of individual blades, the diagonal mass moment difference was introduced as a constraint to build a single-stage blade sorting optimization model, and reinforcement learning was used to find the optimal sorting path so that the balance of the single-stage blade after sorting is optimal. Then, on the basis of the initial unbalance of the disk and the unbalance of the single-stage blade set, a multi-stage blade assembly phase optimization model was established, and reinforcement learning was used to find the optimal assembly phase so that the overall balance of the rotor is optimal. Finally, based on the collection of data during the assembly of the rotor, the least-squares method was used to fit and calculate the real-time assembly unbalance to achieve a semi-physical simulation of the optimization of balance during the assembly process. The validity of the proposed method and model was verified through experiments. However, because the model used in the experiment was a simplified model, which is different from a real fan rotor, the model needs to be optimized in the future to adapt to the on-site fan rotor assembly of enterprises.