An AI ‐ Based Fast Design Method for New Centrifugal Compressor Families

: Limiting global warming’s effects requires a sudden reduction of greenhouse gas emissions to pursue a net ‐ zero carbon growth in the next decades. Along with this energy transition, drastic and rapid changes in demand are expected in many sectors, including the one for centrifugal compressors. In this context, new aerodynamic design processes exploiting the know ‐ how of existing impeller families to generate novel centrifugal compressors could quickly react to demand variations and ensure companies’ success. Modifying the characteristics of existing compressors using a 1D single ‐ zone model is a fast way to exploit this know ‐ how. Besides, artificial intelligence could be useful to highlight relationships between geometrical parameters and performance, thus facilitating the achievement of optimized machines for new applications. Although the scientific literature shows several studies on mono ‐ dimensional approaches, the joint use of a 1D single ‐ zone model with an artificial neural network for designing new impellers from pre ‐ engineered ones remains understudied. Such a model was provided in this paper. An application to the case study of an expander–compressor impeller family derived from other existing natural gas liquefaction one was presented. Results proved that the proposed model enabled developing a new family from an existing one, improving the performance while containing design time and computational efforts.


Introduction
Global warming and the impact of greenhouse gas (GHG) emissions are currently a "hot topic" for the scientific community [1].As highlighted by several research institutions [2,3], the world mean temperature has shown an anomalous increase in the last 40 years.According to the Intergovernmental Panel on Climate Change (IPCC), by the end of the century, the current global warming trend could result in a rise of more than 2 °C compared with pre-industrial levels [4].To limit this increase, 196 Members of the 21st Conference of the Parties adopted, on 12 December 2015, a legally binding international treaty known as the Paris Agreement [5].To achieve this long-term goal and reach carbon neutrality by 2050, it is pivotal to reduce GHG emissions and promote a global energy transition since the energy sector accounts for over three-quarters of them [6].However, considering the IPCC projections and the targets set by the Paris Agreement, the transition needs to be accelerated to meet the 2050 goal [7].Hence, a sudden and drastic change in market needs is expected, along with a rapid product shift in companies' portfolios [8].This will result in a transformation of some industrial sectors like oil and gas, and in a new product demand for renewable energy, electricity, and turbomachinery [9].Regarding the last, a product switching is also expected in the specific field of centrifugal compressors, as evidenced by recent scientific developments on hydrogen, supercritical carbon dioxide (sCO2), and Organic Rankine cycles (ORC).
As new market opportunities for centrifugal compressors are rapidly emerging, the ability to integrate low-emission solutions into existing portfolios could determine the success as market leaders for many centrifugal compressor manufacturers [10].In this context, the new product development (NPD) process assumes a key role since a fast NPD approach could enable companies to achieve high product profitability and good market share [11].Focusing on centrifugal compressors, a fundamental step in the NPD process is represented by aerodynamic design [12], whose models can be distinguished according to the degree of complexity involved in the flow description [13].Starting from scratch up to detailed geometry, these models are progressively adopted and can be divided into four categories [14]: (i) statistical, (ii) one-dimensional (1D), (iii) two-dimensional (2D), and (iv) three-dimensional (3D).Regarding statistical models, they typically enable the prediction of centrifugal compressor performance using few input data related to the compressor geometry [14].A well-known statistical model was introduced by Casey and Robinson [15], where performance curves were obtained with physically based algebraic equations, which, as stated by the authors, can be solved requiring a little prior knowledge of the detailed geometry.However, according to Al-Busaidi and Pilidis [16], this model needs to adjust the empirical coefficients with experimental data and requires prior knowledge of the design and peak efficiencies.These limits can be overcome by adopting 1D methods, which are usually divided into single-and two-zone models [17].Singlezone models assume a uniform flow through the impeller passage and adopt a set of loss correlations to evaluate the performance at mean streamline.Two-zone models, instead, divide the impeller exit flow in a primary isentropic zone and in a secondary nonisentropic region according to Dean's theory [18].A typical example of two-zone model was introduced by Japikse [19], but, as stated by Harley et al. [20], as a large number of estimated inputs are required, successful use of Japikse's model is severely reliant on the staff experience.In addition, both models (single-and two-zone) make an approximation of the real flow nature.Hence, to ensure a higher accuracy without losing generality, aerodynamic design can be carried out using 2D throughflow models, a typical example of which was provided by Casey and Robinson [21].However, such models do not consider cross-flows perpendicular to the analysis surface [13], while the flow inside a centrifugal compressor was demonstrated to be strongly three-dimensional [22].In light of this, 3D models such as computational fluid dynamics (CFD) were introduced to ensure better accuracy and further improve the performance of centrifugal compressors.An example was reported by Cravero et al. [23], where an in-depth flow analysis allowed the authors to increase the stall margins in a small-size centrifugal compressor.However, the use of 3D models leads to greater computational complexity and the need for more skilled staff.
Designing a centrifugal compressor from scratch by sequentially applying the above models is a time-intensive process [24], which may conflict with the need to speed up the introduction of low-emission solutions [25].On the contrary, when fast NPD is required, it might be preferable to derive new centrifugal compressors from pre-engineered families [24].As observed by Hazby et al. [26], this strategy is common when developing new machines, and allows engineers to meet the requirements of emerging applications in a rapid and cost-effective way.As reported by Edwards-Schachter [27], architectural changes of previous designs are typical strategies in product innovation.Therefore, leveraging the available know-how of existing families could accelerate the introduction of novel centrifugal compressors [28].In this prospective, Bygrave et al. [29] highlighted the relevance of 1D models in upgrading a centrifugal compressor pre-engineered family.Casey et al. [30] showed how 1D models allow for a rapid evaluation of many geometries with a low computational effort.Indeed, as stated by the above authors, using CFD analysis in a continuous refinement process could be expensive, tedious, and timeconsuming.For this reason, 1D models are still playing a crucial role in the design of centrifugal compressor [31] and they can save time for further detailed analyses [30].Nevertheless, since many geometries can be quickly evaluated with 1D models, it would be useful for designers to have a tool to support them in choosing the best solution for the specific application.In this regard, the use of artificial intelligence (AI) could be valuable for highlighting relationships that are not obvious to the designer and facilitating the development of new centrifugal compressors from pre-engineered ones.Specifically, artificial neural networks (ANNs) could be used for this purpose.Indeed, the use of an ANN surrogate model (also referred to as "ANN meta-model") offers two benefits: (i) it could highlight hidden patterns not directly understandable from the 1D model [32], and (ii) it could reduce the overall timescale of the design activity [33].Although low-order models (such as 1D models) allow for limited data evaluation compared with CFD analyses, once these models have been validated with experimental or CFD data, characteristic curves can be accurately predicted.Especially in the initial research phase of new possible machines, where detailed knowledge of the flow field is not required, the knowledge of characteristic curves may be sufficient.Furthermore, the adoption of an ANN potentially allows new parameters, information, or constraints to be added later, updating the database and re-training the network.This implies that the meta-model could be trained almost dynamically, incorporating, for instance, manufacturing or sustainability information that could be obtained by other methods.However, to the best of the authors' knowledge, articles showing the joint use of a 1D model and ANN as a fast and computationally efficient NPD approach to derive new centrifugal compressor families from pre-engineered ones are still overlooked in the scientific literature.
Concerning 1D single-zone models, numerous studies are available in the open literature, and the widely adopted sets of loss correlations are those introduced by Aungier [34], Oh [35], and Galvas [36].Khoshkalam et al. [37] recently provided a new set, while Velásquez [38] reviewed the major loss models, finding a suitable set to predict single stage centrifugal compressor performance.Zhang et al. [39] established a method to select a reliable set of loss correlations, and Sundström et al. [40] compared 1D predictions with experimental data to assess the validity range of the simplified model.Furthermore, Kus and Nekså [41] considered a wide range of loss mechanisms to performance prediction of an oil-free CO2 centrifugal compressor.Ameli et al. [42] and Romei et al. [43] underlined the pivotal contribution of 1D single-zone models in designing centrifugal compressors for sCO2 power systems.Regarding the combined use of these models with AIs, few articles are available.Mei et al. [44] joined the Aungier's set with a genetic algorithm for developing a micro-centrifugal compressor for an air conditioner.Li et al. [31] adopted a 1D single-zone model with an adaptive simulated annealing algorithm to optimize a low-pressure stage centrifugal compressor for small gas turbines.Xia et al. [45] implemented a systematic approach of design for sCO2 compressors and carried out three multi-dimensional optimization using particle swarm optimization (PSO) and a genetic algorithm.Wang et al. [46] presented a systemcomponent coupled optimization method based on genetic algorithm and two onedimensional models for sCO2 centrifugal compressor and turbine.Recently, Bourabia et al. [47] proposed an approach to select the geometrical parameters of turbocharger based on a coupling between evolutionary optimization techniques and a 1D model.Similarly, Du et al. [48] combined a 1D single-zone model with a genetic algorithm to optimize a centrifugal compressor for sCO2 Brayton cycle.
Despite the above, the literature lacks approaches to combine a 1D single-zone model with an ANN to rapidly derive new centrifugal compressor.Considering this gap, a new research area focused on the joint use of 1D models and ANNs has recently emerged.Indeed, in 2022, Massoudi et al. [49] presented an approach for the robust design of a centrifugal compressor based on ANN surrogate models trained with database from a 1D model.However, the authors optimized only a single heat pump radial compressor, not designing an entire family.Moreover, they focused on a genetic algorithm (GA) to achieve the optimal set of ANN hyperparameters, while not considering minimizing ANN input parameters.Finally, they performed an optimization with NSGA-III instead of using a random search algorithm, which nevertheless showed results comparable to those of evolutionary algorithms [50] but with fewer parameters to be set.For these reasons, the present paper, complementary to Massoudi et al. [49], showed the application of an AIbased fast design method on an entire family of compressors, adopting the analysis of variance (ANOVA) to reduce ANN input parameters and using a random search algorithm for the optimization phase.As an example case study, the proposed model was here leveraged to achieve the design characteristics of a family of impellers for expandercompressor (EC) applications starting from an existing family used in production of liquefied natural gas (LNG).The main contribution of this article is twofold.At a theoretical level, it investigated the combination of a 1D model with an ANN, providing an NPD approach for new centrifugal compressor design.At a practical level, to it supports designers and practitioners in leveraging the know-how of existing centrifugal compressor families to quickly develop innovative, low-emission solutions and meet energy transition needs.
After providing some introductions embodying the theoretical background relevant to the topics of interest, the remainder of the present paper is as follows.In Section 2, the methodology followed to combine the 1D single-zone model and ANN is described.In Section 3, the proposed model is applied to the aforementioned case study, showing and discussing the achieved results.Finally, in Section 4, some conclusions on this study are offered.

Materials and Methods
Aiming to derive a new pre-engineered family from the ones already present in the company's portfolio, five steps were performed for each stage of the new family.In step 1, a 1D single-zone model was implemented and validated using CFD and experimental data.In step 2, an ANOVA was developed to highlight the impact of geometric parameters on centrifugal compressor performance, thus excluding from the study the non-significant ones.In step 3, first, a range of realistic values was associated with each non-negligible parameter of step 2, obtaining a dataset of possible geometries.Then, the 1D model was applied to each geometry belonging to the dataset, conducting a parametric analysis.In step 4, the results of the parametric analysis were exploited to train and validate an ANN.Next, the ANN was employed to search for a Pareto front of nondominated solutions.Finally, in step 5, the most promising solutions were validated with CFD simulations.Based on CFD results, the optimal solution was identified and used to define the considered stage of the new family of centrifugal compressors for EC applications.The following subsections describe each step in detail, while the results are shown in Section 3. As aforementioned, it is worth remembering that each step described below should be applied once per family stage.

Step 1: 1D Single-Zone Model Implementation and Validation
Considering a single-stage centrifugal compressor composed of an impeller, a diffuser, and a volute (as shown in Figure 1a), a 1D single-zone model was developed based on Harley [51] and Amirante et al. [52].In such a model, the flow properties were computed by solving energy and mass conservation, treating the operating fluid as a perfect gas.The impeller deviation angle was estimated using Qiu's formulation [53], and the pressure drop across the inlet duct (Figure 1a, component A) was not taken into account, being negligible compared to the overall losses.Figure 1b depicts the thermodynamic compression through the stage and the effects on the enthalpy and entropy of the impeller, diffuser, and volute.Concerning the impeller (Figure 1a, component B), internal and external losses were evaluated according to Oh [35] and Aungier [34].Internal losses were divided into the incidence (∆ℎ ), blade-loading (∆ℎ ), mixing (∆ℎ ), skin-friction (∆ℎ ), choke (∆ℎ ), and hub-to-shroud distortion (∆ℎ ) losses.On the contrary, external losses were split into the recirculation (∆ℎ ), disc-friction (∆ℎ ), and leakage (∆ℎ ) losses.Moreover, due to the main effect on impeller performance, incidence, blade-loading, mixing, and skinfriction losses were selected following Oh's study [35].Choke and hub-to-shroud distortion losses were added following Aungier's study [34] to improve the accuracy near the choke condition.In the present work, clearance losses were not considered among the internal losses since the impeller was shrouded.However, these losses could be considered by adding the correlation proposed by Jansen [54].As shown in Figure 1b, internal losses limited the total pressure ratio achievable by the impeller, while external ones caused an entropy increase with no effects on pressure.
Regarding the diffuser (Figure 1a, component C), in the present study, a vaneless configuration was investigated according to Amirante et al. [52].Specifically, the methodology by Amirante et al. [52] and Stuart et al. [55] was used to investigate the pressure drop across the diffuser, while Staniz's correlation [56] was leveraged to estimate the diffuser enthalpy loss.However, in case of a vaned diffuser, a similar approach can be applied, but using the enthalpy losses correlations indicated by Aungier [34].Finally, concerning the volute (Figure 1a, component D), losses were divided into circumferential velocity, meridional velocity, skin-friction, and exit-cone losses according to Hazby et al. [57] and Weber and Koronowski [58].For the sake of clarity, the loss correlations adopted for each component of the centrifugal compressor stage are summarized in Table 1.It is worth mentioning that these correlations have mainly been used with air as the working fluid.However, they have also been validated for other fluids, as reported by Ameli et al. [42] and Meroni et al. [59].[43] the effective number of blades was evaluated as   0.75 , where   _ is the number of main blades and   is the number of splitter blades.
The developed 1D model was validated by comparing its predictions with CFD and experimental data of three different stages characterized by low, medium, and high flow rate, respectively.Comparisons were made in terms of total-to-total pressure ratio ( ), polytropic efficiency (  ), work coefficient (), and polytropic head () defined as follows (Equations (1)-( 4)).

𝛽
, (1) Moreover, the mass flow rate and the rotational speed of each machine were defined by means of the non-dimensional flow coefficient () and peripheral Mach number ( ) as shown in Equations ( 5) and (6).
The comparisons were conducted using measurements of the starting pre-engineered family available in the company.Such measurements included the geometric parameters of the family and their respective characteristic curves in terms of  ,  , , and  (Equations (1)-( 4)), obtained at the impeller and diffuser outlet sections (Figure 1a, sections 2 and 3).For this reason, it was possible to validate the 1D model prediction of impeller and diffuser by comparing it to the results of CFD and empirical data.Conversely, the comparison between 1D model results and experimental measurements was not possible in the case of the volute (Figure 1a, section 4).Instead, another type of validation was performed by comparing 1D volute prediction with a simplified tool developed in-house by the company.Due to a non-disclosure agreement, the in-house tool and its results cannot be described; hence, the volute validation was not shown in the present work.
Concerning CFD analyses, RANS simulations were performed using the TRAF code [65], which is a 3D viscous solver based on the finite volume method for the resolution of Reynolds-Averaged Navier-Stokes equations on structured grids [66].The discretization of convective fluxes was handled by a 2nd order TVD-MUSCL strategy build on the Roe's upwind scheme, while a central difference scheme was used for the viscous fluxes.Wilcox's k-ω model was employed for turbulence closure [67], and a high level of parallelization was ensured exploiting a hybrid OpenMP/MPI architecture [68].The computational domain was discretized using H-type grids for the inlet duct, the rotor, and the diffuser (Figure 2).An overall grid of 3.5 million elements was selected after carrying out a sensitivity analysis, whose results are shown in Table 2. Indeed, increasing the number of elements beyond 3.5 million, the performance of the centrifugal compressor was no longer influenced by the adopted spatial discretization.Furthermore, to improve the predictive capability of the CFD analyses, the external losses shown in Table 1 were added to CFD results during the post-processing phase following the approach of Sundström et al. [40] and Harley et al. [69].Table 3 summarizes the main assumptions of the numerical setup adopted for CFD analyses.Experimental measurements were carried out with air at ambient conditions on a test rig available in the company.Pressure and temperature probes were positioned at the inlet and outlet sections of the impeller and diffuser, assessing their aerodynamic performance with an average error of less than 0.03%, 0.09%, and 0.79%, for temperature, pressure, and flow rate, respectively.As mentioned in Section 2.1, measurements were evaluated in the inlet section of the impeller, and in the outlet sections of both impeller and diffuser.Uncertainty propagation was then performed starting from the relative uncertainty of each sensor by means of in-house company -software.Systematic errors were assumed to be neglected since the pressure transducers are calibrated annually and the adopted thermocouples were provided with a calibration certificate.

Step 2: ANOVA
When looking for a suitable set of independent variables to adopt in a new family of centrifugal compressors, varying geometric parameters with negligible effects on performance could increase the time and the computational cost of the task without any benefit (also exacerbating the ANN training phase).On the contrary, adjusting only significant parameters could reduce the design efforts.In this perspective, four activities were carried out to highlight the significant independent parameters, while excluding from the study the non-significant ones.As a first activity, all the independent input variables used as design variables were identified based on Shouyi et al. [70] and on the company's know-how.However, it is worth mentioning that, as suggested by Aungier [34], unlike Shouyi et al. [70], the impeller axial length was not considered as an independent variable, but computed following the equation in ref. [34].The resulting independent variables are shown in the first column of Table 4.In the second activity, a range of three realistic values was associated to each independent variable.Specifically, the two range extremes (upper and lower bounds) were selected based on experts' knowhow in terms of expected aerodynamics effects and manufacturability issues.The third (intermediate) value was then calculated as arithmetic mean of the upper and lower extremes.In addition, a further comparison was made with the ranges provided by Shouyi et al. [70].The second column of Table 4 shows the specific ranges considered in the present study to achieve the desired EC family.It is worth noting that, since our aim was to achieve a new family of centrifugal compressors with inlet hub diameter and outlet diameter, no range of variation was considered for such independent variables.Instead, constant values were imposed as reported in Table 4. Subsequently, in the third activity, combining the variable values of Table 4, a total of 6561 perturbed geometries were generated and submitted to the 1D model at stall, design, and choke conditions (where the design condition was the point with peak efficiency, the stall condition the point with horizontal tangent of the polytropic head, and the choke condition the point at which the stage polytropic efficiency was reduced by 10%).This was carried out for three different stages characterized by low, medium, and high flow rates.Finally, in the fourth activity, the results of the 1D model predictions were subjected to an ANOVA to assess the impact of each independent variable of Table 4 on the stage performance [71].The ANOVA (with main effect plot) was performed with the Minitab software, allowing variables to assume three discrete values to obtain easily understandable graphs in which the impact could be immediately recognized.Based on the ANOVA results, the non-significant independent variables were excluded from the study (associating them with constant values), while considering in the next steps the remaining significant independent variables.

Step 3: Sobol Sequence
Having removed non-significant parameters, a new dataset was generated to feed and train an ANN as follows.A sample of feasible geometries was generated by associating realistic values with the remaining independent variables.To this aim, unlike the ANOVA (step 2), here variables were assigned by means of a Sobol quasi-random low discrepancy sequence [72], rather than adopting only three discrete values.In this perspective, the value of each independent variable was varied based on Equation (7), where  is the total number of realistic compressor geometries to be created,  is the specific geometry considered,  is the total number of remaining independent variables emerged in step 2,  is the specific independent variable to which we are assigning a value  ,  and  are the variable upper and lower limits, and S depends on the Sobol sequence.
Then, 2500 possible realistic geometries were achieved by combining the values of the independent variables.Next, each of these geometries was analyzed with the 1D model to predict total-to-total pressure ratio, polytropic efficiency, work coefficient, and polytropic head (Equations (1)-( 4)) at section 4 (Figure 1a) for stall, design, and choke conditions.Subsequently, a dataset composed by the values of the significant independent variables and the respective performance predictions was collected and used to train and validate an ANN.The Sobol sequence was adopted to link variables with quasi-random values based on Burhenne et al. [73].Indeed, as stated by the authors themselves, when a large number of independent input variables are affecting a problem, the Sobol sequence is more effective than other sequences such as Latin Hypercube or Monte Carlo sampling.

Step 4: Artificial Neural Network
The obtained Sobol-based dataset emerged in step 3 was used to train and validate an ANN, thus obtaining a response surface capable to reproduce the 1D model behavior.Specifically, a feed-forward ANN with two hidden layers was adopted and trained by means of a back propagation algorithm.By using 70% of the obtained Sobol-based dataset (the remaining 30% was used in equal parts for testing and validation), the training process was carried out in accordance with Riccetti et al. [74], aiming to minimize the difference between the ANN prediction and the 1D model results.To better understand this process, let us consider an input vector (from now on called ) composed of the independent variables achieved in step 2, and an output vector (here referred to as ) composed of the stage performance calculated through the 1D model (Equations (1)-( 4)).Considering the 1D model as a function , the  vector is obtained starting from  as follows (Equation ( 8)).
Starting from the same , the ANN prediction (here referred to as ′) can be defined based on Equation (9), where  represents the behavior that the ANN learned to forecast during the training process.
The ANN response surface  being a surrogate model of the 1D model  , it might be possible that the ANN prediction  differs from the 1D model output  .For this reason, the training process was conducted by minimizing the prediction error  defined as follows.Specifically,  was minimized by recursively performing the training process and updating, for several times (epochs), the weights used to manage the passage of information through neurons.
Once the ANN was trained, a k-fold cross-validation process was performed to understand the ANN reliability and avoid under-fitting and over-fitting issues.This activity completed the ANN construction phase.
Then, exploiting a random search algorithm [75], a new sample of possible geometries was built and given as input to the ANN.Therefore, from the ANN predictions, a Pareto front was found.

Step 5: Validation of Promising Solutions through CFD Analyses
Finally, a further validation of the obtained results was carried out as follows.Some promising solutions of the Pareto front were selected, each of the selected solutions was analyzed with CFD simulations, and the obtained results were compared with the ANN predictions (analyzing the values of total-to-total pressure ratio, polytropic efficiency, work coefficient, and polytropic head).Finally, once we validated the results of the ANN, one of the promising solutions was chosen and designated as a stage of the new centrifugal compressor family.

Results and Discussion
The methodology proposed in Section 2 was applied, as an example case study, to derive a new family of EC impellers from an existing family of LNG centrifugal compressors.For the sake of clarity, instead of showing the results obtained for each impeller of the family, the results are shown for three different stages characterized by low, medium, and high flow, respectively.Indeed, these stages are well representative for the whole family and the obtained considerations could be easily extended to intermediate stages.The following subsections describe and discuss the achieved results following the subdivision provided in Section 2.

Step 1: Validation of 1D Single-Zone Model
The implemented 1D single-zone model was validated by comparing its predictions with CFD and experimental data in terms of  ,  , , and  (Equations ( 1)-( 4)) at sections 2 and 3 of Figure 1a.While in section 4 (volute outlet), due to the absence of available experimental data, the validation was performed by comparing the 1D volute prediction with the company CFD-based in-house tool.It is worth mentioning that for this case study a vaneless diffuser was adopted, but the developed 1D model could also be applied with vaned configurations.Figure 3 shows the  trend for the three stages designed for low (left-hand curves), medium (middle curves), and high flow rate (righthand curves), respectively.At impeller outlet (section 2), a good agreement between the 1D model (blue lines) and the experimental data (white squares) was observed for each machine.The CFD analyses (orange lines) instead showed a slight over-estimation of the total-to-total pressure ratio.A similar behavior was recorded at the diffuser outlet (section 3).The 1D model seemed to follow the trend of the experimental data, except near the stall where the slope variation of the curve was not correctly replicated.However, in the worst case (low flow stage, left-hand curves) the percentage deviation was less than 1.94%.Moreover, it can be observed that the 1D model allowed reaching the stall condition, while the performed RANS simulations were not able to analyze it, especially in the medium and high flow stages.Focusing on the volute output (section 4), the 1D model showed an underestimation of  compared with the CFD-based in-house tool.This discrepancy is more relevant when increasing the flow coefficient at which the stage is designed.In Figure 4, the comparison between the 1D model, the CFD, and the experimental data is conducted in terms of  .In this case, a good agreement was reached at the diffuser outlet (section 3), while the same discrepancies were present at sections 2 and 4. Specifically, the worst percentage deviation of the 1D model was about 4.98% in comparison with experimental data at section 2, and near 3.55% compared with the CFDbased in-house tool at section 4.However, such differences can be considered acceptable, since the 1D model intrinsically simplifies the highly three-dimensional flow field of the centrifugal compressor.Regarding the work coefficient  (Figure 5), the medium and high flow coefficient stages showed a 1D prediction close to the CFD and experimental data.Indeed, the average percentage error in these cases was about 1.15%, while for the low flow stage it rose to 2.88%.In Figure 5 only section 2 is shown, since the diffuser and volute, being static components, do not increase the total enthalpy rise of the stage.The comparison shown in Figure 6 highlights how the 1D model can replicate the experimental data in terms of .Compared with the CFD results, the 1D predictions appear closer to the experimental measurements, as evidenced by an average percentage error of 2.10% instead of 3.25% for the CFD analysis.This trend remained unaltered at the exit of the volute (section 4).Figures 7-9 show the comparison between the 1D model and CFD analyses related to the impeller loss sources for the three stages.As it can be noticed, the loss trends resulting from the 1D model were consistent with those computed with the CFD simulations.Although the skin friction losses (Δℎ ) were underestimated in the 1D model, the overestimation of the incidence losses (Δℎ ) appeared to compensate for this underestimation, providing a correct overall assessment.On the other hand, the other sources of losses were comparable between the 1D and CFD models in terms of values and shapes.Furthermore, from Figures 7-9 it is possible to appreciate the relative impact of each loss source, in accordance with the Oh's findings [35].As a unique difference with Oh's study, the 3D geometry of the realized impellers made the adoption of choke and hub-to-shroud distortion losses proposed by Aungier [34] unavoidable.

Step 2: ANOVA Results
The ANOVA was performed to assess the impact on stage performance of each independent variable of Table 4, thus excluding non-significant ones from the study.To conduct the ANOVA, the values of the independent variables were varied by creating a set of 6561 possible geometries, where  ,  , and the number of blades were fixed to meet the new family requirements.Figures 10 and 11 depict the ANOVA outcomes (in terms of main effect plot) related to  and τ, for the low (blue), the medium (gray), and the high flow coefficient stage (black), respectively.Similar results were achieved for  and ψ but were not reported here for the sake of brevity.Figures 10 and 11 prove that none of the independent variables of Table 4 had a negligible impact on the stage performance.Hence, all independent variables were considered in the next steps of the present study to feed and train the ANNs and derive the new centrifugal compressor family.

Step 3: Sobol Sequence
Exploiting a Sobol sequence (Equation ( 7)), a range of realistic values was associated with each non-negligible independent parameter emerging from step 2 (Sections 2.2 and 3.2).As a result, a dataset of 2500 possible geometries was achieved for each stage of the family.Then, the 1D model was applied to each geometry belonging to the dataset, thus predicting the stage performance, and conducting three parametric analyses (one for each analyzed stage).

Step 4: Artificial Neural Network
The results of each parametric analysis were used to feed and train a related ANN.Consequently, three ANNs were obtained, one for each stage of the family.Figures 12-14 show the results of the ANN training process, comparing the 1D model prediction (blue dots) associated to each test geometry and the respective ANN forecast (red dots), in terms of  and τ.Like in step 3, similar results were achieved for  and ψ but were not reported for the sake of brevity.ANNs showed a good predictive capacity, as evidenced by the forecast absolute errors ( and  ) presented in Table 5 and calculated as the average of differences between each ANN forecast and its related 1D model prediction (expressed as a percentage).It is worth mentioning that the time required on a personal computer (Intel Core i7, 8 cores at 4.80 GHz and 32 GB DDR4 memory at 3200 MHz) to produce the database for the ANN training and the subsequent training phase was less than 6 h.This time was mainly due to the recursive use of the 1D model, which accounted for about 70% of the total time.However, once the ANN was trained, the resulting metamodel predicted the performance of one stage in less than 0.2 s, while the 1D model took two orders of magnitude longer.By using the proposed approach, the initial design time of the three stages was reduced by 75% compared with the procedure currently used in the company.Moreover, if the dataset is large enough, the same ANN could be used to design more than a single stage, further reducing design time.Once the ANNs were trained and validated, such meta-models were used in conjunction with a random search algorithm [75] to predict, for each of the three stages, the performance of a new sample of possible geometries.Based on the achieved predictions, three Pareto fronts were built, one for each stage.
Figures 15-17 depict all the investigated geometries (blue dots), also highlighting the optimal solutions composing the Pareto front (red circles).

Step 5: Validation of Promising Solutions through CFD Analyses
Among the optimal geometries of each Pareto front, five were selected as being the most promising (for a total of fifteen geometries).These machines were, then, validated through CFD analyses.Finally, the three geometries considered by company experts to represent the optimal trade-off between performance and geometric characteristics (e.g., size, manufacturability, mechanical strength, and so on) were selected.These geometries laid the foundation of the new family of centrifugal compressors for EC applications.Figures 18-21 show the comparison between the stage performance of the existing LNG family and the ones of the derived EC family in terms of  ,  , , and .Each derived machine provides better performance than the original family.In addition, the stall and choke margins were extended, thus increasing the operating range of each machine.Finally, the comparisons showed, once again, a good agreement between the 1D model and CFD.

Conclusions
The current efforts related to energy transition, the reduction of greenhouse gas emissions, and other sustainability issues are expected to induce, in the future, lumpy and unpredictable changes in the demand of centrifugal compressors.For this reason, a valuable way to quickly react to demand fluctuations is highly claimed by compressor manufacturers.In this context, the combined use of a 1D single-zone model and ANNs has been identified as a sound way to exploit the know-how of existing impeller families, highlight relationships between geometrical parameters and performance of existing machines, and shorten and streamlining the new product aerodynamic design process by modifying the characteristics of existing compressors to generate new optimized machines.However, the literature lacks studies on the joint use of a 1D single-zone model with an ANN for designing new centrifugal compressor families starting from preengineered ones.
To fill this gap, the present paper proposed a new model (testing it on the case study of a new expander-compressor impeller family derived from existing natural gas liquefaction machines), which involved performing five main steps: (i) a 1D single-zone model was built to predict the performance of the impeller, diffuser, and volute of centrifugal compressors, which was validated through a comparison with experimental data and CFD analyses; (ii) the most relevant independent input variables required to apply the 1D model were identified (leading to eight independent variables).Then, an ANOVA was conducted, proving that all these variables had a consistent (non-negligible) impact on the results of the 1D model predictions; (iii) an extensive parametric analysis was performed for each compressor stage analyzed, investigating three main compressor stages (designed for low, medium, and high flow rate).Then, assigning Sobol quasirandom values to the relevant independent input variables, a total of 2500 geometries were defined for each compressor stage.Next, each geometry was submitted to the 1D single-zone model, defining for each geometry its performance in terms of total-to-total pressure ratio, polytropic efficiency, work coefficient, and polytropic head; (iv) the results of the parametric analyses were used to feed and train an ANN for each compressor stage analyzed.Then, the ANNs forecasts were used to build a Pareto front for each compressor stage analyzed, thus identifying fifteen promising compressor solutions, in agreement with company experts; (v) finally, the performance of these fifteen solutions forecasted by the ANN were validated through a comparison with CFD analyses, also identifying the three geometries (one per each compressor stage analyzed) that represented the optimal trade-off between aerodynamic performance and geometric characteristics.
The results of the present paper showed that the proposed model enables developing a new family of centrifugal compressors starting from an existing one, improving the initial performance while containing design time and computational efforts in favor of further 3D optimizations.Moreover, results proved that the combined use of a 1D singlezone model and an ANN leads to reliable predictions, determining low absolute errors (always below 1.0%) in the forecast of compressor performance (compared to only 1D prediction).Finally, the results of this study showed that exploiting the know-how of existing machines to design new compressor families significantly shortened the new product aerodynamic design process.In fact, applying the new model to the case study here proposed allowed the design of three different compressor stages in less than one week (against one month usually required by the staff of the company investigated to obtain similar results).
Overall, the main contribution of this study is represented by the five-step model based on the combined use of a 1D single-zone model and an ANN.Indeed, to the best of the authors' knowledge, nothing similar has been provided by the literature so far to design new centrifugal compressor families starting from pre-engineered ones, thus shortening the new product aerodynamic design process.
It is worth mentioning that the proposed model can be applied to any kind of centrifugal compressor, although the present study showed an application to the specific case of overhang centrifugal compressors for EC application.However, it is worth noting that the results of the proposed model are based on the following simplifying assumptions: the flow through the compressor stage is one-dimensional, the flow properties are assumed uniform at each stage section, and the loss correlations are assumed to correctly predict the real losses through the compressor stage.
Future developments of this research could be twofold.First, to study the impact that technical specifications (e.g., the skin friction coefficient or the factor for incidence loss), imposed as design input choices, have on the model predictions, especially focusing on the influence of these factors on the 1D single-zone model forecasts.More in detail, we could define if the new proposed model is robust, or it changes its performance accuracy when varying the imposed input parameters.Secondly, to include in the new proposed model the analysis of manufacturing non-conformities, studying from the company databases the non-conformities that occur more frequently, then exploiting artificial intelligence to obtain, among all the possible compressor geometries, the ones whose performance are more robust and reliable against manufacturing uncertainties and variability in geometrical properties.
Authors should discuss the results and how they can be interpreted from the perspective of previous studies and of the working hypotheses.The findings and their

Figure 1 .
Figure 1.Centrifugal compressor stage (a) and its thermodynamic transformation on the h-s plane (b).

Figure 2 .
Figure 2. Example of the selected H-type grid for CFD simulations.

Figure 5 .
Figure 5.Comparison between 1D model predictions, CFD, and experimental data (with error bars) in terms of  at the impeller outlet (section 2).

Figure 7 .
Figure 7. Loss source breakdown evaluated with the 1D model (left) and CFD (right) for the low flow stage.

Figure 8 .
Figure 8. Loss source breakdown evaluated with the 1D model (left) and CFD (right) for the medium flow stage.

Figure 9 .
Figure 9. Loss source breakdown evaluated with the 1D model (left) and CFD (right) for the high flow stage.

Figure 10 .
Figure 10.ANOVA main effect plot of  for low (blue), medium (gray), and high (black) flow stage.

Figure 11 .
Figure 11.ANOVA main effect plot of  for low (blue), medium (gray), and high (black) flow stage.

Figure 12 .
Figure 12.Comparison of 1D prediction (blue dots) and ANN forecast (red dots) under stall, design, and choke conditions for the low flow designed impeller.

Figure 13 .
Figure 13.Comparison of 1D prediction (blue dots) and ANN forecast (red dots) under stall, design, and choke conditions for the medium flow designed impeller.

Figure 14 .
Figure 14.Comparison of 1D prediction (blue dots) and ANN forecast (red dots) under stall, design, and choke conditions for the high flow designed impeller.

Figure 15 .
Figure 15.New geometries investigated with ANN (blue dots) in stall, design, and choke conditions and Pareto front (red circles) for the low flow designed impeller.

Figure 16 .
Figure 16.New geometries investigated with ANN (blue dots) in stall, design, and choke conditions and Pareto front (red circles) for the medium flow designed impeller.

Figure 17 .
Figure 17.New geometries investigated with ANN (blue dots) in stall, design, and choke conditions and Pareto front (red circles) for the high flow designed impeller.

Figure 18 .
Figure 18.Comparison between old LNG and new EC geometries evaluated with the 1D model and CFD in terms of  at the impeller (section 2), diffuser (section 3), and volute (section 4) outlet.

Figure 19 .
Figure 19.Comparison between old LNG and new EC geometries evaluated with the 1D model and CFD in terms of  at the impeller (section 2), diffuser (section 3), and volute (section 4) outlet.

Figure 20 .
Figure 20.Comparison between old LNG and new EC geometries evaluated with the 1D model and CFD in terms of  at the impeller (section 2), diffuser (section 3), and volute (section 4) outlet.

Figure 21 .
Figure 21.Comparison between old LNG and new EC geometries evaluated with the 1D model and CFD in terms of  at the impeller (section 2), diffuser (section 3), and volute (section 4) outlet.

Table 1 .
Loss correlations for impeller, diffuser, and volute performance prediction.
* Note: according to Romei et al.

Table 2 .
Sensitivity analysis of the spatial discretization impact on performance.

Table 3 .
Main assumptions, characteristics, and peculiarities of the adopted numerical setup.

Table 1 .
Geometric parameters adopted as independent variables in the ANOVA and their range of variation.

Table 5 .
ANN absolute error in prediction of polytropic efficiency  and work coefficient  .