Stress Distribution Prediction of Circular Hollow Section Tube in Flexible High-Neck Flange Joints Based on the Hybrid Machine Learning Model

The flexible high-neck flange is connected to the circular hollow section (CHS) tube through welding, and the placement of the weld seam and corresponding stress concentration factor (SCF) are crucial determinants of the joint’s fatigue performance. In this study, three hybrid models combining ant colony optimization (ACO), a genetic algorithm (GA), and grey wolf optimization (GWO) with a random forest (RF) model were developed to predict the stress distribution on the inner and outer walls of the CHS tube under different flange parameter combinations. To achieve this, an automated finite element (FE) analysis program for flexible high-neck flange joints was initially developed based on ABAQUS 2020 software. Parameter combinations were randomly selected within a reasonable range to simulate the nonlinear mechanical behavior of the joint under uniform tension, generating a dataset comprising 5417 sets of data. The accuracy of the FE model was validated through experimental data from the literature. Based on this, feature importance analysis was conducted to reveal the influence of different variable parameters on the stress distribution in the tube of the joint. The flange parameters and tube stress distribution are considered as inputs and outputs, respectively. Three hybrid RF models, specifically ant colony optimization-based random forest (ACO-RF), genetic algorithm-based random forest (GA-RF), and grey wolf optimization-based random forest (GWO-RF), are trained for regression prediction. The results demonstrate that the three hybrid models outperform the original machine learning model in predictive accuracy. The ACO-RF model achieved the highest accuracy with average coefficients of determination (Rmean2) of 0.9983 and 0.9865 on the testing and training sets, respectively. Building upon this foundation, the study developed a corresponding open-source graphical user interface (GUI) as a tool for facilitating computations and visualizing results. Finally, a case study on fatigue damage assessment of a flexible high-neck flange joint in a wind-turbine tower is presented to demonstrate the application of the proposed model in this study.


Introduction 1.Research Background
Flange connections represent a prevalent form of node connection in structural engineering [1].This involves the welding of flange plates to two tubular components, subsequently joined through the use of bolts.The applicability of flange connections spans a wide range, encompassing structures as modest as streetlight poles to those as substantial as wind-turbine towers.Flange nodes can be categorized into various types based on their constructional nuances.Among these, a flexible flange pertains to a constructional w configuration wherein vertical stiffeners are not affixed to the flange plate.This results in an aesthetically pleasing, streamlined profile with reduced welding efforts and a heightened flange planarity.It is worth noting, however, that flexible flanges possess a relatively lower degree of rigidity when compared to their rigid counterparts.For structures enduring fatigue loads, the utilization of flexible flanges is a common practice to circumvent challenges stemming from fatigue initiation imperfections and residual stresses attributed to a greater number of weld seams.
Flexible flanges are typically forged in a single operation, forming both the flange plate and the flange neck, also known as flexible high-neck flanges.Under tensile loads, their characteristic of low stiffness leads to significant nonlinear mechanical behavior.This manifests as the flange plate separating under high tensile stress, generating a prying force, and inducing bending stress in the flange neck and connected tube.The widespread application of flexible high-neck flange connections is dictated by the unique requirements of different engineering projects.These distinctive features result in a wide variety of flange shapes.For instance, flange plates used in wind-turbine towers are often thick, with a substantial distance between the bolt holes and the free edge of the flange plate.These unique characteristics also give rise to variations in the nonlinear mechanical behavior of the flanges.Consequently, flange joints designed to suit specific engineering characteristics may no longer adhere to the basic assumptions of universally applicable flange engineering design methods.Therefore, instances of flange weld seam failures have been documented (such as the wind-turbine tower accident https://www.sohu.com/a/311598972_652081,accessed on 20 September 2023).
The forging dimensions of flexible high-neck flanges can be customized according to specific requirements.However, typically, the height of the neck is kept relatively short to save costs.In fact, for structures facing significant fatigue issues, the height of neck determines the location of the weld seam between the flange and the tube, along with the corresponding stress concentration factor (SCF).This aspect has not yet received adequate attention in current structural design [2].Some designs even overlook the asymmetry of the flange's inner and outer shape, as well as the stress concentration resulting from nonlinear mechanical behavior (i.e., bending stress).Instead, they simply assume that the SCF at this weld seam position is solely influenced by external parameters like the weld bead height and weld toe radius.Therefore, conducting a detailed study of the stress distribution in the tube wall (as shown in Figure 1) for flexible high-neck flange joints under different parameter combinations is essential for devising joint design solutions that prioritize both safety and cost-effectiveness.
Materials 2023, 16, x FOR PEER REVIEW 2 of 31 constructional nuances.Among these, a flexible flange pertains to a constructional w configuration wherein vertical stiffeners are not affixed to the flange plate.This results in an aesthetically pleasing, streamlined profile with reduced welding efforts and a heightened flange planarity.It is worth noting, however, that flexible flanges possess a relatively lower degree of rigidity when compared to their rigid counterparts.For structures enduring fatigue loads, the utilization of flexible flanges is a common practice to circumvent challenges stemming from fatigue initiation imperfections and residual stresses attributed to a greater number of weld seams.Flexible flanges are typically forged in a single operation, forming both the flange plate and the flange neck, also known as flexible high-neck flanges.Under tensile loads, their characteristic of low stiffness leads to significant nonlinear mechanical behavior.This manifests as the flange plate separating under high tensile stress, generating a prying force, and inducing bending stress in the flange neck and connected tube.The widespread application of flexible high-neck flange connections is dictated by the unique requirements of different engineering projects.These distinctive features result in a wide variety of flange shapes.For instance, flange plates used in wind-turbine towers are often thick, with a substantial distance between the bolt holes and the free edge of the flange plate.These unique characteristics also give rise to variations in the nonlinear mechanical behavior of the flanges.Consequently, flange joints designed to suit specific engineering characteristics may no longer adhere to the basic assumptions of universally applicable flange engineering design methods.Therefore, instances of flange weld seam failures have been documented (such as the wind-turbine tower accident https://www.sohu.com/a/311598972_652081,accessed on 20 September 2023).
The forging dimensions of flexible high-neck flanges can be customized according to specific requirements.However, typically, the height of the neck is kept relatively short to save costs.In fact, for structures facing significant fatigue issues, the height of neck determines the location of the weld seam between the flange and the tube, along with the corresponding stress concentration factor (SCF).This aspect has not yet received adequate attention in current structural design [2].Some designs even overlook the asymmetry of the flange's inner and outer shape, as well as the stress concentration resulting from nonlinear mechanical behavior (i.e., bending stress).Instead, they simply assume that the SCF at this weld seam position is solely influenced by external parameters like the weld bead height and weld toe radius.Therefore, conducting a detailed study of the stress distribution in the tube wall (as shown in Figure 1) for flexible high-neck flange joints under different parameter combinations is essential for devising joint design solutions that prioritize both safety and cost-effectiveness.1.2.State-of-the-Art Review 1.2.1.The Design and Calculation Methodology for Flexible Flange Joints Under high tensile stress, flexible flange joints exhibit nonlinear mechanical behavior, thus demanding special attention in the calculation and design process.When the tensile loading exceeds the sum of the compressive forces generated by bolt pretension, two mutually compressed flange plates will separate and open.As the tension continues to increase, the flange plates will rotate around a specific axis, and this rotation axis is different for the inner and outer flanges.Typically, the specific position of the rotation axis usually needs to be determined based on finite element analysis.Relevant design standards, such as GB50135-2017 [3], account for the nonlinear effects of this type of connection.Through the comparison of various typical flexible flange calculations, the rotation axis for the outer flange compression zone is defined at the tangent line inside the tube wall, while the rotation axis for the inner flange is defined at a position two-thirds of the tube radius away from the center.The aforementioned calculations have guided the engineering design of numerous flexible flange nodes.However, in reality, as the application scenarios of flexible flange nodes become increasingly diverse, flanges under existing parameter combinations may have already exceeded the "typical" definitions established during standard formulation.This is especially true for wind-turbine towers with thick flange plates and thin tower cylinder walls, for which further targeted research is needed on the position of the rotation axis and the nonlinear mechanical behavior of the joints.
Furthermore, for flexible flange joints subjected to fatigue loads, targeted efforts should be made to conduct fatigue-resistant design for weld seams and bolts.Standards and specifications such as IIW [4] and DNVGL-RP-C203 [5] define fatigue classes (FAT) for joints similar to flange connections, but they are tailored for cases where the weld seam is located at the root of the flange plate.The difference in the height of the neck in high-neck flanges, i.e., the variation in weld seam position is not considered.Additionally, due to the nonlinear mechanical behavior of flange joints under tensile stress, the corresponding FAT should be strongly correlated with the load mean value of the load cycles endured by the joint.This is an aspect that current engineering algorithms in standards have yet to address.
In summary, existing design standards are gradually becoming inadequate for the refined design of current flexible high-neck flanges.Especially for joint configurations that deviate from traditional calculation assumptions, determining rational assumptions for computational models and selecting appropriate flange design parameters to achieve an optimal design solution is an urgent issue that needs to be addressed.

Application of Machine Learning Models in Structural Design Area
For addressing specialized structural designs that conventional standards may not cover, researchers typically propose corresponding theoretical calculation formulas through mechanism analysis and theoretical derivation.Alternatively, numerical simulations may be utilized to generate a large dataset of computational results, which can be used to fit semi-empirical or empirical formulas.For example, in the case of predicting SCF under different weld seam shapes, Luo et al. [6,7] and Wang et al. [8,9] calculated SCFs for various combinations of shape parameters and misalignments.And parameter fitting methods are then employed to obtain empirical formulas for the SCFs for butt welded joints, cruciform joints, T-shaped joints, and so on.This approach constitutes the origin of various empirical formulas for stress intensity factors, commonly used in the field of fracture mechanics.The same approach can also be applied to the study of flexible high-neck flange joints in this paper.However, this traditional method of parameter fitting usually requires initially defining the basic form of a formula.Subsequently, the values of various parameters in the formula are computed through fitting.The basic form of this formula relies on the researcher's experience, or it may directly involve the use of high-order functions.
The era of data science has brought about a transformation in the development of various disciplines.The increase in computing capability means that previously complex problems no longer require simplified solutions.For the issue of stress distribution in flexible high-neck flange joints under tensile stress, which is influenced by numerous parameters, establishing nonlinear relationships among multiple parameters is currently one of the relatively mature techniques in the field of machine learning, especially when there exists a large dataset.As a crucial component of artificial intelligence, machine learning offers valuable methods for identifying relevant data features [10].In recent years, the use of machine learning for regression prediction involving multiple parameters has become increasingly common, as demonstrated in references [10][11][12][13][14]. Additionally, Table 1 also illustrates the use of machine learning models in various application scenarios.Some machine learning models have shown a remarkable performance in the field of engineering.For example, they have been used to identify the degree of bolt looseness based on the sound generated when tapping the bolts [15], or for rapidly assessing the seismic damage status of steel frames [16].Additionally, in the realm of stress distribution prediction, researchers have also employed models such as convolutional neural networks to create stress distribution prediction models tailored to different objects, e.g., reference [17][18][19], and notable results have been achieved, thereby circumventing the need for complex numerical computation processes.

Authors Machine Learning Model Used Application Scenario
Mangalathu [10] Artificial Neural Network (ANN) Failure model prediction of a round reinforced concrete bridge column Rahman [11] eXtreme Gradient Boosting (XGboost) Prediction of shear strength of steel fiber reinforced concrete beams Kameshwar [13] Decision Tree (DT) Earthquake recovery model of a bridge He [15] K-Nearest Neighbor (KNN) Prediction of loosening state of underwater bolt connection Nguyen [16] Random Forest (RF) Damage assessment of buildings after an earthquake Bolandi [17] Convolutional Neural Networks (CNN) Stress distribution of damaged structural components with stress concentration Liang [18] Deep Learning Model (DL) Stress distribution in human aorta Sepasdar [19] Convolutional Neural Networks (CNN) Nonlinear stress distribution and failure modes in microstructure characterization of composite materials Nguyen [20] Random Forest (RF) Maximum displacement of three pendulum isolation system Nguyen [21] Random Forest (RF) Peak lateral displacement of an isolation system under earthquake action Zhou [22] Random Forest (RF) Assess the liquefaction potential of soil Lele ń [23] Linear Regression (LR) Prediction of vibration amplitude and surface roughness after water jet cutting Ahmed [24] Random Forest (RF) Predict the strength of self-compacting mortar samples It is evident that applying machine learning methods to predict the stress distribution in the CHS tube of flexible high-neck flange joints is technically feasible.Based on an extensive literature review, it is known that among various machine learning models for regression prediction, the random forest model has shown the best performance in multiple cases, as seen in references [14,20,21].However, for the random forest model, certain initial parameters need to be defined before use, and the values of these parameters largely determine the final performance of the model.Currently, these parameters are often selected and manually adjusted based on experience, resulting in a low efficiency and the inability to achieve a globally optimal solution.To address this limitation, researchers have proposed hybrid models that combine traditional optimization algorithms with machine learning models.This approach involves using traditional optimization algorithms to select the initial parameters of the machine learning model, thereby improving the accuracy and modeling efficiency of the model.For the random forest model, some researchers have already proposed hybrid models that incorporate genetic algorithms [22] and grey wolf algorithms [25], all of which have achieved satisfactory results.Although there is currently no specific predictive model for the mechanical behavior of flexible flange joints, the existing achievements in the field of machine learning mentioned above provide crucial references for predicting the stress distribution in the CHS tube of high-neck flexible flange joints in this paper.

Existing Research Gaps
At present, there are few targeted and detailed studies on the influence of different geometric parameters and load levels on the nonlinear mechanical behavior of flange joints.In addition, the research on the nonlinear mechanical behavior of flange joints focuses mostly on bolts, with few studies focusing on the stress distribution on CHS tubes connected to flanges.However, with the increasing use of high-strength steel for lightweight design in engineering applications, the thickness of the CHS tube wall is also on a downward trend, and the stress distribution near the weld seam on tube wall should also be taken into account seriously.In fact, closer to the flange plate, there are higher stress peaks on the tube.This can be attributed to the geometric asymmetry of the flange and the nonlinear mechanical behavior, resulting in a stress concentration effect.This combines with the SCF caused by the weld seam shape to form the total SCF at the weld seam location of the flange joint.For joints subjected to severe fatigue loads, determining the stress distribution on the tube is fundamental for placing weld seams and accurately assessing stress amplitudes at the weld seam location.It necessitates targeted research across a wide range of flange parameters, covering commonly used shapes of flexible flange joints.It will ultimately lead to the development of a stress distribution prediction tool that can be utilized by researchers and engineers.For applications that require user interaction and usage, a graphical user interface (GUI) is especially important.The GUI achieves a more user-friendly and intuitive stress distribution prediction tool by providing graphical elements such as icons, buttons, windows, and menus.

Contribution of the Work
In response to the research gaps mentioned above, this paper conducts a detailed study on the nonlinear mechanical behavior of flexible high-neck flange joints under uniform tensile load, as well as the stress distribution law on the tube wall.This paper firstly compiles a Python program to control ABAQUS 2020 software to perform automated modeling analysis and stress extraction for flexible high-neck flange joints, establishes a dataset consisting of 5417 data, and reveals the typical stress distribution of the tube wall and the influence of different geometric parameters and loads on the stress distribution.On this basis, three hybrid models, namely the ACO-RF model, GA-RF model, and GWO-RF model, were established and trained to predict the stress distribution on the tube wall of a flexible high-neck flange joint.The best performing model was selected to develop a GUI tool for engineers and researchers to predict the stress distribution in the tube wall of flexible high-neck flange joints.Finally, a case study on the fatigue damage assessment of a flexible high-neck flange joint in a wind-turbine tower is presented to demonstrate the application of the proposed model in this study.

Theoretical Overview
In this chapter, an overview of the random forest, ACO-based RF model, GA-based RF model, and GWO-based RF model is provided.

Random Forest Model and Parameters
Random forest, introduced by Leo Breiman [26], draws its inspiration from the early work of Amit and Geman [27].The core idea of RF is based on ensemble learning, where multiple decision trees are combined to enhance model performance.In this approach, each decision tree is constructed using a random subset of the training dataset.By averaging the regression results of each tree, the final prediction of RF is obtained, as illustrated in Figure 2.

Random Forest Model and Parameters
Random forest, introduced by Leo Breiman [26], draws its inspiration from the early work of Amit and Geman [27].The core idea of RF is based on ensemble learning, where multiple decision trees are combined to enhance model performance.In this approach, each decision tree is constructed using a random subset of the training dataset.By averaging the regression results of each tree, the final prediction of RF is obtained, as illustrated in Figure 2. In the RF model, there are numerous parameters that need to be determined before training, which are known as hyperparameters.These hyperparameters include: (1) The RF model allows individual decision trees to use a maximum number of features (max_features).Increasing max_features generally enhances model performance but may reduce algorithm speed.(2) A higher number of decision trees (n_estimators) in the RF model can increase model stability but also raises the overall computational load.(3) The number of leaf nodes at the end of decision trees in the RF model (min_samples_leaf) must be considered.Smaller leaf nodes make it easier for the model to capture noise in the data.(4) The maximum depth of the decision tree in the random forest (max_depth) can limit the complexity and computation of the model.(5) The minimum number of samples needed to repartition the nodes inside the decision tree in a random forest must be considered, which limits the conditions for further partitioning of the subtree.The selection of these hyperparameters is crucial for building an RF model with the best performance and generalization ability, and therefore needs to be carefully adjusted.Different combinations of hyperparameters may apply to different problems and datasets, so trade-offs need to be made in model selection and tuning.

ACO-Base RF Model (ACO-RF)
Using ant colony optimization (ACO) for the selection of the hyperparameter of an RF model can yield stable and highly accurate regression results.The ACO algorithm was initially introduced by Marco Dorigo and his colleagues [28][29][30]  In the RF model, there are numerous parameters that need to be determined before training, which are known as hyperparameters.These hyperparameters include: (1) The RF model allows individual decision trees to use a maximum number of features (max_features).Increasing max_features generally enhances model performance but may reduce algorithm speed.(2) A higher number of decision trees (n_estimators) in the RF model can increase model stability but also raises the overall computational load.(3) The number of leaf nodes at the end of decision trees in the RF model (min_samples_leaf) must be considered.Smaller leaf nodes make it easier for the model to capture noise in the data.(4) The maximum depth of the decision tree in the random forest (max_depth) can limit the complexity and computation of the model.(5) The minimum number of samples needed to repartition the nodes inside the decision tree in a random forest must be considered, which limits the conditions for further partitioning of the subtree.The selection of these hyperparameters is crucial for building an RF model with the best performance and generalization ability, and therefore needs to be carefully adjusted.Different combinations of hyperparameters may apply to different problems and datasets, so trade-offs need to be made in model selection and tuning.

ACO-Base RF Model (ACO-RF)
Using ant colony optimization (ACO) for the selection of the hyperparameter of an RF model can yield stable and highly accurate regression results.The ACO algorithm was initially introduced by Marco Dorigo and his colleagues [28][29][30] in the early 1990s and is inspired by the observed behavior of ants.This algorithm mimics the cooperative behavior of ants in finding the shortest path between their nest and food sources to solve optimization problems.
The ACO-RF hybrid model combines the advantages of both the RF model and the ACO algorithm to optimize the hyperparameters of the RF model, leading to the optimal parameter selection results.The framework of ACO-RF, as shown in the Figure 3a, employs the ACO algorithm to search for the optimal combination of five parameters ('max_features', 'n_estimators', 'min_samples_leaf', 'max_depth', and 'min_samples_split') in the RF model corresponding to the dataset.mization problems.
The ACO-RF hybrid model combines the advantages of both the RF model and the ACO algorithm to optimize the hyperparameters of the RF model, leading to the optimal parameter selection results.The framework of ACO-RF, as shown in the Figure 3a, employs the ACO algorithm to search for the optimal combination of five parameters ('max_features', 'n_estimators', 'min_samples_leaf', 'max_depth', and 'min_sam-ples_split') in the RF model corresponding to the dataset.

GA-Based RF Model (GA-RF)
In addition to the ACO algorithm, this study also employed another optimization algorithm, the genetic algorithm (GA), to optimize the RF model.
GA is a programming technique designed to create optimal solutions for any specific problem [31].These solutions are generated by simulating the natural biological evolution process through the creation of a set of chain-like sequences resembling chromosomes.These chain-like sequences are referred to as individuals, allowing independent evaluation of each solution, followed by some random genetic operations to produce the optimal solution [32].
The second hybrid model and the third hybrid model used in this study, GA-RF [22], share a similar structural framework with the previous ACO-RF hybrid model.The

GA-Based RF Model (GA-RF)
In addition to the ACO algorithm, this study also employed another optimization algorithm, the genetic algorithm (GA), to optimize the RF model.
GA is a programming technique designed to create optimal solutions for any specific problem [31].These solutions are generated by simulating the natural biological evolution process through the creation of a set of chain-like sequences resembling chromosomes.These chain-like sequences are referred to as individuals, allowing independent evaluation of each solution, followed by some random genetic operations to produce the optimal solution [32].
The second hybrid model and the third hybrid model used in this study, GA-RF [22], share a similar structural framework with the previous ACO-RF hybrid model.The framework of the proposed GA-RF is depicted in the Figure 3b, where GA is employed to search for the optimal combination of five parameters in the RF model corresponding to the dataset.

GWO-Based RF Model (GWO-RF)
In addition to the ACO algorithm and GA algorithm, this study also employed another optimization algorithm, grey wolf optimization (GWO), to optimize the RF model.
The GWO algorithm [33] simulates the leadership hierarchy and hunting strategies of grey wolves in nature.This algorithm divides the wolves in the population into different ranks, to simulate a leadership hierarchy.This hierarchy is achieved through a hunting process, consisting of three main steps: searching for prey, surrounding the prey, and attacking the prey.In this simulation, each grey wolf is considered a candidate solution, and they collaborate to complete the hunting task.Among them, the grey wolf that successfully completes the hunting task is regarded as the optimal solution, representing the best individual in the current population.This simulation is inspired by the hunting behavior of grey wolf packs in the natural world.By simulating this process, the GWO algorithm can search for the optimal solutions in the search space.
Therefore, the third hybrid model used in this study, GWO-RF [26], shares a similar structural framework with the previous ACO-RF and GA-RF hybrid model.The framework of the proposed GWO-RF is depicted in the Figure 3c, where GWO is employed to search for the optimal combination of five parameters in the RF model corresponding to the dataset.

Parameter Ranges for the Geometric Model of Flange Joints
To create a database for developing an RF model predicting the stress distribution of CHS tubes in flexible high-neck flange joints, a wide range of parameters has been considered, encompassing several key geometric parameters of the flange joints, as shown in Figure 4 and Table 2.It is worth noting that, in fact, these flange parameters are not entirely independent of each other.Due to the constraints imposed by flange shape, conventional design experience, and regulatory provisions on parameters, certain correlations and dependencies exist among these parameters.Consequently, flange parameters must adhere to the following constraints (see Equations ( 1)-( 6)) for acceptance.Among these, Equation ( 1) limits the common diameter-to-thickness ratio of the steel pipe, Equation ( 2) accounts for the discrepancy between actual and standard bolt preloads, Equations ( 3)-( 5) specify reasonable ranges for flange plate thickness and bolt hole positioning dimensions, and Equation ( 6) sets an upper limit on the number of bolts based on the minimum operational space required for applying bolt preload, where P e represents the standard preloading force of the bolt.Specific values of P e can be referenced in standards such as GB50017-2017 [34] and VDI2230 [35].(2)   1)-( 6).It is important to note that, for the fundamental parameters such as the tube diameter t tube and bolt diameter d bolt , which are determined by designers based on requirements, they follow a uniform distribution.However, other parameters like F a , F b , F c , Pretension, and D tube are obtained by multiplying uniform distributions of fundamental parameters with uniform distributions of multiplier coefficients, as evident from the aforementioned constraints (i.e., Equations ( 1)-( 5)).This results in a deviation from a uniform distribution, as shown in Figure 5.As for the non-uniform distribution of parameters R 1 and Load, it arises from the amalgamation of databases with different parameter ranges.However, it is crucial to emphasize that, upon examination, these non-uniformities will not affect the accuracy of the subsequently trained ML model across the entire range.Under the distribution depicted in Figure 5, each parameter was randomly selected, resulting in a total of 5417 sets of parameter combinations.These combinations were then used for subsequent computational analysis and formed the database for training the hybrid ML model.1)-( 5)).This results in a deviation from a uniform distribution, as shown in Figure 5.As for the non-uniform distribution of parameters 1 R and Load, it arises from the amalgamation of databases with different parameter ranges.However, it is crucial to emphasize that, upon examination, these non-uniformities will not affect the accuracy of the subsequently trained ML model across the entire range.Under the distribution depicted in Figure 5, each parameter was randomly selected, resulting in a total of 5417 sets of parameter combinations.These combinations were then used for subsequent computational analysis and formed the database for training the hybrid ML model.

Numerical Analysis and Data Processing
In this study, ABAQUS 2020 software was utilized to construct the database for machine learning.A Python compiler was employed to invoke ABAQUS for modeling, analyzing, and extracting the stress results of flange joints under various combinations of parameters, as shown in Figure 6.The combinations of flange parameters were randomly selected within the ranges specified in Table 2 by the program.In the numerical model, linear elastic elements were used to simulate the flange joints.Bolts were simulated using beam elements, and multi-point constraints (MPC) were employed to establish the coupling between beam elements and multiple nodes, achieving the coupling between bolts and flanges' plates.A second-order reduced integration element (ABAQUS name: C3D8R) was used for the flanges, and a two-node linear beam element (ABAQUS name: B31) was used for the bolts, and the contact relationship between the flange plates was established with normal hard contact, and the tangential friction coefficient was defined as 0.3.

Numerical Analysis and Data Processing
In this study, ABAQUS 2020 software was utilized to construct the database for machine learning.A Python compiler was employed to invoke ABAQUS for modeling, analyzing, and extracting the stress results of flange joints under various combinations of parameters, as shown in Figure 6.The combinations of flange parameters were randomly selected within the ranges specified in Table 2 by the program.In the numerical model, linear elastic elements were used to simulate the flange joints.Bolts were simulated using beam elements, and multi-point constraints (MPC) were employed to establish the coupling between beam elements and multiple nodes, achieving the coupling between bolts and flanges' plates.A second-order reduced integration element (ABAQUS name: C3D8R) was used for the flanges, and a two-node linear beam element (ABAQUS name: B31) was used for the bolts, and the contact relationship between the flange plates was established with normal hard contact, and the tangential friction coefficient was defined as 0.3.

Numerical Analysis and Data Processing
In this study, ABAQUS 2020 software was utilized to construct the database for machine learning.A Python compiler was employed to invoke ABAQUS for modeling, analyzing, and extracting the stress results of flange joints under various combinations of parameters, as shown in Figure 6.The combinations of flange parameters were randomly selected within the ranges specified in Table 2 by the program.In the numerical model, linear elastic elements were used to simulate the flange joints.Bolts were simulated using beam elements, and multi-point constraints (MPC) were employed to establish the coupling between beam elements and multiple nodes, achieving the coupling between bolts and flanges' plates.A second-order reduced integration element (ABAQUS name: C3D8R) was used for the flanges, and a two-node linear beam element (ABAQUS name: B31) was used for the bolts, and the contact relationship between the flange plates was established with normal hard contact, and the tangential friction coefficient was defined as 0.  Figure 7 depicts the typical stress distribution on the outer side of the tube wall, and the situation on the inner side of the tube wall is analogous.As seen from the figure, under tensile stress, near the root of the flange, the separation of the two flange plates results in bending tensile stress on the outer side of the tube wall and bending compressive stress on the inner side.However, under the loading and boundary conditions, there is an inflection point on the tube wall at a certain distance from the flange root.This leads to the appearance of bending stresses on the other side of the inflection point, which are in the opposite direction to those near the flange region.Finally, in the non-influenced area of the flange joint, the stress in the tube wall returns to the nominal stress, which is the uniform tensile stress applied to the pipe wall.The stress distribution (in terms of the maximum principal stress) of the CHS tube is extracted, starting from the intersection point of the flange plate and the neck of the high-neck flange, and the total length of extraction is chosen to be one times the diameter of tube, as shown in Figure 8. Convergence of the finite element model is achieved by adjusting the global mesh size.It can be observed that as the mesh becomes denser, the stress distribution on the inner and outer walls of the flange tube eventually converges to the same curve, as shown in Figure 9.When the mesh size reaches D tube /25, the stress calculation results have converged.Therefore, for subsequent parameter analysis, D tube /25 is chosen as the mesh size.In addition, the computed results obtained by this modeling method have been compared with the experimental data from reference [36], as shown in Figure 10.It can be seen that the simulation method in this paper has been validated by the experimental data.on the inner side.However, under the loading and boundary conditions, there is an inflection point on the tube wall at a certain distance from the flange root.This leads to the appearance of bending stresses on the other side of the inflection point, which are in the opposite direction to those near the flange region.Finally, in the non-influenced area of the flange joint, the stress in the tube wall returns to the nominal stress, which is the uniform tensile stress applied to the pipe wall.The stress distribution (in terms of the maximum principal stress) of the CHS tube is extracted, starting from the intersection point of the flange plate and the neck of the high-neck flange, and the total length of extraction is chosen to be one times the diameter of tube, as shown in Figure 8. Convergence of the finite element model is achieved by adjusting the global mesh size.It can be observed that as the mesh becomes denser, the stress distribution on the inner and outer walls of the flange tube eventually converges to the same curve, as shown in Figure 9.When the mesh size reaches Dtube/25, the stress calculation results have converged.Therefore, for subsequent parameter analysis, Dtube/25 is chosen as the mesh size.In addition, the computed results obtained by this modeling method have been compared with the experimental data from reference [36], as shown in Figure 10.It can be seen that the simulation method in this paper has been validated by the experimental data.on the inner side.However, under the loading and boundary conditions, there is an inflection point on the tube wall at a certain distance from the flange root.This leads to the appearance of bending stresses on the other side of the inflection point, which are in the opposite direction to those near the flange region.Finally, in the non-influenced area of the flange joint, the stress in the tube wall returns to the nominal stress, which is the uniform tensile stress applied to the pipe wall.The stress distribution (in terms of the maximum principal stress) of the CHS tube is extracted, starting from the intersection point of the flange plate and the neck of the high-neck flange, and the total length of extraction is chosen to be one times the diameter of tube, as shown in Figure 8. Convergence of the finite element model is achieved by adjusting the global mesh size.It can be observed that as the mesh becomes denser, the stress distribution on the inner and outer walls of the flange tube eventually converges to the same curve, as shown in Figure 9.When the mesh size reaches Dtube/25, the stress calculation results have converged.Therefore, for subsequent parameter analysis, Dtube/25 is chosen as the mesh size.In addition, the computed results obtained by this modeling method have been compared with the experimental data from reference [36], as shown in Figure 10.It can be seen that the simulation method in this paper has been validated by the experimental data.

The Performance of the Three Hybrid Models
The performance of the RF hybrid model largely depends on the selected database input variables.In this study, a total of 10 input variables were chosen based on flange , and ex- ternal load size (Load).The output variables represent the stress distribution on the CHS tube wall of a flange joint under different geometric and load parameters, as shown in Figures 8 and 9.The first principal stress at 25 nodes on the inner and outer walls is output as the output variables, totaling 50 stress points for output.To compare the performance improvement of the RF model with ACO, GA, and GWO optimization techniques, the unoptimized RF model was included in the comparison.As shown in Table 3, the values of the RF hyperparameters before and after optimization are provided.It can be observed that different optimization algorithms exhibit significant differences in optimal hyperparameters.

The Performance of the Three Hybrid Models
The performance of the RF hybrid model largely depends on the selected database input variables.In this study, a total of 10 input variables were chosen based on flange , and ex- ternal load size (Load).The output variables represent the stress distribution on the CHS tube wall of a flange joint under different geometric and load parameters, as shown in Figures 8 and 9.The first principal stress at 25 nodes on the inner and outer walls is output as the output variables, totaling 50 stress points for output.To compare the performance improvement of the RF model with ACO, GA, and GWO optimization techniques, the unoptimized RF model was included in the comparison.As shown in Table 3, the values of the RF hyperparameters before and after optimization are provided.It can be observed that different optimization algorithms exhibit significant differences in optimal hyperparameters.

Developing Machine Learning Models 4.1. The Performance of the Three Hybrid Models
The performance of the RF hybrid model largely depends on the selected database input variables.In this study, a total of 10 input variables were chosen based on flange dimensions (t tube , D tube , F c , F a , F b , R 1 ), bolt parameters (d bolt , n bolt , Pretension), and external load size (Load).The output variables represent the stress distribution on the CHS tube wall of a flange joint under different geometric and load parameters, as shown in Figures 8 and 9.The first principal stress at 25 nodes on the inner and outer walls is output as the output variables, totaling 50 stress points for output.To compare the performance improvement of the RF model with ACO, GA, and GWO optimization techniques, the unoptimized RF model was included in the comparison.As shown in Table 3, the values of the RF hyperparameters before and after optimization are provided.It can be observed that different optimization algorithms exhibit significant differences in optimal hyperparameters.The open-source Python package Scikit-Learn 1.0.2[37] was utilized to develop RF models, as well as the other three hybrid models.The entire dataset was divided into training and testing sets.A total of 3791 data points (70% of the entire dataset) were used for building predictive models, while 1626 data points (30% of the entire dataset) were employed to evaluate the predictive model performance.It is worth noting that the training and testing datasets were generated randomly using the "train_test_split" function from Scikit-Learn 1.0.2 to prevent dataset bias.To assess the performance of the four RF models mentioned above, four common metrics were selected and applied: the coefficient of determination (R 2 ), root mean square error (RMSE), Pearson correlation coefficient (r), and mean relative error (MRE).
In the above, y i and ŷi are respectively the target value and the predicted value of data point i, n represents the number of data, and ŷi represents the mean of the target values.Since this study involves a total of 50 output variables, the mean of these output variables is used here as an evaluation metric for the regression models.Figures 11 and 12 respectively illustrate the relationship between the predicted stress distribution of the flange circular pipe wall by four RF models after random sampling and the stress distribution of the CHS tube by the ABAQUS model.Table 4 summarizes the average values of the four parameters defining the regression performance of the RF models.Furthermore, the average values of the four predictive accuracy parameters of the RF models on the test dataset are presented in Figure 13.The average R 2 mean values of their RF models are also provided.On the training dataset, the R 2 mean values for the RF model, ACO-RF model, GA-RF model, and GWO-RF model are 0.9552, 0.9983, 0.9973, and 0.9887, respectively.On the test dataset, the R 2 mean values are 0.9438, 0.9865, 0.9860, and 0.9812, respectively.It can be observed that the ACO-RF model produces the highest R 2 mean values.Compared to the other RF models, the R 2 mean of the ACO-RF on the test dataset is 4.31%, 0.1%, and 0.96% higher than the RF, GA-RF, and GWO models, respectively.

GWO-RF
The open-source Python package Scikit-Learn 1.0.2[37] was utilized to develop RF models, as well as the other three hybrid models.The entire dataset was divided into training and testing sets.A total of 3791 data points (70% of the entire dataset) were used for building predictive models, while 1626 data points (30% of the entire dataset) were employed to evaluate the predictive model performance.It is worth noting that the training and testing datasets were generated randomly using the "train_test_split" function from Scikit-Learn 1.0.2 to prevent dataset bias.To assess the performance of the four RF models mentioned above, four common metrics were selected and applied: the coefficient of determination ( 2 R ), root mean square error (RMSE), Pearson correlation coefficient ( r ), and mean relative error (MRE).
In the above, i y and ˆi y are respectively the target value and the predicted value of data point i , n represents the number of data, and ˆi y represents the mean of the target values.Since this study involves a total of 50 output variables, the mean of these output variables is used here as an evaluation metric for the regression models.Figure 11 and Figure 12 respectively illustrate the relationship between the predicted stress distribution of the flange circular pipe wall by four RF models after random sampling and the stress distribution of the CHS tube by the ABAQUS model.Table 4 summarizes the average values of the four parameters defining the regression performance of the RF models.Furthermore, the average values of the four predictive accuracy parameters of the RF models on the test dataset are presented in Figure 13.The average

R
of the ACO-RF on the test dataset is 4.31%, 0.1%, and 0.96% higher than the RF, GA-RF, and GWO models, respectively.In conclusion, it can be stated that optimization algorithms can significantly enhance the regression performance of RF models.For the dataset created in this study, the ACO algorithm has demonstrated the most outstanding performance.However, it is important to note that the performance of these three optimization algorithms may vary for different datasets.Therefore, when applying these optimization algorithms, it is essential to consider the specific dataset characteristics and the problem context to choose the most suitable optimization approach.

Feature Importance Analysis
The regression model chosen in this study is built based on the RF model, which is an efficient ensemble machine learning method capable of effectively analyzing input feature variables.The order of feature variable importance obtained from the four RF models is shown in Figure 14.The consistent order of importance, from highest to lowest, is load, n bolt , t tube , F a , F c , Pretension, D tube , d bolt , F b , R 1 .In predicting the target variable, the variable with the highest feature importance is load.In the RF model, its importance level is 0.8355; in the ACO-RF model, it is 0.84534; in the GA-RF model, it is 0.84361; and in the GWO-RF model, it is 0.86465.Such high values indicate that load plays an almost decisive role in the regression process of the model, followed by n bolt .In the regression process of the target variable, the feature variable that has the least impact on the model is R 1 , which is close to zero in all four models, indicating that this variable has minimal influence on predicting the target variable.However, it can be affirmed that although the feature importance coefficient of load is relatively high and has a decisive impact on the stress distribution values, the influence of the other parameters should not be overlooked, as they also have a decisive effect on the shape of the stress distribution.In conclusion, it can be stated that optimization algorithms can significantly enhance the regression performance of RF models.For the dataset created in this study, the ACO algorithm has demonstrated the most outstanding performance.However, it is important to note that the performance of these three optimization algorithms may vary for different datasets.Therefore, when applying these optimization algorithms, it is essential to consider the specific dataset characteristics and the problem context to choose the most suitable optimization approach.

Feature Importance Analysis
The regression model chosen in this study is built based on the RF model, which is an efficient ensemble machine learning method capable of effectively analyzing input feature variables.The order of feature variable importance obtained from the four RF models is shown in Figure 14 R , which is close to zero in all four models, indicating that this variable has minimal influence on predicting the target variable.However, it can be affirmed that although the feature importance coefficient of load is relatively high and has a decisive impact on the stress distribution values, the influence of the other parameters should not be overlooked, as they also have a decisive effect on the shape of the stress distribution.

Application of the Proposed GA-RF Model
The proposed GA-RF hybrid machine learning model can be used to predict the stress distribution of the tube wall of flexible high-neck flange joints under arbitrary geometric and load parameters, and fatigue performance evaluation is one of the important application scenarios of this model.This chapter will take the fatigue damage evaluation of a flexible high-neck flange joint in a lattice-type wind-turbine tower structure as an example to demonstrate the specific process of applying this model.

Application of the Proposed GA-RF Model
The proposed GA-RF hybrid machine learning model can be used to predict the stress distribution of the tube wall of flexible high-neck flange joints under arbitrary geometric and load parameters, and fatigue performance evaluation is one of the important application scenarios of this model.This chapter will take the fatigue damage evaluation of a flexible high-neck flange joint in a lattice-type wind-turbine tower structure as an example to demonstrate the specific process of applying this model.

Engineering Background
A large scale ultra-high wind-turbine tower with a hub height of 180 m and a rated power of 5.5 MW adopts a hybrid-type support structure scheme, with a lattice-type tower at the bottom and a tubular tower at the top.The flexible high-neck flange connection is used in the lattice part, as shown in Figure 15.

Application of the Proposed GA-RF Model
The proposed GA-RF hybrid machine learning model can be used to predict the stress distribution of the tube wall of flexible high-neck flange joints under arbitrary geometric and load parameters, and fatigue performance evaluation is one of the important application scenarios of this model.This chapter will take the fatigue damage evaluation of a flexible high-neck flange joint in a lattice-type wind-turbine tower structure as an example to demonstrate the specific process of applying this model.

Engineering Background
A large scale ultra-high wind-turbine tower with a hub height of 180 m and a rated power of 5.5 MW adopts a hybrid-type support structure scheme, with a lattice-type tower at the bottom and a tubular tower at the top.The flexible high-neck flange connection is used in the lattice part, as shown in Figure 15.The fatigue load of the wind-turbine tower during its 20-year service life is obtained under the provisions of the IEC code [38] through wind-turbine integrated load simulation technology.The resulting fatigue load is subsequently converted into the internal forces acting on the tube members at a specific location of the flexible high-neck flange joint through structural calculations.These forces are presented in the form of a rainflow matrix, as shown in the Supplementary Materials of this paper.The rainflow matrix of fatigue load records the range, mean value, and corresponding cycle number of the axial force of the tube member that the flexible high-neck flange joint endures during its service life.

Fatigue Assessment Process
Taking the "effective notch stress-fatigue life" evaluation method recommended by IIW [4] as an example, the fatigue damage of the joint during the 20-year service period is obtained by calculating the notch stress range at the weld toe of the weld seam.The stress of the flexible high-neck flange joint weld under loadings is mainly determined by two parts: the macroscopic shape of the flange joint and the nonlinearity under tensile load, which can be predicted by the hybrid machine learning model proposed in this study, and the local shape of weld seam on neck, which can be calculated using the existing SCF parametric formula, as shown in Figure 16 below.
Therefore, for the fatigue damage calculation of the joint under a certain load cycle in this example, the following flowchart shown in Figure 17 can be used.Similarly, by repeating this process and superimposing the damage values under all load cycles based on Miner's rule [38], the total damage value of the joint can be obtained.
IIW [4] as an example, the fatigue damage of the joint during the 20-year service period is obtained by calculating the notch stress range at the weld toe of the weld seam.The stress of the flexible high-neck flange joint weld under loadings is mainly determined by two parts: the macroscopic shape of the flange joint and the nonlinearity under tensile load, which can be predicted by the hybrid machine learning model proposed in this study, and the local shape of weld seam on neck, which can be calculated using the existing SCF parametric formula, as shown in Figure 16 below.the local shape of weld seam on neck, which can be calculated using the existing SCF parametric formula, as shown in Figure 16 below.

Damage Calculation of Flange Joint
The rainflow matrix of fatigue load of a flexible high-neck flange joint of the above wind-turbine tower is shown in the Supplementary Materials of this paper.The parameters of the flange and weld seam are shown in Tables 5 and 6, respectively.Through the GA-RF model proposed in this study and the parameter formula for SCF of butt welds proposed by the author in [6], the damage under each set of mean load and load range in the rainflow matrix can be calculated with the recommended S-N curve for the effective notch stress approach (i.e., FAT225 recommended by IIW [4]), as shown in Figure 18.Finally, the total damage is calculated by summing up the damages of all sets according to Miner's rule [39], resulting in a value of 0.15.notch stress approach (i.e., FAT225 recommended by IIW [4]), as shown in Figure 18.Finally, the total damage is calculated by summing up the damages of all sets according to Miner's rule [39], resulting in a value of 0.15.

Conclusions
This study developed an automated calculation program using ABAQUS 2020 software to perform stress analysis of flexible high-neck flange joints under various geometric parameter combinations.The accuracy of the finite element model was validated through experimental data collected in the literature.Subsequently, a dataset comprising over 4500 entries within reasonable parameter ranges was generated for training three hybrid models: ACO-RF, GA-RF, and GWO-RF.This enabled the prediction of stress distribution in the CHS tube of flexible high-neck flange joints.
Through training, it was determined that the hybrid models outperform the original RF model in terms of accuracy, while also eliminating the cumbersome process of

Conclusions
This study developed an automated calculation program using ABAQUS 2020 software to perform stress analysis of flexible high-neck flange joints under various geometric parameter combinations.The accuracy of the finite element model was validated through experimental data collected in the literature.Subsequently, a dataset comprising over 4500 entries within reasonable parameter ranges was generated for training three hybrid models: ACO-RF, GA-RF, and GWO-RF.This enabled the prediction of stress distribution in the CHS tube of flexible high-neck flange joints.
Through training, it was determined that the hybrid models outperform the original RF model in terms of accuracy, while also eliminating the cumbersome process of manually adjusting hyperparameters.In terms of predictive performance, the ANT-RF model demonstrated the best performance.As a result, this paper developed an open-source GUI program based on the ANT-RF model for use by engineers and researchers to predict the stress distribution in the CHS tube of flexible high-neck flange joints, as shown in Figure 19 (Download: https://github.com/Qwduang/ANT-RFmodel.git, accessed on 20 September 2023).
The following conclusions can be drawn: 1.
Under tensile stress, the flexible high-neck flange exhibits significant nonlinear mechanical behavior, resulting in bending tensile stress on the outer surface of the CHS tube.Near the flange, high stress peaks can be detected.As the distance from the flange root increases, the stress level rapidly decreases to below the nominal stress, and then returns to the nominal stress level.On the inner surface of the tube, bending compressive stress is formed under tensile stress, resulting in an opposite trend compared to the outer surface.

2.
Feature importance analysis reveals that the most influential parameter on the stress distribution in the tube of flange joint is the magnitude of the tensile force.In addition to this, the number of bolts, wall thickness of the tube, and the positioning dimensions of the bolt holes follow in terms of their respective levels of impact, each playing a non-negligible role in the stress distribution.

3.
For the specific case discussed in this study, the performance of the ACO-RF model stands out as the most outstanding among the three hybrid models with an average coefficient of determination (R 2 mean ) of 0.9865 on the test dataset.The following conclusions can be drawn: 1.Under tensile stress, the flexible high-neck flange exhibits significant nonlinear mechanical behavior, resulting in bending tensile stress on the outer surface of the CHS tube.Near the flange, high stress peaks can be detected.As the distance from the flange root increases, the stress level rapidly decreases to below the nominal stress, and then returns to the nominal stress level.On the inner surface of the tube, bending compressive stress is formed under tensile stress, resulting in an opposite trend compared to the outer surface.2. Feature importance analysis reveals that the most influential parameter on the stress distribution in the tube of flange joint is the magnitude of the tensile force.In addition to this, the number of bolts, wall thickness of the tube, and the positioning dimensions of the bolt holes follow in terms of their respective levels of impact, each playing a non-negligible role in the stress distribution.3.For the specific case discussed in this study, the performance of the ACO-RF model stands out as the most outstanding among the three hybrid models with an average coefficient of determination (  The tool developed in this study can compute the stress distribution on the CHS tube of the flexible flange joint under different parameters and loads.It assists in determining the optimal weld seam location and SCF.It is worth mentioning that the computed results of stress distribution can be normalized into the form of an SCF, characterizing the stress concentration caused by the macro-geometry of the flange and nonlinear mechanical behavior.This SCF, in conjunction with the SCF attributed to weld seam shape and potential misalignments, collectively form the comprehensive SCF for the weld in the flexible flange joint.This comprehensive SCF assessment provides a crucial foundation for conducting fatigue evaluations, employing local stress-life approaches such as the hot-spot stress method, effective notch stress approach, and similar techniques.In the future, building upon the research presented in this paper, further exploration will be conducted on the impact of various flange parameters on bolt stresses.This will lead to the development of a comprehensive predictive model for stress distribution in flexible high-neck flange joints, encompassing both weld seams and bolts.Subsequently, this model will serve as the foundation for assessing the competitive mechanisms of fatigue failure between weld seams and bolts in this joint configuration, as well as for conducting fatigue reliability evaluations at the joint level.

Figure 1 .
Figure 1.The stress distribution in the tube wall for flexible high-neck flange joint.

Figure 5
Figure 5 displays the frequency distribution of each parameter under the constraints of the aforementioned Equations (1)-(6).It is important to note that, for the fundamental parameters such as the tube diameter t tube and bolt diameter d bolt , which are determined by designers based on requirements, they follow a uniform distribution.However, other parameters like F a , F b , F c , Pretension, and D tube are obtained by multiplying uniform distributions of fundamental parameters with uniform distributions of multiplier coefficients, as evident from the aforementioned constraints (i.e., Equations (1)-(5)).This results in a deviation from a uniform distribution, as shown in Figure5.As for the non-uniform distribution of parameters R 1 and Load, it arises from the amalgamation of databases with different parameter ranges.However, it is crucial to emphasize that, upon examination, these non-uniformities will not affect the accuracy of the subsequently trained ML model across the entire range.Under the distribution depicted in Figure5, each parameter was randomly selected, resulting in a total of 5417 sets of parameter combinations.These combinations were then used for subsequent computational analysis and formed the database for training the hybrid ML model.

Figure 5
Figure 5 displays the frequency distribution of each parameter under the constraints of the aforementioned Equations (1)-(6).It is important to note that, for the fundamental parameters such as the tube diameter tube t

Figure 5 .
Figure 5. Frequency distribution of flange geometric parameters.(a) Distribution of variable tube D ;

Figure 5 .
Figure 5. Frequency distribution of flange geometric parameters.(a) Distribution of variable D tube ; (b) Distribution of variable t tube ; (c) Distribution of variable d bolt ; (d) Distribution of variable F c ; (e) Distribution of variable F a ; (f) Distribution of variable F b ; (g) Distribution of variable R 1 ; (h) Distribution of variable Load; (i) Distribution of variable Pretension; (j) Distribution of variable n bolt .

Figure 5 .
Figure 5. Frequency distribution of flange geometric parameters.(a) Distribution of variable tube D ;

Figure 6 .
Figure 6.Numerical modeling and meshing strategy.(a) Component partitioning and contact interaction definition.(b) Meshing strategies.

Figure 7 FrequencyFigure 6 .
Figure 7 depicts the typical stress distribution on the outer side of the tube wall, and the situation on the inner side of the tube wall is analogous.As seen from the figure, under tensile stress, near the root of the flange, the separation of the two flange plates results in bending tensile stress on the outer side of the tube wall and bending compressive stress

Figure 7 .
Figure 7. Schematic of stress components in a flange joint tube wall under uniform tensile stress.

Figure 8 .
Figure 8.Typical FE results and stress distribution on a CHS tube.

Figure 7 .
Figure 7. Schematic of stress components in a flange joint tube wall under uniform tensile stress.

Figure 7 .
Figure 7. Schematic of stress components in a flange joint tube wall under uniform tensile stress.

Figure 8 .
Figure 8.Typical FE results and stress distribution on a CHS tube.Figure 8. Typical FE results and stress distribution on a CHS tube.

Figure 8 . 31 Figure 9 .
Figure 8.Typical FE results and stress distribution on a CHS tube.Figure 8. Typical FE results and stress distribution on a CHS tube.

Figure 10 .
Figure 10.Comparison of experimental and finite element results.
D tube /6 Element size = D tube /14 Element size = D tube /25 Element size = D tube /35 inner stress outer stress Stress distribution on CHS tube/MPa Distance from the base of the flange plate/mm

Figure 10 .
Figure 10.Comparison of experimental and finite element results.

Figure 10 .
Figure 10.Comparison of experimental and finite element results.

Figure 13 .
Figure 13.Prediction accuracy on the testing dataset of four RF models.

Figure 15 .
Figure 15.Schematic diagram of lattice-tubular hybrid wind-turbine support structure.The fatigue load of the wind-turbine tower during its 20-year service life is obtained under the provisions of the IEC code [38] through wind-turbine integrated load simulation technology.The resulting fatigue load is subsequently converted into the internal forces acting on the tube members at a specific location of the flexible high-neck flange joint through structural calculations.These forces are presented in the form of a rainflow matrix, as shown in the Supplementary Materials of this paper.The rainflow matrix of fatigue load records the range, mean value, and corresponding cycle number of the axial

Figure 16 .
Figure 16.Stress composition of flexible high neck flange joint under load cycles.Therefore, for the fatigue damage calculation of the joint under a certain load cycle in this example, the following flowchart shown in Figure17can be used.Similarly, by repeating this process and superimposing the damage values under all load cycles based on Miner's rule [38], the total damage value of the joint can be obtained.

Figure 16 .
Figure 16.Stress composition of flexible high neck flange joint under load cycles.

Figure 16 .
Figure 16.Stress composition of flexible high neck flange joint under load cycles.Therefore, for the fatigue damage calculation of the joint under a certain load cycle in this example, the following flowchart shown in Figure17can be used.Similarly, by repeating this process and superimposing the damage values under all load cycles based on Miner's rule [38], the total damage value of the joint can be obtained.

Figure 17 .
Figure 17.Calculation process of fatigue damage under a certain load cycle.

Materials 2023 ,
16, x FOR PEER REVIEW 20 of 31 manually adjusting hyperparameters.In terms of predictive performance, the ANT-RF model demonstrated the best performance.As a result, this paper developed an opensource GUI program based on the ANT-RF model for use by engineers and researchers to predict the stress distribution in the CHS tube of flexible high-neck flange joints, as shown in Figure 19 (Download: https://github.com/Qwduang/ANT-RFmodel.git, accessed on 20 September 2023).

Figure 19 .
Figure 19.GUI for predicting the stress distribution in the CHS tube of high-neck flexible flange joints.

)
of 0.9865 on the test dataset.

Figure 19 .
Figure 19.GUI for predicting the stress distribution in the CHS tube of high-neck flexible flange joints.

Table 1 .
Machine learning models in various application domains.

Table 2 .
Ranges of flange geometric parameters.

Table 2 .
Ranges of flange geometric parameters.

Table 3 .
The optimal parameters of three hybrid models and unoptimized general parameters.

Table 3 .
The optimal parameters of three hybrid models and unoptimized general parameters.

Table 3 .
The optimal parameters of three hybrid models and unoptimized general parameters.
. The consistent order of importance, from highest to lowest, is load, R .In predicting the target variable, the variable with the highest feature importance is load.In the RF model, its importance level is 0.8355; in the ACO-RF model, it is 0.84534; in the GA-RF model, it is 0.84361; and in the GWO-RF model, it is 0.86465.Such high values indicate that load plays an almost decisive role in the regression process of the model, followed by bolt n .In the regression process of the target variable, the feature variable that has the least impact on the model is1

Table 5 .
Parameters of the flange joint.

Table 6 .
Parameters of the weld seam.

Table 5 .
Parameters of the flange joint.
tube t

Table 6 .
Parameters of the weld seam.

Table A1 .
Detailed prediction accuracy of four RF models in the training datasets.

Table A2 .
Detailed prediction accuracy of four RF models in the testing datasets.