A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model

Bai, Yanyongxu; Mao, Haoyu; Sun, Shaoxuan; Suo, Yu

doi:10.3390/pr14091399

Open AccessArticle

A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model

by

Yanyongxu Bai

¹,

Haoyu Mao

²,

Shaoxuan Sun

² and

Yu Suo

^2,*

¹

School of Mechanical Science and Engineering, Northeast Petroleum University, Daqing 163318, China

²

School of Petroleum Engineering, Northeast Petroleum University, Daqing 163318, China

^*

Author to whom correspondence should be addressed.

Processes 2026, 14(9), 1399; https://doi.org/10.3390/pr14091399

Submission received: 12 March 2026 / Revised: 10 April 2026 / Accepted: 21 April 2026 / Published: 27 April 2026

(This article belongs to the Section Chemical Processes and Systems)

Download

Browse Figures

Versions Notes

Abstract

The phenomenon of CO₂ corrosion of downhole tubing is widespread in oil and gas extraction. Currently, there is a lack of applicable prediction methods for the corrosion rates of different metal tubing in the liquid phase CO₂ environment. To address this issue, this paper systematically investigates the anti-corrosion mechanisms and influencing factors of different metal casings and proposes a deep learning model combining convolutional neural networks and long short-term memory networks. Based on laboratory corrosion experimental data, the model extracts spatial features of parameters affecting the corrosion rate through CNN and captures their temporal dependencies through LSTM. This paper builds a pipe corrosion rate prediction model based on the TensorFlow framework and compares the prediction results with those of the traditional D-W empirical model and the SRV machine learning model. The results showed that the CNN-LSTM model maintained high prediction accuracy regardless of high or low chromium content, with R² reaching 0.83 and 0.94 respectively, solving the problem that existing models have difficulty effectively simulating complex corrosion behavior under flowing corrosive media conditions. The model was verified using the remaining wall thickness of the actual application casing in the field, and the accuracy was over 80%. The established prediction method can be extended to predict the corrosion rate of pipes under similar corrosion conditions.

Keywords:

CO₂ corrosion; 13Cr; CNN-LSTM; corrosion rate prediction

1. Introduction

“Carbon capture, Utilization and storage” (CCUS) is one of the key technical paths for China to achieve its “dual carbon” goals. At present, major oil fields are actively carrying out projects such as super critical CO₂ manufacturing (a technique using super critical CO₂ as manufacturing fluid) to stimulate unconventional reservoirs, CO₂ flooding to enhance crude oil recovery (EOR), and CO₂ geological storage to offshore or onshore depleted oil and gas reservoirs. However, CO₂ has acidic characteristics and is prone to corrosion of the tubing string when flowing in the wellbore, thinning the tubing wall thickness, thereby weakening the pressure-bearing capacity of the wellbore and causing seal failure, resulting in reduced production or even production halt, seriously affecting the safety and stable production of the oilfield. If the rate of pipe corrosion can be accurately predicted, it will help to avoid blind and frequent replacement of the pipe string, thereby significantly reducing workover costs and oilfield development expenses [1].

In response to the actual situation of CO₂ and H₂S corrosion in downhole tubing, many scholars at home and abroad have proposed their own corrosion rate prediction models, among which the more widely used prediction models are: de Waard-Milliams model, E.E. Lyle Jr. Models, etc. Generally speaking, the representative corrosion rate prediction models in the international community can be classified into three types [2,3]. The Norsok model is an empirical prediction model of CO₂ corrosion rate based on more than 2400 sets of experimental data obtained by Nass at the Norwegian Energy Research Institute. The model takes into account the effects of temperature, corrosion product film, CO₂ fugacity, pH value and pipe wall shear force on CO₂ corrosion rate and has a range of application (temperature 20 to 150 °C, pH 3.5 to 6.5) that meets the requirements of most oil and gas fields [4]. Because the model takes into account the protective effect of the corrosion product film (FeCO₃), it is more sensitive to changes in the values of the medium it is in than the de Waard model. The Jepson model is an empirical₂ model for predicting the corrosion rate of CO₂. To address this issue, Cao stablished a horizontal segger flow CO₂ corrosion prediction model in 1997, as segger flow could not be directly used with single-phase flow models [5]. The Fraud number Fr is a dimensionless parameter in fluid mechanics that characterizes the relative magnitudes of the inertial and gravitational forces of a fluid. Mechanically, it represents the contrast between the inertial and gravitational forces acting on the water flow. When Fr > 1, the inertial force dominates the water flow, and the water flow is a jet stream; when Fr is less than 1, gravity is dominant and the flow is slow-flowing; and when Fr = 1, gravity and inertial forces act equally, and the water flow is critical.

Existing prediction methods still face some obvious limitations. First, the prediction experience of most current models is mainly based on carbon steel or low-chromium steel, and when they are used to predict the corrosion of high-chromium carbon steel materials, the accuracy often drops significantly, mainly due to the essential differences in corrosion mechanisms between different materials. A film of corrosion products forms on the surface of common carbon steel or anti-corrosion pipes when they work underground. If the film is unstable or the fluid flow rate is high, the corrosion products will be in a dynamic “formation—dissolution” equilibrium, and the corrosion process will be approximately uniform corrosion. If the product film gradually becomes denser, it will effectively isolate the corrosive medium, causing the corrosion to manifest as deceleration corrosion. Existing models have difficulty effectively simulating and predicting this complex corrosion behavior under flowing corrosive medium conditions [6,7].

There is a highly complex nonlinear correlation between the rate of pipe corrosion and its environmental conditions, a characteristic that makes it difficult for traditional mathematical models based on linear assumptions to achieve ideal prediction accuracy [8]. In recent years, with the rapid development of data science and computing power, an increasing number of researchers have begun to attempt to use data-driven machine learning techniques to construct corrosion prediction models, which have demonstrated significantly better performance than traditional methods in practice. Yin used the random forest algorithm to predict the corrosion behavior of buried pipeline steel, which comprehensively analyzed the characteristic importance of soil parameters and the Pearson correlation among various factors, ultimately achieving high prediction accuracy [9]. The Hatami S team [10] combined the simulated annealing algorithm with the least squares support vector machine (LS-SVM) to propose a hybrid optimization model that further improved the accuracy of predictions in CO₂ corrosion environments. Wang systematically expounded on the application status of machine learning in natural environment corrosion prediction, noting that different algorithms have their own applicable scenarios; for example, linear regression, although having fast training speed, has difficulty capturing complex nonlinear relationships [11]. Neural networks have strong adaptive learning and anti-interference capabilities, but their performance depends on a large number of high-quality samples. Methods such as random forests, support vector machines, and grey relational analysis show better robustness and predictive advantages when dealing with small sample data. Luo further confirmed the feasibility and effectiveness of using deep learning techniques to accurately predict the corrosion rate of pipelines by enhancing CNN and LSTM models based on the attention mechanism [12]. Jiang trained the monitoring data of the corrosion probe using a recurrent neural network (RNN), and the corrosion rate prediction model established could accurately predict the corrosion rate of the pipeline with a mean square error of less than 0.008% between the predicted value and the monitoring data [13]. In summary, the new combination of machine learning methods such as KPCA-SOA-KELM, gray-scale combination model, and APSO-BPNN is widely used in corrosion rate prediction [14,15,16]. But the accuracy in predicting the corrosion rate of corrosion-resistant metal pipes is not high. To address the challenge of scarce and imbalanced corrosion data samples, hybrid models have also been developed by combining feature selection and intelligent optimization algorithms to capture nonlinear relationships in the data and improve prediction accuracy.

This paper selects the CCN-LSTM deep neural network learning method for predicting the corrosion rate of metal pipes. This paper first conducts multi-factor corrosion chamber experiments to obtain experimental data and, at the same time, compares the accuracy of other empirical models with machine learning models and establishes the current corrosion rate prediction method for pipes. Finally, the model was validated based on the actual corrosion rate of the metal pipe applied in the field.

2. Analysis of the Mechanism of CO₂ Corrosion of Casing

CO₂ corrosion is one of the main factors causing casing failure in oil and gas field development, essentially an electrochemical process involving carbonic acid formed by dissolving CO₂. When CO₂ dissolves in water, it follows Henry’s law to form carbonic acid (H₂CO₃), which partially ionizes as a weak acid into H⁺, HCO₃⁻, and CO₃²⁻:

\begin{array}{l} H_{2} C O_{3} ⇌ H^{+} + {HCO}_{3}^{-} \\ {HCO}_{3}^{-} ⇌ H^{+} + C O_{3}^{2 -} \\ {HCO}_{3}^{-} ⇌ H^{+} + C O_{3}^{2 -} \end{array}

Unlike strong acid corrosion, carbonic acid not only provides H⁺ to participate in the cathodic reaction, but its unionized molecules can also directly participate in the reduction reaction, which is an important reason why the corrosion rate of CO₂ is often higher than expected entirely by pH:

2 H_{2} C O_{3} + 2 e^{-} \to H_{2} + 2 {HCO}_{3}^{-}

The anodic reaction is the dissolution of iron:

Fe \to F e^{2 +} + 2 e^{-}

As Fe²⁺ accumulates on the surface of the steel, when the concentration ion product of Fe²⁺ with CO₃²⁻ exceeds the solubility product, the corrosion product FeCO₃ precipitates

F e^{2 +} + C O_{3}^{2 -} \to {FeCO}_{3}^{-}

The formation of the FeCO₃ film has a decisive influence on subsequent corrosion behavior: a dense and complete FeCO₃ film can act as a diffusion barrier, hindering the transmission of corrosive species to the steel substrate, thereby significantly reducing the corrosion rate. In contrast, loose and porous film layers do not provide effective protection and may even induce severe pitting due to local damage. The protective performance of the corrosion product film depends on its microstructure, thickness, compactness and bonding strength to the substrate, which are coupled by multiple factors such as temperature, CO₂ partial pressure, medium flow rate, pH value, and ionic composition.

Researchers have extensively studied the corrosion behavior of carbon steel in CO₂-containing environments. Cr-containing steel forms a Cr-rich corrosion layer that is denser than FeCO₃, which hinders corrosive media migration and improves corrosion resistance [17]. As Cr content increases, the outer corrosion product layer thickens while the inner layer thins [18,19]. This inner layer thickness trend closely follows the change in corrosion rate. The higher Cr content in the product film enhances its compactness and resistance to ion penetration, limiting corrosive media access to the reaction interface and resulting in a thinner inner film. The corrosion product formation process for Cr-containing steel in CO₂ environment can be expressed as

x Cr + y Fe + z C O_{3}^{2 -} + \frac{3 x}{2} O_{2} \to F e_{y} C r_{x} {(C O_{3})}_{z} O_{\frac{3 x}{2}}

Studies have shown that, if the corrosion product film is not dense, Cl⁻ may accumulate at the interface between the corrosion product film and the metal. Due to the passivation effect of Cl⁻, the interface remains active, leading to intensified local corrosion of the metal. Where Cl⁻ is enriched, pitting spreads faster. Cl⁻ ions accumulate at the interface, accelerating pit expansion. And carbon is also enriched at the interface. Due to the significant difference in corrosion potential between carbon and iron, it causes micro-area, inhomogeneous galvanic corrosion and promotes the intensification of corrosion. The pitting autocatalytic process involving Cl⁻ can be described with the following reaction:

{FeCl}_{2} {+ 2 H}_{2} O \to {Fe (OH)}_{2} {+ 2 H}^{+} {+ 2 Cl}^{-}

Geng found, through laboratory experiments, that 13Cr steel shows obvious passivation behavior in the produced fluid of oil fields containing CO₂ [20]. Chen Bin used a self-developed corrosion apparatus to simulate the influence of factors such as temperature, water content, CO₂ partial pressure, and Cr content and found that the most important factors affecting the corrosion rate were temperature and water content, followed by CO₂ partial pressure [21]. In the simulation of actual high-temperature and high-pressure corrosion experiments, temperature was the main factor affecting the corrosion of 13Cr steel, while flow rate, CO₂ partial pressure, and Cl⁻ concentration had less effect on its corrosion [22]. Within a certain temperature range, the corrosion rate of 13Cr steel first increases and then decreases as the temperature rises. The corrosion of 13Cr steel is most severe when the Cl⁻ concentration is 50 g/L [23]. When the temperature reaches 120 °C, the corrosion rate is at its peak and then gradually decreases. Zhu Yu, in his study of the corrosion behavior of 13Cr in geothermal environments, pointed out that the main reason for the increase in corrosion rate with temperature is the increase in corrosion potential, the reduction in polarization resistance, and the decrease in the protective properties of the corrosion film [24,25,26]. With the increase in partial pressure and Cl⁻ concentration, the corrosion rate increases, and the effect of temperature on corrosion shows a pattern of first increasing and then decreasing [27]. Many scholars believe that the corrosion rate of common steel reaches its maximum when the temperature reaches the range of 60–80 °C, while for high Cr content steel, the corrosion rate does not reach its maximum until the temperature reaches 120 °C [28]. The effect of partial pressure of CO₂ on corrosion rate follows an empirical relationship [29,30].

In summary, the corrosion pattern of metal tubing in a CO₂ environment is mainly related to conditions such as temperature, CO₂ partial pressure, formation water mineralization, and flow rate. However, the influence of each factor on the corrosion rate of tubing is not monotonically increasing or decreasing, but mainly depends on the nature of the corrosion film. However, the formation conditions of the corrosion film are influenced by multiple factors. Therefore, corrosion experiments under the influence of multiple factors should be conducted to determine the importance of each influencing factor within a fixed range and provide sufficient training data for the machine learning method.

3. Materials and Methods

3.1. Laboratory Corrosion Experiments

All the corrosion coupons in this experiment were required to be polished and ground to remove the oxide film on the surface before the experiment began, as shown in Figure 1. Based on the usage of casing in the Bohai Oilfield, a field high-temperature and high-pressure corrosion experiment was simulated to study the corrosion behavior of different casing materials. The main materials used were: P110, P110-3Cr, P110-9Cr, and P110-13Cr. All hangers were produced by Shanghai Luosong Electromechanical Equipment Co., Ltd. (Shanghai, China). The size of the corrosion hangers used was 50 mm × 10 mm × 2 mm. Table 1 shows the composition of different steels.

3.2. Experimental Conditions and Procedures

According to the previous analysis, there are many factors affecting the corrosion rate of the metal casing. To conduct a multi-factor analysis, this study selected five influencing factors—metal material, temperature, molar CO₂ content, Cl⁻ concentration, and flow rate—and designed dynamic corrosion experiments under 15 corrosion conditions using the orthogonal experiment method with five factors at three levels. To clarify the relationship between pipe corrosion rate and corrosion time, three time points were taken for each experimental condition to measure the corrosion rate, and triplicate samples were used for each group of experiments to eliminate experimental errors. The specific corrosion conditions are shown in Table 2.

In addition to Cl⁻, the specific formation water ion composition is shown in Table 3.

This experiment was conducted in a high-temperature and high-pressure CO₂ dynamic corrosion experiment apparatus (see Figure 2 for the schematic diagram of the experiment process). The weighed hangers were put into the corrosion instrument, a pressure gauge was connected to the reactor, and the pressure value was recorded inside the reactor at this time. The heating button was turned on until the temperature was heated to the preset corrosion temperature, then the gas pressurization system was turned on. Then the valves, gas tank valves, etc., were connected. Next, CO₂ was introduced and continued for 5 min to expel the air inside the reactor, followed by closing the outlet valve continuing to introduce CO₂ to the preset corrosion CO₂ partial pressure. Then N₂ was introduced to the preset total pressure inside the reactor, the flow rate controller was set and turned on inside the reactor, and corrosion tests were performed for 3 days, 7 days, and 15 days at preset times. After reaching the predetermined corrosion time, the flow rate controller was turned off inside the reactor, then the heating button was turned off on the reactor, followed by waiting for the temperature to drop to room temperature, then depressurizing the reactor, opening the reactor, removing the container in the reactor that held the hangers, taking out the hangers with tweezers, and drying the remaining moisture on the surface of the hangers with absorbent paper and putting each piece into the experimental preservation bag.

The corrosion rate was calculated using the weight loss method with reference to “Corrosion of metals and Alloys—Removal of Corrosion Products from Corrosion specimens” (GB/T 16545-2015), and the specific steps were: (1) Measure the mass of the hanging plate before corrosion; (2) Conduct the corrosion test under preset corrosion conditions; (3) Remove the hangers and remove the corrosion products from their surfaces; (4) Measure the mass of the hangers after corrosion; (5) Calculate the corrosion rate using the following formula.

V_{c o r r} = \frac{365 \times 10^{5} ∆ G}{ρ S t}

In the formula, V_corr is the average corrosion rate, mm/a; ΔG is the weight loss of the sample, g; ρ is the density of the material, g/cm³, S is the contact area, mm², t is the test time, d.

3.3. Corrosion Rate Test Results

The corrosion rate calculation results corresponding to the experimental protocol are shown in Table 4. All four influencing factors have varying degrees of influence on the corrosion rate of the pipe. Further processing of the experimental results is required in order to obtain the weight order of each influencing factor.

The results of the experiment were evaluated by range analysis, that is, by adding up the results under each of the three level conditions contained in each influencing factor to obtain the result K at each level and comparing the maximum K value among the three level conditions under the same factor to determine the optimal level condition.

K_{i}^{X} = \sum X_{i}

In the formula, the X factor represents the type of metal, temperature, molar content of CO₂, concentration of Cl⁻ or flow rate; i (horizontal condition) = 1, 2, or 3.

The range R is the difference between the maximum K value and the minimum K value under the same factor. R can reflect the impact of different levels of variation under that factor on the final result and the order of the factors.

R^{X} = {K_{i}^{X}}_{\max} - {K_{i}^{X}}_{\min}

The range analysis results obtained from the above process are shown in Table 5.

The range R test results show that the metal category has the most significant effect on the corrosion rate of the pipe, and the higher the Cr content of the steel (9Cr, 13Cr), the corrosion rate is significantly lower than that of common carbon steel (P110, N80) and low Cr steel (3Cr). This is mainly due to the dense Cr oxide layer or Cr-containing corrosion product film formed during the corrosion of Cr-containing steel, which effectively hinders the migration of corrosive media to the metal substrate. Temperature is the second most important factor in the corrosion rate, with a peak in the 60–100 °C range. This is because higher temperatures accelerate the electrochemical reaction kinetics, but higher temperatures instead promote the formation of a dense FeCO₃ film, thereby reducing the corrosion rate. The molar content of CO₂ and the concentration of Cl⁻ have similar effects on the corrosion rate, ranking third in importance. An increase in the molar content of CO₂ leads to an increase in the ion concentration of CO₃²⁻, a decrease in the pH of the corrosive medium, and an accelerated corrosion reaction; Cl⁻ accumulates at the interface between the corrosion product film and the metal, and due to the significant difference in corrosion potential between carbon and iron, it causes micro-area inhomogeneous galvanic corrosion and promotes the intensification of corrosion. As a result, even when the molar content of CO₂ is low, if there is sufficient Cl⁻, the corrosion rate of the pipe can reach a level comparable to that under high CO₂ corrosion conditions. The flow rate has a small effect on the corrosion rate of the tubing and is not significant in the measured range of 2–7 m/s.

4. CNN-LSTM Neural Network Prediction Model

A total of 100 valid experimental data points were collected. For a deep learning model of this complexity, this sample size is relatively limited. However, the following strategies were implemented to mitigate overfitting risks: (1) lightweight architecture design with a moderate number of trainable parameters (approximately 150,000 parameters); (2) dropout layers (rate = 0.3) after each convolutional and LSTM layer; (3) early stopping (patience = 50 epochs based on validation loss); (4) L2 regularization (kernel regularizer = 0.001); and (5) k-fold cross-validation (k = 5) to evaluate model robustness. These measures collectively ensure that the model generalizes beyond the training data. Given the inherent challenges of acquiring large-scale corrosion experimental data under high-temperature and high-pressure conditions, the current dataset represents a reasonable foundation for a preliminary yet rigorous evaluation of the proposed deep learning approach. The results should be interpreted as a proof-of-concept demonstration of the CNN-LSTM architecture for corrosion rate prediction under limited data conditions.

4.1. Convolutional Neural Network Model (CNN)

A convolutional neural network (CNN) is a type of feedforward neural network specifically designed to handle data with a grid-like topological structure and has become one of the core models in deep learning due to its outstanding performance in areas such as image recognition, object detection, and signal processing. The core idea of a CNN is to effectively extract the spatially hierarchical features of the input data through local connections, weight sharing, and pooling operations (Figure 3). Its basic structure is typically composed of convolutional layers, activation layers, pooling layers, and fully connected layers stacked together. Extract local patterns of input features through one-dimensional convolution kernels (Conv1D) to capture the synergy between multiple factors. Batch normalization and MaxPooling1D are followed by convolutional layers to enhance the stability and generalization ability of the model.

Its convolution operation can be expressed as

x_{j}^{l} = f (\sum_{x_{i} \in M_{l - 1}} x_{i}^{l - 1} * k_{i j}^{l} + b_{j}^{l})

In the formula, ∗ represents the convolution operation,

x_{i}^{l - 1}

is the j-th output feature map of the l-th convolutional layer,

M_{l - 1}

is the set of feature maps of the l−1 layer,

x_{i}^{l - 1}

is the i-th output feature map of the l−1 layer convolution,

k_{i j}^{l}

is the convolution kernel used in the l-layer convolution operation,

b_{j}^{l}

is the bias of thej-th feature map;

f (\cdot)

is the activation function; ReLU is the activation function used in this paper.

4.2. Long Short-Term Memory Network (LSTM)

Long short-term memory (LSTM) is a particular recurrent neural network (RNN). It aims to address the vanishing and exploding gradients that traditional RNNS face when dealing with long sequence data. Compared with standard RNNS, LSTMS can effectively capture long-range dependencies in time series by introducing sophisticated gating mechanisms to achieve fine-grained control over information flow.

The hidden states of traditional RNNS are only passed step by step through simple loop connections. Information can only flow unidirectionally along time steps, making it difficult to retain important information in early time steps. Moreover, gradient attenuation or surge is prone to occur during the backpropagation process. LSTM addresses this problem by designing three types of gate units—forget gate, input gate, and output gate—and by introducing the cell state. Memory units act as the main channel for information transmission, allowing gradients to flow steadily over long time spans. The gated structure is responsible for dynamically regulating the forgetting, updating and output of information, allowing the network to selectively retain historical information or ignore irrelevant distractions, as shown in Figure 4.

The LSTM consists of an input gate, an output gate, and a forgetting gate, and its operation formula can be expressed as

f_{t} = σ (W_{f} \cdot [h_{t - 1}, x_{t}] + b_{f}) i_{t} = σ (W_{i} \cdot [h_{t - 1}, x_{t}] + b_{i}) o_{t} = σ (W_{o} \cdot [h_{t - 1}, x_{t}] + b_{o}) {\hat{C}}_{t} = \tanh (W_{C} \cdot [h_{t - 1}, x_{t}] + b_{C}) C_{t} = f_{t} C_{t - 1} + i_{t} \cdot {\hat{C}}_{t} h_{t} = o_{t} \cdot \tanh (C_{t})

In the formula,

x_{t}

is the input feature at the current moment,

W_{f}

,

W_{i}

,

W_{o}

,

W_{C}

is the parameter to be trained,

b_{f}

,

b_{i}

,

b_{o}

,

b_{C}

is the training bias term, and the output of the hidden layer at the previous moment is represented as.

The training process of the CNN-LSTM model mainly includes the following steps: First, normalize the input features and target variables to eliminate dimensional differences and improve the model convergence speed; the mean square error is used as the loss function to measure the difference between the predicted value and the true value; the optimization algorithm uses the Adam optimizer, which adaptively adjusts the learning rate to accelerate model convergence; early stop and learning rate decay strategies are introduced to effectively prevent model overfitting and improve generalization ability [28]; Finally, the LSTM output is mapped to the target variable through the fully connected layer, and a linear activation function is used to output the final corrosion rate prediction results. The complete operation process of the CNN-LSTM neural network is shown in Figure 5.

Justification for LSTM Application. Although the dataset does not consist of continuous time-series measurements from a single corrosion process, each data point corresponds to a specific corrosion duration (3, 7, or 15 days). The corrosion rate is defined as the average weight loss per unit time, which inherently represents a time-dependent cumulative effect. The input features include corrosion time as an explicit variable, and the LSTM layer is designed to capture the state evolution across different corrosion time nodes, i.e., how the combined effect of material type, temperature, CO₂ partial pressure, Cl⁻ concentration, and flow velocity leads to different corrosion outcomes at different time scales. This can be conceptualized as processing a “time step” dimension where each sample’s corrosion time serves as the temporal axis. Therefore, applying LSTM to this dataset is methodologically justified, as the model learns the sequential dependency of corrosion progression over time.

5. Results and Discussion

A total of 100 valid experimental data points were collected from the laboratory corrosion tests. To evaluate model robustness and avoid a single data partition, 5-fold cross-validation was performed. The entire dataset was randomly partitioned into five mutually exclusive subsets. In each fold, four subsets were used for training and the remaining one for testing, with performance metrics averaged across the five folds. For direct comparison with baseline models (D-W and SRV), a single train-test split (70 training, 30 testing) was also used, following the same evaluation protocol as the baseline methods. In both cases, 15% of the training data was further held out as a validation set for early stopping. The model was trained for up to 10,000 epochs with early stopping patience of 50 epochs based on validation loss.

The cross-validation results confirmed model stability. For P110 low-chromium steel, the 5-fold cross-validation yielded a mean R² of 0.81 (±0.04), with individual fold R² values ranging from 0.78 to 0.86. For P110-13Cr high-chromium steel, the mean R² was 0.92 (±0.03), ranging from 0.89 to 0.95. These results indicate that the single-split results (R² = 0.83 and 0.94, respectively) are representative rather than being a fortuitous outcome of a particular train-test split

Based on the laboratory corrosion experiment, a total of 100 sets of valid experimental data were collected. Seventy of them were used as training set samples and 30 as test set samples for model training and validation. The number of model training iterations was set to 10,000. In terms of the CNN-LSTM network structure design, the first layer had 128 convolutional kernels, and the second layer had 256 convolutional kernels. The LSTM part adopts a three-layer structure, with 256, 128, and 64 neurons in the hidden layers respectively. Finally, the prediction is output through a fully connected layer with 1 neuron, which corresponds to a single-valued prediction of the corrosion rate.

To verify the effectiveness and superiority of the CNN-LSTM model constructed in this paper, it was compared and analyzed, respectively, with the traditional D-W semi-empirical model and the SRV machine learning model commonly used in current engineering. Two typical casing materials were selected as prediction objects; one was the P110 casing with the lowest chromium content, and the other was the P110-13Cr high-chromium casing which is widely used in the field at present. Among them, the prediction results for the corrosion rate of the P110 casing are shown in Figure 6.

From the comparative analysis of the prediction results, it can be seen that, in the corrosion rate prediction task for low-chromium casing, the prediction accuracy of different models shows significant differences. Overall, the CNN-LSTM hybrid neural network model performed the best, with a coefficient of determination R² reaching 0.83, which was the most accurate among all the comparison models. This is mainly due to the CNN-LSTM model’s ability to effectively extract local patterns from input features and capture long-term dependencies on time series, thereby fitting more accurately the nonlinear degradation behavior of low-chromium casing in corrosive environments. In contrast, the traditional D-W semi-empirical model also showed good predictive power, with an R² of 0.78, slightly lower than CNN-LSTM. As a classic empirical model based on corrosion mechanism, the D-W model can maintain high prediction accuracy even with reasonable parameter correction, demonstrating its superiority in modeling physicochemical processes. However, its linearized modeling approach limits its flexibility in dealing with complex, nonlinear conditions, resulting in slightly inferior accuracy compared to deep learning models. Of the three models, the weakest predictor was the traditional SRV empirical model, with an R² value of only 0.72. To analyze the reasons, SRV models rely on a large amount of historical experimental data for parameter fitting and correction, while the current accumulation of experimental data for low-chromium content tubing is relatively limited and less applied, resulting in a serious shortage of datasets required for model training and validation. The lack of data makes it difficult for the model to capture key corrosion characteristics under real conditions, thereby affecting its prediction accuracy and generalization ability. Figure 6d presents the training and validation loss curves for the CNN-LSTM model on the P110 dataset. The training loss decreases rapidly during the first 1000 epochs and gradually stabilizes after approximately 3000 epochs. The validation loss follows a nearly identical trend, reaching a minimum of 0.0085 at around 4500 epochs. After this point, the validation loss remains stable without significant divergence from the training loss. The final validation loss (0.0100 at 10,000 epochs) is within 8% of the final training loss (0.0031), indicating that the model does not suffer from severe overfitting. This is attributable to the lightweight architecture, dropout layers (rate = 0.3), early stopping (patience = 50 epochs based on validation loss), and L2 regularization (kernel regularizer = 0.001) described in Section 4. The early stopping mechanism would have terminated training at approximately epoch 4500, selecting the model with the lowest validation loss. Similar convergence behavior was observed for the P110-13Cr dataset, with the validation loss stabilizing at 0.0045 after approximately 3500 epochs and maintaining a final validation-to-training loss ratio below 10%.

Figure 7 shows the prediction results of the corrosion rate of high-chromium content casing. When the chromium content in the casing material is high, the overall corrosion rate shows a significant downward trend, mainly due to the dense passivation film formed by chromium on the material surface, which effectively blocks the erosion of the corrosive medium. Among the three prediction models, the CNN-LSTM model constructed in this paper still maintains the highest prediction accuracy, with a coefficient of determination R² reaching 0.94, fully demonstrating the strong generalization ability and stability of the model when dealing with corrosion problems of different materials. A note on the D-W model applicability: The de Waard-Milliams (D-W) semi-empirical model was originally developed based on carbon steel corrosion data and is not theoretically applicable to high-chromium steels (such as 9Cr and 13Cr). The model does not account for the formation of Cr-enriched passive films or the transition from uniform to decelerating corrosion. Therefore, the observed decline in D-W model accuracy for high-chromium steels (R² = 0.68 for 13Cr, compared to 0.78 for P110) is expected and does not imply a failure of the model within its intended domain. The comparison is included here to demonstrate that data-driven models (SRV and CNN-LSTM) can naturally adapt to new material systems through learning from experimental data, whereas mechanistic models require explicit re-parameterization for each new material. To provide a fairer baseline for the high-chromium case, two additional machine learning models were implemented on the same dataset: random forest (RF) and XGBoost. For P110-13Cr, RF achieved an R² of 0.82 and XGBoost achieved 0.84, both outperforming the D-W model but still lower than the CNN-LSTM (R² = 0.94). This further supports the superiority of the proposed deep learning approach, which benefits from both spatial feature extraction (CNN) and temporal dependency modeling (LSTM).

Table 6 is the prediction results for 3Cr and 9Cr steels. In addition to P110 and P110-13Cr, corrosion experiments were also conducted on P110-3Cr and P110-9Cr steels. The prediction performance of the CNN-LSTM model on these two intermediate materials is summarized in Table 6. The model achieved R² values of 0.79 for 3Cr and 0.88 for 9Cr, which are between the values for P110 (0.83) and 13Cr (0.94). This trend is consistent with the corrosion resistance ranking: higher Cr content leads to lower corrosion rates and, importantly, more regular corrosion behavior that is easier for the model to learn. The slightly lower R² for 3Cr (0.79) compared to P110 (0.83) may be attributed to the transitional corrosion behavior of low-chromium steels, where the corrosion product film is neither purely carbonaceous nor fully Cr-enriched, resulting in more complex and less predictable corrosion kinetics.

In contrast, the prediction accuracy of the traditional D-W semi-empirical model shows a significant decline when facing high-chromium content tubing, with its R² value being only 0.68. The root cause of this phenomenon lies in the essential differences in corrosion mechanisms among different materials. A film of corrosion products forms on the surface of ordinary carbon steel or anti-corrosion pipes during their service underground. When the stability of this film is poor or the fluid flow rate is high, the corrosion product film is in a dynamic “formation—dissolution” equilibrium state, at which point the corrosion process is approximately uniform corrosion; however, as the corrosion product film gradually becomes denser, especially in materials with high chromium content, this film can effectively prevent the further intrusion of the corrosive medium, transforming the corrosion process into deceleration corrosion. Existing D-W empirical models, based on traditional corrosion mechanisms, are mainly applicable to uniform corrosion conditions and are difficult to effectively simulate and predict this complex corrosion behavior under flowing corrosive media conditions, especially when the dynamic change of the corrosion product film becomes the dominant factor, and the limitations of the model become prominent. The SRV machine learning model showed a significant improvement in the prediction of high chromium content casing, with an R² value of 0.85, which was significantly better than its performance in the prediction of low chromium casing. This improvement indicates that the SRV model, as a data-driven machine learning method, has a certain degree of adaptive ability when facing new material systems by adjusting the prediction results based on the feature distribution in the training data. However, the prediction accuracy is still lower than that of CNN-LSTM models, indicating that pure machine learning models are still not as good as deep learning models that combine temporal feature extraction capabilities when dealing with complex corrosion mechanism transitions.

Combining the predictions in Figure 6 and Figure 7, it can be seen that the CNN-LSTM model shows the best prediction results for both the P110 casing with the lowest chromium content and the P110-13Cr casing with a higher chromium content. This fully demonstrates the model’s strong adaptability and prediction accuracy in dealing with different materials and corrosion mechanisms. The CNN-LSTM model effectively extracts local patterns from input features through convolutional layers and captures long-term dependencies on time series through LSTM layers, which can more comprehensively depict multi-scale features of the corrosion process, thus maintaining high prediction accuracy when facing changes in corrosion mechanisms caused by variations in material composition.

In addition to R², three other evaluation metrics were computed to provide a more comprehensive assessment of model performance: mean absolute error (MAE), root mean square error (RMSE), and mean absolute percentage error (MAPE). Table 7 summarizes these metrics for all four materials. For P110-13Cr, the model achieved an MAE of 0.008 mm/a, an RMSE of 0.011 mm/a, and a MAPE of 6.2%, indicating high prediction accuracy. For P110, the MAE was 0.042 mm/a, RMSE was 0.058 mm/a, and MAPE was 8.5%. These results further confirm the model’s strong predictive capability across different material types.

The CNN-LSTM hybrid neural network model shows good universality and superiority when dealing with the problem of predicting the corrosion rate of different pipe materials, providing a more reliable solution for the corrosion prediction of oil and gas well casing. The model can be further explored for corrosion prediction in more material systems and under complex conditions in the future.

6. Verification of Field Prediction Results of the CNN-LSTM Model

Application of range analysis results to CNN-LSTM model. The range analysis presented in Section 3.3 (Table 5) identified the order of importance of the five influencing factors: material type > temperature > CO₂ partial pressure > Cl⁻ concentration > flow velocity. This finding was incorporated into the CNN-LSTM model in two ways. First, the material type was one-hot encoded as a categorical variable, giving it higher representational capacity than ordinal encoding, which aligns with its dominant role. Second, during feature normalization, no feature weighting was applied; instead, the model was allowed to learn the relative importance automatically through the training process. The fact that the model achieved higher accuracy for high-Cr materials (9Cr and 13Cr) than for low-Cr materials (P110 and 3Cr) is consistent with the range analysis, which showed that material type is the most influential factor. For high-Cr materials, the corrosion behavior is more regular and predictable, leading to better model performance.

A certain gas field at sea has characteristics such as high formation temperature and high CO₂ content. There are certain pipe string corrosion problems under subsequent conditions such as CO₂ injection, storage, and CO backflow during production. The X well in the block replaced the tubing with 13Cr in 2013 and started production in 2023, and there was obvious corrosion on the inner wall of the tubing string. The CNN-LSTM model was used to was used to predict the corrosion rate at two locations in the tubing well (about 1000 m deep) and at the bottom of the well (about 2000 m deep), to calculate the thinning of the wall thickness and to compare it with the actual wellbore measurement results.

During the service life of the well, the temperature range in the well was 66.9 °C to 72.6 °C, and the temperature at the bottom of the well was 84 °C compared to the formation temperature. The molar content of CO₂ in the produced gas ranged from 24.8% to 59.7%; Cl⁻ concentration is approximately 5000 mg/L. Assuming the fluid flow is equal at different positions inside the tube, it is 0.3 to 1.8 m/s. The prediction results are shown in Figure 8a. From the laboratory experiments, it can be known that the 13Cr pipe shows uniform corrosion in the current gas-phase CO₂ corrosion environment, that is, the corrosion rate does not change with time. The cumulative corrosion depth of the pipe string during service is obtained by adding up the corrosion rates over the years.

The geometric dimensions of the pipe string were measured, as shown in Figure 9, by taking 10 sections for each pipe string and measuring the wall thickness of the A-E, B-F, C-G, and D-H sections, respectively. Given that the initial inner diameter of the string is 100.5 mm, the cumulative corrosion wall thickness of the string during service can be obtained from the pre-corrosion and post-corrosion inner diameters. The average corrosion depth in the well is 0.23 mm, and the average corrosion depth at the bottom of the well is 0.35 mm.

The results show (Figure 10) that the average corrosion depth of the pipe material in the well is less than that of the pipe material at the bottom of the well, the predicted value is close to the field test results, and the accuracy rate of the corrosion depth in the well is 92%. There is an increase in the wall thickness of the casing at the bottom of the well, which is presumed to be related to an error in the factory dimensions. Therefore, after excluding the invalid data, the accuracy is slightly reduced to about 80%. These results confirm the reliability of the proposed prediction method and its applicability to other wells in the block or similar corrosion conditions.

In the process of pipe corrosion, in addition to the four influencing factors selected in the laboratory experiments in this paper, there are other influencing factors such as Cr content in the pipe, water content, concentration of various cations in formation water, etc. It is possible to predict the corrosion rate under other multi-factor conditions by conducting the corresponding laboratory corrosion experiments or by obtaining data based on the relevant experimental results in the existing literature and by adding the input variable Xn and the corresponding corrosion rate Y value to the model.

7. Conclusions

In this study, by conducting multi-factor indoor pipe corrosion experiments, corrosion rate data under different material, temperature, CO₂ partial pressure, Cl⁻ concentration, and flow rate conditions were obtained. Combined with range analysis and CNN-LSTM deep learning model, the influence laws and prediction methods of each factor on pipe corrosion rates were systematically studied, and the following conclusions were drawn:

(1): The range analysis results indicated that the metal type had the most significant effect on the corrosion rate of the pipe, and the higher the Cr content of the steel, the lower the corrosion rate. Among them, the corrosion rate of 13Cr steel was only 2–5% of that of P110 steel. The temperature effect ranks second, with the corrosion rate peaking in the range of 60–100 °C. The influence of CO₂ partial pressure is similar to that of Cl⁻ concentration, and flow rate has the weakest effect. The increase in Cr content enhances the compactness of the corrosion product film and is a key factor in reducing the corrosion rate.
(2): The CNN-LSTM hybrid deep learning model established based on the experimental data effectively extracts the spatial features among multiple factors through the convolutional layer and captures the temporal dependence of the corrosion process through the LSTM layer. In the prediction of P110 low-chromium casing, the R² reaches 0.83, and in the prediction of P110-13Cr high-chromium casing, the R² reaches 0.94. The prediction accuracy was significantly better than that of the traditional D-W semi-empirical model (R² 0.78 and 0.68, respectively) and the SRV machine learning model (R² 0.72 and 0.85, respectively).
(3): The accuracy of the D-W semi-empirical model decreased significantly when predicting the corrosion rate of high chromium content casing, with R² dropping from 0.78 to 0.68, because the model was established based on the uniform corrosion mechanism and had difficulty simulating the complex behavior of the corrosion process changing from uniform to deceleration after the formation of a dense passivation film on the surface of high chromium materials. Although the SRV model is data-driven, it is still inadequate when dealing with time-dependent relationships and nonlinear features, and its accuracy is lower than that of the CNN-LSTM model.
(4): Field application verification of the 13Cr tubing in Well X of a gas field at sea shows that the accuracy of the corrosion depth in the well predicted by the CNN-LSTM model to the actual measured value is 92%, and the accuracy at the bottom of the well is about 80%. The predicted values are in good agreement with the field experimental results. The established method for predicting pipe corrosion rate is reliable and can be extended to other wells in the study block or to predict pipe corrosion rate under similar corrosion conditions.

Author Contributions

Methodology, Y.B., H.M. and S.S.; Software, Y.B., S.S. and Y.S.; Validation, Y.B. and S.S.; Resources, H.M.; Data curation, H.M.; Writing—original draft, Y.B.; Writing—review & editing, Y.B. and Y.S.; Visualization, Y.B.; Project administration, Y.S. All authors have read and agreed to the published version of the manuscript.

Funding

The work presented in this paper was financially supported by the First-Class Discipline Collaborative Innovation Program of Heilongjiang Province (Grant No. LJGXCG2024-F02). The Key Research and Development Program of Heilongjiang Province (Grant No. 2024ZX09C01) is also gratefully acknowledged.

Data Availability Statement

All casing corrosion experimental data employed in this paper are derived from specialized physical simulations and laboratory testing conducted internally by our research group. The aforementioned data involve key technical achievements accumulated by our group over many years; furthermore, certain experimental methods and testing protocols constitute proprietary technologies independently developed by our team. Consequently, these materials are subject to specific confidentiality requirements.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Yang, Y.; Zhou, J.; Hu, H.; Guo, C.; Luo, M.; Ma, J.; Guo, F.; Huang, Z. The development status, economic benefits and future prospects of the CCUS-EOR industry. China Min. Mag. 2020, 34, 190–203. [Google Scholar] [CrossRef]
Wang, S.; Yang, T.; Tian, K.; Guo, X.; Shen, C. Coating requirements for underwater manifold based on NORSOK M501. Gen. Corros. Control 2024, 38, 82–87. [Google Scholar] [CrossRef]
De Waard, C.; Milliams, D.E. Carbonic acid corrosion of steel. Corrosion 1975, 31, 177. [Google Scholar] [CrossRef]
Ren, N.; Shang, Y.; Chen, J.; Li, S. Application of Grey Model in Prediction of Corrosion Rate of oil Well Tubing. Hot Work. Technol. 2020, 49, 34–37. [Google Scholar] [CrossRef]
Cao, Y.; Lin, W.; Feng, J.; Jia, W. Note: Prediction of pipe string corrosion rate in CO₂ well. Gen. Corros. Control 2024, 38, 75–80. [Google Scholar] [CrossRef]
Hu, J.; Zhang, X.; Li, Y.; Hao, L.; Zhang, S. Research on Prediction and Analysis of Corrosion Rate at the Top of a Wet Pipeline. Chem. Equip. Technol. 2024, 45, 68–71. [Google Scholar] [CrossRef]
Liao, K.; Jiang, P.; Liu, X.; Chen, L.; Xian, J.; He, G.; Zou, Q. Establishment of a corrosion prediction model for water transmission pipelines in K shale gas fields. Corros. Prot. 2023, 44, 78–84. [Google Scholar] [CrossRef]
Han, J. Prediction of Corrosion Rate of Gathering and Transportation Pipelines in Daniudi Gas Field based on Multiphase Flow simulation. Gen. Corros. Control 2023, 37, 127–130. [Google Scholar] [CrossRef]
Yin, Z.; Wang, S.; Zhu, Z.; Gu, S.; Ma, S.; Du, Y.; Jiang, S. Research on Key Parameters of Soil Corrosiveness and Prediction Model of Corrosion Rate of Q235 Steel in Beijing area. Chin. J. Eng. Sci. 2023, 45, 1939–1947. [Google Scholar] [CrossRef]
Hatami, S.; Ghaderi-Ardakani, A.; Niknejad-Khomami, M.; Karimi-Malekabadi, F.; Rasaei, M.R.; Mohammadi, A.H. On the prediction of CO₂ corrosion in petroleum industry. J. Supercrit. Fluids 2016, 117, 108–112. [Google Scholar] [CrossRef]
Wang, S.; Ma, S.; Che, K.; Du, Y. The application status of machine learning in the field of corrosion assessment and prediction in natural environments. Chin. J. Corros. Prot. 2023, 43, 441–451. [Google Scholar] [CrossRef]
Luo, Z.; Du, D.; Luo, J.; Wang, X. A Study on the Effectiveness of Combined Models for Predicting Pipe Corrosion Rates: CNN and LSTM models Enhanced by Attention Mechanism. J. Saf. Environ. 2024, 24, 4263–4269. [Google Scholar] [CrossRef]
Jiang, H.; Nurxati, N.; Liu, J. Prediction of Corrosion Rate Based on Recurrent Neural Networ. Corros. Prot. 2023, 44, 78–80. [Google Scholar] [CrossRef]
Fan, P.; Zhang, X. Prediction of Corrosion Rate in subsea Oil and Gas Pipelines Based on KPCA-SOA-KELM. Hot Work. Technol. 2025, 52, 64–68+73. [Google Scholar] [CrossRef]
Yang, B.; Xiao, J.; Ouyang, C.; Zhu, J.; Yan, Y. Application of APSO-BPNN Model in Predicting Corrosion Rates of Iron Materials in Coastal Environments. Corros. Prot. 2024, 45, 72–79. [Google Scholar] [CrossRef]
Zhang, Z.; Liu, X.; Li, Z. Prediction of pipe corrosion rate based on grey combination model. Chin. J. Corros. Prot. 2008, 28, 173–176. [Google Scholar]
Zhao, Y.; Li, X.; Zhang, C.; Zhang, T.; Xie, J.; Zeng, G.; Xu, D.; Wang, F. Investigation of the rotation speed on corrosion behavior of HP-13Cr stainless steel in the extremely aggressive oilfield environment by using the rotating cage test. Corros. Sci. 2018, 145, 307–319. [Google Scholar] [CrossRef]
Xiao, R.; Liu, G.; Liu, B.; Wei, W.; Zhuang, Q.; Jin, S. Research on Prediction of Corrosion Rate in Submarine Pipelines Based on PCA-TSO-BPNN Model. Hot Work. Technol. 2025, 54, 82–88. [Google Scholar] [CrossRef]
Macdonald, D.D. The history of the point defect model for the passive state: A brief review of film growth aspects. Electrochim. Acta 2011, 56, 1761–1772. [Google Scholar] [CrossRef]
Geng, Y. Research on Corrosion Adaptability of Cr 13 Steel for CO₂ Flooding Casing. Master’s Thesis, Northeast Petroleum University, Daqing, China, 2017. [Google Scholar]
Chen, B.; Zhang, Q.; Han, Y.; Xu, G.; Chen, Y. Sensitivity study on Influencing Factors of Corrosion Rate of Oil casing. Oil Field Mach. 2019, 48, 53–57. [Google Scholar] [CrossRef]
Dong, X.; Jiang, Y.; Fu, C.; Zhao, G. Effects of temperature on CO₂ corrosion behavior of Cr 13 stainless steel. Corros. Prot. 2012, 33, 196–198. [Google Scholar]
Zhu, Y. Effects of Temperature on Corrosion Behavior of Super 13Cr Stainless Steel in Geothermal Environment. Master’s Thesis, China University of Petroleum (East China), Qingdao, China, 2024. [Google Scholar]
Wu, P.; Shi, X.; Hu, Q.; Ma, X.; Liu, S.; Zhang, T.; Gu, X. Analysis of corrosion rate of 316L stainless steel in CO₂/oil/water environment. Gen. Corros. Control 2022, 36, 106–109+115. [Google Scholar] [CrossRef]
Zhu, P.; Yan, W.; Li, L.; Duan, B.; Shen, X.; Wang, Q. Study on corrosion resistance of tubing steel in simulated superCr itical CO₂. Surf. Technol. 2016, 45, 167–173. [Google Scholar] [CrossRef]
Zhu, D.; Liu, X.; Lin, Y.; Li, Y.; Zeng, D.; Deng, K. Corrosion behavior of high-temperature and high-pressure acidic gas well casing steel in superCr itical CO₂ environment. Mater. Prot. 2017, 50, 79–84. [Google Scholar] [CrossRef]
Zeng, D.; Dong, B.; Shi, S.; Yu, H.; Chen, Y.; Lu, W. Effects of CO₂ partial pressure on corrosion of 3Cr steel in high-temperature steam environment. J. Iron Steel Res. 2018, 30, 548–554. [Google Scholar] [CrossRef]
Han, D. Study on Corrosion Behavior and Applicability of 3Cr -N80 Steel in CO₂ Injection and Production Environment. Master’s Thesis, China University of Petroleum (East China), Qingdao, China, 2019. [Google Scholar]
Wang, B. Research on the Corrosion and Erosion Mechanism of Cr Alloy Steel in Co₂-Containing Wet Gas Pipelines. Master’s Thesis, University of Science and Technology Beijing, Beijing, China, 2019. [Google Scholar]
He, S.; Wang, B.; Feng, H.; Lu, M.X.; Zhou, D.Z.; Xing, X.J. Corrosion behavior and product film characteristics of S13Cr in ultra-high temperature superCr itical CO₂ environment. Equip. Environ. Eng. 2021, 18, 8–14. [Google Scholar] [CrossRef]

Figure 1. Preparation of oil casing specimens. (a) P110 (b) P110-3Cr (c) P110-9Cr (d) P110-13Cr.

Figure 2. Schematic diagram of the experiment process.

Figure 3. Schematic diagram of the CNN prediction process.

Figure 4. LSTM structure diagram.

Figure 5. Flowchart of the CNN-LSTM neural network operation.

Figure 6. Prediction of P110 casing corrosion rate under different prediction models. (a) CNN-LSTM prediction results; (b) D-W model prediction results; (c) predictions of the SRV machine learning prediction model; (d) loss curves of the CNN-LSTM model on P110 data.

Figure 7. Prediction of P110-13Cr casing corrosion rate under different prediction models. (a) CNN-LSTM prediction results; (b) D-W model prediction results; (c) predictions of the SRV machine learning prediction model.

Figure 8. Prediction of pipe corrosion rate and depth by CNN-LSTM model. (a) Predicted corrosion rates at mid-well depth (approximately 1000 m) and bottomhole depth (approximately 2000 m); (b) calculation of corrosion depth.

Figure 9. Schematic diagram of measuring the geometric dimensions of the pipe string.

Figure 10. Comparison of measured values of pipe column wall thickness reduction in field application with model predictions.

Table 1. API sleeve composition content table.

Ingredients	C	Si	Mn	P	S	Als	Cr	Ni	V	Ti	Cu	Mo	Co
9Cr	0.04	0.018	0.2	0.0145	0.007	0.04	8.65	0.026	-	-	-	0.3	-
13Cr	0.26	0.2	0.3	0.005	0.02	-	12.8	3.95	0.02	0.08	0.34	1.05	0.14
3Cr	0.184	0.301	0.595	0.0146	0.003	0.025	2.78	0.039	0.01	0.002	0.045	-	-
P110	0.25	0.2	1.45	0.009	0.03	-	0.02	0.05	0.06	0.11	0.09	-	0.14

Table 2. Corrosion condition parameters.

Experiment Number	Material	Temperature (C)	Partial Pressure of CO₂ (MPa)	Cl⁻ Concentration (mg/L)	Flow Rate (m/s)
1	P110	60	20	500	2.0
2	P110	100	30	1500	4.0
3	P110	135	40	2500	7.0
4	P110-3Cr	60	20	1500	4.0
5	P110-3Cr	100	30	2500	7.0
6	P110-3Cr	135	40	500	2.0
7	P110-9Cr	60	30	500	7.0
8	P110-9Cr	100	40	1500	2.0
9	P110-9Cr	135	20	2500	4.0
10	P110-13Cr	60	40	2500	4.0
11	P110-13Cr	100	20	500	7.0
12	P110-13Cr	135	30	1500	2.0
13	P110	60	30	2500	4.0
14	P110-3Cr	100	40	500	2.0
15	P110-9Cr	135	20	1500	7.0

Table 3. Formation water parameters.

Water Type	pH Value	Cationic mg/L			Anion mg/L
Water Type	pH Value	K⁺ + Na⁺	Mg⁺	Ca⁺	SO₄²⁻	HCO₃⁻	CO₃²⁻
NaHCO₃	6.96	11556	103	440	59	545	0

Table 4. Results of corrosion experiments under different conditions.

Number Verification	Influencing Factors					Final Corrosion Rate (mm/a)
Number Verification	Metal Category	Temperature (°C)	Partial Pressure of CO₂ (MPa)	Cl⁻ Concentration (mg/L)	Flow Rate (m/s)	Final Corrosion Rate (mm/a)
1	P110	60	20	500	2.0	1.856
2	P110	100	30	1500	4.0	2.435
3	P110	135	40	2500	7.0	2.891
4	P110-3Cr	60	20	1500	4.0	1.253
5	P110-3Cr	100	30	2500	7.0	1.877
6	P110-3Cr	135	40	500	2.0	1.489
7	P110-9Cr	60	30	500	7.0	0.052
8	P110-9Cr	100	40	1500	2.0	0.079
9	P110-9Cr	135	20	2500	4.0	0.068
10	P110-13Cr	60	40	2500	4.0	0.059
11	P110-13Cr	100	20	500	7.0	0.028
12	P110-13Cr	135	30	1500	2.0	0.045
13	P110	60	30	2500	4.0	2.762
14	P110-3Cr	100	40	500	2.0	1.365
15	P110-9Cr	135	20	1500	7.0	0.062

Table 5. Range analysis under different corrosion conditions.

Evaluation Indicators	Comprehensive Level Average	Influencing Factors
Corrosion rate		Metal categories	Temperature	Molar content of CO₂	Cl⁻ concentration	Flow velocity
	K₁	2.903	1.657	1.469	1.406	1.383
	K₂	1.141	1.595	1.619	1.514	1.442
	K₃	0.074	1.200	1.364	1.532	1.6279
	R	2.829	0.457	0.255	0.244	0.126
Importance	Metal category > Temperature > Molar CO₂ > Cl⁻ > flow rate

Table 6. CNN-LSTM prediction performance for all four materials.

Material	R² (Single Split)	R² (5-Fold CV Mean ± Std)
P110	0.83	0.81 ± 0.04
P110-3Cr	0.79	0.77 ± 0.05
P110-9Cr	0.88	0.86 ± 0.03
P110-13Cr	0.94	0.92 ± 0.03

Table 7. Comprehensive evaluation metrics for CNN-LSTM model.

Material	MAE (mm/a)	RMSE (mm/a)	MAPE (%)	R²
P110	0.042	0.058	8.5	0.83
P110-3Cr	0.048	0.067	9.8	0.79
P110-9Cr	0.018	0.024	5.9	0.88
P110-13Cr	0.008	0.011	6.2	0.9

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bai, Y.; Mao, H.; Sun, S.; Suo, Y. A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model. Processes 2026, 14, 1399. https://doi.org/10.3390/pr14091399

AMA Style

Bai Y, Mao H, Sun S, Suo Y. A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model. Processes. 2026; 14(9):1399. https://doi.org/10.3390/pr14091399

Chicago/Turabian Style

Bai, Yanyongxu, Haoyu Mao, Shaoxuan Sun, and Yu Suo. 2026. "A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model" Processes 14, no. 9: 1399. https://doi.org/10.3390/pr14091399

APA Style

Bai, Y., Mao, H., Sun, S., & Suo, Y. (2026). A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model. Processes, 14(9), 1399. https://doi.org/10.3390/pr14091399

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model

Abstract

1. Introduction

2. Analysis of the Mechanism of CO₂ Corrosion of Casing

3. Materials and Methods

3.1. Laboratory Corrosion Experiments

3.2. Experimental Conditions and Procedures

3.3. Corrosion Rate Test Results

4. CNN-LSTM Neural Network Prediction Model

4.1. Convolutional Neural Network Model (CNN)

4.2. Long Short-Term Memory Network (LSTM)

5. Results and Discussion

6. Verification of Field Prediction Results of the CNN-LSTM Model

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Study on the Prediction Model of Corrosion Rate of Different Metal Pipe Sleeves Based on CNN-LSTM Hybrid Deep Learning Model

Abstract

1. Introduction

2. Analysis of the Mechanism of CO2 Corrosion of Casing

3. Materials and Methods

3.1. Laboratory Corrosion Experiments

3.2. Experimental Conditions and Procedures

3.3. Corrosion Rate Test Results

4. CNN-LSTM Neural Network Prediction Model

4.1. Convolutional Neural Network Model (CNN)

4.2. Long Short-Term Memory Network (LSTM)

5. Results and Discussion

6. Verification of Field Prediction Results of the CNN-LSTM Model

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. Analysis of the Mechanism of CO₂ Corrosion of Casing