A Method for Predicting Gas Well Productivity in Non-Dominant Multi-Layer Tight Sandstone Reservoirs of the Sulige Gas Field Based on Multi-Task Learning

Liu, Dawei; Cheng, Shiqing; Wang, Han; Wang, Yang

doi:10.3390/pr13082666

Open AccessArticle

A Method for Predicting Gas Well Productivity in Non-Dominant Multi-Layer Tight Sandstone Reservoirs of the Sulige Gas Field Based on Multi-Task Learning

¹

College of Petroleum Engineering, China University of Petroleum-Beijing, Beijing 102249, China

²

Sinopec North China Petroleum Bureau, Zhengzhou 450006, China

^*

Author to whom correspondence should be addressed.

Processes 2025, 13(8), 2666; https://doi.org/10.3390/pr13082666

Submission received: 16 June 2025 / Revised: 1 August 2025 / Accepted: 13 August 2025 / Published: 21 August 2025

(This article belongs to the Special Issue 2nd Edition of Artificial Intelligent Techniques in the Optimal Operation of Oil and Gas Production Systems)

Download

Browse Figures

Versions Notes

Abstract

This study proposes a multi-task learning-based production capacity prediction model aimed at improving the prediction accuracy for gas wells in multi-layer tight sandstone reservoirs of the Sulige gas field under small-sample conditions. The model integrates mutation theory and progressive hierarchical feature extraction to achieve adaptive nonlinear feature extraction and autonomous feature selection tailored to different prediction tasks. Using the daily average production of each gas-bearing layer during the first month after well commencement and the cumulative production of each gas-bearing layer over the first year as targets, the model was applied to predict the production capacity of 66 gas wells. Compared with single-task models and classical machine learning methods, the proposed multi-task model significantly improves prediction accuracy, reducing the root mean squared error (RMSE) by over 40% and increasing the coefficient of determination (R²) to 0.82. Experimental results demonstrate the model’s effectiveness in environments with limited training data, offering a reliable approach for productivity prediction in complex multi-layer tight sandstone reservoirs.

Keywords:

multi-layered tight sandstone gas reservoirs; gas well productivity prediction; progressive layered extraction; multi-task learning

1. Introduction

Some blocks of the Sulige gas field in China have entered a late stage of production and development. As development progresses, formation pressure gradually decreases, leading to significant declines in gas well production. Currently, the main gas-producing layers in this field have been extensively explored and developed, with a high degree of control over the well network in these areas, resulting in high reserve utilization. However, in the eastern part of the field, some non-dominant gas layers are affected by well control, resulting in scattered, point-like distributions. In certain localized areas, these layers show signs of relative enrichment, high well-encounter rates, and favorable physical properties, indicating potential for production enhancement. Therefore, it is necessary to predict the production capacity of these non dominant gas wells to help further evaluate the potential for gas reservoir development.

Productivity prediction is an important step in evaluating gas well production capacity and in planning gas reservoir development. Currently, there are various methods for capacity prediction of gas wells in multi-layer reservoirs, such as the capacity test-well [1,2,3] and analytical methods [4,5,6,7,8,9,10,11]. The capacity test-well method is widely used in gas field development, but it requires a long testing time and only a few wells can be tested because of its cost. The analytical method evaluates the prediction by deriving the capacity equation, in which there are many assumptions in the capacity equation, and the influence of the physical parameters for each gas-bearing formation on the production cannot be considered sufficiently, which leads to a large deviation in the prediction result from the actual value. Multi-layer tight sandstone gas reservoirs can provide high production after fracturing, but a large difference between the layers of the reservoir will cause gas well production to decrease rapidly. Further, with the extension of production time, the formation pressure decreases and the contribution from each gas-bearing formation to the total production will vary, so using the conventional production prediction method makes it difficult to quickly and accurately predict the production rate for the wells. Therefore, it is necessary to find a suitable method for predicting the production capacity of non-dominant gas wells in tight gas reservoirs.

With the rapid development of artificial intelligence in recent years, many machine learning algorithms have been applied in oil and gas well production prediction [12,13,14,15,16,17,18,19]. Neural networks are among the most widely used machine learning algorithms, with flexible structures that adapt to different gas well productivity prediction tasks. From simple self-organizing maps and backpropagation networks to fuzzy, deep, and hybrid neural networks, various models have been applied to predict gas well capacity. For example, Christian Oberwinkler et al. [20] used a three-layer self-organizing map to predict one-year gas production from 200 tight gas wells, considering reservoir thickness, fluid volume, proppant type, and other features. Shelley et al. [21] established a neural network model based on geological and engineering parameters for oil production prediction from 301 fractured wells. Liu Hong et al. [22] developed a fuzzy neural network for reservoir properties and sand addition, while Guofan Luo et al. [23] created a deep neural network with four hidden layers to predict shale oil output, considering primarily numerical features. Shuhua Wang et al. [24] built deep neural networks with 11 numerical and 7 classification features for a cumulative oil forecast in the Bakken Formation. Besides neural networks, algorithms like support vector machines [25,26], random forests [27,28], and gradient boosting [29,30] have also been used. Support vector machines optimized via algorithms like grey wolf [31] and particle swarm [32] have also been employed to improve model accuracy, especially under limited data conditions.

In summary, gas well productivity prediction studies vary in models, variables, and data volume, but most focus on model training and optimization based on structured data. While sophisticated models can enhance accuracy, data limitations—especially with smaller datasets—restrict the predictive performance of transfer learning approaches.

To address these issues, this study introduces a multi-task approach to mitigate the problem of limited data samples. Based on reservoir properties of multi-layer tight sandstone gas reservoirs, fracturing data from gas wells, and production history data, a feature selection process for gas wells in non-dominant gas layers of tight sandstone gas reservoirs is established. A multi-layer tight sandstone gas reservoir–gas well production prediction method (PLEMT) based on Progressive Layered Extraction (PLE) is proposed, enabling multi-task learning-based production capacity prediction for multi-layer tight sandstone gas reservoirs.

2. Few-Shot Learning Prediction Problems of Traditional Machine Learning Methods

Few-Shot Learning (FSL) is an important research area in machine learning that aims to enable models to learn effectively with only a small number of samples. Machine learning with fewer than 100 target training samples is generally referred to as few-shot learning. When traditional machine learning methods are applied to few-shot learning prediction, the limited amount of data available for training prevents the model from fully learning the features and patterns in the data, potentially leading to issues such as the curse of dimensionality, overfitting, and difficulties in feature selection, resulting in poor prediction outcomes [33,34,35]. Data augmentation, feature selection and dimensionality reduction techniques, ensemble learning, and transfer learning are widely applied in few-shot learning prediction. However, in production capacity prediction research, data augmentation modifies the input data. The effectiveness of oil and gas well production capacity prediction is severely constrained by the correlation between input data and real data; feature selection and dimensionality reduction require operators to possess extensive prior knowledge and experience regarding the target reservoir; ensemble learning necessitates training multiple models suitable for predicting the production capacity of target oil and gas wells, which may involve high computational complexity and significant time consumption; and transfer learning requires data from oil and gas reservoirs with similar characteristics to the target reservoir, which demands high data accuracy. Insufficient sample sizes degrade the performance of few-shot learning for oil and gas well production forecasting.

3. Productivity Prediction Model for Multi-Task Progressive Hierarchical Extraction

3.1. Feature Selection Method

Gas well productivity is affected by many factors, some of which may have a nonlinear relationship with production. In the data collection stage, it is necessary to collect as much as possible the relevant files in the database of the gas field enterprise, and extract the required parameters from the files to establish a gas production database, so as to provide sufficient and high-quality training data for modeling. In this paper, a feature selection process for influencing factors of gas well productivity in tight sandstone reservoirs is proposed, as shown in Figure 1.

3.1.1. Mutation Method to Obtain Daily Production of Each Gas-Bearing Layer

The productivity of multi-layer gas wells is affected by the reservoir properties, fracturing parameters, and production dynamic parameters of the wells. The non-dominant layer of multi-layer tight sandstone gas wells is often combined with the dominant layer for production, and there are differences in gas-bearing layers of each well, so if the reservoir properties and fracturing parameters of each layer are input into the production prediction model, it means that it is necessary to realize “one prediction model for one well”, which greatly increases the prediction difficulty and workload. Additionally, when the reservoir properties of two gas-bearing layers differ significantly, the reservoir properties of the layer with poorer properties are easily excluded when predicting gas well production capacity, leading to missing input data. For example, well A is a gas well with a three-layer tight sandstone reservoir, where the reservoir properties of secondary layer 3 are significantly inferior to those of secondary layers 1 and 2. Between 2009 and 2014, the well underwent four profile tests, which showed that secondary layer 3 contributed 23.2% to 30% of the gas well’s production, as shown in Figure 2.

However, when conducting a Spearman correlation analysis between the commonly used reservoir properties of the well (including porosity, permeability, formation thickness, and gas saturation of each gas-bearing layer) and the cumulative gas production of the well, the Spearman correlation coefficient between the characteristics of layer 3 and the total production was significantly lower than that of the other two gas-bearing layers due to the poor reservoir properties of layer 3. Therefore, the reservoir properties of layer 3 cannot be used as feature inputs for the predictive model. Consequently, in the production capacity prediction of multi-layer tight sandstone gas reservoirs, it is necessary to segment the gas well’s production. By studying the correlation between the production capacity and features of each gas-bearing layer after segmentation, the input features for the production capacity prediction model of multi-layer tight sandstone gas wells can be determined.

Mutation theory, founded by French mathematician René Thom in the 1970s, is a comprehensive use of topology, singularity theory, and structural stability to study the phenomenon of mutation in the internal role of the uncertain system of mathematical disciplines. The theory can also be applied to ranking or preferentially selecting different entities based on the same influencing factors. The most widely used types of mutation include cusp mutation, swallowtail mutation, and butterfly mutation models [36], and the potential function and divergence point set equations are shown in Table 1. In oil and gas research, applying mutation theory involves the topics of reservoir evaluation, production splitting, and target optimization, and the production splitting of gas-bearing layers of multi-layered combined extraction wells has also achieved good results [37,38,39].

The normalization formula can be derived from the divergence point set equation, and then the total mutation affiliation function value of the system can be found, so the state variables and control variables in the normalization formula need to be normalized to between 0 and 1. The normalization formulas for the three commonly used mutation models given above are as follows:

\begin{matrix} Spike mutation : & x_{α} = \sqrt{u}, x_{e} = \sqrt[3]{v} \\ Swallow-tailed mutation : & x_{x} = \sqrt{u}, x_{v} = \sqrt[3]{v}, x_{v} = \sqrt[4]{w} \\ x_{t} = \sqrt{t}, x_{v} = \sqrt[3]{u}, x_{v} = \sqrt[4]{v}, \\ Butterfly mutation : & x_{w} = \sqrt[5]{e} \end{matrix}

First, determine the control variables affecting the mutation model, and sort and classify the influence size; establish the mutation model architecture from bottom to top, i.e., from the indicator layer to the criterion layer, and then to the target layer; next, calculate the mutation affiliation function value according to the mutation model satisfied by each layer within the architecture after normalization of each control variable; and finally determine the system target value for the different evaluation objects.

3.1.2. De-Multicollinearity

Gas well productivity prediction often involves many input features, and high inter-feature similarity can lead to multicollinearity, distorting model estimates or reducing stability and thus causing overfitting. To address this, we apply two sequential screening steps:

(1): Spearman Correlation

We compute the Spearman rank correlation coefficient (

| ρ |

) between each feature and the target variable. Features with

| ρ | > 0.2

are retained to form feature subset A.

(2): Hampel Distance

We then evaluate the full-order nonlinear dependency of each feature on the target using the Hampel identifier. For feature

j

, the Hampel distance

H_{j}

is defined as

H_{j} = \frac{| I_{j} - median (I) |}{1.4826 \times MAD (I)}

(1)

where

H_{j}

is the biased mutual information estimate for feature

j

,

median (I)

is the median of all

I_{j}

values,

MAD (I)

is the median absolute deviation of these

I_{j}

values, and 1.4826 is the normalization factor that makes

MAD

an unbiased estimator of the standard deviation under normality.

By the 3-sigma rule, which states that approximately 99.7% of values in a normal distribution lie within three standard deviations of the mean, features with

H_{j} > 3

are deemed to have significant dependency on the target and are retained to form feature subset B.

Finally, we merge subsets A and B to create the candidate set C. To remove redundancy, we compute pairwise Spearman correlations within C; whenever

| ρ | > 0.8

between two features, we discard the feature with lower task relevance. The remaining features constitute the final subset D.

3.1.3. Feature Importance Evaluation

In this study, the importance scores for each feature were evaluated using three different methods. First, the sequential backward selection method assessed feature importance based on the change in model performance after iteratively removing each feature, combined with importance scores derived from a random forest model. Features with higher importance scores contributed more significantly to the model’s accuracy. Second, recursive feature elimination was used to evaluate feature importance based on the absolute values of coefficients or importance metrics, where features with larger absolute values were considered more influential. Third, the SHAP algorithm quantified the contribution of each feature to individual predictions, and the average of the absolute SHAP values across all samples was used as a global importance measure. The scores from these three methods were normalized using min–max scaling and then combined through a weighted average to generate a comprehensive feature importance ranking. Based on this ranking, the top features were selected to form the feature subset, denoted as E, for subsequent modeling.

3.1.4. Feature Number Optimization

To ensure the robustness of the feature selection process and prevent overfitting, all feature importance rankings and optimal feature subset selections were performed exclusively on the training dataset. For each candidate feature subset, a model was trained solely on training data, and its performance was evaluated within the training set using cross-validation. The feature subset yielding the best validation performance was identified as the optimal set. Importantly, the test dataset was kept completely independent and unused during the feature selection process, thereby avoiding any data leakage. The final model was trained with this selected feature set and directly tested on the test data.

3.2. Progressive Hierarchical Extraction Methods

Multi-task learning aims to enhance the generalization performance by sharing information among multiple tasks and essential in choosing appropriate parameter-sharing methods. Existing parameter-sharing methods in multi-task learning mostly use hard parameter sharing, soft parameter sharing, and Multi-Gate Mixture of Experts (MMOE) and its variants; however, these methods have certain limitations. Hard parameter sharing does not take into account the sensitive relationships and essential differences among tasks, and the model will share features during the training of different task predictions, which will degrade the overall task performance and lead to a negative migration for multi-task learning. The soft-sharing approach requires training a model for each task, which is not parametrically efficient, and the model building and implementation of the parameter-sharing approach requires the operator to have rich prior knowledge. MMOE does not address the issue of co-use of features for different tasks and has high requirements for task relevance.

To effectively harness relevant features and facilitate efficient feature sharing among multiple tasks, this study adopts the Progressive Layered Extraction (PLE) method [40]. PLE aims to progressively extract and fuse features across multiple levels, enabling the model to preserve and leverage task-specific information while enhancing its generalization capability. The structure of the PLE model is shown in Figure 3.

The PLE model typically consists of multiple hierarchical layers, each containing shared experts and task-specific experts. Shared experts are responsible for learning common features across all tasks, whereas task-specific experts focus on capturing unique information for individual tasks. In each layer, gating mechanisms dynamically fuse the outputs from shared and task-specific experts, producing task-specific feature representations. This layered extraction process captures multi-scale, hierarchical task-correlated features.

The primary advantage of this method lies in its ability to integrate multi-level features effectively, avoiding premature feature interference or loss. It enables the model to adaptively share features at different hierarchical levels, boosting both accuracy and robustness in multi-task scenarios. Additionally, the flexible gating mechanism allows tasks to dynamically adjust the contribution of different feature sources, improving overall model adaptability and generalization.

3.3. MTPLE Capacity Forecasting Model

To improve the accuracy of capacity prediction, this study introduces the Multi-Task Progressive Layered Extraction (MTPLE) model. As shown in Figure 4, the MTPLE model leverages multi-task learning and progressive hierarchical feature extraction to build a robust, generalizable forecasting framework.

The model consists of multiple hierarchical layers. Input features are first preprocessed and then passed into expert networks at each layer, which include both shared experts and task-specific experts. A gating mechanism dynamically fuses the outputs of these experts, generating hierarchical features tailored to each task. These features are subsequently processed by task-specific tower networks to produce the final predictions.

The model is designed to address two main prediction tasks: Task 1 involves predicting the daily average production of each gas-bearing layer during the first month after well commencement. Task 2 focuses on predicting the cumulative production of each gas-bearing layer over the first year. The selected features are fed into the MTPLE structure, where different expert networks and shared networks are constructed for the two tasks. The outputs from these networks are fused via gating units and further processed by tower networks to generate task-specific predictions.

To ensure a balance between computational efficiency and predictive accuracy, the extraction network is configured with two layers, and linear activation functions are applied to the expert and shared network layers. The tower networks are implemented as Deep Neural Networks (DNNs) with 64 units each. The initial learning rate is set to 0.001, decaying by 0.2 every 10 iterations, with a maximum of 100 iterations. The performance metric used for network optimization is the coefficient of determination (R²).

4. Example Analysis of Gas Well Productivity Prediction in Multi-Layer Tight Sandstone Gas Reservoirs

4.1. Data Source and Preprocessing

This study analyzes a three-layer tight sandstone gas reservoir. The reservoir comprises 66 production wells, with non-main gas-producing layers including the He 8–1 section, Shan 2–3 section, and Benxi Formation. Based on the collected data, we extracted a categorized set of 43 feature parameters covering key geological, fracturing, and operational factors.

Since there are no missing values, no imputation was performed. Categorical features (such as the production layer location) were encoded using one-hot encoding. Continuous features (such as permeability and pressure) were standardized. Considering that all engineering parameters (such as orifice size, casing pressure, tubing pressure) are at well level, we can use the layer capacity results obtained from the mutation method to perform a “virtual correction” and estimate each layer’s parameters accordingly. Specifically, the method assumes

{Layer Parameter}_{i} \approx Well-level Parameter \times \frac{{Layer}_{i} Productivity}{Total Productivity}

(2)

This approach allows us to derive layer-specific estimates for these well-level parameters based on the production capacity split, providing more realistic features for subsequent modeling. Taking the features for task 1 as an example, they include geological parameters and fracturing parameters for each layer, as well as the virtual layer engineering parameters derived from the mutation method. These features allow for a more precise representation of each layer’s capacity in the first month, providing targeted input variables for the model. Table 2 shows the range of values for some features in task 1.

4.2. Calculation of Gas Well Production by the Mutation Method

The process of producing natural gas is regarded as a mutation phenomenon under the joint influence of reservoir physical properties, fracturing effect, and production dynamic parameters. Utilizing the mutation theory, the gas-bearing layer target state value is obtained to evaluate the production state of each gas-bearing layer. According to the experience of gas reservoir development, the production splitting coefficient of each layer in multi-layer combined gas wells is considered a separate system, controlled by ten internal factors categorized into three subsystems, as shown in Figure 5. The reserve characteristics subsystem (A) includes three factors: effective thickness (A1), porosity (A2), and gas saturation (A3), and is modeled as a swallowtail mutation. The development characteristic subsystem (B) comprises permeability (B1), interlayer interference coefficient (B2), and mid-layer pressure (B3), also forming a swallowtail mutation model. The geological features subsystem (C) consists of sedimentary microfacies (C1), sandstone content (C2), reservoir density (C3), and gas layer depth (C4), which constitute a butterfly mutation model. The three subsystems—reserve characteristics, development characteristics, and geological features—together form the overall mutation system.

Because not all non-dominant gas wells in the target block have undergone gas production profile testing, to verify the rationality of the mutation method, eight gas wells with gas production profile detection records were selected for calculation. Taking Well Y as an example, the calculation process of the mutation method is described. Table 3 shows the values of the influence factor statistics for Well Y.

Taking a relatively discontinuous surface as an example, the reservoir feature subsystem, which includes layer thickness, porosity, and gas saturation, forms a swallowtail discontinuity model. Based on the obtained target values for each layer, M_i, and the relative mutation surface target value, M′, the production splitting coefficient for each layer is calculated using Equation (3).

P_{i} = \frac{M_{i} - M^{'}}{\sum_{i = 1}^{n} (M_{i} - M^{'})}

(3)

x_{A_{1}} = {(0.1398)}^{\frac{1}{2}} = 0.3739

(4)

x_{A_{2}} = {(0.6197)}^{\frac{1}{3}} = 0.8526

(5)

x_{A_{3}} = {(0.9242)}^{\frac{1}{4}} = 0.9805

(6)

A = (x_{A_{1}} + x_{A_{2}} + x_{A_{3}}) / 3 = 0.7356

(7)

The development feature subsystem of this layer constitutes a swallowtail mutation; as calculated above, B = 0.7014. The geological feature subsystem constitutes a butterfly mutation; as calculated above, C = 0.9033. The three subsystems constitute a swallowtail mutation; therefore, the target value of the relative mutation surface system is M = 0.9070. Using the same method, we obtain the system target values for each production layer, as shown in Table 4. Next, we calculate the yield-splitting coefficient through Equation (5), and then obtain the production capacity for each sublayer.

The contribution of gas well productivity to each sublayer was also analyzed using mutation theory for the remaining seven wells. The calculation results show that the error of the mutation method in calculating the layered production contribution and the gas production profile test is less than 15%, which meets the requirements of the engineering calculations, as shown in Figure 5. Therefore, the mutation method was utilized to split 66 joint wells in the non-dominant layer of a multi-layer tight sandstone gas field to obtain the daily production from each gas-bearing layer of the gas wells.

4.3. Feature Selection Results

According to the feature selection process, the reservoir properties and fracturing parameters described in Section 3.1 are used as independent variables. For feature selection, the reservoir properties and fracturing parameters described in Section 3.1 were used as independent variables. The per-layer gas productivity ratios were obtained after layer-wise splitting using the mutation method. These ratios were then used as target variables to derive open-flow capacities and first-year cumulative production for each gas-bearing layer, based on which the indicators for tasks 1 and 2 were selected. Since the feature selection process is the same for different prediction tasks, we only describe in detail the feature selection for task 1.

4.3.1. De-Multicollinearity Results

The Spearman correlation coefficient method was used to evaluate task 1 and 43 features, retaining those with a correlation greater than 0.2 to form feature subset A (22 features). Due to the large number of features, the Spearman correlation plot is not shown in this paper. Simultaneously, Hampel entropy was used for feature evaluation. Features with Hample entropy greater than 3 were selected to form feature subset B (20 features). The Hampel entropy calculation results are shown in Figure 6. The union of A and B was taken to form C (20 features). By combining the Spearman correlation coefficient method to remove redundant features, the number of input features was reduced to 16, forming feature set D.

4.3.2. Results of the Feature Importance Calculation

Comprehensive sequence backward selection, SHAP, and the three methods were used to calculate the importance scores of the features in D. The weighted average of the three methods after 0–1 normalization was used to obtain the final feature importance scores for the 16 features in task 1, as shown in Figure 7.

4.3.3. Determine the Number of Task Features

To determine the number of task-specific features, we ranked features by [feature importance/chosen ranking metric] and then evaluated model performance by sequentially using the top 1, top 2, …, top 16 features as inputs. For each feature subset we trained a Random Forest (max depth = 3, n_estimators = 1000) and recorded the R² on the validation set (see Figure 8). The R² curve peaked at seven features, which therefore constituted the optimal feature set E for task 1. The same procedure produced feature set F for task 2. The final feature set U was formed by taking the union of E and F (duplicates removed) and used as input to the MTPLE model. The resulting input features were casing pressure, gas layer thickness, permeability, fracturing sand intensity, production time, gap nipple size, and total fracturing fluid volume.

4.4. Validity Analysis of MTPLE Model

To validate the effectiveness of the proposed Multi-Task Progressive Learning Environment (MTPLE), we designed a comprehensive experiment to compare its performance with that of single-task neural network models. The single-task models are trained individually at the end of the PLE structure for each task (task 1 and task 2), with identical hyperparameters to ensure a fair comparison.

For data partitioning, the first 56 wells were used as the test set, and the remaining 10 wells as the training set. All neural network models were trained for 1000 epochs to allow sufficient convergence. Given the inherent randomness in neural network training—such as weight initialization and mini-batch sampling—each model configuration was trained 100 times to improve robustness and reduce stochastic variability. This resulted in repeated training of the same model architecture, each time starting from different initial conditions, and generating slightly different models.

Considering the use of 5-fold cross-validation to better evaluate model stability and generalization, the entire training process was conducted across five different data splits, with each fold serving as the validation set once, while the remaining four folds served as training data. Each fold was trained 100 times with different initializations, leading to a total of 5 folds × 100 repetitions = 500 trained models for each task and each model type (MTPLE and single-task).

The performance of the models was assessed by predicting across these 500 models for each sample, and then averaging the results to obtain the final predicted values. This approach minimizes the influence of randomness and provides a more reliable estimate of the models’ predictive capabilities.

The results in Table 5 show that for task 1, the prediction errors were generally less than 15%, while for task 2, errors remained below 20%. These levels of accuracy satisfy the typical requirements for capacity prediction in this geological block. Figure 9 and Figure 10 provide a visual comparison of the 500 predicted values generated by the MTPLE model and single-task models for tasks 1 and 2, respectively, under 5-fold cross-validation, demonstrating the effectiveness and stability of the multi-task approach.

Figure 9 and Figure 10 show the comparison of prediction results between the MTPLE and DNN models for task 1 and task 2, respectively.The dashed black box line represents the distribution of the 10% to 90% quantile predicted values, and the narrower the distribution range, the more stable the model-predicted results. In tasks 1 and 2, the interval of the 90% quantile of the predicted yield of the MTPLE model is narrower than that of the DNN model, and the mean value is closer to the actual yield, which indicates that the MTPLE model plays the role of data enhancement through the information sharing between tasks, and improves the accuracy and stability of the productivity prediction.

Table 6 compares the prediction errors of the MTPLE and DNN models. Compared with the DNN model, the RMSEs of the two tasks of MTPLE are reduced by 40% and 25.35%, respectively, indicating that the prediction performance of MTPLE is significantly better than that of the single-task model.

4.5. Performance Comparison Between MTPLE and Classical Machine Learning Models

To further evaluate the performance of the MTPLE model, four classical machine learning algorithms—K-Nearest Neighbor (KNN), Random Forest (RF), Support Vector Machine (SVM), and XGBoost—were also implemented. All models utilized the feature selection results to independently predict the two main tasks: task 1, predicting the capacity of each gas-bearing layer in the first month, and task 2, predicting the capacity of each layer during the first year. Since these traditional algorithms generally support single-task prediction, separate models were trained for each task. The hyperparameters were optimized via Bayesian optimization, which constructs a probabilistic surrogate model of the objective function to efficiently explore the hyperparameter space. The search ranges for each model’s hyperparameters were set as follows: for KNN, the number of neighbors (K) varied from 1 to 20; for SVM, the kernel included ‘linear’, ‘rbf’, and ‘poly’, with the regularization parameter C set between 0.1 and 1000, and gamma between 0.01 and 10; for Random Forest, the number of trees (n_estimators) ranged from 100 to 1000, and the maximum depth (max_depth) between 5 and 30; for XGBoost, the learning rate (learning_rate) was between 0.01 and 0.2, the maximum depth (max_depth) between 3 and 10, and the number of estimators (n_estimators) between 100 and 1000. Each model was run 500 times with different initializations, and the prediction results were averaged for stability. All models were evaluated through 5-fold cross-validation to ensure robustness and reduce bias.

Figure 11 and Figure 12 show the prediction results of MTPLE and Support Vector Machine, K-Nearest Neighbor, Random Forest, and XGBoost for tasks 1 and 2, respectively. The horizontal coordinates in the figure are the actual initial production and the first-year cumulative production values, respectively, the vertical coordinate is the model prediction value, and the black dashed line is the 45° diagonal; the smaller the angle between the fitted line and the diagonal of the prediction result, the higher the model prediction accuracy. The prediction results of MTPLE are closer to the 45° diagonal for both tasks, and the prediction result is superior to that of classical machine learning algorithms.

Table 7 compares the MSE and R² of the Support Vector Machine, K-Nearest Neighbor, Random Forest, XGBoost, and MTPLE models for tasks 1 and 2, and the MTPLE model achieves the minimum error in both tasks. In the case of limited training data, the traditional machine learning algorithm is limited by the single-task prediction mode, which makes it difficult to fully learn the change-rule of yield; thus, the prediction accuracy is significantly lower than that of the MTPLE model.

The prediction model proposed in this paper can be used not only for predicting the productivity of gas wells in non-dominant multi-layer tight sandstone gas reservoirs, but also for predicting the productivity of other multi-layer gas wells and multi-layer oil wells after changing the appropriate input characteristics.

5. Influence of Different Characterization Categories on the Results of Production Capacity Prediction

Reservoir physical properties represent the geological characteristics of gas wells, and fracturing parameters reflect the fracture formation and expansion status, which in turn affect the production capacity of gas wells, and engineering parameters represent the impact of changes in the working system on the production capacity of gas wells. To clarify the influence of different feature categories and to reveal the relationship law between different features and production capacity, the feature categories listed above were input into the MTPLE prediction model to judge the influence of different feature categories on the production capacity of gas wells.

Figure 13 compares the prediction performance of the MTPLE model with different feature category input. The production dynamic parameters contribute the least to the performance of task 1, which is caused by the relatively consistent pressure drop in each well at the beginning of production and the small change in the gas nipple size. The production dynamic parameters have the largest contribution to task 2, which may be due to the good correlation between the casing pressure and gas well productivity in the long-term gas well production, while operational changes, such as switching on and off wells and replacing the gas nipple, provide dynamic information related to fluctuations in production, which constrains the overall trend in the long-term production of gas wells. Geological parameters have a small effect on both tasks 1 and 2, which reflects the strong heterogeneity and strong interval variability of reservoirs in tight sandstone gas reservoirs. The large difference between the mean and median values of the reservoir physical properties in Table 1 indicates that the distribution of these characteristics is more sporadic between the maximum and the minimum values, whereas the difference between the mean and median of the gas well open-flow capacity and the first-year cumulative production is very small, which results in a small difference between the correlation of these characteristics and the open-flow capacity and the first-year cumulative production. This results in a poorer correlation between these other features and the open-flow capacity and first-year cumulative production, and there may be human errors in the initial entry. The influence of fracturing parameters on tasks 1 and 2 is larger, which indicates that the fracturing parameters can reflect the production status of gas wells during the initial and long-term production periods, and reasonable fracturing parameters have a greater influence on the productivity capacity of gas wells in multi-layered tight sandstone reservoirs.

6. Conclusions

(1): This study describes the Multi-Task Progressive Layered Learning (MTPLE) model for predicting gas well productivity in multi-layer tight sandstone reservoirs. The model incorporates various feature engineering techniques, such as Hampel entropy and Spearman correlation analysis, to extract deep-level features and uncover underlying associations between features and variables. The progressive hierarchical feature extraction effectively avoids interference from weakly correlated information across different tasks. Further, it enhances the model’s ability to leverage coupling relationships within the data, resulting in more accurate reflection of changes in the open-flow capacity and the first-year cumulative production of multi-layer gas wells, thereby improving the model’s stability.
(2): Compared with single-task models, the MTPLE model improves the R² values by 20% and 9% for initial production and first-year cumulative production of 50 gas wells in a certain block, respectively. This indicates that the model effectively reduces prediction errors and enhances prediction accuracy under few-shot learning conditions.
(3): Compared with traditional machine learning models, the MTPLE model significantly improves the prediction accuracy of open-flow capacity and first-year cumulative production for the same 50 gas wells, demonstrating that multi-task learning greatly enhances the model’s generalization ability and effectively alleviates the challenges posed by limited data samples.
(4): The analysis of different feature categories for gas well productivity prediction shows that production dynamic parameters and fracturing parameters have a greater impact on long-term well capacity, while fracturing parameters greatly influence initial capacity. Conversely, geological parameters have a smaller effect, mainly due to the high heterogeneity of the tight sandstone reservoirs.

Author Contributions

Conceptualization, S.C. and D.L.; methodology, D.L.; software, D.L.; validation, H.W. and Y.W.; formal analysis, D.L.; investigation, H.W.; resources, H.W. and Y.W.; data curation, H.W.; writing—original draft preparation, D.L.; writing—review and editing, D.L. and Y.W.; visualization, D.L.; supervision, S.C.; and project administration, H.W. and Y.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Han Wang was employed by Sinopec North China Petroleum Bureau. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflicts of interest.

References

Chen, Y. A simple method to determine the absolute unobstructed flow rate of gas wells. Nat. Gas Ind. 1987, 7, 59–63. [Google Scholar]
Wang, W.; Shen, P.; Ma, X.; Jia, A. Research on the Analysis Method of Gas Well Production Test Data for Low Permeability Gas Reservoirs. Nat. Gas Ind. 2005, 30, 76–78+153. [Google Scholar]
Shi, J.; Li, Q.; Zhang, L.; Sun, X.; Sun, Z.; Liu, S. Reasons and correction methods for abnormal production capacity indicator curves in multi-layer co production gas wells. Nat. Gas Ind. 2018, 38, 50–59. [Google Scholar]
Kucuk, F.; Karakas, M.; Ayestaran, L. Well Testing and Analysis Techniques for Layered Reservoirs. SPE Form. Eval. 1986, 1, 342–354. [Google Scholar] [CrossRef]
Kucuk, F.; Wilkinson, D. Transient PressNre Behavior of Commingled Reservoirs. SPE Form. Eval. 1991, 6, 111–120. [Google Scholar] [CrossRef]
Yue, J.; Duan, Y.; Chen, W.; Wang, S.; Liu, Q. Study on the productivity of fractured gas wells with multiple vertical fractures. Daqing Pet. Geol. Dev. 2004, 3, 46–48+91–92. [Google Scholar]
He, Y.; Xu, L.; Lv, W.; Yang, Z.; Hao, M. Analysis of Production Capacity of Fractured Wells in Low Permeability and Permeable Reservoirs. Spec. Oil Gas Reserv. 2006, 59–61+107. [Google Scholar]
Liang, B.; Li, M.; Zeng, F.; Du, K. Research on Production Capacity Analysis Methods for Tight Gas Reservoirs. Fault Block Oil Gas Fields 2005, 30–33+90–91. [Google Scholar]
Wang, T.; Yu, H.; Zhao, P.; Li, J.; Liu, R.; Kou, S.; Wang, J.; Liao, S. Evaluation of post fracturing production capacity of tight gas wells based on unstable pressure well testing analysis. Spec. Oil Gas Reserv. 2023, 30, 122–130. [Google Scholar]
Bai, W.; Cheng, S.; Wang, Y.; Cai, D.; Guo, X.; Guo, Q. Prediction method for unstable production of multiphase flow in tight condensate gas wells. Pet. Explor. Dev. 2024, 51, 154–160. [Google Scholar] [CrossRef]
Wang, X.; Feng, D.; Li, X.; Yan, Y.; Guo, X.; Qiao, X.; Xue, B.; Ma, C.; Wang, Q.; Lei, K.; et al. Evaluation of tight gas well productivity considering time-varying effects: A case study of Yan’an gas field. J. Pet. 2019, 40, 1358–1367. [Google Scholar]
Han, K.; Wang, W.; Fan, D.; Yao, J.; Luo, F.; Yang, C. Production prediction of atmospheric shale gas wells based on the coupling of production decline and LSTM. Oil Gas Reserv. Eval. Dev. 2023, 13, 647–656. [Google Scholar] [CrossRef]
Lin, H.; Sun, X.; Song, X.; Meng, C.; Xiong, W.; Huang, J.; Liu, H.; Liu, C. Research on shale gas well production prediction model based on improved artificial neural network. Oil Gas Reserv. Eval. Dev. 2023, 13, 467–473. [Google Scholar] [CrossRef]
Pang, L.; Wang, Y.; Jiang, W.; Wang, Y.; Gao, G.; Wang, X. Research on Short Production Cycle Carbonate Gas Well Production Prediction Based on Machine Learning. Spec. Oil Gas Reserv. 2023, 30, 134–141. [Google Scholar]
He, Y.; He, Z.; Tang, Y.; Qin, J.; Song, J.; Wang, Y. Production evaluation and prediction of shale gas wells based on machine learning. Pet. Drill. Prod. Technol. 2021, 43, 518–524. [Google Scholar] [CrossRef]
Ma, X.; Fan, Y. Production prediction model of fracturing vertical well based on machine learning. Math. Pract. Underst. 2021, 51, 186–196. [Google Scholar]
Bai, W.-P.; Cheng, S.-Q.; Guo, X.-Y.; Wang, Y.; Guo, Q.; Tan, C.-D. Oilfield analogy and productivity prediction based on machine learning: Field cases in PL oilfield, China. Pet. Sci. 2024, 21, 2554–2570. [Google Scholar] [CrossRef]
Li, X.; Ma, X.; Xiao, F.; Xiao, C.; Wang, F.; Zhang, S. Time-series production forecasting method based on the integration of Bidirectional Gated Recurrent Unit (Bi-GRU) network and Sparrow Search Algorithm (SSA). J. Pet. Sci. Eng. 2022, 208, 109309. [Google Scholar] [CrossRef]
Wang, Y.; Cheng, S.; Zhang, F.; Feng, N.; Li, L.; Shen, X.; Li, J.; Yu, H. Big Data Technique in the Reservoir Parameters’ Prediction and Productivity Evaluation: A Field Case in Western South China Sea. Gondwana Res. 2021, 96, 22–36. [Google Scholar] [CrossRef]
Oberwinkler, C.; Ruthammer, G.; Zangl, G.; Economides, M.J. New tools for fracture design optimization. In Proceedings of the PE International Symposium and Exhibition on Formation Damage Control, Lafayette, LA, USA, 18–20 February 2004. SPE-86467. [Google Scholar]
Shelley, R.; Oduba, O.; Melcher, H. Machine learning and artificial intelligence provide insights for Wolfcamp completion design. In Proceedings of the SPE Hydraulic Fracturing Technology Conference and Exhibition, Virtual, 4–6 May 2021. [Google Scholar]
Liu, H.; Zhao, J.; Hu, Y.; Zhang, S.; Liu, J. Prediction of hydraulic fracturing effectiveness using T-S fuzzy neural network. Fault Block Oil Gas Field 2002, 35-38, 91. (In Chinese) [Google Scholar]
Luo, G.; Tian, Y.; Bychina, M.; Ehlig-Economides, C. Production optimization using machine learning in Bakken shale. In Proceedings of the 6th Unconventional Resources Technology Conference, American Association of Petroleum Geologists, Houston, TX, USA, 23 July 2018; pp. 2174–2197. [Google Scholar]
Wang, S.; Chen, Z.; Chen, S. Applicability of deep neural networks for production forecasting in Bakken shale reservoirs. J. Pet. Sci. Eng. 2019, 179, 112–125. [Google Scholar] [CrossRef]
Li, C. Analysis and Prediction of Factors Affecting Capacity of ZT Shale Gas Wells. Master’s Thesis, Xi’an Petroleum University, Xi’an, China, 2020. [Google Scholar]
Li, Y. Study on Main Controlling Factors of Coalbed Methane Well Production and Prediction Based on Machine Learning methods. Master’s Thesis, China University of Petroleum, Beijing, China, 2017. [Google Scholar]
Song, X. Research on Hydraulic Fracturing and Capacity Prediction of Volcanic Reservoir Horizontal Wells Based on Machine Learning Methods. Master’s Thesis, China University of Petroleum, Beijing, China, 2020. [Google Scholar]
Wang, S.; Chen, S. Insights into fracture stimulation design in unconventional reservoirs based on machine learning modeling. J. Pet. Sci. Eng. 2019, 174, 682–695. [Google Scholar] [CrossRef]
Makhotin, I.; Koroteev, D.; Burnaev, E. Gradient boosting for enhancing hydraulic fracturing efficiency in unconventional reservoirs. J. Pet. Explor. Prod. Technol. 2019, 9, 1919–1925. [Google Scholar] [CrossRef]
Wu, H. Research on Hydraulic Fracturing Optimization Models for Shale Gas Wells Based on Machine Learning. Master’s Thesis, China University of Petroleum, Beijing, China, 2020. [Google Scholar]
Song, X.; Liu, Y.; Ma, J.; Wang, J.; Kong, X.; Ren, X. Capacity prediction based on support vector machine optimized via grey wolf algorithm. Rock Oil Gas Reserv. 2020, 32, 134–140. [Google Scholar]
Xu, H. Improved Particle swarm Optimization Algorithm and Its Application in Predicting Coalbed Methane Capacity. Master’s Thesis, China University of Mining and Technology, Xuzhou, China, 2013. [Google Scholar]
Kokol, P.; Kokol, M.; Zagoranski, S. Machine learning on small size samples: A synthetic knowledge synthesis. Sci. Prog. 2022, 105, 1–16. [Google Scholar] [CrossRef]
Larracy, R.; Phinyomark, A.; Scheme, E. Machine Learning Model Validation for Early Stage Studies with Small Sample Sizes. In Proceedings of the 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), Mexico, Mexico, 1–5 November 2021; pp. 2314–2319. [Google Scholar] [CrossRef]
Li, C.; Wang, L.; Li, J.; Chen, Y. Application of multi-algorithm ensemble methods in high-dimensional and small-sample data of geotechnical engineering: A case study of swelling pressure of expansive soils. J. Rock Mech. Geotech. Eng. 2024, 16, 1896–1917. [Google Scholar] [CrossRef]
Dai, Y.; Li, Z.; Wu, D. Application of mutation theory in seismic data reservoir prediction. Nat. Gas Ind. 2006, 47-49+158-159. [Google Scholar]
Tian, L.; Shen, Z.; Liu, L.; Wang, M. Research on Production Split Methods for Multi-layer Overlying Tight Sandstone Gas Reservoirs. Nat. Gas Explor. Dev. 2016, 39, 41–44+6-7. [Google Scholar]
Zhang, Y.; Cheng, S.; Shi, W.; Yan, J.; Zheng, R.; Wang, S. Production Split Method for Multi-layer oint Production Wells and Its Application in the Daniu Gas Field. Pet. Drill. Prod. Technol. 2019, 41, 624–629. [Google Scholar] [CrossRef]
Fu, Q.; Xue, G.; Ren, C.; Lin, R.; Luo, J. Application of a New Method for Splitting Production from Multi-layer oint Production Wells in the W Oilfield. Block Oil Gas Fields 2019, 26, 512–515. [Google Scholar]
Tang, H.; Liu, J.; Zhao, M.; Gong, X. Progressive layered extraction (PLE): A novel multi-task learning (MTL) model for personalized recommendations. In Proceedings of the 14th ACM Conference on Recommender Systems, New York, NY, USA, 22–26 September 2020; pp. 269–278. [Google Scholar]

Figure 1. Selection process of production capacity prediction features for multi-layer tight sandstone gas wells.

Figure 2. Gas production ratio of each layer in well A.

Figure 3. PLE modeling framework.

Figure 4. MTPLE Capacity Forecasting Model framework.

Figure 5. Mutation method calculation results.

Figure 6. Hampel entropy calculation results for task 1.

Figure 7. Feature importance score for task 1.

Figure 8. Method to determine the number of input features for task 1.

Figure 9. Comparison of prediction results between the MTPLE and DNN models in task 1. (a) Acutal value vs. predicted values for the MTPLE model in task 1; (b) Acutal value vs. predicted values for the DNN model in task 1.

Figure 10. Comparison of prediction results between the MTPLE and DNN models in task 2. (a) Acutal value vs. predicted values for the MTPLE model in task 2; (b) Acutal value vs. predicted values for the DNN model in task 2.

Figure 11. Predicted vs. actual production values for task 1 by MTPLE and classical machine learning models.

Figure 12. Predicted vs. actual production values for task 2 by MTPLE and classical machine learning models.

Figure 13. Comparison of production capacity prediction results for different feature categories. (a) Comparison of MTPLE prediction performance across different feature categories for Task 1; (b) Comparison of MTPLE prediction performance across different feature categories for Task 2.

Table 1. Potential function of mutation type and its divergence point set equation.

Mutation Type	Control Variable	Potential Function	Divergence Point Set Equation
Spike mutation	2	$V (x) = x^{2} + μ x^{2} + v x$	$v = - 6 x^{2}, v = - 8 x^{3}$
Swallow-tailed mutation	3	$V (x) = \frac{1}{4} x^{4} + \frac{1}{2} a x^{2} + b x$	$v = - 6 x^{2}, v = 8 x^{3}, w = - 3 x^{4}$
Butterfly mutation	4	$\begin{array}{l} V (x) = \frac{1}{6} x^{5} + \frac{1}{4} t x^{4} + \frac{1}{3} u x^{3} \\ + \frac{1}{2} v x^{2} + w x \end{array}$	$\begin{array}{l} t = - 10 x^{2}, u = 20 x^{3}, \\ v = - 15 x^{4}, w = 4 x^{5} \end{array}$

Table 2. The range of values for some features in task 1.

Feature Category	Feature Name	Maximum Value	Minimum Value	Average Value
Geological factors	Gas layer thickness (m)	15.8	7.66	6.61
	Permeability (10⁻³ μm²)	1.3	0.21	0.78
	Porosity (%)	13.33	3.5	7.55
	Gas saturation (%)	67.6	8.2	44.3
Fracturing factors	Flowback rate (%)	100	24.28	74.75
	Fracturing sand intensity (m³/m)	67.6	8.2	44.24
	Perforating thickness (m)	10	2	5.62
Operational factors	Production time (h)	720	564.5	708.6
	Gas nipple size (mm)	10	6.5	8.73
	Casing pressure (MPa)	29.8	6.4	18.35
	Tubing pressure (MPa)	29.8	5.3	14.31

Table 3. Values of the influence factor statistics for Well Y.

	Gas Well Layer	Gas Layer Thickness/m	Porosity	Gas Saturation	Permeability/mD	Inter Layer Interference Coefficient	Depth Pressure in the Middle of the Gas Layer/Mpa	Sedimentary Microfacies	Sandstone Content	Density/ (g·cm⁻³)	Depth in the Middle of the Gas Layer/m
Original value	He 8–1	9.3	5.68%	74.22%	1.73	0.5	22.51	2	98.50%	2.51	3037.3
	Shan 1–1	3.8	3.52%	72.76%	1.38	1	23.34	1	100.00%	2.74	3148.7
	Shan 1–2	1.3	4.64%	78.73%	0.65	2	23.37	1	100.00%	2.75	3153.6
	Taiyuan	2.2	3.59%	72.80%	0.86	8	23.45	1	95.60%	2.71	3164.2
	Mutation	1.3	3.52%	72.76%	0.65	8	22.51	1	95.60%	2.75	3164.2
Dimensionless value	He 8–1	1	0.01	0.0094	1.00	2	0.9600	1	0.99	1	1.0000
	Shan 1–1	0.41	0.0062	0.0092	0.79	1	0.995 2	0.5	1.00	0.9	0.9647
	Shan 1–2	0.14	0.0082	0.01	0.37	0.5	0.996 8	0.5	1.00	0.91	0.9633
	Taiyuan	0.24	0.0063	0.0092	0.49	0.13	1.000 0	0.5	0.96	0.93	0.9600
	Mutation	0.14	0.0062	0.0092	0.37	0.13	0.960 0	0.5	0.96	0.9	0.9600

Table 4. System target value of each production layer for Well Y.

	He 8_1	Shan 1_1	Shan 1_2	Taiyuan	Relative Mutation Surface
Reserve characteristic feature A	0.9976	0.9078	0.8772	0.8804	0.8577
Development feature B	1.0270	0.9878	0.9293	0.9027	0.8885
Geological feature C	0.9997	0.9790	0.9791	0.9784	0.9749
System target value	1.0081	0.9582	0.928 5	0.9205	0.9070

Table 5. Comparison of actual and predicted values in task 1.

Well	Actual Value	Predicted Value	Deviation
W1	2.21	2.00	9.29%
W2	4.32	3.98	7.92%
W3	1.89	2.12	12.23%
W4	2.21	2.13	3.48%
W5	1.67	1.57	5.79%
W6	3.87	3.49	9.70%
W7	1.21	1.12	7.11%
W8	0.98	1.10	12.11%
W9	2.33	2.04	12.45%
W10	1.96	1.84	6.11%

Table 6. Comparison of mean prediction errors between the MTPLE and DNN models.

Task	Evaluation Indicators	MTPLE	DNN
Task 1	RMSE (10⁴ m³/d)	0.4	0.61
Task 1	R²	0.82	0.68
Task 2	RMSE (10⁴ m³)	74.832	126.96
Task 2	R²	0.78	0.62

Table 7. Comparison of mean prediction errors between MTPLE and classical machine learning.

Task	Evaluation Indicators	MTPLE	KNN	RF	SVM	XGBoost
Task 1	MSE(10⁴ m³/d)	0.09	0.33	0.23	0.23	0.19
Task 1	R²	0.78	0.52	0.65	0.66	0.68
Task 2	MSE(10⁴ m³)	7700.28	23,032.52	19,626.20	12,302.93	12,963.266
Task 2	R²	0.77	0.48	0.52	0.61	0.6

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, D.; Cheng, S.; Wang, H.; Wang, Y. A Method for Predicting Gas Well Productivity in Non-Dominant Multi-Layer Tight Sandstone Reservoirs of the Sulige Gas Field Based on Multi-Task Learning. Processes 2025, 13, 2666. https://doi.org/10.3390/pr13082666

AMA Style

Liu D, Cheng S, Wang H, Wang Y. A Method for Predicting Gas Well Productivity in Non-Dominant Multi-Layer Tight Sandstone Reservoirs of the Sulige Gas Field Based on Multi-Task Learning. Processes. 2025; 13(8):2666. https://doi.org/10.3390/pr13082666

Chicago/Turabian Style

Liu, Dawei, Shiqing Cheng, Han Wang, and Yang Wang. 2025. "A Method for Predicting Gas Well Productivity in Non-Dominant Multi-Layer Tight Sandstone Reservoirs of the Sulige Gas Field Based on Multi-Task Learning" Processes 13, no. 8: 2666. https://doi.org/10.3390/pr13082666

APA Style

Liu, D., Cheng, S., Wang, H., & Wang, Y. (2025). A Method for Predicting Gas Well Productivity in Non-Dominant Multi-Layer Tight Sandstone Reservoirs of the Sulige Gas Field Based on Multi-Task Learning. Processes, 13(8), 2666. https://doi.org/10.3390/pr13082666

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Method for Predicting Gas Well Productivity in Non-Dominant Multi-Layer Tight Sandstone Reservoirs of the Sulige Gas Field Based on Multi-Task Learning

Abstract

1. Introduction

2. Few-Shot Learning Prediction Problems of Traditional Machine Learning Methods

3. Productivity Prediction Model for Multi-Task Progressive Hierarchical Extraction

3.1. Feature Selection Method

3.1.1. Mutation Method to Obtain Daily Production of Each Gas-Bearing Layer

3.1.2. De-Multicollinearity

3.1.3. Feature Importance Evaluation

3.1.4. Feature Number Optimization

3.2. Progressive Hierarchical Extraction Methods

3.3. MTPLE Capacity Forecasting Model

4. Example Analysis of Gas Well Productivity Prediction in Multi-Layer Tight Sandstone Gas Reservoirs

4.1. Data Source and Preprocessing

4.2. Calculation of Gas Well Production by the Mutation Method

4.3. Feature Selection Results

4.3.1. De-Multicollinearity Results

4.3.2. Results of the Feature Importance Calculation

4.3.3. Determine the Number of Task Features

4.4. Validity Analysis of MTPLE Model

4.5. Performance Comparison Between MTPLE and Classical Machine Learning Models

5. Influence of Different Characterization Categories on the Results of Production Capacity Prediction

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI