Seismic Attribute Fusion and Reservoir Prediction Using Multiscale Convolutional Neural Networks and Self-Attention: A Case Study of the B Gas Field, South Sumatra Basin

Cheng, Ziyun; Huang, Wensong; Zhang, Xiaoling; Lei, Zhanxiang; Hong, Guoliang; Wang, Wenwen; Zhang, Mengyang; Li, Linze; Li, Jian

doi:10.3390/pr14060981

Open AccessArticle

Seismic Attribute Fusion and Reservoir Prediction Using Multiscale Convolutional Neural Networks and Self-Attention: A Case Study of the B Gas Field, South Sumatra Basin

by

Ziyun Cheng

¹,

Wensong Huang

¹,

Xiaoling Zhang

¹,

Zhanxiang Lei

¹,

Guoliang Hong

¹,

Wenwen Wang

¹,

Mengyang Zhang

^2,3,*,

Linze Li

^2,3 and

Jian Li

¹

PetroChina Research Institute of Petroleum Exploration & Development, Beijing 100083, China

²

National Key Laboratory of Petroleum Resources and Engineering, China University of Petroleum (Beijing), Beijing 102249, China

³

College of Geosciences, China University of Petroleum (Beijing), Beijing 102249, China

^*

Author to whom correspondence should be addressed.

Processes 2026, 14(6), 981; https://doi.org/10.3390/pr14060981

Submission received: 6 February 2026 / Revised: 3 March 2026 / Accepted: 12 March 2026 / Published: 19 March 2026

(This article belongs to the Special Issue Applications of Intelligent Models in the Petroleum Industry)

Download

Browse Figures

Versions Notes

Abstract

Strong heterogeneity and ambiguous seismic responses hinder reliable sandstone thickness prediction when using a single seismic attribute in the lower sandstone interval of the Talang Akar Formation (hereafter abbreviated as the LTAF interval) in the B gas field, South Sumatra Basin. To address this challenge, we propose a seismic attribute fusion and reservoir sweet-spot prediction framework based on a multiscale convolutional neural network (CNN) integrated with a self-attention module. Multiple seismic attribute volumes are organized as multi-channel 2D attribute slices, and parallel convolutions with kernel sizes of 3 × 3, 5 × 5, and 7 × 7 are employed to capture spatial features ranging from thin-bed boundaries and channel morphology to sand-body assemblage distribution. The self-attention module explicitly models inter-attribute dependencies and performs adaptive weighted fusion to suppress noise and emphasize informative attributes. The network adopts a dual-output design, producing (i) a sandstone thickness prediction map at the same spatial resolution as the input and (ii) attribute importance scores for quantitative attribute selection and geological interpretation. Using 3D seismic data and well-constrained thickness labels, the proposed model achieves an R² of 0.8954, outperforming linear regression (R² = 0.8281) and random forest regression (R² ≈ 0.8453). The learned importance scores indicate that amplitude-related attributes (e.g., RMS amplitude and maximum amplitude) contribute most to thickness prediction, whereas frequency- and energy-related attributes show relatively lower contributions, which is consistent with bandwidth-limited resolution effects. Overall, the proposed framework unifies attribute fusion, thickness prediction, and interpretability within a single model, providing practical support for fine reservoir characterization and development optimization in heterogeneous sandstone reservoirs.

Keywords:

South Sumatra Basin; multiscale convolutional neural network; self-attention mechanism; seismic attribute fusion; sandstone thickness prediction

1. Introduction

Seismic exploration is an important technique for reservoir identification and prediction in petroleum exploration and development, which has been widely used in the exploration and development of oil and gas resources worldwide. By analyzing the characteristics of seismic wave propagation in underground strata, seismic exploration can provide critical geological data for reservoir identification, prediction, and development [1,2,3]. In hydrocarbon exploration, there are multiple challenges for the prediction of sandstone reservoirs. The study of deep tight sandstones in the Qigu Formation of the Junggar Basin conducted by Tian Lei et al. (2024) suggests that deposition controls the distribution and composition of sandstone bodies, which is the basic contributor to reservoir heterogeneity, while diagenesis is a key factor that affects the heterogeneity of deep tight sandstones by modifying sandstone pores in terms of structure and type [4]. The study of Guo Song et al. (2013) on the beach-bar sandstones of the Boxing Oilfield in the Bohai Bay Basin shows that these sandstone reservoirs are highly heterogeneous due to the development of thinly interbedded sandstone and mudstone, their hydrocarbon-bearing properties vary significantly in different well areas and sandstone formations, their quality is closely correlated with their hydrocarbon-bearing properties, and the degree of hydrocarbon filling and hydrocarbon distribution are directly controlled by reservoir heterogeneity [5]. Typically, sandstone reservoirs are highly heterogeneous, with complex variations in porosity and permeability distributions, and they are affected by multiple factors such as depositional environments and tectonic deformation. These factors make it very difficult to predict sandstone reservoirs.

In the selection of favorable seismic attributes, traditional methods often employ techniques such as correlation coefficient analysis, principal component analysis, and multiple regression, which are standardized, easy to implement, and computationally efficient. They are suitable for intuitive interpretation based on geological knowledge and can effectively support reservoir prediction when the number of seismic attributes is small and the seismic response characteristics are relatively simple. However, with the significant increase in seismic attribute types and data dimensions, these methods overly rely on linear correlation and human experience, making it difficult to characterize the complex inter-attribute relationships. Furthermore, due to collinearity and redundancy among seismic attributes, the results of attribute selection are easily affected by changes in samples and parameters, and the stability and applicability of these methods are limited [6,7]. Therefore, traditional methods for seismic attribute selection can barely meet the needs of accurate reservoir prediction, and more intelligent, data-driven attribute selection and fusion techniques need to be introduced. In addition, the statistical results for frequency attributes show that the dominant frequency of seismic data is usually in the range of 20–40 Hz (peak frequency: about 25–30 Hz), leading to weakened relationships between traditional seismic attributes and sandstone reservoirs and limited accuracy of reservoir prediction based on a single seismic attribute [8,9]. The relatively low contribution of frequency-related attributes is likely influenced by the limited seismic bandwidth (dominant frequency of ~25–30 Hz) and the corresponding vertical resolution, which may reduce their sensitivity to subtle thickness variations compared with amplitude-related attributes. Therefore, the lower weights reflect data-dependent sensitivity rather than inherent geological irrelevance. More importantly, the seismic responses of sandstone reservoirs often exhibit high ambiguity. Different seismic attributes may lead to different interpretations of the same reservoir. Due to this reason, traditional seismic attribute analysis methods have significant limitations in terms of accuracy and reliability, making it difficult to meet the needs of accurate reservoir prediction [10,11,12].

Therefore, accurately capturing and interpreting reservoir features using more advanced seismic exploration techniques has become a key factor in improving the efficiency of hydrocarbon exploration and development. In recent years, the rapidly developing deep learning techniques have provided new solutions, and in particular, the application of convolutional neural networks (CNNs) and attention mechanisms has achieved breakthroughs in many fields. According to Zhang Chenjia et al. (2021), CNNs integrated with attention mechanisms can adaptively weight features at different scales such as channel and space and have significantly enhanced feature representation capabilities that enable them to achieve better results than traditional methods in complex tasks such as image recognition, fully demonstrating the advantages of deep learning techniques in automatic feature extraction and nonlinear modeling [13]. CNNs can automatically extract multiscale spatial features from raw seismic data [14], while attention mechanisms can highlight key features and suppress irrelevant information by dynamically weighting input features, significantly improving the performance of CNN models in complex tasks [15]. These deep learning techniques have shown great potential in image processing in the field of seismic exploration.

In this paper, a seismic attribute fusion method based on a multiscale CNN and a self-attention mechanism is proposed to address the accuracy and reliability issues in sandstone reservoir prediction. This method can improve the accuracy of reservoir prediction by extracting multiscale features from seismic data through the multiscale CNN and by performing weighted fusion of features through the attention mechanism. It enables more effective extraction of key features from seismic data, providing more reliable technical support for accurately predicting sandstone reservoirs.

In summary, existing works on the prediction of complex sandstone reservoirs still have several deficiencies. First, they often use a single seismic attribute or a small number of seismic attributes for linear modeling, making it difficult to fully explore the nonlinear relationships between multi-source seismic attributes and resulting in limited accuracy of reservoir prediction. Second, they rely primarily on manual experience or simplified statistical indicators for the selection of favorable seismic attributes, lacking a unified framework for quantitative characterization of contributions from seismic attributes. Third, most of the studies on the sandstones in the Lower Talang Akar Formation (LTAF) of the B field in the South Sumatra Basin focus on geological understanding and conventional seismic interpretation, lacking deep learning-driven practices in sandstone thickness prediction. Therefore, an intelligent seismic attribute characterization method integrating a multiscale CNN and a self-attention mechanism was proposed to perform sandstone thickness prediction and seismic attribute selection for the LTAF sandstone reservoir in the B field, and its effectiveness was verified using actual production data.

The main contributions of this study are threefold. (1) We design a multiscale CNN backbone with parallel receptive fields to capture seismic–geological features at multiple spatial scales, which is critical for heterogeneous channelized sandstone reservoirs. (2) We incorporate a self-attention module to adaptively fuse multi-attribute features and mitigate redundancy/noise among seismic attributes. (3) We propose a dual-output strategy that outputs both a thickness prediction map and quantitative attribute importance scores, enabling joint prediction and interpretation within a unified framework. In comparison with existing research, our method offers substantial improvements in both prediction accuracy and interpretability. Recent studies have demonstrated the advantages of integrating seismic attributes with machine learning for reservoir prediction, yet limitations remain. For example, multi-attribute regression frameworks have been successfully applied to predict porosity and shale volume in complex deltaic settings, and feature importance analysis has been used to interpret influential seismic attributes [16,17]. However, these approaches do not fully exploit multiscale seismic information or attention mechanisms. More recently, deep learning-driven multi-frequency seismic inversion methods that combine multiscale convolutional modules with self-attention have shown improved characterization of thin-layer stratigraphy, illustrating the benefits of capturing contextual relationships in seismic data [18,19,20]. Building on these advances, our framework incorporates multiscale learning and self-attention to effectively fuse diverse seismic attributes and enhances interpretability through a dual-output design, providing both thickness prediction maps and attribute importance scores—features that are often lacking in conventional methods.

2. Geological Setting and Overview of the Study Area

2.1. Regional Geological/Tectonic Setting and Location of the Study Area

Located in southeastern Sumatra, the South Sumatra Basin is one of the most important hydrocarbon-bearing basins in Indonesia and one of the regions with the greatest hydrocarbon potential in the world [21]. The formation of this basin is closely related to its unique tectonic setting, and it has undergone multiple stages of tectonic evolution. Its oil and gas resources are mainly concentrated in medium-deep sandstone reservoirs, with great potential and value for exploration and development [22]. The abundant oil and gas resources in the basin are closely related to its complex geological structure, depositional environment, and petrophysical properties. Therefore, systematically studying the geological characteristics of the basin is of great significance for hydrocarbon exploration and development.

The B field is situated in the core area of the South Sumatra Basin. The basin’s tectonic evolution process can be roughly divided into three stages, namely, the rift stage, the post-rift and back-arc subsidence stage, and the inversion and uplift stage. During the rift stage, the subduction of the West Sumatra Trench caused the basin to extend, creating a series of half-grabens. During the post-rift stage, tectonic activity weakened, the subsidence of the basin intensified, the sea level rose, and the basin entered the marine transgression stage. During the inversion and uplift stage, the basin experienced multistage structural inversion accompanied by the uplift of the Barisan Mountains and the formation of compressional-torsional folds. These structural inversion events facilitated the formation and adjustment of various types of reservoirs, providing favorable conditions for hydrocarbon accumulation and preservation [23].

2.2. Stratigraphic and Sedimentary Characteristics

The South Sumatra Basin has a diverse and complex geological framework, and the evolution of its depositional environment and tectonic activity have profoundly influenced the formation and distribution of reservoirs. The depositional environment of the basin has evolved from a lacustrine environment to a shallow sea environment and then to a deep-sea environment. In particular, multistage marine transgressions have significantly affected the spatiotemporal distribution of the basin’s strata and the development of reservoirs in the region.

The main sedimentary units of the basin include the pre-rift basement/bedrock, the rift-stage Lahat, Lemat, and Talang Akar formations, and post-rift formations such as Batu Raja, Gumai, Air Benakat, and Muara Enim. The significant differences in sediment type and tectonic setting among these formations directly control the degree of source rock development and the spatiotemporal distribution of reservoirs. The Talang Akar Formation and the Gumai Formation differ significantly in sediment characteristics and lithology. The Gumai Formation is dominated by marine sediments and volcaniclastic rocks, with well-developed secondary pores providing ample space for hydrocarbon storage. In contrast, the depositional system of the Talang Akar Formation (the main stratigraphic unit studied herein) is dominated by deltaic and fluvial facies, with favorable facies zones such as delta fronts and channel deposits. The sandstones of the Talang Akar Formation generally have high porosity and high permeability, forming a series of high-quality sandstone reservoirs in zones with delta fronts and channel deposits [24]. Due to differences between these two formations in sedimentary facies and lithology, there are multiple types of reservoirs and various hydrocarbon accumulation patterns in the study area, creating great resource potential and broad prospect for subsequent hydrocarbon exploration and development.

In this study, the target interval is the lower sandstone interval of the Talang Akar Formation in the B gas field (hereafter abbreviated as the LTAF interval). This abbreviation is used for convenience within the study area. It is mainly distributed in the rift zone and the slopes on both sides of the rift zone. In terms of lithology, it is dominated by medium- to fine-grained sandstone interbedded with a small number of thin conglomerate and mudstone layers, and channel sandstone is the most prevalent type of reservoir rock. In terms of depositional environment, LTAF was formed in a lacustrine-deltaic depositional system with multiple sedimentary facies, including braided channels, meandering channels, crevasse splays, and floodplains. The delta front sandstones and channel-filling sandstones generally have high porosity and high permeability, making them favorable facies zones for hydrocarbon accumulation. The thickness of LTAF is notably characterized by coupled effects of tectonic-depositional processes. It is relatively thick in the central depression zone, reaching tens of meters to nearly 100 m, and it gradually thins towards structural highs and basin margins until it pinches out. Controlled by both sedimentary facies distribution and faulting, the LTAF sandstone reservoir is highly heterogeneous in both horizontally and vertically, with significant variations in hydrocarbon charge degrees and gas-bearing properties in different sections. Therefore, it is necessary to accurately predict the distribution and thickness of the LTAF sandstone body based on seismic attributes using intelligent algorithms in order to provide support for adjusting the mid- to late-stage development strategy of the B field (Figure 1).

2.3. Research Data Overview

Reservoir prediction in this study was performed using 3D seismic data and well logs from the study area. The 3D seismic volume covers approximately 80 km², with a trace spacing of 20 m × 20 m and a sampling interval of 2 ms, which satisfies the resolution requirements for reservoir-scale geological analysis. Formation of 2D input slices: For each seismic attribute volume, 2D slices were extracted within the target interval bounded by the picked top and base horizons. The selected 18 seismic attributes were stacked along the channel dimension to form multi-channel inputs. Each full 2D slice is defined on a regular grid with a spatial sampling of 20 m × 20 m and a size of H × W = 876 × 712 grid nodes, where H = 876 and W = 712 correspond to the numbers of nodes in the J- and I-directions, respectively. For model training, we constructed well-centered 2D patches cropped from these slices and stacked the same 18 seismic attributes to form patch tensors of shape h × w × 18 (where h and w denote the patch dimensions). Each attribute channel was normalized using z-score normalization based on training set statistics. The label for each patch was the log-interpreted sandstone thickness at the corresponding well location, which was used to supervise thickness prediction. The seismic data contain stable and effective frequency information within the target interval. Sandstone thickness labels interpreted from logs in 113 wells (out of 237 wells in the study area) were used as supervised-learning targets.

During data processing, the seismic volume was first preprocessed and calibrated using conventional workflows. Multiple seismic attributes (e.g., average negative amplitude, average positive amplitude, RMS amplitude) were extracted within the time window of the target interval.

To construct training and validation samples, well markers were tied to the seismic time domain, and multi-attribute seismic responses were matched with the log-derived sandstone thickness at the corresponding well locations. In addition, to ensure comparable magnitudes across different attributes and improve training stability, each attribute channel was normalized using statistics computed from the training set. The dataset was divided into training and validation sets at a ratio of 70%:30% for model training and evaluation.

3. Multi-Source Seismic Attribute Intelligent Fusion and Deep Learning-Based Modeling

In this study, a seismic attribute selection and fusion model based on a multiscale CNN integrated with a self-attention mechanism was fully built, considering the characteristics of the responses of seismic attributes to geological structures and lithofacies at different scales. This model takes multiple seismic attributes as input, extracts local to global features through multiscale convolutions, and then simulates inter-attribute correlations through the self-attention mechanism, achieving a unified process of reservoir prediction and seismic attribute selection (Figure 2).

3.1. Multiscale Convolutional Feature Input and Extraction

The core idea of the multiscale CNN architecture is to extract multilevel/multiscale features using convolution kernels of different sizes, thus capturing multiscale information in the input data more comprehensively [25,26,27]. To capture the details presented by different seismic responses at different scales, multiple convolution kernels of different sizes were employed to extract multilevel/multiscale features from seismic data. The detailed process is described below.

First, two-dimensional (2D) seismic attribute slices are input using Equation (1).

X = [A_{1}, A_{2}, \dots, A_{N}], A_{i} \in R^{(H \times W)}

(1)

where H and W represent the spatial dimensions of the seismic profile, and

A_{i}

represents the ith seismic attribute (amplitude, phase, instantaneous frequency, or impedance).

Following this, multiple convolutional layers with convolution kernels of different sizes are used for feature extraction. The 3 × 3 convolution kernel extracts texture information, amplitude micro-variations, and the boundaries of thin sandstone layers. The 5 × 5 convolution kernel extracts feature related to the morphology, distribution, direction, and lateral extension of channel sandstone bodies. The 7 × 7 convolution kernel captures features related to the overall morphology of channel groups or sandstone groups and the continuity of sandstone bodies.

The output of convolutions at the aforementioned three scales can be expressed as:

F_{k} = f_{k \times k} (X), k \in {3, 5, 7}

(2)

where

f_{k \times k} (\cdot)

represents convolution operations with a kernel size of ^k×k.

Finally, the extracted multiscale features are spliced and fused using Equation (3) to provide the basis for inter-attribute correlation analysis.

F = Concat (F_{3}, F_{5}, F_{7})

(3)

3.2. Feature Enhancement Through Self-Attention Mechanism

Self-attention is an attention mechanism that has been widely used in deep learning in recent years. It can dynamically adjust the weights of input features based on inter-feature correlations, thereby strengthening the learning of important features and suppressing irrelevant features [28,29]. To optimize the feature fusion process of the multiscale CNN, a self-attention mechanism was introduced in this study, obtaining a weight matrix by calculating the correlations between input features. The greater the weight of a feature is, the more important the feature is to the final prediction results. In the seismic attribute fusion process, the weights of seismic attributes can be dynamically adjusted based on inter-attribute correlations to enable the network model to focus more on important features while suppressing noise and irrelevant information. The features extracted by the multiscale CNN in the subsequent feature extraction process are input into the self-attention module. The self-attention module calculates the weight of each feature and performs weighted fusion of features. In this way, the network model can focus on the most important seismic attributes, thereby improving the accuracy of reservoir prediction.

First, the feature map

F

is linearly transformed into query (

Q

), key (

K

), and value (

V

) matrices.

Q = F W_{Q}, K = F W_{K}, V = F W_{V}

(4)

The attention weight matrix is calculated as follows:

Attention (Q, K, V) = softmax (\frac{Q K^{T}}{\sqrt{d_{k}}}) V

(5)

An enhanced feature is obtained through residual connections and layer normalization as follows:

F_{o u t} = LayerNorm (F + Attention (Q, K, V))

(6)

3.3. Dual-Output Design

To obtain and represent the results of seismic attribute fusion more clearly, a heatmap showing the distribution of attribute importance weights (Figure 3) and a fused map of predicted sandstone thickness values (Figure 4) were created based on multiscale convolutional feature extraction and self-attention feature fusion for seismic attribute selection and feature importance interpretation.

3.3.1. Feature Map Obtained by Attribute Fusion

The feature tensor

F_{out}

output by the self-attention module is mapped through the 1 × 1 convolutional layer or up-sampling layer to a sandstone thickness prediction map with the same resolution as the input to represent the spatial distribution of sandstone thickness values corresponding to favorable sandstone bodies. The calculation process can be expressed as follows:

Y_{pred} = f_{conv} (F_{out})

(7)

where

f_{conv} (\cdot)

is the convolutional mapping function, and

Y_{pred}

is the final spatial distribution predicted by the model.

3.3.2. Attribute Importance Analysis

To provide an interpretable measure of attribute contribution, we compute the average response values of various channels (corresponding to seismic attributes) by performing global average pooling (GAP) on the fused feature map in the spatial dimension. This process generates channel-wise statistics, which are then normalized using a SoftMax function, allowing the resulting weights to be interpreted consistently across attributes. These importance scores are used to (i) rank seismic attributes for the target interval and (ii) support the geological interpretation of why the model emphasizes certain seismic responses.

w_{c} = \frac{1}{H^{'} W^{'}} \sum_{h = 1}^{H^{'}} \sum_{w = 1}^{W^{'}} F_{out} (h, w, c)

(8)

where

H^{'}

and

W^{'}

are the height and width of the feature map, respectively, and

F_{out} (h, w, c)

represents the response value of the cth channel at the spatial location

(h, w)

. Then, the SoftMax function is applied for normalization so that the weights of seismic attributes can be directly interpreted as probability distributions.

{\tilde{w}}_{c} = \frac{e^{w_{c}}}{\sum_{j = 1}^{C} e^{w_{j}}}

(9)

Finally, the importance weight (

{\tilde{w}}_{c}

) of each seismic attribute/channel is obtained, which can be used for seismic attribute selection and physical importance analysis. A larger

{\tilde{w}}_{c}

value indicates that the attribute contributes more significantly to reservoir identification and has higher geological sensitivity.

3.4. Loss Function and Training Strategy

During model training, the measured sandstone thickness curve or equivalent sandstone thickness data were used as labeled data for supervised learning, and the fused feature was mapped to a sandstone thickness prediction map of the same size as the input profile. In this paper, the mean squared error (MSE) is used as the loss function.

L_{pred} = \frac{1}{N} \sum_{i = 1}^{N} {(h_{i}^{pred} - h_{i}^{obs})}^{2}

(10)

where

h_{i}^{pred}

and

h_{i}^{obs}

are the predicted and measured sandstone thickness values at the ith well or sampling point, and N is the number of sample points.

To suppress abnormal weight amplification and improve training stability, the L2 regularization term is introduced to form a joint loss function as follows:

L = L_{pred} + λ ∥ w ∥_{2}^{2}

(11)

where

w

is the weight vector of the attribute channel, and

λ

is the balance coefficient. The training data is divided into training and validation sets at a ratio of 70%:30%. The labeled samples were split into training and validation subsets at a ratio of 70%/30%. This split is commonly adopted in supervised learning to balance model fitting and unbiased performance assessment, especially under a limited number of labeled wells. We used the Adam optimizer and early stopping (training terminates when the validation loss does not decrease for a preset number of epochs) to reduce the risk of overfitting. The Adam optimization algorithm is used for backpropagation. To reduce the risk of overfitting, model training is terminated when the validation loss no longer decreases within a certain number of consecutive iterations.

4. Results of Seismic Attribute Fusion and Model Validation

4.1. Seismic Attribute Contribution Matrix and Sandstone Thickness Prediction Results

Based on the trained model, we obtain (i) an attribute importance matrix (Figure 3) by summarizing attention-related statistics and (ii) a fused sandstone thickness prediction map (Figure 4b). Figure 3 indicates that amplitude-related attributes (e.g., RMS amplitude and maximum amplitude) contribute more strongly to thickness prediction, whereas frequency- and energy-related attributes contribute less. This provides a quantitative basis for attribute selection in the LTAF interval.

The matrix in Figure 3 shows the relative contributions of different seismic attributes. Amplitude-related attributes (such as the RMS amplitude and maximum amplitude) contribute more significantly to sandstone thickness prediction, while frequency and energy-related attributes contribute relatively less. This provides a basis for attribute selection. This observation is consistent with the bandwidth-limited resolution of the available seismic data, which may weaken the response of some frequency-derived measures to fine-scale thickness variations. Hence, the lower weights should be interpreted as reduced sensitivity under the present data conditions rather than inherent geological irrelevance.

Compared with (Figure 4a), the fused prediction (Figure 4b) exhibits clearer geological patterns and improved continuity, suggesting that multiscale feature extraction helps capture channel morphology and lateral connectivity, while adaptive fusion suppresses scattered noise and enhances spatially consistent sand-body signals.

The importance of each seismic attribute was quantitatively evaluated using the seismic attribute contribution matrix, and weights were dynamically assigned to seismic attributes based on sandstone thickness prediction. Then, multiple attribute information was deeply mined and synergistically fused using the CNN in combination with the self-attention mechanism to improve the continuity and clarify of the spatial changes in amplitude, frequency, and energy features, significantly improving the accuracy of sandstone thickness prediction and providing reliable reference for subsequent geological modeling and the fine characterization of sandstone reservoirs.

In Figure 4, compared with the original seismic data map that has not been processed by deep learning, the fused map shows clearer key geological features that have been effectively enhanced by multiscale convolutional feature extraction through the CNN and the weighted fusion of extracted features through the self-attention mechanism. In particular, these feature extraction and fusion processes have enhanced the model’s ability to identify sandstone bodies. The amplitude-related attributes (taking the RMS amplitude as an example) and sandstone thickness prediction results obtained after seismic attribute fusion show a clearer trend of accumulation in the spatial dimension, which is difficult to observe in the original seismic data map.

Before attributing fusion, the seismic attributes in the original map are often sparsely distributed and characterized by high noise levels, making it difficult to reveal reservoir complexity and detail. The CNN model with self-attention can effectively suppress noise and irrelevant information while enhancing the representation of key features. In this process, the weights of different seismic attributes are dynamically adjusted by the self-attention mechanism to further highlight areas where sandstone bodies are predicted to exist.

Compared with the original map, the fused thickness prediction map shows the relationship between reservoir features and seismic attributes more clearly, which facilitates interpretation.

4.2. Model Validation

To evaluate the predictive accuracy of the proposed multiscale CNN with self-attention, we performed well-constrained validation and compared the results with baseline methods. The coefficient of determination (R²) was used as the primary metric, and additional residual-based analysis was conducted to inspect deviation patterns.

4.2.1. Validation Metrics

For the purpose of this study, the coefficient of determination (R²) is used as the main metric to verify the model’s accuracy in sandstone thickness prediction. R² measures the goodness of fit between the model’s predictions and actual observations, representing the proportion of actual sandstone thickness variations that the model can interpret. R² is calculated as follows:

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}{\sum_{i = 1}^{n} {(y_{i} - \bar{y})}^{2}}

(12)

where

y

denotes the measured (actual) sandstone thickness,

\hat{y}

denotes the corresponding model prediction, and

\bar{y}

is the mean of

y

. A larger R² indicates a better agreement between predictions and measurements.

4.2.2. Benchmark Comparison with Baseline Methods

Figure 5 shows the cross-plot between the sandstone thickness predicted by the proposed model and the measured thickness at wells. The validation result indicates a strong correlation, with R² = 0.8954, suggesting that the proposed model can explain most of the variation in sandstone thickness in the study area.

For comparison, we further evaluated two commonly used baselines. First, a linear regression model was built using a representative single seismic attribute (RMS amplitude) as the predictor. The corresponding cross-plot is shown in Figure 6, where the coefficient of determination is R² = 0.8281, which is notably lower than that of the proposed model. Second, a random forest regression model was used to represent a nonlinear baseline in Figure 7. Its performance is R² ≈ 0.8453, which also remains lower than that of the proposed multiscale CNN with self-attention.

Overall, the proposed model achieves the highest R² among the tested methods, indicating improved nonlinear regression capability and better generalization for thickness prediction in complex heterogeneous sandstone reservoirs.

4.2.3. Ablation Study

To quantify the contribution of the self-attention mechanism to attribute fusion, we conducted an ablation study by removing the self-attention module while keeping the multiscale CNN backbone, input attributes, and training strategy unchanged. In addition, we included a single-scale CNN baseline (without multiscale kernels and without attention) to evaluate the benefit of multiscale feature extraction. All models were trained and evaluated using the same dataset split and the same evaluation metrics for fair comparison. The ablation experiments have been designed to quantify the contributions of multiscale feature extraction and the self-attention mechanism under identical data split and training settings. The quantitative results for the ablated models will be reported in Table 1 after completing the additional runs.

This ablation setup enables a controlled evaluation of how multiscale convolutions and self-attention contribute to thickness prediction performance and interpretability, and the finalized statistics will be included in the revised version. The ablation results provide direct evidence on how the self-attention module improves performance by explicitly modeling inter-attribute correlations and enabling adaptive weighting during the fusion process.

4.2.4. Deviations and Uncertainty Analysis

Residuals between predicted and measured sandstone thickness are analyzed using a residual scatter plot (Figure 8) and a residual histogram (Figure 9), where the residual is defined as Predicted − Actual (m). As shown in Figure 8, residuals are generally centered around the zero line across the prediction range with a slight negative bias (mean = −0.41 m; median = −0.21 m), indicating mild underestimation on average. Quantitatively, the overall error level is MAE = 1.01 m and RMSE = 1.37 m. Figure 9 shows that most residuals concentrate near zero, while the tails and a few outliers indicate localized uncertainty. These larger deviations are likely related to (i) low signal-to-noise ratio and local acquisition/processing artifacts, (ii) thin-bed tuning/interference effects, (iii) rapid lateral facies transitions, and/or (iv) structural complexity (faulting or steep dips). Therefore, zones with larger absolute residuals should be treated as higher-risk targets and prioritized for additional verification, such as updated well ties or local seismic conditioning. Overall, the residual analysis suggests stable predictions with meter-level typical errors and limited localized uncertainty.

4.3. Geological and Engineering Implications

The predicted sandstone bodies in the LTAF interval are mainly distributed along braided channels and delta-front fairways in the central depression zone of the B field, which aligns with the established sedimentary facies understanding. Several favorable zones extend toward structural lows and slope areas, indicating potential targets that warrant further evaluation.

From an engineering perspective, the thickness prediction map can be translated into an actionable decision-support workflow (Figure 10). First, it can be used to screen and rank candidate locations for new wells by applying thickness thresholds and connectivity criteria within the predicted sand-body fairways, while avoiding isolated anomalies likely caused by noise. Second, for existing wells located near predicted thickness gradients, the results provide a quantitative basis to prioritize intervention options (e.g., sidetracking or infill drilling) by jointly considering predicted thickness, sand-body continuity, and production dynamics. Third, the attribute importance output offers an interpretable link between predictions and seismic responses, supporting risk management by identifying which attribute groups drive model decisions in different blocks. Overall, the proposed framework provides a feasible technical route for mid- to late-stage development optimization in complex sandstone reservoirs.

5. Conclusions

(1): In this study, an intelligent seismic attribute representation model integrating a multiscale CNN and a self-attention mechanism was built, achieving simultaneous seismic attribute fusion, reservoir thickness prediction, and attribute importance analysis under the same network architecture while ensuring both predictive accuracy and interpretability.
(2): The multiscale convolutional feature extraction module effectively extracts multilevel/multiscale spatial features from thin-layer boundaries and channel morphology to sandstone distribution, while the self-attention mechanism models inter-attribute correlations and performs adaptive weighted fusion of features, enabling key seismic attributes to present more concentrated and continuous spatial responses in zones with thick sandstones. The results of well-constrained model validation show that the model’s R² value for sandstone thickness prediction reaches 0.8954, indicating that the model outperforms baseline methods such as linear regression and random forest.
(3): The results of attribute weight analysis show that amplitude-related attributes such as the RMS amplitude and maximum amplitude are most sensitive to sandstone thickness, while some frequency and energy-related attributes contribute less to sandstone thickness prediction. These results provide a quantitative basis for the selection of favorable seismic attributes for the study area.
(4): The model proposed in this paper has achieved good results in the LTAF sandstone reservoir of the B field in the South Sumatra Basin, and it has the potential for extended application in similar zones with medium- to high-frequency seismic data. However, the model’s performance still depends on the quantity and quality of well constraints and the quality of seismic attributes, and its applicability in areas with low signal-to-noise ratio (SNR), very few wells, and/or higher heterogeneity needs to be further evaluated. In the future, we will attempt to introduce joint constraints of 3D convolution/temporal attention and multi-source data (e.g., well logs, geological models, inversion volumes) and carry out uncertainty quantification to further improve the model’s reliability and engineering applicability.

Author Contributions

Conceptualization, X.Z.; methodology, Z.C.; software, Z.C., Z.L., G.H., W.W., M.Z. and L.L.; validation, Z.C. and X.Z.; formal analysis, Z.C., W.H., Z.L. and J.L.; investigation, Z.L., L.L. and J.L.; resources, Z.C. and X.Z.; data curation, Z.C., W.H. and G.H.; writing—original draft, Z.C.; writing—review & editing, M.Z.; visualization, W.H. and W.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The seismic and well data used in this study are proprietary and confidential to the data owner/field operator and are not publicly available due to legal and commercial restrictions. The data may be made available from the corresponding author upon reasonable request and with the permission of the data owner.

Conflicts of Interest

Ziyun Cheng, Wensong Huang, Xiaoling Zhang, Zhanxiang Lei, Guoliang Hong, Wenwen Wang and Jian Li were employed by PetroChina Research Institute of Petroleum Exploration & Development. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Liu, J.F. Application and development of seismic exploration technology in petroleum and geological exploration. Petrochem. Technol. 2020, 27, 137–138. [Google Scholar]
Xiao, P. Study of the development status and trend of petroleum exploration technology. Petrochem. Technol. 2020, 27, 285–286. [Google Scholar]
Cao, H.; Dai, S.H.; Li, J.F.; Zheng, X.F.; Liu, H.S.; Zhou, K. Seismic exploration technology for tight oil and gas reservoirs in China and its development direction. Acta Pet. Sin. 2025, 46, 236–254. [Google Scholar] [CrossRef]
Tian, L.; Song, F.; Zhang, S.Q.; Yang, Y. Heterogeneity of deep tight sandstone reservoirs and controlling factors—A case study of the Jurassic Qigu Formation in Yongjin Area of the Junggar Basin. J. Northeast. Pet. Univ. 2024, 48, 29–40, 136–137. [Google Scholar]
Guo, S.; Tan, L.J.; Lin, C.Y.; Li, H.N.; Lü, X.X. Effect of reservoir heterogeneity on hydrocarbon accumulation—A case study of the beach-bar sandstones of the upper sub-section of the 4th member of the Shahejie Formation in Boxing Oilfield. Pet. Geol. Exp. 2013, 35, 534–538. [Google Scholar]
Deng, Y.; Liu, X.H.; Wang, L.L.; Zhang, H.L.; Liu, Z.X.; Hao, J.J. Study and application of thin reservoir prediction methods based on broadband seismic data. Comput. Tech. Geophys. Geochem. Explor. 2025, 47, 209–221. [Google Scholar]
Jiang, Z.H. Research on reservoir prediction based on intelligent frequency-divided inversion—A case study of the southern exploration area of the Heiyupao Depression, Daqing. Chin. J. Geol. 2021, 56, 1052–1061. [Google Scholar]
Cui, Y.Z.; Yao, R.; Pan, H.H.; Cui, G.Z.; Guo, J.L.; Wang, X.H.; Liu, Y.Y.; Xu, C.Y.; Liu, S. Application of multi-filter combination in seismic frequency expanding. Geophys. Prospect. Pet. 2025, 64, 1060–1071. [Google Scholar] [CrossRef]
Wang, J.; Ren, X.F.; Zhang, J.H.; Wang, X.F.; Zhang, P.; Zhang, D. Thickness prediction method for favorable sandstone and conglomerate reservoirs based on PCA-RF. CT Theory Appl. 2021, 29, 311–318. [Google Scholar] [CrossRef]
Wang, H.X.; Feng, S.Y.; Liu, Z.L. A geological structure identification model based on an improved random forest algorithm. Int. J. Coal Sci. Technol. 2023, 10, 25. [Google Scholar]
Liu, W.X.; Wang, H.; Wan, Q.H.; Heng, L.Q.; Yin, N.X.; Pan, S.J. Fine-scale analysis of the configuration of braided river delta reservoirs based on frequency-divided RGB fusion technology. J. Earth Sci. Environ. 2022, 44, 765–774. [Google Scholar]
Li, T.T.; Wang, Z.; Ma, S.Z.; Wang, Z.; Yuan, Z.L. A review of seismic attribute fusion methods. Prog. Geophys. 2015, 30, 378–385. [Google Scholar]
Zhang, C.J.; Zhu, L.; Yu, L. A review of attention mechanisms in convolutional neural networks. Comput. Eng. Appl. 2021, 57, 64–72. [Google Scholar]
Wang, S.H.; Yu, H.Z.; Tan, S.Q.; Chen, X.G. Research on seismic facies identification technology based on deep convolutional neural networks. Comput. Tech. Geophys. Geochem. Explor. 2020, 42, 475–480. [Google Scholar]
Zhao, M.; Zhao, Y.; Shen, D.H.; Wang, J.; Dai, X. High-resolution processing of seismic data using the U-net adaptive attention mechanism. Oil Geophys. Prospect. 2024, 59, 675–683. [Google Scholar]
Ejaita, E.; Omonefe, F. Multi-attribute predictions of volume of shale and porosity from seismic data over Akaso Field using machine learning regression. Discov. Geosci. 2026, 4, 80. [Google Scholar] [CrossRef]
Tan, P.; Fu, S.; Huang, L.; Chen, Z.; Cao, J. Effects of orthogonal cleat structures on hydraulic fracture evolution behavior. Geoenergy Sci. Eng. 2024, 241, 213119. [Google Scholar] [CrossRef]
Zhang, J.; Shan, X.; Huo, S.; Huang, L.; Zheng, W.; Zhou, X.; Liu, E. Deep learning-driven multi-frequency seismic inversion for enhanced thin-layer stratigraphic characterization. J. Appl. Geophys. 2025, 239, 105749. [Google Scholar] [CrossRef]
Huang, L.; Liu, J.; Zhang, F.; Dontsov, E.; Damjanac, B. Exploring the influence of rock inherent heterogeneity and grain size on hydraulic fracturing using discrete element modeling. Int. J. Solids Struct. 2019, 176, 207–220. [Google Scholar] [CrossRef]
Ilyushin, Y.V.; Novozhilov, I.M. Temperature field control of a metal oil-well tubing for producing of high-paraffin oil. In Proceedings of the 2020 XXIII International Conference on Soft Computing and Measurements (SCM), Online, 27–29 May 2020; IEEE: New York, NY, USA, 2020; pp. 149–152. [Google Scholar]
Li, M.Y.; Zhu, R.K.; Hu, S.Y. Geological characteristics and resource potential of overseas terrestrial shale oil. Lithol. Reserv. 2022, 34, 163–174. [Google Scholar] [CrossRef]
Hong, G.L.; Wang, H.J.; Zhu, H.Q.; Bai, Z.H.; Wang, W.W. Hydrocarbon accumulation conditions and favorable zones of the Miocene Gumai lithologic reservoirs in Block J of the South Sumatra Basin. Lithol. Reserv. 2023, 35, 138–146. [Google Scholar]
Hao, R.L.; Huang, W.H.; Hu, G.C.; Liu, J.Y.; Zhu, H.L. Superposition pattern of Miocene progradational deltaic sandstone bodies in Block J of the South Sumatra Basin. Pet. Geol. Recovery Effic. 2020, 27, 52–62. [Google Scholar]
Chen, W.; Shi, K. Multi-scale Attention Convolutional Neural Network for time series classification. Neural Netw. 2021, 136, 126–140. [Google Scholar] [CrossRef] [PubMed]
Yang, X.; Zhu, Y.; Guo, Y.; Zhou, D. An image super-resolution network based on multi-scale convolution fusion. Vis. Comput. 2022, 38, 4307–4317. [Google Scholar] [CrossRef]
Liu, X.; Ng, A.H.-M.; Lei, F.; Ren, J.; Liao, X.; Ge, L. Hyperspectral Image Classification Using a Multi-Scale CNN Architecture with Asymmetric Convolutions from Small to Large Kernels. Remote Sens. 2025, 17, 1461. [Google Scholar] [CrossRef]
Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention Is All You Need. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Long Beach, CA, USA, 4–9 December 2017; pp. 5998–6008. [Google Scholar] [CrossRef]
Liu, J.W.; Liu, J.W.; Luo, X.L. Research progress in attention mechanism in deep learning. Chin. J. Eng. 2021, 43, 1499–1511. [Google Scholar] [CrossRef]
Chaudhari, S.; Mithal, V.; Polatkan, G.; Ramanath, R. An Attentive Survey of Attention Models. ACM Trans. Intell. Syst. Technol. 2021, 12, 1–32. [Google Scholar] [CrossRef]

Figure 1. Composite stratigraphic column of the South Sumatra Basin showing lithology, depositional facies, sequence stratigraphy, and sea-level changes; different dot patterns in sandstone indicate grain-size variations.

Figure 2. Schematic of the proposed multiscale CNN self-attention framework. Multiscale convolutions (kernel sizes 3 × 3, 5 × 5, and 7 × 7) extract spatial features at different scales, which are fused by a self-attention module. The network produces dual outputs: a sandstone thickness prediction map and attribute importance scores for interpretability.

Figure 3. Seismic attribute contribution matrix (based on attention weight statistics).

Figure 4. Map comparison for seismic attributes, fused prediction, and facies interpretation within the target interval of the B gas field. (a) RMS amplitude attribute map. (b) Fused seismic attribute response and predicted sandstone thickness distribution obtained from the proposed multiscale CNN with self-attention; validation wells are marked by the well symbols in the map to enable well-constrained consistency checking. (c) Interpreted facies zoning (distributary channel, overbank overflow, and interdistributary bay), used to support geological interpretation and to assess the spatial consistency between predicted thick-sand zones and sedimentary facies architecture. The scale bar is shown at the top of each panel, and the arrow indicates the north direction. Well locations are marked by circle-with-cross symbols, representing validation wells used for consistency checking.

Figure 5. Cross-plot of sandstone thickness predicted by the proposed model versus well-measured thickness (m). Each × symbol represents a well data point. The dashed line denotes the fitted regression, and the coefficient of determination is R² = 0.8954.

Figure 6. Cross-plot of sandstone thickness predicted by the linear regression model versus well-measured thickness (m). Each × symbol represents a well data point. The dashed line denotes the fitted regression, and the coefficient of determination is R² = 0.8281.

Figure 7. Random Forest predictions versus measured sandstone thickness (m). Each × symbol represents a well data point. The dashed line denotes the fitted regression (R² ≈ 0.8453).

Figure 8. Residual scatter plot of sandstone thickness prediction, where residuals are defined as Predicted − Actual (m). The dashed line indicates zero residual; points above/below the line represent over-/under-prediction, and larger absolute residuals indicate higher prediction uncertainty.

Figure 9. Histogram of residuals (Predicted − Actual, m) for sandstone thickness prediction. The peak near zero indicates that most predictions have small errors, whereas the spread and tails reflect high-deviation cases that may require further validation.

Figure 10. Engineering decision-support workflow based on the proposed framework. Multi-attribute seismic data and well-constrained thickness labels are used to construct training/validation samples and train the multiscale CNN integrated with a self-attention module. The model produces dual outputs: (i) a sandstone thickness prediction map and (ii) attribute importance scores for interpretability. The thickness map is further converted into actionable products via thresholding and connectivity-based filtering to support candidate-zone ranking for new wells and intervention planning, while the attribute importance output supports interpretation and risk management. Different colored boxes represent major stages of the workflow, and the numbered boxes (Box 1–Box 6) indicate sequential processing steps from data input to engineering decision-making.

Table 1. Ablation study design for quantifying the effects of multiscale convolution and the self-attention module (results to be filled).

Model	Multiscale	Self-Attention	R²
Single-scale CNN	No	No	TBF
Multiscale CNN (w/o attention)	Yes	No	TBF
Proposed (Multiscale + attention)	Yes	Yes	0.8954

Note: TBF (to be filled) denotes the quantitative results of the additional ablation runs under the same data split and training settings.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cheng, Z.; Huang, W.; Zhang, X.; Lei, Z.; Hong, G.; Wang, W.; Zhang, M.; Li, L.; Li, J. Seismic Attribute Fusion and Reservoir Prediction Using Multiscale Convolutional Neural Networks and Self-Attention: A Case Study of the B Gas Field, South Sumatra Basin. Processes 2026, 14, 981. https://doi.org/10.3390/pr14060981

AMA Style

Cheng Z, Huang W, Zhang X, Lei Z, Hong G, Wang W, Zhang M, Li L, Li J. Seismic Attribute Fusion and Reservoir Prediction Using Multiscale Convolutional Neural Networks and Self-Attention: A Case Study of the B Gas Field, South Sumatra Basin. Processes. 2026; 14(6):981. https://doi.org/10.3390/pr14060981

Chicago/Turabian Style

Cheng, Ziyun, Wensong Huang, Xiaoling Zhang, Zhanxiang Lei, Guoliang Hong, Wenwen Wang, Mengyang Zhang, Linze Li, and Jian Li. 2026. "Seismic Attribute Fusion and Reservoir Prediction Using Multiscale Convolutional Neural Networks and Self-Attention: A Case Study of the B Gas Field, South Sumatra Basin" Processes 14, no. 6: 981. https://doi.org/10.3390/pr14060981

APA Style

Cheng, Z., Huang, W., Zhang, X., Lei, Z., Hong, G., Wang, W., Zhang, M., Li, L., & Li, J. (2026). Seismic Attribute Fusion and Reservoir Prediction Using Multiscale Convolutional Neural Networks and Self-Attention: A Case Study of the B Gas Field, South Sumatra Basin. Processes, 14(6), 981. https://doi.org/10.3390/pr14060981

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Seismic Attribute Fusion and Reservoir Prediction Using Multiscale Convolutional Neural Networks and Self-Attention: A Case Study of the B Gas Field, South Sumatra Basin

Abstract

1. Introduction

2. Geological Setting and Overview of the Study Area

2.1. Regional Geological/Tectonic Setting and Location of the Study Area

2.2. Stratigraphic and Sedimentary Characteristics

2.3. Research Data Overview

3. Multi-Source Seismic Attribute Intelligent Fusion and Deep Learning-Based Modeling

3.1. Multiscale Convolutional Feature Input and Extraction

3.2. Feature Enhancement Through Self-Attention Mechanism

3.3. Dual-Output Design

3.3.1. Feature Map Obtained by Attribute Fusion

3.3.2. Attribute Importance Analysis

3.4. Loss Function and Training Strategy

4. Results of Seismic Attribute Fusion and Model Validation

4.1. Seismic Attribute Contribution Matrix and Sandstone Thickness Prediction Results

4.2. Model Validation

4.2.1. Validation Metrics

4.2.2. Benchmark Comparison with Baseline Methods

4.2.3. Ablation Study

4.2.4. Deviations and Uncertainty Analysis

4.3. Geological and Engineering Implications

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI