1. Introduction
Carbonate reservoirs represent one of the most significant hydrocarbon-bearing systems worldwide [
1,
2,
3,
4]. They are extensively developed in major basins across the Middle East, North Africa, and China, including the Tarim, Sichuan, and Ordos basins. These reservoirs play a key role in reserve growth and sustained production in unconventional and deep hydrocarbon exploration [
5,
6]. In contrast to clastic reservoirs, carbonate reservoirs undergo far more complex formation and alteration processes. Their pore systems are controlled by variations in depositional facies, diagenetic processes, dissolution, and multiple phases of tectonic deformation. As a result, carbonate reservoirs develop diverse and multi-scale pore structures, including fractures, matrix pores, and vugs. In particular, fracture–vuggy carbonate reservoirs, formed by the combined development of structural fractures and dissolution-enlarged voids, display pronounced heterogeneity, discontinuity, and anisotropy [
7,
8]. Their seismic signatures—often including bead-like strong reflections, chaotic events, and localized energy anomalies—pose substantial challenges for accurately predicting and characterizing key petrophysical parameters such as porosity.
The Tarim Basin represents China’s key region for deep to ultra-deep marine carbonate exploration and production [
9,
10]. The XX well block is located on the western flank of the Tabei Uplift in northern Tarim. It lies within a NE-trending strike-slip fault system between the Halaha–Tang Depression and the Shuntogole Low Uplift. This area is a typical zone where fault-controlled fracture–vuggy carbonate reservoirs are extensively developed. Multi-phase strike–slip faulting coupled with prolonged hydrothermal alteration has generated a composite “fault–dissolution–vug” reservoir system along major faults and associated secondary faults [
8]. Reservoirs are typically buried deeper than 7000 m and exhibit highly complex spatial architectures. Drilling and production results show that high-yield wells are concentrated in fault–karst composite zones and in areas with intense fracture and vug development. Reservoir properties such as porosity and permeability can change abruptly over short distances, reflecting pronounced heterogeneity and discontinuity. Moreover, seismic data at such depths often suffer from low signal-to-noise ratios and limited bandwidth. These issues are mainly caused by strong attenuation and pronounced lateral velocity variations. The limited availability of well control and the susceptibility of logs to high-temperature, high-pressure environments and unstable borehole conditions further constrain the spatial continuity and quantitative reliability of conventional seismic inversion and attribute analysis [
11]. Consequently, achieving high-accuracy porosity prediction for the fracture–vuggy carbonate reservoirs in the XX well block—under complex structural settings and strong reservoir heterogeneity—has emerged as a critical scientific challenge for deep-target inversion and sweet-spot evaluation in the Tarim Basin.
Over the last two decades, substantial efforts have been devoted both domestically and internationally to improving the identification and prediction of carbonate reservoirs. Early investigations primarily employed seismic attribute analysis and spectral decomposition to delineate vug bodies and fracture-enhanced zones [
12,
13]. For instance, studies by Wang et al. and Hendry et al. showed that spectral decomposition combined with impedance inversion can reveal dissolution features, yet the vertical resolution remains fundamentally constrained by seismic wavelength, limiting the detection of fine-scale pore structures [
14,
15]. The introduction of geostatistical inversion and facies-controlled inversion later shifted reservoir prediction from qualitative characterization toward more quantitative modeling. For instance, Li et al. proposed a two-stage facies-controlled inversion workflow, while Liu et al. incorporated structure tensors with coherence attributes to improve impedance inversion in carbonate settings [
16,
17]. Despite these advances, most existing approaches still rely heavily on statistical correlations among seismic attributes. The underlying rock-physics mechanisms are often underutilized, which limits predictive reliability. The fundamental rock-physics mechanisms that govern carbonate reservoirs are often underutilized, limiting predictive reliability.
In recent years, the rapid development of artificial intelligence, especially deep learning technology, has provided new ideas for seismic inversion and reservoir parameter prediction under complex reservoir conditions [
18,
19,
20,
21]. Convolutional neural networks (CNNs) and their encoder–decoder variants such as U-Net have been extensively applied to seismic facies classification, fracture detection, lithology identification, and porosity prediction [
22]. However, the intrinsic architectural constraints of CNNs hinder their ability to model the long-range dependencies essential for characterizing fracture–vug systems. Transformer architectures, driven by self-attention mechanisms, excel in complex image recognition by enabling global context modeling and dynamically emphasizing salient features [
23]. Transformer architectures have recently gained attention in geophysical applications. They are particularly effective at modeling long-range dependencies through self-attention mechanisms. The hybrid TransUNet architecture integrates U-Net’s multi-scale local feature extraction with the Transformer’s global contextual awareness. This enables effective detection of diverse “point–patch–band” fracture/vug patterns and offers a promising pathway for porosity prediction and fracture quantification in complex carbonate reservoirs. Qi et al. developed a CNN–BiLSTM–Transformer hybrid model for porosity estimation in carbonate reservoirs [
24]. Yan et al. applied a TransUNet architecture integrating Transformer, U-Net, and dual-attention mechanisms to the detection of carbonate and karst cave systems [
25]. However, in ultra-deep and structurally complex areas such as the XX well block, the limited quantity of well control severely restricts purely data-driven approaches, increasing the risk of local overfitting and degrading model performance when extrapolating between wells [
26,
27,
28]. Moreover, without explicit physical constraints, deep models may achieve statistically high accuracy yet still generate porosity distributions inconsistent with known rock-physics relationships or geological trends, compromising interpretability and limiting generalization [
29]. Therefore, integrating carbonate-specific rock-physics mechanisms with sparse well-log constraints offers a critical pathway toward improving the reliability and geological realism of porosity prediction [
30,
31].
Building on these insights, this study focuses on the fracture–vuggy carbonate reservoirs in the XX well block of the Tarim Basin and proposes a porosity prediction method that integrates rock-physics constraints with well-log supervision. The workflow is grounded in a carbonate-specific rock-physics model that defines quantitative links between porosity, lithology, fluid properties, and elastic parameters. These relationships are incorporated as prior constraints into a unified seismic–well-log learning and inversion framework. In addition, high-confidence porosity logs are used as supervision targets to calibrate the network, enabling a synergistic integration of mechanism-driven and data-driven constraints. By jointly leveraging multi-attribute seismic data and rock-physics responses, the method delivers porosity predictions with improved geological realism and lateral continuity, even in settings with limited well control and pronounced reservoir heterogeneity. The case study from the XX well block demonstrates that the proposed hybrid strategy is effective in identifying highly porous fracture–vug zones and evaluating sweet spots. It offers valuable technical support for fine-scale characterization and development planning of deep carbonate reservoirs.
3. Results
3.1. Study Area Overview
The Tarim Basin represents one of China’s most significant provinces for deep and ultra-deep marine carbonate exploration. In its northern sector, thick platform carbonates of the Middle–Lower Ordovician are extensively developed, forming a core target interval for deep hydrocarbon reserve growth and stable production. The XX well block lies on the western flank of the Tabei Uplift in northern Tarim Basin, situated within a NE-trending strike–slip system between the Halaha–Tang Depression and the Shuntogole Low Uplift. Bounded by the Luntai Uplift to the north, the Manjaer Depression to the south, and the Yingmaili High to the west, it constitutes one of the most prospective deep hydrocarbon accumulation zones in the western Tabei Uplift.
The Middle–Lower Ordovician interval comprises thick platform limestones, dolomites, and their diagenetic derivatives. Overprinting effects of facies variations, diagenesis, hydrothermal alteration, and multiple tectonic episodes have produced a complex reservoir architecture characterized by multi-scale, multi-type fractures, pores, and vugs. Multi-phase strike–slip and transpressional deformation has led to the widespread development of “fault–dissolution–vug” composite reservoirs along major and subsidiary faults, forming a typical fault-controlled fracture–vug system. Hydrothermal dissolution along steep, deep-seated faults has produced extensive collapse structures, vertical caverns, and characteristic bead-like strong reflections within the Yijianfang and upper Yingshan formations—representing the dominant high-yield reservoir types in the region. Reservoir burial depths commonly exceed 7000 m in the XX block, with some wells deeper than 8000 m, exhibiting the typical “ultra-deep, ultra-high pressure, and ultra-high temperature” attributes. Intense tectonic–karst overprinting has produced extreme heterogeneity, with abrupt lateral and vertical changes in porosity, permeability, and fracture intensity. High-output wells occur mainly within fault–karst composite belts and fracture–vug zones, whereas nearby wells may be low-yielding or ineffective, complicating efforts to delineate reservoir connectivity, effective thickness, and spatial distribution.
The FI7 fault belt of the XX block lies within a prime segment of the Tabei strike–slip system and represents a key target zone for reserve enhancement and production in fault-controlled fracture–vug carbonate reservoirs. Numerous wells have yielded high oil and gas flows from Ordovician fault-controlled fracture–vug reservoirs, indicating that the FI7 zone benefits from favorable hydrocarbon charge, efficient fault-conduit pathways, and strongly developed fracture–vug storage space. However, the FI7 fault belt exhibits pronounced segmentation, with substantial variations in fault activity, dip angle, linkage patterns, and karst modification along the fault plane. These factors lead to highly complex reservoir boundaries and intricate internal cave–fracture–pore systems, as well as secondary faults acting as fluid conduits or barriers, representing major sources of uncertainty in reservoir prediction.
The available geophysical dataset comprises high-resolution 3D seismic data over the FI7 belt, supplemented by sparse well logs, mud logs, and test data from several ultra-deep wells. Sparse well control and log distortions caused by high-temperature, high-pressure, and unstable borehole conditions further reduce the spatial continuity and quantitative reliability of porosity prediction using single-attribute or conventional inversion approaches. The challenge is even greater in fracture–vuggy carbonates, where seismic signatures frequently include bead-like strong reflections, chaotic events, and localized energy anomalies, further complicating porosity inversion.
Figure 2 presents a representative post-stack seismic section, in which large bead-like strong reflections along the fault trace reveal vertical vug/cave accumulations and collapse features. Considering the intricate tectonic–sedimentary–karst coupling and data limitations, this study focuses on the fracture–vuggy carbonate reservoirs of the FI7 fault belt, systematically investigating the development mechanisms and seismic response characteristics of “fault–dissolution–vug” composite systems. By incorporating rock-physics constraints and integrating sparse well-log porosity data with multi-attribute seismic information, we establish a high-resolution porosity prediction and reservoir-structure characterization framework tailored for ultra-deep carbonates. This provides robust technical support for sweet-spot evaluation, reservoir geological modeling, and development planning.
3.2. Evaluation of Porosity Prediction Performance
The study area is covered by a 3D seismic survey, and this work utilizes a 3D seismic dataset acquired across the entire block. Beyond standard processing steps—static correction, amplitude recovery, denoising, multiple suppression, deconvolution, velocity analysis, and prestack migration—the dataset was further enhanced using structural-oriented filtering, spectral optimization, and imaging-quality improvement techniques. These steps significantly improved the continuity and resolution of deep reflectors, yielding a high-SNR, well-imaged 3D post-stack seismic volume suitable for subsequent attribute extraction, inversion, and detailed reservoir characterization.
Figure 3 illustrates the 3D seismic coverage and the distribution of representative wells across the study area, providing constraints on fault geometry and reservoir lateral heterogeneity. The straight line marks a selected inter-well profile that highlights the seismic response between wells and the lateral variations in the target interval.
Figure 4 presents the post-stack seismic section along this inter-well line. Major fault structures and strong-amplitude reflections of the target reservoir are clearly imaged, providing essential support for well–seismic calibration, seismic facies analysis, and subsequent reservoir prediction. The inter-well line is oriented along the main controlling fault and the dominant reservoir trend, comprising 2081 CDPs.
Figure 4a displays CDPs 1–1040, while
Figure 4b covers CDPs 1401–2081. Four wells are positioned along the line at evenly distributed CDP locations, as summarized in
Table 1. The spatial arrangement of wells along the profile effectively captures faulted zones and favorable reservoir belts, while representing different structural settings and reservoir distributions. This configuration offers strong constraints for well–seismic calibration, facies interpretation, and the training and validation of porosity prediction models.
The well logs from WELL1, WELL2, and WELL3 together with their corresponding near-well seismic traces were used to construct the training dataset, whereas WELL4 served as the blind-well validation dataset.
Figure 5 illustrates the porosity logs from the four wells. Rock-physics modeling also requires water saturation and clay content; however, as this study focuses primarily on porosity prediction, these parameters were assigned layer-wise empirical values derived from statistical analyses of logs within the study area. Comparison of predicted porosity with core data shows that the model responds strongly within vuggy intervals. For core porosity exceeding 6%, the predictions reliably capture the associated density drop and the increase in acoustic slowness. In intervals affected by borehole enlargement (CALI anomalies), the incorporation of deep-learning feature construction and derivative-based constraints prevents spurious deviations, demonstrating the strong robustness of the predictions. Furthermore, in sections showing abrupt density drops or rapid increases in acoustic slowness, the model consistently identifies fracture–vuggy reservoir intervals, effectively avoiding the “over-smoothed misclassification” issue typical of purely data-driven models. The Physics-Constrained TransUNet was trained through iterative optimization on the training dataset, and the loss-function convergence curve is presented in
Figure 6. As training proceeded, the loss value decreased steadily and stabilized at around epoch 30, indicating that the network had effectively learned the nonlinear seismic–porosity mapping. The embedded physics-based constraints helped suppress overfitting and improved the geological plausibility of the inversion results. Once convergence was achieved, the optimal network parameters were selected for predicting porosity in the target well.
The trained model was applied to the blind well WELL4, and the corresponding porosity prediction results obtained using the proposed Physics-Constrained TransUNet method are shown in
Figure 7. The predicted porosity log exhibits good agreement with the interpreted well-log porosity in both overall trends and local variations, accurately delineating high- and low-porosity intervals of the reservoir. In particular, the method demonstrates strong conformity within key reservoir sections, indicating its capability to capture relevant geological features. For comparison,
Figure 8 presents the porosity prediction results derived from an empirical formula commonly used for carbonate reservoirs. A comparison between
Figure 7 and
Figure 8 shows that, although the empirical approach provides a reasonable approximation of the general porosity trend, it tends to produce overly smooth results and fails to adequately resolve local fluctuations and small-scale heterogeneity. In contrast, the proposed method not only preserves the large-scale trend but also achieves superior recovery of fine-scale porosity variations, better reflecting the complex pore structure of fracture–vuggy carbonate reservoirs.
To quantitatively evaluate the prediction performance,
Table 2 reports the Pearson correlation coefficient (PCC), mean relative error (MRE), and coefficient of determination (R
2) metrics for both the proposed method and the empirical approach at WELL4. The results indicate that the Physics-Constrained TransUNet consistently outperforms the empirical method across all evaluation metrics, demonstrating improved prediction accuracy and stability. Furthermore,
Table 3 compares the correlations between measured porosity logs and elastic parameters (P-wave velocity, S-wave velocity, and density) with those between predicted porosity and the corresponding elastic parameters. The comparable correlation patterns observed in these two cases indicate that the predicted porosity retains physically consistent relationships with elastic properties. This consistency further validates the effectiveness and geological reliability of the proposed method from a rock-physics perspective.
Applying the trained Physics-Constrained TransUNet model to the seismic data along the selected inter-well line yields the porosity prediction section shown in
Figure 9.
Figure 10 presents the porosity prediction slice extracted from the main target interval. The results demonstrate that the proposed model exhibits clear advantages in predicting fracture–vuggy carbonate reservoirs. The predicted section captures the lateral continuity of porosity with high clarity and effectively identifies highly heterogeneous features such as fracture–dissolution composite bodies that exhibit significant scale variations. In complex fracture–vug development zones, the model accurately delineates the position, extent, and morphology of porosity anomalies, outperforming traditional attribute stacking and conventional inversion techniques by a substantial margin. Overall, the model demonstrates strong sensitivity and stability in identifying fracture–vuggy reservoirs and characterizing porosity distributions, offering a robust technical foundation for the fine-scale evaluation of carbonate reservoirs.
4. Discussion
The Physics-Constrained TransUNet proposed in this work exhibits strong predictive performance in ultra-deep fracture–vuggy carbonate reservoirs. However, several aspects related to its theoretical foundations, geological applicability, and practical deployment warrant further examination. Methodologically, the incorporation of rock-physics constraints narrows the solution space of the deep-learning model, enabling predictions that more closely follow realistic geological patterns at both large and fine scales. In settings with sparse well control and strong heterogeneity, purely data-driven models often struggle with unstable inter-well extrapolation and blurred porosity transitions. In contrast, the physical constraints impose a physically meaningful evolution pathway for elastic responses, enhancing lateral continuity and stability near fracture–karst systems. It is important to acknowledge that rock-physics models rely on idealized assumptions. Features such as anisotropic pore systems, variable fracture apertures, and scale effects of large karst cavities are difficult to capture accurately. As a result, deviations may still arise near extreme geological bodies such as large collapse zones or giant caverns. Moreover, because logging data in ultra-deep carbonate reservoirs are sparse, key parameters such as water saturation and clay content often rely on empirical assignments. This inevitably weakens the rigor of the physical constraints and may introduce additional uncertainty.
Regarding the coordination between data characteristics and network architecture, TransUNet effectively integrates CNNs’ local feature extraction with Transformers’ long-range dependency modeling, providing advantages in identifying complex fracture–vug systems. Nevertheless, inherent limitations of post-stack seismic—restricted bandwidth, limited resolution, and wavelet instability—mean that small-scale features at depth may still be inadequately resolved. Although structural-oriented filtering, spectral enhancement, and other processing techniques have improved seismic data quality, the fundamental physical limitations of seismic imaging continue to restrict prediction accuracy. Furthermore, although physical constraints mitigate overfitting, deep-learning networks still rely on adequate near-well training data. In areas with extremely sparse well control, network expressiveness becomes limited—a widespread challenge in deep carbonate reservoir studies.
In this study, a multi-well training and validation strategy was adopted to better reflect realistic field-application scenarios under limited well control. Multiple wells within the study area were jointly used to construct the training dataset, while an additional well (WELL4) was reserved exclusively as an independent blind well for generalization assessment. This strategy allows the model to learn shared seismic–porosity relationships across different wells while providing a more stringent and practical evaluation of inter-well extrapolation capability than random sample-level splitting alone. Given the strong heterogeneity of fracture–vuggy carbonate reservoirs and the limited number of available wells, this approach offers a reasonable balance between training robustness and validation reliability. Potential extensions, such as systematic cross-validation across wells, may further enhance statistical representativeness when more well data become available.
Overall, the proposed collaborative inversion framework substantially enhances both the geological plausibility and spatial continuity of porosity predictions, though opportunities for further improvement remain. Future research may advance in three directions: (1) incorporating digital rock models or multi-scale CT imaging to better capture the true complexity of multi-type pore structures; (2) integrating fracture- and pore-sensitive attributes—such as pre-stack AVAZ, spectral attenuation, and elastic impedance—into the dataset to improve the model’s ability to resolve complex reservoirs; (3) embedding uncertainty quantification techniques within the deep-learning framework to assess prediction confidence, offering more interpretable support for reservoir risk assessment and development planning.
In summary, integrating physical constraints with deep learning provides a promising new pathway for porosity prediction in fracture–vug carbonate reservoirs. The approach shows significant potential, yet continued progress is needed in improving rock-physics realism, enhancing model generalization, and better integrating multi-source geological and geophysical data.
5. Conclusions
This study addresses key challenges in the XX well block of the Tarim Basin—strong heterogeneity in fracture–vuggy carbonate reservoirs, sparse well control, and limited seismic resolution—by developing a Physics-Constrained TransUNet framework that combines rock-physics constraints with the representational advantages of deep learning. The workflow begins by constructing a rock-physics model tailored for marine carbonates, integrating mineral composition, pore structure, water saturation, and fluid properties into a unified quantitative framework that ensures physical consistency between porosity, elastic properties, and seismic responses. These rock-physics priors, together with regularization terms and well-log supervision, are embedded within the TransUNet architecture to form a hybrid “physics-driven plus data-driven” collaborative inversion framework. Case study results demonstrate stable convergence of the model and accurate delineation of key reservoir features—including vugs, fractures, and high-porosity zones. Compared with traditional attribute-based methods and conventional inversion, the proposed approach achieves superior well-log matching and markedly improved lateral continuity between wells. The predicted inter-well section and target-interval porosity slice further confirm the model’s sensitivity to fault–dissolution–vug systems and its strong capability in identifying complex reservoir architectures, favorable reservoir zones, and potential sweet spots. In summary, the physics-constrained deep-learning framework significantly enhances both the accuracy and geological plausibility of porosity prediction in fracture–vuggy carbonate reservoirs, offering a robust and scalable solution for fine characterization and development planning in ultra-deep, complex reservoir settings.