Application of a Dual-Stream Network Collaboratively Based on Wavelet and Spatial-Channel Convolution in the Inpainting of Blank Strips in Marine Electrical Imaging Logging Images: A Case Study in the South China Sea

Lin, Guilan; Fang, Sinan; Li, Manxin; Wu, Hongtao; Xue, Chenxi; Zhang, Zeyu

doi:10.3390/jmse13050997

Open AccessArticle

Application of a Dual-Stream Network Collaboratively Based on Wavelet and Spatial-Channel Convolution in the Inpainting of Blank Strips in Marine Electrical Imaging Logging Images: A Case Study in the South China Sea

by

Guilan Lin

^1,2

,

Sinan Fang

^1,2,*

,

Manxin Li

^1,2,

Hongtao Wu

^1,2,

Chenxi Xue

^1,2 and

Zeyu Zhang

^1,2

¹

Key Laboratory of Exploration Technologies for Oil and Gas Resources, Ministry of Education, Yangtze University, Wuhan 430100, China

²

College of Geophysics and Petroleum Resources, Yangtze University, Wuhan 430100, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(5), 997; https://doi.org/10.3390/jmse13050997

Submission received: 19 April 2025 / Revised: 17 May 2025 / Accepted: 17 May 2025 / Published: 21 May 2025

(This article belongs to the Special Issue Research on Offshore Oil and Gas Numerical Simulation)

Download

Browse Figures

Versions Notes

Abstract

Electrical imaging logging technology precisely characterizes the features of the formation on the borehole wall through high-resolution resistivity images. However, the problem of blank strips caused by the mismatch between the instrument pads and the borehole diameter seriously affects the accuracy of fracture identification and formation continuity interpretation in marine oil and gas reservoirs. Existing inpainting methods struggle to reconstruct complex geological textures while maintaining structural continuity, particularly in balancing low-frequency formation morphology with high-frequency fracture details. To address this issue, this paper proposes an inpainting method using a dual-stream network based on the collaborative optimization of wavelet and spatial-channel convolution. By designing a texture-aware data prior algorithm, a high-quality training dataset with geological rationality is generated. A dual-stream encoder–decoder network architecture is adopted, and the wavelet transform convolution (WTConv) module is utilized to enhance the multi-scale perception ability of the generator, achieving a collaborative analysis of the low-frequency formation structure and high-frequency fracture details. Combined with the spatial channel convolution (SCConv) to enhance the feature fusion module, the cross-modal interaction between texture and structural features is optimized through a dynamic gating mechanism. Furthermore, a multi-objective loss function is introduced to constrain the semantic coherence and visual authenticity of image reconstruction. Experiments show that, in the inpainting indexes for Block X in the South China Sea, the mean absolute error (MAE), structural similarity index (SSIM), and peak signal-to-noise ratio (PSNR) of this method are 6.893, 0.779, and 19.087, respectively, which are significantly better than the improved filtersim, U-Net, and AOT-GAN methods. The correlation degree of the pixel distribution between the inpainted area and the original image reaches 0.921~0.997, verifying the precise matching of the low-frequency morphology and high-frequency details. In the inpainting of electrical imaging logging images across blocks, the applicability of the method is confirmed, effectively solving the interference of blank strips on the interpretation accuracy of marine oil and gas reservoirs. It provides an intelligent inpainting tool with geological interpretability for the electrical imaging logging interpretation of complex reservoirs, and has important engineering value for improving the efficiency of oil and gas exploration and development.

Keywords:

electrical imaging logging; blank strips inpainting; wavelet transform; spatial channel convolution; dual-stream network; South China Sea

1. Introduction

Electrical imaging logging technology plays a crucial role in the field of petroleum exploration and development, providing geologists with detailed information about wellbore formations through high-resolution resistivity measurements [1,2]. This technology can accurately identify geological features such as fractures, pores, and sedimentary structures, which is of significant importance for reservoir evaluation [3,4,5,6]. Micro-resistivity imaging logging instruments, as one of the advanced devices, utilize multiple small electrodes on a multi-pad to emit currents to the wellbore wall. The changes in current caused by different rock compositions, structures, and contained fluids reflect the variations in resistivity at various points along the wellbore wall, as shown in Figure 1. These changes are converted into high-resolution images, which not only help in intuitively understanding the microstructure of the wellbore formations but also provide key data support for subsequent geological interpretation and reservoir evaluation. Although significant progress has been made in electrical imaging logging technologies, such as Schlumberger’s fullbore formation micro-imager (FMI), the incomplete match between the instrument pad size and the wellbore diameter leads to the issue of blank strips during data acquisition. These blank areas affect the integrity and continuity of the images, posing challenges to the accuracy of fracture identification and stratigraphic continuity interpretation. Therefore, it is essential to fill in the blank strips from the electrical imaging logging images.

In recent years, researchers have explored various approaches to tackle this challenge. Traditional methods for inpainting blank strips can be broadly classified into two categories: those based on morphological features and those based on detailed features. For instance, Hurley et al. [7] made a groundbreaking contribution by using actual wellbore images for training, incorporating multipoint statistical techniques and the filtersim algorithm [8] to create full-borehole images with geological statistical properties, offering a measurable theoretical framework for filling in the gaps. Zhang et al. [9] introduced a hybrid approach that effectively balanced rapid inpainting with the preservation of geological feature continuity by combining inverse distance weighting interpolation with pattern matching algorithms. Luo et al. [10] enhanced the Criminisi algorithm [11] by incorporating a texture priority mechanism, thereby broadening the applicability of traditional structural repair techniques. Despite these advancements in specific contexts, common limitations persist: sensitivity to parameters that restrict generalization; challenges with matching in complex texture areas; and inadequate adaptation to multi-scale geological structures. In contrast, deep learning offers a paradigm shift by leveraging hierarchical feature extraction and end-to-end learning frameworks to autonomously capture multi-scale information. These challenges have thus driven researchers to adopt more adaptive deep learning solutions.

The rapid advancement of computer vision and deep learning technologies has brought revolutionary changes to multiple fields [12,13,14,15,16]. Scholars have begun to actively explore the application of these advanced technologies in electrical imaging of logging image processing to solve problems that traditional methods cannot overcome. For example, Chen et al. [17] proposed a fully convolutional neural network based on the U-Net architecture [18], which extracts underlying statistical features under a small number of samples through parameter optimization, recovers details through skip connections, and introduces dilated convolutions to enhance multi-scale feature capture. Zhang et al. [19] used an improved U-Net network model to learn a large number of image blank strip filling sample instances, calculated based on the original features of the image and the labeled area to be filled, thereby achieving automatic filling of blank strips. Du et al. [20] adopted a deep generative network method based on attention constraints to fill in the blank strips in logging electrical imaging. They replaced the convolution method in traditional convolutional neural networks with deformable convolutions, thereby improving the model’s ability to handle complex structural images. Cao et al. [21] used an improved method based on generative adversarial networks [22] to repair partially missing issues in imaging logging images. They constructed a generator network based on the Fully Convolutional Network (FCN) and enhanced the repair effect of missing parts in imaging logging images through the addition of depthwise separable convolution residual blocks, Inception modules, multi-scale feature extraction, and spatial attention mechanisms, as well as the design of global and local discriminator networks. Sun et al. [23] used a multi-scale generative adversarial network method to fill in the blank strips in electrical imaging. Su et al. [24] used a deep learning method based on fast Fourier convolution for the repair of blank strips in electrical imaging logging images. The network enhances the ability to recognize geological features by perceiving the spatial frequency domain periodicity of the image features. Wang et al. [25] proposed a method combining deep filling image repair algorithms with histogram equalization principles to restore full-borehole electrical imaging logging images. This method improves the overall quality of the image to some extent, especially in terms of texture consistency at the edges. Zhong et al. [26] proposed a prior-guided adaptive generative adversarial network (GAN) method to improve the repair quality of defective areas in logging images. They enhanced the recovery of image continuity features by training the GAN network on a large number of natural images. Yang et al. [27] proposed a deep learning method named LogMAT based on a hybrid architecture, which leverages key techniques including multi-stage pretraining, incremental completion strategy, and cylindrical boundary constraint to achieve high-quality missing pixel filling for various borehole images. Despite the breakthroughs achieved in adaptive multi-dimensions by the above methods, they still face four core challenges: first, single-stream or ordinary dual-stream models are difficult to coordinate structural–texture features, leading to fracture artifacts and detail distortion; second, the limited receptive field of traditional convolutions restricts the ability of global context modeling, causing large-scale repair blurring; third, static feature fusion strategies introduce redundant noise, weakening the semantic consistency of fracture edges; fourth, the model is overly dependent on specific data distributions, resulting in significant fluctuations in cross-block generalization errors. Therefore, developing an innovative method that adapts to marine geological conditions, coordinates multi-scale structure-texture features, achieves precise filling of blank strips, and maintains cross-scale semantic consistency has become the core breakthrough direction of this study.

According to the characteristics of borehole imaging diagrams and geological features [28,29,30], this study proposes a dual-stream network repair method that combines wavelet transform and spatial-channel convolution. This method addresses the challenging issue of repairing blank strips in electrical imaging logging images through a three-stage progressive strategy. The first stage targets the problem of complete data absence in engineering scenarios by innovatively constructing a texture-aware data prior algorithm for initial filling. It also introduces a manual verification mechanism by geological experts for fracture morphology and stratigraphic continuity, screening candidate samples that conform to geological laws, and establishing a training foundation that balances data availability and geological rationality. The second stage is based on the dual-stream network architecture [31] for core algorithm innovation and collaborative optimization: (1) A cascaded wavelet transform convolution module [32] is designed to enhance the generator, expanding the receptive field through multi-level wavelet decomposition to capture global low-frequency information and strengthen the multi-scale representation of cave edges and multi-angle fractures. (2) A spatial-channel bidirectional gated convolution [33] is designed to enhance feature fusion, realizing semantic consistency reconstruction of texture–structure features while suppressing noise through a feature redundancy compression and dynamic fusion mechanism. (3) A composite loss function system is constructed, integrating reconstruction loss, perception loss [34], style loss, and adversarial loss [35], forming an optimization objective driven by geological constraints. The third stage evaluates the method’s performance through a multi-dimensional validation system: quantitative analysis of fracture topological continuity indicators in comparative experiments, verification of the contribution of core modules in ablation experiments, evaluation of geological generalization ability through cross-block testing, and fracture interpretation consistency verification using core-observed data. This ultimately forms a complete technology chain from data preprocessing to model optimization and then to engineering verification.

2. Geological Overview

Buried hills, defined as paleo-topographic highlands unconformably overlain by younger strata, constitute critical hydrocarbon reservoir targets in basin exploration. Their fracture zones and complex pore structures considerably heighten the exploration challenges in the deepwater Qiongdongnan and Pearl River Mouth Basins of the northern South China Sea. Recent exploration successes there have underscored their strategic importance [36], as illustrated by the basins in Figure 2.

The Qiongdongnan Basin exhibits a NE–SW extensional structural pattern, bounded between the Xisha Uplift and Hainan Uplift. Its evolution comprises two phases: Paleogene rifting and Neogene sagging, forming four first-order tectonic units distributed progressively from north to south—the Northern Depression, Central Uplift, Central Depression, and Southern Uplift. The Cenozoic stratigraphy comprehensively documents the transition from continental to marine sedimentation, with the Lingtou and Yacheng formations constituting primary source rock layers.

In contrast, the Pearl River Mouth Basin extends 800 km along the continental margin, characterized by a six-segment structural zonation. Its Zhu-I Depression developed upon the Yanshanian granitic basement (170–90 Ma), genetically linked to the South China continental margin. The stratigraphic sequence includes the Upper Cretaceous Shenhu Formation overlain by Paleocene Wenchang and Enping formations. Geophysical data reveal a basement dominated by intermediate-acid intrusive rocks, locally intercalated with basaltic volcanic rocks.

Consequently, addressing the prevalent blank strip issues in electrical imaging logging under these complex structural settings, the inpainting method proposed in this study will provide new technical support for deepwater hydrocarbon exploration in the South China Sea.

3. Methodology

3.1. Data Collection and Preprocessing

In this study, both the training data and test data were collected from six wells in the Buried Hills Reservoir, which are conveniently labeled as A, B, C, D, E, and F. The data from these six wells were sequentially processed for imaging, screening, and cropping, resulting in a total of 2642 original electrical image logs with a standardized imaging scale bars of 1:10. The image data were acquired using Halliburton Company’s (Houston, TX, USA) micro-resistivity scanner imager (XRMI) and Schlumberger Ltd.’s (Houston, TX, USA) fullbore formation micro-imager (FMI) logging tools. The XRMI tool is equipped with 150 micro-electrodes (6 pads, each containing 25 button electrodes arranged in two vertical rows). The FMI tool comprises 192 measuring electrodes, distributed across 4 primary pads and 4 secondary pads, with 24 electrodes per pad (12 electrodes per row in two staggered rows) arranged in a staggered pattern to optimize borehole wall coverage. Fifty representative images were randomly selected as the test set, and the remaining 2592 images were used as the training set. To obtain complete electrical image logs without blank bands and to maintain the original texture structure features, this study proposed a texture-aware data prior algorithm to preprocess the 2592 images in the training set. After professional personnel selection, the final effective training set contained 2500 images.

The overall processing steps for data acquisition are shown in Figure 3 and can be divided into two steps. The first step involves processing the raw logging data for imaging, followed by sequential segmentation and mask extraction operations on the imaging results. The second step is the process of the texture-aware data prior algorithm, which uses texture features (LBP) and structural gradient fields for model fusion and data priors. This process includes three key stages: priority calculation, texture-aware patch matching, and progressive repair updates. Among these, the priority calculation integrates confidence propagation, structural gradient fields, and texture features to construct a comprehensive evaluation index, defined by Equation (1):

P (p) = C (p) \times D (p) \times F (p)

(1)

where C(p) represents the confidence of the current pixel point p, reflecting the reliability of known information; D(p) is the data term, reflecting the continuity of image structural features; and F(p) is the repair front detection term, identifying the boundary features of the area to be repaired. The data term D(p) is calculated by combining the geometric constraints of the normal field and gradient field, as shown in Equation (2):

D (p) = | \nabla I^{⊥} (p) \cdot n (p) | \times G_{c} (p)

(2)

where ∇I^⊥ denotes the positive gradient operator, n(p) is the normal field vector, and G_c(p) is the composite gradient field combining the original gradient with the LBP texture gradient, as shown in Equation (3):

G_{c} (p) = \sqrt{{(\nabla I_{x} + α \nabla T_{x})}^{2} + {(\nabla I_{y} + α \nabla T_{y})}^{2}}

(3)

In the above equation, ∇I_x and ∇I_y are the gradient components of the original image in the x and y directions, respectively; ∇T_x and ∇T_y are the gradient components of the LBP texture feature in the x and y directions, respectively; and α is the fusion weight coefficient. During the patch matching stage, a multi-modal similarity measurement method is adopted, simultaneously considering color space differences and texture feature distances, as shown in Equation (4):

S (Ψ_{p}, Ψ_{q}) = β | | Ψ_{p}^{L a b} - Ψ_{q}^{L a b} | |_{2} + (1 - β) | | T_{p}^{L B P} - T_{q}^{L B P} | |_{H}

(4)

Here,

Ψ_{p}^{L a b}

,

Ψ_{q}^{L a b}

represent the features of patches Ψ_p and Ψ_q in the Lab color space,

T_{p}^{L B P}

,

T_{q}^{L B P}

are the local binary pattern (LBP) histogram features of patches Ψ_p and Ψ_q, ∥⋅∥₂ is the Euclidean distance, ∥⋅∥_H is the histogram intersection distance, and β is set to 0.7 to balance the contributions of color and texture.

The confidence update introduces a texture complexity factor, as shown in Equation (5):

C_{t + 1} (p) = C_{t} (p) \times [0.5 + 0.5 \frac{1}{N} \sum_{q \in Ψ_{p}} T_{c o m p l e x} (q)]

(5)

T_complex(q) is the local texture complexity calculated based on the variance of LBP features, and N is the patch size. This mechanism effectively maintains the consistency of high-texture areas during the repair process.

In summary, through the texture-aware data prior algorithm, combined with multi-modal similarity measurements and confidence updates, a high-quality training set with geological rationality was constructed, providing a reliable data foundation for subsequent network training.

Figure 3. Schematic diagram of dataset creation. The first step illustrates the workflow of data imaging processing, segmentation, and mask extraction operations. The second step outlines the process flow of the texture-aware data prior algorithm.

3.2. Overall Network Architecture

The network architecture of the electrical imaging logging image inpainting method proposed in this study, which is based on the collaborative optimization of wavelet and spatial-channel convolution in a dual-stream network, is shown in Figure 4. The overall model is divided into three parts: the first part is a generator enhanced by wavelet transform convolution, the second part is a feature fusion module enhanced by spatial-channel convolution, and the third part is a discriminator composed of two parallel branches for structure and texture. The generator is responsible for generating restored images from damaged ones, while the feature fusion module enhances the consistency and expressiveness of features through bidirectional gated feature fusion in spatial and channel domains (SCBi-GFF) and context feature aggregation (SC-CFA). The discriminator evaluates the realism and quality of the generated images. Additionally, a multi-joint loss function is designed to guide the training of the entire network, ensuring that the generated images are highly consistent with real images in terms of texture and structure.

3.2.1. Wavelet Transform Enhanced Generator Design

In the task of restoring blank bands in electrical imaging well logs, the generator needs to reconstruct high-fidelity images with geological significance from locally missing input data. Traditional full convolutional generators are limited by their fixed-scale receptive field and single-frequency feature extraction capabilities, making it difficult to simultaneously capture low-frequency stratigraphic structures, fracture morphology, and pore distribution. This can lead to misalignment or texture artifacts in the restored regions. To address this issue, this work proposes a generator enhancement architecture based on wavelet transform convolutions, which achieves refined restoration through multi-band orthogonal decomposition—convolution-inverse transformation mechanisms. As shown in Figure 4a, the generator adopts a dual-stream U-Net variant, divided into texture streams and structural streams, working collaboratively through an encoder-decoder framework. During the encoding phase, the texture stream extracts multi-scale texture features via partial convolutions, while the structural stream uses edge information as prior knowledge to extract global constraints. In the decoding phase, the texture decoder combines the high-level features from the structural branch to generate texture-constrained textures, and the structural decoder integrates the features from the texture branch to generate structure-guided structures. To further enhance the generator’s ability to process different frequency components, we introduce wavelet transform convolutions (WTConv) into the generator. Figure 5 illustrates the operation diagram of wavelet transform convolutions, revealing how they enhance the model’s receptive field through two-level wavelet decompositions. Specifically, the original input image undergoes two wavelet transformations (WT), decomposing it into low-frequency components

X_{LL}^{(2)}

and high-frequency components in horizontal, vertical, and diagonal directions:

X_{LH}^{(2)}

,

X_{HL}^{(2)}

, and

X_{HH}^{(2)}

. Here,

X_{LL}^{(2)}

represents the low-frequency part after the second wavelet decomposition, preserving the overall structure and trend of the image; while

X_{LH}^{(2)}

,

X_{HL}^{(2)}

, and

X_{HH}^{(2)}

represent the detailed information in the horizontal, vertical, and diagonal directions, respectively. Although the operation performed on

X_{LL}^{(2)}

is a 3 × 3 small convolution kernel, since it is executed on the already decomposed low-frequency part, the actual receptive field covers a larger 12 × 12 region of the original image. This mechanism not only significantly expands the receptive field of the convolution layer but also enables the model to capture more contextual information at lower computational costs, thereby helping to better understand the overall structure of the image and improve the restoration effect. By separating information at different frequencies, wavelet transform convolutions enable the model to effectively process image data at different scales, providing stronger feature extraction capabilities and higher restoration accuracy for tasks such as restoring blank bands in electrical imaging well logs.

Thus, wavelet transform convolutions, by performing convolutions in the wavelet domain, can effectively capture the multi-scale features of images, enhancing the generator’s modeling capabilities for image details and structures. This ensures the accuracy and consistency of the restoration results in both low-frequency stratigraphic structures and high-frequency details. Experimental results show that this method significantly improves the structural integrity and texture clarity of the restored regions when handling the task of restoring blank bands in electrical imaging well logs.

3.2.2. Spatial-Channel Convolution Enhanced Feature Fusion Design

The two-dimensional resistivity rate image obtained from electrical imaging well logging data exhibits significant stratigraphic texture directionality and local anisotropic characteristics. This places high demands on the spatial detail restoration and channel feature distinction capabilities of the reconstruction algorithm. Therefore, based on the feature fusion design of the SCConv module and the CTSDG framework, this paper constructs the SCBi-GFF and SC-CFA modules. These modules aim to jointly optimize texture and structure generation quality, thereby addressing the issue of discontinuous crack details in the images.

SCConv is an efficient convolution module that uses a combined reconstruction strategy across spatial and channel dimensions, significantly reducing computational costs while enhancing feature expression capabilities. As shown in Figure 6, this module consists of a sequential concatenation of spatial reconstruction units (SRU) and channel reconstruction units (CRU). Its core idea is to dynamically model redundancy distribution, explicitly separate and reconstruct effective features, and optimize the interaction efficiency of multi-modal information. In the spatial dimension, SRU adopts a three-step strategy of “separation-threshold-reconstruction”. The input feature X is first grouped and normalized to generate a channel scaling factor γ, which quantifies the spatial information density of each channel. Subsequently, the normalized weights W are mapped to the range (0,1) via a Sigmoid function and binarized using a fixed threshold T = 0.5, dividing the features into information-dominant regions

X_{1}^{w}

and redundancy-dominant regions

X_{2}^{w}

. To avoid information loss caused by simply discarding redundant features, SRU reconstructs the spatial feature X^w through cross-addition (e.g.,

X_{11}^{w}

⊕

X_{22}^{w}

) and concatenation operations, suppressing smooth background redundancy while retaining inter-regional context. In the channel dimension, CRU further optimizes features through a “split-heterogeneous-fusion” strategy. After spatial refinement, the feature X^w is split into high- and low-information-density branches at a preset ratio α = 0.5: the high branch uses group convolution (GWC) and pointwise convolution (PWC) in parallel to extract multi-scale semantic features, while the low branch retains shallow-layer details through lightweight PWC and skip connections. The two types of features are globally averaged and pooled after extracting channel statistics, and their attention weights are adaptively fused using SoftMax, ultimately outputting the refined channel feature γ. This design balances computational efficiency and expressive power through heterogeneous paths—sparse connections in the high branch reduce parameters, while the reuse mechanism in the low branch avoids detail loss.

The SCBi-GFF module is based on a bidirectional gate-controlled feature fusion framework. It inputs structural features F_s and texture features F_t through dual branches and embeds SCConv into the feature preprocessing stage to achieve sparse contextual modeling. As shown in Figure 7, the input structural feature F_s and texture feature F_t first pass through the SCConv module, utilizing its space-channel joint reconstruction mechanism to perform prior dimensionality reduction and redundancy information filtering, suppressing noise interference and strengthening key region expressions. On this basis, the module constructs a bidirectional gate-controlled dynamic modulation mechanism: through cross-path feature interaction, the structural feature F_s and texture feature F_t are multiplied element-wise with a Sigmoid function to generate gate control weight matrices G_t and G_s, respectively, which quantify the supplementary strength of texture details to structural features and the geometric constraint weight of structural contours on texture features. Subsequently, learnable parameters α and β are introduced to dynamically weight-modulate the original features, generating optimized structural feature

F_{s}^{'}

and texture feature

F_{t}^{'}

, where α and β are initialized to zero to stabilize early training convergence and gradually learn the optimal fusion ratio of cross-modal features through gradient backpropagation. After modulation, the features are fed back to the original path via skip connections, forming a closed-loop optimization circuit, forcing the network to strengthen the semantic consistency between structural and texture features during iterations and avoiding local misalignment. Ultimately, the two-path features are fused through channel concatenation and nonlinear transformation to generate the fusion feature F_b, which deeply integrates the global constraints of structural contours and the multi-scale contextual information of texture details. Through the gate mechanism, it explicitly balances the complementarity of the two, enabling the restored region to present high-fidelity texture transitions while maintaining geometric rationality.

The SCBi-GFF module dynamically balances structural and textural features through bidirectional gating, ensuring semantic consistency and high-fidelity texture restoration. Its closed-loop optimization and adaptive parameter learning enable precise alignment of multi-scale geological patterns.

The SC-CFA module is based on a context aggregation framework and combines SCConv to achieve feature redundancy suppression and deep coupling of multi-scale contextual modeling. As shown in Figure 8, the input feature first passes through the SCConv module, where SRU separates important and non-important regions, and CRU compresses redundant channels to generate an optimized intermediate feature F, providing a denoised foundational expression for subsequent processing. Subsequently, the feature enters the context matching and reconstruction stage, constructing attention weights through block similarity calculations to guide feature recombination and generate an initial repair feature F_rec. This feature is further input into a multi-scale feature aggregation branch, consisting of four parallel dilated convolutions (dilation rates of 1, 2, 4, 8) to extract contextual information at different scales. Each group outputs independent weight generators that dynamically produce pixel-level weights W¹, W², W⁴, and W⁸. Through element-wise multiplication, these weights adaptively weight each scale’s features, which are then fused into global perception features through element-wise addition. The fused features are restored to spatial resolution via deconvolution and combined with skip connections to provide cross-layer feedback with the original input features, preserving consistency to avoid information loss. The entire process achieves hierarchical feature fusion from local details to long-range dependencies through SCConv’s redundancy suppression, multi-scale dynamic weighting, and cascaded context reconstruction, reducing redundancy while enhancing the semantic rationality and texture continuity of the restored region.

The SC-CFA module hierarchically aggregates multi-scale features through dynamic weighting and redundancy suppression, enhancing long-range dependency modeling while preserving texture continuity and geological coherence.

3.2.3. Discriminator Network Design

The discriminator network adopts a dual-branch architecture, as shown in Figure 4c, consisting of two parallel branches to process RGB images and edge-grayscale composite inputs, respectively. The texture branch employs a five-layer convolutional structure (the first three layers use 4 × 4 convolutions with stride 2 for downsampling, while the latter two layers maintain resolution) to hierarchically extract features and evaluate texture authenticity. The structure branch introduces residual blocks to preprocess edge information and processes concatenated edge and grayscale maps using convolutional operations of the same hierarchy, focusing on constraining geometric rationality. Both branches utilize spectral normalization to stabilize training. Final feature fusion is achieved by concatenating outputs along the channel dimension.

3.2.4. Loss Function Design

For the blank strip inpainting task in electrical imaging logging, a multi-objective joint loss function is introduced to enhance visual quality, structural consistency, and semantic rationality of the restored results. The loss components include reconstruction loss, perceptual loss, style loss, and adversarial loss.

Reconstruction loss: Based on the L1 norm, this loss constrains pixel-level alignment between the inpainted image I_out and the ground-truth logging image I_gt, suppressing global blurring while preserving high-frequency geological response features. This avoids lithological misjudgment caused by low-frequency deviations and ensures accurate reconstruction of physical quantities such as resistivity:

L_{rec} = E [‖ I_{out} - I_{gt} ‖_{1}]

(6)

where ||⋅||₁ denotes the L1 norm.

Perception loss: Leveraging a pretrained VGG-16 network [37], this loss extracts multi-level semantic features to constrain similarity between generated and real images in deep feature space, enhancing semantic coherence of formation interfaces and pore structures:

L_{perc} = E [\sum_{i = 1}^{3} ‖ ϕ_{i} (I_{out}) - ϕ_{i} (I_{gt}) ‖_{1}]

(7)

where ϕ_i represents the activation maps from the i-th pooling layer of VGG-16.

Style loss: This loss constrains local texture distribution via Gram matrix differences between generated and real images, suppressing non-geological pseudo-texture noise and abrupt resistivity patterns:

L_{style} = E [\sum_{i = 1}^{3} ‖ ψ_{i} (I_{out}) - ψ_{i} (I_{gt}) ‖_{1}], ψ_{i} = ϕ_{i}^{⊤} ϕ_{i}

(8)

where ψ_i is the Gram matrix of feature maps.

Adversarial loss: Introduces a discriminator D to improve visual realism through adversarial training while avoiding excessive smoothing or artifacts:

L_{adv} = \min_{G} \max_{D} E [\log D (I_{gt}, E_{gt})] + E [\log (1 - D (I_{out}, E_{out}))]

(9)

where E_gt and E_out denote structural edge maps of real and generated images, respectively.

The total multi-objective joint loss is defined as:

L_{total} = \sum_{k} λ_{k} L_{k}, k \in {rec, perc, style, adv}

(10)

where λ denotes the weights of each loss, experimentally set to λ_rec = 1.0, λ_perc = 0.2, λ_style = 50, and λ_adv = 0.05.

3.3. Workflow

The workflow of the proposed dual-stream network based on wavelet and spatial-channel convolution collaborative optimization for blank strips inpainting in electrical imaging logging images is illustrated in Figure 9. First, six wells from the Buried Hills reservoir were selected as the dataset and divided into training and test sets to establish a benchmark data framework. A texture-aware data prior algorithm was then applied to concurrently inpaint the training set images, followed by manual validation and selection of valid data by expert staff. Next, the dual-stream network was constructed, and the training set images with their corresponding masks were fed into the model for training. The hyperparameters used in the model are listed in Table 1. Finally, the optimal trained weights were loaded to evaluate the model on 50 test set images, including comparative experiments and ablation experiments. To further validate the effectiveness of the proposed method, cross-region generalization analysis was performed using logging data from low-permeability hydrocarbon reservoirs, supplemented by core sample comparisons.

4. Experimental Results and Analysis

4.1. Software and Hardware Environment

The software and hardware configurations used in the experiments are listed in Table 2.

4.2. Evaluation Metrics

To comprehensively evaluate the performance of the proposed method for blank strips inpainting in electrical imaging logging images, three metrics—MAE [38], SSIM [39], and PSNR [40]—were adopted from the perspectives of pixel-level accuracy, structural consistency, and visual perception quality. Their definitions are as follows: MAE measures the absolute pixel-wise deviation between the inpainted image and the ground truth, defined as:

MAE = \frac{1}{H \times W} \sum_{i = 1}^{H} \sum_{j = 1}^{W} |I_{gt} (i, j) - I_{rec} (i, j)|

(11)

where H and W denote the height and width of the image, I_gt is the ground truth, and I_rec is the inpainted result. MAE is robust to outliers and effectively reflects the global inpainting accuracy in low-contrast regions of logging images.

PSNR quantifies reconstruction quality via the logarithmic ratio between the maximum pixel value and the mean squared error (MSE), defined as:

\begin{array}{l} PSNR = 10 \cdot \log_{10} (\frac{{MAX}^{2}}{MSE}), \\ MSE = \frac{1}{H \times W} \sum_{i = 1}^{H} \sum_{j = 1}^{W} {(I_{gt} (i, j) - I_{rec} (i, j))}^{2} \end{array}

(12)

where MAX is the maximum pixel value. PSNR provides a standardized measure of signal fidelity, suitable for evaluating error suppression in high dynamic range logging images (e.g., fracture zones).

SSIM evaluates local similarity between images through luminance, contrast, and structural components, defined as:

SSIM (I_{gt}, I_{rec}) = \frac{(2 μ_{I_{gt}} μ_{I_{rec}} + C_{1}) (2 σ_{I_{gt} I_{rec}} + C_{2})}{(μ_{I_{gt}}^{2} + μ_{I_{rec}}^{2} + C_{1}) (σ_{I_{gt}}^{2} + σ_{I_{rec}}^{2} + C_{2})}

(13)

where μ and σ represent the mean and standard deviation of the images, σ_IgtIrec is the covariance, and C₁,C₂ are stabilization constants. SSIM emphasizes geological structural coherence (e.g., fracture orientation, bedding continuity) and addresses the insensitivity of traditional pixel-level metrics to structural distortions.

4.3. Comparative Experiments

To verify the effectiveness of the method proposed in this study, we selected improved filtersim, U-Net, and AOT-GAN [41] for comparative analysis. These three approaches represent traditional image processing techniques, an encoder-decoder architecture-based convolutional neural network (CNN) baseline model, and the application of state-of-the-art multi-scale generative adversarial networks (GANs) in image inpainting tasks, respectively. Among them, U-Net and AOT-GAN require training with parameter optimization on the training set before testing. Given that borehole fractures and multi-angle fracture shapes are the primary targets of interest for logging engineers, the 50 samples in the test set included complex horizontal fractures, high-angle fractures, and pore information.

Figure 10 illustrates the repair effects of typical samples. From the visual results, the dual-stream network method based on wavelet-spatial channel convolutional optimization proposed in this study demonstrates significant advantages: (1) Fracture and pore regions are accurately filled, with natural artifact-free boundaries; (2) High-angle fractures maintain complete topological continuity without distortion or breaks; (3) Reconstructed textures align with geological structural characteristics. In contrast, the improved filtersim method shows artificial filling traces at complex fracture intersections, while U-Net and AOT-GAN results exhibit blurred textures and fractured discontinuities. This architecturally demonstrates that our dual-stream network, by explicitly separating structural and textural features through wavelet and spatial-channel modules, outperforms the aforementioned comparative methods while avoiding issues of texture blurring and fracture discontinuity.

Additionally, we calculated the total parameter count and inference time for all four methods. The inference time was derived from the average processing duration across 50 test images, as detailed in Table 3. As shown in the table, while our dual-stream architecture results in a higher parameter count (60.24 M), it achieves superior computational efficiency with an inference time of 0.86 s—significantly faster than improved-filtersim (3508.65 s), U-Net (1.24 s), and AOT-GAN (1.51 s). This enhanced speed is attributed to optimized feature fusion strategies and wavelet decomposition mechanisms, which streamline computational workflows without compromising reconstruction quality.

To quantify the filling performance, MAE, PSNR, and SSIM metrics were used for evaluation. MAE ranges from 0 to 255, SSIM from −1 to 1, and PSNR from 0 to infinity. As shown in Figure 11, the boxplots illustrate the metric distributions across the 50 test images. The proposed method outperformed others in all metrics. Table 4 summarizes the average metric values. Our method reduced MAE by 1.295, improved SSIM by at least 0.206, and enhanced PSNR by at least 3.148 compared to other methods. Both visual and quantitative results confirmed that the proposed approach significantly improved filling quality.

4.4. Ablation Study

To further validate the synergistic effects of the SCConv and WTConv modules, this study designed progressive ablation experiments using the CTSDG model as the baseline. The experimental groups included four architectures: (1) Model 1: baseline model CTSDG; (2) Model 2: CTSDG + SCConv; (3) Model 3: CTSDG + WTConv; (4) Model 4: the proposed dual-stream network with wavelet-spatial channel collaborative optimization.

Figure 12 displays the typical inpainting results of ablation tests on samples containing multi-layered background structures and diverse fracture morphologies. Visual analysis reveals that Model 1 exhibits significant artifacts and fracture discontinuities in complex regions. Model 2 reduces artifact areas by at least 70% through SCConv’s spatial-channel feature enhancement, but still shows minor discontinuities in angled fractures. Model 3 improves texture reconstruction via WTConv’s wavelet-domain decomposition but causes local structural misalignment due to spatial feature loss. Model 4 achieves an optimal balance in artifact suppression, fracture continuity, and structural alignment through dual-stream complementary mechanisms.

Figure 13 presents boxplots of evaluation metrics computed from 50 test logging images using the ablation models, and Table 5 records their average results. Quantitative evaluation further demonstrates the module’s synergy. Compared to the baseline, our method reduces MAE by 9.917, improves SSIM by 0.22, and increases PSNR by 4.813. Removing any single module degrades all metrics to varying degrees, confirming that our method reduces MAE while improving SSIM and PSNR. Both visual and quantitative results highlight the critical role of the WTConv module in preserving repair details. The experiments demonstrate that SCConv suppresses artifact generation through channel attention mechanisms, WTConv retains structural details via wavelet multi-scale decomposition, and their collaboration enables bidirectional optimization of spatial-frequency domain features, providing theoretical guarantees for the precise reconstruction of complex geological characteristics.

4.5. Cross-Block Generalization Application Validation

To validate the cross-scenario applicability of the proposed method, this study introduced external data from glutenite reservoirs, multi-fracture development zones, and dissolution pore–crack type reservoirs for generalization testing. Figure 14 displays the inpainting results in these three geologically distinct scenarios. From the results, it can be observed that the model can effectively restore the resistivity characteristics of missing regions under the aforementioned complex geological conditions. The inpainted images maintain good continuity in resistivity distribution with the original valid regions, and no obvious artifact traces are visually observed.

These three representative cases were selected to reflect a range of structural and lithological complexities, including glutenite formations with strong heterogeneity, multi-fracture zones with dense discontinuities, and dissolution pore–crack types featuring dual-porosity systems. The model demonstrates generally consistent performance across these scenarios, suggesting a degree of robustness under varying geological conditions.

Given the complementary role of core data and electrical imaging data in reservoir fracture research [42,43], this study further conducted a comparison between the repaired results and core data to verify the geological reliability of the inpainting. As shown in Figure 15c, a specific interval marked by symbol ① was initially interpreted as a possible fracture zone in the raw electrical imaging due to missing electrode measurements. However, after applying the proposed inpainting model, the restored image in Figure 15e presented a continuous resistivity pattern without fracture indicators. This correction was verified by the core slice in Figure 15a and the corresponding 3D core reconstruction in Figure 15b, both of which revealed no actual fractures at the same depth.

This case illustrates that the proposed model can eliminate false fracture signals induced by missing data, thereby enhancing geological interpretability. By accurately reconstructing resistivity distributions, the method helps prevent misinterpretations in fracture identification and supports more reliable downstream analyses such as fracture parameter evaluation and reservoir seepage modeling. Therefore, the model not only restores missing resistivity information but also improves the practical value of electrical imaging in geological applications.

5. Discussion

5.1. Pixel-Wise Correlation Statistical Analysis

Through the collaborative design of wavelet transforms and spatial-channel convolutions, this study successfully addresses the key challenges of multi-scale feature extraction and texture consistency in the task of blank-band restoration for electrical imaging logging images. Innovatively adopting a dual-stream complementary optimization mechanism of structure-flow and texture-flow (Figure 4), experiments demonstrate that this architecture simultaneously enhances the geometric rationality and texture clarity of repaired regions: the structure-flow network ensures the topological continuity of fracture morphology, while the texture-flow network achieves subpixel-level texture reconstruction through multi-scale feature fusion, effectively overcoming the common fracture artifact issues in models like U-Net.

To quantify the statistical consistency between the repaired results and the original images, this study innovatively introduces pixel distribution correlation analysis: the pixel values of repaired regions are divided into intervals, and their frequency distributions are statistically compared with those of the original images using Pearson correlation coefficients for quantitative evaluation. Statistical analysis of nine representative samples from the proposed method’s repaired results in Figure 10, Figure 12 and Figure 14 (see Figure 16a–i) reveals that the pixel distribution correlation coefficients between repaired and original images range from 0.921 to 0.997, demonstrating high statistical significance. These results confirm the dual advantages of our method in complex geological image restoration: it maintains both the spatial continuity of structural features in local regions and the statistical distribution patterns of sedimentary textures at a global scale. Notably, the highest correlation coefficient (0.997) appears in the sample with multi-fracture structures (corresponding to Figure 14b), as shown in Figure 16i, fully illustrating the dual-stream network’s capability for collaborative modeling of multi-scale geological features. While achieving precise local structural restoration, the method effectively preserves the consistency of global statistical properties in images. This “local–global” dual-constraint mechanism significantly improves the reliability of geological interpretation for electrical imaging logging data.

5.2. Sensitivity to Key Hyperparameters

Although the proposed dual-stream image restoration framework demonstrates notable advantages in structural preservation and texture detail reconstruction, it still exhibits a degree of dependency on hyperparameters at the model design level, which requires further investigation. As shown in Figure 17, the model’s performance shows significant variation under different wavelet decomposition series and SCConv expansion rate configurations, indicating a certain sensitivity to key hyperparameters.

In terms of the wavelet decomposition series, configurations with 1, 2, 3, and 4 decomposition levels were examined. The results reveal that a two-level decomposition achieves the best overall performance, demonstrating the lowest MAE and the highest SSIM and PSNR scores. This indicates that it can effectively capture high-frequency textures while suppressing checkerboard artifacts, without causing a notable increase in memory consumption. In comparison, a single decomposition level is insufficient for detail extraction, while three-level and four-level decompositions tend to introduce information redundancy and risk of detail loss. These results suggest an optimal range exists for the depth of wavelet decomposition in structural restoration tasks.

Regarding the SCConv expansion rate, four sequences—(2, 4, 8), (4, 8, 16), (8, 16, 32), and (16, 32, 64)—were compared. Among them, the (2, 4, 8) configuration strikes the best balance between receptive field enlargement and computational complexity control, resulting in the lowest MAE and the highest SSIM and PSNR scores. This shows that moderate expansion rates are effective in enlarging the receptive field while preserving local structural details. In contrast, larger expansion rates such as (4, 8, 16) enhance the expressive capacity of feature channels but are prone to introducing excessive smoothing artifacts during reconstruction, leading to structural degradation and overall performance decline. Extreme configurations like (16, 32, 64) exhibit near-degraded performance across multiple metrics, likely due to high memory demands and unstable gradient propagation, further indicating that excessively large expansion rates are not advisable.

In conclusion, although the proposed model surpasses existing approaches in several evaluation metrics, its robustness still depends, to some extent, on the careful tuning of critical hyperparameters.

6. Conclusions

This study proposes a dual-stream network for electrical imaging log blank-band restoration based on wavelet–spatial channel collaborative optimization. The main innovations of this method in electrical imaging restoration are summarized as follows:

(1): A dual-stream complementary optimization mechanism integrating structure-flow and texture-flow: The structure-flow ensures the topological continuity of fracture morphology. The texture–flow achieves subpixel-level texture reconstruction through multi-scale feature fusion, effectively resolving fracture artifacts common in single-stream or conventional dual-stream models, while simultaneously improving the geometric rationality and texture clarity of repaired regions.
(2): Innovative introduction of the WTConv module: A multi-band orthogonal decomposition-convolution-inversion mechanism enables refined restoration, significantly expanding the receptive field of convolutional layers. This allows the model to capture more contextual information at lower computational costs, better understand the overall image structure, and improve processing capabilities for different frequency components.
(3): Design of SCBi-GFF and SC-CFA modules: Joint optimization of texture and structural generation quality collaboratively addresses discontinuous fracture details. Through cross-path feature interaction and dynamic parameter weighting, semantic consistency of structural-textural features is enhanced, avoiding local distortions.

Through these innovations, comprehensive evaluations were conducted, including comparative experiments, ablation studies, and cross-block generalization tests. Quantitative analysis demonstrates that the proposed method achieves MAE = 6.893, SSIM = 0.779, and PSNR = 19.087, significantly outperforming existing mainstream models. Ablation studies systematically reveal the synergistic effects between modules, confirming the effectiveness of the dual-stream complementary architecture and multi-band optimization design. Cross-block generalization tests further validate the model’s strong robustness. Additionally, fracture consistency verification based on core slice comparisons confirms that the inpainted results do not interfere with geological interpretation and effectively eliminate false fracture indicators, providing more reliable data support for downstream reservoir modeling tasks. In future work, the research team intends to extend this framework to 3D electrical imaging data restoration, integrate geological parameters such as fracture aperture and occurrence, and achieve intelligent spatially continuous reconstruction of fracture systems in complex reservoirs. Meanwhile, we also plan to introduce a multi-modal fusion approach by integrating complementary logging data and combining it with our model to promote joint feature learning.

Author Contributions

Conceptualization, G.L. and S.F.; methodology, G.L. and S.F.; software, G.L. and Z.Z.; validation, M.L., C.X. and H.W.; formal analysis, G.L. and H.W.; investigation, S.F. and C.X.; resources, S.F. and G.L.; data curation, G.L. and M.L.; writing—original draft preparation, S.F.; writing—review and editing, S.F., G.L., M.L., H.W. and Z.Z.; visualization, Z.Z. and C.X.; supervision, S.F. and G.L.; project administration, S.F.; funding acquisition, S.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China for Young Scientists (Grant No. 42204127), the Open Fund of the Key Laboratory of Exploration Technologies for Oil and Gas Resources (Yangtze University), the Ministry of Education (Grant No. PI2023-03), and the Hubei Provincial Department of Education Science and Technology Research Program for Young Talents (Grant No. Q20221304).

Data Availability Statement

The data that has been used is confidential. Further inquiries can be directed to the corresponding author.

Acknowledgments

The authors extend their sincere thanks to the editors and reviewers for their careful reading and fruitful suggestions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Ekstrom, M.P.; Dahan, C.A.; Chen, M.Y.; Lloyd, P.M.; Rossi, D.J. Formation Imaging with Microelectrical Scanning Arrays. In Proceedings of the SPWLA 27th Annual Logging Symposium, Houston, TX, USA, 9–13 June 1986. [Google Scholar]
Gao, J.; Jiang, L.; Liu, Y.; Chen, Y. Review and analysis on the development and applications of electrical imaging logging in oil-based mud. J. Appl. Geophys. 2019, 171, 103872. [Google Scholar] [CrossRef]
Xie, F.; Zhang, C.; Liu, R.; Xiao, C. Production prediction for fracture-vug carbonate reservoirs using electric imaging logging data. Petrol. Explor. Dev. 2018, 45, 369–376. [Google Scholar] [CrossRef]
Xiao, L.; Li, J.; Mao, Z.; Yu, H. A method to evaluate pore structures of fractured tight sandstone reservoirs using borehole electrical image logging. AAPG Bull. 2020, 104, 205–226. [Google Scholar] [CrossRef]
Aghli, G.; Moussavi-Harami, R.; Mohammadian, R. Reservoir heterogeneity and fracture parameter determination using electrical image logs and petrophysical data (a case study, carbonate Asmari Formation, Zagros Basin, SW Iran). Petrol. Sci. 2020, 17, 51–69. [Google Scholar] [CrossRef]
Wu, X.; Su, Y.; Zhang, C.; Xin, Y.; Chen, X.; Li, N.; Huang, R.; Tang, B.; Zhao, X. A novel evaluation method of dolomite reservoir using electrical image logs: The Cambrian dolomites in Tarim Basin, China. Geoenergy Sci. Eng. 2024, 233, 212509. [Google Scholar] [CrossRef]
Hurley, N.F.; Zhang, T. Method to generate full-bore images using borehole images and multipoint statistics. SPE Reserv. Eval. Eng. 2011, 14, 204–214. [Google Scholar] [CrossRef]
Zhang, T.; Switzer, P.; Journel, A. Filter-based classification of training image patterns for spatial simulation. Math. Geol. 2006, 38, 63–80. [Google Scholar] [CrossRef]
Zhang, X.; Zhang, M.; Xiao, X.; Luo, L.; Yang, Y.; Cui, W. Image inpainting for fullbore electrical imaging logging in complex formations. Geophys. Prospect. Petrol. 2018, 57, 148–153. [Google Scholar] [CrossRef]
Luo, X.; Yan, J.; Wang, M.; Zhong, G.; Wang, J.; Huang, Y. Optimization and application of borehole wall restoration method of FMI logging image. Well Logging Technol. 2021, 45, 386–393. [Google Scholar]
Criminisi, A.; Pérez, P.; Toyama, K. Region filling and object removal by exemplar-based image inpainting. IEEE Trans. Image Process. 2004, 13, 1200–1212. [Google Scholar] [CrossRef]
Feng, Y.; Fan, Z.; Yan, Y.; Jiang, Z.; Zhang, S. MFAFNet: Multi-Scale Feature Adaptive Fusion Network Based on DeepLab V3+ for Cloud and Cloud Shadow Segmentation. Remote Sens. 2025, 17, 1229. [Google Scholar] [CrossRef]
Li, B.; Nie, X.; Cai, J.; Zhou, X.; Wang, C.; Han, D. U-Net model for multi-component digital rock modeling of shales based on CT and QEMSCAN images. J. Petrol. Sci. Eng. 2022, 216, 110734. [Google Scholar] [CrossRef]
Fang, S.; Zhang, Z.; Chen, W.; Pan, H.; Peng, J. 3D crosswell electromagnetic inversion based on radial basis function neural network. Acta Geophys. 2020, 68, 711–721. [Google Scholar] [CrossRef]
Zheng, X.; Fang, S.; Chen, H.; Peng, L.; Ye, Z. Internal Detection of Ground-Penetrating Radar Images Using YOLOX-s with Modified Backbone. Electronics 2023, 12, 3520. [Google Scholar] [CrossRef]
Guo, J.; Lv, H.; Zhao, Q.; Yang, Y.; Zhu, Z.; Zhang, Z. CNN-GRU-ATT Method for Resistivity Logging Curve Reconstruction and Fluid Property Identification in Marine Carbonate Reservoirs. J. Mar. Sci. Eng. 2025, 13, 331. [Google Scholar] [CrossRef]
Chen, J.; Yang, L.; Zhao, Y.; Zhao, H.; Zhang, W.; Fang, G.; Wu, W. Application of deep neural network model in processing of electrical logging images. Electron. Meas. Technol. 2021, 44, 138–143. [Google Scholar]
Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention (MICCAI), Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
Zhang, H.; Sima, L.; Wang, L.; Che, G.; Guo, Y.; Yang, Q. Blank strip filling method for resistivity imaging image based on convolution neural network. Prog. Geophys. 2021, 36, 2136–2142. [Google Scholar] [CrossRef]
Du, C.; Xing, Q.; Zhang, J.; Wang, J.; Liu, B.; Wang, Y. Blank strips filling for electrical logging images based on attention-constrained deep generative network. Prog. Geophys. 2022, 37, 1548–1558. [Google Scholar] [CrossRef]
Cao, M.; Feng, H.; Xiao, H. An improved GAN-based image restoration method for imaging logging images. Appl. Sci. 2023, 13, 9249. [Google Scholar] [CrossRef]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27, 2672–2680. [Google Scholar]
Sun, Q.; Su, N.; Gong, F.; Du, Q. Blank Strip Filling for Logging Electrical Imaging Based on Multiscale Generative Adversarial Network. Processes 2023, 11, 1709. [Google Scholar] [CrossRef]
Su, Q.; Qiao, D.; Ren, Y.; Feng, Z.; Lin, S.; Huang, R. Inpainting of blank strips in imaging logging images based on Fourier convolution. J. Beijing Univ. Aeronaut. Astronaut. 2024, 1–11. [Google Scholar] [CrossRef]
Wang, J.; Hou, Z.; Zhang, Z.; Wang, M.; Cheng, H. Combined Deep-Fill and Histogram Equalization Algorithm for Full-Borehole Electrical Logging Image Restoration. Processes 2024, 12, 1568. [Google Scholar] [CrossRef]
Zhong, Z.; Wang, X. Prior-guided adaptive generative adversarial network method for various types of borehole image inpainting. Geophysics 2024, 89, D229–D242. [Google Scholar] [CrossRef]
Yang, Z.; Wu, X.; Pang, X.; Sheng, H.; Si, X.; Wang, G.; Yang, L.; Wang, C. Completing any borehole images. IEEE Trans. Geosci. Remote Sens. 2024, 62, 3469394. [Google Scholar] [CrossRef]
Assous, S.; Elkington, P.; Clark, S.; Whetton, J. Automated detection of planar geologic features in borehole images. Geophysics 2014, 79, D11–D19. [Google Scholar] [CrossRef]
Lai, J.; Wang, G.; Fan, Z.; Wang, Z.; Chen, J.; Zhou, Z.; Wang, S.; Xiao, C. Fracture detection in oil-based drilling mud using a combination of borehole image and sonic logs. Mar. Petrol. Geol. 2017, 84, 195–214. [Google Scholar] [CrossRef]
Liu, J.; Wu, H.; Nian, T.; Xu, L.W.; Zhang, Y.; Du, W.; Wu, H.; Chang, L. Automated detection of beddings from borehole images: A new improvement and first comparative study. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5904916. [Google Scholar] [CrossRef]
Guo, X.; Yang, H.; Huang, D. Image Inpainting via Conditional Texture and Structure Dual Generation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 14114–14123. [Google Scholar]
Finder, S.E.; Amoyal, R.; Treister, E.; Freifeld, O. Wavelet convolutions for large receptive fields. In Proceedings of the European Conference on Computer Vision, Milan, Italy, 29 September–4 October 2024; Springer: Cham, Switzerland, 2025; pp. 363–380. [Google Scholar]
Li, J.; Wen, Y.; He, L. SCConv: Spatial and Channel Reconstruction Convolution for Feature Redundancy. In Proceedings of the 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 18–22 June 2023; pp. 6153–6162. [Google Scholar]
Johnson, J.; Alahi, A.; Fei-Fei, L. Perceptual losses for real-time style transfer and super-resolution. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Proceedings, Part II 14. Springer: Berlin/Heidelberg, Germany, 2016; pp. 694–711. [Google Scholar]
Iizuka, S.; Simo-Serra, E.; Ishikawa, H. Globally and locally consistent image completion. ACM Trans. Graphic. 2017, 36, 107. [Google Scholar] [CrossRef]
Zhang, G.; Lyu, C.; Yang, D.; Guo, S.; Wang, L. The dominant three-element model of oil and gas accumulation in basement buried hills: A discussion on new exploration frontiers in the deepwater area of the northern South China Sea. J. Nat. Gas Geosci. 2024, 9, 69–85. [Google Scholar] [CrossRef]
Theckedath, D.; Sedamkar, R.R. Detecting affect states using VGG16, ResNet50 and SE-ResNet50 networks. SN Comput. Sci. 2020, 1, 79. [Google Scholar] [CrossRef]
de Myttenaere, A.; Golden, B.; Le Grand, B.; Rossi, F. Mean absolute percentage error for regression models. Neurocomputing 2016, 192, 38–48. [Google Scholar] [CrossRef]
Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef]
Besag, J.; York, J.; Mollié, A. Bayesian image restoration, with two applications in spatial statistics. Ann. Inst. Stat. Math. 1991, 43, 1–20. [Google Scholar] [CrossRef]
Zeng, Y.; Fu, J.; Chao, H.; Guo, B. Aggregated contextual transformations for high-resolution image inpainting. IEEE Trans. Vis. Comput. Graph. 2022, 29, 3266–3280. [Google Scholar] [CrossRef]
Nian, T.; Wang, G.; Tan, C.; Fei, L.; He, W.; Wang, S. Hydraulic apertures of barren fractures in tight-gas sandstones at depth: Image-core calibration in the lower cretaceous Bashijiqike Formation, Tarim Basin. J. Petrol. Sci. Eng. 2021, 196, 108016. [Google Scholar] [CrossRef]
Wood, D.A. Expanding role of borehole image logs in reservoir fracture and heterogeneity characterization: A review. Adv. Geo-Energy Res. 2024, 12, 194–204. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of the working principle of the micro-resistivity imaging logging tool. It emits electrical currents toward the borehole wall, measures the returning resistivity variations, captures these data through sensors on multiple pads, and ultimately generates high-resolution images that reflect formation characteristics.

Figure 2. Location map of the Qiongdongnan Basin and Pearl River Mouth Basin.

Figure 4. Network architecture diagram of the dual-stream network based on collaborative optimization of wavelet and spatial-channel convolution for blank strips inpainting in electrical imaging logging images: (a) shows the generator structure, (b) shows the feature fusion module, and (c) shows the discriminator structure.

Figure 5. Schematic diagram of convolution operations for multi-scale feature extraction through two-level wavelet transformations.

Figure 6. SCConv module diagram.

Figure 7. SCBi-GFF module diagram.

Figure 8. SC-CFA module diagram.

Figure 9. Workflow diagram.

Figure 10. Inpainting results of different methods on the logging image inpainting. From left to right: original image, corresponding mask, improved filtersim method, U-Net method, AOT-GAN method, and our method. (a–d) represent four groups of typical samples, respectively.

Figure 11. Comparison of three evaluation metrics across methods. Boxplots display the computational results from 50 test logging images.

Figure 12. Inpainting results of ablation experiments on logging images. (a)–(c) represent three groups of typical samples, respectively.

Figure 13. Boxplots of three evaluation metrics in ablation experiments, computed from 50 test logging images.

Figure 14. Inpainting applications in representative reservoir types: (a) glutenite reservoir; (b) multi-fracture development zone; (c) dissolution pore–crack reservoir.

Figure 15. Core slice comparison for validation: (a) core slice; (b) 3D core model; (c) raw image with blank zones; (d) input mask; (e) repaired image.

Figure 16. Pixel frequency distribution correlation between original and repaired images. Subfigures (a–d) correspond to the repaired results in Figure 10, (e–g) to those in Figure 12, and (h,i) to those in Figure 14.

Figure 17. Sensitivity analysis of wavelet decomposition series and SCConv expansion rate configurations.

Table 1. Model training hyperparameters.

Number	Name	Parameter
1	Learning rate	0.002
2	Batch size	4
3	Total epoch	5000
4	SCConv expansion rate	(2, 4, 8)
5	Input image size	(256, 256, 3)
6	Number of workers	1
7	Wavelet decomposition series	2

Table 2. Software and hardware specification.

Category	No.	Item	Specification
Software	1	Operating system	Windows 10 (64-bit)
	2	Python	3.9
	3	PyTorch	2.0
	4	CUDA	11.1
	5	Compiler	PyCharm 2023.1
Hardware	6	GPU	NVIDIA GeForce RTX 2080 Ti
	7	CPU	Intel(R) Xeon(R) W-2145 @ 3.70 GHz
	8	RAM	48.0 GB

Table 3. Comparative analysis of model parameters and inference efficiency.

Method	Parameter Count (M)	Inference Time (s)
Improved-filtersim	-	3508.65
U-Net	24.89	1.24
AOT-GAN	19.73	1.51
Ours	60.24	0.86

Table 4. Average results of three evaluation metrics across comparative methods.

	Improved-Filtersim	U-Net	AOT-GAN	Ours
MAE (Avg)↓	8.152	15.741	16.920	6.893
SSIM (Avg)↑	0.684	0.573	0.614	0.779
PSNR (Avg)↑	16.880	15.939	15.362	19.087

Table 5. Average results of three evaluation metrics in the ablation experiments.

Model	SCConv	WTConv	MAE (Avg)↓	SSIM (Avg)↑	PSNR (Avg)↑
1			16.810	0.559	14.274
2	√		14.260	0.627	16.068
3		√	9.148	0.712	17.314
4 (Ours)	√	√	6.893	0.779	19.087

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lin, G.; Fang, S.; Li, M.; Wu, H.; Xue, C.; Zhang, Z. Application of a Dual-Stream Network Collaboratively Based on Wavelet and Spatial-Channel Convolution in the Inpainting of Blank Strips in Marine Electrical Imaging Logging Images: A Case Study in the South China Sea. J. Mar. Sci. Eng. 2025, 13, 997. https://doi.org/10.3390/jmse13050997

AMA Style

Lin G, Fang S, Li M, Wu H, Xue C, Zhang Z. Application of a Dual-Stream Network Collaboratively Based on Wavelet and Spatial-Channel Convolution in the Inpainting of Blank Strips in Marine Electrical Imaging Logging Images: A Case Study in the South China Sea. Journal of Marine Science and Engineering. 2025; 13(5):997. https://doi.org/10.3390/jmse13050997

Chicago/Turabian Style

Lin, Guilan, Sinan Fang, Manxin Li, Hongtao Wu, Chenxi Xue, and Zeyu Zhang. 2025. "Application of a Dual-Stream Network Collaboratively Based on Wavelet and Spatial-Channel Convolution in the Inpainting of Blank Strips in Marine Electrical Imaging Logging Images: A Case Study in the South China Sea" Journal of Marine Science and Engineering 13, no. 5: 997. https://doi.org/10.3390/jmse13050997

APA Style

Lin, G., Fang, S., Li, M., Wu, H., Xue, C., & Zhang, Z. (2025). Application of a Dual-Stream Network Collaboratively Based on Wavelet and Spatial-Channel Convolution in the Inpainting of Blank Strips in Marine Electrical Imaging Logging Images: A Case Study in the South China Sea. Journal of Marine Science and Engineering, 13(5), 997. https://doi.org/10.3390/jmse13050997

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Application of a Dual-Stream Network Collaboratively Based on Wavelet and Spatial-Channel Convolution in the Inpainting of Blank Strips in Marine Electrical Imaging Logging Images: A Case Study in the South China Sea

Abstract

1. Introduction

2. Geological Overview

3. Methodology

3.1. Data Collection and Preprocessing

3.2. Overall Network Architecture

3.2.1. Wavelet Transform Enhanced Generator Design

3.2.2. Spatial-Channel Convolution Enhanced Feature Fusion Design

3.2.3. Discriminator Network Design

3.2.4. Loss Function Design

3.3. Workflow

4. Experimental Results and Analysis

4.1. Software and Hardware Environment

4.2. Evaluation Metrics

4.3. Comparative Experiments

4.4. Ablation Study

4.5. Cross-Block Generalization Application Validation

5. Discussion

5.1. Pixel-Wise Correlation Statistical Analysis

5.2. Sensitivity to Key Hyperparameters

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI