Upgoing and Downgoing Wavefield Separation in Vertical Seismic Profiling Guided by Signal Knowledge Representation

Lu, Cai; Qu, Liyuan; Liu, Jijun; Gao, Jianbo

doi:10.3390/app15116360

Open AccessArticle

Upgoing and Downgoing Wavefield Separation in Vertical Seismic Profiling Guided by Signal Knowledge Representation

¹

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China

²

School of Resources and Environment, University of Electronic Science and Technology of China, Chengdu 611731, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(11), 6360; https://doi.org/10.3390/app15116360

Submission received: 24 April 2025 / Revised: 28 May 2025 / Accepted: 30 May 2025 / Published: 5 June 2025

Download

Browse Figures

Versions Notes

Abstract

:

Effective vertical seismic profiling (VSP) of upgoing and downgoing wave separation is essential for high-quality imaging. However, VSP wavefield separation is particularly challenging under complex geological conditions. Existing solutions encompass one derived from the mathematical characteristics of upgoing and downgoing waves, employing signal decomposition methodologies, and another that utilizes data-driven machine learning techniques, achieving wavefield separation by training sample data to identify the distinct characteristics of upgoing and downgoing waves. This study introduces a VSP wave-separation method using signal knowledge representation, primarily by constructing knowledge representations of upgoing and downgoing waves. Physics-informed recurrent neural network FWI and Poynting vector physical knowledge representation yielded accurate velocity models. Axial gradient information was utilized to construct morphological knowledge representations of upgoing and downgoing waves. Directional differentiation knowledge representations were established based on kinematic characteristic disparities between upgoing and downgoing waves in the time-depth domain. These wave knowledge representations (KRs) built a dual convolutional autoencoder. Its distinct branches extracted up/down wave information, while the KRs, transformed into loss functions, enabled knowledge-driven unsupervised VSP wave separation. The proposed methodology was validated using a homogeneous layer and Marmousi models, demonstrating the effective separation of upgoing and downgoing waves from the VSP seismic records.

Keywords:

knowledge representation; unsupervised model; vertical seismic profile (VSP); wavefield separation

1. Introduction

Seismic waves are fundamental to geophysical exploration, serving as the primary carriers of information about the Earth’s subsurface. These elastic waves, predominantly categorized into compressional (P-waves) and shear (S-waves), propagate through geological media with velocities and amplitudes that are sensitive to variations in rock properties, fluid content, and structural features. Seismic exploration methods, therefore, aim to generate and record these waves to image subsurface structures and characterize reservoir properties. Vertical Seismic Profiling (VSP) is a crucial borehole seismic technique in seismic data acquisition. It typically involves deploying an array of receivers at various depths within a wellbore to record the seismic signals generated by a source positioned at or near the surface. This acquisition geometry allows VSP to directly record both the downgoing wavefield and the upgoing wavefield. The direct measurement of the downgoing wavefield provides invaluable information for determining the seismic source wavelet, accurate interval velocities, and formation attenuation characteristics, while the upgoing wavefield carries detailed information about reflectors and structural features around the wellbore. Consequently, compared to surface seismic exploration, VSP generally offers superior signal-to-noise ratios and higher vertical resolutions. The VSP thus enables more precise measurements of formation velocity parameters, P- and S-wave properties, anisotropy, amplitude information, and lithological characteristics, thereby significantly enhancing seismic data-interpretation capabilities in structural, stratigraphic, and lithological analysis [1]. In VSP-based data processing, the separation of upward and downward wavefields is a prerequisite for seismic migration imaging [2]. Upgoing waves typically contain valuable information from subsurface reflectors. A clean upgoing wavefield forms the basis for performing Amplitude Versus Angle (AVA) or Off-set (AVO) analysis to identify fluid properties, calculating formation absorption attenuation (Q-value) parameters to evaluate reservoir characteristics and achieving accurate well-seismic time–depth calibration. Downgoing waves, on the other hand, mainly consist of direct waves and multiples [3]. Among these, clear direct waves provide precise seismic wave travel time information, serving as crucial input for calculating accurate interval velocities and establishing reliable velocity models. This is vital for subsequent migration imaging and depth–domain interpretation [4,5]. Furthermore, analyzing the downgoing waves and the multiples they generate aids in understanding the mechanisms of multiple generation within seismic data, subsequently guiding the suppression of multiples in surface seismic data processing. Downgoing waves can also be utilized for extracting the seismic wavelet, performing deconvolution to enhance resolution, and similarly, for estimating the attenuation characteristics of the formation media. In summary, the accuracy of VSP wavefield separation directly dictates the quality of the input data for subsequent processing steps and ultimately determines the reliability and geological significance of the final interpretation results. However, complex geological conditions frequently result in overlapping wavefields contaminated with noise, thereby complicating conventional separation methods [6].

Traditional wavefield-separation approaches can be classified into two categories. The first comprises time–depth separation methods, including Singular Value Decomposition (SVD) [7] and median filtering [8]. SVD leverages differences in seismic wave lateral coherency; however, due to its non-orthogonal nature, SVD performs poorly with VSP wavefields. Median filtering utilizes distinct dip angles in the time-depth domain but requires precise first-break picking and may introduce frequency distortion in high-frequency signals. The second category encompasses transform-domain filtering methods, primarily f-k filtering [9] and Radon transformation [10,11]. These methods separate wavefields by converting time-depth domain data into specific parameter domains. However, both approaches suffer from truncation effects and spatial aliasing, which potentially compromise separation accuracy through manual window selection. These challenges stem partly from the interference of wavefields within VSP records and the potentially time-varying spectral characteristics of the signal itself. Accurately characterizing and simulating such complex signal properties is itself a research focus; for instance, Genovese and Palmeri utilized wavelet transforms to generate random seismic ground motion records with non-stationary characteristics, aiming to more realistically reflect the complex nature of seismic signals [12].

Recently, there has been increasing interest in deep learning for complex non-linear optimization problems [13,14,15]. Deep learning has successfully addressed the challenges of seismic inversion [16] and denoising [17] in seismic data processing. Compared to the limitations of traditional methods, neural networks possess powerful non-linear fitting capabilities. They are able to learn from data and model the complex and highly non-linear mapping relationship between the mixed wavefield and the separated up/downgoing wavefields. This gives them a theoretical advantage in handling complex wavefield superposition problems that are difficult for traditional linear methods to solve. Data-driven Convolutional Neural Networks (CNNs) [5] and Generative Adversarial Networks (GANs) [18], including variants like CGANs with specialized convolution blocks, effectively capture weak wavefield characteristics and mitigate the effects of amplitude disparity. However, the performance of these approaches depends heavily on the quality of the training data and their generalization capabilities. Although Tao [6] and Margrave et al. [19] addressed this limitation using synthetic dataset generation, challenges persist in practical applications due to limited real-world samples. Dalai [20] proposed an unsupervised deep learning method for denoising receiver function data. This approach is based on an encoder–decoder framework, with its core idea being that the network learns the inherent structure of the signal by reconstructing the noisy input itself rather than a clean target during training, thereby achieving denoising and effectively eliminating the need for large-scale labeled data. Lu [21] employed a dual-convolutional autoencoder architecture to realize unsupervised wavefield separation in VSP data. The pursuit of robust unsupervised and physics-guided methods continues to be an active area of research, with investigations into hybrid model-data-driven strategies [22] and the application of physics-informed principles to further constrain network learning.

This study proposes a knowledge-guided unsupervised deep learning method (KGCAE, Knowledge-Guided Convolutional Autoencoder) whose core innovation lies in constructing and integrating multi-dimensional knowledge representations to guide the training process. These include (1) physics-derived representations based on physical principles, (2) morphological representations reflecting structural priors, and (3) directional discrepancy representations characterizing wave propagation patterns. By translating these knowledge representations into loss functions to guide the wavefield separation process, the method aims to combine the representational power of deep learning with physical constraints for up/down-going wave separation, overcoming limitations of traditional approaches and purely data-driven supervised methods.

The proposed KGCAE leverages its dual-autoencoder architecture and unsupervised, knowledge-driven learning paradigm to eliminate dependence on large, labeled datasets, which is typically required by supervised deep learning methods. Through the integration of diverse knowledge representations, this approach guides the network to generate physically plausible and structurally coherent separation results, particularly under challenging geological conditions with severe wavefield aliasing and low signal-to-noise ratios. This knowledge-guided mechanism effectively constrains the solution space, avoiding non-physical artifacts or stability issues common in purely data-driven or morphology-based unsupervised methods, thereby achieving more robust and stable separation performance. However, the method’s performance depends on the accuracy of the input knowledge representations. The physical knowledge relies on the quality of the velocity model obtained through full-waveform inversion (FWI), while the morphological knowledge currently requires manual dip picking.

This study aims to achieve VSP upgoing and downgoing wavefield separation by establishing a comprehensive multi-dimensional knowledge representation system for signal analysis and constructing a knowledge-driven separation framework based on a dual convolutional autoencoder, transforming the knowledge representation system into constraint loss functions applied to the autoencoder. An unsupervised separation of the upgoing and downgoing waves was achieved. Systematic validation experiments conducted on both the homogeneous layer and Marmousi models demonstrated that the proposed methodology effectively processes the VSP upgoing and downgoing wavefield separations under complex geological conditions, effectively avoiding the reliance on large-scale labeled samples inherent in traditional data-driven methods.

2. Knowledge Representation for VSP Wavefield Separation

This section will detail the construction of knowledge representations for the target signals, used to guide the subsequent VSP upgoing and downgoing wave separations. This study extracts and utilizes characteristic information of the wavefield from multiple perspectives, constructing multiple forms of knowledge representations. These include physical knowledge representations derived from the energy flux density vector (Poynting vector), morphological knowledge representations based on shape priors, and directional difference knowledge representations based on wavefield propagation characteristics. These knowledge representations collectively form a comprehensive system for wavefield description.

2.1. Physical Knowledge Representation Based on Energy Flow

The core principle for physically distinguishing upgoing and downgoing waves lies in their direction of energy propagation. This study leverages the energy flux density vector, commonly known as the Poynting vector e, to capture this directional information [23,24,25]. The extraction process involves three key steps: first, obtaining an approximate subsurface velocity model through Full Waveform Inversion (FWI); second, performing elastic wave forward modeling using this model; and third, calculating the Poynting vector from the simulated wavefield to derive the knowledge representations.

To obtain accurate geological model parameters, we adopted a Physics-Informed Recurrent Neural Network (PIRNN)-based full-waveform-inversion method [26]. The geological model parameters mainly comprise the P- and S-wave velocities (

c_{p}, c_{s}

) and density

ρ

, which are key parameters describing the elastic properties of underground media. We can establish the mapping relationship between the geological model parameters and the resulting VSP wavefield. Specifically, we assume that the P- and S-wave velocities in the geological model are represented as

c_{p} \in R^{N \times M}

and

c_{s} \in R^{N \times M}

respectively, where N and M represent the number of vertical and horizontal sampling points. The forward modeling process can be expressed abstractly as

\tilde{X} = F (c_{p}, c_{s}),

(1)

where

\tilde{X} \in R^{T \times N}

represents the VSP wavefield obtained through forward modeling,

T

is the number of time sampling points, and

F (\cdot)

represents the forward process, which is specifically implemented using the PIRNN method. To invert the geological model parameters,

c_{p}

and

c_{s}

, using the observed VSP wavefield,

X

, as the label, the inversion problem of the geological model parameters can be converted into a constrained optimization problem:

{\begin{array}{l} \min_{c_{p}, c_{s}} | | X - \tilde{X} | | \\ subject to \tilde{X} = F (c_{p}, c_{s}) \end{array}}_{,}

(2)

where

\tilde{X} \in R^{T \times N}

is the observed VSP wavefield. By solving the above optimization problem, we can obtain the optimal solution of the geological model parameters,

c_{p}^{*} a n d c_{s}^{*}

, and obtain the approximate solution of the seismic wavefield,

{\tilde{X}}^{*}

, through forward modeling. To achieve VSP wavefield simulation, the PIRNN incorporates the physical constraints of the wave equation into the neural network using the RNN’s time-stepping mechanism for forward modeling. Through the backpropagation algorithm, the PIRNN can invert the geological model parameters from the observed VSP wavefield, thereby providing a foundation for the subsequent upgoing- and downgoing wave knowledge representation.

After obtaining

c_{p}^{*}

and

c_{s}^{*}

, the approximate solution of the seismic wavefield,

{\tilde{X}}^{*}

, is obtained through a forward modeling process. During this process, the energy flux density vector [27,28] is introduced to describe the propagation direction and distribution of energy within the wavefield. We simulated the wavefield propagation in the two-dimensional depth-offset space. The behavior of the displacement field

u = (u_{x}, u_{z})

can be effectively described by decomposing it using the dilatation scalar

θ = \nabla \cdot u

and the rotation vector

ω = \nabla \times u

. The elastic wave equation, expressed in terms of these P-wave and S-wave related potentials, is given by [28,29]

\frac{\partial^{2} u}{\partial t^{2}} = {c_{p}^{*}}^{2} \nabla θ - {c_{s}^{*}}^{2} \nabla \times ω,

(3)

where the estimated velocities

c_{p}^{*}

and

c_{s}^{*}

are used. Solving Equation (3) numerically yields an approximate full wavefield solution

X^{*}

. From this solution, we can derive the necessary quantities for energy analysis, including the particle velocity

v = \frac{\partial u}{\partial t}

the dilatation scalar

θ

, and the rotation vector

ω

.

Lastly, based on the simulated wavefield

X^{*}

, we calculate the energy flux density vector,

e

, to understand the direction and distribution of energy propagation [27,28]. The vector e is composed of P-wave and S-wave components,

e_{p}

and

e_{s}

, respectively:

\{\begin{cases} e_{p} = - θ v_{p} \\ e_{s} = v_{s} \times ω \end{cases},

(4)

where

e_{p} = (e_{p x}, e_{p z})

is the energy flux density vector of the P waves and

e_{s} = (e_{s x}, e_{s z})

is the energy flux density vector of the S waves.

v_{p}

is the P-wave particle vibration velocity and

v_{s}

is the S-wave particle vibration velocity. Equation (4) clarifies the interrelationship between the P- and S-waves during time propagation and elucidates the mechanism whereby they couple with the dilatation scalar and rotation vectors. The total energy flux density vector is expressed as follows:

e = e_{p} + e_{s} .

(5)

The energy flux density vector,

e

, not only describes the direction and intensity of energy propagation but also provides a basis for distinguishing between upgoing and downgoing waves. From Equation (5), the P-wave component,

e_{p}

, is related to the dilatation scalar and the P-wave particle vibration velocity field,

v_{p}

, whereas the S-wave component,

e_{s}

, is related to the rotation vector,

ω

. We utilize the sign of the vertical component of the P-wave energy flux vector,

e_{p z}

, to determine the primary wave propagation direction. A positive

e_{p z}

generally corresponds to upward energy flow, and a negative value to downward flow. This directional information allows us to mask the seismic wavefield approximate solution,

X^{*}

, to extract the knowledge representation of the upward and downward wavefields:

\begin{array}{l} U_{r p} (t, z_{r}) = \{\begin{array}{l} {\tilde{X}}^{*} (t, z_{r}), & if e_{p z} \geq 0 \\ 0, & otherwise \end{array} \\ D_{r p} (t, z_{r}) = \{{\begin{array}{l} {\tilde{X}}^{*} (t, z_{r}), & if e_{p z} < 0 \\ 0, & otherwise \end{array}}_{,} \end{array}

(6)

where

U_{r p}

represents the knowledge representation of the upward P-waves,

D_{r p}

represents the knowledge representation of the downward P-waves, and

e_{p z}

is the vertical component of the P-wave energy flux density vector,

e_{p}

. Figure 1 shows the extraction process of the physical knowledge representation, where (a) and (e) represent the parts of the P-wave energy flux density vector, with

e_{p z} > 0

and

e_{p z} < 0

obtained through the above forward process, showing the distribution of the upward and downward waves. By applying this equation to the seismic wavefield approximate solution,

X^{*}

, in (c), we can obtain the physical knowledge representation of the upgoing and downgoing waves in (d) and (e), respectively. This knowledge representation, which is derived directly from physical models, has a clear physical meaning and can reflect the direction of wavefield propagation to provide important information on physical constraints for subsequent upgoing and downgoing wave separation.

2.2. Morphological Knowledge Representation

In near-VSP data, odd-order-reflected waves (downward waves) typically propagate downward, whereas even-order-reflected waves (upward waves) typically propagate upward. This difference in the direction of propagation confers distinct apparent velocity and slope characteristics for the upward and downward waves in the depth-time domain. Specifically, the upgoing wavefield forms a negative-slope inverse pattern in the depth-time domain, whereas the downgoing wavefield forms a positive-slope direct pattern. As shown in Figure 2, the near-offset VSP record of a simple layered model reveals that upgoing and downgoing waves have a set of axial dip angles with the horizontal interface, which is denoted as

α = \{α_{i}| 1 \leq i \leq p, α_{i} \in R\}

and

β = \{β_{i}| 1 \leq i \leq q, β_{i} \in R\}

. To utilize these morphological features to constrain the upgoing and downgoing wavefields, we introduced a knowledge representation based on morphological priors. This representation aims to emphasize the axial continuity and slope differences between the upward and downward wavefields, thereby improving the accuracy of wavefield separation.

To extract this knowledge representation, we used a modified Sobel operator to calculate the gradients in the axial direction. The Sobel operator is a commonly used discrete differential operator for calculating the approximate gradients of images and signals. It detects edges and directional changes in images using convolution operations. Figure 3a,b shows the basic Sobel operators, which represent the horizontal and vertical convolution kernels, respectively. To accommodate the slope characteristics of the upward and downward wavefields, the Sobel operator must be modified to capture the gradient information in different directions. Specifically, we rotated the Sobel operator as follows:

\nabla_{α_{1}} = \cos α_{1} * A + \sin α_{1} * B,

(7)

where

α_{1}

represents the rotation angle,

A

and

B

represent the horizontal and vertical Sobel operators shown in Figure 3a,b, and

\nabla_{α_{1}}

represents the rotated gradient operator. Taking

α_{1} = 45 °

as an example, the rotated Sobel operator is shown in Figure 3c, which can effectively capture the axial gradient information of the downgoing waves that form a 45° angle with the horizontal plane.

Lu et al. [21] proposed a gradient knowledge representation based on morphological priors. Let

\tilde{U}

be the network’s prediction for upgoing waves and

\tilde{D}

be the prediction for downgoing waves. The gradient along the axial direction is calculated as follows:

G (α, \tilde{U}) = m i n \{\nabla_{α_{i}} * \tilde{U}| 1 \leq i \leq p, p \geq 1\} G (β, \tilde{D}) = m i n \{\nabla_{β_{i}} * \tilde{D}| 1 \leq i \leq q, q \geq 1\},

(8)

where

*

represents convolution operation; min represents element-wise minimum operation;

α = \{α_{i}| 1 \leq i \leq p, α_{i} \in R\}, β = \{β_{i}| 1 \leq i \leq q, β_{i} \in R\}

represents the axial dip angles of upgoing and downgoing wavefields in the depth-time domain;

\nabla_{α_{i}}

and

\nabla_{β_{i}}

represent the modified Sobel operators that are constructed in accordance with the axial-dip angle sets of upgoing and downgoing waves; and

G (α, \tilde{U})

and

G (β, \tilde{D})

are the calculated gradient information matrices. As upgoing and downgoing waves have characteristics of axial slope differences and axial continuity, the wavefields should be smooth along their axial directions (which imply small variations in their axial directions). This knowledge representation can be used to construct loss functions based on

G (α, \tilde{U})

and

G (β, \tilde{D})

, guiding the network to optimize axial continuity during training, thus achieving morphological constraints on upgoing and downgoing waves. Figure 3 shows the basic horizontal and vertical Sobel operators (Figure 3a,b) and an example of a Sobel operator rotated by 45° (Figure 3c), which form the basis for constructing directionally sensitive gradient detectors. Figure 4 illustrates the process of extracting the morphological prior-knowledge representation. First, the axial dip angle sets,

α

and

β

, of the upgoing and downgoing waves are manually picked from the original VSP data. Then, modified Sobel operators are constructed based on the dip angle sets. Finally, the axial gradient information is extracted from the predicted upgoing and downgoing wavefields,

\tilde{U}

and

\tilde{D}

, using their corresponding modified Sobel operators.

2.3. Directional Difference Knowledge Representation

In the VSP records, the upgoing and downgoing waves exhibited significant directional differences. The zero-mean characteristic of the VSP wavefield is shown in the VSP wavefield diagram of the layered model in Figure 5. There is a notable difference between the upward and downward waves, which can be represented by the angle between them; the wavefield mean approaches zero. Based on this observation, we introduced cosine similarity to characterize this knowledge representation. Cosine similarity is defined as the cosine of the angle between two vectors, with values ranging from [–1, 1].

When two vectors are orthogonal, the cosine similarity is 0, and when two vectors are identical or opposite, the value is 1 or –1. However, because the VSP wave-separation problem scenario includes negative signals, we retained the cosine similarity in the interval [0, 1] through an absolute value operation. Let

\tilde{U}

be the predicted upgoing wavefield and

\tilde{D}

be the predicted downgoing wavefield. Then, the cosine similarity is defined as

\cos (\tilde{U}, \tilde{D}) = \frac{\tilde{U} \cdot \tilde{D}}{‖\tilde{U}‖ ‖\tilde{D}‖},

(9)

where

\tilde{U} \cdot \tilde{D}

represents the dot product of the two wavefields, and

‖\tilde{U}‖

and

‖\tilde{D}‖

represent the L2 norms of the two wavefields, respectively. In the knowledge representation of wavefield difference, we used the absolute value of the cosine similarity as a measure of the directional difference:

L_{d i r e c t i o n} = |\cos (\tilde{U}, \tilde{D})| .

(10)

3. Methodology

3.1. Modeling of Upgoing and Downgoing Wave Separation Problem

The goal of signal separation is to recover the unknown upgoing wavefield,

U \in R^{T \times N}

, and downgoing wavefield,

D \in R^{T \times N}

, from the observed VSP record,

X \in R^{T \times N}

. This study employed a U-Net architecture using two independent networks to predict the upward and downward wavefields. When deriving the solution, physical knowledge representations (

U_{r p}

and

D_{r p}

), morphological knowledge representations, and wavefield directional difference knowledge representations were explicitly introduced as constraints to enhance the physical consistency and accuracy of the wavefield separation. Specifically, our modeling of the upgoing and downgoing wave-separation problems is as follows:

\begin{matrix} a r g m i n \\ θ_{u}, θ_{d} \end{matrix} \underset{Reconstruction}{\underset{⏟}{{| | X - \tilde{X} | |}_{h}}} + λ_{1} \underset{Physical rules}{\underset{⏟}{{| | U_{r p} - \tilde{U} | |}_{h}}} + λ_{2} \underset{Physical rules}{\underset{⏟}{{| | D_{r p} - \tilde{D} | |}_{h}}} + λ_{3} \underset{Morphological}{\underset{⏟}{{| | G (α, \tilde{U}) ⊙ G (β, \tilde{D}) | |}_{F}}} + λ_{4} \underset{Direction Difference}{\underset{⏟}{|\cos (\tilde{U}, \tilde{D})|}} s . t . \tilde{X} = \tilde{U} + \tilde{D} \tilde{U} = G_{u, θ_{u}} (X) \in R^{T \times N} \tilde{D} = G_{d, θ_{d}} (X) \in R^{T \times N},

(11)

where

X

represents the original VSP record,

\tilde{X}

represents the predicted reconstructed wavefield, and the remaining terms are the constraints introduced from the knowledge representations, including the constraints in the physical, morphological, and propagation-direction difference. Furthermore,

| | \cdot | |

：

R^{T \times N} \to R^{+}

is an operator for measuring error,

⊙

represents element-wise multiplication,

G_{u ，} G_{d}

：

R^{T \times N} \to R^{T \times N}

represent the two-branch network-mapping functions for predicting upgoing and downgoing wavefields, which recover

\tilde{U}

and

\tilde{D}

from the original wavefield

X

, and

θ_{u}

,

θ_{d}

represent the parameters of

G

.

This modeling process not only needs to measure the reconstruction error between the observed record,

X

, and the reconstructed wavefield,

\tilde{U} + \tilde{D}

, but also the error between the predicted wavefields,

\tilde{U}

and

\tilde{D}

, and their knowledge representations. This constrains the continuity and directional consistency of the wavefield to ensure the physical reasonableness of wavefield separation.

3.2. Network Structure

This study adopts a dual convolutional autoencoder network architecture guided by signal knowledge representation, called the Knowledge-Guided Convolution Autoencoder (KGCAE), to achieve upgoing and downgoing wavefield separation [21]. Figure 6 shows the constructed autoencoder network architecture, where Net-U and Net-D are responsible for predicting the upgoing and downgoing wavefields, respectively. The input was the original VSP record and the output was the predicted upgoing and downgoing wavefields of the same shape as the original record.

Using Net-U as an example, the specific structures and parameters are listed in Table 1. Here, we assumed that the original input VSP record had the shape (1600, 100, 1), representing 1600 time points, 100 detectors, and 1 channel, and Input data are normalized into the [−1, 1] interval for better convergence. The overall structure consisted of encoding and decoding layers. The encoding layer is employed for feature extraction and compression of VSP data, as illustrated in Figure 7. It specifically comprises three convolutional layers and two pooling layers. The first convolutional layer is set to 32 channels to capture basic, low-level features; an excessive number of channels would increase computational redundancy. The second layer expands to 64 channels to encode more complex, high-level features, thereby enhancing the feature representation capability. The third layer compresses the channel count back to 32, aiming to avoid excessive feature redundancy while retaining critical information for the decoding layer. To balance the receptive field and simultaneously reduce the loss of boundary information, the convolutional kernel size is set to 3 × 3. The pooling layers utilize 2 × 2 max pooling to enhance the feature extraction capability. The decoding layer comprised 4 convolutional and 2 upsampling layers for data feature reconstruction and recovery. The channel counts for the convolutional layers were set to 32, 64, 32, and 1, respectively; the final layer outputs single-channel reconstructed data with dimensions consistent with the original input VSP data. The convolutional kernel size was 3 × 3. The upsampling layers employed 2 × 2 nearest-neighbor upsampling. We use the tanh activation function instead of ReLU because the value interval contains both positive and negative values, The use of the ReLU function would lead to value overflow. Figure 7 shows the upgoing and downgoing wave-separation processes.

3.3. Loss Function

To utilize knowledge representations for guiding wave separation, this study modeled them as a loss function,

L

, which includes the reconstruction, representation, gradient, and cosine losses. The specific loss function is defined as follows:

L = L_{r e c} + λ_{1} L_{r e p} + λ_{2} L_{g r a d} + λ_{3} L_{c o s},

(12)

where

L_{r e c}

represents the reconstruction loss that measures the difference between the original and reconstructed wavefields,

L_{r e p}

represents the representation loss, which measures the difference between the predicted and physics-knowledge-represented wavefields,

L_{g r a d}

represents the gradient loss obtained from the morphological knowledge representation, emphasizing the wavefield continuity, and

L_{c o s}

represents the cosine loss obtained from the knowledge representation of the difference in the propagation direction, used to reduce the aliasing phenomena in wavefield separation. Next, we describe the principles of each loss function and their roles in wavefield separation.

3.3.1. Reconstruction Loss

The reconstruction loss,

L_{r e c}

, was used to measure the difference between the sum of the upgoing wavefield,

\tilde{U}

, and the downgoing wavefield,

\tilde{D}

, predicted by the dual convolutional autoencoder and the original VSP record,

X

, specifically defined as

L_{r e c} = {||X - \tilde{U} - \tilde{D}||}_{h},

(13)

where

| | \cdot | |_{h}

represents the error measure based on the Huber function, which is specifically defined as:

f_{h} (s) = \{\begin{matrix} \frac{1}{2} * s^{2}, i f |s| \leq d \\ d * |s| - \frac{1}{2} * d^{2}, i f |s| > d \end{matrix} .

(14)

The Huber function uses squared loss for small errors and linear loss for larger errors, thus combining the advantages of both the mean squared error and absolute error. This effectively reduces the sensitivity to outliers. The introduction of the reconstruction loss ensures that when the model predicts the upgoing and downgoing wavefields, the overall wavefield can accurately reconstruct the original observed data. This loss term encourages the model to maintain the overall consistency and physical reasonability of the wavefield during the separation of the upward and downward waves.

3.3.2. Morphological Knowledge Representation Loss

The gradient loss,

L_{g r a d}

, is derived from morphological knowledge and is used to measure the axial continuity of the upward and downward wavefields predicted by the network, specifically defined as

L_{g r a d} = {||G (α, U) ⊙ G (β, D)||}_{F}^{2},,

(15)

where

⊙

represents element-wise multiplication,

| | \cdot | |_{F}

represents the L2 norm, and

α = \{α_{i}| 1 \leq i \leq p, α_{i} \in R\}, β = \{β_{i}| 1 \leq i \leq q, β_{i} \in R\}

represent the slope sets of upgoing and downgoing wavefields, respectively.

Lu [21] built a dual convolutional autoencoder for wavefield separation based on the gradient loss function, called dualCAE, and achieved certain results. However, the gradient loss is based only on morphological priors; the dip-angle sets of the upward and downward wavefields must be manually selected. For wavefield separation with a complex morphology, the stability is relatively poor and requires complex parameter adjustments. In the next section, we introduce the representation loss and improve the separation results by incorporating physical rules.

3.3.3. Physical Knowledge Representation Loss

The representation loss,

L_{r e p}

, is derived from physical knowledge and aims to enhance the accuracy and stability of the upgoing and downgoing wavefield separation by introducing physical rules. The representation loss measures the difference between the model-predicted upward wavefield,

\tilde{U}

, and downward wavefield,

\tilde{D}

. The knowledge representation wavefields,

U_{r p}

and

D_{r p}

, are extracted based on the energy flux density vectors, specifically defined as

L_{r e p} = {||U_{r p} - \tilde{U}||}_{h} + {||D_{r p} - \tilde{D}||}_{h},

(16)

where

| | \cdot | |_{h}

represents the error measurement using the Huber function. By introducing representation loss, the model not only focuses on the overall wavefield reconstruction but also emphasizes the consistency between the predicted wavefields and the knowledge representation of the upward and downward wavefields extracted through physical methods. This loss ensures that, during the separation of the upgoing and downgoing waves, the model can accurately capture the physical characteristics and propagation directions of the wavefield, thereby improving the separation accuracy. Furthermore, in complex geological structures, the representation loss provides additional constraints that prevent unreasonable wavefield reconstructions during separation, thereby enhancing the stability of the separation process.

3.3.4. Directional Difference Knowledge Representation Loss

The Cosine Loss,

L_{c o s}

, is derived from the knowledge representation of the differences in propagation direction and is primarily used to further optimize the directional differences between the upgoing and downgoing wavefields. The cosine loss is calculated as follows:

L_{c o s} = \frac{1}{N} \sum_{i = 0}^{N - 1} {||c o s (u_{i}, d_{i})||}_{2},

(17)

where

N

is the number of receivers,

u_{i}

and

d_{i}

represent the values of the predicted upgoing wavefield,

\tilde{U}

, and downgoing wavefield,

\tilde{D}

, at the i-th receiver from the autoencoder, respectively, and

| | \cdot | |_{2}

denotes the L2 norm of a vector. The introduction of cosine loss aims to create distinct differences between the upgoing and downgoing wavefields, reduces aliasing phenomena in wavefield separation, and ensures that the separated wavefields conform better to actual physical laws.

4. Numerical Examples

This section evaluates the effectiveness of the proposed up/down-going wave separation method, which is based on target signal knowledge representation, through numerical experiments. Two velocity models were selected for the experiments: a simple horizontally layered geological model and the two-dimensional (2D) elastic Marmousi model. We conducted ablation studies and comparative experiments on the horizontally layered velocity model and also plotted comparison curves showing single-trace separation results from different methods. On the Marmousi model, comparative tests were performed against the traditional f-k method to validate the effectiveness of knowledge representation in the process of separating up- and down-going waves. All numerical simulations and model training were completed under a unified hardware environment, utilizing an NVIDIA A6000 GPU computing platform and implemented using the TensorFlow deep learning framework.

4.1. Validation on Homogeneous Layered Velocity Model

The horizontally layered velocity model consisted of four distinct layers with P-wave, S-wave, and density parameters, as illustrated in Figure 8b. Velocity and density inversion interfaces formed within a four-layer geological structure are crucial for studying phenomena such as the generation of reflected waves and inter-layer multiples. The synthetic model used in this research aims to provide a clear and controllable environment to validate and demonstrate the core ideas and methods of our study. Although the density values are on the lower side relative to their corresponding P-wave velocities according to Gardner’s empirical formula, this allows for the inclusion of atypical conditions under specific geological settings, such as high porosity or particular fluid fills, thereby effectively supporting the core argument of this method. The observation system is illustrated in Figure 8b, which features a source point positioned 50 m below the ground with a 12 m offset. Receivers were distributed at depths ranging from 250 to 4250 m, with 10 m intervals between each receiver. A Ricker wavelet with a dominant frequency of 50 Hz was used as the source signal. The VSP data were generated through elastic wave forward modeling; the modeling parameters are listed in Table 2. The resulting raw VSP records are presented in Figure 8a, with the corresponding f-k spectra shown in Figure 8c, where the energy concentration is apparent at approximately 50 Hz.

4.1.1. Ablation Experiments on the Homogeneous Layered Model

In accordance with Equation (11), the loss function weights for the experiments in this section were configured as listed in Table 3. Each case runs for 10,000 iterations, with each epoch taking 100 ms.

To validate the effectiveness of physical knowledge representation and gradient loss in constraining the upgoing and downgoing wavefields, three scenarios were examined:

Application of reconstruction loss and physical knowledge representation loss;
Application of reconstruction loss, Physical Knowledge Representation Loss, and Morphological Knowledge Representation Loss;
Application of reconstruction loss, Physical Knowledge Representation Loss, Morphological Knowledge Representation Loss, and Directional Difference Knowledge Representation Loss.

Figure 9a–h shows the experimental results for the three cases. Specifically, (a) and (e) show the knowledge representation of the layered model; (b) and (f) illustrate the separation results from the first case; (c) and (g) present the outcomes from the second case; and (d) and (h) display the results from the third case. The first and second rows represent the separated upgoing and downgoing wavefields, respectively. Based on (a) and (e), the knowledge representation of the upgoing and downgoing wavefields was not entirely accurate. As the energy flux density vectors can only reflect the total energy information of the wavefield, the wave knowledge representation derived from their directions exhibits notable separation artifacts. Consequently, significant axial discontinuities appear at the convergence of upgoing and downgoing waves. Figure 9b,f demonstrates the network output when applying reconstruction and representation losses. Although the output should align with its knowledge representation when only representation loss is applied, it notably improved the axial discontinuity issue. This enhancement is attributed to parameter sharing in the CNNs and the multiscale invariance of the wave axial characteristics. However, the separation artifacts remained visible in the output. Figure 9c,g shows the network output under the second configuration. With the addition of gradient regularization, the separation artifacts in the wavefields were effectively reduced, although not completely eliminated, whereas the axial continuity of the wavefields was enhanced. However, due to the strong residual energy in the first arrival portion of the upgoing wave representation, some downgoing wave residuals persisted in this region. Figure 9d,h presents the network output under the third configuration. The incorporation of cosine loss encourages the model to learn distinctly different directional features for the upgoing and downgoing waves, thereby improving the separation accuracy. The results demonstrated significantly enhanced wavefield separation with cleaner separation outcomes.

4.1.2. Comparative Experiments on the Homogeneous Layered Model

To further investigate the effectiveness of the proposed KGCAE methodology, we conducted wavefield separation on the aforementioned simple-layered model and VSP data, comparing it with the conventional f-k filtering method and the morphological feature-guided (dualCAE) approach [21]. Table 4 lists the hyperparameter configurations for KGCAE and dualCAE methods. The proposed KGCAE method runs for 10,000 iterations, with each epoch consuming 100 ms. For the f-k filtering, a rectangular window was applied in the f-k domain. The separation was performed based on the sign of the apparent velocity associated with the wave energy: energy corresponding to negative apparent velocities was attributed to the upgoing wavefield, while energy with positive apparent velocities was assigned to the downgoing wavefield.

Figure 10 presents a comparison of the separation results from the three methods. The f-k filtering results are shown in (a) and (d), (b) and (e) display the separation results using the dualCAE method, (c) and (f) show the results obtained using the proposed KGCAE methodology. The corresponding frequency-beam diagrams are illustrated in (g–l). From a comprehensive perspective, three methodologies successfully achieved distinct wavefield separation, mataining axial continuity and zero-mean characteristics in both the upward and downward waves. However, as observed in Figure 8c, spatial aliasing exists in the raw VSP data. When the energy distributions of upgoing and downgoing waves overlap near zero apparent velocity, energy leakage occurs in the separation results (Figure 10a,d). Additionally, ringing artifacts may emerge during the conversion from the frequency-wavenumber (f-k) domain to the time-depth domain, as highlighted within the dashed box in Figure 10a. In contrast, the proposed KGCAE method demonstrated superior performance in terms of energy intensity and separation clarity. A notable advantage of our methodology lies in its ability to process raw VSP data directly without domain transformation. It automatically allocates frequency-domain aliasing to its respective upgoing and downgoing components without requiring explicit frequency-domain filtering, thereby effectively mitigating spectral aliasing challenges. The energy is correctly partitioned into upgoing and downgoing wavefields, reducing leakage and yielding clearer f-k spectra in the separated wavefields. Compared to conventional f-k filtering and dualCAE methods, this approach achieves higher fidelity.

Comparing the convergence processes of KGCAE and dualCAE algorithms, we focused on the reconstruction loss evolution rather than the total loss, because the loss function compositions and weights differ. As illustrated in Figure 11, the blue curve (dualCAE) exhibits significant fluctuations during convergence, whereas the red curve (KGCAE) demonstrates notably smaller oscillations. The reconstruction loss convergence curves indicate that the proposed method surpasses dualCAE in terms of convergence speed and stability. The f-k method, being a direct filtering process based on the described criterion, does not have an analogous iterative convergence behavior.

4.2. Validation on Marmousi Model

To evaluate the performance and stability of our proposed KGCAE method against the dualCAE method [21] and F-K filtering method for complex structures, this study employed a widely recognized Marmousi model.

The Marmousi model used in this experiment is shown in Figure 12. Specifically, Figure 12a–c represents the P-wave velocity, S-wave velocity, and density distributions of the Marmousi model, respectively. The observation system configuration, as indicated in Figure 12a, consists of a source positioned at the surface with a 40 m offset while receivers are distributed underground at intervals of 5 m, spanning depths from 0 to 500 m. A Ricker wavelet with a dominant frequency of 30 Hz was employed as the source signal. The VSP data were generated through elastic wave forward modeling, with the specific simulation parameters detailed in Table 5. The resulting VSP forward modeling record is presented in Figure 12d, and the corresponding f-k spectra are shown in Figure 12e. The f-k spectrum clearly demonstrates the presence of distinct upgoing and downgoing wave energies in the original VSP record, with energy concentrated around the 30 Hz frequency band.

Comparative Experiments on the Marmousi Model

Comparative experiments were conducted using the dualCAE method and F-K filtering method. The hyperparameters for the KGCAE method and the dualCAE method were configured as listed in Table 6, in accordance with the loss function defined in Equation (12). KGCAE is trained for 10,000 iterations at 90 ms per epoch.

Figure 13a,b illustrates the physical knowledge representation of the upgoing and downgoing waves, respectively, demonstrating the wavefield distribution extracted based on the energy flux density vectors. Figure 13c,d shows the frequency–beam spectra of (a,b). Figure 14 shows the separation results and their frequency-wavenumber spectra for three methods applied to the Marmousi model.

Figure 14a,d,g,j displays the separation results and their f-k spectra using the F-K filtering method. As evident from the f-k spectrum of the original VSP data in Figure 12c, severe spatial aliasing exists between upgoing and downgoing waves. Since F-K filtering separates wavefields by distinguishing opposite apparent velocity directions, noticeable aliasing artifacts remain in the separation results, as indicated by the black arrows in Figure 14d where upgoing wave components contaminate the downgoing wavefield. Moreover, similar to the comparative experiments on the homogeneous layered model, the F-K filtering results still exhibit leakage and ringing effects. Figure 14c,f,i,l shows the separated upgoing and downgoing waves and their frequency–wavenumber spectra obtained by the KGCAE while Figure 14b,e,h,k shows the corresponding results from the dual CAE method, demonstrating that, compared to the dual CAE, the KGCAE more effectively utilizes the axial and numerical distribution information from the knowledge representation of the wavefield. This leads to clearer axial characteristics in the separated wavefields, better preservation of the zero-mean properties, and more effective elimination of high-frequency noise, thereby maintaining the frequency characteristics of the original signal. This indicates that the KGCAE exhibits superior performance in maintaining physical consistency and authentic signal properties when processing complex wavefields.

Although the dualCAE method achieves relatively clean separation results through morphological prior constraints with clear and continuous axial characteristics in the target wavefield, it has certain limitations. Specifically, the first arrival portion of the upgoing wavefield appears chaotic; DC noise contamination is evident at the wavefield boundaries due to edge effects (as shown in Figure 14b,e). The frequency–wavenumber spectra further confirm these observations, revealing the limitations of dualCAE in managing boundary effects, particularly in complex geological structures, where the separation performance is less stable and precise than that of KGCAE. Furthermore, Figure 15 presents a comparative analysis of the reconstruction loss,

L_{r e c}

, for both methods, with the red curve representing KGCAE and the blue curve representing dualCAE. Analysis revealed that the proposed method demonstrated faster convergence and greater stability during training. This indicates that constraining the wavefield separation through physics-based knowledge representation leads to a more efficient optimization, thereby more rapidly achieving stable separation results.

5. Conclusions

In this article, we constructed a VSP upgoing and downgoing wavefield-separation method utilizing signal knowledge representation. First, our methodology involved establishing a signal knowledge representation system and its acquisition methodology, followed by the construction of a dual convolutional autoencoder knowledge-driven separation framework. By transforming the knowledge representation into constraint loss functions, an effective unsupervised separation of the upward and downward waves was achieved. While this study utilized a dual convolutional autoencoder framework, the proposed signal knowledge representation system and its corresponding constraint losses could potentially be adapted to guide other neural network architectures, such as U-Net, for VSP wavefield separation. Systematic validation experiments on horizontally layered and Marmousi models demonstrate that this method effectively addresses the VSP wavefield separation challenges under complex geological conditions.

However, this method has several limitations. The accuracy of physical knowledge representation acquisition depends on the precision of the full waveform inversion, whereas the axial dip angles in morphological knowledge representation require manual picking. Furthermore, although the knowledge representation of the propagation directional difference reflects the orthogonality of the upgoing and downgoing waves, their propagation directions in the actual VSP data are not completely orthogonal. This potentially affects the generalization capability of the model. To address these limitations, future studies should employ more robust inversion algorithms to obtain a more accurate representation of physical knowledge and investigate intelligent feature extraction algorithms for automated morphological feature recognition.

Author Contributions

Conceptualization, C.L.; formal analysis, J.G.; investigation, J.G.; methodology, C.L., L.Q. and J.L.; software, C.L. and L.Q.; validation, J.G.; writing—original draft, J.L.; writing—review and editing, J.L. All authors have read and agreed to the published version of the manuscript.

Funding

The research was supported by the National Natural Science Foundation of China (Grant No. 42474168).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

We are grateful to the reviewers for their valuable comments.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CNN	Convolutional Neural Network
GAN	Generative Adversarial Network
FWI	Full Waveform Inversion
KGCAE	Knowledge-Guided Convolution Autoencoder
ReLU	Rectified Linear Unit
SVD	Singular Value Decomposition
VSP	vertical seismic profiling

References

Stewart, R.R.; Huddleston, P.D.; Kan, T.K. Seismic versus sonic velocities: A vertical seismic profiling study. Geophysics 1984, 49, 1153–1168. [Google Scholar] [CrossRef]
Hinds, R.C.; Anderson, N.L.; Kuzmiski, R.D. VSP Interpretive Processing: Theory and Practice; Society of Exploration Geophysicists: Houston, TX, USA, 1996; Volume 1, p. 214. [Google Scholar] [CrossRef]
Blias, E. VSP wavefield separation: Wave-by-wave optimization approach. Geophysics 2007, 72, T47–T55. [Google Scholar] [CrossRef]
Suprajitno, M.; Greenhalgh, S.A. Separation of upgoing and downgoing waves in vertical seismic profiling by contour-slice filtering. Geophysics 1985, 50, 950–962. [Google Scholar] [CrossRef]
Aminzadeh, F. A recursive method for the separation of upgoing and downgoing waves of vertical seismic profiling data. Geophysics 1986, 51, 2206–2218. [Google Scholar] [CrossRef]
Tao, B.; Yang, Y.; Zhou, H.; Wang, Y.; Lyu, F.; Li, W. Deep learning-based upgoing and downgoing wavefield separation for vertical seismic profile data. Geophysics 2023, 88, D339–D355. [Google Scholar] [CrossRef]
Freire, S.L.M.; Ulrych, T.J. Application of singular value decomposition to vertical seismic profiling. Geophysics 1988, 53, 778–785. [Google Scholar] [CrossRef]
Stewart, R.R. Median filtering: Review and a new F/K analogue design. J. Can. Soc. Explor. Geophys. 1985, 21, 54–63. [Google Scholar]
Treitel, S.; Shanks, J.L.; Frasier, C.W. Some aspects of fan filtering. Geophysics 1967, 32, 789–800. [Google Scholar] [CrossRef]
Yang, Y.; Lu, J.; Wang, Y. Vertical seismic profile wavefield separation using median filtering constrained by the linear radon transform. Appl. Sci. 2018, 8, 1494. [Google Scholar] [CrossRef]
Moon, W.; Carswell, A.; Tang, R.; Dilliston, C. Radon transform wave field separation for vertical seismic profiling data. Geophysics 1986, 51, 940–947. [Google Scholar] [CrossRef]
Genovese, F.; Palmeri, A. Wavelet-based generation of fully non-stationary random processes with application to seismic ground motions. Mech. Syst. Signal Process. 2025, 223, 111833. [Google Scholar] [CrossRef]
Seydoux, L.; Balestriero, R.; Poli, P.; Hoop, M.; Campillo, M.; Baraniuk, R. Clustering earthquake signals and background noises in continuous seismic data with unsupervised deep learning. Nat. Commun. 2020, 11, 3972. [Google Scholar] [CrossRef] [PubMed]
Wang, B.; Zhang, N.; Lu, W.; Wang, J. Deep-learning-based seismic data interpolation: A preliminary result. Geophysics 2019, 84, V11–V20. [Google Scholar] [CrossRef]
Wei, Y.; Li, Y.E.; Zong, J.; Yang, J.; Fu, H.; Sun, M. Deep learning-based P-and S-wave separation for multicomponent vertical seismic profiling. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5908116. [Google Scholar] [CrossRef]
Sun, J.; Niu, Z.; Innanen, K.A.; Li, J.; Trad, D.O. A theory-guided deep-learning formulation and optimization of seismic waveform inversion. Geophysics 2020, 85, R87–R99. [Google Scholar] [CrossRef]
Dong, X.; Lin, J.; Lu, S.; Huang, X.; Wang, H.; Li, Y. Seismic shot gather denoising by using a supervised-deep-learning method with weak dependence on real noise data: A solution to the lack of real noise data. Surv. Geophys. 2022, 43, 1363–1394. [Google Scholar] [CrossRef]
Cao, D.; Chen, X.; Jia, Y.; Jin, C.; Fu, X. Upgoing and downgoing wavefield separation in VSP data using CGAN based on asymmetric convolution blocks. J. Geophys. Eng. 2024, 21, 1511–1525. [Google Scholar] [CrossRef]
Margrave, G.F. Stratigraphic Filtering and Q Estimation. CREWES Res. Rep. 2014, 26, 18. [Google Scholar]
Dalai, B.; Kumar, P.; Srinu, U.; Sen, M.K. De-noising receiver function data using the unsupervised deep learning approach. Geophys. J. Int. 2022, 229, 737–749. [Google Scholar] [CrossRef]
Lu, C.; Mu, Z.; Zong, J.; Wang, T. Unsupervised VSP up-and downgoing wavefield separation via dual convolutional autoencoders. IEEE Trans. Geosci. Remote Sens. 2023, 62, 5900315. [Google Scholar] [CrossRef]
Wen, Y.; Qian, F.; Guo, W.; Zong, J.; Peng, D.; Chen, K.; Hu, G. VSP Upgoing and Downgoing Wavefield Separation: A Hybrid Model-Data Driven Approach. IEEE Trans. Geosci. Remote Sens. 2025, 63, 5908014. [Google Scholar] [CrossRef]
Aki, K.; Richards, P.G. Quantitative Seismology; W. H. Freeman and Company: San Francisco, CA, USA, 2002. [Google Scholar]
Goodier, J.N.; Timoshenko, S. Theory of Elasticity; McGraw-Hill: New York, NY, USA, 1970. [Google Scholar]
Ma, D.; Zhu, G. Numerical modeling of P-wave and s-wave separation in elastic wavefield. Oil Geophys. Prospect. 2003, 38, 482–486. [Google Scholar]
Lu, C.; Wang, Y.; Zou, X.; Zong, J.; Su, Q. Elastic full-waveform inversion via physics-informed recurrent neural network. IEEE Trans. Geosci. Remote Sens. 2024, 62, 4510616. [Google Scholar] [CrossRef]
Kiselev, A.P. Energy flux of elastic waves. J. Math. Sci. 1982, 19, 1372–1375. [Google Scholar] [CrossRef]
Tang, H.-G.; He, B.-S.; Mou, H.-B. P-and S-wave energy flux density vectors. Geophysics 2016, 81, T357–T368. [Google Scholar] [CrossRef]
Virieux, J.; Etienne, V.; Cruz-Atienza, V.; Brossier, R.; Chaljub, E.; Coutant, O.; Garambois, S.; Mercerat, D.; Prieux, V.; Operto, S.; et al. Modelling seismic wave propagation for geophysical imaging. In Seismic Waves-Research and Analysis; Kanao, M., Ed.; IntechOpen: London, UK, 2012. [Google Scholar]

Figure 1. Physical knowledge representation extraction process. (a) Energy flux density vector with

e_{p z} > 0

. (b) Energy flux density vector with

e_{p z} < 0

. (c) Approximate solution,

X^{*}

, of the seismic wavefield. (d) Upgoing wave knowledge representation,

U_{r p}

. (e) Downgoing wave physical knowledge representation,

D_{r p}

.

Figure 1. Physical knowledge representation extraction process. (a) Energy flux density vector with

e_{p z} > 0

. (b) Energy flux density vector with

e_{p z} < 0

. (c) Approximate solution,

X^{*}

, of the seismic wavefield. (d) Upgoing wave knowledge representation,

U_{r p}

. (e) Downgoing wave physical knowledge representation,

D_{r p}

.

Figure 2. Near-offset VSP record of a simple layered model.

α_{1} a n d α_{2}

represent the axial dip angles between upgoing waves and the horizontal interface, whereas

β_{1} a n d β_{2}

represent the axial dip angles between downgoing waves and the horizontal interface.

Figure 2. Near-offset VSP record of a simple layered model.

α_{1} a n d α_{2}

represent the axial dip angles between upgoing waves and the horizontal interface, whereas

β_{1} a n d β_{2}

represent the axial dip angles between downgoing waves and the horizontal interface.

Figure 3. Basic Sobel operators and rotated Sobel operator. (a) Horizontal Sobel operator. (b) Vertical Sobel operator. (c) Sobel operator rotated by 45°.

Figure 4. Flowchart presenting morphological knowledge extraction. (a) Morphological knowledge representation of upgoing waves; (b) morphological knowledge representation of downgoing waves; (c) Sobel operator for upgoing waves; (d) Sobel operator for downgoing waves.

Figure 5. Diagram illustrating the wavefield directional differences and zero-mean characteristics. The mean value of the VSP wavefield is

9.199 \times 10^{- 8}

, where

α

represents the propagation difference between upgoing and downgoing waves, which is characterized by cosine similarity.

Figure 5. Diagram illustrating the wavefield directional differences and zero-mean characteristics. The mean value of the VSP wavefield is

9.199 \times 10^{- 8}

, where

α

represents the propagation difference between upgoing and downgoing waves, which is characterized by cosine similarity.

Figure 6. Dual Convolutional Autoencoder Architecture (KGCAE). The input is the original VSP data, X, with Net-U and Net-D predicting the upgoing and downgoing wavefields, respectively. The predicted wavefields, Outputs U and D, are regularized using representation loss, gradient loss, cosine loss, and reconstruction loss derived from knowledge representations.

Figure 7. Workflow of the proposed Knowledge-Guided Convolutional Autoencoder (KGCAE) for unsupervised VSP wavefield separation. The diagram illustrates the process where original VSP data are input into the KGCAE. The model’s dual autoencoder networks, Net-U and Net-D, then predict the upgoing and downgoing wavefields, respectively. This separation is guided by a composite loss function derived from physical, morphological, and directional difference knowledge representations, in addition to a reconstruction loss.

Figure 8. Parameters of homogeneous-layer model and its beam diagram. (a) Generated raw VSP records. (b) P-wave velocity, S-wave velocity, and density parameters of the velocity model. (c) Beam diagram of the raw VSP records.

Figure 9. Comparison of separation results for homogeneous-layer VSP records under three conditions. (a–d) show the knowledge representation of upgoing waves and the separated upgoing wavefields in three conditions. (e–h) show the knowledge representation of downgoing waves and the separated downgoing wavefields in three conditions.

Figure 10. Comparison of wavefield-separation results on homogeneous-layer VSP data using three methods. (a,d) Predicted upgoing and downgoing wavefields using the F-K filtering method. (b,e) Predicted upgoing and downgoing wavefields using the dualCAE method. (c,f) Predicted upgoing and downgoing wavefields using the KGCAE method. (g,j) Frequency–beam spectra of (a,d). (h,k) Frequency–beam spectra of (b,e). (i,l) Frequency–beam spectra of (c,f).

Figure 11. Reconstruction loss of dualCAE and KGCAE method on the homogeneous model.

Figure 12. Marmousi model and its forward modeling VSP data. (a) P-wave velocity; (b) S-wave velocity; (c) density; (d) VSP data obtained from forward modeling of the Marmousi geological model; and (e) frequency beam of (d). The spectral observation system has been marked in (a).

Figure 13. The physical knowledge representation of the Marmousi model. (a,b) Physical Knowledge representation of upgoing and downgoing wavefields; (c,d) frequency–beam spectra of (a,d).

Figure 14. Comparison of VSP wavefield separation results on the Marmousi model using three methods. (a,d) predictions of upgoing and downgoing wavefields using the F-K filtering method; (b,e) predictions of upgoing and downgoing wavefields using the dualCAE method; (c,f) predictions of upgoing and downgoing wavefields using the KGCAE method; (g,j) frequency–beam spectra of (a,d); (h,k) frequency–beam spectra of (b,e); and (i,l) frequency–beam spectra of (c,f).

Figure 15. Comparison of reconstruction loss convergence for the proposed KGCAE method (red curve) and the dualCAE method (blue curve) during training on the Marmousi model. The x-axis represents training iterations, and the y-axis displays the reconstruction loss value. The KGCAE method demonstrates notably faster convergence and greater stability throughout the training process.

Table 1. Structure and Parameter Configuration of Each Layer in the Net-U (net1) Convolutional Autoencoder.

Layer			Output Channels	Kernel Size	Stride	Activation	Output Shape
Input			1	-	-	-	1600 × 100 × 1
Encoder Layer	Conv2D		32	3 × 3	1 × 1	tanh	1600 × 100 × 32
	MaxPool		-	2 × 2	2 × 2	-	800 × 50 × 32
	Conv2D		64	3 × 3	1 × 1	tanh	800 × 50 × 64
	Conv2D		32	3 × 3	1 × 1	tanh	800 × 50 × 32
	MaxPool		-	2 × 2	2 × 2	-	400 × 25 × 32
Decoder Layer		Conv2D	32	3 × 3	1 × 1	tanh	400 × 25 × 32
		UnSampling2D	-	2 × 2	-	-	800 × 50 × 32
		Conv2D	64	3 × 3	1 × 1	tanh	800 × 50 × 64
		Conv2D	32	3 × 3	1 × 1	tanh	800 × 50 × 32
		UnSampling2D	-	2 × 2	-	-	1600 × 100 × 32
		Conv2D	1	3 × 3	1 × 1	tanh	1600 × 100 × 1

Table 2. Homogeneous model forward parameters.

Parameters	Forward Modeling
Central Frequency of Source	50 Hz
Sampling Time	4 ms
Location of launch point	50 m below the ground
Offset	12 m
Distance between adjacent receivers	10 m
Receivers’ distribution range	Range: from 250 to 4250 m underground
Depth of model	4500 m
Width of model	50 m

Table 3. Loss function parameter settings for three cases.

Parameters	Case 1	Case 2	Case 3
$λ_{1}$	$0.2$	$0.2$	$0.2$
$λ_{2}$	$0$	$1 \times 10^{- 4}$	$1 \times 10^{- 4}$
$λ_{3}$	$0$	$0$	$1 \times 10^{- 3}$
$α$	$0$	$(129 °, 139 °)$	$(129 °, 139 °)$
$β$	$0$	$(41 °, 52 °)$	$(41 °, 52 °)$
Epoch	10,000	$10,000$	$10,000$

Table 4. Simulation parameters of KGCAE and dualCAE.

Parameters	KGCAE	dualCAE
$λ_{1}$	$0.2$	$0$
$λ_{2}$	$1 \times 10^{- 3}$	$1 \times 10^{- 3}$
$λ_{3}$	$1 \times 10^{- 4}$	$1 \times 10^{- 4}$
$α$	$(129 °, 139 °)$	$(129 °, 139 °)$
$β$	$(41 °, 52 °)$	$(41 °, 52 °)$
$E p o c h$	$10,000$	$10,000$

Table 5. Marmousi forward parameters.

Parameters	Forward Modeling
Central Frequency of Source	$30 H z$
Sampling Time	$5 m s$
Offset	$40 m$
Distance between adjacent receivers	$5 m$
Receivers’ distribution range	0 to 500 m underground
Depth of model	$500 m$
Width of model	$500 m$

Table 6. Simulation parameters of KGCAE and dualCAE.

Parameters	KGCAE	dualCAE
$λ_{1}$	$0.2$	$0$
$λ_{2}$	$1 \times 10^{- 3}$	$1 \times 10^{- 3}$
$λ_{3}$	$1 \times 10^{- 4}$	$1 \times 10^{- 4}$
$α$	$(96 °, 102 °)$	$(96 °, 102 °)$
$β$	$(78 °, 81 °)$	$(78 °, 81 °)$
$E p o c h$	$10,000$	$10,000$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lu, C.; Qu, L.; Liu, J.; Gao, J. Upgoing and Downgoing Wavefield Separation in Vertical Seismic Profiling Guided by Signal Knowledge Representation. Appl. Sci. 2025, 15, 6360. https://doi.org/10.3390/app15116360

AMA Style

Lu C, Qu L, Liu J, Gao J. Upgoing and Downgoing Wavefield Separation in Vertical Seismic Profiling Guided by Signal Knowledge Representation. Applied Sciences. 2025; 15(11):6360. https://doi.org/10.3390/app15116360

Chicago/Turabian Style

Lu, Cai, Liyuan Qu, Jijun Liu, and Jianbo Gao. 2025. "Upgoing and Downgoing Wavefield Separation in Vertical Seismic Profiling Guided by Signal Knowledge Representation" Applied Sciences 15, no. 11: 6360. https://doi.org/10.3390/app15116360

APA Style

Lu, C., Qu, L., Liu, J., & Gao, J. (2025). Upgoing and Downgoing Wavefield Separation in Vertical Seismic Profiling Guided by Signal Knowledge Representation. Applied Sciences, 15(11), 6360. https://doi.org/10.3390/app15116360

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Upgoing and Downgoing Wavefield Separation in Vertical Seismic Profiling Guided by Signal Knowledge Representation

Abstract

1. Introduction

2. Knowledge Representation for VSP Wavefield Separation

2.1. Physical Knowledge Representation Based on Energy Flow

2.2. Morphological Knowledge Representation

2.3. Directional Difference Knowledge Representation

3. Methodology

3.1. Modeling of Upgoing and Downgoing Wave Separation Problem

3.2. Network Structure

3.3. Loss Function

3.3.1. Reconstruction Loss

3.3.2. Morphological Knowledge Representation Loss

3.3.3. Physical Knowledge Representation Loss

3.3.4. Directional Difference Knowledge Representation Loss

4. Numerical Examples

4.1. Validation on Homogeneous Layered Velocity Model

4.1.1. Ablation Experiments on the Homogeneous Layered Model

4.1.2. Comparative Experiments on the Homogeneous Layered Model

4.2. Validation on Marmousi Model

Comparative Experiments on the Marmousi Model

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI