To validate the effectiveness and robustness of the proposed first-arrival picking method based on OVMD–WTD and fractal box-counting dimension analysis, a simplified homogeneous and isotropic medium model was constructed. A constant background wave velocity was assumed, and complex wave propagation factors such as dispersion, multiples, and dipping interfaces were not included. This idealized and controlled environment allows for a focused evaluation of the algorithm’s performance under low signal-to-noise ratio conditions. Synthetic microseismic source signals were generated, and six geophones were placed at various positions and orientations around the source to simulate the wave propagation process. To approximate real observation conditions, random noise was superimposed onto the recorded signals to create noisy data. The proposed OVMD–WTD method was then applied for denoising, followed by first-arrival picking using the fractal box-counting dimension method. The extracted first-arrival times were used as input parameters for the PSO algorithm to perform source localization. This framework enables a systematic assessment of the proposed method’s capability for signal processing and localization accuracy in noisy environments.
In the signal spectrum prior to noise contamination, the signal energy is concentrated in the low-frequency band and exhibits a pronounced low-frequency peak. The spectrum decays smoothly, indicating that the signal is dominated by low-frequency components and is essentially noise-free. This well-defined spectrum indicates that the signal is pure and that its frequency-domain characteristics are relatively simple. In the spectrum after noise is introduced, the low-frequency peak decreases markedly, with noticeable broadening—particularly in the high-frequency range—where pronounced fluctuations and irregularities appear. High-frequency noise renders the signal spectrum unsmooth, increasing the complexity of its frequency-domain characteristics; numerous small-amplitude fluctuations emerge, indicating that noise now exerts a significant influence on the signal. Therefore, mitigating noise in the signal is imperative.
3.2.1. Signal Decomposition and Reconstruction
Prior to noise reduction, the signal is initially decomposed by OVMD to determine its optimal modal number, K. In variational mode decomposition (VMD), the modal number K is critical because an appropriate value balances decomposition accuracy and the faithful extraction of the underlying frequency components. If K is set too low, the signal’s high-frequency components may not be adequately decomposed, causing information loss; if K is too high, over-decomposition can occur, producing redundant modes or noise and degrading overall performance. The procedure begins with one mode (K = 1) and sequentially increases K while recording each mode’s center frequency until K = 11 is reached. Frequency analysis reveals that, for 1 ≤ K ≤ 9, the center frequencies of the IMFs are uniformly distributed and well separated, which results in a clearer allocation of frequency content. Therefore, mode numbers within this range efficiently capture the signal’s distinct spectral components. At K = 11, although the overall frequency distribution remains valid, changes in the center frequencies of higher-order modes are negligible, indicating over-decomposition that may introduce high-frequency noise or redundant information. Specifically, when K = 9, the center frequency of the corresponding mode stabilizes near 4710 Hz, and the frequency components are clearly separated and accurately captured, confirming K = 9 as the optimal modal number for this signal. The detailed center frequency values are shown in
Table 1.
Table 1 lists the central frequencies of each intrinsic mode function (IMF1 to IMF11) obtained during the OVMD process for noise-contaminated synthetic microseismic signals, under varying numbers of decomposition modes. K denotes the selected number of modes for decomposition. Each column labeled IMF corresponds to one mode, with the value representing its central frequency in the frequency domain. These central frequencies are derived from the spectral characteristics calculated through the variational optimization process, reflecting the dominant frequency components contained within each mode. All data were generated based on a synthetic microseismic event scenario.
After determining the optimal number of modes (K = 9), the signal is decomposed using VMD. The detailed decomposition process and the corresponding spectra of each mode are presented in
Figure 7.
Figure 7 presents the nine intrinsic mode functions (IMF1–IMF9) obtained by applying OVMD to the noise-contaminated signal from Geophone 1.
Figure 7a presents the nine intrinsic mode functions (IMFs) obtained by applying OVMD to the noise-contaminated signal from Geophone 1. Each curve in the figure represents the variation in an individual mode, illustrating the time-varying characteristics of different frequency components throughout the signal. This demonstrates the capability of OVMD to effectively decompose non-stationary signals.
Figure 7b displays the spectral information of each mode presented in
Figure 7a. The spectra were obtained by applying the FFT to each IMF component, revealing the energy distribution of each mode in the frequency domain. It can be observed that the IMFs exhibit clear band separation in the frequency domain, indicating that OVMD effectively extracts the multi-scale features of the signal.
IMF1–IMF3 primarily capture the signal’s low- and mid-frequency components. In microseismic signals, low-frequency content typically reflects steady seismic activity or sustained vibrational features, whereas high-frequency components often correspond to noise or rapid environmental disturbances. Thus, IMF1 and IMF2 embody the principal microseismic vibrational modes, making these bands the primary focus in microseismic signal processing. IMF4 and IMF5 mark the transition into the mid-frequency range; their spectra exhibit multiple peaks, suggesting they capture higher-frequency variations within the signal. IMF6–IMF11 progressively cover the high-frequency region; in particular, IMF9–IMF11 concentrate energy above 3.5 kHz. Such high-frequency spectra typically reflect rapid signal fluctuations, including noise or high-frequency oscillations, indicating that these modes chiefly capture the signal’s high-frequency content and noise.
The frequency components transition progressively from low to high from IMF1 to IMF11. IMF1–IMF3 mainly capture the signal’s low-frequency components; IMF4 and IMF5 represent the mid-frequency portion; and IMF6–IMF11 capture high-frequency components, which may include noise or high-frequency oscillations. This decomposition enables the separation of distinct frequency components in the original signal, allowing for the clearer analysis of signal variations and the impact of noise and other disturbances.
After completing OVMD, the signal’s distinct frequency components become available for further processing and analysis. To obtain microseismic signals with minimal interference, we remove the noisy portions of each mode using wavelet threshold denoising. First, each mode is subject to wavelet decomposition, which separates the signal into multiple scales. Each wavelet scale represents a distinct frequency band, enabling the precise localization of frequency information within the hierarchical decomposition. In this study, the sym4 wavelet basis, known for its good symmetry and compact support, was selected to enhance the preservation of microseismic signal transients while reducing the risk of phase distortion. It is particularly suitable for suppressing high-frequency noise and extracting signal details. Based on the spectral characteristics of microseismic signals and the requirements of multi-scale analysis, the number of wavelet decomposition levels was set to four to enable efficient frequency band separation and avoid excessive information decomposition. For thresholding, a soft thresholding method was applied to the wavelet coefficients. This approach avoids introducing excessive artificial oscillations during signal processing and provides smoother denoising performance, especially under strong high-frequency noise conditions. After thresholding, we reconstruct the denoised signal via inverse wavelet transform, using the low-frequency coefficients at level N and the quantized high-frequency coefficients from levels 1 to N. The denoised signal obtained using wavelet thresholding, along with its spectrum, is compared with the results from other denoising methods.
Figure 8 and
Figure 9 present the time-domain waveforms and the corresponding frequency spectra of signals processed by four different denoising methods, respectively: (a) the OVMD–WTD joint denoising method, (b) Butterworth filtering, (c) elliptic filtering, and (d) the EMD–wavelet joint denoising method. By comparing the performance of each method in both time and frequency domains, the overall effectiveness in terms of signal fidelity and noise suppression can be intuitively evaluated.
According to the filtering results, the OVMD–wavelet joint denoising method demonstrates superior performance the other methods in microseismic signal denoising. It effectively suppresses high-frequency noise while preserving the integrity of low-frequency components. Its steeper transition band enables the efficient separation of noise from useful signals, minimizing residual noise. In contrast, the other three methods—Butterworth low-pass filtering, elliptic filtering, and EMD–wavelet joint denoising—also reduce high-frequency noise but are limited by wider transition bands and slower attenuation rates. As a result, residual high-frequency noise remains, degrading signal purity and quality. Among them, the EMD–wavelet method loses fine signal details, particularly in rapidly changing segments, leading to significant differences from the original signal. Overall, the OVMD–wavelet joint denoising method offers clear advantages in both denoising and signal reconstruction, particularly for microseismic signal processing and accurate event detection. It retains fine signal details while effectively suppressing noise interference.
3.2.2. Fractal Box Dimension and Initial Time Detection
First-arrival picking is critical in microseismic monitoring. It directly influences source localization accuracy and aids in identifying the event type, analyzing subsurface properties, and estimating event magnitude. Accurate first-arrival picking enables more precise source localization and event classification. After signal reconstruction, the first-arrival time can be extracted from the denoised signal.
The fractal dimension is a metric used to characterize signal complexity and self-similarity. In microseismic signals, the occurrence of a first-arrival event often leads to a sharp change in signal complexity, typically reflected as a sudden fluctuation in the fractal dimension. By quantifying local complexity, this method effectively captures the abrupt point corresponding to the first-arrival wave. Therefore, analyzing the signal’s fractal box dimension facilitates the accurate identification of the first-arrival time. Compared with traditional methods, the fractal box dimension better accommodates the nonlinear and nonsmooth nature of microseismic signals, offering stronger noise robustness and higher computational efficiency. It can accurately locate the first-arrival time even under low signal-to-noise ratio conditions. An adaptive algorithm is employed for window selection, allowing the automatic determination of optimal window size based on signal characteristics. This improves the accuracy of first-arrival picking.
The first-arrival picking process consists of the following steps:
- (1)
Adaptive window size selection and signal segmentation
During signal preprocessing, an adaptive windowing mechanism is introduced. Based on signal length and dynamic characteristics, the method adjusts segmentation flexibly by setting minimum and maximum window size limits. Compared to traditional fixed-window methods, the adaptive window dynamically adjusts the segmentation scale according to local signal features, effectively avoiding information loss or redundancy and improving the accuracy of first-arrival picking.
- (2)
Fractal dimension calculation
For each windowed signal segment, the fractal dimension is estimated using the box-counting method to quantify local signal complexity. By scanning across multiple box scales, the method performs multi-scale analysis by counting the number of boxes covering the signal at each scale, and the fractal dimension is then obtained via curve fitting. This method leverages the signal’s nonsmooth characteristics across scales and provides a quantitative indicator for detecting significant change points.
- (3)
Significant change point detection based on fractal dimension difference
By computing the difference in fractal dimension between adjacent windows, points of significant signal variation are identified. When the difference exceeds a predefined threshold, the corresponding point is marked as a significant change point. Since the first-arrival time typically corresponds to a transition from a stable to a rapidly changing state, this approach effectively detects the key features associated with the first-arrival time.
- (4)
First-arrival picking and verification
The first-arrival time is located by identifying the earliest point with a significant fractal dimension change. Combined with the adaptive windowing strategy, the method adjusts segmentation at different stages of the signal to ensure precise first-arrival time extraction. The accuracy and robustness of the method are validated by comparing the extracted first-arrival times with calibrated references in wavefield snapshots.
The fractal dimension is highly sensitive to local structural variations in signals, and the first-arrival time of seismic waves typically corresponds to a significant jump in its distribution. In this study, a sliding window approach is employed to dynamically calculate the local fractal dimension, enabling the identification of abrupt changes and the determination of first-arrival times. These arrival times are subsequently used for source localization inversion, thereby improving the spatiotemporal accuracy of microseismic event localization.
In the calculation of fractal dimension, the selection of box size
is a key factor affecting both the accuracy and stability of the results. The choice of
must strike a balance among several criteria: (1) sufficient scale resolution to ensure sensitivity to abrupt features in the microseismic signal; (2) avoidance of overly large values that may obscure features or cause misinterpretation across scales; and (3) prevention of excessively small scales that may lead to numerical oscillation or insufficient statistical sampling. To address these requirements, a multi-scale box-counting strategy is adopted, selecting a representative set of
values that span the dominant frequency (or period) range of the signal. This range is designed to capture the primary temporal features of the signal while avoiding the loss of short-duration information or the introduction of cross-period confusion. The selected ε values are used to compute a linear relationship between
and
, from which the best-fitting linear segment is identified via least-squares regression. The slope of this segment is then taken as the local fractal dimension, enabling the multi-scale quantitative characterization of signal complexity.
To comprehensively illustrate the temporal evolution of seismic wave propagation following source excitation in the numerical simulation, multiple key time points were selected to generate wavefield snapshots. In the figure, the color scale represents the spatial distribution of wave amplitude, allowing for a clear visualization of wave propagation paths and the arrival times of the wavefronts at each geophone. This figure not only provides the physical basis for applying the fractal box-counting method to first-arrival picking but also serves as a visual reference for validating the accuracy of the subsequent event localization, further supporting the reliability and effectiveness of the proposed approach. The wavefield snapshot from the numerical simulation is shown in
Figure 10.
To accurately extract the first-arrival time from the signals received by each geophone, this study employs the fractal box-counting dimension method to quantify local variations in signal complexity. Several key parameters are configured to enhance the accuracy and adaptability of the algorithm: the box sizes are set to [2, 4, 6, 8, 10, 12, 14, 15] to cover the signal plot and count the number of boxes containing waveform structures, thereby characterizing complexity variations across multiple scales. An adaptive sliding window mechanism is introduced for segmented analysis, with a window length range of 1 to 15 and a step size of 1, allowing the analysis scale to dynamically adjust based on local signal features. The computation length for each segment is set to 15 to balance stability and resolution. During the sliding window traversal, abrupt points are identified by calculating the difference in fractal dimension between adjacent windows. To ensure both accuracy and computational efficiency, the threshold for fractal dimension variation is set to 0.1; when the difference between two consecutive windows exceeds this threshold, the corresponding position is regarded as a significant change in signal complexity. To ensure physical plausibility, only the first point that meets the criterion—located in the early part of the signal—is extracted and designated as the effective first-arrival time.
This parameter configuration enhances the method’s stability and generalization across different signal structures, while maintaining a balance between computational efficiency and detection precision. The effectiveness of the proposed method is illustrated in
Figure 11, where
Figure 11a shows the denoised synthetic signal, and
Figure 11b displays the first-arrival picking process. The red marker in
Figure 11b indicates the first-arrival point identified by the adaptive fractal box-counting method.
It can be observed from the table that the first-arrival times obtained using the fractal box-counting dimension closely match those indicated by the wavefield snapshots at the corresponding detector locations, with a maximum deviation of only 0.005 s.
In addition, to more comprehensively evaluate the accuracy of the proposed method in first-arrival picking, we calculated the RMSE between the picked arrival times and the theoretical values at each detection point. This metric quantitatively reflects the overall average picking error and provides a more convincing basis for the subsequent analysis of localization accuracy. The error is calculated using the RMSE formula as follows:
where
denotes the number of seismic sensors (geophones),
is the observed first-arrival time extracted from the
-th geophone, and
represents the corresponding theoretical first-arrival time for the
-th geophone.
Based on the theoretical arrival times and the results obtained from the fractal box-counting dimension analysis presented in the table, the calculated RMSE of the first-arrival times is approximately 0.00430 s. This indicates that the proposed method demonstrates high picking accuracy in the time domain, providing a solid foundation for improved localization precision in subsequent analysis.