1. Introduction
In seismic surveys, noise in the subsurface tends to interfere with effective signals. Its signal is weak but widely distributed and haphazard, and we often resort to various techniques to eliminate it. However, waves generated by vibrations from underground sources will carry information about underground tectonics as they propagate. If noise can be used, it can act as a substitute for an active source. In some cases where active source excitation is not possible, such as near cities, it is sufficient to set up geophones to receive passive sources of signals. At the same time, passive acquisition is cost-effective because there is no need to excite seismic sources. Furthermore, due to the rich frequency range of subsurface noise sources, there are a large number of low-frequency signals [
1]. Therefore, passive sources are more advantageous than active sources in large-scale, deep seismic surveys.
After acquiring the passive source seismic record, we can use seismic interferometry [
2,
3] to reconstruct the passive source seismic record into a virtual shot record similar to the active source seismic shot record. However, using seismic interferometry to reconstruct passive seismic data inevitably results in coherent noise and artifacts. The noise has a greater impact on deteriorating the reconstruction of seismic data when the attenuation of the medium is high [
4]. Removing the effects of this coherent noise and artifact disturbances is an ongoing effort by geophysicists. The F-K [
5] filtering-based processing can suppress the coherent noise to some extent, but it may cause damage to the effective wave. Suppressing the coherent noise in the Radon domain has also been tried several times [
6,
7,
8], but there is still a problem of low resolution. Rabiner et al. [
9] created a median filtering denoising method and later produced many derivative algorithms [
10,
11]. They can operate on pixels on the image, but their small windows have no effect on coherent noise and artifacts, and large windows will blur effective signals.
The above methods can remove coherent noise and artifacts to some extent. However, they can harm the effective signal or suppress it poorly. The selection of parameters will have a large impact on the processing effect, which is somewhat subjective. In addition, this method cannot effectively recover the effective waveform.
Deep learning learns deep features of images and other information by training neural networks. LeCun [
12] and others first invented a convolutional neural network and achieved good results in handwritten digit recognition. The AlexNet, invented by Hinton et al. [
13], has achieved good results in image recognition and won the championship in the image recognition contest, setting off an upsurge of in-depth learning. Since then, various deep learning networks have sprung up in the public’s view, such as VggNet [
14], ResNet [
15], FCN [
16], DnCNN [
17], and UNet [
18]. These networks have achieved good results in classification and image segmentation. Based on ensuring the accuracy of manual recognition and conventional methods, they have higher processing efficiency, so they have been widely developed in various fields.
In recent years, thanks to the upgrading of computer hardware resources, deep learning algorithms have been widely used in the geophysical field. Gu et al. [
19] realized low-frequency reconstruction in full-waveform inversion based on deep learning. Parasyris et al. [
20] synthetic data generation for deep learning-based inversion for velocity model building. Zhang et al. [
21] used interactive salt segmentation from 3D seismic images using saltisnet3d. Tao et al. [
22] acoustic impedance inversion of seismic imaging profiles using self-attentive U-Net. Xiong et al. [
23] use SafeNet to identify seismic disturbances. Sun et al. [
24] accomplished low-frequency extrapolation of multicomponent data in Elastic FWI using deep learning. Wang et al. [
25] used the MCMC inverse problem method of neural networks to perform numerical simulations in GPR cross-hole full waveform inversion. Liu et al. [
26] used fine-tuned FPN to achieve microseismic first-arrival pickup. Lou et al. [
27] proposed MCDL to achieve seismic volume dip estimation. Dou et al. [
28] used the “MDA GAN“ of the adversarial network to realize 3D seismic data interpolation and reconstruction.
In this research, a convolutional neural network is used to identify and suppress the coherent noise and artifacts of the virtual shot record. We obtain passive seismic records with a small number of sources and passive seismic records with a large number of sources through forward simulation. Using seismic interferometry, they are reconstructed as virtual shot records, respectively. The reconstructed records of passive seismic records with fewer sources are used as training data, and the reconstructed records of passive seismic records with more sources are used as training labels. The network is utilized to mine the features of passive data with better passive seismic source effects. Using neural networks to suppress coherent noise and artifacts and restore waveforms in parts where waveforms are not continuous enough. For the virtual shot records with an uneven source distribution, we use the virtual shot records with a wide source distribution as labels. At this time, the task of the network is not only to suppress coherent noise and artifacts and restore the ductility and continuity of the waveform, but also to restore the event of linear intersection to the event of double curvature. Although they are two tasks, we use the same network model to implement them and take a large number of evenly distributed virtual shot records as the training labels for the two tasks. Therefore, if the above two conditions are not ideal when we are collecting passive sources, we can reduce their impact on our seismic records and finally obtain better results, which improves the applicability of passive seismic exploration.
  3. Numerical Example
  3.1. Enhanced Reconstruction Results for a Small Number of Seismic Sources
In this paper, virtual shot records are obtained from twenty velocity models. The velocity model as shown in 
Figure 2 will be used to obtain passive seismic records, which will be reconstructed into virtual shot records to be used as test data. These velocity models used to generate the training and test sets are randomly generated. To demonstrate the generalizability of the method, the test set is not included in the training set. Here the grid size of the velocity model is 2 m, the geophone spacing is 1 grid spacing, and the sampling interval is set to 1 ms to receive a total of 200 passive seismic records. Random noise sequences were used for the seismic sources. Each velocity model contains 128 seismic traces, and since the intercorrelation method allows the construction of shot points at any geophone point, 128 virtual shot point seismic records can be generated. To address the poor results of reconstructed records due to the small number of sources, we used them as training data and used reconstructed records with a large number of evenly distributed sources as labels. To compress the training cost and reduce unnecessary training, we only show the virtual shot records of the first 800 sampling points after reconstruction. We show the two source distributions as training sets and training labels under the velocity model (see 
Figure 3). The virtual shot records used as training data are obtained by reconstructing the passive source seismic records under the source distribution shown in 
Figure 3a. Under the distribution of seismic sources shown in 
Figure 3b, we reconstructed the virtual shot records as training labels. By inputting training data and training labels into the network, we can obtain a trained network. This network can process the waveforms of the virtual shot records.
Our computing device was a Nvidia Quadro RTX 4000, and our neural network framework was TensorFlow-GPU 2.6.0 with 8 GB of video memory. To improve the generalization of the network, we set the batch size to 4, the learning rate to 0.00001, and gave a large dropout value of 0.5. We selected 2560 seismic records as training data and 256 seismic records as test data. After 1000 iterations, the loss of the network converges, and training is complete. The loss and decline of the network are shown in 
Figure 4. We fed the test data into the network, and the results are shown in 
Figure 5 and 
Figure 6.
To test whether the performance of our designed network is enhanced compared to the original network, we set the UNet neural network with attention mechanisms as the control group to test the improvement effect. By comparison, we can see that the coherent noise is effectively suppressed by the processing of the neural network. To solve the above problem that passive virtual shot records as labels may still be noisy, we found that both the training set and its labels contain a certain amount of coherent noise during the training process. However, due to the differences in objective factors such as their source locations and numbers, the training data and labels corresponding to the same virtual shot recordings under the same velocity model are similar in terms of effective waveforms when reconstructed, whereas the coherent noise in the virtual shot recordings is unmatchable. This also leads to the fact that our neural network does not learn the invalid features of the relevant noise during training. Although the labels also inevitably contain some coherent noise, which is an interesting finding in our suppression of coherent noise. Similarly, it is difficult for the neural network to learn the features of the artifacts because the features recorded by the virtual shot vary in each training data, and the corresponding virtual shot labels contain few or no artifacts in the same position. As a result, the artifacts are eliminated. In contrast, since the effective signals on the training data and corresponding labels are similar, our neural network can learn their features. Therefore, the weaker effective waveforms in the virtual shot records of a small number of sources are better protected and enhanced. In addition, where waveform discontinuities are present, our network complements them, restoring waveforms with good continuity.
Although these networks can suppress coherent noise and artifacts effectively, they also extract effective signals. However, it can be clearly seen that the improved network can suppress noise more thoroughly. The continuity of the same waveform axis is better and clearer, which also proves that our improvement of the network is effective.
In this section, we experiment with neural networks to reconstruct records of unevenly distributed seismic sources. Using the velocity model from the previous section, we can concentrate the subsurface seismic sources in a small region for numerical simulation. The resulting passive seismic source records are reconstructed into unevenly distributed virtual shot records. The distribution of seismic sources is shown in 
Figure 7.
After 400 iterations, our network was trained, and we fed the test data into the network, which was processed as shown in the figure. To test the generalizability of our network under different models and the processing performance for virtual shots located at different positions, we selected records of virtual shots located in multiple locations for display (see 
Figure 8 and 
Figure 9).
Similar to the previous section, we can still use both of our neural networks to suppress coherent noise and artifacts. We can revert to a reasonable hyperbolic homogeneous axis in the virtual shot records on the side without the source distribution; we suppress the spurious homogeneous axis on the side with the source distribution. At the same time, the problem of insufficient continuation of the waveform is well resolved. We analyze that since there are many linear intersecting homography features in the training set corresponding to the hyperbolic homography axes in the training labels, it is possible to learn the corresponding features. The reason that coherent noise with artifacts can be suppressed is also similar to the previous section. In the tracts with the right side of the seismic distribution, our neural network recovers its waveform even though the training data are broken waveforms.
In the testing of uneven source distribution, we can see that both networks can effectively suppress coherent noise and artifacts. However, when carefully comparing the processed waveforms, we found that the virtual records processed by the improved network showed clearer and more continuous waveforms.
  3.2. Virtual Shot Record Processing under a Complex Model
In the experiment with the simple layered model, we have achieved good processing results for simple seismic records. When the velocity model becomes complex, the seismic record will inevitably become complex. In order to test the effect of our method on processing complex seismic records, we used the virtual shot record generated by the complex velocity models to train the neural network. The complex velocity models are shown in 
Figure 10, and their processing results are shown in 
Figure 11 and 
Figure 12.
Through the above comparison before and after processing, we can find that in the face of complex models, even if the number of sources is sufficient and the distribution is uniform, the virtual shot record is not perfect. In the face of imperfect processing tasks, the network processing effect is still good.
In the case of a small number of sources, our network can still remove a large number of virtual events and coherent noise from the virtual shot record and effectively restore the discontinuous events to continuous events.
In the case of dealing with unevenly distributed sources, the network also removes a large number of spurious events and coherent noise from the virtual shot recordings. It will also effectively reduce linear intersection events to hyperbolic events that conform to physical laws.
It can be seen that our method has achieved good results not only in the simple layered model but also in the complex model. This result also proves that this method has wide applicability.
We can see that the waveforms in the wavefield become more complex in spite of the fact that the velocity model becomes complex, resulting in the waveforms in the wavefield becoming more complex. Our network still suppresses the coherent signal at a low level. For the virtual shot records of a small number of sources, the waveforms are clearer with our network processing. For the virtual shot records of sources distributed only on one side, the waveform anomalies are restored to a reasonable shape by our network processing.
After achieving good results in the processing of virtual shot records for conventionally small numbers of sources and for conventionally unevenly distributed sources, we tried to process the virtual shot records for extremely distributed sources. The two extremely distributed sources are shown in 
Figure 13. They are the sources located in the shallow space and the sources located in the narrow range, respectively.
After processing using neural networks, the results are shown in 
Figure 14 and 
Figure 15. It can be seen from the comparison figure that the virtual shot records corresponding to these two extreme signal source distributions contain a large number of spurious events and coherent noise. Moreover, the false reflections are stronger in the reconstructed records, and the ductility of the effective signal reflection waveform is not enough.
After processing with the neural network, we successfully restored the waveform of the effective signal to a reasonable level. We can see from 
Figure 13 and 
Figure 14 that the effective signal in the processed wavefield is stronger. It can be seen that our network can still recover effective signals from the artifacts and coherent noise.
  4. Discussion
In passive-source seismic exploration, the number and distribution of subsurface sources are very random. The traditional seismic interferometry needs to satisfy the uniform distribution of the sources and be at a certain depth to reconstruct a high-quality virtual shot record. And the number and distribution location of undesirable seismic sources have a great impact on the virtual shot record.
Some conventional classical interferometric methods, such as inverse convolutional seismic interferometry [
30], and algorithms developed on this basis, such as multidimensional deconvolutional seismic interferometry [
35], can mitigate the effect of the inhomogeneous distribution of sources on the virtual seismic recordings to some extent. However, all of these methods require strict constraints on the uniform distribution of the geophones and the excision of the direct waves (which is very difficult to achieve for the noise sources). On this basis, the inverse is applied to the records received by all geophones. This operation is not only computationally huge, but the process is extremely unstable. These methods also do not give good results for specific source distributions (such as sources distributed in a narrow range on the survey line). In the method using deep learning, Sun et al. [
36] used neural networks to make the virtual seismic record under uneven source distribution and learn the signal of the active source. Not only were a lot of artifacts eliminated, but the waveforms were also clearer. However, there is no discussion of the extreme case of distributed sources or the case of a small number of sources. And the use of active source records for labeling may cause some loss of low-frequency signals.
Compared with the improved seismic interferometry of the inverse convolution class, the cross-correlation-based seismic interferometry has fewer limitations, is computationally stable, and does not require strict uniform arrangement of geophones, cumbersome resection of direct waves, or computationally unstable inverse operations. And using the virtual seismic records of passive sources as labels for the training of the network can not only retain the low-frequency signals better but also train better due to the higher degree of similarity. Therefore, this method can improve the application effect of passive source seismic exploration to a certain extent on the basis of no additional requirements on the observation system and no loss of low-frequency signals. In this paper, we only show four of these cases of seismic source inactivity. After processing according to our network, the influence of the number and distribution of sources on the virtual seismic record can be reduced to some extent.
Only four of these inactive cases are shown in this paper. After the processing based on our network, the influence of the distribution of the seismic sources on the virtual shot records can be reduced to a certain extent. Therefore, this method can improve the application of passive source seismic exploration to some extent.
In the future development trend, with the future development of computing equipment and deep learning technology, there will be neural networks with better processing effects, which can solve the influence of the distribution of more complex seismic sources on the virtual shot record.
  5. Conclusions
In this paper, a wavefield enhancement technique based on a convolutional neural network for passive seismic reconstruction records is proposed. The reconstructed virtual source records with a small number of sources and uneven source distribution are fed into the trained convolutional neural network, and the virtual source records with suppressed artifacts and coherent noise are obtained. The unphysical axes are restored to hyperbolic axes, and the low-frequency advantage of passive seismic surveys is retained.
Compared with reconstructed virtual shot recordings with a better number and distribution of sources, we retain accurate waveform information and low-frequency features with a lower proportion of noise. The network also has good generalization ability.
For passive sources that are not active in the subsurface, including the case where the number of sources is small and unevenly distributed, we provide a technical approach that ensures the effectiveness of passive seismic survey methods. It ensures high quality passive seismic reconstruction records when the sources are not widely distributed. The actual processing efficiency is high, and real-time monitoring can be realized.