1. Introduction
In seismic exploration, passive seismic signals often present a haphazard distribution characteristic and are often removed as noise, resulting in the loss of effective information. In recent years, with the in-depth research of geophysicists, it is found that even though the passive seismic signal is weak and unstable in energy, it carries rich subsurface information due to its wide distribution and low frequency, which follows the same propagation law as the active-source seismic waves, and if it can be utilized, it will make up for many shortcomings of active-source seismic exploration, such as higher cost, limited exploration depth, and concentrated frequency distribution. In some areas, such as cities, nature reserves, farmland, etc., active-source seismic exploration cannot be used, but passive-source seismic exploration can still be applied.
For the utilization of conventional passive seismic data, we generally use seismic interferometry for seismic record reconstruction [
1]. Seismic data channel set reconstruction using seismic interferometry not only can obtain regular seismic records similar to those of active sources but also the application is very flexible. We can reconstruct virtual shot records with arbitrary geophone locations according to the requirements. In recent years, with the increasingly widespread application of passive seismic surveys, many seismic interferometry reconstructions based on different theories have been discovered, such as the multidimensional inverse folded product method [
2], the sparse inversion method [
3], receiver-pair seismic interferometry [
4], and so on. Among them, the cross-correlation method reconstruction [
5], proposed by Wapenaar through cross-correlation, has been widely used in the industry due to its stable effect and simple operation.
However, in passive seismic exploration, due to the uncertainty of subsurface passive sources, there is no guarantee whether they vibrate and for how long when the sources are not abundant, or for some reasons the passive seismic records are mixed with other noises that interfere with our reconstruction, which leads to the time period when the sources are abundant and widely distributed potentially being only a very short period of time. So, it is often necessary to receive longer passive seismic records, even weeks of continuous recording are selected for reconstruction by seismic interferometry and superimposed to improve the signal-to-noise ratio in order to increase the intensity of the effective waves and reduce the impact of coherent signals on imaging or inversion. Not only does it take a long time to wait for reception but it also requires labor to screen the suitable recording time periods, and in addition, the reconstruction of passive seismic records with long periods has certain requirements for computing equipment.
In recent years, with the rapid development of computer hardware and software, the use of deep learning techniques for data processing has gradually come into people’ s view, and data processing has gradually become automated. Convolutional neural networks [
6] were first introduced to solve the problems of image recognition and have been tried for use with autoencoders [
7] and DnCNN [
8] for noise removal and signal recognition, which can achieve comparable or even better results than traditional noise removal methods. Convolutional neural networks emerged mainly to solve problems that consume a lot of time and effort of processors or problems where processing means temporarily fail to achieve better results. The input dataset and labels are processed by means of a designed deep convolutional neural network, which enables the network to go through a three-way cyclic process of forward propagation, backward propagation, and calculation of errors, and finally obtain one or more reasonable functions that constitute a mapping from the dataset to the labels. Using this method, processors can achieve the processing results we need without obsessing about the complex geological conditions and the applicability of various empirical equations.
Deep learning has also shown powerful data processing capabilities in seismic exploration. Zhu et al. used a deep neural network called Deep Denoiser for denoising and decomposing seismic signals, which still gave good results even in a high-noise background [
9]. Ross et al. used deep learning to accomplish automatic identification of P-wave sources and picking of arrival time [
10]. Sun Jing et al. successfully used deep learning theory for coherent noise suppression of marine seismic data [
11]. Valentin et al. extracted horizon surfaces from 3D seismic data using deep learning [
12]. Using the U-Net network, Song Huan et al. input seismic data containing multiple waves into a successfully trained neural network, which can directly output seismic data after multiple wave suppression [
13]. Sergei et al. create a method of geologic body interpretation from seismic data based on deep learning [
14]. It can be seen that data processing of seismic data using deep learning is not only widely used but also has outstanding results and high adaptability.
As a result, this paper investigates a weak signal extraction of reconstruction results of passive seismic records with short periods by using convolutional neural networks, using only a seismic record with longer period and its reconstruction results as training labels to extract the buried effective signal from the complex coherent noise, reduce the effect of coherent noise, and suppress spurious waveforms. In this way, passive seismic records with short periods are utilized to obtain the effect of reconstructed records with higher signal-to-noise ratios. There is no need to superimpose multiple reconstructed passive seismic records with long period, and only a short recording time is required to achieve the desired effect. Therefore, we do not need to receive passive-source seismic signals for a long time, which greatly improves acquisition efficiency. In addition, in the field acquisition process, the use of a shorter period of passive seismic record can also achieve real-time monitoring of the passive exploration with high quality, grasp real-time acquisition information, and determine in advance whether the acquisition area has detection value. In urban construction, before and after the earthquake can also play the role of real-time monitoring of underground structures.
3. Numerical Examples
3.1. Layered Model
In this paper, we first use a simple and small four-layered model to verify the feasibility of this idea experimentally. The velocity model we used is shown in
Figure 1. The random waveforms used are shown in
Figure 2.
We used 2000 randomly distributed subsurface passive sources excited at random time points below 300 m. Once excited, these point sources will vibrate continuously for random periods of time, and the resulting waves are finally received by the geophone. The waveforms are formed by convolution of low-frequency Ricker wavelet with a main frequency of 12 Hz and random sequences. Twelve hertz is just a random value we chose during the experiment to meet the low-frequency range, if necessary, we can also choose a lower or higher center frequency. In general, the smaller the grid spacing and sampling interval, the higher the resolution of the seismic record, so we set a grid spacing of 1 m and a sampling interval of 0.5 ms and recorded continuously for 150 s. The resulting passive seismic records with 5 s and passive seismic records with 100 s are shown in
Figure 3.
Next, we use the cross-correlation method to reconstruct them, and the resulting reconstructed seismic records are shown in
Figure 4. It can be clearly seen that the waveforms of the seismic record with long period are clearer and more continuous, and the intensity of coherent noise is lower. The waveform of the reconstructed seismic record is incomplete, and the weak signal is almost buried in the coherent noise, which will affect the subsequent imaging and inversion if used directly.
From the above results, we know that the reconstruction results of passive seismic records with short periods have lower signal-to-noise ratio, stronger coherent noise, weaker effective signal, and more spurious events compared with the reconstruction results of passive seismic records with long periods. This is due to the fact that we used a large number of noise sources, and they can all continue to vibrate for a period of time, so the quality of the noise in each small period of time is not very different. The longer the recording time is, the more effective signals are received and the better the reconstructed seismic record is.
We want to suppress the coherent noise in the reconstruction results of passive seismic records with short periods, so that they have a high signal-to-noise ratio similar to that of the reconstruction results of passive seismic records with short periods. Therefore, we selected two conventional denoising methods for testing. As shown in
Figure 5, when it is processed using median filtering, a small filtering window leads to little processing effect, and a large filtering window causes the effective signal to be mixed with the strong coherent signal. When it is processed using f-k filtering, only a small amount of the effective signal is enhanced, and most of the signal becomes weaker. Conventional methods cannot achieve good processing results, so we try to adopt a deep learning approach for testing.
The deep learning network model we used is shown in
Figure 6. Although the excitation time and duration of underground sources are random, the virtual shot seismic records of adjacent locations in similar time periods are always similar, so their contribution to the neural network is not large. Therefore, we select seven receiving point positions that are far away from each other and select virtual shot records reconstructed from four segments of noise records with a certain interval for each position as the training set. Additionally, the training labels use the reconstructed results of passive seismic records with long periods of virtual shot locations in corresponding virtual shot locations, respectively; the test set uses the reconstructed results of passive seismic records with short periods of virtual shots reconstructed in other locations different from the training set to test the performance of the completed training network.
The deep learning framework used in our experiment is tensorflow-gpu, version 1.8. The number of epochs is set to 500, the activation function is tanh, and the dropout is given as 0.6 to prevent overfitting on the one hand and to give more expectation to the potential of the neural network on the other. The optimizers use adam. Among them, the main factors that affect the training time are the size of the dataset and the number of epochs. The training took a total of 8 h, and the processing of the test data took less than 1 min. The result of the test data processed by the neural network and the difference between it and the reconstruction result of the corresponding passive seismic records with long periods are shown in
Figure 7.
After comparison, we can find that in the effective signals buried in the reconstruction results, the passive seismic records with short periods are extracted, which effectively improves the signal-to-noise ratio and lays a good foundation for improving the imaging accuracy in the future. Comparing the reconstruction results of passive seismic records with long periods, we can also find that the level of coherent noise is lower in our processed reconstruction record. Observing the difference profile, we can find that the effective waves are effectively extracted, and the difference profile basically contains no information of effective waves. Only the coherent noise interference of the reconstruction record of passive seismic records with long periods and the lateral interference generated during the deep learning processing exist. Therefore, we can confirm that this means of data enhancement reconstruction results of passive seismic records with short periods is feasible and effective. The disadvantage is that some parts of the reflected wave will show weak amplitude, but its partial effect can be attenuated by gain.
3.2. Complex Model Numerical Experiment
From the processing of the simple-layered model, we can see that good results can be obtained by using convolutional neural networks to process reconstructed records of passive seismic records with short periods. Then, is it possible to achieve good results for the processing of complex models?
We use a complex velocity model as shown in
Figure 8 for our experiments. We use 192 channels for reception, and we randomize the source excitation location and time point in order to realistically simulate the real situation of a passive seismic record in the subsurface. In order to test the effect of the model on different waveforms, a random noise source was used for this test tremor source so that it occasionally appears to be sparser or denser during the recording period. The experimental results are shown in
Figure 9 and
Figure 10. Here, we select the reconstruction records of passive seismic records with short periods, processing results of passive seismic records with short periods, passive seismic records with long periods, and active-source forward modeling results. The virtual shots located at channel 60 and channel 96, respectively, are for display. In order to facilitate the observation of details, we only capture the reconstruction records of the first 1.0 s, which are rich in waveforms and have a large impact on the subsequent processing for observation and comparison.
Figure 11 shows a comparison of the data in one channel when the virtual shot is located in the 60th channel.
By comparison, we can find that the coherent noise, which is originally comparable with the effective signal energy, is well suppressed after processing, and the noise level is even lower than that of reconstruction of passive seismic records with long period, while the weak effective signal is well retained. The spurious events in the recordings are also suppressed effectively. Although there is still a gap between the waveforms in the processed recordings and the active-source recordings, the critical reflected waves are well displayed. Comparing the data of the channels, the processed reconstructed channel set of passive seismic records with short periods cannot only match better with the reconstructed channel set of passive seismic records with long periods in terms of phase, but also, it is closer in terms of amplitude. Generally speaking, the more far away from the surface the coherent noise interferes with the channel set, and our network can suppress it very well. In comparison with the simple laminar model, although the results of our complex model processing are not as highly restored, and the strength and continuity of the effective signals after processing are not as strong as those of the reconstruction records of passive seismic records with long periods, our model still effectively extracts the effective signal and reduces the intensity of the coherent noise, which is even weaker than that of the reconstruction records of passive seismic records with long periods. With this method, enhancement of the effective signal and reduction in the effects caused by coherent noise are guaranteed. Additionally, only a short period of recording time is required to achieve this, significantly reducing the dependence of passive seismic record exploration methods on recording time.
4. Discussion
Method selection of seismic interferometry: There are three common seismic interferometry methods. They are cross-correlation, multidimensional deconvolution, and cross-coherence. Among them, the multidimensional deconvolution method and cross- coherence method have certain effects on the suppression of multiple and surface waves, although they are clearer in waveform display. However, the cross-correlation method also has the advantages of a simple formula, small calculation, and stable effect. For this experiment, it is possible to choose which seismic interferometry method to make the dataset with. We chose the cross-correlation method just to obtain the dataset faster and more stably.
Selection of neural network framework: For the experiment of enhancing the weak signal of passive seismic records with short periods, we only used the most traditional convolutional neural network and achieved good results. Maybe there will be more suitable and effective convolutional neural networks in the future, and we will keep trying this.
Setting of hyperparameters: It is widely known that the hyperparameters of a convolutional neural network include the number of convolutional layers, the number of convolutional kernels contained in each layer, the learning rate, dropout, etc. When the number of convolutional layers is small, the model cannot learn enough features, and too many will lead to slow learning but little improvement. The number of convolutional kernels per layer depends on the computing hardware and the size of the input and output. Too small a learning rate leads to slow learning and overfitting, too large will lead to loss shock and nonconvergence. Too small a dropout leads to overfitting, and too large a dropout leads to underfitting. Epoch is the opposite of dropout, too small leads to underfitting, and too large leads to overfitting. Too small a batch size leads to wasted memory and reduced efficiency, too large leads to convergence of the model to some local extrema. After many attempts, the hyperparameters in this paper are reasonable and finally achieve good results.
In this paper, we propose a deep-learning-based method for weak signal recognition and coherent noise suppression of passive seismic records with short periods. Using reconstructed records of passive seismic records with long periods as the label and reconstructed records of short-period passive seismic records as the training set, the obtained network has a good noise reduction and signal recognition capability. After using this network for reconstruction records of passive seismic records with short periods, the processing results have a high signal-to-noise ratio. It makes the shorter records of conventional passive seismic records available, which greatly improves the utilization of passive seismic records and can effectively reduce the recording time required for acquisition and improve acquisition efficiency.
In addition, the method can be used for real-time monitoring of the quality of passive seismic exploration because it can extract effective signals for passive seismic records with short periods, and we can avoid the situation where the passive sources are collected in areas where the subsurface seismic sources are not abundant, or where the data are not available due to the short reception time of the instrument. It also has some application value for construction, earthquake monitoring, groundwater flow, dam structure monitoring, and other work that requires continuous real-time monitoring of subsurface tectonic changes.