Improved Dempster–Shafer Evidence Theory for Tunnel Water Inrush Risk Analysis Based on Fuzzy Identiﬁcation Factors of Multi-Source Geophysical Data

: Water inrush is one of the most important risk factors in tunnel construction because of its abruptness and timeliness. Various geophysical data used in actual construction contain useful information related to groundwater development. However, the existing approaches with such data from multiple sources and sensors are generally independent and cannot integrate this information, leading to inaccurate projections. In addition, existing tunnel advanced geological forecast reports for risk projections interpreted by human operators generally contain no quantitative observations or measurements, but only consist of ambiguous and uncertain qualitative descriptions. To surmount the problems above, this paper proposes a tunnel water inrush risk analysis method by fusing multi-source geophysical observations with fuzzy identiﬁcation factors. Speciﬁcally, the membership function of the fuzzy set is used to solve the difﬁculty in determining the basic probability assignment function in the improved Dempster–Shafer evidence theory. The prediction model of efﬂuent conditions fuses seismic wave reﬂection data, ground penetrating radar data, and transient electromagnetic data. Therefore, quantitative evaluations of the efﬂuent conditions are achieved, including the strand water, linear water, seepage and dripping water, and anhydrous. Experimental evaluations with a typical tunnel section were conducted, in which the state of the groundwater from a series of geological sketch reports in this sectionpaper were used as ground truth for veriﬁcation. The experimental results revealed that the proposed method not only has high accuracy and robustness but also aligns well with different evidence effectively that generally contradicts manual interpretation reports. The results from 12 randomly selected tunnel sections also demonstrate the generalization abilities of the proposed method.


Introduction
China has the largest number of tunnels at the largest scale worldwide.However, in some remote, complex, and difficult mountainous areas in western China, the construction of long and deep buried tunnels running hundreds of kilometers is still enormously challenging [1].Because of the sudden and destructive characteristics of tunnel water inrush, construction equipment and materials in the tunnel can be buried, which delays the construction period and seriously threatens the life and safety of construction personnel [2][3][4].In addition, surface water resources outside the cave are depleted, thereby destroying the surface ecological environment.Therefore, how to use the existing data to correctly and efficiently predict the water abundance of the surrounding rock in front of the tunnel face is of great significance.
Advanced geological prediction can obtain the location and scale of water-rich areas in front of the tunnel face in advance and provide reasonable excavation methods and effective pre-support and pre-reinforcement measures for construction to reduce the possibility of a tunnel water inrush disaster [5][6][7].It is one of the most important and basic means in current tunnel construction technology.Geophysical detection results are widely used in tunnel advanced geological prediction because of high precision, wide detection range, and high efficiency [8][9][10].Based on the difference in the physical properties between the tunnel's surrounding rock and the water-rich area, the geophysical exploration method analyzes the variation law of the geophysical field to predict the location and scope of water inrush [11,12].The equipment used in the geophysical prospecting method is portable and does not affect the excavation of the tunnel face.Commonly used methods are seismic wave reflection, ground penetrating radar (GPR), and transient electromagnetic methods (TEM).The seismic reflection wave method mainly transmit seismic wave signals, and the wave speed and amplitude of seismic waves are severely attenuated when encountering a large amount of water, so the water-rich areas ahead can be judged according to the reflected signals [13,14].The widely used equipment in the seismic wave reflection method is the tunnel seismic prediction system (TSP) developed and produced by Amberg [15]; the generated data is also called TSP data.The principle of GPR is to transmit electromagnetic wave signals through a transmitting antenna and then identify structures such as karst and water-rich areas by analyzing the waveform of electromagnetic wave propagation [16].TEM is widely used to predict water enrichment because it can obtain the apparent resistivity distribution in front of the tunnel face according to the principle of electromagnetic induction [17].Different geophysical methods have various advantages, disadvantages, prediction ranges, and accuracies.Because of this, the prediction results of the surrounding rock's effluent from different geophysical data sometimes differ.Therefore, the fusion method of multi-source geophysical exploration data is not only the current development direction but also an important means to solve the problem of tunnel disasters under complex geological conditions.
Most existing fusion methods are used to establish a comprehensive advanced forecasting system, but this method depends on manual identification [18][19][20].It is challenging to meet the project requirements using a comprehensive forecast.Therefore, there are still some difficulties in fusing various geophysical data containing uncertain information to form a more accurate analysis result of water output, mainly including some characteristics of multi-source geophysical data itself, namely heterogeneity and complex correlation.
(1) Heterogeneity of the multi-source geophysical data.Different advanced geological prediction methods have been implemented for the same tunnel face to detect the water outlet of the tunnel.The TSP data store the attenuation parameters of seismic waves propagating in the tunnel, including longitudinal wave velocity, shear wave velocity, Poisson's ratio of rock, Young's modulus, and positive and negative reflection interfaces of seismic waves.The GPR data store the distribution of the amplitude, frequency, event axis, and energy of the electromagnetic wave signal propagation in the tunnel's surrounding rock.Additionally, the TEM data measure the resistivity distribution of the surrounding rock.In addition, these parameters are presented in different formats (tables, images).Based on the generation method of the data field, the propagation characteristics of the wave, signal processing method, and characteristic parameters are all different.Therefore, we require a complementary method of fusing multi-source geophysical data to extract more valuable information.(2) Complex correlation characteristics of multi-source geophysical data.Although the detection objects of different data are all water output conditions of the same tunnel mileage section, it is difficult to have a unified method to correlate these data.In one case, the water-rich region deduced from the change in the physical property parameters of the TSP data reflect the absence of water in the electromagnetic waveform attenuation characteristics of the GPR data or high resistance to the resistivity value of the TEM data.Therefore, there may be contradictions in the performance results of these data in water-rich areas and there is no way to determine the internal relationship between the characteristics of these data.This was determined by the multi-source heterogeneity of the data.Nevertheless, the characteristics of these data are closely related to the water outflow in this area, and it is difficult to make full use of the interpreted results of these geophysical data to establish a unified water inrush risk analysis model.
To solve these problems, we propose an improved Dempster-Shafer evidence theory (D-S theory) for tunnel water inrush risk analysis based on the fuzzy characteristics of multi-source geophysical data.The D-S theory is an effective means to deal with uncertain information, which can realize multi-source information fusion without prior information, and is widely used in expert systems, pattern recognition, intelligence analysis, and other fields [21][22][23][24].In the face of evidence conflict, the results obtained using classical D-S theory are often ambiguous [25].At the same time, the basic probability assignment (BPA) is the basis of the D-S theory, and its value represents the credibility of a given result.The corresponding fusion results are different with different methods of constructing the BPA [26].The construction of BPA remains a question worth studying.Jiang et al. obtained a BPA based on an improved similarity measure between generalized fuzzy numbers [27].Tang et al. proposed a triangular fuzzy number model membership function to generate a generalized BPA [28].There is no fixed mode for the determination of BPA, and it is based on practical applications to obtain the proper BPA.The interpretation of the results of different geophysical data is ambiguous, and a considerable amount of sample data is accumulated during tunnel excavation.For this reason, an improved fuzzy D-S theory model integrating multi-source geophysical data was proposed in this study.
In summary, by inputting the TSP, GPR, and TEM data, the membership function of different physical property identification parameters in each type of data was established, and the fuzzy comprehensive function was applied to different membership functions to obtain the water-rich judgment of each geophysical data.Finally, the improved D-S theory synthesis rules were used to integrate the three types of geophysical identification results, and the water effluent status of the section was determined, namely strand water, linear water, seepage water, or no water.Finally, the data from 12 tunnel sections were randomly selected to establish three groups of comparative experiments.The classical D-S theory, fuzzy D-S theory, and improved fuzzy D-S theory were used to conduct experiments in different areas, and it was concluded that the prediction accuracy of this model was higher than that of other methods.
The remainder of this paper is organized as follows.Section 2 introduces the characteristics of three types of geophysical data and their corresponding fuzzy identification factors and fusion methods.In Section 3, a tunnel was selected for experimental research and the fusion prediction results of the effluent of surrounding rock were obtained.Section 4 compares the results of a single data source and three different fusion methods.Section 5 presents the conclusions of this study.

Multi-Source Geophysical Data
In this study, three types of tunnel advanced prediction geophysical data were used to conduct a fusion analysis of water inrush risk aiming to make full use of the potential information and the associated value of each data.Figure 1 shows in detail the various characteristics of the three types of data, including the physical field, detection range, physical parameters, and characteristics of the field.The dashed boxes represent the physical parameters of each dataset, which are also the basis for the fusion of this study.

TSP Data
Figure 2 shows the schematic of the TSP observation system.The general observation system consisted of 24 blast holes and 2-4 receivers.The buried depth and inclination of the blast hole and receiver are shown in the cross-section in Figure 2. First, the seismic wave produced by the explosive blast in the hole propagates around, and reflection occurs when it meets the interface of different wave-impedance media.By converting the reflected information into electrical signals and performing a series of processes, including observation system editing, tunnel modeling, data setting, time-varying height, bandpass filtering, first break picking, direct wave adjustment, Qanalysis, reflected wave extraction, P-S-wave separation, 3D-modeling, velocity analysis, depth migration, reflector extraction, and inversion results, the changes in P-wave velocity, S-wave velocity, P-S-wave velocity ratio, Poisson's ratio, dynamic Young's modulus, and the distribution of the reflection interface were obtained [15].The results are shown in Figure 3.

TSP Data
Figure 2 shows the schematic of the TSP observation system.The general observation system consisted of 24 blast holes and 2-4 receivers.The buried depth and inclination of the blast hole and receiver are shown in the cross-section in Figure 2.

TSP Data
Figure 2 shows the schematic of the TSP observation system.The general observation system consisted of 24 blast holes and 2-4 receivers.The buried depth and inclination of the blast hole and receiver are shown in the cross-section in Figure 2. First, the seismic wave produced by the explosive blast in the hole propagates around, and reflection occurs when it meets the interface of different wave-impedance media.By converting the reflected information into electrical signals and performing a series of processes, including observation system editing, tunnel modeling, data setting, time-varying height, bandpass filtering, first break picking, direct wave adjustment, Qanalysis, reflected wave extraction, P-S-wave separation, 3D-modeling, velocity analysis, depth migration, reflector extraction, and inversion results, the changes in P-wave velocity, S-wave velocity, P-S-wave velocity ratio, Poisson's ratio, dynamic Young's modulus, and the distribution of the reflection interface were obtained [15].The results are shown in Figure 3. First, the seismic wave produced by the explosive blast in the hole propagates around, and reflection occurs when it meets the interface of different wave-impedance media.By converting the reflected information into electrical signals and performing a series of processes, including observation system editing, tunnel modeling, data setting, timevarying height, bandpass filtering, first break picking, direct wave adjustment, Q-analysis, reflected wave extraction, P-S-wave separation, 3D-modeling, velocity analysis, depth migration, reflector extraction, and inversion results, the changes in P-wave velocity, Swave velocity, P-S-wave velocity ratio, Poisson's ratio, dynamic Young's modulus, and the distribution of the reflection interface were obtained [15].The results are shown in Figure 3.The tunnel face mileage of the section shown in Figure 3 was 325.6 m, and the forecast range was 325.6~+ 234 m.The tunnel face lithology was granite with a relatively broken rock mass, and the surrounding rock grade was Ⅴ.According to the change in parameters, the result of the unfavorable geological condition is divided into four parts, as shown in Figure 3.The segmented prediction results obtained by the interpreters were as follows: 1. Range 325.6~+ 311 m: The surrounding rock is basically the same as the current face, partially broken, with more developed joints, containing water (mainly linear and strand-shaped water).2. Range 311~+ 275 m: The surrounding rock is mainly fragmented, partially fragmented, with developed joints, and densely developed fissures, containing water (increased local water volume), and there is a risk of water inrush locally.Sufficient attention must be paid during the construction process.3. Range 275~+ 255 m: the surrounding rock is relatively broken, with relatively developed joints, and contains water (mainly linear and strand-shaped water).The tunnel face mileage of the section shown in Figure 3 was 325.6 m, and the forecast range was 325.6~+ 234 m.The tunnel face lithology was granite with a relatively broken rock mass, and the surrounding rock grade was V.According to the change in parameters, the result of the unfavorable geological condition is divided into four parts, as shown in Figure 3.The segmented prediction results obtained by the interpreters were as follows: 1.
Range 325.6~+ 311 m: The surrounding rock is basically the same as the current face, partially broken, with more developed joints, containing water (mainly linear and strand-shaped water).

2.
Range 311~+ 275 m: The surrounding rock is mainly fragmented, partially fragmented, with developed joints, and densely developed fissures, containing water (increased local water volume), and there is a risk of water inrush locally.Sufficient attention must be paid during the construction process.

3.
Range 275~+ 255 m: the surrounding rock is relatively broken, with relatively developed joints, and contains water (mainly linear and strand-shaped water).

4.
Range 255~+ 234 m: the surrounding rock is relatively broken, with relatively developed joints and fissures, and contains water (the local water volume increases).

GPR Data
The basic principle of the geological radar method is to work together through a transmitting antenna, a receiving antenna, and a host computer.When collecting sample points, the control unit first sends a control signal to the transmitter and the receiver.After receiving the signal, the transmitter transmits an electromagnetic pulse wave of a determined main frequency to a measurement point through the transmitting antenna (T).In the process of propagation through various media, the electromagnetic pulse wave meets the physical interface of different media (the difference interface of resistivity and dielectric constant), and wave reflection occurs [29].The reflected wave is received by the receiving antenna (R). Figure 4a shows the GPR detection principle, and Figure 4b shows the corresponding waveform generation principle.

GPR Data
The basic principle of the geological radar method is to work together through a transmitting antenna, a receiving antenna, and a host computer.When collecting sample points, the control unit first sends a control signal to the transmitter and the receiver.After receiving the signal, the transmitter transmits an electromagnetic pulse wave of a determined main frequency to a measurement point through the transmitting antenna (T).In the process of propagation through various media, the electromagnetic pulse wave meets the physical interface of different media (the difference interface of resistivity and dielectric constant), and wave reflection occurs [29].The reflected wave is received by the receiving antenna (R). Figure 4a shows the GPR detection principle, and Figure 4b shows the corresponding waveform generation principle.A GPR machine is easily disturbed by machines and pipelines in tunnels.Currently, it is primarily used for the detection and prediction of karst caves, water-bearing zones, and fractured zones [30].It uses the change in the dielectric constants of different media to make judgments, which requires a rich detection experience.The radar image contains rich information about the detected body, which is qualitatively interpreted according to the characteristics of the radar image (such as the dense zone of joints, voids, reflection interface, etc.). Figure 4c shows the GPR waveform with a mileage of 332.8~+ 306 m, from which the interpreter draws the following conclusions.The reflected GPR signal in this section is strong, and the waveform is relatively chaotic.The conclusion is that the surrounding rock is partially broken with more developed joints and contains water (mainly in the form of linear and strand-shaped water).Among them, the 327~+ 320 m sections have densely developed joints.

TEM Data
The transient electromagnetic method uses eddy currents generated by the electromagnetic induction principle to detect objects with good conductivity and is often used to detect metal orebodies in the early stage.During tunnel construction, the electrical conductivity of the water-rich area is stronger than that of the surrounding dry area; therefore, it is widely used to explore water [31].The working principle of the transient electromagnetic method is shown in Figure 5.A GPR machine is easily disturbed by machines and pipelines in tunnels.Currently, it is primarily used for the detection and prediction of karst caves, water-bearing zones, and fractured zones [30].It uses the change in the dielectric constants of different media to make judgments, which requires a rich detection experience.The radar image contains rich information about the detected body, which is qualitatively interpreted according to the characteristics of the radar image (such as the dense zone of joints, voids, reflection interface, etc.). Figure 4c shows the GPR waveform with a mileage of 332.8~+ 306 m, from which the interpreter draws the following conclusions.The reflected GPR signal in this section is strong, and the waveform is relatively chaotic.The conclusion is that the surrounding rock is partially broken with more developed joints and contains water (mainly in the form of linear and strand-shaped water).Among them, the 327~+ 320 m sections have densely developed joints.

TEM Data
The transient electromagnetic method uses eddy currents generated by the electromagnetic induction principle to detect objects with good conductivity and is often used to detect metal orebodies in the early stage.During tunnel construction, the electrical conductivity of the water-rich area is stronger than that of the surrounding dry area; therefore, it is widely used to explore water [31].The working principle of the transient electromagnetic method is shown in Figure 5.By passing a pulse current through the transmitting coil, the generated magnetic field generates an eddy current when encountering a good conductor, such as a rich water body, and then generates a secondary magnetic field.The receiving coil, in turn, generates a new induced current under the influence of the secondary magnetic field [32].The front apparent resistivity distribution map is obtained by further analysis.Figure 6a shows the apparent resistivity results for the 305.2~+ 235 m mileage section, with a tunnel face mileage of 305.2 m and a transmitting coil mileage of 307 m.The black dotted frame indicates the range of the tunnel, and the red dotted frame represents the area with low resistivity, which is also considered the area with a higher risk of water inrush.Figure 6b shows the relevant parameters of the collected data.The prediction conclusion drawn by the interpreters based on this result is that the water volume of the surrounding rock in this section is essentially the same as that of the current tunnel face, and the water volume is still large, mainly linear and strand effluent.

Fuzzy Identification Factors
Considering the physical parameters of shear wave velocity   , the ratio of longitudinal wave velocity to shear wave velocity   /  , Poisson's ratio in TSP data, waveform parameter amplitude variation, in-phase axis continuity, reflected signal intensity in GPR data, and apparent resistivity in TEM data, these parameters are fundamental factors for the interpreter to judge the water outlet of the tunnel; they directly reflect the affluent state of the surrounding rock, which we call the identification factors in this study.
These identification factors changed correspondingly in water-rich areas.For example, when encountering water-rich areas, the   of seismic waves often drops signifi- By passing a pulse current through the transmitting coil, the generated magnetic field generates an eddy current when encountering a good conductor, such as a rich water body, and then generates a secondary magnetic field.The receiving coil, in turn, generates a new induced current under the influence of the secondary magnetic field [32].The front apparent resistivity distribution map is obtained by further analysis.Figure 6a shows the apparent resistivity results for the 305.2~+ 235 m mileage section, with a tunnel face mileage of 305.2 m and a transmitting coil mileage of 307 m.The black dotted frame indicates the range of the tunnel, and the red dotted frame represents the area with low resistivity, which is also considered the area with a higher risk of water inrush.By passing a pulse current through the transmitting coil, the generated magnetic field generates an eddy current when encountering a good conductor, such as a rich water body, and then generates a secondary magnetic field.The receiving coil, in turn, generates a new induced current under the influence of the secondary magnetic field [32].The front apparent resistivity distribution map is obtained by further analysis.Figure 6a shows the apparent resistivity results for the 305.2~+ 235 m mileage section, with a tunnel face mileage of 305.2 m and a transmitting coil mileage of 307 m.The black dotted frame indicates the range of the tunnel, and the red dotted frame represents the area with low resistivity, which is also considered the area with a higher risk of water inrush.Figure 6b shows the relevant parameters of the collected data.The prediction conclusion drawn by the interpreters based on this result is that the water volume of the surrounding rock in this section is essentially the same as that of the current tunnel face, and the water volume is still large, mainly linear and strand effluent.

Fuzzy Identification Factors
Considering the physical parameters of shear wave velocity   , the ratio of longitudinal wave velocity to shear wave velocity   /  , Poisson's ratio in TSP data, waveform parameter amplitude variation, in-phase axis continuity, reflected signal intensity in GPR data, and apparent resistivity in TEM data, these parameters are fundamental factors for the interpreter to judge the water outlet of the tunnel; they directly reflect the affluent state of the surrounding rock, which we call the identification factors in this study.
These identification factors changed correspondingly in water-rich areas.For example, when encountering water-rich areas, the   of seismic waves often drops signifi- Figure 6b shows the relevant parameters of the collected data.The prediction conclusion drawn by the interpreters based on this result is that the water volume of the surrounding rock in this section is essentially the same as that of the current tunnel face, and the water volume is still large, mainly linear and strand effluent.

Fuzzy Identification Factors
Considering the physical parameters of shear wave velocity v s , the ratio of longitudinal wave velocity to shear wave velocity v p /v s , Poisson's ratio in TSP data, waveform parameter amplitude variation, in-phase axis continuity, reflected signal intensity in GPR data, and apparent resistivity in TEM data, these parameters are fundamental factors for the interpreter to judge the water outlet of the tunnel; they directly reflect the affluent state of the surrounding rock, which we call the identification factors in this study.
These identification factors changed correspondingly in water-rich areas.For example, when encountering water-rich areas, the v s of seismic waves often drops significantly, v p /v s rises, Poisson's ratio increases, the amplitude of the reflected signal of electromagnetic waves changes strongly, the continuity of the in-phase axis is poor, the signal strength is strong, and the apparent resistivity is significantly reduced.However, these parameters are relative changes concerning the effluent of the tunnel face because different lithologies, degree of joint development, and degree of rock fragmentation will affect the magnitude of these parameter values.Therefore, in the advanced geological prediction report, the manually interpreted forecast analysis is usually a description of the changes in these parameters, such as rising significantly, falling slightly, and other fuzzy terms.There is no quantitative mapping between these fuzzy terms and effluent results, resulting in different interpreters drawing different conclusions.Therefore, this study proposes a fusion method based on fuzzy characteristics to form a risk analysis model from fuzzy identification to effluent situations.

Fusion Methods
Given the description of the uncertain conclusions in the existing advanced prediction technical reports, such as little change in wave velocity, high Poisson's ratio, and strong GPR waveform reflection signal, these words are a fuzzy qualitative concept, obtained by professional interpreters.Therefore, it is highly specialized and requires comprehensive interpretation by experts with rich geophysical knowledge and geological experience to correctly identify the effluent condition of the tunnel.In addition, different geophysical exploration data are typically interpreted by different geophysical exploration experts.Owing to their strong subjectivity, identification results are often inconsistent.To fully explore the relationship between various data sources and improve the accuracy of the identification results, this study combined fuzzy sets and the improved D-S theory integrating TSP, GPR, and TEM data to establish a fusion model to obtain more accurate water output forecast results.The fusion principle model is shown in Figure 7.
Remote Sens. 2022, 14, x FOR PEER REVIEW 8 of 20 cantly,   /  rises, Poisson's ratio increases, the amplitude of the reflected signal of electromagnetic waves changes strongly, the continuity of the in-phase axis is poor, the signal strength is strong, and the apparent resistivity is significantly reduced.However, these parameters are relative changes concerning the effluent of the tunnel face because different lithologies, degree of joint development, and degree of rock fragmentation will affect the magnitude of these parameter values.Therefore, in the advanced geological prediction report, the manually interpreted forecast analysis is usually a description of the changes in these parameters, such as rising significantly, falling slightly, and other fuzzy terms.There is no quantitative mapping between these fuzzy terms and effluent results, resulting in different interpreters drawing different conclusions.Therefore, this study proposes a fusion method based on fuzzy characteristics to form a risk analysis model from fuzzy identification to effluent situations.

Fusion Methods
Given the description of the uncertain conclusions in the existing advanced prediction technical reports, such as little change in wave velocity, high Poisson's ratio, and strong GPR waveform reflection signal, these words are a fuzzy qualitative concept, obtained by professional interpreters.Therefore, it is highly specialized and requires comprehensive interpretation by experts with rich geophysical knowledge and geological experience to correctly identify the effluent condition of the tunnel.In addition, different geophysical exploration data are typically interpreted by different geophysical exploration experts.Owing to their strong subjectivity, identification results are often inconsistent.To fully explore the relationship between various data sources and improve the accuracy of the identification results, this study combined fuzzy sets and the improved D-S theory integrating TSP, GPR, and TEM data to establish a fusion model to obtain more accurate water output forecast results.The fusion principle model is shown in Figure 7.The input data consist of three types of geophysical data.Taking TSP data as an example, the fuzzy physical parameters used to judge water richness in TSP data are shear wave velocity v s , the ratio of longitudinal wave velocity to shear wave velocity v p /v s , and Poisson's ratio, which are also called the identification factors in this study.The value of each identification factor is calculated by its respective membership function, and then the fuzzy comprehensive function acts on these three factors.Finally, the comprehensive identification result of the TSP on water effluent is obtained.The prediction results of the GPR and TEM data were obtained similarly.The prediction results of each type of data may be different, and the improved evidence theory can fully solve the conflict between evidence to obtain a more reliable fusion result; thus, the final water-rich state result is predicted.

Membership Functions of Different Identification Factors
Fuzzy set theory was developed by L.A. Zadeh in 1965 [33].In the language system of the real world, many words such as "young", "very" and "almost" are fuzzy concepts.In the theory proposed by Zadeh, the fuzzy set A on the domain U is determined, and the following mapping µ A is called the membership function of A [34].
x is a certain identification factor, and µ A (x) is the degree of belonging of x to A, referred to as the degree of membership.In this study, there were four fuzzy sets corresponding to the degree of water discharge in front of the tunnel face: A = {Strand water}, B = {Linear water}, C = {Seepage and dripping water}, and D = {Anhydrous}.The identification factors were divided into three categories according to the three types of geophysical data, represented by x, y, and z.In the TSP, the identification factors v s , v p /v s and Poisson's ratio are expressed as x 1 , x 2 , x 3 , respectively.In the GPR, the amplitude of the reflected wave signal changes, the continuity of the event axis, and the signal strength of the reflected wave are expressed as y 1 , y 2 , y 3 , respectively.There is only one identification factor, the apparent resistivity in the TEM, denoted as z.
Through the collection of a large number of advanced geological prediction reports, the membership functions of these identification factors can be approximated by a ridge distribution or semi-ridge distribution function using statistical analysis [35]. Figure 8 shows the membership functions of the different identification factors obtained from statistics.
The horizontal axis in Figure 8a represents the change in the shear wave velocity in front of the face relative to that at the face.Note that this time it is not based on the magnitude of the velocity value, but on the relative change; because the rock and lithology are different, the inherent propagation velocity is different.The water-rich situation can only be determined according to this change.The vertical axis in Figure 8a represents the corresponding membership degrees of the different velocity change values in the four fuzzy sets.Similarly, the horizontal axes in Figure 8b-d represent the changes in v p /v s , Poisson's ratio, and apparent resistivity, respectively.The change in the identification factors corresponding to the four effluent grades was based on the tunnel face as the linear effluent.If the effluent conditions of the tunnel face are different, the membership function must be recalculated.Because the identification factors are all changed values, when the basis is different, the range of change is also different.For the identification factors of GPR data with its non-numerical characteristics, after analyzing the existing reports, the study gives their membership value as shown in Table 1.The horizontal axis in Figure 8a represents the change in the shear wave velocity in front of the face relative to that at the face.Note that this time it is not based on the magnitude of the velocity value, but on the relative change; because the rock and lithology are different, the inherent propagation velocity is different.The water-rich situation can only be determined according to this change.The vertical axis in Figure 8a represents the corresponding membership degrees of the different velocity change values in the four fuzzy sets.Similarly, the horizontal axes in Figure 8b-d represent the changes in   /  , Poisson's ratio, and apparent resistivity, respectively.The change in the identification factors corresponding to the four effluent grades was based on the tunnel face as the linear effluent.If the effluent conditions of the tunnel face are different, the membership function must be recalculated.Because the identification factors are all changed values, when the basis is different, the range of change is also different.For the identification factors of GPR data with its non-numerical characteristics, after analyzing the existing reports, the study gives their membership value as shown in Table 1.After obtaining the membership degree of each type of identification factor for the four effluent fuzzy sets, the prediction results of geophysical data to the effluent condition are output using the fuzzy synthesis function to fuse various kinds of factors.The fuzzy synthesis function is defined as follows [36] S[µ 1 , µ 2 , µ 3 , . . .
where µ 1 ∼ µ n are the membership values of the different identification factors for the same type of geophysical data.

BPA Based on the Degree of Membership for the Improved D-S Theory
Whether it is the classical D-S theory or the improved D-S theory, the BPA function is obtained through relevant experience in a specific environment, and different BPA functions have quite different fusion results, which makes the fusion results susceptible to subjective influence.Considering that the prediction conclusion of the water outlet in front of the tunnel face has a fuzzy concept, a probability distribution function generation method based on the degree of membership was proposed.
The classical D-S theory was first proposed by Dempster, the so-called upper and lower limit probability derived from multi-valued mapping, and later developed by Shafer to make evidence theory a complete method for dealing with uncertainty problems [37].It takes the BPA function instead of probability as a metric.In the D-S theory, Θ is used to represent an identification frame, which consists of a series of mutually exclusive objects, namely, [38] In this study, there are three identification frameworks for three kinds of geophysical data categories, and the objects in each identification framework are four uncertain expressions of effluent, namely, strand effluent, linear water, seepage and dripping water, and anhydrous.Where 2 Θ is the set composed of all subsets of Θ, and the BPA function ϕ is defined as the mapping of 2 Θ → [0, 1] , which satisfies the following [39]      where A is an arbitrary subset of the identification space and ϕ(A) represents the degree of trust in hypothesis set A in a certain environment.The three types of geophysical data were judged independently for their respective identification frames and then combined using the Dempster combination rule.The Dempster combination rule is defined as follows.Suppose ϕ 1 and ϕ 2 are two different BPA functions; then, their orthogonality sum where, K = ∑ x∩η=φ ϕ 1 (x) × ϕ 2 (y), K is the conflict factor, which reflects the conflicting degree of the evidence; (1 − K) −1 is the normalization factor, and this combination rule is equivalent to assigning conflicts to each set in equal proportions in the combination.Classical D-S theory cannot resolve serious conflicts between evidence, such as evidence The results after fusion are ϕ(A) = 0, ϕ(B) = 1, ϕ(C) = 0, which is inconsistent with the actual situation.When combining multi-source geophysical data, owing to the subjectivity of manual interpretation, the results of different data identified as water outflow conditions can be completely contradictory; therefore, the evidence theory needs to be improved.After a series of discoveries by Sun [41], when there are n number of evidence sources, the corresponding BPA functions are ϕ 1 , ϕ 2 , . . .ϕ n .At this time, let the conflict coefficient between evidence sources i and j be k ij , and the improved D-S theory is as follows In Equation (6), The equation holds that when there is a conflict between the evidence, some of the conflicting parts are still useful, and information other than useful is given to the unknown part, which is represented as the set X. ε indicates the credibility of the evidence, which is a useful part of the conflict.The larger ε is, the larger the useful information of the conflicting part.
Finally, the membership degrees obtained using different identification factors are used as the value of the probability distribution function for the fusion application of the improved evidence theory, and a more accurate prediction result can be obtained.

Study Area
A tunnel in China was selected as the study area, as shown in Figure 9.The tunnel exit section overlies the Quaternary system slope residual silty clay, coarse breccia, gravel soil, Quaternary Pleistocene moraine layer coarse (fine) breccia, gravel soil, and the soil layer is about 8-15 m thick.The underlying bedrock of the tunnel is coarse-grained biotite monzonitic granite with strong tectonics and active faults.In addition, two sets of shear conjugate joints are developed, which have different degrees of water conductivity and water abundance and are also the main migration and storage structures of mountain groundwater.The surrounding rock of the tunnel is mainly adamellite, dark gray, partly brownish yellow, and grayish white.It is mainly weakly weathered, with local strong weathering.The joint fissures are relatively developed and extend well.

Result of Effluent Condition
In the mileage sections 325~+ 234 m, the identification factors of the 13 parts changed.Therefore, a comprehensive prediction of water inrush was made for each part.Here, we took the first section, 352~+ 311 m, as an example to describe the experimental process in detail.Section 2.1 introduced the process and results of obtaining three types of data in this mileage section and briefly described the results of manual interpretation.Here, the improved fuzzy D-S theory model was used to fuse these three types of data and auto- The ridge of the mountain where the tunnel is located is cut by several northeast surface trenches.The experimental section is located in the part passing through the trench, which is conducive to rainfall or snow melting catchment and concentrated in-filtration.
In terms of hydrogeology, the entrance of the tunnel is recharged by river tributaries, a water-rich strip is formed on the right side of the tunnel under the influence of river tributaries, and there is a direct hydraulic relationship.Therefore, the tunnel area has rich water-bearing conditions, which may also cause water inrush disasters.It is reasonable to select this tunnel as the experimental area to predict the quantitative water abundance in front of the tunnel face during excavation.The mileage of this section of the tunnel is 325~+ 234 m, and the advanced geological prediction method in the construction drawing design is geophysical (seismic wave reflection, geological radar, and transient electromagnetic methods).

Result of Effluent Condition
In the mileage sections 325~+ 234 m, the identification factors of the 13 parts changed.Therefore, a comprehensive prediction of water inrush was made for each part.Here, we took the first section, 352~+ 311 m, as an example to describe the experimental process in detail.Section 2.1 introduced the process and results of obtaining three types of data in this mileage section and briefly described the results of manual interpretation.Here, the improved fuzzy D-S theory model was used to fuse these three types of data and automatically obtain comprehensive results.
According to the corresponding membership function in Figure 6, the membership values of the three types of identification factors of the TSP data to the effluent fuzzy set are listed in Table 2. Using the fuzzy synthesis function, it was determined that the membership degree of the TSP data to fuzzy set A was µ A (TSP) =0.099, fuzzy set B µ B (TSP) = 0.676, fuzzy set C µ C (TSP) = 0.905 and fuzzy set D µ D (TSP) = 0.454.The fuzzy set with the largest degree of membership was the effluent state identified by the TSP data.Combined with the results in Table 2, we can see that the recognition result of a single identification factor was different from the comprehensive membership degree of the fusion of multiple recognition factors.Table 3 shows the membership values of three types of identification factors of GPR data to the effluent fuzzy set; similarly, µ A (GPR) = 0.700, µ B (GPR) = 0.900, µ C (GPR) = 0.633, and µ D (GPR) = 0.433.It can be seen that the identification result of GPR was different from that of the TSP.The recognition result of GPR was linear effluent, whereas TSP was seepage and dripping water.Table 4 shows the membership value of the apparent resistivity identification factor of the TEM data to the effluent fuzzy set and directly obtained µ A (TEM) = 0.000, µ B (TEM) = 0.655, µ C (TEM) = 0.500, µ D (TEM) = 0.050.The prediction result of single TEM data was the same as the GPR data but different from the TSP data, and the membership value of strand effluent was zero.In summary, for the identification framework Θ = {A, B, C, D}, the degree of support of different geophysical data for the water effluent situation is shown in Table 5.The results were normalized and fused.The classical D-S theory, fuzzy D-S theory, and improved fuzzy D-S theory were used to fuse the data, and the fusion results are presented in Table 6.Table 6 shows the fusion results of the fuzzy D-S theory and improved fuzzy D-S theory were all linear water.However, if the fuzzy D-S theory fusion method was used, because the BPA function value of TEM data to fuzzy set A was zero, no matter how large the values of TSP and GPR were, the final fusion result was still zero, which is contrary to the actual fusion situation.
When using the classical D-S theory fusion method, because the value of the BPA function depends on experience, it is highly likely that the BPA functions given by different interpreters would be different, resulting in contrasting results.The conclusions of the existing advance prediction reports are similar to those described in Section 2.1.The professionals can directly infer the water outflow state from different geophysical data according to the interpretation results, including strand water, linear water, seepage and dripping water, and anhydrous.Then, the conclusions of different geophysical data are fused with the classical D-S theory.Based on this, the fusion result here is strand water.
In this study, the water effluent conditions in this section were verified using the geological sketch of the tunnel face.The geological sketch of the tunnel face involves observing and recording the surrounding rock of the exposed tunnel, which mainly includes the size, state, rock strength, water gushing state, crack width, karst development degree, and design surrounding rock grade [42,43].Therefore, the prediction results can be verified based on the water-gushing state.Figure 10 shows a part of the geological sketch of the face and the corresponding pictures in the mileage sections 325~+ 311 m.A two-step excavation method was adopted in this section of the tunnel, and the content of the report was a description of the geological conditions of the upper half of the tunnel face.The report indicated that the water output of this series of tunnel faces was approximately 30 L/(min•10 m), which was evaluated as a linear effluent and was consistent with the experimental results of the fusion model.When the mileage was 318 m in Figure 10, the experimental results only showed that most of the water in this entire section was linear water, but there was still strand water in the lower right corner of the tunnel face.Although the amount of water was not large, advanced support was needed, such as leading a small conduit and pipe shed.
When the space between rock blocks is filled with water, the stability of the surrounding rock is reduced, and the tunnel is prone to deformation and instability collapse.Therefore, as an important unfavorable geological body that threatens the safety and efficient excavation of the tunnel, water-rich area requires more accurate control of the effluent in front of the tunnel.The fusion method proposed in this study can effectively predict the water outflow to reduce the occurrence of water inrush during tunnel construction.
Here, is a detailed explanation of the prediction process for the 325~+ 311 m mileage segments.Figure 11 shows all the prediction results of the 325~+ 234 m mileage segments, including the single prediction result of each geophysical dataset and the prediction results of the three different methods.It can be observed that our method has a higher fusion When the mileage was 318 m in Figure 10, the experimental results only showed that most of the water in this entire section was linear water, but there was still strand water in the lower right corner of the tunnel face.Although the amount of water was not large, advanced support was needed, such as leading a small conduit and pipe shed.
When the space between rock blocks is filled with water, the stability of the surrounding rock is reduced, and the tunnel is prone to deformation and instability collapse.Therefore, as an important unfavorable geological body that threatens the safety and efficient excavation of the tunnel, water-rich area requires more accurate control of the effluent in front of the tunnel.The fusion method proposed in this study can effectively predict the water outflow to reduce the occurrence of water inrush during tunnel construction.
Here, is a detailed explanation of the prediction process for the 325~+ 311 m mileage segments.Figure 11 shows all the prediction results of the 325~+ 234 m mileage segments, including the single prediction result of each geophysical dataset and the prediction results of the three different methods.It can be observed that our method has a higher fusion accuracy than the other methods.In fact, the accuracy of the fuzzy D-S theory was very close to that of our proposed method, but there is a situation where it cannot handle conflicts, resulting in a lack of robustness.

Discussion
In the experimental part, the 325~+ 311 mileage section was selected to predict the effluent, and it was concluded that the improved fuzzy D-S theory prediction model proposed in this study was consistent with the actual situation, while the fuzzy D-S theory could not make a reasonable prediction of the evidence, and the traditional D-S theory was less accurate.
To generalize the results, three methods were used to predict the effluent of another 12 random sections; the predicted results are shown in Figure 12. Figure 12a shows the histogram of the statistical results.The abscissa was 12 sets of mileage data, and ordinates 1-4 corresponded to anhydrous water, seepage and dripping water, linear water, and strand water, respectively.The results of the three single geophysical data and the three fusion methods were compared to those of the actual effluent.From the accuracy results in Figure 12b, the prediction accuracy of TEM was higher than that of TSP and GPR data, but lower than that of other fusion methods.Then, the improved fuzzy D-S theory was the same as the fuzzy D-S theory.However, the fuzzy D-S theory fusion result of the seventh group of samples was not a number (NaN).This was because the denominator was zero when calculating the attribution factor, which made the calculation meaningless.This situation was caused by the conflict of prediction results between different geophysical data, which was avoided by the improved fuzzy D-S theory.However, the BPA function of the classical D-S theory was based on the experience of on-site interpreters, making the prediction results unstable and fluctuating, and the accuracy was only 66.7%.

Discussion
In the experimental part, the 325~+ 311 mileage section was selected to predict the effluent, and it was concluded that the improved fuzzy D-S theory prediction model proposed in this study was consistent with the actual situation, while the fuzzy D-S theory could not make a reasonable prediction of the evidence, and the traditional D-S theory was less accurate.
To generalize the results, three methods were used to predict the effluent of another 12 random sections; the predicted results are shown in Figure 12. Figure 12a shows the histogram of the statistical results.The abscissa was 12 sets of mileage data, and ordinates 1-4 corresponded to anhydrous water, seepage and dripping water, linear water, and strand water, respectively.The results of the three single geophysical data and the three fusion methods were compared to those of the actual effluent.From the accuracy results in Figure 12b, the prediction accuracy of TEM was higher than that of TSP and GPR data, but lower than that of other fusion methods.Then, the improved fuzzy D-S theory was the same as the fuzzy D-S theory.However, the fuzzy D-S theory fusion result of the seventh group of samples was not a number (NaN).This was because the denominator was zero when calculating the attribution factor, which made the calculation meaningless.This situation was caused by the conflict of prediction results between different geophysical data, which was avoided by the improved fuzzy D-S theory.However, the BPA function of the classical D-S theory was based on the experience of on-site interpreters, making the prediction results unstable and fluctuating, and the accuracy was only 66.7%.
the same as the fuzzy D-S theory.However, the fuzzy D-S theory fusion result of the seventh group of samples was not a number (NaN).This was because the denominator was zero when calculating the attribution factor, which made the calculation meaningless.This situation was caused by the conflict of prediction results between different geophysical data, which was avoided by the improved fuzzy D-S theory.However, the BPA function of the classical D-S theory was based on the experience of on-site interpreters, making the prediction results unstable and fluctuating, and the accuracy was only 66.7%.Similarly, according to the comparative experiments of these 12 groups, it was proven that the method proposed in this paper is feasible for the fusion of multi-source tunnel geophysical data and has high accuracy and robustness.In general, from the experimental results of 12 groups randomly selected here and the tunnel mileage section selected in Section 3.2, the method proposed in this study has good performance in predicting water yield and is superior to the prediction results of single geophysical data and other fusion methods.Moreover, since the BPA value was obtained based on the change degree of different identification factors relative to the tunnel face, this fusion method is applicable to different surrounding rock lithology, and only needs to obtain the identification factor value at the tunnel's face.Therefore, the prerequisite for the use of this method is the correct acquisition of the initial identification factor, which is suitable for different tunnel environments.

Conclusions
To predict tunnel water inrush geological disasters, we proposed a prediction model combining multi-source geophysical exploration data to solve the discrepancy between the prediction results and reality caused by the subjectivity and uncertainty of traditional single data.Compared to single data, the fusion results were more dependable and stable.In addition, the fusion model proposed in this study has great practical value in several aspects.

1.
The model improves the automation of geophysical data interpretation and can reduce the number of interpreters used in tunnel construction, thus reducing cost.

2.
The results of this study can also be used as auxiliary reference information for interpreters, prompting the careful examination of problems when the conclusions of the model are inconsistent with those of interpreters.

3.
The prediction results of the water effluent obtained by the model in this study have high accuracy, robustness, and important reference values for practical engineering applications.

Figure 1 .
Figure 1.Features of multi-source geophysical data.

Figure 2 .
Figure 2. Schematic diagram of the TSP observation system.

Figure 1 .
Figure 1.Features of multi-source geophysical data.

Figure 2 .
Figure 2. Schematic diagram of the TSP observation system.

Figure 2 .
Figure 2. Schematic diagram of the TSP observation system.

Figure 5 .
Figure 5.The working principle of the transient electromagnetic method.

Figure 5 .
Figure 5.The working principle of the transient electromagnetic method.

20 Figure 5 .
Figure 5.The working principle of the transient electromagnetic method.

Figure 8 .
Figure 8. Membership functions of different identification factors.

Figure 8 .
Figure 8. Membership functions of different identification factors.

20 Figure 10 .
Figure 10.Geological sketch of a series of tunnel face in the mileage 325~+ 311 m.

Figure 10 .
Figure 10.Geological sketch of a series of tunnel face in the mileage 325~+ 311 m.

Figure 12 .
Figure 12.Three methods to predict the water output of 12 sections.

Figure 11 .
Figure 11.Prediction results of the effluent conditions for mileage 325~+ 234 m.

Figure 12 .
Figure 12.Three methods to predict the water output of 12 sections.Figure 12. Three methods to predict the water output of 12 sections.

Figure 12 .
Figure 12.Three methods to predict the water output of 12 sections.Figure 12. Three methods to predict the water output of 12 sections.

Table 1 .
Membership degree of GPR identification factors to effluent fuzzy sets.

Table 1 .
Membership degree of GPR identification factors to effluent fuzzy sets.

Table 2 .
Membership degree of TSP identification factors to fuzzy sets.

Table 3 .
Membership degree of GPR identification factors to fuzzy sets.

Table 4 .
Membership degree of GPR identification factors to fuzzy sets.

Table 5 .
Support degree of different geophysical data for water abundance.

Table 6 .
Fusion results from the three different methods.