Figure 3 presents the computation process of this research. Primarily, it predicts the typical observation points along the strike section, so the typical observation points should be determined first. Then, the x-coordinates of these points are computed from their locations; the influence radius is acquired based on the mining depth and the influence angle, and the time influence parameter is achieved according to the advancing rate and critical mining dimension. Based on the three computed parameters, the time function method can be conducted accordingly. The ground characteristic coefficient can be determined based on slope, curvature, soil, geology and land use status. The mountainous time function method can be conducted based on the time function method, slope and ground characteristic coefficient for the typical observation points. Taking the mountainous time function as the state equation, the EnKF method is achieved through several steps using an iteration process. Finally, the performance of the mountainous time function and the EnKF methods is evaluated through subsidence curves, error curves and error indexes.
Based on the workflow, the typical observation points along the strike section are determined first; then, the mountainous time function method is presented, and the EnKF method is established by using the mountainous time function method as the state equation; finally, the error indexes are selected to quantify and estimate the prediction quality based on the two methods.
3.1. Determination of Typical Observation Points
There are many observation points along the section. In this research, the subsidence process is modeled and predicted through typical observation points along the strike section. To determine the typical observation points, the x-coordinates of the points should be acquired first.
According to the open-off cut position and the offset distance (15 m in this research) of the workface, the x-coordinates of the observation points along the strike section were computed based on their locations and are shown in
Table 3.
Table 3 shows that the distances along the x-axis are approximate for each two adjacent points, about 20–30 m. The approximate distances provide good conditions for understanding the subsidence process along the x-axis.
The observation points are surveyed using the global positioning satellites (GPS) method from 13 June 2018 to 15 July 2020, for 19 phases in total. The mean error of the GPS surveying method is deemed as 4 mm in this research [
44]. Based on the x-coordinates and the GPS monitoring data of the observation points, the subsidence curves can be drawn to show the subsidence process along the x-axis, as shown in
Figure 4.
Figure 4 shows the subsidence process of the observation points along the strike section of the workface. The subsidence values increase with the workface propulsion for all points. As for the subsidence curves, they go through a falling and rising process for all the phases, especially in the later phases. Of course, there are some outliers, whose values do not obey the common rules of change. For example, there are a few points whose subsidence values are apparently higher than 0 mm. Moreover, some curves present evident undulations along the strike section, such as the curves on 6 July 2018 and 24 October 2018.
The influence radius (
r) can be computed using the following equation [
45]:
In Equation (1), H refers to the mining depth, and β refers to the influence angle.
Through computation, the influence radius is shown to be about 216.667 m in the workface. Meanwhile, the maximum subsidence values are predicted for the observation points using the probability integral method [
46].
Based on the influence radius, the observed and predicted subsidence values and the x-coordinates of the observation points, the typical observation points were selected along the strike section of the workface. They are A21, A24, A28, A29 and A36; the observed and predicted maximum subsidence values are shown in
Table 4.
Table 4 shows that the observed and predicted subsidence values are approximate for some points, such as A28; meanwhile, others have evident deviation, such as A21 and A24. The predicted maximum subsidence values increase as the x-coordinates increase, which is not suitable for the observed maximum subsidence values.
The reasons why the five typical points were selected are as follows. The x-coordinate of A21 is approximate to 0 m, which, in theory, refers to the open–off cut position. The x-coordinate of A24 is 100.574 m, about half of the influence radius, so it has the highest horizontal deformation and curvature values. The X coordinate of A28 is 219.384 m, which is approximate to the influence radius. A29 has the highest observed subsidence value among all the points. As for A36, it has the highest predicted subsidence value of all the points.
3.2. Mountainous Time Function Method
The mountainous time function method used in this research applies the time function method in conjunction with the mountainous conditions. It is the time function method applied to mountainous regions.
The time function method was presented by Knothe in 1953 [
7,
11]. Its equations are as follows:
In Equations (2)–(5),
W(
t) is the subsidence value corresponding to the time
t;
W0 is the final subsidence value;
φ(
t) is the time function;
m is the mining height;
q is the subsidence coefficient;
α is the dig angle of the coal seam;
c is the time influence parameter;
v is the advance rate, which can be calculated with
Table 2;
L1 is the critical mining dimension, which is 100 m in this research.
The above time function method (Equations (2)–(5)) is suitable for flat topography. If it is used in a mountainous region, the subsidence value at coordinate
x can be transformed as the following equations [
23]:
Some parameters of Equations (6) and (7) are explained above, such as r. As for the other parameters, W′(x) is the subsidence value at the observation point due to coal mining in a mountainous area; W(x) is the subsidence value in flat ground with same geological conditions; F(x) is used to simplify the Equation (6); Dx is the ground characteristic coefficient, which is determined by the characteristics at the location of the observation points, such as the slope, curvature, land use, geological conditions and so on; Wm is the maximum subsidence value; α is the inclination angle of the ground trend surface. A, P and t are slip influence parameters, whose values are determined to be 2π, 2 and π, respectively, in this research.
As shown in
Figure 3, there are 19 observation phases in this research. Therefore, the working face can be divided into 18 elements. The propulsion status, including the propulsion distance and the advance rate, are computed in
Table 5.
In
Table 5, the propulsion distance is the mining length between the current phase and the last phase, and the advance rate is the propulsion distance divided by the days during the two phases. From
Table 5, we can see that the propulsion distance varies significantly at different phases. For example, it is 237.85 m on 15 April 2020, while 13.47 m on 6 July 2018. Meanwhile, the advance rate is also different at different phases.
For the 18 elements of the workface, the maximum progressive subsidence can be computed for the moment
t using the following equation [
7,
23]:
Using the superposition principle, the progressive subsidence of the observation points along the strike section with
x coordinate at time
t can be computed using the following equations:
where
Based on Equations (9) and (10), the subsidence process can be predicted at any time by using the parameters in the workface. This is the mountainous time function method.
3.3. EnKF Method
The mountainous time function method can predict the subsidence process of the observation points along the strike section at any time. Meanwhile, the subsidence values at the locations of the observation points are measured using the GPS method in 19 phases. To improve the prediction accuracy, it is critical to assimilate the GPS observation data into the prediction process.
As a sequential assimilation method, the Kalman filter method is widely used in acquiring the least square estimation of a system state [
47]. The covariance matrix of this method is updated using a linear model, so its use is limited to linear models with small scales. To solve this problem, the EnKF method is presented based on the random forecast theory [
28,
38]. The EnKF method estimates the statistical moment of the system state through constructing samples, so as to assimilate the data with different sources. Hence, the EnkF method is widely used in many fields with large scale and non-linear problems, such as weather forecasting, ocean dynamics, hydrology, and so on [
48,
49].
As a method of estimating analysis error covariance and background error covariance using the ensemble, the basic idea of the EnKF method is as follows [
50]: through initializing a set of system state samples as the background ensemble, the analysis ensemble can be achieved by updating each element of the background ensemble using observation information, so as to estimate the real mean and covariance values of the state. Then, the background ensemble at the next moment can be acquired by transferring the sample ensemble through the system model. The specific computation process of the EnKF method in this research is as follows:
Firstly, the state and measuring equations are listed as follows:
In Equations (11) and (12), Xk and Xk−1 are the state variables at times k and k−1, respectively; A and B are the parameter matrix of the system state; Uk is the controlling value of the system; Zk is the measuring vector; H is the parameter matrix of the measuring equation, which is a unit matrix in this research; Wk and Vk are the system error vector and measuring error vector, whose covariances are determined as Q (25 mm2) and R (16 mm2), respectively, in this research.
The initial state ensemble can be acquired by adding stochastic disturbance to the initial state variables:
In Equation (13), ui is the background error, which is subject to the normal distribution with a mean of 0 and a variance of R.
Then, the state variable prediction is conducted based on the mountainous time function method, which is derived from Equation (9) and shown as the following:
The state variable can be predicted using Equation (14). In this equation, W(x, tk) is the predicted ensemble at the moment tk acquired from the analyzed subsidence value at moment tk−1; W(x, tk−1)′ is the analyzed ensemble at the moment tk−1, computed from the observation information at moment tk−1; ωi is the model error, which is subject to the normal distribution with a mean of 0 and a variance of Q.
The ensemble error covariance (
Pe) is computed using the following equation:
In Equation (15), T is the transpose symbol; A′ = A − is the ensemble disturbance; is the mean value of the ensemble; N is the number of ensemble elements.
The gain matrix
K can be acquired using the following equation:
The state variable can be updated using the following equation:
In Equation (17), W(x, tk)′ is the analyzed ensemble at moment tk computed from the observation information at moment tk; Dk is the observation information at moment tk.
Finally, the ensemble error covariance is updated using the following equation:
In Equation (18), Pa is the updated ensemble error covariance; Pe is the predicted ensemble error covariance; Re is the observed error covariance.
Overall, the larger the number of the set is, the better the assimilation quality is. Referencing the relevant literature, the number of the set was determined as 100 in this research [
48,
49,
50].
Through the iteration computation process from Equations (13)–(18), the ground subsidence of the observation points can be predicted using the EnKF assimilation method. It is executed using Matlab R2014b software in this research.
3.4. Error Indexes Selection
Based on the relevant research [
10,
26,
51], four error indexes were selected to quantify the prediction quality. They are the mean error (
ME), the mean absolute error (
MAE), the root mean square error (
RMSE) and the mean absolute percentage error (
MAPE). The equations for these error indexes are as follows:
In Equations (19)–(22), n is the total number of the values, which is 19 in this research, corresponding to 19 phases of the typical observation points; x is the prediction value acquired by both the mountainous time function and the EnKF methods; y is the real value, which is regarded as the assimilated value acquired by the EnKF method.