Prediction of Mine Waste Rock Drainage Quantity Using a Machine Learning Model with Physical Constraints

Zhang, Can; Ma, Liang; Liu, Wenying

doi:10.3390/min15020194

Open AccessEditor’s ChoiceArticle

Prediction of Mine Waste Rock Drainage Quantity Using a Machine Learning Model with Physical Constraints

by

Can Zhang

¹

,

Liang Ma

^2,* and

Wenying Liu

^1,*

¹

Department of Materials Engineering, University of British Columbia, 309-6350 Stores Road, Vancouver, BC V6T 1Z4, Canada

²

Clean Energy Innovation Research Centre, National Research Council Canada, 4250 Wesbrook Mall, Vancouver, BC V6T 1W5, Canada

^*

Authors to whom correspondence should be addressed.

Minerals 2025, 15(2), 194; https://doi.org/10.3390/min15020194

Submission received: 18 December 2024 / Revised: 3 February 2025 / Accepted: 11 February 2025 / Published: 19 February 2025

(This article belongs to the Section Environmental Mineralogy and Biogeochemistry)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Mining activities generate substantial amounts of waste rock, which are often disposed of in waste rock piles. Drainage from these piles can pose serious environmental risks. It is crucial to reliably predict drainage properties in order to effectively manage them. In previous work, we developed a machine learning model to predict waste rock drainage quantity using weather monitoring data as the input and drainage flow rate as the output. However, this model lacked physical constraints, limiting its interpretability, reliability, and applicability. In this study, we introduced a new machine learning model designed with physical constraints to improve the predictions of drainage quantity. This new model incorporates a weather refining sub-model and integrates physical constraints to enhance the overall reliability of the model predictions. The weather refining sub-model transforms primary weather features (total precipitation and temperature) into secondary features (rainfall, snowmelt, and evaporation) through established mathematical relationships. These secondary features were then used as inputs for the machine learning model to predict drainage quantity. To embed physical principles within the machine learning model, we integrated a water balance equation into the neural network architecture and modified the loss function accordingly. In addition, we included an adjustable bias term to optimize the balance between model performance and interpretability. Compared with our previous model, the incorporation of physical constraints into the machine learning model improved the accuracy of the drainage quantity predictions. More importantly, this approach ensures that the model outputs adhere to physical laws, thereby enhancing its interpretability, reliability, and applicability.

Keywords:

drainage flow model; hybrid machine learning; water balance; loss function modifications

1. Introduction

Large amounts of waste rocks with diverse sizes and compositions are generated by mining activities and disposed of in waste rock piles. The drainage from these piles, which often contain acid, elevated concentrations of sulfate, and heavy metal(loid)s, can pose serious environmental risks and must be properly managed. Understanding the hydrology of waste rock piles is a crucial component of effective mine waste management [1]. Numerous studies have been conducted to characterize the hydrologic properties of these piles [2,3,4,5]. The highly heterogeneous nature of waste rock piles results in the formation of vastly different flow paths, causing the infiltration rate to vary significantly, ranging from a few hours for preferential flow to several years for matrix flow to fully penetrate a waste rock pile [5,6,7,8]. Consequently, drainage flow is influenced not only by the present surface infiltration rate but also by those from previous periods.

Significant efforts have been made to develop models for predicting drainage quantity, which can generally be classified into three categories. The first category includes the basic water balance models that correlate drainage quantity with precipitation and surface infiltration/run-off on a monthly or annual basis [9]. While useful, these models only provide rough estimates of drainage quantity based on weather monitoring data. The second category consists of the models that incorporate fundamental principles, such as mass conservation, and constitutive equations, such as the Richards equation, to describe water movement in unsaturated porous media [10,11]. These models offer a more detailed and accurate prediction of the dynamic processes within waste rock piles [12]. Reactive transport models have also been developed to simulate drainage flow within heterogeneous waste rock piles [13,14,15]. However, it is often challenging for these models to fully capture the complex interactions among various factors influencing drainage flow, even though different methods such as the probability density functions of physical and geochemical properties have been applied to simulate different degrees of heterogeneities [16]. In general, the application of these models requires comprehensive characterizations of each geologic unit within a waste rock pile to reliably predict drainage quantity. These characterizations can be prohibitively expensive or impossible to obtain.

Emerging machine learning models fall into the third category. Most of these models correlate drainage hydrological properties with directly monitored weather characteristics, such as temperature and precipitation [17,18,19]. Given that the relationship between weather characteristics and drainage quantity is often complex, these models typically use deep learning algorithms. However, training these models requires a large volume of data, which is often unavailable, to ensure that these models do not violate certain physical phenomena. To reduce the amount of data required for training, prior knowledge based on the first principles has to be introduced into a model to improve the performance of the learning algorithms [20]. A common method for introducing physics into neural networks is by adding regularization terms derived from partial differential equations into the loss function [21]. However, the use of partial differential equations requires characterization data that are often difficult to obtain in the field, which can undermine the effectiveness of these physics-informed neural networks. For example, when introducing the Richards equation into neural networks, the data required on soil water retention curves and unsaturated hydraulic conductivities are typically difficult to access, especially for waste rock piles with a high degree of heterogeneity.

In this research, we developed a machine learning model with physical constraints to predict drainage quantity. We applied a weather refining sub-model to first transform the directly monitored primary weather features into secondary ones, which were then used as inputs for the machine learning model. The novelty of this machine learning model lies in the integration of the water balance equation and the inclusion of the physics-based constraints derived from partial derivatives within the loss function. This approach ensures that each term in the water balance equation adheres to fundamental physical laws. The model was then trained and tested with data from three monitoring stations.

2. Methodology

The methodology developed in this study for simulating drainage quantity consists of two sequential components: a non-machine learning weather refining sub-model and a machine learning drainage quantity model with physical constraints, as illustrated in Figure 1. The weather refining sub-model uses primary weather features, such as daily temperature and precipitation, as inputs. The primary weather features are directly measured by weather monitoring stations in the vicinity of waste rock piles. This sub-model transformed the primary weather features into secondary ones using established mathematical relationships. These secondary weather features were then used as inputs for the machine learning drainage quantity model (hereafter referred to as the “drainage quantity model”). The target (output) of the drainage quantity model is the drainage flow rate. Since drainage quantity is directly influenced by rainfall, snowmelt, and evaporation, transforming primary weather features into secondary features better reflects the underlying relationships between weather conditions and drainage. This transformation improves the physical interpretability of the model and simplifies the integration of physical constraints into the model. The weather monitoring stations and the waste rock piles from which drainage flow rate data were collected for model training are described in a previous article [19].

2.1. Weather Refining Sub-Model

The weather refining sub-model transforms primary weather features into secondary ones, which are then used as inputs for the drainage quantity model. To enhance data reliability, primary weather features from multiple weather monitoring stations near the waste rock piles were combined in the analysis. The missing value segments of varying lengths were addressed using appropriate interpolation methods to ensure a continuous dataset. After preprocessing, the data were used as inputs for the weather refining sub-model, which generated the secondary weather features through the following steps: (1) temperature adjustment; (2) precipitation differentiation; (3) snowmelt simulation; and (4) evaporation estimation.

Temperature adjustment: Although the weather monitoring stations are located near the waste rock piles, the elevation difference may introduce discrepancies between the recorded weather data and the actual conditions on the piles. To address this, the lapse rate, which describes the rate of temperature decrease with the increasing altitude, was used to adjust the recorded temperature data to reflect the actual ambient temperature on the piles [22]. A constant lapse rate was assumed for each waste rock pile. The calculation of the lapse rate was based on the values reported in the literature for regions at similar latitudes, as well as the observed snow cover disappearance dates recorded at the mine site [23].

Precipitation differentiation: The two forms of precipitation, rainfall and snowfall, have distinct impacts on water infiltration into waste rock piles, making it essential to differentiate between them. While weather stations provide precise precipitation data, elevation differences and the resulting temperature variations can influence the actual form of precipitation on the waste rock piles. For this study, it was assumed that the total precipitation amounts on the waste rock piles matched that recorded by the weather stations. However, the form of precipitation was determined based on the surface temperature, measured two meters above ground. Using the adjusted surface air temperature derived in the previous step, the precipitation was classified as snowfall below a threshold surface temperature and as rainfall above this threshold surface temperature. As suggested by the literature, the threshold temperature could vary between 0 and 1.7 °C. In this study, considering the limited snowfall recorded at the mine site, this threshold temperature was fixed at 1.1 °C [24]. The recategorized precipitation data provided a more accurate representation of the rainfall and snowfall distributions on the waste rock piles.

Snowmelt simulation: Rainfall contributes directly to net infiltration into waste rock piles, whereas snowfall undergoes a more complex process involving accumulation and melting. To capture these processes, a snowmelt model was developed to transform past snowfall into daily snowmelt data. The model calculates the theoretical maximum daily snowmelt using Equation (1), under the assumption of sufficient snow accumulation. The actual snowmelt for a given day is then determined as follows: if the snow cover for that day is not fully depleted, the actual snowmelt equals the theoretical maximum; or if the snow cover for that day is depleted, the actual snowmelt equals the remaining snow from the previous day.

The degree-day factor method, adapted from LISFLOOD (April 2020) (a hydrological rainfall-runoff model), was applied to calculate the theoretical maximum daily snowmelt, as shown in Equation (1) [25]. This model accounts for the contribution of rainfall to the snowmelt process.

M = C_{m} (t) \cdot (1 + 0.01 \cdot R \cdot ∆ t) (T_{a i r} - T_{m e l t}) \cdot ∆ t, when T_{a i r} > T_{m e l t}

(1)

In Equation (1), M is the amount of snowmelt in mm; C_m (t) is the effective degree-day factor in mm/(°C‧day), which varies over time; R is the amount of rainfall (mm) during the time period Δt; and T_air and T_melt are the average air temperature (°C) and the snowmelt temperature over the time period Δt, respectively. Theoretically, Δt can be any time duration, and in this research, it was set to one day. The amount of snowmelt M was assumed to be zero when T_air ≤ T_melt, meaning no snowmelt occurs if the air temperature is lower than or equal to the snowmelt threshold. Additionally, it is assumed that the degree-day factor C_m (t) follows a sinusoidal function with a period of one year. Its peak occurs at the summer solstice and its trough occurs at the winter solstice, reflecting seasonal variations in the snowmelt process.

Evaporation estimation: The daily evaporation rate in this research was estimated using a simplified version of the standard Penman equation proposed by [26]:

E \approx 0.047 R_{S} \sqrt{T + 9.5} - 2.4 {(\frac{R_{S}}{R_{A}})}^{2} + 0.09 (T + 20) (1 - \frac{R H}{100}), for T + 9.5 > 0

(2)

where E is the daily evaporation rate in mm/day; R_S is the solar radiation reaching the ground surface; R_A is the extraterrestrial solar radiation in MJ/(mm‧day); T is the daily mean temperature in °C; and RH is the relative humidity (%). If T + 9.5 ≤ 0, the daily evaporation rate E is considered zero. The solar radiation reaching the ground surface (R_S) was estimated by downscaling the observed historical solar radiation data using the ClimateNA software package [27]. The extraterrestrial solar radiation (R_A) was calculated using theoretical formulas based on the Julian date and the latitude of the waste rock pile [26].

2.2. Machine Learning Drainage Quantity Model

2.2.1. Conceptual Model

The machine learning drainage quantity model developed in this research was built using TensorFlow version 2.3.0 in Python version 3.8.19. The model was developed based on the water balance principle for a waste rock pile, where the drainage quantity at any given time is determined by the contributions of rainfall, snowmelt, and evaporation. The schematic design of the model is illustrated in Figure 2. Before being used as inputs for the drainage quantity model, the secondary weather features—rainfall, snowmelt, and evaporation—generated by the weather refining sub-model were preprocessed using the rolling average method. This method calculates average values over varying time windows. Previous studies have shown that the use of this method smooths out high-frequency fluctuations and allows the model to use a broader range of measurements without increasing the number of time steps [18,19,28]. Additionally, waste rock volume was included as a feature due to its influence on water percolation through the waste rock pile.

Through their respective neural networks (NN_Rain, NN_Snow, NN_Evap, NN_WRV, and NN_Bias), explained in detail in the following section, the inputs were transformed into the effective contributions of rainfall (

{\hat{R}}_{e f f}

), snowmelt (

{\hat{S}}_{e f f}

), evaporation (

{\hat{E}}_{e f f}

), and a bias term (

\hat{B i a s}

). Specifically,

{\hat{R}}_{e f f}

,

{\hat{S}}_{e f f}

, and

{\hat{E}}_{e f f}

represent the contributions to the drainage flow rate from rainfall, snowmelt, and evaporation, respectively. Drainage quantity was then calculated as the sum of the effective contributions of rainfall and snowmelt, minus the effective contribution of evaporation, with the addition of the bias term.

2.2.2. Neural Network Architecture Design

As outlined in the previous section, NN_Rain, NN_Snow, NN_Evap, NN_WRV, and NN_Bias represent the neural networks applied in the drainage quantity model. They were developed using five Gated Recurrent Units (GRUs), a type of modern recurrent neural networks [29]. GRUs were chosen because they are simpler and have fewer parameters to determine compared to the long short-term memory (LSTM) method. This makes them particularly well suited for handling smaller time-series datasets, such as the waste rock drainage data used in this study. Any of the neutral networks used could theoretically be replaced by other neural networks, such as LSTM cells or fully connected neural networks (FCNNs) with an extra flatten layer.

The inputs for the neutral networks were arranged as a three-dimensional tensor with the shape of (N, D, M), where N is the number of samples, D is the number of time steps, and M is the number of features. NN_Rain, NN_Snow, and NN_Evap are designed to use only their respective weather feature sequences as inputs, while NN_WRV uses waste rock volume as its input (Equations (3)–(6)). The processes by which these neural networks transform inputs to outputs are represented as the following functions:

f_{{N N}_{R a i n}}

,

f_{{N N}_{S n o w}}

,

f_{{N N}_{E v a p}}

, and

f_{{N N}_{W R V}}

. The outputs of these neutral networks are denoted as y_rain, y_snow, y_evap, and

{\vec{y}}_{W R V}

, respectively. It is important to note that y_rain, y_snow, and y_evap are scalar values, whereas

{\vec{y}}_{W R V}

is a three-dimensional vector. The components of

{\vec{y}}_{W R V}

serve as scale factors that adjust y_rain, y_snow, and y_evap to derive the effective contributions of rainfall (

{\hat{R}}_{e f f}

), snowmelt (

{\hat{S}}_{e f f}

), and evaporation (

{\hat{E}}_{e f f}

) to drainage quantity, as shown in Equations (7)–(9). These adjustments were specifically designed to account for the influence of waste rock volume on water percolation through the waste rock pile.

y_{r a i n} = f_{{N N}_{R a i n}} (R_{t}, R_{t - 1}, \dots, R_{t - D + 1})

(3)

y_{s n o w} = f_{{N N}_{S n o w}} (S_{t}, S_{t - 1}, \dots, S_{t - D + 1})

(4)

y_{e v a p} = f_{{N N}_{E v a p}} (E_{t}, E_{t - 1}, \dots, E_{t - D + 1})

(5)

{\vec{y}}_{W R V} = {(y_{W R V, 1}, y_{W R V, 2}, y_{W R V, 3})}^{T} = f_{{N N}_{W R V}} (V_{t}, V_{t - 1}, \dots, V_{t - D + 1})

(6)

\hat{R_{e f f}} = y_{W R V, 1} \cdot y_{r a i n}

(7)

\hat{S_{e f f}} = y_{W R V, 2} \cdot y_{s n o w}

(8)

\hat{E_{e f f}} = y_{W R V, 3} \cdot y_{e v a p}

(9)

The NN_bias, denoted by the function

f_{{N N}_{B i a s}}

, was designed to predict the bias term (

\hat{B i a s}

), as shown in Equation (10). The inputs for NN_Bias include all the weather features as well as the waste rock volume.

\hat{B i a s} = f_{{N N}_{B i a s}} (R_{t}, R_{t - 1}, \dots, R_{t - D + 1}, S_{t}, S_{t - 1}, \dots, S_{t - D + 1}, E_{t}, E_{t - 1}, \dots, E_{t - D + 1}, V_{t}, V_{t - 1}, \dots, V_{t - D + 1})

(10)

Eventually, the predicted mine waste rock drainage quantity (

\hat{Q}

) is calculated by Equation (11):

\hat{Q} = {\hat{R}}_{e f f} + {\hat{S}}_{e f f} - {\hat{E}}_{e f f} + \hat{B i a s}

(11)

To ensure meaningful physical interpretations, y_rain, y_snow, y_evap, and the three components of the vector

{\vec{y}}_{W R V}

should all be non-negative. To achieve this, LeakyReLU was chosen as the activation function for the output layers of NN_Rain, NN_Snow, NN_Evap, and NN_WRV. Although LeakyReLU may produce negative values, these values are typically very close to zero and can be treated as zero for water balance calculations. In contrast, the activation function for the output layer of NN_Bias was set to be linear, because the bias term can have both positive and negative values. This design ensures that each output is appropriately constrained in the model.

2.2.3. Loss Function Design

The loss function is a crucial component in machine learning, which quantifies the difference between the predicted outputs of a machine learning algorithm and the actual target values. In this study, apart from the careful choice of activation functions mentioned above, further physical constraints were embedded in the loss function to ensure that the correlations between the input and the output of the neural networks—NN_Rain, NN_Snow, and NN_Evap—were non-negative. The model training process involves minimizing the loss function by adjusting the internal parameters of the neutral networks represented by the matrix θ. The optimal θ, denoted as θ_best, is the value at which the loss function is minimized, as shown in Equation (12).

θ_{b e s t} = \arg \min_{θ} L o s s (X, Q, \hat{Q}, {\hat{R}}_{e f f}, {\hat{S}}_{e f f}, {\hat{E}}_{e f f}, \hat{B i a s})

(12)

where Loss represents the loss function, X represents the assembly of all inputs, and Q represents the observed drainage flow rate.

The loss function used in this study was designed to encompass three terms, defined as an empirical loss, a physical loss, and an extra regularization loss, as shown in Equation (13):

{L o s s = L_{e m p i r i c a l} + L}_{p h y s i c a l} + L_{r e g u l a r i z a t i o n}

(13)

The empirical loss was computed using the mean square error (MSE), which is the typical method of measuring model performance in supervised machine learning. Specifically, this term measures the difference between the model predicted and the observed drainage flow rates, as shown in Equation (14).

L_{e m p i r i c a l} = \frac{1}{N^{’}} \sum_{i = 1}^{N^{’}} {(Q_{i} - {\hat{Q}}_{i})}^{2}

(14)

where

N ’

is the batch size, and

Q_{i}

and

{\hat{Q}}_{i}

are the observed and predicted drainage flow rate, respectively.

To enforce non-negative correlations between the output and the input of the neural networks, a physical constraint was incorporated into the model by introducing a physical loss term into the loss function. This term penalizes negative correlations by increasing its contribution to the loss in proportion to the magnitude of the negative correlation. The design of the physical loss term is shown in Equation (15):

L_{p h y s i c a l} = λ_{p} \cdot \frac{1}{N^{’}} \sum_{i = 1}^{N^{’}} \frac{1}{D} \sum_{j = 1}^{D} [- \min (0, \frac{\partial {\hat{R}}_{e f f, i}}{\partial R_{i j}}) - \min (0, \frac{\partial {\hat{S}}_{e f f, i}}{\partial S_{i j}}) - \min (0, \frac{\partial {\hat{E}}_{e f f, i}}{\partial E_{i j}})]

(15)

where

R_{i j}

,

S_{i j}

, and

E_{i j}

are the rainfall, snowmelt, and evaporation features for sample i and time step j; the scale factor λ_p is a positive hyperparameter that ensures a physical loss value (L_physical) comparable in magnitude to the empirical loss value (L_empirical). Like the other hyperparameters, the value of λ_p needs to be fine-tuned. If λ_p is set too low, the penalty for negative correlations becomes negligible. Conversely, if λ_p is set too high, it can dominate the loss function, hindering the reduction of empirical loss during training and ultimately compromising model performance.

Finally, an extra regularization loss term was introduced into the loss function to minimize the reliance of the model output on the bias term. The bias term (

\hat{B i a s}

) predicted by NN_Bias serves two purposes: firstly, it captures the interactions among weather features; and secondly, it balances performance and interpretability of the model. Generally, a higher bias term improves performance but reduces interpretability. The model training process aims to minimize the reliance of the model predictions on the bias term, thereby maximizing the reliance on the effective contributions of the three weather futures. The bias term only intervenes when the effective contributions of weather futures fail to satisfactorily predict the drainage quantity. This extra regularization loss term, shown in Equation (16), was introduced to achieve this.

L_{r e g u l a r i z a t i o n} = λ_{r} \cdot \frac{1}{N^{’}} \sum_{i = 1}^{N^{’}} {\hat{B i a s}}_{i}^{2}

(16)

The scale factor λ_r is a positive hyperparameter controlling the contribution of the bias term to the model output. The contribution of the bias term can be manually adjusted or automatically adjusted by the model itself during the training process. A larger λ_r reduces the magnitude of the bias term, meaning that model performance is sacrificed for a better model interpretability, and vice versa. During training, if the overall model performance is satisfactory, that is, the values of the empirical loss (L_empirical) and the physical loss (L_physical) are relatively small, the optimizer of the model reduces the regularization loss (L_{regularization}) to achieve a smaller overall Loss, resulting in a smaller absolute value of the bias term. On the other hand, if the model performs poorly during training, the optimizer increases the L_{regularization} for a lower overall Loss, resulting in a higher absolute value of the bias term.

Furthermore, all five neutral networks were trained simultaneously, making them learn to collaborate during training. Besides predicting drainage quantity, the neural networks can also discern the contributions of various weather features to drainage quantity.

2.2.4. Model Tuning

For the three drainage monitoring stations studied, the dataset available was divided into a training and test set. The data from the last year of monitoring were chosen as the test set and the remaining data were used as the training set. The test set was completely isolated from the training and validation processes. The training set was randomly and evenly divided into five folds for cross-validation to determine the best hyperparameter combination. Specifically, a Bayesian optimization algorithm was adopted to more efficiently search for the best combinations of hyperparameters [30]. The hyperparameters were searched simultaneously during cross-validation. The best combinations of hyperparameters for each drainage monitoring station are shown in Table 1. The Adam optimizer was chosen as the optimization algorithm for model training [31]. During model training, the batch size was fixed at four and did not participate in the model tuning process. A new model was built with the best hyperparameter combination and trained on all five folds to take full advantage of the monitoring data. Finally, the model was asked to give predictions in the test set for evaluation.

3. Results and Discussion

3.1. Model Training and Testing

The drainage quantity model was applied to three waste rock drainage monitoring stations, using the input data collected from nearby weather monitoring stations. Table 2 provides details on the distances and elevation differences between the three waste rock piles and their respective weather monitoring stations.

Figure 3 compares the predicted and observed (measured) drainage flow rates during both model training and testing. The results indicate that the drainage quantity model generally fits the observed drainage flow rates well. However, the model fails to capture some peak flow events due to a lack of training data for such events. These peak flows, which typically last from a few hours to a day, posed a challenge for obtaining sufficient site measurement data for training the model. The trained model was successfully applied to predict the drainage flow rates using available weather monitoring data. The predicted drainage flow trend closely follows the historical measurements, suggesting that the model outputs are stable and reliable.

The performance of the drainage quantity model was quantitatively evaluated by two metrics: the root mean square error (RMSE) and Nash–Sutcliffe efficiency (NSE), as shown in Table 3. The RMSE measures the average difference between the predicted and observed drainage flow rates, with lower RMSE values indicating better model performance. Compared with our previously published model [19], the RMSE of the present model is reduced by 38% for Station 1, 44% for Station 2, and 9% for Station 3 in their respective test sets. However, the RMSE alone cannot be used to compare model performance across stations, as it shares the same unit as the flow rate and varies with the average flow rate across stations. Therefore, the NSE was also calculated to evaluate model performance across the different stations. An NSE value closer to one indicates better model performance. Based on the NSE values of the test sets, the model performed equally well for Stations 1 and 2, and slightly worse for Station 3.

The bias term (

\hat{B i a s}

) was automatically adjusted during the training process to compensate for discrepancies between the predicted and the observed drainage flow rates. Equation (17) was used to calculate the contribution of the bias term to the final prediction. The monthly contributions of the bias term to the drainage quantity predictions are shown in Figure 4. On average, over the 12 months, the contribution of the bias term is 9% for Station 1, 31% for Station 2, and 18% for Station 3. The contributions of the bias term to the model output were considered acceptable, indicating that the drainage quantity predictions rely primarily on the effective contributions of the three weather features.

Bias Contribution = \frac{|\hat{B i a s}|}{|\hat{R_{e f f}}| + |\hat{S_{e f f}}| + |\hat{E_{e f f}}| + |\hat{B i a s}|} \times 100 %

(17)

3.2. Sensitivity Tests

The trained model is designed to provide physically meaningful predictions, even when subjected to weather conditions that were previously unseen during training. To assess this, two sets of sensitivity tests were conducted on the test sets: one involving increasing the temperature (elevated temperature) and the other involving increasing the total precipitation (elevated precipitation). These elevated primary weather features were used as inputs for the weather refining model to generate new sets of secondary weather features. These secondary features were then used as inputs for the trained drainage quantity model to predict drainage flow rates for the three drainage monitoring stations. The outputs of the model under these two scenarios were compared with the original predictions for each station. The results are shown in Figure 5.

When the temperature is elevated, peak flow events occur earlier, which can be attributed to an earlier onset of snowmelt. As a result of the early snowpack depletion and increased evaporation at higher temperatures, the predicted flow rates in the summer are lower than those in the original predictions. For the elevated precipitation scenario, a greater amount of snow accumulates on top of the waste rock piles compared to the original estimation. As a result, the predicted flow rates are higher from June onward, due to a larger snowpack and a slower depletion of the snowpack than in the original predictions. The sensitivity test results demonstrate that the drainage model produces physically meaningful predictions when weather conditions are altered.

3.3. Verification of the Non-Negative Correlations by Monotonicity Test

After simultaneously training all five neural networks, NN_Rain, NN_Snow, and NN_Evap were isolated for monotonicity tests to verify that the physical loss term (L_physical) described in Equation (15) successfully enforced non-negative correlations between the inputs (rainfall, snowmelt, and evaporation) and the outputs (the effective contributions of rainfall, snowmelt, and evaporation). The monotonicity tests were conducted with two drainage quantity models: one was trained without the physical loss term, and the other was trained with it. The test without the physical loss term was achieved by setting λ_p to zero in Equation (15). To perform the monotonicity tests, the input for each neutral network was manually increased, referred to as the “elevated input”. The resulting output was then standardized to obtain its z-score, which was compared with the z-score of the output with the original input. The results of the monotonicity tests are presented in Figure 6. To ensure consistency, the mean and standard deviation used for standardization were calculated from the output corresponding to the original input of each neural network.

The neural networks trained without the physical loss term exhibit some negative correlations, as some data points fall below the diagonal line, which is undesirable. In contrast, the neural networks trained with the physical loss term show a satisfactory non-negative monotonicity, with all data points positioned along or above the diagonal line. The results highlight the importance of including the physical loss term in the loss function to maintain non-negative correlations.

3.4. The Impact of Current Weather Conditions on Future Drainage Quantities

The model outputs presented above use weather data up to the present day to predict the drainage quantity for the present day. It is practically valuable to evaluate how the accuracy of the model changes when using past and current weather data to predict future drainage quantities. To investigate this, the concept of ‘lag days’ was introduced to represent the time difference (in days) between the most recent weather input data and the predicted drainage quantity. For example, one lag day means that the model uses weather data up to the present day to predict the drainage flow rate for the following day. By this definition, the model outputs presented above were generated on lag day zero, as the model uses weather data up to the present day to predict the drainage flow rate for the present day.

A new set of models was trained using the same methodology but with varying lag days, and their performance was evaluated against the test set. The results are shown in Figure 7. As expected, the performance of the model, as measured by the NSE values shown in Figure 7, decreases as the number of lag days increases. However, the rate of performance deterioration varies across the three stations. As the number of lag days increases, the performance of the model at Station 1 deteriorates the slowest, while the performance deteriorates the fastest at Station 2. At Station 3, the model’s performance declines gradually for the first three lag days, followed by a more rapid deterioration thereafter.

These tests enhanced our understanding of which aspects of the drainage flow rate are most influenced by recent rainfall and/or snowmelt events. For example, as shown in Figure 8, a peak flow observed at the end of June was accurately predicted by the original model with zero lag days. However, when the lag days were increased to three, the model completely missed this peak. This suggests that a rainfall or snowmelt event occurring within the three days directly contributed to the peak flow, and the model with a three-day lag failed to predict it due to the absence of critical recent data.

4. Conclusions

This study introduces a machine learning model with embedded physical constraints to predict mine waste rock drainage quantities. Compared with our previously published model, the novelty of this model is that it incorporates physical constraints to ensure that the model outputs obey the established physical laws. Weather monitoring data and drainage flow rate measurements from three monitoring stations were used for model training and testing.

The model comprises two main components: a weather refining sub-model and the machine learning drainage quantity model. First, the weather refining sub-model transforms the primary weather features directly measured by monitoring stations—temperature and precipitation—to the secondary ones, including rainfall, snowmelt, and evaporation. These transformations are based on well-established mathematical relationships. The machine learning model uses these secondary weather features and the waste rock volume as inputs. Through their respective neural networks, these inputs were transformed to the effective contributions of rainfall, snowmelt, and evaporation to drainage quantity, with an extra bias term. The machine learning model incorporates physical constraints by embedding a water balance equation and including a physical loss term in the loss function. The water balance equation and the regularization loss term ensure that the drainage quantity is mainly determined by the effective contributions of rainfall, snowmelt, and evaporation. The physical loss term enforces the non-negative correlations between the inputs and outputs of the three neural networks associated with the weather features. Additionally, the inclusion of a bias term achieves a balance between model performance and interpretability.

The model was successfully trained and validated using the provided datasets. The results from the monotonicity test confirmed that the inclusion of the physical loss term in the loss function successfully enforced non-negative correlations between the inputs and outputs of the three neural networks associated with the weather features. The sensitivity tests show that the model generates physically meaningful predictions, even under previously unseen weather conditions. The incorporation of physical constraints into the model enhances its interpretability, reliability, and applicability when compared with our previously published model. The drainage quantity model developed in this study will be a part of the subsequent development of a drainage quality model.

Author Contributions

Conceptualization, C.Z., L.M. and W.L.; methodology, C.Z., L.M. and W.L.; software, C.Z.; validation, L.M. and W.L.; formal analysis, W.L.; investigation, C.Z. and L.M.; resources, W.L.; data curation, C.Z.; writing—original draft preparation, C.Z.; writing—review and editing, L.M. and W.L.; visualization, C.Z.; supervision, L.M. and W.L.; project administration, W.L.; funding acquisition, L.M. and W.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research is financially supported by the National Research Council Canada (NRC) Digital Health and Geospatial Analytics program (DHGA-117-1).

Data Availability Statement

In accordance with the policy, the code script for the model described in this paper cannot be open-sourced. However, we are committed to fostering academic collaboration and are willing to share the project codes for academic purposes. Interested parties are encouraged to contact the corresponding author for further details.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wolkersdorfer, C.; Nordstrom, D.K.; Beckie, R.D.; Cicerone, D.S.; Elliot, T.; Edraki, M.; Valente, T.; França, S.C.A.; Kumar, P.; Lucero, R.A.O.; et al. Guidance for the integrated use of hydrological, geochemical, and isotopic tools in mining operations. Mine Water Environ. 2020, 39, 204–228. [Google Scholar] [CrossRef]
Tremblay, G.A.; Hogan, C.M. (Eds.) Mine Environment Neutral Drainage (MEND) Manual 5.4.2d: Prevention and Control; Canada Centre for Mineral and Energy Technology: Ottawa, ON, Canada, 2001; Available online: http://mend-nedem.org/wp-content/uploads/5-4-2dVolume4_PreventionControlL.pdf (accessed on 17 August 2023).
Lefebvre, R.; Hockley, D.; Smolensky, J.; Geélinas, P. Multiphase Transfer Processes in Waste Rock Piles Producing Acid Mine Drainage: 1: Conceptual Model and System Characterization. J. Contam. Hydrol. 2001, 52, 137–164. [Google Scholar] [CrossRef]
Nichol, C.F. Transient Flow and Transport in Unsaturated Heterogeneous Media: Field Experiments in Mine Waste Rock; The University of British Columbia: Vancouver, BC, Canada, 2002. [Google Scholar]
Nichol, C.; Smith, L.; Beckie, R. Field-Scale Experiments of Unsaturated Flow and Solute Transport in a Heterogeneous Porous Medium. Water Resour. Res. 2005, 41, W05018. [Google Scholar] [CrossRef]
Brusseau, M.L.; Rao, P.S.C. Modeling Solute Transport in Structured Soils: A Review. Geoderma 1990, 46, 169–192. [Google Scholar] [CrossRef]
Flury, M.; Yates, M.V.; Jury, W.A. Numerical analysis of the effect of the lower boundary condition on solute transport in lysimeters. Soil Sci. Soc. Am. J. 1999, 63, 1493–1499. [Google Scholar] [CrossRef]
Smith, L.; López, D.; Beckie, R. Hydrogeology of Waste Rock Dumps; Victoria, B.C., Ed.; British Columbia Ministry of Energy, Mines and Petroleum Resources: Cranbrook, BC, Canada; CANMET: Ottawa, ON, Canada, 1995. [Google Scholar]
Gélinas, P.; Lefebvre, R.; Choquette, M. Monitoring of Acid Mine Drainage in a Waste Rock Dump. In Proceedings of the Secondary International Conference on Environmental Issue and Management of Waste in Energy and Mineral Production, Calgary, AB, Canada, 1–4 September 1992; Volume 6, pp. 747–757. [Google Scholar]
Ramasamy, M.; Power, C.; Mkandawire, M. Numerical Prediction of the Long-term Evolution of Acid Mine Drainage at a Waste Rock Pile Site Remediated with an HDPE-lined Cover System. J. Contam. Hydrol. 2018, 216, 10–26. [Google Scholar] [CrossRef]
King, M. Groundwater and contaminant transport modelling at the sydney tar ponds. In Proceedings of the 56th Annual Canadian Geotechnical Conference and 4th Joint IAH-CNC/CGS Groundwater Specialty Conference, Winnipeg, MB, Canada, 28 September–1 October 2003; pp. 10–26. [Google Scholar]
Hendrickx, J.M.H.; Flury, M. Uniform and Preferential Flow Mechanisms in the Vadose Zone. In Conceptual Models of Flow and Transport in the Fractured Vadose Zone; The National Academies Press: Washington, DC, USA, 2001; pp. 149–187. [Google Scholar]
Pedretti, D.; Mayer, K.U.; Beckie, R.D. Stochastic Multicomponent Reactive Transport Analysis of Low Quality Drainage Release from Waste Rock Piles: Controls of the Spatial Distribution of Acid Generating and Neutralizing Minerals. J. Contam. Hydrol. 2017, 201, 30–38. [Google Scholar] [CrossRef]
Pedretti, D.; Vriens, B.; Skierszkan, E.K.; Baják, P.; Mayer, K.U.; Beckie, R.D. Evaluating Dual-Domain Models For Upscaling Multicomponent Reactive Transport in Mine Waste Rock. J. Contam. Hydrol. 2022, 244, 103931. [Google Scholar] [CrossRef]
Pedretti, D.; Mayer, K.U.; Beckie, R.D. Controls of Uncertainty in Acid Rock Drainage Predictions from Waste Rock Piles Examined through Monte-Carlo Multicomponent Reactive Transport. Stoch. Environ. Res. Risk Assess. 2020, 34, 219–233. [Google Scholar] [CrossRef]
Smith, L.; Beckie, R. Hydrologic and Geochemical Transport Processes in Mine Waste Rock. In Environmental Aspects of Mine Wastes; Mineralogical Association of Canada: Haywood County, NC, USA, 2003; Volume 31. [Google Scholar]
Jiang, C.; Zhu, S.; Hu, H.; An, S.; Su, W.; Chen, X.; Li, C.; Zheng, L. Deep Learning Model Based on Big Data for Water Source Discrimination in an Underground Multiaquifer Coal Mine. Bull. Eng. Geol. Environ. 2022, 81, 26. [Google Scholar] [CrossRef]
Ma, L.; Huang, C.; Liu, Z.S.; Morin, K.A.; Aziz, M.; Meints, C. Artificial Neural Network for Prediction of Full-scale Seepage Flow Rate at the Equity Silver Mine. Water. Air. Soil Pollut. 2020, 231, 179. [Google Scholar] [CrossRef]
Zhang, C.; Ma, L.; Liu, W. A machine learning approach for prediction of the quantity of mine waste rock drainage in areas with spring freshet. Minerals 2023, 13, 376. [Google Scholar] [CrossRef]
Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-Informed Machine Learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
Tartakovsky, A.M.; Marrero, C.O.; Perdikaris, P.; Tartakovsky, G.D.; Barajas-Solano, D. Physics-informed deep neural networks for learning parameters and constitutive relationships in subsurface flow problems. Water Resour. Res. 2020, 56, e2019WR026731. [Google Scholar] [CrossRef]
Huschke, R.E. Glossary of meteorology. Weatherwise 1960, 13, 69–86. [Google Scholar] [CrossRef]
Kittel, C.; Kroemer, H.; Scott, H.L. Thermal Physics, 2nd ed. Am. J. Phys. 1998, 66, 164–167. [Google Scholar] [CrossRef]
Laramie, R.L.; Schaake, J.C. Simulation of the Continuous Snowmelt Process; Cambridge Ralph, M., Ed.; Parsons Laboratory for Water Resources and Hydrodynamics, Massachusetts Institute of Technology: Cambridge, MA, USA, 1972; Available online: https://hdl.handle.net/1721.1/142971 (accessed on 3 July 2024).
van der Knijff, J.M.; Younis, J.; de Roo, A.P.J. LISFLOOD: A GIS-Based Distributed Model for River Basin Scale Water Balance and Flood Simulation. Int J Geogr Inf Sci. 2010, 24, 189–212. [Google Scholar] [CrossRef]
Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop Evapotranspiration—Guidelines for Computing Crop Water Requirements—FAO Irrigation and Drainage Paper 56; FAO: Rome, Italy, 1998; Available online: https://www.fao.org/4/x0490e/x0490e00.htm (accessed on 3 July 2024).
Wang, T.; Hamann, A.; Spittlehouse, D.; Carroll, C. Locally Downscaled and Spatially Customizable Climate Data for Historical and Future Periods for North America. PLoS ONE 2016, 11, e0156720. [Google Scholar] [CrossRef] [PubMed]
Ma, L.; Huang, C.; Liu, Z.S.; Morin, K.A.; Aziz, M.; Meints, C. The Correlation between Drainage Chemistry and Weather for Full-Scale Waste Rock Piles Based on Artificial Neural Network. J. Contam. Hydrol. 2021, 239, 103793. [Google Scholar] [CrossRef] [PubMed]
Cho, K.; van Merriënboer, B.; Bahdanau, D.; Bengio, Y. On the Properties of Neural Machine Translation: Encoder–decoder Approaches. In Proceedings of the SSST 2014—8th Workshop on Syntax, Semantics and Structure in Statistical Translation, Doha, Qatar, 25 October 2014. [Google Scholar] [CrossRef]
Bergstra, J.; Yamins, D.; Cox, D.D. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures. In Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16–21 June 2013. [Google Scholar]
Kingma, D.P.; Ba, J.L. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015—Conference Track Proceedings, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]

Figure 1. A schematic of methodology developed in the present study for simulating waste rock drainage quantity.

Figure 2. A schematic of the neural network structure design for the drainage quantity model.

Figure 3. Comparison of model outputs and measured drainage flow rates during model training and testing for (a) Station 1; (b) Station 2; and (c) Station 3.

Figure 4. The monthly contribution of the bias term to the drainage quantity prediction for the three drainage monitoring stations studied.

Figure 5. Comparison of the sensitivity test results (one involving increasing the temperature and the other involving increasing the total precipitation) with the original predictions of the test sets for (a) Station 1; (b) Station 2; and (c) Station 3.

Figure 6. Non-negative monotonicity means that all data points are positioned along or above the diagonal line. (a) Monotonicity test results for the drainage quantity model with the physical loss term in the loss function; (b) Monotonicity test results for the drainage quantity model without the physical loss term in the loss function.

Figure 7. The decrease in the NSE scores of the test sets with an increasing number of lag days for the three drainage monitoring stations.

Figure 8. Comparison of the drainage flow rates predicted by the original model, predicted by the model with three lag days, and the observed values of the test set of Station 1.

Table 1. The best combination of hyperparameters for models trained for three waste rock drainage stations. Units are the dimensionality of the output space from the recurrent GRU layer; λ_p and λ_r are two scale factors mentioned in Equations (15) and (16).

	Units	Learning Rate	λp	λr
Station 1	14	0.0005	1.5	1
Station 2	32	0.0005	1.5	0.1
Station 3	8	0.0005	1.5	0.1

Table 2. Distance (in kilometers) and elevation differences (in meters) between three waste rock piles and their respective weather monitoring stations.

	Distance (km)	Elevation Differences (m)
Station 1	49.0	1060
Station 2	49.2	1060
Station 3	5.0	660

Table 3. The RMSE and NSE for evaluating the performance of the drainage quantity model during model training and testing.

	¹RMSE (m³/s)		²RMSE (m³/s)		²NSE
	Train	Test	Train	Test	Train	Test
Station 1	0.4449	0.9175	0.5330	0.5701	0.8052	0.8904
Station 2	0.9033	1.0108	0.4920	0.5660	0.9244	0.9082
Station 3	0.2545	0.4400	0.4387	0.3985	0.8426	0.8616

¹RMSE is for our previously published model and is used as the benchmark. ²RMSE and ²NSE are the two metrics to evaluate the current model.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, C.; Ma, L.; Liu, W. Prediction of Mine Waste Rock Drainage Quantity Using a Machine Learning Model with Physical Constraints. Minerals 2025, 15, 194. https://doi.org/10.3390/min15020194

AMA Style

Zhang C, Ma L, Liu W. Prediction of Mine Waste Rock Drainage Quantity Using a Machine Learning Model with Physical Constraints. Minerals. 2025; 15(2):194. https://doi.org/10.3390/min15020194

Chicago/Turabian Style

Zhang, Can, Liang Ma, and Wenying Liu. 2025. "Prediction of Mine Waste Rock Drainage Quantity Using a Machine Learning Model with Physical Constraints" Minerals 15, no. 2: 194. https://doi.org/10.3390/min15020194

APA Style

Zhang, C., Ma, L., & Liu, W. (2025). Prediction of Mine Waste Rock Drainage Quantity Using a Machine Learning Model with Physical Constraints. Minerals, 15(2), 194. https://doi.org/10.3390/min15020194

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prediction of Mine Waste Rock Drainage Quantity Using a Machine Learning Model with Physical Constraints

Abstract

1. Introduction

2. Methodology

2.1. Weather Refining Sub-Model

2.2. Machine Learning Drainage Quantity Model

2.2.1. Conceptual Model

2.2.2. Neural Network Architecture Design

2.2.3. Loss Function Design

2.2.4. Model Tuning

3. Results and Discussion

3.1. Model Training and Testing

3.2. Sensitivity Tests

3.3. Verification of the Non-Negative Correlations by Monotonicity Test

3.4. The Impact of Current Weather Conditions on Future Drainage Quantities

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI