2.1. Structural and Practical Identifiability
Depend on the types of model identifiability, there are various examples and techniques to address the issues of model identifiability [
29,
30,
31,
32,
33,
34,
35,
36,
37,
38,
39,
40]. Here, we offer our perspective on this issue. Consider a dynamical system of the form: 
        where 
 represents the state variables, 
 represents the measurable output (e.g., the data), 
 represents the (control) input vectors, for example the administered drug, and 
 represent the set of constant parameters. Note that while 
 can contain time-varying parameters, for a biological system, this complexity can usually be avoided by using a functional representation of the time-varying parameters or explicitly modeling the underlying processes that drive the temporal changes. Thus, for simplicity, we take 
 to contain only constant parameters. The general definition of model identifiability follows [
29,
31].
Definition 1  (Model identifiability). The dynamic system given by Equations (2) and (3) is identifiable if  can be uniquely determined from the given input  and measurable output . If a system is not identifiable, then it is unidentifiable. Furthermore, if  does not contain error, then the identifiability of the system is referred to as structural identifiability. Otherwise, it is referred to as practical identifiability.
 The first example that we gave is an instance of structural unidentifiability and the second is a case of practical unidentifiability. We expand on what it means for a dynamical model to be identifiable with respect to a measurable output. For this next example, we will look at a one dimensional logistic equation
        
Here, 
r and 
K are the intrinsic rate of growth and carrying capacity, respectively, for the population denoted by 
x. Let 
 and 
 be two sets of parameters and assume perfect measurement 
. We will also assume that 
 is known. Then, if 
 and 
 results in the same models dynamics, then
        
For the condition above to hold for all 
, we must have 
 and 
, which implies 
 for almost all 
, except for possibly a set of measure zero. For instance, if 
 or 
K, then the system is at its steady state, making the aforementioned comparison obsolete. 
To put it simply, if a model is identifiable, then two sets of parameters that give the same model dynamics must be identical. This means that an identifiable model does not have potential issue in 
Figure 1. Instead, the uncertainty in model forecast is solely dependent on the uncertainty in the data.
We remark that the above system does not contain a control term; however, for biological systems, if the identifiability of the system without the control can be studied, then adding the control term afterward usually does not change its structural identifiability, see the example given by Eisenberg and Jain [
31]. What we showed here is an example of a direct test of model identifiability, originally used by Denis-Vidal and Joly-Blanchard [
41]. While the direct test method is often not used in practice, it serves as an intuitive description for model identifiability from a dynamical system perspective.
To test for model structural identifiability, software built on differential algebra theory, such as DAISY, is the gold standard [
42]. However, structural identifiability does not guarantee practical identifiability, which is necessary for clinical application. The practical identifiability of a model should be studied with data, in particular using the Fisher information matrix or profile likelihood [
24,
31]. In these scenarios, the available data dictates the formulation of the model. However, because mathematical models are often developed independent of the data collection, modelers often must sacrifice certain realistic aspects of the model to keep it identifiable relative to the available data. Reversely, if one first builds a set of candidate models and finds out the required data for accurate model identification, then it may be possible to obtain these data during the collection process. We consider the latter case an ideal scenario, where mathematical modelers and clinicians can collaborate effectively.
  2.2. Observing-System Simulation Experiment via Monte Carlo Method
In the ideal scenario mentioned above, Monte Carlo simulation experiment is our tool of choice to obtain the information on the data required for a model to be identifiable. First, we introduce the statistical model [
43]:
        where 
 is the measurement, 
 is the vector of parameters estimated from 
 observations at time 
. Assuming no model error, then the general form of the measurement error is
        
        with 
, 
 taken to be independent and identically distributed random variables with mean 0 and variance 
. For biological application, it is reasonable to expect the measurement error to be proportional to the measurement itself, so we fix 
, giving us a relative error. The steps of the Monte Carlo simulation method follow [
29,
44].
- Determine the appropriate set of true parameters  for the simulation. 
- Numerically solve the ODE model to obtain the measurements at desired time points. 
- Generate M sets of simulated data from the statistical model (6) and (7) with a Gaussian error structure and a chosen standard deviation  around mean 0. 
- Fit the model to each of the M simulated data sets to obtain the parameter estimates . Here, we take M to be 200 sets. 
- Calculate the average relative estimation error (ARE) for each element of  -  as
             - 
            where  -  and  -  are the k-th element of  -  and  - , respectively. 
- Repeats steps 2 through 5 with increasing . 
A model is practically identifiable if the 
ARE is less than the variance 
, meaning we want the error in the parameter estimation to be less than the error in the data. When the variance 
 is 0% and 
ARE is sufficiently close to 0%, then the model is considered to be structurally identifiable. We borrow the idea from the observing-system simulation experiment where we will use different hypothetical sets of data to test whether they can help us identify key parameters [
45,
46]. By continually restricting the amount of information we have from the data, we can approximate the threshold of information required for model identification.
The MC simulation approach is not error-free. One such limitation comes from choosing the initial guesses. For example, if we start our initial guess close enough to the true set of parameters, then the effect of the error  may be limited. However, if we have our guesses far away from the true set of parameters, then the numerical optimization may become trapped in some local minimum away from the true estimate or the parameters may not be sensitive enough to be estimable. One can start with random samples of initial guesses to have a better chance at reaching the true estimates. However, this sampling approach does not inherently deal with the issue of the insensitive parameters. Here, we pick the initial guesses randomly within 50% of the true value. In order to rule out any parameters that are not sensitive with respect to the tolerance of the numerical optimization schemes, we carry out the MC approach for each individual parameter with error-free data (). Any parameters that cannot be refitted within reasonable ARE will be eliminated (or become fixed) from the pool of free parameters. The remaining parameters are deem to be sensitive enough for the numerical scheme. As we amp up the tolerance of the numerical scheme, eventually we should be able to fit all parameters when no error is present in the data.
  2.3. Two Mathematical Models for Prostate Cancer
Many mathematical models for prostate cancer have been developed in the past two decades [
1,
18,
47] with many recent studies focused on immuno- and chemo-treatments of prostate cancer [
48,
49,
50,
51,
52,
53,
54,
55,
56]. Here, we divert from this trend and instead use two simpler mechanistic models to demonstrate the concept of model identifiability in practice. Both models contain a clear prognostic parameter that keeps track of cancer progression, which greatly simplifies their structure. Using these models, we aim to show that even if the model itself may not be identifiable, having a model-based prognostic parameter allows modelers to focus the resource to identify these key parameters. This would be helpful in practical settings due to limitation in data acquisition.
A cancer stem cell model. Cancer stem cells propel cancer’s therapeutic resistance and are thought to be a primary factor in the initiation and progression of prostate cancer [
57,
58,
59,
60]. Utilizing the mathematical model below, in conjunction with the stem cell hypothesis, could provide a better understanding of prostate cancer’s acquisition of castration resistant cells and their heterogeneity within a mass. Prostate cancer stem cells are thought to express little to no androgen receptors, giving them the ability to multiply their population without a hormone requirement [
61]. Resistance is achieved with cancer stem cells’ ability to thrive in the absence of androgen, which provides a means for cancer to continue to evolve during and after intervention with intermittent androgen deprivation therapy [
12,
17,
62].
 Prostate cancer stem cells continue to rapidly divide after treatment, either asymmetrically to form differentiated cells or symmetrically to form additional stem cells. The production of differentiated cells results in negative feedback of the production of stem cells. However, unlike stem cells, differentiated cells are affected negatively by androgen deprivation therapy. The ability to withstand androgen deprivation is just one of the many contributing factors that give rise to the renewal of stem cells. For instance, mitochondrial fission factor expression plays a role in the evolution and multiplicity of prostate cancer stem cells [
63].
Here, we use a novel model built upon this concept for prostate cancer from the studies by Brady-Nicholls et al. [
12,
17,
62]. The model consider three compartments, the cancer stem cells (
S), the differentiated cancer cell (
D), and the PSA byproduct (
P). While it is simpler in structure, the model has shown promises in its applicability.
        
The cancer stem cell population S is assumed to divide at a rate  to produce either one stem cell and one cancer cell with probability , or two cancer cells. This division has a negative feedback from the differentiated cancer cells, which takes the form . The cancer cell is killed by the drug at a constant rate , where  denotes the application of the drug. PSA is produced by cancer cells at a rate , which is cleared from the blood stream at a rate .
Since the drug applications for these model, 
u and 
, are known input. For simplicity, we can treat them as constant. Since they are known, their variation in time should not affect the identification of the other factors. Additionally, in practice, the drug application would be fixed for a certain period of time depending on the specific treatment. We take the following parameter values as the true values for our study: 
 (dimensionless), 
 (dimensionless), 
, 
, 
, and 
 [
62].
A cell quota cancer model. Prostate cancer cells require androgen for growth, which is why the effect of androgen is regularly incorporated into prostate cancer model [
1,
64,
65]. However, the quantitative connection between androgen and prostate cancer growth is not well characterized, leading to various functional forms used for this purpose.
 Here, we use a cancer model that integrates the effect of androgen based on a stoichiometric modeling framework [
15,
66,
67]. The model was developed in a series of studies that highlight the importance of androgen dynamics in prostate cancer growth [
11,
13,
16,
26,
27,
28,
64,
68,
69]. In this model, cancer independence to androgen is modeled as a variable explicitly and can be used as an indicator of cancer growth. Meade et al. later expanded on this idea to build a more biologically realistic model of cancer growth for predicting treatment failure [
16]. Despite its simplicity, the model is founded on established biological principle and can capture and predict the dynamics of cancer progression.
        
The cancer population, denoted by 
x, grows based on the Droop cell-quota model. The death rates are contributed by an androgen dependent term, 
, and a density dependent term, 
. Here, 
 is the maximal androgen dependent death rate for the cancer. The authors assume that the cancer cells lose their dependence on androgen at a rate 
, which can be interpreted as the “rate of gaining androgen independence”. With this interpretation, under androgen deprivation therapy, the treatment would gradually become ineffective. 
Q and 
P are the intracellular androgen level and serum PSA, respectively. The dynamics of 
Q is governed by an influx of serum androgen and the uptake of cancer cells. 
 and 
 represent the rates at which androgen is being produced by the testes and the adrenal gland, respectively, with the drug application denoted by 
u. 
P is assumed to be produced as a baseline by normal cells, but mainly by cancer cells, and is cleared from the blood stream at a constant rate. We take the following parameter values as the true values for our study: 
 (dimensionless), 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, and 
 [
13,
15].