1. Introduction
Cancer is one of the most complex diseases, and as such, scientists are deeply involved in studying not only the biological nature and interactions of the processes involved in cancer but also in creating mathematical models of these processes. Chemotherapy remains a cornerstone of cancer treatment but is often associated with significant drawbacks, including severe toxicity, adverse side effects, and a lack of specificity in targeting tumor cells while sparing healthy tissues. Conventional chemotherapy protocols do not account for individual patient characteristics. As a result, patients may experience suboptimal treatment outcomes, either due to under-dosing, which fails to eradicate the tumor, or excessive dosing, which leads to unnecessary toxicity and immune suppression.
Therefore, the identical cancer treatment applied to a large group of patients can effectively cure only some of them [
1,
2]. Often, we are unable to fully explain the mechanisms why the treatment fails in some patients [
3]. Due to a complex interplay of factors such as the type, aggressiveness, and stage of cancer, patient genetics, diet, immune system status, body weight, and gender, the efficiency of cancer therapies can vary significantly [
4,
5]. Another important factor affecting the response of the organism is the chemotherapy substance chosen for the treatment [
6].
The design of chemotherapy dosing protocols for cancer patients often relies on empirical determination or systematic design based on population models [
7,
8,
9,
10]. These models typically derive their parameters from clinical data across large groups of patients. As a result, chemotherapy doses and treatment outcomes have a mean-population character, failing to account for individual patient variability [
11]. Consequently, the application of such therapy carries risks, potentially hindering the success of chemotherapy [
12]. To compensate for inter-subject variability by aligning the model parameters with the observed patient’s response to the treatment, advanced techniques are necessary [
13].
These issues have given rise to the concept of personalized (precision) medicine (PMed), a new healthcare approach that seeks to address why patients with similar clinical symptoms often exhibit different responses to the same treatment [
14]. Personalized medicine focuses on tailoring treatment to the individual characteristics of each patient [
15]. This approach leverages a set of patient-specific information or characteristics to design customized therapeutic strategies [
16]. Unlike the “one-size-fits-all approach”, which aims to develop treatments for the average person, precision medicine focuses on the differences between individuals [
17]. This shift towards a patient-centered approach has the potential to improve survival rates, enhance quality of life, and reduce the burden of chemotherapy-related complications.
In the case of model-based personalization, an adequately parsimonious mathematical model is required [
18,
19]. Numerous mathematical models have been developed to describe tumor growth, its interactions with the immune system, and responses to various cancer therapies, including chemotherapy [
9,
20,
21,
22,
23,
24], immunotherapy [
25,
26,
27], radiotherapy [
28], anti-angiogenesis [
29], and their combinations [
8,
30,
31]. A comprehensive review of cancer modeling can be found in [
9,
32,
33].
Current efforts in the field are focused on two distinct approaches to developing personalized therapies:
Iteratively and heuristically adjusting the therapy during treatment. This suggests an adaptive, trial-and-error approach where therapy is modified based on patient response. This is common in clinical practice but may lack a formal mathematical framework.
Rigorous approach using a mathematical model to guide therapy, followed by model-based optimization to refine treatment strategies.
The latter can be split into two steps [
34]:
Personalization of the model: This step involves identifying the model parameters that correspond to the individual patient, transforming the population model into a personalized one.
Personalization of the treatment: This step involves determining the optimal chemotherapy dosing protocol tailored to the patient based on the personalized model.
Several studies have explored the integration of machine learning (ML) into oncological treatment, particularly in pharmacogenomics and chemotherapy personalization or by integrating medical imagining [
35]. The studies [
36,
37] demonstrated the potential of ML in analyzing large-scale genomic, proteomic, and other omics datasets to identify genetic markers associated with drug responses. However, a drawback of these studies is that the methods used remain relatively simple, relying on heuristic or data-driven models without a strong mechanistic foundation.
The studies [
38,
39] focus on the widely used Gompertz tumor growth model, employing methods such as least squares minimization, maximum likelihood estimation, and the Extended Kalman Filter (EKF). While these methods offer robust estimation techniques, they require precise knowledge of process noise and measurement noise parameters to ensure optimal results. This presents a major limitation, as these noise parameters are often unknown in practical applications.
The studies [
40,
41] employ nonlinear function optimization techniques to estimate the parameters of a fourth-order tumor growth model. These methods rely on cost functions based on the least-squares criterion, where parameter values are chosen to minimize the sum of squared differences between simulated and measured tumor volumes. However, the issue is that fitting the model response to the data does not ensure that parameters are estimated accurately.
The studies [
42,
43] explore the combination of state and parameter estimation techniques, comparing the Extended Kalman Filter (EKF) with Moving Horizon Estimation (MHE). These approaches augment the state vector with the parameter vector and use optimization techniques to minimize the root-mean-square error between predicted and measured tumor volumes. Both methods require accurate noise modeling, which remains an open problem in practical applications.
The study in [
44] applies Monte Carlo Approximate Bayesian Computation (ABC) to simultaneously perform model selection and parameter estimation for five different tumor growth models. This approach provides a probabilistic framework for quantifying uncertainty in both model choice and parameter values. However, a key drawback is the high computational cost, which may limit scalability.
Finally, the most relevant studies [
45,
46,
47] employ artificial neural networks to estimate tumor model parameters using sparse measurement data and prior information about parameter intervals. These approaches leverage in silico-generated training data spanning large parameter intervals and incorporate randomized parameters to improve generalization. Two different neural networks were trained, to estimate the pharmacodynamic and the pharmacokinetic parameters. The most significant limitation is that the inputs of the neural networks are directly the measurements of the tumor volume, which hinders scalability and necessitates the use of a four-layered neural network structure. Besides that, these studies did not develop any personalized treatment strategies.
Therefore, to reflect the current gaps in the state of the art, the strategy proposed in this paper is to obtain personalized model parameters by evaluating the patient’s response to an initial dose of chemotherapy with a multilayer perceptron artificial neural network (MLP NN) with only one hidden layer. The proposed approach does not utilize an MLP NN as a replacement for the original biological dynamic model or as a digital twin [
48]. Instead, it employs the neural network as a mapping function that directly relates the subject-specific parameters to their effects on the individual state responses. This method effectively bypasses the need for numerically solving the model differential equations when estimating model parameters and does not require making assumptions about the noise characteristics. Moreover, we shifted from pure model personalization addressed in the literature towards treatment personalization, thus completing the whole clinical scheme.
In the context of the state-of-the-art review presented, the most significant novel contributions of this paper are as follows:
A methodology to personalize (estimate) uncertain parameters of the dynamic model of tumor growth, chemotherapy and the immune system based on analyzing the clinical response of an oncology patient to a single chemotherapy dose using the pretrained MLP NN.
A parametrization technique to approximate the full sampled state responses with an analytical parametric function, resulting in a significant reduction in input data dimension and training complexity of the MLP NN, allowing for MLP NN with only one hidden layer.
Combination of the model personalization with the model-based optimization of chemotherapy dosing protocol ensuring personalized optimal cancer treatment.
In contrast to traditional parameter identification techniques, which typically rely on numerical optimization of a quadratic criterion to minimize residuals between the measured and simulated model responses, or a likelihood function [
49,
50], the proposed approach ensures faster parameter estimation through the evaluation of the MLP NN output.
Finally, the proposed approach focuses on fitting the parameter values themselves rather than fitting the state responses, which will ultimately result in better general validity of the estimated models than in traditional approaches.
2. Materials and Methods
2.1. Model of Cancer and Chemotherapy
In this section, a mathematical model of tumor growth dynamics, chemotherapy, and the interactions with the immune system will be presented. The model itself and the values of its nominal (mean-population) parameters will be adopted from [
23].
The adopted model is not limited to a particular chemotherapeutic drug or cancer subtype. It focuses on the dynamics of a single solid tumor, without accounting for further proliferation or metastatic development. This standard phenomenological model describes the dynamics of four state variables representing the following quantities in a patient with cancer:
natural killer cell population
cytotoxic T cells population
tumor cell population
amount of anticancer drug in the tumor site
Natural killer (NK) cells are a key component of the innate immune system, responsible for detecting and eliminating abnormal cells, including tumor cells and virus-infected cells, without prior sensitization. Unlike adaptive immune cells, NK cells do not rely on antigen recognition but instead use a balance of activating and inhibitory receptors to distinguish between healthy and malignant cells. When NK cells recognize a target cell with reduced major histocompatibility complex (MHC) class I expression—a common feature of tumor cells—they release cytotoxic granules containing perforin and granzymes, leading to cell lysis.
Cytotoxic T lymphocytes (CTLs), also known as CD8+ T cells, are key players in the adaptive immune system and play a vital role in tumor immunosurveillance. Unlike natural killer (NK) cells, CTLs must be activated through antigen presentation by antigen-presenting cells (APCs) via MHC class I molecules. Once activated, they recognize and attach to tumor cells displaying tumor-specific or tumor-associated antigens. This leads to the induction of apoptosis in target cells, either through the release of perforin and granzymes or by engaging death receptor signaling pathways.
The considered mathematical model is defined by differential equations [
23]
Input
represents the rate of chemotherapy administration.
The state variables form the state vector
defined as
The adopted model is based on the following assumptions [
23]:
Tumor cell populations are reduced by both immune effector cells and chemotherapy.
Effector cell numbers decline due to natural degradation, their consumption in tumor cell destruction, and the impact of chemotherapy.
Chemotherapeutic agents influence both tumor cells and immune effector cells through a mass-action process.
Administering the drug at a higher constant rate may lead to a greater reduction in both tumor cells and immune effector cells.
The growth of natural killer cells follows a logistic model as , while their suppression occurs through interactions with tumor cells, defined by . Cytotoxic T cells are activated only in response to tumor presence while being generated via a bilinear term . Their natural decline follows a linear process , and they are further inactivated upon interaction with tumor cells, represented by . Tumor cell dynamics is modeled using a logistic growth function . Tumor elimination occurs through the action of both NK cells and cytotoxic T cells, with respective terms and accounting for their effects.
The chemotherapy drug follows first-order pharmacokinetics and influences all three cell populations—NK cells, cytotoxic T cells, and tumor cells—via a mass-action mechanism. Each cell type exhibits a specific susceptibility to the drug, characterized by distinct mortality rates with coefficients , , and , respectively.
The parameters of the model defined by (1) can be gathered into the parameter vector
as
The nominal values of the model parameters and their biological interpretations are summarized in
Table 1. Considering these nominal parameters, we define the nominal parameter vector
as
Each patient developing cancer is unique, hence the parameter vector typically varies significantly between the subjects within the population. This makes the evolution of state variables hard to predict without model personalization for the particular patient and their cancer phenotype.
Because model (1) is relatively broadly applicable to various chemotherapeutic agents and cancer subtypes, if the focus is on a single solid tumor, this motivates personalization by tailoring the model parameters to reflect the specific characteristics of the cancer type, the patient’s physiological conditions, and the properties of the administered chemotherapeutic drug. Key characteristics of the chemotherapeutic agent are encapsulated in the coefficients , , , which quantify its selectivity in targeting specific cells. The aggressiveness of the cancer is characterized by parameter , representing the tumor growth rate. Meanwhile, the immune system’s effectiveness in combating the tumor is determined by a complex interplay of additional model parameters such as the growth rate of natural killer cells , the activation rate of cytotoxic T cells , capturing the dynamic interactions between the tumor and the immune response.
The roles of this mathematical model are the following:
To substitute for in vivo data by providing the simulated state responses for various parameters, dose sizes, and initial conditions when creating the training and validation datasets.
To provide the simulated state responses when evaluating the cost function of the chemotherapy dosing protocol optimization.
Since the model in (1) is nonlinear, the state responses
cannot be obtained analytically. Because of this, the only way to obtain the state responses
is by numerical solution of the differential equations given by (1), using Runge–Kutta methods [
51] for numerical integration.
2.2. Statistical Properties of the Model Parameters
Consider that the parameter vector given by (3) is a random variable following a multivariate normal distribution, which describes the statistical properties related to the inter-subject parametric variability in the population of patients with cancer.
The mean value is given by vector
as
The second statistical characteristic is the covariance matrix
defined as
The diagonal elements of
represent the variances of the individual parameters, hence defining the measure of parametric uncertainties. The off-diagonal elements represent the covariances between pairs of parameters, which quantify the degree to which these parameters change together.
To generalize the theoretical analysis, we will introduce a new dimensionless random vector
following the standard normal distribution, which can be noted as
Then, the parameter vector
will be related to random vector
via matrix
and the nominal vector
as
Dimension
represents the number of random factors affecting the parameter vector
. According to (8), the covariance matrix
defined by (6) obtains
Considering the confidence level
, the random vector
will lie in the symmetrical confidence interval
with
probability, where
is the
quantile of the standard normal distribution and
is the vector of ones.
Based on (8), we can claim that the parameter vector
lies in the confidence interval
Then, the absolute interval parametric uncertainty obtains
Equation (12) will be used to determine the entries of matrix
based on the assumptions about the relative interval parametric uncertainties given in percent by
as
where
is the index of parameter and
, which increments if
.
2.3. Parametrization of the State Responses
It is an important fact that the sequences of sampled state responses for and the sample time cannot be effectively used as inputs to the MLP NN. Although theoretically possible, it is very unpractical due to the dimensions, i.e., the large number of inputs to the MLP NN. Besides that, it is very likely that the MLP NN would not be able to sufficiently generalize the information comprised in such raw data and thus provide a reliable parameter estimate.
Therefore, the proposed strategy will be to extract the features of the sampled state responses
by parametrizing them. The parametrization technique will be based on an approximation of the state responses by a parametric analytical function while determining its parameters in terms of linear regression. After the parametrization, the sampled state responses are described by vectors
,
,
of
parameters as
This approach will lead to a significant reduction in the training complexity and the memory footprint since
.
Consider a parametrizable weighted sum of exponential functions
as
where
,
, …,
are the function parameters to be fitted and
are appropriately chosen tuning coefficients.
This function will approximate signal , which substitutes for the state responses , , available in the form of finite sampled sequences for . The absolute term in (15) is involved to reflect nonzero steady states of model (1), allowing for a nonzero finite limit .
Equation (15) can be noted in linear form
where
is the regression vector.
Based on (16), we can formulate the linear regression problem
where
is the regression matrix and
is the vector of samples.
The parameter vector
will be obtained in terms of the least squares method analytically as [
52]
2.4. Artificial Neural Network
2.4.1. Structure of the Network
We chose the structure of multilayer perceptron artificial neural network having the output layer with linear activation function and one hidden layer comprising
neurons with hyperbolic tangent sigmoid activation function [
53]. The structure of this MLP NN is depicted in
Figure 1.
The learning algorithm for training this MLP NN was the classical Levenberg–Marquardt error backpropagation [
54]. The functionality of the MLP NN, including the output calculation, and the weights and biases adaptation (training) are implemented by the functions of Matlab Deep Learning Toolbox (R2023b,
https://www.mathworks.cn/products/deep-learning.html) [
55].
2.4.2. Inputs and Outputs
The input vector
to the MLP NN will be formed by vectors of parameters
,
,
obtained by the parametrization of the state responses
respectively, the initial state vector
and the administered leading dose size
as
If
is the dimension of
,
,
according to (14), then
.
The reason for providing the information on is to ensure parameter estimation under the assumption of an arbitrary initial state. By taking the initial state into account explicitly, the MLP NN will be capable of separating its effect on the state responses from the effect of parametric variations.
It is important to mention that by providing the dose size and the initial state , which involves the initial drug amount , the information on the chemotherapy administered within the current and previous dosing periods (cycles) also enters the personalization. A significant (nonzero) dose is mandatory to properly capture the effects of chemotherapy. This enables the parameters , , to be estimated.
The model personalization itself consists of evaluating vector by the MLP NN to provide vector representing the estimate of . Subsequently, the estimate of model parameter vector is determined from according to (8).
The MLP NN can be seen as a nonlinear multivariate vector function
[
53].
where
is a vectorized hyperbolic tangent function,
,
are the matrices of weights in the hidden and the output layer, respectively, and
,
are the vectors of biases in the hidden and the output layers, respectively. The number of neurons in the hidden layer
was determined based on the heuristic rule of thumb
, which is satisfactory for
.
2.4.3. Generating the Training Dataset
The dataset for training of the MLP NN was generated according to the procedure described as follows. The parameters given by (3) corresponding to virtual oncology subjects were generated randomly while considering their statistical distribution as defined before. The relative interval of parametric uncertainties of the individual model parameters was chosen as
Notice that in (21), only
of a total of 14 parameters in the parameter vector
are considered uncertain, while the others remain nominal, i.e., fixed across the population. In particular, parameters
are considered uncertain and vary among the subjects. This choice is strongly supported by analyzing the magnitudes and the linear dependencies of the corresponding sensitivity functions of the state variables with respect to the parameters as presented in
Appendix A.
The sensitivity analysis allowed us to understand the percentual changes in which parameters have an observable effect on the state variables. Then, we could adjust the relative interval parametric uncertainties (21) according to the magnitudes of the sensitivities (see
Figure A1,
Figure A2 and
Figure A3) to ensure that their effects are realistically balanced. It means that if the state sensitivity to the
-th parameter is relatively low, the corresponding parametric uncertainty should be higher than that of a parameter with much greater sensitivity.
However, it is important to acknowledge that the uncertainties given by (21) were chosen without the support of clinical results or data, as these are not available in the literature or in the public domain. Experimentally determining the actual statistical distribution of the parameters is beyond the scope of this paper, as it requires large studies with real subjects.
Considering the
confidence level, the entries of matrix
were determined according to (13) as
The structure of matrix
implies that the covariance matrix
defined by (9) will be diagonal, which means that the random variations in the model parameters are not mutually correlated within the population.
To create the training dataset, there will be generated a total of 1500 different virtual patients. More specifically, for each of these subjects in the virtual population, a random vector
will be generated considering standard normal distribution (zero mean and unit covariance matrix). The corresponding parameter vector
will be calculated according to formula (8), considering the mean
given by the values in
Table 1 and matrix
given by (22). After that, the states of each virtual patient will be simulated by numerically solving the differential Equation (1) using the Runge–Kutta 4-th order numerical integration method [
51] to obtain the sampled state responses
for a given
and the dosing period (cycle duration)
. This simulation (numerical solution) will be repeated for 100 different initial conditions and leading dose sizes
. The initial conditions
will also be randomly generated, assuming the normal distributions with the mean values equal to the corresponding stable equilibrium states
and the standard deviations are given as
, respectively. The leading dose size
for each entry of the dataset was generated as a uniformly distributed random number from the interval 0.1–1.5. The training dataset thus includes
entries. After obtaining the simulated state responses, their parametrization will be performed to obtain the individual input vectors
defined by (19).
The full training dataset was divided randomly into three subsets within the Matlab Deep Learning Toolbox: 70% for training, 15% for validation, and 15% for testing.
2.4.4. Validation by the Covariance Matrix of the Estimate Error
The aim of the validation is to assess the personalization performance of the trained MLP NN using new data. The validation dataset with entries was generated randomly by following the same procedure as in the case of the training dataset but with a different seed for the random number generator.
By evaluating the output of the MLP NN for all entries of the validation dataset, the estimates will be determined according to (20) for and the estimated residuals will be calculated as .
The validation and performance assessment will be carried out in a statistical framework by analyzing the covariance matrix of the estimated residuals. The covariance matrix of the estimated residual
will be defined as
From the statistical perspective, personalization should minimize the variances of the residuals , while ensuring statistical unbiasedness.
In practice,
will be obtained as the sample covariance matrix
considering
entries of the validation dataset as [
56]
Since
follows the standard normal distribution with
as defined by (7), the personalization is successful if the variances of the residuals are significantly lower than one. This can be noted as
The above can be interpreted such that the aim of personalization is not to provide the exact values of the unknown model parameters but to lower their uncertainties. Another condition is that the mean value of the residuals must be close to zero to ensure the estimate unbiasedness, which can be noted as
2.4.5. Confidence Interval of the Parameter Estimate
To be able to assess the accuracy of the estimated dimensionless vector or the estimated parameter vector , which both represent the point estimates, it is useful to determine their confidence intervals.
Considering the confidence level
and the sample covariance matrix
, the confidence interval of
obtains
The confidence interval is symmetrical because
for the normal distribution. Then, considering (8), we can find the confidence interval of the model parameter vector
as
Based on the above, the interval uncertainty of the estimated parameter vector
obtains
It can be claimed that the personalization should provide the parameter estimate with the uncertainty
satisfying
Substituting
according to (12) and
according to (29) yields
which confirms (25).
2.5. Nonlinear Least Squares Method to Minimize the Model Response Error
A more conservative approach to the personalization of model (1) is the least squares minimization of the model simulation error with respect to the data in the form of sampled state responses. This approach considers a quadratic cost function to minimize the sum of the squared errors/residuals between the provided sampled states
and the fitted model responses
obtained by numerical solution of (1). The corresponding cost function has the form
where
,
,
are the weights.
It is important to remark that since the model responses
,
,
are not linear with respect to the estimated vector
, the linear regression cannot be used to find
minimizing (32). This also means that the cost function given by (32) cannot be written as a quadratic form with respect to
and minimized analytically, implying a nonlinear least squares problem. Therefore, the estimate
must be determined by numerical optimization using some gradient-free algorithm such as the Nelder-Mead [
57] or genetic algorithms [
58]. It suggests that the main drawback of this conventional approach is the computational complexity of the personalization itself, which is significant due to the need for numerical optimization involving repetitive evaluation of cost function given by (32) and numerical solution of the model differential equations.
This contrasts with the parametrization of the state responses and calculating the output of the MLP NN in the proposed approach. Therefore, it can be concluded that despite the significant burden involved in generating the training dataset and in the process of training itself, the proposed approach is superior in terms of the computational complexity of personalization.
2.6. Personalized Chemotherapy
In this section, the chemotherapy dosing protocol will be optimized considering the mathematical model given by (1) with its parameters estimated by the proposed personalization approach.
2.6.1. Target Steady State
The steady states of the personalized model must be analyzed to find an appropriate target state for the therapy. For and , the state variables will approach their steady values . Given the fact that we are dealing with a nonlinear system, it is typical to have multiple steady states, each with a specific region of attraction. Convergence of the states to one of the stable steady states depends on the initial state . Determining the steady states is crucial for the chemotherapy dosing design as we need to specify the target (reference) state representing the cancer in remission. The steady states can be determined as the solutions of a system of algebraic equations resulting from (1) under the assumption of .
Another important aspect is that since the chemotherapy dosing will terminate after completing the treatment protocol, it is necessary to ensure that the state variables remain stable after reaching the vicinity of the target steady state. Therefore, satisfying the criterion of local stability for the chosen target steady state is necessary for the success of therapy and inducing remission. Otherwise, the cancer would relapse, and the tumor would regrow. By studying the Jacobian matrix, we can determine the local stability of a steady state based on its eigenvalues. If all eigenvalues have negative real parts, the equilibrium point is locally stable.
The Jacobian matrix
is given by
There are four different clinical scenarios for the steady states. The first is the Dead state represented by all state variables being zero, which means that the tumor is gone, but the immune system is also depleted. This state is undesired from the clinical perspective. The ideal condition is the Cured state having zero tumor cells and enough natural killer cells. However, this state is typically unstable and hence cannot be maintained. The worst scenario is the Grown state manifesting by a large number of tumor cells, representing the final stage of cancer. Therefore, the so-called Coexisting state seems to be the best option as it corresponds to cancer in remission having a small number of cells and enough cells. If one of the Coexisting states satisfies the local stability criterion, it will be chosen as the target state for the treatment.
2.6.2. Personalization and Need for the Leading Dose
Before optimizing the chemotherapy dosing protocol tailored for the patient, the model needs to be personalized. However, to personalize the model, we need the sampled state responses from one dosing period (cycle), which means that the first dose has already been administered. This order of operations results in a causality paradox. As a consequence, the first administered dose , called the leading dose, cannot be personalized. After the first dosing period elapsed, the sampled state responses are available and thus the model personalization can be carried out. Finally, the remaining doses of the dosing protocol will be optimized based on the personalized model. It can be concluded that the personalization of chemotherapy cannot be achieved without the data including the subject’s response to a single chemotherapy administration.
2.6.3. Optimization Problem
The aim of cancer treatment is to minimize the tumor cell population , maintain a healthy population of natural killer cells while minimizing the total amount of chemotherapy administered during the treatment to reduce its adverse effect on healthy cells.
The objective of designing the chemotherapy dosing protocol is to ensure the transition of the state variables from their initial values
to the coexisting state
, which represents cancer remission [
59]. The therapy is implemented as a finite sequence of
individual (discrete) doses
, delivered periodically as short, uniformly spaced impulses (boluses) with a period of
days. Each dose is represented as a rectangular pulse of the administration rate signal
, lasting
and having an amplitude of
, ensuring that the pulse’s area under the curve equals
.
The aim of the design is to find vector
of
positive doses
minimizing the criterion
where
,
,
,
are the weighting coefficients. Since
represents positive quantities of chemotherapy to be administered, the additional constraints are imposed as
. Because the responses
,
,
are not linear with respect to the decision vector
, the traditional quadratic programming problem cannot be formulated. Therefore, given that our problem is finite-dimensional, we opt for a numerical optimization while
,
,
,
are obtained by the numerical solution of differential equations given by (1) using the 4th-order Runge–Kutta method.
To determine the vector
that minimizes (35), numerical optimization was performed using the fmincon function available in MATLAB Optimization Toolbox (R2023b,
https://www.mathworks.cn/products/optimization.html) [
60]. This function implements the interior-point method to solve constrained numerical optimization problems [
61,
62].
Interior-point methods represent a class of optimization algorithms that focus on exploring the interior of the feasible region. They use a barrier function (typically logarithmic) to penalize points near the boundary, ensuring iterates remain within the feasible region defined by inequality constraints. The barrier parameter is gradually reduced, allowing the solution to approach the boundary as it converges to the optimum. These methods leverage a central path, a trajectory defined by the solutions of the barrier-augmented optimization problem as the barrier parameter is reduced, to approach the optimal solution satisfying the primal and dual conditions (Karush–Kuhn–Tucker conditions). In the nonlinear least-squares optimization problem considered, where explicit gradients and Hessians are unavailable, the optimizer employs finite-difference approximations for gradients and approximates the Hessian matrix using a quasi-Newton method, specifically the Broyden-Fletcher-Goldfarb-Shanno (BFGS) method.
Another aspect of personalized therapy lies in the patient’s state at the beginning of the treatment (initial state) . However, contrary to unknown parameters , the initial state can be determined by performing the corresponding medical diagnostic procedures (blood analysis, histology, medical imagining techniques).
The whole process of data acquisition, state response parametrization, model personalization, and personalized optimization of chemotherapy doses is visualized in
Figure 2.
4. Discussion
The first portion of the results dealt with the statistical assessment of the model personalization accuracy. This assessment was substantiated by the mean value and the covariance matrix of the parameter estimate residuals as given in
Table 2. Since the mean value of the residuals is relatively close to zero, it can be claimed that the personalization does not produce statistically biased estimates of the parameters. Another important result to highlight is that the diagonal entries of the covariance matrix of the estimate residuals satisfy the variance condition given by (25) with a large margin. This proves that personalization significantly lowers the uncertainties of the model parameters. For better interpretation, the relative interval uncertainty of the original parameter vector
obtained according to (12) and the relative interval uncertainty of the estimated (personalized) parameter vector
obtained according to (29) were visualized in
Figure 3, demonstrating the shrinking of the uncertainty interval after personalization.
It also implies that the trained MLP NN captures and sufficiently generalizes the relationships between the variations in the parameters, the initial state, dose size, and the state responses to a leading dose of chemotherapy.
Besides comparing the estimated parameters themselves, the personalized models were validated by comparing their responses with the reference state responses used for personalization. This can be seen as a validation of the personalized model in the time domain. Such a comparison is documented in
Figure 6 and
Figure 8. By studying these figures, it can be concluded that the personalized models provided an almost perfect match of the state responses.
To confirm this observation, the root mean squared error was computed by comparing state responses between the estimated and reference models for each subject in the validation dataset, separately for each state variable
,
,
as documented in
Table 3. These results revealed that for the sample time
the metrics
,
,
are sufficient, especially if taken relative to the typical magnitudes of the state responses, which have average values
,
and
. This indicates that the mean error of the state responses provided by the personalized model is only a few percent
,
,
and that the best accuracy was achieved for
and the worst for
.
In contrast, increasing the sample time to
day resulted in higher
,
and
indicating slightly deteriorated, yet acceptable accuracy of the state responses prediction and good overall model validity under reduced sampling frequency. Increasing the sampling interval to
day also led to marginally higher variances of the residuals of the estimated parameters and broader confidence intervals in
Figure 3. Nonetheless, this analysis remains crucial, as frequent data collection may pose practical challenges in clinical settings.
For the sake of comparison with other relevant parameter identification approaches, the conventional least squares minimization of the model response error with respect to the reference state responses was evaluated for the validation dataset and the obtained results were quantified by statistical metrics as summarized in
Table 2 and
Table 3. This quantitative comparison demonstrated that the proposed machine learning approach generally achieves much better accuracy of parameter estimation and slightly lower accuracy of state response prediction than obtained by the conventional methodologies that minimize the model response error. Therefore, it can be concluded that the proposed approach provides more accurate parameter estimation, which enhances the general validity of the resulting models, even if it does not achieve the closest fit to the observed state responses. In contrast, although the prediction error methods yield a good fit between the model response and the data, its general validity is limited due to inaccuracies in parameter estimation.
Another important section of the results addresses a detailed case study focused on experimenting with a single particular patient. First, an in silico subject was randomly generated following the considered statistical distribution of the model parameters within the population. Then, the state responses to the suboptimal leading dose were obtained for the duration of one dosing period by simulating the model defined by (1) for the given initial conditions.
A comparison of these state responses with their approximations obtained by the proposed parametrization technique is given in
Figure 4. It is evident that the state responses can be approximated very accurately by the function of an exponential nature and an absolute (constant) term. Nevertheless, it is important to remember that the aim of parametric approximation is not to achieve a perfect fit but rather to extract the features of the responses and thus describe them with a small set of parameters.
Another important result of the personalization accuracy analysis is the comparison between the actual instance of vector
given by (38) and its estimate
given by (41). This is also visualized in
Figure 5. for the parameter vector
. The results demonstrated that the parameters estimated from the finite sequence of state responses to the leading dose were of good accuracy, with all estimates falling within the confidence intervals. The relative (percentual) width of the confidence interval
is visibly different for each parameter depending on the diagonal entries of the covariance matrix
following (38).
The conventional least squares minimization of the model simulation error yielded the estimates given by (43) and visualized in
Figure 7. Based on these results, it can be claimed that the error of personalization is higher than, but still acceptable.
Another important part of the results analysis is the personalization of the steady states, which is ultimately required for the optimal chemotherapy design. Since the steady states are derived from the model parameters, their accuracy results from the accuracy of model personalization. The actual coexisting state is defined by (39), and the personalized models resulted in the estimated coexisting states as given by (42) and (44). We can claim that the most accurate estimate of the coexisting steady state was obtained by the proposed approach.
The remaining results addressed the problem of personalized optimization of the chemotherapy dosing protocol. Based on the model personalized from the patient’s response to the suboptimal leading dose, the sequence of six optimal chemotherapy doses was determined by numeric optimization, and this dosing protocol was applied to the virtual subject, resulting in the state responses plotted in
Figure 9. These state responses demonstrate effective and relatively fast stabilization of all three variables to the target coexisting state representing cancer in remission. However, significant fluctuations are present at the beginning of the treatment, which are difficult to manage due to the fixed suboptimal leading dose and adverse initial conditions. It is also important to notice that the tumor responses showed periodic behavior. The observed periodicity means that the tumor size stably oscillates around the target state, which corresponds to the so-called Jeff’s phenomenon [
7,
20] observed in clinical practice. The personalized optimal treatment was compared with the suboptimal dosing protocol obtained by optimizing the response of the nominal (mean-population) model without tailoring the treatment to the characteristics of the subject. The suboptimal treatment responses in
Figure 10 featured highly undesired fluctuations in the tumor cell population
and the cytotoxic cell population
, resulting in a higher value of criterion
. Besides that, the total amount of administered chemotherapy was significantly higher, implying that the personalized optimal treatment ensures the desired therapeutic outcomes with lower chemotherapy doses.
The paper presented a novel methodology for personalizing mathematical models of tumor growth dynamics, immune system interactions, and chemotherapy effects. By leveraging machine learning techniques and computer simulations, this study aimed to address the limitations of traditional approaches in chemotherapy dosing protocols, which often rely on mean-population models that fail to account for individual patient variability.
This study offers practical benefits for clinicians by providing a framework for guiding decision-making. By integrating machine learning, dynamic models, and optimization, clinicians can simply make more informed decisions. By incorporating patient-specific responses into the treatment protocol, it is possible to reduce the trial-and-error approach, ensuring that patients receive the most effective dose for their unique needs. Clinicians could also use this approach to assess how an individual patient is likely to respond to treatment and whether successful remission is even feasible, leading to a more proactive approach to cancer therapy.
The clinical workflow should be as follows:
Conventionally determine and administer the leading dose .
Determine the sampled patient responses , , via regular bloodwork and medical imagining tools during one dosing period (cycle).
Feed the obtained data to the computer implementing the proposed methodology.
The computer provides the following information to the clinician:
Estimated parameters of the tumor model.
Optimal sequence of chemotherapy doses.
Feasibility of cancer remission and the estimated final tumor size.
The clinician expertly assesses the results and finally determines the future steps and approves the treatment.
The methodology involved generating training and validation datasets by simulating a diverse population of virtual patients. The parameter vector for each patient was assumed to follow a multivariate normal distribution, capturing inter-subject variability. Sensitivity analysis showed that some parameters had negligible effects on the state variables, narrowing down the number of practically identifiable parameters to 9 out of 14. The trained neural network was then capable of estimating personalized model parameters based on clinical data from a single chemotherapy dose, facilitating more accurate predictions of tumor behavior and treatment outcomes.
Key innovations included the use of machine learning to personalize cancer model parameters and the application of parametric approximation of the state responses to simplify training data. The parametric approximation of sampled state responses by the sum of exponential functions reduced the complexity of the input data, ensuring feasible training of the MLP NN and better generalization capability. These advancements enable optimized chemotherapy dosing protocols tailored to individual patients, improving both treatment efficacy and safety. An important benefit of determining personalized steady states is the ability to predict cancer remission feasibility and estimate final tumor size.
The proposed approach has several limitations that should be acknowledged. First, the model personalization process requires administering an initial chemotherapy dose before estimating patient-specific parameters, which may introduce delays in optimizing the dosing protocol and pose risks to the patient. Additionally, once the neural network has been trained, the chemotherapy dosing period (cycle duration) remains fixed. Another challenge lies in the frequent sampling of tumor and immune cell populations, which may not always be feasible in clinical practice. Moreover, only parameters with a significant impact on tumor dynamics can be effectively personalized, potentially reducing the overall accuracy of the patient-specific model. Furthermore, a distinct neural network must be trained for each variation in the mathematical model, increasing the computational burden and limiting generalizability. The last drawback is that the computational complexity of generating the datasets and training the MLP NN is high. These limitations highlight the challenges that must be addressed for effective clinical implementation of the proposed approach.
Future research should focus on addressing the limitations of the proposed approach and enhancing its clinical applicability. One key direction is the development of adaptive dosing strategies that allow real-time therapy adjustments (optimization) based on continuous patient monitoring providing new data. Improving the feasibility of patient-specific data collection is another critical area, which could involve integrating non-invasive biomarkers or imaging techniques to estimate tumor and immune cell populations more efficiently. Further work is also needed to test different and more complex structures of cancer models. Clinical validation through retrospective and prospective trials will be essential to assess the effectiveness and safety of the personalized dosing protocol in real-world settings.