Next Article in Journal
Bridge Deflection Prediction Based on Cascaded Residual Smoothing and Multiscale Spatiotemporal Attention Network
Previous Article in Journal
Creating Effective Self-Adaptive Differential Evolution Algorithms to Solve the Discount-Guaranteed Ridesharing Problem Based on a Saying
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Personalization of Optimal Chemotherapy Dosing Based on Estimation of Uncertain Model Parameters Using Artificial Neural Network

Faculty of Electrical Engineering and Information Technology, Slovak University of Technology in Bratislava, 841 04 Bratislava, Slovakia
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(6), 3145; https://doi.org/10.3390/app15063145
Submission received: 19 February 2025 / Revised: 10 March 2025 / Accepted: 11 March 2025 / Published: 13 March 2025
(This article belongs to the Section Applied Biosciences and Bioengineering)

Abstract

:
Background/Objectives: The effectiveness of chemotherapy in cancer treatment is often compromised by inter-patient variability, leading to suboptimal outcomes. Traditional dosing protocols rely on population-based models that do not account for individual patient responses and the cancer phenotype. This study aims to develop a personalized chemotherapy dosing strategy by estimating uncertain model parameters using artificial neural networks, ensuring an optimal and individualized treatment approach. Methods: A dynamical model of tumor growth, immune response, and chemotherapy effects is used as the foundation for personalization. A training dataset is generated by simulating state responses across a diverse population of virtual patients, capturing inter-subject variability. The state responses are parameterized (approximated) using the sum of exponential functions to reduce dimensionality, and a multilayer perceptron artificial neural network is trained to estimate patient-specific model parameters based on response data from a single chemotherapy dose. Results: The proposed method effectively estimates patient-specific model parameters, significantly reducing uncertainty compared to conventional population-based models or the nonlinear least squares method. Numerical experiments demonstrate that personalized chemotherapy dosing, optimized using the estimated model parameters, achieves fast tumor remission while minimizing the total drug amount. Conclusions: By integrating the artificial neural network as the parameter estimator with model-based optimization, this study presents a novel approach to personalized chemotherapy dosing.

1. Introduction

Cancer is one of the most complex diseases, and as such, scientists are deeply involved in studying not only the biological nature and interactions of the processes involved in cancer but also in creating mathematical models of these processes. Chemotherapy remains a cornerstone of cancer treatment but is often associated with significant drawbacks, including severe toxicity, adverse side effects, and a lack of specificity in targeting tumor cells while sparing healthy tissues. Conventional chemotherapy protocols do not account for individual patient characteristics. As a result, patients may experience suboptimal treatment outcomes, either due to under-dosing, which fails to eradicate the tumor, or excessive dosing, which leads to unnecessary toxicity and immune suppression.
Therefore, the identical cancer treatment applied to a large group of patients can effectively cure only some of them [1,2]. Often, we are unable to fully explain the mechanisms why the treatment fails in some patients [3]. Due to a complex interplay of factors such as the type, aggressiveness, and stage of cancer, patient genetics, diet, immune system status, body weight, and gender, the efficiency of cancer therapies can vary significantly [4,5]. Another important factor affecting the response of the organism is the chemotherapy substance chosen for the treatment [6].
The design of chemotherapy dosing protocols for cancer patients often relies on empirical determination or systematic design based on population models [7,8,9,10]. These models typically derive their parameters from clinical data across large groups of patients. As a result, chemotherapy doses and treatment outcomes have a mean-population character, failing to account for individual patient variability [11]. Consequently, the application of such therapy carries risks, potentially hindering the success of chemotherapy [12]. To compensate for inter-subject variability by aligning the model parameters with the observed patient’s response to the treatment, advanced techniques are necessary [13].
These issues have given rise to the concept of personalized (precision) medicine (PMed), a new healthcare approach that seeks to address why patients with similar clinical symptoms often exhibit different responses to the same treatment [14]. Personalized medicine focuses on tailoring treatment to the individual characteristics of each patient [15]. This approach leverages a set of patient-specific information or characteristics to design customized therapeutic strategies [16]. Unlike the “one-size-fits-all approach”, which aims to develop treatments for the average person, precision medicine focuses on the differences between individuals [17]. This shift towards a patient-centered approach has the potential to improve survival rates, enhance quality of life, and reduce the burden of chemotherapy-related complications.
In the case of model-based personalization, an adequately parsimonious mathematical model is required [18,19]. Numerous mathematical models have been developed to describe tumor growth, its interactions with the immune system, and responses to various cancer therapies, including chemotherapy [9,20,21,22,23,24], immunotherapy [25,26,27], radiotherapy [28], anti-angiogenesis [29], and their combinations [8,30,31]. A comprehensive review of cancer modeling can be found in [9,32,33].
Current efforts in the field are focused on two distinct approaches to developing personalized therapies:
  • Iteratively and heuristically adjusting the therapy during treatment. This suggests an adaptive, trial-and-error approach where therapy is modified based on patient response. This is common in clinical practice but may lack a formal mathematical framework.
  • Rigorous approach using a mathematical model to guide therapy, followed by model-based optimization to refine treatment strategies.
The latter can be split into two steps [34]:
  • Personalization of the model: This step involves identifying the model parameters that correspond to the individual patient, transforming the population model into a personalized one.
  • Personalization of the treatment: This step involves determining the optimal chemotherapy dosing protocol tailored to the patient based on the personalized model.
Several studies have explored the integration of machine learning (ML) into oncological treatment, particularly in pharmacogenomics and chemotherapy personalization or by integrating medical imagining [35]. The studies [36,37] demonstrated the potential of ML in analyzing large-scale genomic, proteomic, and other omics datasets to identify genetic markers associated with drug responses. However, a drawback of these studies is that the methods used remain relatively simple, relying on heuristic or data-driven models without a strong mechanistic foundation.
The studies [38,39] focus on the widely used Gompertz tumor growth model, employing methods such as least squares minimization, maximum likelihood estimation, and the Extended Kalman Filter (EKF). While these methods offer robust estimation techniques, they require precise knowledge of process noise and measurement noise parameters to ensure optimal results. This presents a major limitation, as these noise parameters are often unknown in practical applications.
The studies [40,41] employ nonlinear function optimization techniques to estimate the parameters of a fourth-order tumor growth model. These methods rely on cost functions based on the least-squares criterion, where parameter values are chosen to minimize the sum of squared differences between simulated and measured tumor volumes. However, the issue is that fitting the model response to the data does not ensure that parameters are estimated accurately.
The studies [42,43] explore the combination of state and parameter estimation techniques, comparing the Extended Kalman Filter (EKF) with Moving Horizon Estimation (MHE). These approaches augment the state vector with the parameter vector and use optimization techniques to minimize the root-mean-square error between predicted and measured tumor volumes. Both methods require accurate noise modeling, which remains an open problem in practical applications.
The study in [44] applies Monte Carlo Approximate Bayesian Computation (ABC) to simultaneously perform model selection and parameter estimation for five different tumor growth models. This approach provides a probabilistic framework for quantifying uncertainty in both model choice and parameter values. However, a key drawback is the high computational cost, which may limit scalability.
Finally, the most relevant studies [45,46,47] employ artificial neural networks to estimate tumor model parameters using sparse measurement data and prior information about parameter intervals. These approaches leverage in silico-generated training data spanning large parameter intervals and incorporate randomized parameters to improve generalization. Two different neural networks were trained, to estimate the pharmacodynamic and the pharmacokinetic parameters. The most significant limitation is that the inputs of the neural networks are directly the measurements of the tumor volume, which hinders scalability and necessitates the use of a four-layered neural network structure. Besides that, these studies did not develop any personalized treatment strategies.
Therefore, to reflect the current gaps in the state of the art, the strategy proposed in this paper is to obtain personalized model parameters by evaluating the patient’s response to an initial dose of chemotherapy with a multilayer perceptron artificial neural network (MLP NN) with only one hidden layer. The proposed approach does not utilize an MLP NN as a replacement for the original biological dynamic model or as a digital twin [48]. Instead, it employs the neural network as a mapping function that directly relates the subject-specific parameters to their effects on the individual state responses. This method effectively bypasses the need for numerically solving the model differential equations when estimating model parameters and does not require making assumptions about the noise characteristics. Moreover, we shifted from pure model personalization addressed in the literature towards treatment personalization, thus completing the whole clinical scheme.
In the context of the state-of-the-art review presented, the most significant novel contributions of this paper are as follows:
  • A methodology to personalize (estimate) uncertain parameters of the dynamic model of tumor growth, chemotherapy and the immune system based on analyzing the clinical response of an oncology patient to a single chemotherapy dose using the pretrained MLP NN.
  • A parametrization technique to approximate the full sampled state responses with an analytical parametric function, resulting in a significant reduction in input data dimension and training complexity of the MLP NN, allowing for MLP NN with only one hidden layer.
  • Combination of the model personalization with the model-based optimization of chemotherapy dosing protocol ensuring personalized optimal cancer treatment.
In contrast to traditional parameter identification techniques, which typically rely on numerical optimization of a quadratic criterion to minimize residuals between the measured and simulated model responses, or a likelihood function [49,50], the proposed approach ensures faster parameter estimation through the evaluation of the MLP NN output.
Finally, the proposed approach focuses on fitting the parameter values themselves rather than fitting the state responses, which will ultimately result in better general validity of the estimated models than in traditional approaches.

2. Materials and Methods

2.1. Model of Cancer and Chemotherapy

In this section, a mathematical model of tumor growth dynamics, chemotherapy, and the interactions with the immune system will be presented. The model itself and the values of its nominal (mean-population) parameters will be adopted from [23].
The adopted model is not limited to a particular chemotherapeutic drug or cancer subtype. It focuses on the dynamics of a single solid tumor, without accounting for further proliferation or metastatic development. This standard phenomenological model describes the dynamics of four state variables representing the following quantities in a patient with cancer:
  • N ( t ) natural killer cell population
  • L t cytotoxic T cells population
  • T ( t ) tumor cell population
  • u ( t ) amount of anticancer drug in the tumor site
Natural killer (NK) cells are a key component of the innate immune system, responsible for detecting and eliminating abnormal cells, including tumor cells and virus-infected cells, without prior sensitization. Unlike adaptive immune cells, NK cells do not rely on antigen recognition but instead use a balance of activating and inhibitory receptors to distinguish between healthy and malignant cells. When NK cells recognize a target cell with reduced major histocompatibility complex (MHC) class I expression—a common feature of tumor cells—they release cytotoxic granules containing perforin and granzymes, leading to cell lysis.
Cytotoxic T lymphocytes (CTLs), also known as CD8+ T cells, are key players in the adaptive immune system and play a vital role in tumor immunosurveillance. Unlike natural killer (NK) cells, CTLs must be activated through antigen presentation by antigen-presenting cells (APCs) via MHC class I molecules. Once activated, they recognize and attach to tumor cells displaying tumor-specific or tumor-associated antigens. This leads to the induction of apoptosis in target cells, either through the release of perforin and granzymes or by engaging death receptor signaling pathways.
The considered mathematical model is defined by differential equations [23]
d N d t t = N t a 1 b N t α 1 T t k N u t , d L d t t = r N ( t ) T ( t ) L ( t ) μ + β 1 T t + k L u ( t ) , d T d t t = T ( t ) c 1 d T t α 2 N t β 2 L t k T u ( t ) , d u d t t = v t ω u t .
Input v t R represents the rate of chemotherapy administration.
The state variables form the state vector x t R 4 defined as
x t =   N ( t ) L ( t ) T ( t ) u t T .
The adopted model is based on the following assumptions [23]:
  • Tumor cell populations are reduced by both immune effector cells and chemotherapy.
  • Effector cell numbers decline due to natural degradation, their consumption in tumor cell destruction, and the impact of chemotherapy.
  • Chemotherapeutic agents influence both tumor cells and immune effector cells through a mass-action process.
  • Administering the drug at a higher constant rate may lead to a greater reduction in both tumor cells and immune effector cells.
The growth of natural killer cells N ( t ) follows a logistic model as a N ( t ) 1 b N t , while their suppression occurs through interactions with tumor cells, defined by α 1 T t N ( t ) . Cytotoxic T cells L ( t ) are activated only in response to tumor presence while being generated via a bilinear term r N ( t ) T ( t ) . Their natural decline follows a linear process μ L ( t ) , and they are further inactivated upon interaction with tumor cells, represented by β 1 T t L ( t ) . Tumor cell dynamics T ( t ) is modeled using a logistic growth function c T ( t ) 1 d T t . Tumor elimination occurs through the action of both NK cells and cytotoxic T cells, with respective terms α 2 N t T ( t ) and β 2 L t T ( t ) accounting for their effects.
The chemotherapy drug u ( t ) follows first-order pharmacokinetics and influences all three cell populations—NK cells, cytotoxic T cells, and tumor cells—via a mass-action mechanism. Each cell type exhibits a specific susceptibility to the drug, characterized by distinct mortality rates with coefficients k N , k L , and k T , respectively.
The parameters of the model defined by (1) can be gathered into the parameter vector θ R 14 as
θ = a b c d r μ α 1 α 2 β 1 β 2 ω k N k L k T T .
The nominal values of the model parameters and their biological interpretations are summarized in Table 1. Considering these nominal parameters, we define the nominal parameter vector θ ¯ R 14 as
θ ¯ = a ¯ b ¯ c ¯ d ¯ r ¯ μ ¯ α ¯ 1 α ¯ 2 β ¯ 1 β ¯ 2 ω ¯ k ¯ N k ¯ L k ¯ T T .
Each patient developing cancer is unique, hence the parameter vector θ typically varies significantly between the subjects within the population. This makes the evolution of state variables hard to predict without model personalization for the particular patient and their cancer phenotype.
Because model (1) is relatively broadly applicable to various chemotherapeutic agents and cancer subtypes, if the focus is on a single solid tumor, this motivates personalization by tailoring the model parameters to reflect the specific characteristics of the cancer type, the patient’s physiological conditions, and the properties of the administered chemotherapeutic drug. Key characteristics of the chemotherapeutic agent are encapsulated in the coefficients k N , k L , k T , which quantify its selectivity in targeting specific cells. The aggressiveness of the cancer is characterized by parameter c , representing the tumor growth rate. Meanwhile, the immune system’s effectiveness in combating the tumor is determined by a complex interplay of additional model parameters such as the growth rate of natural killer cells a , the activation rate of cytotoxic T cells r , capturing the dynamic interactions between the tumor and the immune response.
The roles of this mathematical model are the following:
  • To substitute for in vivo data by providing the simulated state responses for various parameters, dose sizes, and initial conditions when creating the training and validation datasets.
  • To provide the simulated state responses when evaluating the cost function of the chemotherapy dosing protocol optimization.
Since the model in (1) is nonlinear, the state responses N t ,   L t ,   T ( t ) cannot be obtained analytically. Because of this, the only way to obtain the state responses N t ,   L t ,   T ( t ) is by numerical solution of the differential equations given by (1), using Runge–Kutta methods [51] for numerical integration.

2.2. Statistical Properties of the Model Parameters

Consider that the parameter vector θ given by (3) is a random variable following a multivariate normal distribution, which describes the statistical properties related to the inter-subject parametric variability in the population of patients with cancer.
The mean value is given by vector θ ¯ as
θ ¯ = E θ .
The second statistical characteristic is the covariance matrix Q R 14 × 14 defined as
Q = c o v θ , θ = E θ θ ¯ θ θ ¯ T .
The diagonal elements of Q represent the variances of the individual parameters, hence defining the measure of parametric uncertainties. The off-diagonal elements represent the covariances between pairs of parameters, which quantify the degree to which these parameters change together.
To generalize the theoretical analysis, we will introduce a new dimensionless random vector ξ R m following the standard normal distribution, which can be noted as
E ξ = 0 ,     c o v ξ , ξ = E ξ ξ T = I .
Then, the parameter vector θ will be related to random vector ξ via matrix R R 14 × m and the nominal vector θ ¯ as
θ = d i a g θ ¯ R ξ + θ ¯ .
Dimension m N represents the number of random factors affecting the parameter vector θ . According to (8), the covariance matrix Q defined by (6) obtains
Q = E θ θ ¯ θ θ ¯ T = d i a g θ ¯ R R T d i a g θ ¯ .
Considering the confidence level α , the random vector ξ will lie in the symmetrical confidence interval
1 r 1 α 2 ξ 1 r 1 α 2 ,
with 1 α probability, where r 1 α 2 is the   1 α 2 quantile of the standard normal distribution and 1 = 1 1 1 T is the vector of ones.
Based on (8), we can claim that the parameter vector θ lies in the confidence interval
d i a g θ ¯ R 1 r 1 α 2 + θ ¯ θ d i a g θ ¯ R 1 r 1 α 2 + θ ¯ .
Then, the absolute interval parametric uncertainty obtains
θ = d i a g θ ¯ R 1 r 1 α 2 .
Equation (12) will be used to determine the entries of matrix R based on the assumptions about the relative interval parametric uncertainties given in percent by θ i θ ¯ i × 100 as
R i j = θ i θ ¯ i 1 r 1 α 2 ,
where i = 1 , 2 , , 14 is the index of parameter and j = 1 , 2 , , m , which increments if θ i θ ¯ i 0 .

2.3. Parametrization of the State Responses

It is an important fact that the sequences of sampled state responses N k T s , L k T s ,   T ( k T s ) for k = 1 , 2 , , n and the sample time T s cannot be effectively used as inputs to the MLP NN. Although theoretically possible, it is very unpractical due to the dimensions, i.e., the large number of inputs to the MLP NN. Besides that, it is very likely that the MLP NN would not be able to sufficiently generalize the information comprised in such raw data and thus provide a reliable parameter estimate.
Therefore, the proposed strategy will be to extract the features of the sampled state responses N k T s , L k T s ,   T ( k T s ) by parametrizing them. The parametrization technique will be based on an approximation of the state responses by a parametric analytical function while determining its parameters in terms of linear regression. After the parametrization, the sampled state responses are described by vectors p N R n p , p L R n p , p T R n p of n p parameters as
N k T s ,   k = 1 , 2 , , n p a r a m e t r i z a t i o n p N = p N 1 p N 2 p N n p T , L k T s ,   k = 1 , 2 , , n p a r a m e t r i z a t i o n p L = p L 1 p L 2 p L n p T , T k T s ,   k = 1 , 2 , , n p a r a m e t r i z a t i o n p T = p T 1 p T 2 p T n p T .
This approach will lead to a significant reduction in the training complexity and the memory footprint since n p n .
Consider a parametrizable weighted sum of exponential functions X ~ t : R R as
X ~ t = p 1 e a 1 t + p 2 e a 2 t + p 3 e a 3 t p n p 1 e a n p 1 t + p n p ,
where p 1 , p 2 , …, p n p are the function parameters to be fitted and a 1 , a 2 , , a n p 1 < 0   are appropriately chosen tuning coefficients.
This function will approximate signal X ( t ) , which substitutes for the state responses N t , L t , T t available in the form of finite sampled sequences N k T s , L k T s ,   T ( k T s ) for k = 1 , 2 , , n . The absolute term p n p in (15) is involved to reflect nonzero steady states of model (1), allowing for a nonzero finite limit lim t X ~ ( t ) = p n p .
Equation (15) can be noted in linear form
X ~ ( t ) = e a 1 t e a 2 t e a 3 t e a n p 1 t 1 p 1 p 2 p 3 p n p 1 p n p = h T p ,
where h R n p is the regression vector.
Based on (16), we can formulate the linear regression problem
e a 1 T s e a 2 T s e a 3 T s e a n p 1 T s 1 e a 1 2 T s e a 2 2 T s e a 3 2 T s e a n p 1 2 T s 1 e a 1 3 T s e a 2 3 T s e a 3 3 T s e a n p 1 3 T s 1 e a 1 n T s e a 2 n T s e a 3 n T s e a n p 1 n T s 1 p 1 p 2 p 3 p n p 1 p n p = X T s X 2 T s X 3 T s X n T s , H p = X ,
where H R n × n p is the regression matrix and X R n is the vector of samples.
The parameter vector p will be obtained in terms of the least squares method analytically as [52]
p = H T H 1 H T X .

2.4. Artificial Neural Network

2.4.1. Structure of the Network

We chose the structure of multilayer perceptron artificial neural network having the output layer with linear activation function and one hidden layer comprising n n neurons with hyperbolic tangent sigmoid activation function [53]. The structure of this MLP NN is depicted in Figure 1.
The learning algorithm for training this MLP NN was the classical Levenberg–Marquardt error backpropagation [54]. The functionality of the MLP NN, including the output calculation, and the weights and biases adaptation (training) are implemented by the functions of Matlab Deep Learning Toolbox (R2023b, https://www.mathworks.cn/products/deep-learning.html) [55].

2.4.2. Inputs and Outputs

The input vector U to the MLP NN will be formed by vectors of parameters p N , p L , p T obtained by the parametrization of the state responses N t , L t , T t respectively, the initial state vector x ( 0 ) and the administered leading dose size D as
U = p N p L p T x ( 0 ) D .
If n p is the dimension of p N , p L , p T according to (14), then U R 3 n p + 5 .
The reason for providing the information on x ( 0 ) is to ensure parameter estimation under the assumption of an arbitrary initial state. By taking the initial state into account explicitly, the MLP NN will be capable of separating its effect on the state responses from the effect of parametric variations.
It is important to mention that by providing the dose size D and the initial state x ( 0 ) , which involves the initial drug amount u ( 0 ) , the information on the chemotherapy administered within the current and previous dosing periods (cycles) also enters the personalization. A significant (nonzero) dose D is mandatory to properly capture the effects of chemotherapy. This enables the parameters k N , k L , k T to be estimated.
The model personalization itself consists of evaluating vector U by the MLP NN to provide vector ξ ^ R m representing the estimate of ξ . Subsequently, the estimate of model parameter vector θ ^ is determined from ξ ^ according to (8).
The MLP NN can be seen as a nonlinear multivariate vector function N : R 3 n p + 5 R m [53].
ξ ^ = N U = W o tanh W h U + b h + b o ,
where t a n h x :   R n n R n n   is a vectorized hyperbolic tangent function, W h R n n × 3 n p + 5 , W o R m × n n are the matrices of weights in the hidden and the output layer, respectively, and b h R n n , b o R m are the vectors of biases in the hidden and the output layers, respectively. The number of neurons in the hidden layer n n was determined based on the heuristic rule of thumb m < n n < 3 n p + 5 , which is satisfactory for n n = m + 3 n p + 5 2 .

2.4.3. Generating the Training Dataset

The dataset for training of the MLP NN was generated according to the procedure described as follows. The parameters given by (3) corresponding to virtual oncology subjects were generated randomly while considering their statistical distribution as defined before. The relative interval of parametric uncertainties of the individual model parameters was chosen as
θ θ ¯ = 0.8 0.05 0.05 0 0 0.4 0.8 0 0 0 0.25 0.85 0.25 0.25 T .
Notice that in (21), only m = 9 of a total of 14 parameters in the parameter vector θ are considered uncertain, while the others remain nominal, i.e., fixed across the population. In particular, parameters a , b , c , μ , α 1 , ω , k N , k L , k T are considered uncertain and vary among the subjects. This choice is strongly supported by analyzing the magnitudes and the linear dependencies of the corresponding sensitivity functions of the state variables with respect to the parameters as presented in Appendix A.
The sensitivity analysis allowed us to understand the percentual changes in which parameters have an observable effect on the state variables. Then, we could adjust the relative interval parametric uncertainties (21) according to the magnitudes of the sensitivities (see Figure A1, Figure A2 and Figure A3) to ensure that their effects are realistically balanced. It means that if the state sensitivity to the i -th parameter is relatively low, the corresponding parametric uncertainty should be higher than that of a parameter with much greater sensitivity.
However, it is important to acknowledge that the uncertainties given by (21) were chosen without the support of clinical results or data, as these are not available in the literature or in the public domain. Experimentally determining the actual statistical distribution of the parameters is beyond the scope of this paper, as it requires large studies with real subjects.
Considering the 1 α = 0.95 confidence level, the entries of matrix R were determined according to (13) as
R = 0.1531 × 0.8 0 0 0 0 0 0 0 0 0 0.05 0 0 0 0 0 0 0 0 0 0.05 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0.4 0 0 0 0 0 0 0 0 0 0.8 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0.25 0 0 0 0 0 0 0 0 0 0.85 0 0 0 0 0 0 0 0 0 0.25 0 0 0 0 0 0 0 0 0 0.25 .
The structure of matrix R implies that the covariance matrix Q defined by (9) will be diagonal, which means that the random variations in the model parameters are not mutually correlated within the population.
To create the training dataset, there will be generated a total of 1500 different virtual patients. More specifically, for each of these subjects in the virtual population, a random vector ξ will be generated considering standard normal distribution (zero mean and unit covariance matrix). The corresponding parameter vector θ will be calculated according to formula (8), considering the mean θ ¯ given by the values in Table 1 and matrix R given by (22). After that, the states of each virtual patient will be simulated by numerically solving the differential Equation (1) using the Runge–Kutta 4-th order numerical integration method [51] to obtain the sampled state responses N k T s , L k T s ,   T ( k T s ) for a given T s and the dosing period (cycle duration) T D . This simulation (numerical solution) will be repeated for 100 different initial conditions and leading dose sizes D . The initial conditions N 0 , L 0 ,   T ( 0 ) will also be randomly generated, assuming the normal distributions with the mean values equal to the corresponding stable equilibrium states N 0 , L 0 , T 0 and the standard deviations are given as N 0 2 , L 0 2 , T 0 2 , respectively. The leading dose size D for each entry of the dataset was generated as a uniformly distributed random number from the interval 0.1–1.5. The training dataset thus includes M = 150,000 entries. After obtaining the simulated state responses, their parametrization will be performed to obtain the individual input vectors U defined by (19).
The full training dataset was divided randomly into three subsets within the Matlab Deep Learning Toolbox: 70% for training, 15% for validation, and 15% for testing.

2.4.4. Validation by the Covariance Matrix of the Estimate Error

The aim of the validation is to assess the personalization performance of the trained MLP NN using new data. The validation dataset with M = 150,000 entries was generated randomly by following the same procedure as in the case of the training dataset but with a different seed for the random number generator.
By evaluating the output of the MLP NN for all entries of the validation dataset, the estimates ξ ^ i will be determined according to (20) for i = 1 , 2 , , M and the estimated residuals will be calculated as ξ ^ i ξ i .
The validation and performance assessment will be carried out in a statistical framework by analyzing the covariance matrix of the estimated residuals. The covariance matrix of the estimated residual P R m × m will be defined as
P = c o v ξ ^ ξ , ξ ^ ξ = E ξ ^ ξ ξ ^ ξ T .
From the statistical perspective, personalization should minimize the variances of the residuals ξ ^ ξ , while ensuring statistical unbiasedness.
In practice, P will be obtained as the sample covariance matrix P ^ R m × m considering M entries of the validation dataset as [56]
P ^ = c o v ξ ^ ξ , ξ ^ ξ = 1 M 1 i = 1 M ξ ^ i ξ i ξ ^ i ξ i T .
Since ξ follows the standard normal distribution with c o v ξ , ξ = I as defined by (7), the personalization is successful if the variances of the residuals are significantly lower than one. This can be noted as
d i a g P ^ 1 .
The above can be interpreted such that the aim of personalization is not to provide the exact values of the unknown model parameters but to lower their uncertainties. Another condition is that the mean value of the residuals must be close to zero to ensure the estimate unbiasedness, which can be noted as
E ξ ^ ξ 0 .

2.4.5. Confidence Interval of the Parameter Estimate

To be able to assess the accuracy of the estimated dimensionless vector ξ ^ or the estimated parameter vector θ ^ , which both represent the point estimates, it is useful to determine their confidence intervals.
Considering the confidence level α and the sample covariance matrix P ^ , the confidence interval of ξ obtains
ξ ^ + d i a g P ^ r α 2 ξ ξ ^ + d i a g P ^ r 1 α 2 . ξ ^ d i a g P ^ r 1 α 2 ξ ξ ^ + d i a g P ^ r 1 α 2 .
The confidence interval is symmetrical because r α 2 = r 1 α 2 for the normal distribution. Then, considering (8), we can find the confidence interval of the model parameter vector θ as
θ ¯ + d i a g θ ¯   R ξ ^ d i a g P ^ r 1 α 2 θ θ ¯ + d i a g θ ¯   R ξ ^ + d i a g P ^ r 1 α 2 θ ¯ + θ ^ d i a g θ ¯   R d i a g P ^ r 1 α 2 θ θ ¯ + θ ^ + d i a g θ ¯   R d i a g P ^ r 1 α 2
Based on the above, the interval uncertainty of the estimated parameter vector θ ^ obtains
θ ^ = d i a g θ ¯   R d i a g P ^ r 1 α 2 .
It can be claimed that the personalization should provide the parameter estimate with the uncertainty θ ^ satisfying
θ ^ θ .
Substituting θ according to (12) and θ ^ according to (29) yields
d i a g θ ¯   R d i a g P ^ r 1 α 2 d i a g θ ¯ R 1 r 1 α 2 ,
which confirms (25).

2.5. Nonlinear Least Squares Method to Minimize the Model Response Error

A more conservative approach to the personalization of model (1) is the least squares minimization of the model simulation error with respect to the data in the form of sampled state responses. This approach considers a quadratic cost function to minimize the sum of the squared errors/residuals between the provided sampled states N k T s ,   L k T s ,   T ( k T s ) and the fitted model responses N ^ k T s , ξ ^ ,   L ^ k T s , ξ ^ ,   T ^ ( k T s , ξ ^ ) obtained by numerical solution of (1). The corresponding cost function has the form
J L S ξ ^ = k = 1 n λ N N ^ k T s , ξ ^ N k T s , ξ 2 + λ L L ^ k T s , ξ ^ L k T s , ξ 2 + λ T T ^ k T s , ξ ^ T k T s , ξ 2 ,
where λ N > 0 , λ L > 0 , λ T > 0 are the weights.
It is important to remark that since the model responses N ^ k T s , ξ ^ , L ^ k T s , ξ ^ , T ^ k T s , ξ ^ are not linear with respect to the estimated vector ξ ^ , the linear regression cannot be used to find ξ ^ minimizing (32). This also means that the cost function given by (32) cannot be written as a quadratic form with respect to ξ ^ and minimized analytically, implying a nonlinear least squares problem. Therefore, the estimate ξ ^ must be determined by numerical optimization using some gradient-free algorithm such as the Nelder-Mead [57] or genetic algorithms [58]. It suggests that the main drawback of this conventional approach is the computational complexity of the personalization itself, which is significant due to the need for numerical optimization involving repetitive evaluation of cost function given by (32) and numerical solution of the model differential equations.
This contrasts with the parametrization of the state responses and calculating the output of the MLP NN in the proposed approach. Therefore, it can be concluded that despite the significant burden involved in generating the training dataset and in the process of training itself, the proposed approach is superior in terms of the computational complexity of personalization.

2.6. Personalized Chemotherapy

In this section, the chemotherapy dosing protocol will be optimized considering the mathematical model given by (1) with its parameters estimated by the proposed personalization approach.

2.6.1. Target Steady State

The steady states of the personalized model must be analyzed to find an appropriate target state for the therapy. For t and v t = 0 , the state variables will approach their steady values N 0 ,   L 0 ,   T 0 . Given the fact that we are dealing with a nonlinear system, it is typical to have multiple steady states, each with a specific region of attraction. Convergence of the states to one of the stable steady states depends on the initial state x ( 0 ) . Determining the steady states is crucial for the chemotherapy dosing design as we need to specify the target (reference) state representing the cancer in remission. The steady states can be determined as the solutions of a system of algebraic equations resulting from (1) under the assumption of d N d t = d L d t = d T d t = 0 .
Another important aspect is that since the chemotherapy dosing will terminate after completing the treatment protocol, it is necessary to ensure that the state variables remain stable after reaching the vicinity of the target steady state. Therefore, satisfying the criterion of local stability for the chosen target steady state is necessary for the success of therapy and inducing remission. Otherwise, the cancer would relapse, and the tumor would regrow. By studying the Jacobian matrix, we can determine the local stability of a steady state based on its eigenvalues. If all eigenvalues have negative real parts, the equilibrium point is locally stable.
The Jacobian matrix A ( x , θ ) R 4 × 4 is given by
A x , θ = a 1 2 b N α 1 T k N u 0 α 1 N k N N r T μ + β 1 T + k L u r N β 1 L k L L α 2 T β 2 T c 1 2 d T α 2 N β 2 L k T u k T T 0 0 0 ω .
There are four different clinical scenarios for the steady states. The first is the Dead state represented by all state variables being zero, which means that the tumor is gone, but the immune system is also depleted. This state is undesired from the clinical perspective. The ideal condition is the Cured state having zero tumor cells and enough natural killer cells. However, this state is typically unstable and hence cannot be maintained. The worst scenario is the Grown state manifesting by a large number of tumor cells, representing the final stage of cancer. Therefore, the so-called Coexisting state seems to be the best option as it corresponds to cancer in remission having a small number of T cells and enough N cells. If one of the Coexisting states satisfies the local stability criterion, it will be chosen as the target state for the treatment.

2.6.2. Personalization and Need for the Leading Dose

Before optimizing the chemotherapy dosing protocol tailored for the patient, the model needs to be personalized. However, to personalize the model, we need the sampled state responses from one dosing period (cycle), which means that the first dose has already been administered. This order of operations results in a causality paradox. As a consequence, the first administered dose D 0 , called the leading dose, cannot be personalized. After the first dosing period elapsed, the sampled state responses are available and thus the model personalization can be carried out. Finally, the remaining doses of the dosing protocol will be optimized based on the personalized model. It can be concluded that the personalization of chemotherapy cannot be achieved without the data including the subject’s response to a single chemotherapy administration.

2.6.3. Optimization Problem

The aim of cancer treatment is to minimize the tumor cell population T ( t ) , maintain a healthy population of natural killer cells N ( t ) while minimizing the total amount of chemotherapy administered during the treatment to reduce its adverse effect on healthy cells.
The objective of designing the chemotherapy dosing protocol is to ensure the transition of the state variables from their initial values N 0 ,   L 0 ,   T ( 0 ) to the coexisting state N 0 , L 0 , T 0 , which represents cancer remission [59]. The therapy is implemented as a finite sequence of n d individual (discrete) doses D i , delivered periodically as short, uniformly spaced impulses (boluses) with a period of T D days. Each dose is represented as a rectangular pulse of the administration rate signal v ( t ) , lasting 0.1 × T D and having an amplitude of D i × 10 T D , ensuring that the pulse’s area under the curve equals D i .
The aim of the design is to find vector D R n D of n D positive doses
D = D 1 D 2 D 3 D n D T ,
minimizing the criterion
J D D = k = 1 n D × n w N N k T s , D , θ ^ N 0 2 + w L L k T s , D , θ ^ L 0 2 + w T T k T s , D , θ ^ T 0 2 + w u u k T s , D , θ ^ 2 ,
where w N > 0 , w L > 0 , w T > 0 , w u > 0 are the weighting coefficients. Since D represents positive quantities of chemotherapy to be administered, the additional constraints are imposed as D > 0 . Because the responses N k T s , D , θ ^ , L k T s , D , θ ^ , T k T s , D , θ ^ are not linear with respect to the decision vector D , the traditional quadratic programming problem cannot be formulated. Therefore, given that our problem is finite-dimensional, we opt for a numerical optimization while N k T s , D , θ ^ , L k T s , D , θ ^ , T k T s , D , θ ^ , u k T s , D , θ ^ are obtained by the numerical solution of differential equations given by (1) using the 4th-order Runge–Kutta method.
To determine the vector D that minimizes (35), numerical optimization was performed using the fmincon function available in MATLAB Optimization Toolbox (R2023b, https://www.mathworks.cn/products/optimization.html) [60]. This function implements the interior-point method to solve constrained numerical optimization problems [61,62].
Interior-point methods represent a class of optimization algorithms that focus on exploring the interior of the feasible region. They use a barrier function (typically logarithmic) to penalize points near the boundary, ensuring iterates remain within the feasible region defined by inequality constraints. The barrier parameter is gradually reduced, allowing the solution to approach the boundary as it converges to the optimum. These methods leverage a central path, a trajectory defined by the solutions of the barrier-augmented optimization problem as the barrier parameter is reduced, to approach the optimal solution satisfying the primal and dual conditions (Karush–Kuhn–Tucker conditions). In the nonlinear least-squares optimization problem considered, where explicit gradients and Hessians are unavailable, the optimizer employs finite-difference approximations for gradients and approximates the Hessian matrix using a quasi-Newton method, specifically the Broyden-Fletcher-Goldfarb-Shanno (BFGS) method.
Another aspect of personalized therapy lies in the patient’s state at the beginning of the treatment (initial state) x 0 . However, contrary to unknown parameters θ , the initial state x 0 can be determined by performing the corresponding medical diagnostic procedures (blood analysis, histology, medical imagining techniques).
The whole process of data acquisition, state response parametrization, model personalization, and personalized optimization of chemotherapy doses is visualized in Figure 2.

3. Results

In this section, numerical experiments will be conducted to demonstrate the effectiveness of the proposed methodology while assessing the accuracy of the model personalization and the performance of the personalized optimal chemotherapy dosing protocol compared to one designed based on the nominal model.
Within this experiment, we will consider the dosing period (cycle duration) T D = 40 days and the sample time T s = 1 / 4 of the day or T s = 1 day, which implies n = 160 or n = 40 samples of the state vector. Evaluating the performance of the proposed methodology with an extended sampling interval reduces the number of data points available for personalization. This infrequent sampling represents a significant practical challenge in clinical applications.
It is also important to mention that changing the sample time T s does not require re-training of the MLP NN as it affects only the parametrization of the sampled state responses while the number of parameters n p describing them would remain the same.
The proposed methodology will be compared with the nonlinear least squares-based model response error minimization as described in Section 2.5. The weights in the cost function given by (32) were empirically tuned as
λ N = 5 × 10 4 ,   λ L = 10 4 ,   λ T = 10 2 .

3.1. Personalization of the Model Parameters

For the parametrization of the state responses, the number of parameters was chosen as n p = 8 , which implies U R 28 . The tuning coefficients (exponents) a i in (15) were chosen as a 1 a 2 a 3 a 4 a 5 a 6 a 7 = 100 30 10 5 1 0.5 0.1 .
Processing the validation dataset and evaluating the corresponding statistics yields the mean value of the residuals ξ ^ ξ and the diagonal entries of the covariance matrix P ^ of the residuals as given in Table 2.
For better interpretation, the relative interval uncertainty of the original parameter vector ± θ θ ¯ obtained according to (12) and the relative interval uncertainty of the estimated (personalized) parameter vector ± θ ^ θ ¯ obtained according to (29) are visualized in Figure 3.
To ensure comprehensive validation, the comparison between the state responses of the estimated model N ^ k T s , ξ ^ i , L ^ k T s , ξ ^ i , T ^ k T s , ξ ^ i and the reference model N k T s , ξ i , L k T s , ξ i , T k T s , ξ i was repeated for each entry (subject) in the validation dataset. Then, the root mean squared error (RMSE) metric has been evaluated independently for each state variable as
R M S E N = 1 M 1 n i = 1 M k = 1 n N ^ k T s , ξ ^ i N k T s , ξ i 2 , R M S E L = 1 M 1 n i = 1 M k = 1 n L ^ k T s , ξ ^ i L k T s , ξ i 2 , R M S E T = 1 M 1 n i = 1 M k = 1 n T ^ k T s , ξ ^ i T k T s , ξ i 2
where M is the number of entries in the validation dataset and n is the number of samples. The obtained results are summarized in Table 3.
As a demonstrative case study, consider a particular subject with vector ξ given as
ξ = 0.388 1.137 0.711 0.968 0.482 1.183 0.430 0.400 1.132 T .
For this subject, the actual coexisting stable steady state is
N 0 = 2.9035 × 10 5 , L 0 = 14.940 × 10 5 , T 0 = 7.6286 × 10 5 .
Considering the initial conditions and the dose size given by (40) for this subject will result in the state responses N k T s , L k T s ,   T ( k T s ) used for the model personalization.
N 0 = 1 × 10 5 , L 0 = 20 × 10 5 , T 0 = 50 × 10 5 ,   u 0 = 0 ,   D = 1
The next step is the parametrization of the state responses. The accuracy of the parametrization is visualized by comparing the original state responses N t , L t , T ( t ) with their parametric approximations N ~ t , L ~ t , T ~ ( t ) for T s = 1 / 4 in Figure 4.
Evaluating vector U obtained by parametrizing the state responses by the trained MLP NN according to (20) results in
ξ ^ = 0.5588 1.2151 0.5866 1.0861 0.3160 1.1409 0.1058 0.3675 1.0203 T .
The estimated model parameters θ ^ determined from ξ ^ according to (8) and their corresponding confidence interval θ ^ determined according to (29) are compared with the actual parameter vector θ in Figure 5.
The coordinates of the target coexisting state were estimated from the personalized model as
N ^ 0 = 2.9029 × 10 5 , L ^ 0 = 15.015 × 10 5 , T ^ 0 = 8.0630 × 10 5 .
The reliability of the personalized models was assessed by comparing their dynamic responses to the reference state responses used during the personalization process, providing validation within the time domain. A comparison of the reference state responses N t , L t , T ( t ) considering vector θ and the responses N ^ t , L ^ t , T ^ ( t ) obtained using the estimated vector θ ^ can be seen in Figure 6.
Considering the identical state responses as in the previous scenarios, numerical minimization of (32) yields
ξ ^ = 0.178 0.681 0.005 1.165 0.600 1.822 0.188 0.816 0.575 T .
A corresponding visual comparison of the estimated parameters can be seen in Figure 7.
The coordinates of the target coexisting state were estimated from the personalized model as
N ^ 0 = 2.9478 × 10 5 , L ^ 0 = 14.677 × 10 5 , T ^ 0 = 7.0081 × 10 5 .
A comparison of the reference state responses N t , L t , T ( t ) considering vector ξ and the simulated responses N ^ t , L ^ t , T ^ ( t ) obtained using the estimated vector ξ ^ can be seen in Figure 8.

3.2. Personalized Optimal Chemotherapy

Now, having the model personalized according to the responses obtained after the leading dose within the first dosing period, the optimal chemotherapy dosing protocol will be determined considering n D = 6 optimized doses. The weighting coefficients w N , w L , w T , w u were tuned to ensure that all four terms in (35) have comparable magnitudes. This is essential to avoid significant imbalances and overfitting, especially when one term disproportionately outweighs the others. Ensuring penalties of comparable magnitudes promotes a balanced optimization process, allowing each component’s role to be appropriately weighted in attaining the targeted therapeutic outcome.
w N = 10 5 ,   w L = 10 2 ,   w T = 1 ,   w u = 10 15
The state of the subject at the beginning of the optimized treatment is equivalent to the state after elapsing the first dosing period (cycle), which is
N T D = 3.0357 × 10 5 , L T D = 1.6863 × 10 5 , T T D = 1.3135 × 10 4 ,   u T D = 0.001 .
The optimization of chemotherapy will be performed considering the personalized model parameters (41) while the target steady state will be given by (42). By minimizing criterion (35), the sequence of chemotherapy doses was obtained as
D = 0.2263 0.1413 0.0302 0.1039 0.0921 0.0769 T .
To ensure objectiveness, the designed personalized treatment protocol will be simulated considering the actual model parameters given by (38) and not the estimated ones. The evolution of the states obtained by applying the designed chemotherapy dosing protocol is visualized in Figure 9 and the resulting value of criterion is J D D = 2.0653 × 10 16 .
Considering the nominal model parameters, by performing the optimization of the treatment by minimization of criterion (35), the sequence of chemotherapy doses was obtained as
D = 0.7620 0.5804 0.4187 0.4610 0.2235 0.2296 T .
Also in this scenario, the designed nominal treatment protocol will be simulated considering the actual model parameters given by (39). The evolution of the states obtained by applying the designed chemotherapy dosing protocol is visualized in Figure 10 while the resulting criterion is J D D = 3.6283 × 10 16 .

4. Discussion

The first portion of the results dealt with the statistical assessment of the model personalization accuracy. This assessment was substantiated by the mean value and the covariance matrix of the parameter estimate residuals as given in Table 2. Since the mean value of the residuals is relatively close to zero, it can be claimed that the personalization does not produce statistically biased estimates of the parameters. Another important result to highlight is that the diagonal entries of the covariance matrix of the estimate residuals satisfy the variance condition given by (25) with a large margin. This proves that personalization significantly lowers the uncertainties of the model parameters. For better interpretation, the relative interval uncertainty of the original parameter vector ± θ θ ¯ obtained according to (12) and the relative interval uncertainty of the estimated (personalized) parameter vector ± θ ^ θ ¯ obtained according to (29) were visualized in Figure 3, demonstrating the shrinking of the uncertainty interval after personalization.
It also implies that the trained MLP NN captures and sufficiently generalizes the relationships between the variations in the parameters, the initial state, dose size, and the state responses to a leading dose of chemotherapy.
Besides comparing the estimated parameters themselves, the personalized models were validated by comparing their responses with the reference state responses used for personalization. This can be seen as a validation of the personalized model in the time domain. Such a comparison is documented in Figure 6 and Figure 8. By studying these figures, it can be concluded that the personalized models provided an almost perfect match of the state responses.
To confirm this observation, the root mean squared error was computed by comparing state responses between the estimated and reference models for each subject in the validation dataset, separately for each state variable N ( t ) , L ( t ) , T ( t ) as documented in Table 3. These results revealed that for the sample time T s = 1 / 4 the metrics R M S E N = 2.2891 × 10 4 , R M S E L = 4.4498 × 10 4 , R M S E T = 5.4686 × 10 5 are sufficient, especially if taken relative to the typical magnitudes of the state responses, which have average values N ¯ t = 2.4965 × 10 5 , L ¯ t = 1.4522 × 10 6 and T ¯ t = 3.9565 × 10 6 . This indicates that the mean error of the state responses provided by the personalized model is only a few percent R M S E N N ¯ t = 0.0922 , R M S E L L ¯ t = 0.0307 , R M S E T T ¯ t = 0.1382 and that the best accuracy was achieved for L ( t ) and the worst for T ( t ) .
In contrast, increasing the sample time to T s = 1 day resulted in higher R M S E N , R M S E L and R M S E T indicating slightly deteriorated, yet acceptable accuracy of the state responses prediction and good overall model validity under reduced sampling frequency. Increasing the sampling interval to T s = 1 day also led to marginally higher variances of the residuals of the estimated parameters and broader confidence intervals in Figure 3. Nonetheless, this analysis remains crucial, as frequent data collection may pose practical challenges in clinical settings.
For the sake of comparison with other relevant parameter identification approaches, the conventional least squares minimization of the model response error with respect to the reference state responses was evaluated for the validation dataset and the obtained results were quantified by statistical metrics as summarized in Table 2 and Table 3. This quantitative comparison demonstrated that the proposed machine learning approach generally achieves much better accuracy of parameter estimation and slightly lower accuracy of state response prediction than obtained by the conventional methodologies that minimize the model response error. Therefore, it can be concluded that the proposed approach provides more accurate parameter estimation, which enhances the general validity of the resulting models, even if it does not achieve the closest fit to the observed state responses. In contrast, although the prediction error methods yield a good fit between the model response and the data, its general validity is limited due to inaccuracies in parameter estimation.
Another important section of the results addresses a detailed case study focused on experimenting with a single particular patient. First, an in silico subject was randomly generated following the considered statistical distribution of the model parameters within the population. Then, the state responses to the suboptimal leading dose were obtained for the duration of one dosing period by simulating the model defined by (1) for the given initial conditions.
A comparison of these state responses with their approximations obtained by the proposed parametrization technique is given in Figure 4. It is evident that the state responses can be approximated very accurately by the function of an exponential nature and an absolute (constant) term. Nevertheless, it is important to remember that the aim of parametric approximation is not to achieve a perfect fit but rather to extract the features of the responses and thus describe them with a small set of parameters.
Another important result of the personalization accuracy analysis is the comparison between the actual instance of vector ξ given by (38) and its estimate ξ ^ given by (41). This is also visualized in Figure 5. for the parameter vector θ ^ . The results demonstrated that the parameters estimated from the finite sequence of state responses to the leading dose were of good accuracy, with all estimates falling within the confidence intervals. The relative (percentual) width of the confidence interval ± θ ^ i θ ¯ i is visibly different for each parameter depending on the diagonal entries of the covariance matrix P ^ following (38).
The conventional least squares minimization of the model simulation error yielded the estimates given by (43) and visualized in Figure 7. Based on these results, it can be claimed that the error of personalization is higher than, but still acceptable.
Another important part of the results analysis is the personalization of the steady states, which is ultimately required for the optimal chemotherapy design. Since the steady states are derived from the model parameters, their accuracy results from the accuracy of model personalization. The actual coexisting state is defined by (39), and the personalized models resulted in the estimated coexisting states as given by (42) and (44). We can claim that the most accurate estimate of the coexisting steady state was obtained by the proposed approach.
The remaining results addressed the problem of personalized optimization of the chemotherapy dosing protocol. Based on the model personalized from the patient’s response to the suboptimal leading dose, the sequence of six optimal chemotherapy doses was determined by numeric optimization, and this dosing protocol was applied to the virtual subject, resulting in the state responses plotted in Figure 9. These state responses demonstrate effective and relatively fast stabilization of all three variables to the target coexisting state representing cancer in remission. However, significant fluctuations are present at the beginning of the treatment, which are difficult to manage due to the fixed suboptimal leading dose and adverse initial conditions. It is also important to notice that the tumor responses showed periodic behavior. The observed periodicity means that the tumor size stably oscillates around the target state, which corresponds to the so-called Jeff’s phenomenon [7,20] observed in clinical practice. The personalized optimal treatment was compared with the suboptimal dosing protocol obtained by optimizing the response of the nominal (mean-population) model without tailoring the treatment to the characteristics of the subject. The suboptimal treatment responses in Figure 10 featured highly undesired fluctuations in the tumor cell population T ( t ) and the cytotoxic cell population L ( t ) , resulting in a higher value of criterion J D D . Besides that, the total amount of administered chemotherapy was significantly higher, implying that the personalized optimal treatment ensures the desired therapeutic outcomes with lower chemotherapy doses.
The paper presented a novel methodology for personalizing mathematical models of tumor growth dynamics, immune system interactions, and chemotherapy effects. By leveraging machine learning techniques and computer simulations, this study aimed to address the limitations of traditional approaches in chemotherapy dosing protocols, which often rely on mean-population models that fail to account for individual patient variability.
This study offers practical benefits for clinicians by providing a framework for guiding decision-making. By integrating machine learning, dynamic models, and optimization, clinicians can simply make more informed decisions. By incorporating patient-specific responses into the treatment protocol, it is possible to reduce the trial-and-error approach, ensuring that patients receive the most effective dose for their unique needs. Clinicians could also use this approach to assess how an individual patient is likely to respond to treatment and whether successful remission is even feasible, leading to a more proactive approach to cancer therapy.
The clinical workflow should be as follows:
  • Conventionally determine and administer the leading dose D 0 .
  • Determine the sampled patient responses N ( t ) , L ( t ) , T ( t ) via regular bloodwork and medical imagining tools during one dosing period (cycle).
  • Feed the obtained data to the computer implementing the proposed methodology.
  • The computer provides the following information to the clinician:
    • Estimated parameters of the tumor model.
    • Optimal sequence of chemotherapy doses.
    • Feasibility of cancer remission and the estimated final tumor size.
  • The clinician expertly assesses the results and finally determines the future steps and approves the treatment.
The methodology involved generating training and validation datasets by simulating a diverse population of virtual patients. The parameter vector for each patient was assumed to follow a multivariate normal distribution, capturing inter-subject variability. Sensitivity analysis showed that some parameters had negligible effects on the state variables, narrowing down the number of practically identifiable parameters to 9 out of 14. The trained neural network was then capable of estimating personalized model parameters based on clinical data from a single chemotherapy dose, facilitating more accurate predictions of tumor behavior and treatment outcomes.
Key innovations included the use of machine learning to personalize cancer model parameters and the application of parametric approximation of the state responses to simplify training data. The parametric approximation of sampled state responses by the sum of exponential functions reduced the complexity of the input data, ensuring feasible training of the MLP NN and better generalization capability. These advancements enable optimized chemotherapy dosing protocols tailored to individual patients, improving both treatment efficacy and safety. An important benefit of determining personalized steady states is the ability to predict cancer remission feasibility and estimate final tumor size.
The proposed approach has several limitations that should be acknowledged. First, the model personalization process requires administering an initial chemotherapy dose before estimating patient-specific parameters, which may introduce delays in optimizing the dosing protocol and pose risks to the patient. Additionally, once the neural network has been trained, the chemotherapy dosing period (cycle duration) remains fixed. Another challenge lies in the frequent sampling of tumor and immune cell populations, which may not always be feasible in clinical practice. Moreover, only parameters with a significant impact on tumor dynamics can be effectively personalized, potentially reducing the overall accuracy of the patient-specific model. Furthermore, a distinct neural network must be trained for each variation in the mathematical model, increasing the computational burden and limiting generalizability. The last drawback is that the computational complexity of generating the datasets and training the MLP NN is high. These limitations highlight the challenges that must be addressed for effective clinical implementation of the proposed approach.
Future research should focus on addressing the limitations of the proposed approach and enhancing its clinical applicability. One key direction is the development of adaptive dosing strategies that allow real-time therapy adjustments (optimization) based on continuous patient monitoring providing new data. Improving the feasibility of patient-specific data collection is another critical area, which could involve integrating non-invasive biomarkers or imaging techniques to estimate tumor and immune cell populations more efficiently. Further work is also needed to test different and more complex structures of cancer models. Clinical validation through retrospective and prospective trials will be essential to assess the effectiveness and safety of the personalized dosing protocol in real-world settings.

5. Conclusions

This study represents an important advancement in precision medicine for cancer treatment. The methodology demonstrated that optimized chemotherapy dosing protocols, personalized using an MLP NN, can lead to more effective and safer treatment outcomes by accounting for individual patient variability. The results showed that personalized chemotherapy dosing could effectively stabilize tumor cell populations, achieving cancer remission while minimizing the risks associated with conventional, one-size-fits-all dosing protocols. Furthermore, the findings demonstrated that relying on a nominal, non-personalized model often leads to suboptimal dosing strategies characterized by excessive drug administration. The trained MLP NN successfully captured the complex relationships between the initial tumor stage, chemotherapy dose, and model parameter variations.
A critical implication of this work is its potential to shift the paradigm of cancer treatment toward truly individualized therapy. By integrating computational modeling and machine learning into clinical decision-making, oncologists could develop personalized treatment strategies that adapt to patient-specific tumor responses. This would mark a departure from the traditional trial-and-error approach, leading to more precise and predictable therapeutic outcomes.

Author Contributions

Conceptualization, Z.V. and A.V.; methodology, Z.V. and A.V.; software, M.D.; validation, M.D., Z.V. and A.V.; formal analysis, M.D. and A.V.; investigation, Z.V.; writing—original draft preparation, M.D., Z.V. and A.V.; writing—review and editing, J.P. and E.M.; visualization, M.D.; supervision, J.P. and E.M.; funding acquisition, J.P. and E.M. All authors have read and agreed to the published version of the manuscript.

Funding

The research is supported by the grant VEGA 1/0229/24 Advanced Methods of Modeling, identification, and Control of Biosystems, granted by the Ministry of Education, science, development, and Sport of the Slovak Republic.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All relevant data associated with the manuscript is available at http://datadryad.org/stash/share/I9tEW9zqUgG4rjqGa8ZrkOUdlPNJxMH2WPVf6vrArTc (accessed on 20 December 2024).

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of this study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Appendix A

In this appendix, the sensitivity analysis of the state variables of model (1) with respect to the model parameters will be carried out. Consider the general nonlinear n -th order model defined by
d x ( t , θ ) d t = f x , θ + B u ( t ) ,
where x ( t , θ ) R n , f x , θ : R n R n , B R n , u ( t ) R and θ R p is the parameter vector.
Now consider function S i j ( t , θ ) : R R
S i j ( t , θ ) = d x i t , θ d θ j ,
which defines the sensitivity of the i -th state variable x i t , θ to the variations in the j -th parameter θ j .
The time derivative d S i j t , θ d t of the sensitivity function S i j t , θ can be derived as
d S i j t , θ d t = d f i x , θ d θ j + k = 1 n A i k ( x , θ ) S k j t , θ ,
where A i k ( x , θ ) are the entries of the Jacobian matrix (37).
By numerically solving differential equations d S i j t , θ d t for i = 1,2 , , n , and j = 1 , 2 , , p assuming S i j 0 , θ = 0 simultaneously with the differential equations of model (A1), the sensitivity functions S i j t , θ can be found.
Considering f x , θ defined by (1) and θ defined by (3), we can find the nonzero partial derivatives d f i x , θ d θ j as
d f 1 x , θ d θ 1 = N ( t ) 1 b N t , d f 1 x , θ d θ 2 = a N 2 ( t ) ,   d f 3 x , θ d θ 3 = T ( t ) 1 d T t ,   d f 3 x , θ d θ 4 = c T 2 ( t ) , d f 2 x , θ d θ 5 = T ( t ) N ( t ) ,   d f 2 x , θ d θ 6 = L ( t ) ,   d f 1 x , θ d θ 7 = T t N ( t ) ,   d f 3 x , θ d θ 8 = T t N ( t ) , d f 2 x , θ d θ 9 = L ( t ) T t , d f 3 x , θ d θ 10 = L t T ( t ) ,   d f 4 x , θ d θ 11 = u t ,   d f 1 x , θ d θ 12 = N ( t ) u ( t ) , d f 2 x , θ d θ 13 = L ( t ) u ( t ) ,   d f 3 x , θ d θ 14 = T ( t ) u ( t ) .
Now, considering the initial conditions given by (47), the state vector of the nominal model was simulated and S i j ( t , θ ) were determined by solving (A3). The evolution of individual sensitivities S i j ( t , θ ) scaled with respect to the given nominal parameter value θ ¯ j to obtain the sensitivity S i j ( t , θ ) θ ¯ j to relative variation in a parameter θ j θ ¯ j is visualized in Figure A1, Figure A2 and Figure A3.
Figure A1. Evolution of the sensitivity functions S 1 j ( t ) θ ¯ j for the nominal model.
Figure A1. Evolution of the sensitivity functions S 1 j ( t ) θ ¯ j for the nominal model.
Applsci 15 03145 g0a1
Figure A2. Evolution of the sensitivity functions S 2 j ( t ) θ ¯ j for the nominal model.
Figure A2. Evolution of the sensitivity functions S 2 j ( t ) θ ¯ j for the nominal model.
Applsci 15 03145 g0a2
Figure A3. Evolution of the sensitivity functions S 3 j ( t ) θ ¯ j for the nominal model.
Figure A3. Evolution of the sensitivity functions S 3 j ( t ) θ ¯ j for the nominal model.
Applsci 15 03145 g0a3
By analyzing the sensitivities S i j ( t , θ ) visualized in Figure A1, Figure A2 and Figure A3, one can make the following conclusions concerning the identifiability of the individual parameters. Sensitivities of all three state variables N t , L t , T t to the variations in the parameters θ 4 (denoted d ), θ 8 (denoted α 2 ), θ 9 (denoted β 1 ) are consistently very low compared to typical magnitudes of the states. This observation implies that inter-subject variations in these parameters in the order of tens of percent have a negligible effect on the state variables, rendering these parameters practically unidentifiable unless changed drastically (unrealistically). Besides that, by simply looking at the corresponding curves in Figure A1, Figure A2 and Figure A3 or by performing the multivariate linear regression of the sensitivity functions S i j ( t , θ ) , we found that the sensitivities to parameters, θ 5 (denoted r ), θ 10 (denoted β 2 ) can be approximated as linear combinations of the sensitivity functions of the other parameters, which means that the variations in parameters r , β 2 affect the state responses ambiguously making these parameters unidentifiable.

References

  1. Schilsky, R.L. Personalized Medicine in Oncology: The Future Is Now. Nat. Rev. Drug Discov. 2010, 9, 363–366. [Google Scholar] [CrossRef] [PubMed]
  2. Ginsburg, G.S.; Willard, H.F. Genomic and Personalized Medicine: Foundations and Applications. Transl. Res. 2009, 154, 277–287. [Google Scholar] [CrossRef]
  3. Garraway, L.A.; Jänne, P.A. Circumventing Cancer Drug Resistance in the Era of Personalized Medicine. Cancer Discov. 2012, 2, 214–226. [Google Scholar] [CrossRef]
  4. Hood, L.; Friend, S.H. Predictive, Personalized, Preventive, Participatory (P4) Cancer Medicine. Nat. Rev. Clin. Oncol. 2011, 8, 184–187. [Google Scholar] [CrossRef]
  5. Dienstmann, R.; Vermeulen, L.; Guinney, J.; Kopetz, S.; Tejpar, S.; Tabernero, J. Consensus Molecular Subtypes and the Evolution of Precision Medicine in Colorectal Cancer. Nat. Rev. Cancer 2017, 17, 79–92. [Google Scholar] [CrossRef] [PubMed]
  6. Tannock, I.F.; Hickman, J.A. Limits to Personalized Cancer Medicine. N. Engl. J. Med. 2016, 375, 1289–1294. [Google Scholar] [CrossRef] [PubMed]
  7. De Pillis, L.G.; Radunskaya, A. A Mathematical Tumor Model with Immune Resistance and Drug Therapy: An Optimal Control Approach. J. Theor. Med. 2001, 3, 318436. [Google Scholar] [CrossRef]
  8. de Pillis, L.G.; Gu, W.; Radunskaya, A.E. Mixed Immunotherapy and Chemotherapy of Tumors: Modeling, Applications and Biological Interpretations. J. Theor. Biol. 2006, 238, 841–862. [Google Scholar] [CrossRef]
  9. de Pillis, L.G.; Gu, W.; Fister, K.R.; Head, T.; Maples, K.; Murugan, A.; Neal, T.; Yoshida, K. Chemotherapy for Tumors: An Analysis of the Dynamics and a Study of Quadratic and Linear Optimal Controls. Math. Biosci. 2007, 209, 292–315. [Google Scholar] [CrossRef]
  10. Sápi, J.; Drexler, D.A.; Kovács, L. Potential Benefits of Discrete-Time Controller-Based Treatments over Protocol-Based Cancer Therapies. Acta Polytech. Hung. 2017, 14, 11–23. [Google Scholar] [CrossRef]
  11. Jameson, J.L.; Longo, D.L. Precision Medicine—Personalized, Problematic, and Promising. N. Engl. J. Med. 2015, 372, 2229–2234. [Google Scholar] [CrossRef] [PubMed]
  12. de Bono, J.S.; Ashworth, A. Translating Cancer Research into Targeted Therapeutics. Nature 2010, 467, 543–549. [Google Scholar] [CrossRef] [PubMed]
  13. Hamburg, M.A.; Collins, F.S. The Path to Personalized Medicine. N. Engl. J. Med. 2010, 363, 301–304. [Google Scholar] [CrossRef] [PubMed]
  14. Ashley, E.A. Towards Precision Medicine. Nat. Rev. Genet. 2016, 17, 507–522. [Google Scholar] [CrossRef]
  15. Mirnezami, R.; Nicholson, J.; Darzi, A. Preparing for Precision Medicine. N. Engl. J. Med. 2012, 366, 489–491. [Google Scholar] [CrossRef] [PubMed]
  16. Hood, L.; Flores, M. A Personal View on Systems Medicine and the Emergence of Proactive P4 Medicine: Predictive, Preventive, Personalized and Participatory. N. Biotechnol. 2012, 29, 613–624. [Google Scholar] [CrossRef]
  17. Peck, R.W. Precision Medicine Is Not Just Genomics: The Right Dose for Every Patient. Annu. Rev. Pharmacol. Toxicol. 2018, 58, 105–122. [Google Scholar] [CrossRef]
  18. Barretina, J.; Caponigro, G.; Stransky, N.; Venkatesan, K.; Margolin, A.A.; Kim, S.; Wilson, C.J.; Lehár, J.; Kryukov, G.V.; Sonkin, D.; et al. The Cancer Cell Line Encyclopedia Enables Predictive Modelling of Anticancer Drug Sensitivity. Nature 2012, 483, 603–607. [Google Scholar] [CrossRef] [PubMed]
  19. Topol, E.J. Individualized Medicine from Prewomb to Tomb. Cell 2014, 157, 241–253. [Google Scholar] [CrossRef]
  20. Borges, F.S.; Iarosz, K.C.; Ren, H.P.; Batista, A.M.; Baptista, M.S.; Viana, R.L.; Lopes, S.R.; Grebogi, C. Model for Tumour Growth with Treatment by Continuous and Pulsed Chemotherapy. Biosystems 2014, 116, 43–48. [Google Scholar] [CrossRef]
  21. Wu, X.; Hou, Y.; Zhang, K. Switched System Optimal Control Approach for Drug Administration in Cancer Chemotherapy. Biomed. Signal Process Control 2022, 75, 103575. [Google Scholar] [CrossRef]
  22. Wu, X.; Liu, Q.; Zhang, K.; Cheng, M.; Xin, X. Optimal Switching Control for Drug Therapy Process in Cancer Chemotherapy. Eur. J. Control 2018, 42, 49–58. [Google Scholar] [CrossRef]
  23. Song, G.; Liang, G.; Tian, T.; Zhang, X. Mathematical Modeling and Analysis of Tumor Chemotherapy. Symmetry 2022, 14, 704. [Google Scholar] [CrossRef]
  24. Qods, P.; Arkat, J.; Batmani, Y. Optimal Administration Strategy in Chemotherapy Regimens Using Multi-Drug Cell-Cycle Specific Tumor Growth Models. Biomed. Signal Process Control 2023, 86, 105221. [Google Scholar] [CrossRef]
  25. Sharifi, N.; Ozgoli, S.; Ramezani, A. Multiple Model Predictive Control for Optimal Drug Administration of Mixed Immunotherapy and Chemotherapy of Tumours. Comput. Methods Programs Biomed. 2017, 144, 13–19. [Google Scholar] [CrossRef]
  26. Ledzewicz, U.; Schättler, H. On the Optimal Control Problem for a Model of the Synergy of Chemo- and Immunotherapy. Optim. Control Appl. Methods 2024, 45, 575–593. [Google Scholar] [CrossRef]
  27. Ledzewicz, U.; Maurer, H.; Schättler, H. Bang-Bang Optimal Controls for a Mathematical Model of Chemo- and Immunotherapy in Cancer. Discret. Contin. Dyn. Syst. B 2024, 29, 1481–1500. [Google Scholar] [CrossRef]
  28. Watanabe, Y.; Dahlman, E.L.; Leder, K.Z.; Hui, S.K. A Mathematical Model of Tumor Growth and Its Response to Single Irradiation. Theor. Biol. Med. Model. 2016, 13, 6. [Google Scholar] [CrossRef]
  29. Schättler, H.; Ledzewicz, U.; Cardwell, B. Robustness of Optimal Controls for a Class of Mathematical Models for Tumor Anti-Angiogenesis. Math. Biosci. Eng. 2011, 8, 355–369. [Google Scholar] [CrossRef]
  30. Ledzewicz, U.; Schättler, H. Combination of Antiangiogenic Treatment with Chemotherapy as a Multi-Input Optimal Control Problem. Math. Methods Appl. Sci. 2022, 45, 3058–3082. [Google Scholar] [CrossRef]
  31. Ledzewicz, U.; Maurer, H.; Schättler, H. Optimal and Suboptimal Protocols for a Mathematical Model for tumor Anti-Angiogenesis in Combination with Chemotherapy. Math. Biosci. Eng. 2011, 8, 307–323. [Google Scholar] [CrossRef] [PubMed]
  32. Lowengrub, J.S.; Frieboes, H.B.; Jin, F.; Chuang, Y.-L.; Li, X.; Macklin, P.; Wise, S.M.; Cristini, V. Nonlinear Modelling of Cancer: Bridging the Gap between Cells and Tumours. Nonlinearity 2009, 23, R1. [Google Scholar] [CrossRef] [PubMed]
  33. Yin, A.; Moes, D.J.A.R.; van Hasselt, J.G.C.; Swen, J.J.; Guchelaar, H.-J. A Review of Mathematical Models for Tumor Dynamics and Treatment Resistance Evolution of Solid Tumors. Pharmacomet. Syst. Pharmacol. 2019, 8, 720–737. [Google Scholar] [CrossRef]
  34. Zvia Agur Karin Halevi-Tobias, Y.K.; Shlagman, O. Employing Dynamical Computational Models for Personalizing Cancer Immunotherapy. Expert. Opin. Biol. Ther. 2016, 16, 1373–1385. [Google Scholar] [CrossRef]
  35. Chen, Q.; Ye, Q.; Zhang, W.; Li, H.; Zheng, X. TGM-Nets: A Deep Learning Framework for Enhanced Forecasting of Tumor Growth by Integrating Imaging and Modeling. Eng. Appl. Artif. Intell. 2023, 126, 106867. [Google Scholar] [CrossRef]
  36. Avci, C.B.; Bagca, B.G.; Shademan, B.; Takanlou, L.S.; Takanlou, M.S.; Nourazarian, A. Machine Learning in Oncological Pharmacogenomics: Advancing Personalized Chemotherapy. Funct. Integr. Genom. 2024, 24, 182. [Google Scholar] [CrossRef]
  37. Patel, S.; Yadav, D.; Kumar, D. Integrating Machine Learning to Customize Chemotherapy for Oral Cancer Patients. Oral. Oncol. Rep. 2025, 13, 100711. [Google Scholar] [CrossRef]
  38. Patmanidis, S.; Charalampidis, A.C.; Kordonis, I.; Mitsis, G.D.; Papavassilopoulos, G.P. Comparing Methods for Parameter Estimation of the Gompertz Tumor Growth Model. IFAC-PapersOnLine 2017, 50, 12203–12209. [Google Scholar] [CrossRef]
  39. Patmanidis, S.; Charalampidis, A.C.; Kordonis, I.; Mitsis, G.D.; Papavassilopoulos, G.P. Tumor Growth Modeling: Parameter Estimation with Maximum Likelihood Methods. Comput. Methods Programs Biomed. 2018, 160, 1–10. [Google Scholar] [CrossRef]
  40. Nagy, E.; Czakó, B.; Siket, M.; Gombos, B.; Füredi, A.; Szakács, G.; Kovács, L.; Drexler, D.A. Tracking Parameter Changes of an Impulsive Tumor Growth Model. In Proceedings of the 2022 IEEE 10th Jubilee International Conference on Computational Cybernetics and Cyber-Medical Systems (ICCC), Reykjavík, Iceland, 6–9 July 2022; pp. 179–184. [Google Scholar]
  41. Nagy, E.; Drexler, D.A. The Effect of the Choice of Initial Estimation for a Tumor Model Parameter Estimation Problem. In Proceedings of the 2022 IEEE 22nd International Symposium on Computational Intelligence and Informatics and 8th IEEE International Conference on Recent Achievements in Mechatronics, Automation, Computer Science and Robotics (CINTI-MACRo), Tirgu Mures, Romania, 21–22 November 2022; pp. 227–232. [Google Scholar]
  42. Siket, M.; Eigner, G.; Drexler, D.A.; Rudas, I.; Kovács, L. State and Parameter Estimation of a Mathematical Carcinoma Model under Chemotherapeutic Treatment. Appl. Sci. 2020, 10, 9046. [Google Scholar] [CrossRef]
  43. Czakó, B.; Drexler, D.A.; Kovács, L. Time-Varying Parameter Identification of a Tumor Growth Model Using Moving Horizon Estimation. In Proceedings of the 2022 IEEE 26th International Conference on Intelligent Engineering Systems (INES), Crete, Greece, 12–15 August 2022; pp. 73–78. [Google Scholar]
  44. da Costa, J.M.J.; Orlande, H.R.B.; da Silva, W.B. Model Selection and Parameter Estimation in Tumor Growth Models Using Approximate Bayesian Computation-ABC. Comput. Appl. Math. 2018, 37, 2795–2815. [Google Scholar] [CrossRef]
  45. Puskás, M.; Drexler, D.A. Tumor Model Parameter Estimation for Therapy Optimization Using Artificial Neural Networks. In Proceedings of the 2021 IEEE International Conference on Systems, Man, and Cybernetics (SMC), Melbourne, Australia, 17–20 October 2021; pp. 1254–1259. [Google Scholar]
  46. Puskás, M.; Drexler, D.A. Parameter Identification of a Tumor Model Using Artificial Neural Networks. In Proceedings of the 2021 IEEE 19th World Symposium on Applied Machine Intelligence and Informatics (SAMI), Herl’any, Slovakia, 21–23 January 2021; pp. 443–448. [Google Scholar]
  47. Nagy, E.; Puskás, M.; Drexler, D.A. Comparison of Artificial Neural Network and ANFIS for Parameter Estimation of a Tumor Model. In Proceedings of the 2022 IEEE 20th Jubilee World Symposium on Applied Machine Intelligence and Informatics (SAMI), Poprad, Slovakia, 2–5 March 2022; pp. 133–140. [Google Scholar]
  48. Rasheed, A.; San, O.; Kvamsdal, T. Digital Twin: Values, Challenges and Enablers From a Modeling Perspective. IEEE Access 2020, 8, 21980–22012. [Google Scholar] [CrossRef]
  49. Gill, P.E.; Murray, W.; Wright, M.H. Practical Optimization; Academic Press: Cambridge, MA, USA, 1981; ISBN 9780122839504. [Google Scholar]
  50. Nocedal, J.; Wright, S.J. Numerical Optimization; Springer Series in Operations Research and Financial Engineering; Springer: New York, NY, USA, 2006; ISBN 978-0-387-30303-1. [Google Scholar]
  51. Butcher, J.C. Numerical Differential Equation Methods. In Numerical Methods for Ordinary Differential Equations; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2016; pp. 55–142. ISBN 9781119121534. [Google Scholar]
  52. Ljung, L. System Identification: Theory for the User, 2nd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 1999; ISBN 978-0136566953. [Google Scholar]
  53. Haykin, S. Neural Networks and Learning Machines, 3rd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 2009; ISBN 978-0131471399. [Google Scholar]
  54. Hagan, M.T.; Demuth, H.B.; Beale, M.H. Neural Network Design; PWS Publishing: Boston, MA, USA, 1996; ISBN 978-0534943325. [Google Scholar]
  55. MathWorks MATLAB Deep Learning Toolbox. Available online: https://www.mathworks.cn/products/deep-learning.html (accessed on 18 February 2025).
  56. Gubner, J.A. Probability and Random Processes for Electrical and Computer Engineers; Cambridge University Press: Cambridge, UK, 2006; ISBN 978-0521864701. [Google Scholar]
  57. Press, W.H.; Teukolsky, S.A.; Vetterling, W.T.; Flannery, B.P. Numerical Recipes: The Art of Scientific Computing, 3rd ed.; Cambridge University Press: Cambridge, UK, 2007; ISBN 978-0521880688. [Google Scholar]
  58. Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley: Reading, MA, USA, 1989; ISBN 978-0201157673. [Google Scholar]
  59. Jarrett, A.M.; Faghihi, D.; Hormuth, D.A.; Lima, E.A.B.F.; Virostko, J.; Biros, G.; Patt, D.; Yankeelov, T.E. Optimal Control Theory for Personalized Therapeutic Regimens in Oncology: Background, History, Challenges, and Opportunities. J. Clin. Med. 2020, 9, 1314. [Google Scholar] [CrossRef] [PubMed]
  60. MathWorks MATLAB Optimization Toolbox. Available online: https://www.mathworks.cn/products/optimization.html (accessed on 18 February 2025).
  61. Nesterov, Y.; Nemirovskii, A. Interior-Point Polynomial Algorithms in Convex Programming; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1994. [Google Scholar]
  62. Nocedal, J.; Wright, S.J. Interior-Point Methods for Nonlinear Programming. In Numerical Optimization; Springer: New York, NY, USA, 2006; pp. 563–597. ISBN 978-0-387-40065-5. [Google Scholar]
Figure 1. Structure of the considered multilayer perceptron artificial neural network.
Figure 1. Structure of the considered multilayer perceptron artificial neural network.
Applsci 15 03145 g001
Figure 2. Block diagram of the model personalization and personalized optimization of chemotherapy dosing.
Figure 2. Block diagram of the model personalization and personalized optimization of chemotherapy dosing.
Applsci 15 03145 g002
Figure 3. Comparison of the interval uncertainty of the original parameter vector ± θ θ ¯ and the interval uncertainty of the estimated (personalized) parameter vector ± θ ^ θ ¯ .
Figure 3. Comparison of the interval uncertainty of the original parameter vector ± θ θ ¯ and the interval uncertainty of the estimated (personalized) parameter vector ± θ ^ θ ¯ .
Applsci 15 03145 g003
Figure 4. State responses and their parametric approximations by the exponential function for T s = 1 / 4 .
Figure 4. State responses and their parametric approximations by the exponential function for T s = 1 / 4 .
Applsci 15 03145 g004
Figure 5. Elements of the actual vector θ and of the estimated vector θ ^ together with the corresponding confidence interval θ ^ obtained using the MLP NN with exponential function parametrization of the state responses for T s = 1 / 4 .
Figure 5. Elements of the actual vector θ and of the estimated vector θ ^ together with the corresponding confidence interval θ ^ obtained using the MLP NN with exponential function parametrization of the state responses for T s = 1 / 4 .
Applsci 15 03145 g005
Figure 6. Comparison of the reference state responses with the simulated responses considering the personalized model for T s = 1 / 4 .
Figure 6. Comparison of the reference state responses with the simulated responses considering the personalized model for T s = 1 / 4 .
Applsci 15 03145 g006
Figure 7. Elements of the actual vector θ and of the estimated vector θ ^ personalized using the least squares method for model response fitting.
Figure 7. Elements of the actual vector θ and of the estimated vector θ ^ personalized using the least squares method for model response fitting.
Applsci 15 03145 g007
Figure 8. Comparison of the reference state responses with the simulated responses considering the model personalized using the least squares method for model response fitting.
Figure 8. Comparison of the reference state responses with the simulated responses considering the model personalized using the least squares method for model response fitting.
Applsci 15 03145 g008
Figure 9. Simulation of the optimal chemotherapy designed based on the personalized model.
Figure 9. Simulation of the optimal chemotherapy designed based on the personalized model.
Applsci 15 03145 g009
Figure 10. Simulation of the suboptimal chemotherapy designed based on the nominal model.
Figure 10. Simulation of the suboptimal chemotherapy designed based on the nominal model.
Applsci 15 03145 g010
Table 1. Nominal parameters of the model and their biological interpretation [23].
Table 1. Nominal parameters of the model and their biological interpretation [23].
Parameter a ¯ b ¯ c ¯ d ¯ r ¯ μ ¯ α ¯ 1
Nominal value 1 3.17 × 10 6 5.14 × 10 1 1.02 × 10 9 1.1 × 10 7 2.0 × 10 2 1.0 × 10 7
MeaningGrowth rate of natural killer cellsInverse of natural killer cells’ capacityGrowth rate of tumorInverse of tumor capacityActivation rate of cytotoxic T cellsCytotoxic T-cell death rateNatural killer cell death rate
Parameter α ¯ 2 β ¯ 1 β ¯ 2 ω ¯ k ¯ N k ¯ L k ¯ T
Nominal value 6.41 × 10 11 3.42 × 10 10 3.5 × 10 7 9.0 × 10 1 6.0 × 10 1 6.0 × 10 1 8.0 × 10 1
MeaningTumor death rate of natural killer cellsCytotoxic T-cell death rateRate of cytotoxic T cells induced tumor deathDrug decay rateNatural killer cell death rate by drugCytotoxic T-cell death rate by drugTumor cell death rate by drug
Table 2. Mean value of the residuals ξ ^ ξ and the diagonal entries of the covariance matrix P ^ of the residuals evaluated across M entries of the validation dataset for different T s .
Table 2. Mean value of the residuals ξ ^ ξ and the diagonal entries of the covariance matrix P ^ of the residuals evaluated across M entries of the validation dataset for different T s .
T s
Proposed machine learning method m e a n ξ ^ ξ 1 4 0.0046 0.0113 0.0047 0.0009 0.0009 0.0013 0.0019 0.0009 0.0008 T
d i a g P ^ 0.0371 0.1350 0.0370 0.0080 0.1330 0.0399 0.1222 0.0426 0.0633 T
m e a n ξ ^ ξ 1 0.0123 0.0003 0.0022 0.0028 0.0031 0.0079 0.0010 0.0017 0.0068 T
d i a g P ^ 0.1009 0.1679 0.0624 0.0182 0.1661 0.0618 0.2205 0.0535 0.1017 T
Nonlinear least squares method m e a n ξ ^ ξ 1 4 0.1624 0.1257 0.0205 0.0689 0.0838 0.0260 0.0033 0.0460 0.0399 T
d i a g P ^ 0.6265 0.9521 0.8547 0.8774 0.7292 0.9043 0.9793 0.8476 1.0195 T
m e a n ξ ^ ξ 1   0.1940 0.0534 0.0025 0.0679 0.0779 0.0357 0.0001 0.0473 0.0056 T
d i a g P ^ 0.6280 0.9626 0.8560 0.8456 0.7436 0.8792 0.9462 0.8357 1.0461 T
Table 3. RMSE evaluated across M entries of the validation dataset for different T s .
Table 3. RMSE evaluated across M entries of the validation dataset for different T s .
Proposed Machine Learning MethodNonlinear Least Squares Method
T s = 1 / 4 T s = 1 T s = 1 / 4 T s = 1
R M S E N 2.2891 × 10 4 2.3747 × 10 4 1.6740 × 10 4 1.8361 × 10 4
R M S E L 4.4498 × 10 4 4.7333 × 10 4 5.9670 × 10 4 5.9306 × 10 4
R M S E T 5.4686 × 10 5 5.8619 × 10 5 4.2857 × 10 5 4.4625 × 10 5
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dodek, M.; Vitková, Z.; Vitko, A.; Pavlovičová, J.; Miklovičová, E. Personalization of Optimal Chemotherapy Dosing Based on Estimation of Uncertain Model Parameters Using Artificial Neural Network. Appl. Sci. 2025, 15, 3145. https://doi.org/10.3390/app15063145

AMA Style

Dodek M, Vitková Z, Vitko A, Pavlovičová J, Miklovičová E. Personalization of Optimal Chemotherapy Dosing Based on Estimation of Uncertain Model Parameters Using Artificial Neural Network. Applied Sciences. 2025; 15(6):3145. https://doi.org/10.3390/app15063145

Chicago/Turabian Style

Dodek, Martin, Zuzana Vitková, Anton Vitko, Jarmila Pavlovičová, and Eva Miklovičová. 2025. "Personalization of Optimal Chemotherapy Dosing Based on Estimation of Uncertain Model Parameters Using Artificial Neural Network" Applied Sciences 15, no. 6: 3145. https://doi.org/10.3390/app15063145

APA Style

Dodek, M., Vitková, Z., Vitko, A., Pavlovičová, J., & Miklovičová, E. (2025). Personalization of Optimal Chemotherapy Dosing Based on Estimation of Uncertain Model Parameters Using Artificial Neural Network. Applied Sciences, 15(6), 3145. https://doi.org/10.3390/app15063145

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop