Next Article in Journal / Special Issue
A Stochastic Characterization of the Capture Zone in Pursuit-Evasion Games
Previous Article in Journal / Special Issue
On Optimal Leader’s Investments Strategy in a Cyclic Model of Innovation Race with Random Inventions Times
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal CAR T-cell Immunotherapy Strategies for a Leukemia Treatment Model

1
Department of Optimal Control, Faculty of Computational Mathematics and Cybernetics, Lomonosov Moscow State University, 119992 Moscow, Russia
2
Department of Mathematics and Computer Science, Texas Woman’s University, Denton, TX 76204, USA
*
Author to whom correspondence should be addressed.
Games 2020, 11(4), 53; https://doi.org/10.3390/g11040053
Submission received: 18 September 2020 / Revised: 30 October 2020 / Accepted: 10 November 2020 / Published: 18 November 2020
(This article belongs to the Special Issue Optimal Control Theory)

Abstract

:
CAR T-cell immunotherapy is a new development in the treatment of leukemia, promising a new era in oncology. Although so far, this procedure only helps 50–90% of patients and, like other cancer treatments, has serious side effects. In this work, we have proposed a controlled model for leukemia treatment to explore possible ways to improve immunotherapy methodology. Our model is described by four nonlinear differential equations with two bounded controls, which are responsible for the rate of injection of chimeric cells, as well as for the dosage of the drug that suppresses the so-called “cytokine storm”. The optimal control problem of minimizing the cancer cells and the activity of the cytokine is stated and solved using the Pontryagin maximum principle. The five possible optimal control scenarios are predicted analytically using investigation of the behavior of the switching functions. The optimal solutions, obtained numerically using BOCOP-2.2.0, confirmed our analytical findings. Interesting results, explaining, why therapies with rest intervals (for example, stopping injections in the middle of the treatment interval) are more effective (within the model), rather than with continuous injections, are presented. Possible improvements to the mathematical model and method of immunotherapy are discussed.

1. Introduction

Immunotherapy is a method of cancer treatment, in which the immune system is used to fight the tumor. The immune system protects the body from infections and diseases. It attacks pathogens such as bacteria and viruses. The immune system also helps to get rid of the body’s diseased or damaged cells. The basic idea of immunotherapy is simple: to help the body protect itself from malicious “invaders”. However, cancer cells can be insidious. Often, they find a way to “hide” from the immune system or to disable its response. In general, immunotherapy aims to counteract some of these ways of evading immune system attacks: helps the immune system find cancer cells and to kill them. There are different types of immunotherapy, each of which has its own mechanism to strengthen the immune system’s response.
One of the encouraging ways to use the body’s capabilities in the fight against cancer is therapy with so-called T-lymphocytes with chimeric antigen (antigens—substances foreign to the body that cause its immune response) receptors, abbreviated CAR-T, (Chimeric Antigen Receptor T-cell) appeared relatively recently, at the beginning of the 21th century. In numerous publications this therapy is called “a breakthrough in the treatment of oncology”, “a new era in medicine” [1,2].
What is the idea behind such a molecule? One of its parts recognizes markers on the surface of the tumor cell, while other parts transmit an activating signal to the cell of the immune system to destroy the tumor cell. CAR T-cells are produced in the following way: T-lymphocytes are extracted from the patient’s blood, that is, cells that should normally protect us from cancer and virus-infected cells. This is done through apheresis, a technology that separates blood into components and produces a certain number of lymphocytes. Then, DNA encoding CAR is inserted into the chromosomes of T-lymphocytes, and the cell begins to produce on its surface those very chimeric, that is, artificially created receptors that do not exist in nature. They are designed so that T-lymphocytes find markers on the surface of cancer cells and receive a signal to attack them. The thus obtained CAR T-cells are expanded and injected back into the patient’s blood.
What happens next? If CAR T-cells in a patient’s body collide with a normal cell, they are supposed to simply float past each other; if they find a cancer cell, the chimeric antigen receptor recognizes a specific marker on it, to which it was tuned during creation. The T-lymphocyte kills the cancer cell and starts dividing very actively. One cell, equipped with a CAR T-receptor, produces hundreds and even thousands. Such a self-replicating medicine. It was a believe that as long as there are cancer cells in some corners of the body, CAR T-lymphocytes will destroy them. When all cancer cells are destroyed, most of the CAR T-cells will die, some will remain in the bone marrow to reappear, multiply and destroy the cancer if they recur. Why, then, even in the best case, the effectiveness of treatment is only 50–90%, not 100 % ? ([2], table 2) It was found that 90 % of cancer cells can carry the target that CAR T-lymphocytes are targeting, but 10 % cannot. And these 10 % remain intact in the body and multiply. In addition, some cancer cells can “escape” from CAR T-lymphocytes if it “removes” or modifies targets on the surface of its cells. There are other problems that complicate treatment with chimeric antigen receptors. For example, despite the fact that CAR T-cells by themselves can live a long time, in an organism depleted by multiple chemotherapy they will multiply rather sluggishly and ineffectively fight cancer cells. This is partly why this approach does not work in all cases.
At the same time, these antigens are not strictly cancer specific. When immunotherapy emerged as an alternative to radio and chemotherapy, many people thought that this method of treatment was not as harmful for the patients. However, CAR, although the most expensive, is still in the process of developing because modified lymphocytes also kill healthy cells [3]. For example, in the immune therapy used in treating blood cancers, CAR T-cells target CD19 antigen that is also found on normal B-lymphocytes [4]. So, anti-CD19 CAR therapy leads to a decrease in blood of B-lymphocytes (B-cell aplasia), which should be justified by the clinical anticancer effect in each particular case [5,6]. Thus, the current chimeric T-cell receptors are not strictly specific to cancer cells. In this regard, there may be a toxic effect on the principle of “organ-transplant against the host” [7,8,9]. In some cases, as is the case with B-cell aplasia in the treatment of anti-CD19 CAR T-lymphocytes of lympho-proliferative diseases, this is not too severe a complication and is stopped by the introduction of immunoglobulin [2]. However, sometimes the toxic effect of CAR T-lymphocytes is an obstacle to their clinical use. Everything is determined by the correctness of the chosen target and the acceptable level of observed complications [10].
The world has already conducted clinical trials on several thousand patients with blood cancers. The FDA (Food and Drug Administration) approved the first application of CD19 CAR immunotherapy for USA patients in 2007. Now this therapy shows the effectiveness of 50–90% in the treatment of patients even with terminal stage lymphoblastic leukemia, lymphoma and myeloma [2]. Side effects are associated with the fact that CAR T-cells are too powerful. By attacking cancer cells, they also affect other cells of the immune system, forcing them to release massively biologically active substances, cytokines. This leads to another major problem in the usage of CAR lymphocytes, the so-called “cytokine storm”.
The symptom of “a cytokine storm” is primarily an increase in temperature and hypotension, which can lead to multiple organ failure. The prevention of it consists of the use of steroids, vasopressors and supportive therapy in the conditions of the intensive care unit. Recently, the use of antibodies against TNF- α (infliximab) and antibodies against the interleukin-6 receptor (tocilizumab) or monotherapy with the use of tocilizumab showed good results (see, for example, [11]). Unlike many conventional drug-induced side effects, this toxicity cannot be managed by simply reducing the dosage of the drug, as the number of proliferating T-cells will increase in number and may eventually reach critical level. Prevention of the “cytokine storm” is seems to be manageable by the fractional introduction of T-lymphocytes or by the use of short-lived populations of lymphocytes [10]. It is gratifying that many doctors have already learned to predict and control the development of “a cytokine storm” in most patients.
Thus, besides the purely biomedical task of making immunotherapy more specific to certain types of cancer, there are two main questions with CAR T-cell immunotherapy:
  • With what frequency and at what intervals should chimeric leukocytes be injected and in what concentrations?
  • How to take a medicine that suppresses the “surge” of the immune system? If between injections the patient seems to be resting, is it necessary to simultaneously stop giving the immunosuppressant, or are these two independent procedures?
In order not to go through all the possible options in the treatment of cancer patients, one can try to formulate the situation in immunotherapy mathematically and create a mathematical model to answer these and other questions. Currently, many models have been created that help to understand and predict both the dynamics of tumor growth itself and those works whose authors propose controlled models and, together with optimization ideas, simultaneously try to find the optimal treatment protocol. In this sense, the survey of mathematical models for leukemia and lymphoma given in [12] is very useful. It shows that deterministic ODE models can serve as good approximations of the average behavior of a system when the populations are large and so such models can be suitable for describing the late stages of cancer.
The main goal of mathematical modeling of cancer is to improve treatment, either by optimizing the way existing therapies are being administered or by motivating novel therapies [12]. After consideration of many papers, we find especially interesting the works of [13], in which the authors present a competition model of cancer tumor growth that includes the immune system response and drug therapy, as well as a recent article [14], in which the authors model chimeric anti-gene receptor T-cell therapy (CAR therapy) with presence of interleukin-6, the cytokine responsible for “cytokine storm”. In both papers the authors use the optimal control theory to numerically find the optimal treatment schedule.
For various reasons, as well as depending on the individual characteristics of the patient undergoing immunotherapy treatment, chimeric cells can behave ambiguously or differently: they either become too active and divide uncontrollably, or vice versa, behave sluggishly, inactive, sometimes killing healthy cells. It seems logical to model the case of super-active CAR T-cells either with a predator-one prey model plus additional competition between healthy and cancer cells (this would be a case of ideal immunotherapy) or with a model with one predator and two preys (this is when active chimeras are nonspecific and might kill healthy cells as well). In turn, for an organism weakened by radio- or chemotherapy, CAR T-cells are not very active or specific and the competition model for all three types of cells seems more suitable. The model of the latter type is considered in this work, which is the continuation of our studies presented in [15,16,17].
We propose and investigate controlled model of CAR T-cell immunotherapy that employs and combines the best ideas presented in [13,14]. In our controlled model, the dynamics of CAR T-cells, normal or healthy B-cells, cancer cells, and the cytokines is described by four ordinary differential equations of Lotka-Volterra competition type with two bounded controls. The first control represents the concentration of the injected CAR immune T-cells and its influence on the variables of system. The second control reflects the effect of the immunosuppressant drug, such as tocilizumab and its role on preventing “cytokine storm”.
The paper is organized as follows. In Section 2 we define the model and state the optimal control problem of minimizing the tumor’s cells and concentration of the cytokines. In Section 3, we apply the Pontryagin maximum principle and find the type of optimal controls that depend on the switching functions. Here we find that the second optimal control reflecting the suppressive effect of possible “cytokine storm” is constant, taking the maximum value on the entire time interval of treatment period. In Section 4, we study the properties of the first switching function analytically and find that the corresponding optimal control takes its maximum value in the end of the time interval. In Section 5, Section 6 and Section 7, we reduce the matrix to an upper triangular form, split the system and analyze quadratic subsystems. In Section 8, we find the type of the switching function that determines the behavior of the first optimal control responsible for the CAR injection schedule. We prove that the switching function has not more than two zeros. In Section 9 we find analytically that the first optimal control has at most two switchings and that depending on model parameters, it can take one of the five optimal treatment scenarios. Section 10 shows computer simulation of the model at different model parameters and demonstrate the optimal solutions. Section 11 contains our conclusions.

2. Model and Optimal Control Problem

At a given time interval [ 0 , T ] , which is the treatment time, let us consider a controlled model for the spread and immunotherapy of leukemia. The model is described by the following nonlinear system of differential equations:
x ( t ) = r x ( t ) α 1 x ( t ) y ( t ) β 1 x ( t ) z ( t ) + m u ( t ) x ( t ) , y ( t ) = p y ( t ) η y 2 ( t ) α 2 y ( t ) x ( t ) β 2 y ( t ) z ( t ) , z ( t ) = q z ( t ) γ z 2 ( t ) α 3 z ( t ) x ( t ) β 3 z ( t ) y ( t ) , w ( t ) = λ ν w ( t ) + n ( 1 v ( t ) ) w ( t ) x ( t )
with given initial values:
x ( 0 ) = x 0 , y ( 0 ) = y 0 , z ( 0 ) = z 0 , w ( 0 ) = w 0 ; x 0 > 0 , y 0 > 0 , z 0 > 0 , w 0 > 0 .
In this model, x ( t ) denotes the population of CAR modified T-lymphocytes at time t, y ( t ) the population of B-leukemic or cancer cells, z ( t ) the population of healthy B-cells, and w ( t ) —inflammatory cytokines, such as Il-6, the population of which is elevated by the immune therapy.
Firstly, let us consider the second and the third equations of the control system. The first two terms of the both equations:
p y ( t ) η y 2 ( t ) = p y ( t ) 1 p 1 η y ( t ) , q z ( t ) γ z 2 ( t ) = q z ( t ) 1 q 1 γ z ( t )
reflect a logistic growth law for the cancer and healthy B-cells. The parameters p, q and p 1 η , q 1 γ represent the corresponding per capita growth rates and reciprocal carrying capacities of these types of cells. The last terms in the both equations β 2 y ( t ) z ( t ) and β 3 z ( t ) y ( t ) represent decrease in the populations of the cancer and healthy cells because of their competition; β 2 and β 3 are the corresponding competition coefficients. The third terms in the both equations α 2 y ( t ) x ( t ) and α 3 z ( t ) x ( t ) show a decrease in the populations of cancer and healthy cells as a result of their interaction with the immune modified T-cells, x ( t ) . Here, α 2 and α 3 are the corresponding relative compatibility coefficients.
Secondly, let us explain the first and the fourth equations of the model. In the first equation, the first term r x ( t ) involves proliferation of CAR T-cells due to stimulation by encounters with their target cells that contain CD19 antigen, and r is the per capita growth rate of this type of cells. We do not consider here their natural death since they do not undergo apoptosis after killing the target cells [3]. While the x ( t ) cells population does decrease after killing the target cells, y ( t ) , z ( t ) that is displayed by the corresponding second and third terms α 1 x ( t ) y ( t ) and β 1 x ( t ) z ( t ) of the first differential equation, this interaction with target cells somewhere how stimulates CAR cells proliferation. Here α 1 and β 1 are the corresponding killing efficiency of CAR T-cells in relation to cancer and healthy cells. The last term m u ( t ) x ( t ) of the first equation reflects an increase in the population of x ( t ) cells due to the controlled injection of the modified chimeric T-cells. Here u ( t ) is the first bounded control. This control represents a kind of intensity of CAR immunotherapy. The control itself changes from zero to the maximum value. If the control is equal to zero, then the injection of chimeric cells is not performed, and if it is maximum, then this corresponds to the injection with the maximum concentration of chimeric cells. The mathematical properties of control u ( t ) are explained below. Finally, m reflects the efficiency of the CAR T-cell therapy.
The fourth differential equation describes the dynamics of the Il-6 cytokines. The first term λ of it is the influx rate (secreting rate) of the cytokines. The second term ν w ( t ) indicates a negative feedback mechanism mentioned in [14], and ν is the cytokine decay rate. An elevating increase in the cytokine level caused by the CAR T-cell immunotherapy that appears as the “cytokine storm”, manifested as a systemic inflammatory process in all vital organs of patients receiving CAR immunotherapy, is reflected by the last term n ( 1 v ( t ) ) w ( t ) x ( t ) of the differential equation. This last term of the controlled model contains the second bounded control v ( t ) that plays the role of the intensity of the dosage of immunosuppressant drug intake. The second bounded control takes values from 0 (no medication is given) to 1 (the maximum dosage of anti-inflammatory drag is taken), and n reflects the efficiency of this dosage. Our goal is by using two controls, find such optimal treatment protocol that minimizes the cancer cells and prevent over activity of the cytokines.
All the parameters of the model are positive constants, the value of which depends on each specific case. We consider that the following assumption is true.
Assumption 1.
Let in system (1) the parameters: r, p, q, m, n, η, γ, λ, ν, α 1 , α 2 , α 3 , β 1 , β 2 , β 3 be positive, and for them the following inequalities hold:
p q 0 , α 3 α 2 0 ,
β 3 η 0 , β 2 β 3 γ η 0 ,
α 2 β 3 α 3 η 0 , α 2 γ α 3 β 2 0 ,
α 1 ( α 2 γ α 3 β 2 ) β 1 ( α 2 β 3 α 3 η ) 0 ,
and one of the inequalities in (3) and in (5) be strict.
It is easy to see that the fulfillment of inequalities (3)–(5) implies the validity of the following relationships:
α 2 ( β 2 β 3 γ η ) + ( β 3 η ) ( α 2 γ α 3 β 2 ) 0 , α 3 ( β 2 β 3 γ η ) + ( γ β 2 ) ( α 2 β 3 α 3 η ) 0 .
Next, in system (1), two control functions u ( t ) and v ( t ) are introduced. They subject to the restrictions:
0 u ( t ) u max , 0 v ( t ) 1 .
We consider that the set of all admissible controls Ω ( T ) is formed by all possible pairs of Lebesgue measurable functions ( u ( t ) , v ( t ) ) , which for almost all t [ 0 , T ] satisfy inequalities (8).
The positiveness, boundedness and continuation of the solutions for system (1) and (2) is established by the following lemma.
Lemma 1.
For any pair of admissible controls ( u ( t ) , v ( t ) ) the corresponding absolutely continuous solutions x ( t ) , y ( t ) , z ( t ) , w ( t ) for system (1) and (2) are defined on the entire interval [ 0 , T ] and satisfy the inequalities:
0 < x ( t ) x max , 0 < y ( t ) y max , 0 < z ( t ) z max , 0 < w ( t ) w max , t [ 0 , T ] ,
where
x max = x 0 e ( r + m u max ) T , y max = y 0 e p T , z max = z 0 e q T , w max = w 0 e n x max T + λ ( n x max ) 1 e n x max T 1 .
The proof of Lemma 1 is fairly straightforward and we omit it.
Now, let us consider for system (1) and (2) on the set of admissible controls Ω ( T ) the following minimization problem:
J ( u ( · ) , v ( · ) ) = ( y ( T ) μ z ( T ) + δ w ( T ) ) + 0 T ( y ( t ) μ z ( t ) + δ w ( t ) ) d t min ( u ( · ) , v ( · ) ) Ω ( T ) ,
where μ and δ are positive weighting factors. The objective function in (10) is a sum of terminal and integral parts. The terminal part reflects the patient’s state (the weighted combination of the populations of leukemia cells, healthy B-cells and cytokines) at the end of the treatment period, and the integral term sets this state throughout the entire treatment period. For convenience of notation, in the objective function we denote the terminal part by I.
The existence in the minimization problem (10) of the optimal controls ( u * ( t ) , v * ( t ) ) and the corresponding optimal solutions x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) for system (1) and (2) follows from Lemma 1 and Theorem 4 ([18], Chapter 4). Indeed, the key assumptions of this theorem are true. Namely, inequalities (9) from Lemma 1 give a uniform estimate for the solution ( x ( t ) , y ( t ) , z ( t ) , w ( t ) ) of system (1) and (2) corresponding to any pair of admissible controls ( u ( t ) , v ( t ) ) from Ω ( T ) for all t [ 0 , T ] . This is the first key assumption. Next, system (1) is linear in controls u ( t ) and v ( t ) , and the integrand of the objective function in (10) does not depend on these controls. These facts ensure the convexity of the set of generalized velocities, which is the second key assumption.

3. Pontryagin Maximum Principle

We apply the Pontryagin maximum principle [19], which is the necessary optimality condition, in order to analyze the optimal controls u * ( t ) , v * ( t ) and the corresponding optimal solutions x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) . Firstly, let us define the Hamiltonian:
H ( x , y , z , w , u , v , ψ 1 , ψ 2 , ψ 3 , ψ 4 ) = ( r α 1 y β 1 z + m u ) x ψ 1 + ( p η y α 2 x β 2 z ) y ψ 2 + ( q γ z α 3 x β 3 y ) z ψ 3 + ( λ ν w + n ( 1 v ) w x ) ψ 4 ( y μ z + δ w ) ,
where ψ 1 , ψ 2 , ψ 3 , ψ 4 are the adjoint variables.
Secondly, we calculate the required partial derivatives:
H x ( x , y , z , w , u , v , ψ 1 , ψ 2 , ψ 3 , ψ 4 ) = ( r α 1 y β 1 z + m u ) ψ 1 α 2 y ψ 2 α 3 z ψ 3 + n ( 1 v ) w ψ 4 , H y ( x , y , z , w , u , v , ψ 1 , ψ 2 , ψ 3 , ψ 4 ) = ( p η y α 2 x β 2 z ) ψ 2 α 1 x ψ 1 η y ψ 2 β 3 z ψ 3 1 , H z ( x , y , z , w , u , v , ψ 1 , ψ 2 , ψ 3 , ψ 4 ) = ( q γ z α 3 x β 3 y ) ψ 3 β 1 x ψ 1 β 2 y ψ 2 γ z ψ 3 + μ , H w ( x , y , z , w , u , v , ψ 1 , ψ 2 , ψ 3 , ψ 4 ) = ( ν + n ( 1 v ) x ) ψ 4 δ , H u ( x , y , z , w , u , v , ψ 1 , ψ 2 , ψ 3 , ψ 4 ) = m x ψ 1 , H v ( x , y , z , w , u , v , ψ 1 , ψ 2 , ψ 3 , ψ 4 ) = n w x ψ 4 .
Then, in accordance with the Pontryagin maximum principle, for the optimal controls u * ( t ) , v * ( t ) and the optimal solutions x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) there exists a vector-function ψ * ( t ) = ( ψ 1 * ( t ) , ψ 2 * ( t ) , ψ 3 * ( t ) , ψ 4 * ( t ) ) such that:
  • ψ * ( t ) is a nontrivial solution of the adjoint system:
    ψ 1 * ( t ) = H x ( x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) , u * ( t ) , v * ( t ) , ψ 1 * ( t ) , ψ 2 * ( t ) , ψ 3 * ( t ) , ψ 4 * ( t ) ) = ( r α 1 y * ( t ) β 1 z * ( t ) + m u * ( t ) ) ψ 1 * ( t ) + α 2 y * ( t ) ψ 2 * ( t ) + α 3 z * ( t ) ψ 3 * ( t ) n ( 1 v * ( t ) ) w * ( t ) ψ 4 * ( t ) , ψ 2 * ( t ) = H y ( x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) , u * ( t ) , v * ( t ) , ψ 1 * ( t ) , ψ 2 * ( t ) , ψ 3 * ( t ) , ψ 4 * ( t ) ) = ( p η y * ( t ) α 2 x * ( t ) β 2 z * ( t ) ) ψ 2 * ( t ) + α 1 x * ( t ) ψ 1 * ( t ) + η y * ( t ) ψ 2 * ( t ) + β 3 z * ( t ) ψ 3 * ( t ) + 1 , ψ 3 * ( t ) = H z ( x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) , u * ( t ) , v * ( t ) , ψ 1 * ( t ) , ψ 2 * ( t ) , ψ 3 * ( t ) , ψ 4 * ( t ) ) = ( q γ z * ( t ) α 3 x * ( t ) β 3 y * ( t ) ) ψ 3 * ( t ) + β 1 x * ( t ) ψ 1 * ( t ) + β 2 y * ( t ) ψ 2 * ( t ) + γ z * ( t ) ψ 3 * ( t ) μ , ψ 4 * ( t ) = H w ( x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) , u * ( t ) , v * ( t ) , ψ 1 * ( t ) , ψ 2 * ( t ) , ψ 3 * ( t ) , ψ 4 * ( t ) ) = ( ν n ( 1 v * ( t ) ) x * ( t ) ) ψ 4 * ( t ) + δ , ψ 1 * ( T ) = I x ( T ) = 0 , ψ 2 * ( T ) = I y ( T ) = 1 , ψ 3 * ( T ) = I z ( T ) = μ , ψ 4 * ( T ) = I w ( T ) = δ ;
  • the controls u * ( t ) and v * ( t ) maximize the Hamiltonian
    H ( x * ( t ) , y * ( t ) , z * ( t ) , w * ( t ) , u , v , ψ 1 * ( t ) , ψ 2 * ( t ) , ψ 3 * ( t ) , ψ 4 * ( t ) )
    with respect to variables u [ 0 , u max ] and v [ 0 , 1 ] for almost all t [ 0 , T ] , and therefore they satisfy the relationships:
    u * ( t ) = u max , if m x * ( t ) ψ 1 * ( t ) > 0 , any u [ 0 , u max ] , if m x * ( t ) ψ 1 * ( t ) = 0 , 0 , if m x * ( t ) ψ 1 * ( t ) < 0 ,
    v * ( t ) = 1 , if n w * ( t ) x * ( t ) ψ 4 * ( t ) > 0 , any v [ 0 , 1 ] , if n w * ( t ) x * ( t ) ψ 4 * ( t ) = 0 , 0 , if n w * ( t ) x * ( t ) ψ 4 * ( t ) < 0 .

4. Properties of the Switching Functions and Optimal Controls

We establish the validity of the following lemma.
Lemma 2.
The adjoint variable ψ 4 * ( t ) is negative on the interval [ 0 , T ] .
Proof of Lemma 2.
Let us select from the adjoint system (11) the Cauchy problem for the adjoint variable ψ 4 * ( t ) in the form
ψ 4 * ( t ) = ( ν n ( 1 v * ( t ) x * ( t ) ) ψ 4 * ( t ) + δ , t [ 0 , T ] , ψ 4 * ( T ) = δ ,
and then integrate it. As a result, we find the formula
ψ 4 * ( t ) = δ e ν ( T t ) + n t T ( 1 v * ( ξ ) ) x * ( ξ ) d ξ 1 + t T e ν ( T s ) n s T ( 1 v * ( ξ ) ) x * ( ξ ) d ξ d s ,
from which the statement of this lemma immediately follows.  □
The validity of the theorem below follows from Lemmas 1 and 2 and also Formula (13).
Theorem 1.
Optimal control v * ( t ) is a constant function of the type
v * ( t ) = 1 , t [ 0 , T ] .
This fact means that the drug intake, which is given by the control function v ( t ) , occurs with maximum intensity of 1 over the entire treatment period [ 0 , T ] .
Now we substitute the Formula (14) into the first equation of the adjoint system (11), and also exclude the last equation from it together with the corresponding initial condition. As a result, we obtain the simplified adjoint system:
ψ 1 * ( t ) = ( r α 1 y * ( t ) β 1 z * ( t ) + m u * ( t ) ) ψ 1 * ( t ) + α 2 y * ( t ) ψ 2 * ( t ) + α 3 z * ( t ) ψ 3 * ( t ) , ψ 2 * ( t ) = ( p η y * ( t ) α 2 x * ( t ) β 2 z * ( t ) ) ψ 2 * ( t ) + α 1 x * ( t ) ψ 1 * ( t ) + η y * ( t ) ψ 2 * ( t ) + β 3 z * ( t ) ψ 3 * ( t ) + 1 , ψ 3 * ( t ) = ( q γ z * ( t ) α 3 x * ( t ) β 3 y * ( t ) ) ψ 3 * ( t ) + β 1 x * ( t ) ψ 1 * ( t ) + β 2 y * ( t ) ψ 2 * ( t ) + γ z * ( t ) ψ 3 * ( t ) μ , ψ 1 * ( T ) = 0 , ψ 2 * ( T ) = 1 , ψ 3 * ( T ) = μ .
In this system, let us perform the change of adjoint variables in accordance with the formulas:
ϕ 1 * ( t ) = x * ( t ) ψ 1 * ( t ) , ϕ 2 * ( t ) = y * ( t ) ψ 2 * ( t ) , ϕ 3 * ( t ) = z * ( t ) ψ 3 * ( t ) .
Due to the corresponding equations of systems (1) and (15), we obtain appropriate equations for the new adjoint variables (16) as
ϕ 1 * ( t ) = α 2 x * ( t ) ϕ 2 * ( t ) + α 3 x * ( t ) ϕ 3 * ( t ) , ϕ 2 * ( t ) = α 1 y * ( t ) ϕ 1 * ( t ) + η y * ( t ) ϕ 2 * ( t ) + β 3 y * ( t ) ϕ 3 * ( t ) + y * ( t ) , ϕ 3 * ( t ) = β 1 z * ( t ) ϕ 1 * ( t ) + β 2 z * ( t ) ϕ 2 * ( t ) + γ z * ( t ) ϕ 3 * ( t ) μ z * ( t ) .
Adding to them following from (15) and (16) new initial conditions, we find the corresponding adjoint system:
ϕ 1 * ( t ) = α 2 x * ( t ) ϕ 2 * ( t ) + α 3 x * ( t ) ϕ 3 * ( t ) , t [ 0 , T ] , ϕ 2 * ( t ) = α 1 y * ( t ) ϕ 1 * ( t ) + η y * ( t ) ϕ 2 * ( t ) + β 3 y * ( t ) ϕ 3 * ( t ) + y * ( t ) , ϕ 3 * ( t ) = β 1 z * ( t ) ϕ 1 * ( t ) + β 2 z * ( t ) ϕ 2 * ( t ) + γ z * ( t ) ϕ 3 * ( t ) μ z * ( t ) , ϕ 1 * ( T ) = 0 , ϕ 2 * ( T ) = y * ( T ) , ϕ 3 * ( T ) = μ z * ( T ) .
Now, analyzing relationship (12) and the first formula in (16), we rewrite this relationship as follows
u * ( t ) = u max , if L ( t ) > 0 , any u [ 0 , u max ] , if L ( t ) = 0 , 0 , if L ( t ) < 0 ,
where L ( t ) = ϕ 1 * ( t ) is a switching function, the behavior of which defines the form of the optimal control u * ( t ) .
Let us obtain the system of differential equations for the function L ( t ) . We introduce the first function corresponding to the switching function as
G ( t ) = α 2 ϕ 2 * ( t ) + α 3 ϕ 3 * ( t ) .
Then, the first differential equation in (17) can be rewritten as
L ( t ) = b 1 ( t ) G ( t ) ,
where b 1 ( t ) = x * ( t ) .
By the inequalities (5) from Assumption 1, we introduce the second function corresponding to the switching function as
P ( t ) = ϕ 3 * ( t ) + α 2 ( α 2 y * ( t ) α 3 μ z * ( t ) ) α 2 ( α 2 β 3 α 3 η ) y * ( t ) + α 3 ( α 2 γ α 3 β 2 ) z * ( t ) .
Then, using the second and third equations of system (17), we find the equation:
G ( t ) = a 2 ( t ) L ( t ) + b 2 ( t ) G ( t ) + c 2 ( t ) P ( t ) ,
where the functions a 2 ( t ) , b 2 ( t ) , c 2 ( t ) are defined as
a 2 ( t ) = α 1 α 2 y * ( t ) + α 3 β 1 z * ( t ) , b 2 ( t ) = α 2 1 ( α 2 η y * ( t ) + α 3 β 2 z * ( t ) ) , c 2 ( t ) = α 2 1 ( α 2 ( α 2 β 3 α 3 η ) y * ( t ) + α 3 ( α 2 γ α 3 β 2 ) z * ( t ) ) .
Again using the third equation of system (17) and also the second and third equations of system (1), we obtain the equation:
P ( t ) = a 3 ( t ) L ( t ) + b 3 ( t ) G ( t ) + c 3 ( t ) P ( t ) + F ( t ) ,
where the functions a 3 ( t ) , b 3 ( t ) , c 3 ( t ) are defined by the formulas:
a 3 ( t ) = β 1 z * ( t ) , b 3 ( t ) = α 2 1 β 2 z * ( t ) , c 3 ( t ) = α 2 1 ( α 2 γ α 3 β 2 ) z * ( t ) ,
and the function F ( t ) is given by the relationship
F ( t ) = α 2 ( ( α 2 γ α 3 β 2 ) + μ ( α 2 β 3 α 3 η ) ) y * ( t ) z * ( t ) × α 2 α 3 ( p q ) + ( α 3 α 2 ) ( x * ( t ) + α 2 β 3 y * ( t ) + α 3 β 2 z * ( t ) ) ( α 2 ( α 2 β 3 α 3 η ) y * ( t ) + α 3 ( α 2 γ α 3 β 2 ) z * ( t ) ) 2 .
By inequalities (3) and (5), it is easy to see that this function takes positive values on [ 0 , T ] .
Remark 1.
An important property of the function F ( t ) is its sign-definiteness on the interval [ 0 , T ] . From the analysis of the formula (23), we can conclude that this property is preserved if, instead of inequalities (3), the inequalities:
p q 0 , α 3 α 2 0
hold, and only one of them is strict. Then, the function F ( t ) takes negative values on [ 0 , T ] .
Now, gathering Equations (19), (21) and (22) for functions L ( t ) , G ( t ) , P ( t ) in one system and then adding the appropriate initial conditions due to the definitions of these functions and the initial conditions from (17), we find the required system of differential equations for the switching function L ( t ) and the functions G ( t ) and P ( t ) that correspond to it:
L ( t ) = b 1 ( t ) G ( t ) , t [ 0 , T ] , G ( t ) = a 2 ( t ) L ( t ) + b 2 ( t ) G ( t ) + c 2 ( t ) P ( t ) , P ( t ) = a 3 ( t ) L ( t ) + b 3 ( t ) G ( t ) + c 3 ( t ) P ( t ) + F ( t )
with the initial conditions for the first two functions:
L ( T ) = 0 , G ( T ) = ( α 2 y * ( T ) α 3 μ z * ( T ) ) .
Here the function F ( t ) is defined by Formula (23).
Finally, let us analyze the behavior of the switching function L ( t ) in a neighborhood of t = T . The following lemma is true.
Lemma 3.
In the neighborhood adjacent to t = T , the switching function L ( t ) takes only positive values if α 2 y * ( T ) α 3 μ z * ( T ) 0 , and only negative values if α 2 y * ( T ) α 3 μ z * ( T ) < 0 .
Proof of Lemma 3.
First, let us consider the case, when
α 2 y * ( T ) α 3 μ z * ( T ) 0 .
The definition of the function G ( t ) implies its continuity on the interval [ 0 , T ] . Then, the neighborhood of t = T is defined, in which due to (27) and the second initial condition from (26), this function is negative if α 2 y * ( T ) α 3 μ z * ( T ) > 0 , and positive if α 2 y * ( T ) α 3 μ z * ( T ) < 0 . Then, integrating in this neighborhood the first equation of system (25) with the appropriate initial condition from (26), we find the formula
L ( t ) = t T b 1 ( ξ ) G ( ξ ) d ξ ,
from which the positivity or negativity of the switching function L ( t ) immediately follows.
Now, we consider the case, when
α 2 y * ( T ) α 3 μ z * ( T ) = 0 .
It is easy to see that then the Formula (20) implies the equality
P ( T ) = μ z * ( T ) ,
and the initial condition from (26) for the function G ( t ) is rewritten as G ( T ) = 0 . Let us integrate the second equation of system (25) with this initial condition. As a result, we have the formula
G ( t ) = t T e t s b 2 ( ξ ) d ξ a 2 ( s ) L ( s ) + c 2 ( s ) P ( s ) d s .
The definitions of the functions a 2 ( t ) , c 2 ( t ) , L ( t ) , P ( t ) ensure their continuity on the interval [ 0 , T ] . Then, the first initial condition from (26) and (29) imply the existence of the neighborhood of t = T in which the function a 2 ( t ) L ( t ) + c 2 ( t ) P ( t ) takes positive values. Considering the values of t in Formula (30) only from this neighborhood, we conclude that in it the function G ( t ) is negative. Due to (28), we see that in this neighborhood the switching function L ( t ) takes positive values.  □
The validity of the theorem below follows from Lemma 3 and Formula (18).
Theorem 2.
A neighborhood is defined adjacent to t = T , at which the optimal control u * ( t ) takes the maximum value u max if α 2 y * ( T ) α 3 μ z * ( T ) 0 , and the minimum value 0 if α 2 y * ( T ) α 3 μ z * ( T ) < 0 .
This fact means that the CAR T-cell therapy, which is expressed by the control function u ( t ) , at the end of the treatment period [ 0 , T ] can occur with both maximum intensity u max and minimum intensity 0.

5. Reducing the Matrix of System  to the Upper Triangular Form

We see that the system (25) is linear non-autonomous system of differential equations. Its matrix has the form:
0 b 1 ( t ) 0 a 2 ( t ) b 2 ( t ) c 2 ( t ) a 3 ( t ) b 3 ( t ) c 3 ( t ) .
For the convenience of analysis of such a system, we reduce this matrix to a special upper triangular form
* * 0 0 * * 0 0 * ,
using a change of variables:
L ˜ ( t ) = L ( t ) , G ˜ ( t ) = G ( t ) h 1 ( t ) L ( t ) , P ˜ ( t ) = P ( t ) h 2 ( t ) L ( t ) h 3 ( t ) G ( t ) ,
where the functions h i ( t ) , i = 1 , 2 , 3 will be determined later in a special way. In matrix (31), the symbol * denotes its nonzero elements.
Now, let us obtain the corresponding differential equations for the functions L ˜ ( t ) , G ˜ ( t ) and P ˜ ( t ) using the equations of system (25). Performing direct calculations, for the function L ˜ ( t ) we have the equation:
L ˜ ( t ) = b 1 ( t ) h 1 ( t ) L ˜ ( t ) + b 1 ( t ) G ˜ ( t ) .
Using similar calculations for the function G ˜ ( t ) , the following expression can be found
G ˜ ( t ) = h 1 ( t ) b 1 ( t ) h 1 2 ( t ) + c 2 ( t ) h 1 ( t ) h 3 ( t ) + b 2 ( t ) h 1 ( t ) + c 2 ( t ) h 2 ( t ) + a 2 ( t ) L ˜ ( t ) + ( b 2 ( t ) + c 2 ( t ) h 3 ( t ) b 1 ( t ) h 1 ( t ) ) G ˜ ( t ) + c 2 ( t ) P ˜ ( t ) .
Let us choose the functions h i ( t ) , i = 1 , 2 , 3 in such a way that the expression for L ˜ ( t ) vanishes identically. This implies that the function h 1 ( t ) satisfies the differential equation:
h 1 ( t ) = b 1 ( t ) h 1 2 ( t ) + c 2 ( t ) h 1 ( t ) h 3 ( t ) + b 2 ( t ) h 1 ( t ) + c 2 ( t ) h 2 ( t ) + a 2 ( t ) .
Then, expression (33) is transformed to the differential equation:
G ˜ ( t ) = ( b 2 ( t ) + c 2 ( t ) h 3 ( t ) b 1 ( t ) h 1 ( t ) ) G ˜ ( t ) + c 2 ( t ) P ˜ ( t ) .
Finally, we carry out similar direct calculations for the function P ˜ ( t ) . As a result, the following expression can be obtained
P ˜ ( t ) = [ ( h 2 ( t ) c 2 ( t ) h 2 ( t ) h 3 ( t ) + c 3 ( t ) h 2 ( t ) a 2 ( t ) h 3 ( t ) + a 3 ( t ) ) + h 1 ( t ) ( h 3 ( t ) c 2 ( t ) h 3 2 ( t ) b 1 ( t ) h 2 ( t ) ( b 2 ( t ) c 3 ( t ) ) h 3 ( t ) + b 3 ( t ) ) ] L ˜ ( t ) + h 3 ( t ) c 2 ( t ) h 3 2 ( t ) b 1 ( t ) h 2 ( t ) ( b 2 ( t ) c 3 ( t ) ) h 3 ( t ) + b 3 ( t ) G ˜ ( t ) + ( c 3 ( t ) c 2 ( t ) h 3 ( t ) ) P ˜ ( t ) + F ( t ) .
Let us choose the functions h i ( t ) , i = 2 , 3 in such a way that the expressions for L ˜ ( t ) and G ˜ ( t ) vanish identically. This implies that the functions h 2 ( t ) and h 3 ( t ) satisfy the differential equations:
h 2 ( t ) = c 2 ( t ) h 2 ( t ) h 3 ( t ) + c 3 ( t ) h 2 ( t ) a 2 ( t ) h 3 ( t ) + a 3 ( t ) , h 3 ( t ) = c 2 ( t ) h 3 2 ( t ) b 1 ( t ) h 2 ( t ) ( b 2 ( t ) c 3 ( t ) ) h 3 ( t ) + b 3 ( t ) .
Then, expression (36) is transformed to the differential equation:
P ˜ ( t ) = ( c 3 ( t ) c 2 ( t ) h 3 ( t ) ) P ˜ ( t ) + F ( t ) .
Gathering the Equations (32), (35) and (38) into one system, for the functions L ˜ ( t ) , G ˜ ( t ) and P ˜ ( t ) we obtain the linear non-autonomous system of differential equations:
L ˜ ( t ) = b 1 ( t ) h 1 ( t ) L ˜ ( t ) + b 1 ( t ) G ˜ ( t ) , G ˜ ( t ) = ( b 2 ( t ) + c 2 ( t ) h 3 ( t ) b 1 ( t ) h 1 ( t ) ) G ˜ ( t ) + c 2 ( t ) P ˜ ( t ) , P ˜ ( t ) = ( c 3 ( t ) c 2 ( t ) h 3 ( t ) ) P ˜ ( t ) + F ( t ) ,
the matrix of which has the required upper triangular form (31).
Gathering the Equations (34) and (37) also in one system, for the functions h i ( t ) , i = 1 , 2 , 3 realizing representation (39), we have the quadratic non-autonomous system of differential equations:
h 1 ( t ) = b 1 ( t ) h 1 2 ( t ) + c 2 ( t ) h 1 ( t ) h 3 ( t ) + b 2 ( t ) h 1 ( t ) + c 2 ( t ) h 2 ( t ) + a 2 ( t ) , h 2 ( t ) = c 2 ( t ) h 2 ( t ) h 3 ( t ) + c 3 ( t ) h 2 ( t ) a 2 ( t ) h 3 ( t ) + a 3 ( t ) , h 3 ( t ) = c 2 ( t ) h 3 2 ( t ) b 1 ( t ) h 2 ( t ) ( b 2 ( t ) c 3 ( t ) ) h 3 ( t ) + b 3 ( t ) .
Initial conditions
h 1 ( 0 ) = h 1 0 , h 2 ( 0 ) = h 2 0 , h 3 ( 0 ) = h 3 0
can be added to system (40), where h 1 0 , h 2 0 and h 3 0 are constants. As a result, we have the Cauchy problem (40) and (41). Then, as a consequence of the Existence and Uniqueness of Solution Theorem [20], solutions h 1 ( t ) , h 2 ( t ) , h 3 ( t ) to this problem can be defined only locally, in a neighborhood of t = 0 . However, for this problem it is important to have solutions of the Cauchy problem (40) and (41) defined on the entire interval [ 0 , T ] (see [21,22,23]). That is we have to find the restrictions on the functions a i ( t ) , c i ( t ) , i = 2 , 3 and b i ( t ) , i = 1 , 2 , 3 , or what is the same, on the parameters of system (1) such that under these restrictions solutions of the Cauchy problem (40) and (41) are defined on the entire interval [ 0 , T ] . Then system (39) would be also defined on the entire interval, and hence, it would be possible to use the Generalized Rolle’s Theorem [24] to estimate the number of zeros of the switching function L ( t ) = L ˜ ( t ) .

6. Splitting Quadratic System

The analysis of the three-dimensional quadratic system (40) is quite complex, but it can be significantly simplified by splitting this system into two simpler two-dimensional quadratic subsystems. Specifically, it is easy to see that equations for h 2 ( t ) and h 3 ( t ) do not contain the variable h 1 ( t ) , and hence, can be considered separately. These two equations form the first of these quadratic subsystems:
h 2 ( t ) = c 2 ( t ) h 2 ( t ) h 3 ( t ) + c 3 ( t ) h 2 ( t ) a 2 ( t ) h 3 ( t ) + a 3 ( t ) , h 3 ( t ) = c 2 ( t ) h 3 2 ( t ) b 1 ( t ) h 2 ( t ) ( b 2 ( t ) c 3 ( t ) ) h 3 ( t ) + b 3 ( t ) , h 2 ( 0 ) = h 2 0 , h 3 ( 0 ) = h 3 0 .
To formulate the second two-dimensional subsystem, we introduce variable
h 0 ( t ) = h 1 ( t ) h 3 ( t ) + h 2 ( t ) .
Substituting (43) into the fist equation of system (40), we obtain equation
h 1 ( t ) = b 1 ( t ) h 1 2 ( t ) + c 2 ( t ) h 0 ( t ) + b 2 ( t ) h 1 ( t ) + a 2 ( t ) .
Besides, function h 0 ( t ) satisfies the following differential equation:
h 0 ( t ) = b 1 ( t ) h 0 ( t ) h 1 ( t ) + c 3 ( t ) h 0 ( t ) + b 3 ( t ) h 1 ( t ) + a 3 ( t ) .
Initial condition h 0 ( 0 ) = h 0 0 for variable h 0 ( t ) immediately follows from (43):
h 0 0 = h 1 0 h 3 0 + h 2 0 .
Combining (44)–(46) and adding the initial condition for function h 1 ( t ) from (41), we find the second two-dimensional quadratic subsystem
h 0 ( t ) = Φ 0 ( t , h 0 ( t ) , h 1 ( t ) ) = b 1 ( t ) h 0 ( t ) h 1 ( t ) + c 3 ( t ) h 0 ( t ) + b 3 ( t ) h 1 ( t ) + a 3 ( t ) , h 1 ( t ) = Φ 1 ( t , h 0 ( t ) , h 1 ( t ) ) = b 1 ( t ) h 1 2 ( t ) + c 2 ( t ) h 0 ( t ) + b 2 ( t ) h 1 ( t ) + a 2 ( t ) , h 0 ( 0 ) = h 0 0 = h 1 0 h 3 0 + h 2 0 , h 1 ( 0 ) = h 1 0 .
We note that the three-dimensional quadratic system (40) is equivalent to two two-dimensional quadratic subsystems (42) and (47). As analysis of two-dimensional systems is usually simpler, further we will deal with the later. In particularly, we have to analyze these quadratic subsystems to find the restrictions on the functions a i ( t ) , c i ( t ) , i = 2 , 3 and b i ( t ) , i = 1 , 2 , 3 , which guarantee that there exist solutions of subsystems (42) and (47) defined on the entire interval [ 0 , T ] .
Previously, such an approach to facilitate the analysis of solutions to quadratic systems was applied in [25,26,27].

7. Analysis of Quadratic Subsystems

Let us analyze the quadratic subsystems (42) and (47). For this, we will assume that the solutions h 0 ( t ) , h 1 ( t ) of system (47) and the solutions h 2 ( t ) , h 3 ( t ) of system (42) are determined on the maximal intervals of existence of such solutions [18,20]: specifically, we assume that functions h 0 ( t ) and h 1 ( t ) are determined on the interval Δ 01 = [ 0 , t 01 ] [ 0 , T ] , and functions h 2 ( t ) and h 3 ( t ) are determined on the interval Δ 23 = [ 0 , t 23 ] [ 0 , T ] . Without loss of generality, we suppose that t 01 T and t 23 T .
First, we consider system (47) with nonnegative initial conditions
h 0 0 0 , h 1 0 0 .
Let us apply to it the necessary and sufficient condition for solutions h 0 ( t ) , h 1 ( t ) to be nonnegative for all t [ 0 , t 01 ) [28]. It consists in satisfying the quasi-positivity condition
Φ 0 ( t , 0 , h 1 ) 0 , for all h 1 0 , Φ 1 ( t , h 0 , 0 ) 0 , for all h 0 0 .
As a result, we obtain the inequalities:
a 2 ( t ) 0 , c 2 ( t ) 0 , a 3 ( t ) 0 , b 3 ( t ) 0 ,
which hold for all t [ 0 , T ] due to inequalities (5) from Assumption 1 and Lemma 1.
Secondly, we consider system (42). Performing in this system the change of variables
h ˜ 2 ( t ) = c 2 ( t ) h 2 ( t ) , h ˜ 3 ( t ) = c 2 ( t ) h 3 ( t ) c 3 ( t ) ,
and introducing values
h ˜ 2 0 = c 2 ( 0 ) h 2 0 , h ˜ 3 0 = c 2 ( 0 ) h 3 0 c 3 ( 0 ) ,
we obtain the following system of equations:
h ˜ 2 ( t ) = Φ 2 t , h ˜ 2 ( t ) , h ˜ 3 ( t ) = h ˜ 2 ( t ) h ˜ 3 ( t ) + c 2 1 ( t ) c 2 ( t ) h ˜ 2 ( t ) + a 2 ( t ) h ˜ 3 ( t ) + ( a 2 ( t ) c 3 ( t ) a 3 ( t ) c 2 ( t ) ) , h ˜ 3 ( t ) = Φ 3 t , h ˜ 2 ( t ) , h ˜ 3 ( t ) = h ˜ 3 2 ( t ) + b 1 ( t ) h ˜ 2 ( t ) + c 2 1 ( t ) c 2 ( t ) ( b 2 ( t ) + c 3 ( t ) ) h ˜ 3 ( t ) + c 2 1 ( t ) c 2 ( t ) ( b 3 ( t ) c 2 ( t ) b 2 ( t ) c 3 ( t ) ) + ( c 2 ( t ) c 3 ( t ) c 2 ( t ) c 3 ( t ) ) , h ˜ 2 ( 0 ) = h ˜ 2 0 , h ˜ 3 ( 0 ) = h ˜ 3 0 .
Let the initial conditions of this system be nonnegative:
h ˜ 2 0 0 , h ˜ 3 0 0 .
Then, we apply to system (48) the necessary and sufficient condition for solutions h ˜ 2 ( t ) , h ˜ 3 ( t ) to be nonnegative for all t [ 0 , t 23 ) as well. It consists in satisfying the quasi-positivity condition
Φ 2 t , 0 , h ˜ 3 0 , for all h ˜ 3 0 , Φ 3 t , h ˜ 2 , 0 0 , for all h ˜ 2 0 .
As a result, we obtain the inequalities:
a 2 ( t ) 0 , a 2 ( t ) c 3 ( t ) a 3 ( t ) c 2 ( t ) 0 , b 1 ( t ) 0 , c 2 ( t ) ( b 3 ( t ) c 2 ( t ) b 2 ( t ) c 3 ( t ) ) + ( c 2 ( t ) c 3 ( t ) c 2 ( t ) c 3 ( t ) ) 0 ,
which hold for all t [ 0 , T ] due to direct calculations using inequalities (3), (5)–(7) and Lemma 1.
Next, we have to verify the continuation of the nonnegative solutions h 0 ( t ) , h 1 ( t ) , h ˜ 2 ( t ) and h ˜ 3 ( t ) of systems (47) and (48) to the entire interval [ 0 , T ] (see Theorem 1.6 in [28]). To do this, we define Lyapunov functions:
V 01 ( h 0 , h 1 ) = 0.5 h 0 2 + h 1 2 , V 23 h ˜ 2 , h ˜ 3 = 0.5 h ˜ 2 2 + h ˜ 3 2 ,
and estimate the derivatives d V 01 / d t and d V 23 / d t of these functions along the trajectories of the systems of differential Equations (47) and (48), respectively. (The derivatives are the scalar products of the gradients of these functions and the right-hand sides of the corresponding systems.) The Lyapunov functions satisfy the relationships:
d V 01 d t ( h 0 ( t ) , h 1 ( t ) ) = b 1 ( t ) ( h 0 2 ( t ) + h 1 2 ( t ) ) h 1 ( t ) + c 3 ( t ) h 0 2 ( t ) + b 2 ( t ) h 1 2 ( t ) + ( b 3 ( t ) + c 2 ( t ) ) h 0 ( t ) h 1 ( t ) + ( a 3 ( t ) h 0 ( t ) + a 2 ( t ) h 1 ( t ) ) < ( b 2 ( t ) + c 3 ( t ) ) + 0.5 ( b 3 ( t ) + c 2 ( t ) ) ( h 0 2 ( t ) + h 1 2 ( t ) ) + a 2 2 ( t ) + a 3 2 ( t ) h 0 2 ( t ) + h 1 2 ( t ) K 01 V 01 ( h 0 ( t ) , h 1 ( t ) ) , d V 23 d t h ˜ 2 ( t ) , h ˜ 3 ( t ) = h ˜ 2 2 ( t ) + h ˜ 3 2 ( t ) h ˜ 3 ( t ) + c 2 1 ( t ) c 2 ( t ) h ˜ 2 2 ( t ) + c 2 1 ( t ) c 2 ( t ) ( b 2 ( t ) + c 3 ( t ) ) h ˜ 3 2 ( t ) + ( a 2 ( t ) + b 1 ( t ) ) h ˜ 2 ( t ) h ˜ 3 ( t ) + ( ( a 2 ( t ) c 3 ( t ) a 3 ( t ) c 2 ( t ) ) h ˜ 2 ( t ) + c 2 1 ( t ) c 2 ( t ) ( b 3 ( t ) c 2 ( t ) b 2 ( t ) c 3 ( t ) ) + ( c 2 ( t ) c 3 ( t ) c 2 ( t ) c 3 ( t ) ) h ˜ 3 ( t ) ) < c 2 1 ( t ) | c 2 ( t ) | + 0.5 ( a 2 ( t ) + b 1 ( t ) ) h ˜ 2 2 ( t ) + h ˜ 3 2 ( t ) + ( ( a 2 ( t ) c 3 ( t ) a 3 ( t ) c 2 ( t ) ) 2 + c 2 2 ( t ) ( c 2 ( t ) ( b 3 ( t ) c 2 ( t ) b 2 ( t ) c 3 ( t ) ) + ( c 2 ( t ) c 3 ( t ) c 2 ( t ) c 3 ( t ) ) ) 2 ) 1 / 2 × h ˜ 2 2 ( t ) + h ˜ 3 2 ( t ) K 23 V 23 h ˜ 2 ( t ) , h ˜ 3 ( t ) .
Here, K 01 and K 23 are constants defined as following:
K 01 = 2 max t [ 0 , T ] ( b 2 ( t ) + c 3 ( t ) ) + 0.5 ( b 3 ( t ) + c 2 ( t ) ) + a 2 2 ( t ) + a 3 2 ( t ) , K 23 = 2 max t [ 0 , T ] ( c 2 1 ( t ) | c 2 ( t ) | + 0.5 ( a 2 ( t ) + b 1 ( t ) ) + ( ( a 2 ( t ) c 3 ( t ) a 3 ( t ) c 2 ( t ) ) 2 + c 2 2 ( t ) ( c 2 ( t ) ( b 3 ( t ) c 2 ( t ) b 2 ( t ) c 3 ( t ) ) + ( c 2 ( t ) c 3 ( t ) c 2 ( t ) c 3 ( t ) ) ) 2 ) 1 / 2 ) ,
We have to note that these inequalities are obtained under constraints
h 0 2 ( t ) + h 1 2 ( t ) 1 , h ˜ 2 2 ( t ) + h ˜ 3 2 ( t ) 1 ,
which follow from the relationships
lim t t 01 0 h 0 2 ( t ) + h 1 2 ( t ) = + , lim t t 23 0 h ˜ 2 2 ( t ) + h ˜ 3 2 ( t ) = +
(see Lemma in Section 14, Chapter 4 in [29]). That is, the estimates
d V 01 d t ( h 0 ( t ) , h 1 ( t ) ) K 01 V 01 ( h 0 ( t ) , h 1 ( t ) ) , d V 23 d t h ˜ 2 ( t ) , h ˜ 3 ( t ) K 23 V 23 h ˜ 2 ( t ) , h ˜ 3 ( t )
imply the required continuation of the nonnegative solutions h 0 ( t ) and h 1 ( t ) of system (47) and the nonnegative solutions h ˜ 2 ( t ) and h ˜ 3 ( t ) of system (48) to the entire interval [ 0 , T ] . Then the functions h 2 ( t ) and h 3 ( t ) are also defined on this interval. Taking into consideration that the functions h 0 ( t ) , h 1 ( t ) , h 2 ( t ) and h 3 ( t ) satisfying systems (42) and (47) are defined on the entire interval [ 0 , T ] , we conclude that system (39) is defined on this interval as well.
We note that such an approach for analyzing the continuation of solutions to quadratic systems was previously presented in [27].

8. Estimating the Number of Zeros of Switching Function L ( t )

Let us estimate the number of zeros of the switching function L ( t ) = L ˜ ( t ) . We will recall that the function F ( t ) (see system (39)) takes positive values on the entire interval [ 0 , T ] . Taking into consideration that L ( T ) = 0 (see (26)), we will show that this function has at most two distinct zeros on the interval [ 0 , T ) .
Suppose the contrary, that is, the function L ( t ) = L ˜ ( t ) has at least three distinct zeroes on this interval. Without loss of generality, we assume that these zeros are closest to t = T , which is also the zero of the function L ˜ ( t ) . Then, by virtue of the Generalized Rolle’s Theorem [24] applied to the first equation of system (39), the function G ˜ ( t ) has at least three distinct zeros on the interval ( 0 , T ) . Indeed, denoting these three zeros by τ 1 * , τ 2 * , τ 3 * , we integrate the first equation of system (39) on each of the intervals [ τ i * , τ i + 1 * ] , i = 1 , 2 and [ τ 3 * , T ] . Then, due to the equalities L ˜ ( τ i * ) = 0 , i = 1 , 2 , 3 and L ˜ ( T ) = 0 , we have the formulas:
τ i * τ i + 1 * e τ i * s b 1 ( ξ ) h 1 ( ξ ) d ξ b 1 ( s ) G ˜ ( s ) d s = 0 , i = 1 , 2 , τ 3 * T e τ 3 * s b 1 ( ξ ) h 1 ( ξ ) d ξ b 1 ( s ) G ˜ ( s ) d s = 0 .
The continuity of all three integrand factors and the positiveness of the first two of them in (49) imply the existence of at least one zero for the third factor, the function G ˜ ( t ) , on each of the intervals ( τ i * , τ i + 1 * ) , i = 1 , 2 and ( τ 3 * , T ) . Otherwise, the function G ˜ ( t ) is sign-definite at least on one of the indicated intervals. Therefore, the integral over such an interval in (49) is not zero, which is contradictory. Thus, the above arguments confirm the earlier conclusion about the number of zeros of the function G ˜ ( t ) .
Applying in a similar way this theorem to the second equation of system (39), we have to conclude that the function P ˜ ( t ) has at least two distinct zeros on the interval ( 0 , T ) . Finally, using the theorem in the third equation of this system, we conclude that the function F ( t ) has at least one zero on the interval ( 0 , T ) , which is contradictory. Hence, the assumption is incorrect. Thus, the required estimate for the number of zeros of the switching function L ( t ) = L ˜ ( t ) is established.
Taking into consideration Lemma 3, we can see that depending on the value of ( α 2 y * ( T ) α 3 μ z * ( T ) ) , the switching function L ( t ) is described by one of the following two relationships, namely:
  • if α 2 y * ( T ) α 3 μ z * ( T ) 0 , then
    L ( t ) > 0 , if 0 t < τ 1 * , = 0 , if t = τ 1 * , < 0 , if τ 1 * < t < τ 2 * , = 0 , if t = τ 2 * , > 0 , if τ 2 * < t < T , = 0 , if t = T ;
  • if α 2 y * ( T ) α 3 μ z * ( T ) < 0 , then
    L ( t ) < 0 , if 0 t < τ 1 * , = 0 , if t = τ 1 * , > 0 , if τ 1 * < t < τ 2 * , = 0 , if t = τ 2 * , < 0 , if τ 2 * < t < T , = 0 , if t = T ,
    where τ 1 * , τ 2 * [ 0 , T ) are zeros of L ( t ) .

9. Types of Optimal Control u * ( t )

The relationships (50) and (51) for the switching functions L ( t ) combined with the Formula (18) for the optimal control u * ( t ) enable us to immediately find its possible types. Specifically, depending on the value of ( α 2 y * ( T ) α 3 μ z * ( T ) ) , the optimal control u * ( t ) is of one of the following types:
  • if α 2 y * ( T ) α 3 μ z * ( T ) 0 , then
    u * ( t ) = u max , if 0 t τ 1 * , 0 , if τ 1 * < t τ 2 * , u max , if τ 2 * < t T ;
  • if α 2 y * ( T ) α 3 μ z * ( T ) < 0 , then
    u * ( t ) = 0 , if 0 t τ 1 * , u max , if τ 1 * < t τ 2 * , 0 , if τ 2 * < t T ,
    where τ 1 * , τ 2 * [ 0 , T ) are moments of switching.
Analysis of the Formulas (52) and (53) shows that depending on the relationship between the populations of cancer cells and healthy B-cells at the end (T) of the treatment period [ 0 , T ] , the optimal control u * ( t ) either takes the maximum value u max on the entire interval [ 0 , T ] that corresponds to the carrying out of the CAR T-cell therapy with the greatest intensity throughout a given time interval (emergency CAR T-cell therapy); or it has one switching from the minimum value 0 to the maximum value u max that describes the situation when, first there is the period of no therapy, and then, at the time t = τ * the switching occurs to the period of this therapy with the greatest intensity (CAR T-cell therapy delayed at the beginning); or has one switching from the maximum value u max to the minimum value 0 that reflects the situation when, first there is the period of the therapy with the greatest intensity, and then, at the time t = τ * the switching occurs to the period of no therapy (CAR T-cell therapy stopped at the end). Finally, the optimal control u * ( t ) might have two switchings: first there is the switching from the maximum value u max to the minimum value 0, and then the switching is carried out again to the maximum value u max . This corresponds to the situation when, first the CAR T-cell therapy is provided with the greatest intensity, then, at the time t = τ 1 * the switching occurs, and this therapy is stopped (rest period), then, at the time t = τ 2 * the therapy is returned to the greatest intensity (CAR T-cell therapy with rest period). Also, another type of the optimal control u * ( t ) with two switchings is possible: first there is the switching from the minimum value 0 to the maximum value u max , and then the switching occurs again to the minimum value 0. This describes the situation when, first, the CAR T-cell therapy is absent, it is delayed until time t = τ 1 * , at which the switching is carrying out, and this therapy is provided with the greatest intensity, until another time t = τ 2 * , after which the therapy is stopped again (CAR T-cell therapy delayed at the beginning, maximal in the middle and stopped at the end of the treatment period). In this case, at the beginning of the treatment interval, no injection of the chimeric cells is given, while the immunosuppressant is taken throughout the treatment interval at the maximal dosage. Then the injections with maximum intensity are carried out on the interval [ τ 1 * , τ 2 * ] , and from the moment τ 2 * injections are stopped.
Therefore, five types of optimal strategies of the CAR T-cell therapy can be distinguished: the emergency CAR T-cell therapy, CAR T-cell therapy delayed at the beginning, CAR T-cell therapy stopped at the end, CAR T-cell therapy delayed at the beginning, maximal in the middle and stopped after that at the end, and the CAR T-cell therapy with rest period in the middle of the treatment. These therapies are consistent both with the actual data of clinical studies [30,31] and the results of the numerical calculations from [14,32].

10. Numerical Results

First, we will show that the set of parameters values given by inequalities (3)–(6) is not-empty. To do this, we will transform these inequalities by introducing new variables:
A = α 2 α 3 , B = β 2 γ , C = β 3 η , D = η γ .
Due to them, the second inequality in (3), as well as inequalities (4) and (5) are rewritten as
A 1 , C 1 , A B , A C 1 , B C 1 .
We note that the inequality (6) is rewritten as
α 1 β 1 ( A C 1 ) D A B ,
and therefore it can be considered separately, like the first inequality in (3).
Then, in the positive octant of the coordinate system ( A , B , C ) , inequalities (54) define the not-empty set, which is shown in Figure 1.
Figure 2 demonstrates a similar set, but for the case when, instead of inequalities (3), the inequalities (24) hold. Such a set is given by the inequalities:
A 1 , C 1 , A B , A C 1 , B C 1 .
Therefore, further we will choose such values of the model parameters p, q, η , γ , α 1 , α 2 , α 3 , β 1 , β 2 , β 3 so that they satisfy the following two cases.
Case 1. The inequalities (3)–(6), or, what is the same, the inequalities (54) and (55) are satisfied.
Case 2. The inequalities (4)–(6),(24) or equivalent inequalities (55) and (56) are valid.
Now, we will demonstrate the results of a numerical solution of the minimization problem (10). The values of the corresponding parameters were adopted from [13,14,32] and chosen so as to show the numerical solutions for both of the above cases. Thus, Figure 3, Figure 4 and Figure 5 presented below correspond to Case 1, and Figure 6 and Figure 7 relate to Case 2.
These numerical calculations were conducted using “BOCOP–2.2.0” [33]. It is an optimal control interface, implemented in MATLAB, for solving optimal control problems with general path and boundary constraints and free or fixed final time. By a time discretization, such problems are approximated by finite-dimensional optimization problems, which are then solved by well-known software IPOPT, using sparse exact derivatives computed by ADOL-C. IPOPT is the open-source software package for large-scale nonlinear optimization. In the computations, we set the number of time steps to 3000 and the tolerance to 10 14 and use the sixth-order Lobatto III C discretization rule [33]. In Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7 the blue lines represent the optimal solutions, whereas the green dashed lines represent the solutions in the absence of the control. The red lines show the continuations of the optimal solutions of the population of B-leukemic or cancer cells y ( t ) for a longer time interval (in Figure 3, Figure 4 and Figure 5 for [ T , T + 5 ] , and in Figure 6 and Figure 7 for [ T , T + 10 ] ). The vertical dashed lines represent the switching moments. The end of the interval [ 0 , T ] is shown by the vertical dot-dashed lines.
In Figure 3, the case of so-called emergency CAR T-cell therapy over 30 days treatment period is shown. Optimal control takes the maximum value u max on the entire interval of the therapy, which helps to slow down the rate of proliferation of cancer cells, but still such therapy is not able to completely suppress the cancer. Note that after the end of the therapy period, cancer cells begin to grow rapidly in the first 5 days. This suggests that cancer is resistant to this scenario of CAR T-cell immunotherapy. Such emergency CAR T-cell therapy can only slow down the development of the cancer, and is a temporary solution for developing another course of treatment, but is not suitable for the patient’s recovery strategy.
The Figure 4 and Figure 5 reflect more optimistic therapy schedules. In Figure 4 the case of CAR T-cell therapy delayed at the beginning is considered. For the first 10 days the optimal control u * ( t ) takes the minimum value 0, and then the therapy is carried out with maximum intensity u max . In this case, CAR T-cells begin to grow and suppress the development of cancer. After the end of such therapy, cancer cells decrease to zero in the first 5 days. Such therapy leads to the destruction of cancer cells, which is an indicator of the higher efficiency of this course of therapy for the treatment of this type of cancer.
In Figure 5 CAR T-cell therapy that is delayed at the beginning, of its maximal intensity in the middle and stopped at the end is shown. At the beginning of therapy, in the first 4 days, the injection of the chimeric cells is not performed, and then it is carried out with the maximum intensity u max , which contributes to the reduction of cancer cells. On 21st day the therapy is again stopped. After the end of this therapy, cancer cells die out in the first 5 days. Here, as in the previous case, the treatment leads to the complete destruction of cancer cells.
In Figure 6 and Figure 7 the case of CAR T-cell therapy stopped at the end is considered. For the first 4–6 days the optimal control u * ( t ) takes the maximum value u max , and then the therapy is carried out with minimum intensity 0. In this case, CAR T-cells (chimeric cells) begin to grow and suppress the development of cancer. Although both figures are characterized by the same type of the optimal control, and both relate to Case 2, when α 2 α 3 , they correspond to different situations. Thus, Figure 6 refers to a situation when chimeric cells are as active in recognizing CD19 markers on healthy cells as on cancer cells, with α 2 = 0.8 and α 3 = 0.7 .
Figure 6 demonstrates a therapy that helps to slow down the rate of proliferation of cancer cells, but still cannot completely suppress the cancer. Note that after the end of the therapy period, for some time healthy cells still continue to grow, however, cancer cells begin to grow slowly in the first 10 days as well. This suggests that such treatment scenario can only slow down the development of the cancer, and is a temporary solution for developing another course of treatment, but is not suitable for the patient’s recovery strategy.
Figure 7 reflects the situation when α 2 > α 3 and so chimeric cells are more active in destroying cancer cells than on healthy cells. It is like a case of complete recovery from cancer.
Finally, we note that in all numerical calculations, the behavior of the cytokine population is the same regardless of its initial state w 0 . Due to the constant optimal control v * ( t ) , taking its maximum value 1 over the entire treatment period, the size of this population quickly stabilizes to the value λ / ν .

11. Conclusions

Recently, optimal control problems have become unfairly considered routine, since many authors use ready-made computer programs to solve them, without conducting any analytical investigation at all or reducing the latter to a minimum. It should be noted right away that in our work we use a Bolza-type objective function, the integral part of which does not contain square of the controls. Usually, a purely numerical solution of optimal control problems with such objective functions, without a preliminary analytical study of possible solutions, can be associated with certain computational difficulties. Anyway, it is difficult to look for a needle in a haystack. Therefore, we would like to emphasize the most important results obtained by us analytically.
In this paper, we stated and solved the optimal control problem with two bounded controls using the Pontryagin maximum principle. It should be noted that our analytical study of the controlled model allowed finding the type of optimal controls using the behavior of the switching functions. Thus, Theorem 1 states that the optimal control v * ( t ) is constant taking its maximum value over the entire time interval. This means that in order to prevent “a cytokine storm”, immunosuppressant drug must be taken at its maximal dosage over the entire treatment period. Further, Lemma 3 together with Theorem 2 state that depending on the model parameters and the values of the state variables at moment T, the optimal control u * ( t ) can take either its maximum or minimum value. Our methods of analytical investigating are well described in Section 5 (reducing the matrix of the linear non-autonomous system in a upper triangular form and then applying the Generalized Rolle’s Theorem to this system), in Section 6, where we split the three-dimensional quadratic system into two simpler two-dimensional quadratic subsystems, in Section 7, in which we provide an analysis of the quadratic subsystems, and finally, in Section 8, where we estimate the number of zeros of the switching function L ( t ) . Using this switching function properties, in Section 9 we established that the optimal control u * ( t ) has at most two switchings that was also demonstrated by independent numerical results presented in Section 10.
In this work, creating a controlled model of CAR T-cell immunotherapy, and solving the problem of an optimal control, we tried to explain some of the disadvantages of this procedure for treating leukemia and wished to find possible ways to increase its effectiveness. Within the framework of the proposed model and as a result of an analytical study of the switching function, five possible optimal treatment scenarios were obtained. The numerical results of the optimal solutions using BOCOP-2.2.0, as well as solutions without control, are shown in Figure 3, Figure 4, Figure 5, Figure 6 and Figure 7. Note that the graphs of the optimal controls are in an excellent agreement with the corresponding Formulas (52) and (53), when fulfilled, they should be obtained. Thus, the optimal controls demonstrated in Figure 3 (no switching) and Figure 4 (one switching) end with their maximum value that is in an agreement with the Formula (52) ( α 2 y * ( T ) α 3 μ z * ( T ) = 0.8 > 0 , and 0.1359 , respectively), while for the graphs in Figure 5 (two switchings), Figure 6 (one switching) and Figure 7 (also one switching), the Formula (53) is satisfied ( α 2 y * ( T ) α 3 μ z * ( T ) = 0.3112 < 0 , 0.1745 , and 0.1 , respectively), and the optimal controls end with their minimum value.
Despite the fact that the optimal strategies shown in Figure 4 and Figure 5 lead to the destruction of cancer cells, also on these graphs, a sharp increase in chimeric leukocytes is noted, while the population of other cells is significantly reduced. Apparently, in an organism weakened by chemo- and radiotherapy, chimeric cells, instead of killing only cancer cells, begin to compete with both types of cells and continue to actively divide.
It should be noted that in all situations presented in Figure 3, Figure 4 and Figure 5, the number of healthy lymphocytes is steadily decreasing over the given treatment period. It follows from the clinical experiences that the B-cell aplasia induced by CD19 CAR immunotherapy is rapidly reversed after CAR T-cells are ablated [34]. In our model, Figure 6 also shows the same behavior, and one can see that as soon as the therapy stops, an effect of B-cells aplasia is vanishing. (See the red piece of the z * ( t ) curve on the 5-day post treatment interval.) It is interesting that Figure 7 demonstrates a situation where no aplasia occurs and shows that with the optimal control, healthy cells do not die at all.
In Figure 3, Figure 4 and Figure 5, we indeed do not observe rapid increase in the healthy cells five days after the treatment period. How can you explain such a “strange” manifestation of the model? This could be explained by the fact that in the first three cases the body receives too many chimeric cells, which themselves proliferate in the process of interaction with cancer and healthy cells and therefore their influence remains after the end of treatment. Thus, they are not completely ablated, and their nonzero presence continues influencing the model dynamics.
After all, it was more logical to assume that in the first case (Figure 3), cancer cells would certainly be destroyed more efficiently than in the other two cases. The reason is apparently in the properties of the model itself. It is known that in the treatment of blood cancer, chimeric leukocytes with the antigen to the CD19 marker, which is also found in healthy leukocytes, are used. In our model, this manifests itself in the third terms of the first and third equations of the system. According to this model, some of the chimeric cells spend their energy not on cancer cells, but on the destruction of healthy cells. Moreover, if maximum injections of chimeric cells are made, then this will immediately weaken healthy cells and will not launch a competitive mechanism between them and cancer cells, expressed by the last terms of the second and third equations of the model (1).
Note that Figure 3, Figure 4 and Figure 5 correspond to Case 1, when α 2 α 3 and so chimeric cells very actively target healthy B-lymphocytes. Case 2, when α 2 α 3 , presented by Figure 6 and Figure 7 seems to be more realistic. Thus, in Figure 6, the concentration of healthy cells does not decrease in the course of treatment, even under initial conditions, when both cancerous and healthy cells have the same concentration. It is like a situation of modeling immunotherapy of leukemia in its severe stage. Moreover, it was found that if the initial concentration of cancer cells is significantly higher than the concentration of healthy cells, then the treatment according to Figure 7 scenario becomes more effective. Figure 7 demonstrates a case of recovery from leukemia, when cancer cells are completely destroyed, and healthy B-lymphocytes grow, reaching their equilibrium state.
It should be noted that for our numerical results, we chose such values of the model parameters in order to demonstrate all possible types of optimal control predicted analytically. It is possible that some of the Formulas (52) or (53) are not implemented at all in practice, but this was impossible to verify due to the unavailability of real data from clinical experiments.
Well, in principle, we need more specific chimeras that would recognize and kill only foreign cancer cells. For example, the CAR T-cell therapy can be specifically modified to stimulate the lymphocytes not too much, but not too weakly. In order not to stimulate the lymphocyte in the absence of a cancer cell, to better find rare markers on the surface of cancer cells. Mathematically such a case can be modeled by predator-prey model with further competition between cancer and healthy cells, mentioned in Introduction. This model would be implemented in our future studies.

Author Contributions

Conceptualization, E.K. and E.G.; methodology, E.K. and E.G.; software, A.K.; validation, E.K., E.G. and A.K.; formal analysis, E.K.; investigation, E.K. and E.G.; resources, E.K. and E.G.; data curation, not applicable; writing–original draft preparation, E.K., E.G. and A.K.; writing–review and editing, E.K. and E.G.; visualization, E.K., E.G. and A.K.; supervision, E.K.; project administration, not applicable; funding acquisition, not applicable. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

A.K. was funded by the Moscow Center for Fundamental and Applied Mathematics (Russia).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Heymach, J.; Krilov, L.; Alberg, A.; Baxter, N.; Chang, S.M.; Corcoran, R.B.; Dale, W.; DeMichele, A.; Diefenbach, C.S.M.; Dreicer, R.; et al. Clinical cancer advances 2018: Annual report on progress against cancer from the American Society of Clinical Oncology. J. Clin. Oncol. 2018, 36, 1020–1044. [Google Scholar] [CrossRef] [Green Version]
  2. June, C.H.; Sadelain, M. Chimeric antigen receptor therapy. N. Engl. J. Med. 2018, 379, 64–73. [Google Scholar] [CrossRef]
  3. Davenport, A.J.; Jenkins, M.R.; Ritchie, D.S.; Prince, H.M.; Trapani, J.A.; Kershaw, M.H.; Darcy, P.K.; Neeson, P.J. CAR-T cells are serial killers. Oncoimmunology 2015, 4, e1053684. [Google Scholar] [CrossRef] [Green Version]
  4. Brentjens, R.; Yeh, R.; Bernal, Y.; Riviere, I.; Sadelain, M. Treatment of chronic lymphocytic leukemia with genetically targeted autologous T cells: Case report of an unforeseen adverse event in a phase I clinical trial. Mol. Ther. 2010, 18, 666–668. [Google Scholar] [CrossRef] [PubMed]
  5. Kalos, M.; Levine, B.L.; Porter, D.L.; Katz, S.; Grupp, S.A.; Bagg, A.; June, C.H. T cells with chimeric antigen receptors have potent antitumor effects and can establish memory in patients with advanced leukemia. Sci. Transl. Med. 2011, 3, 95ra73. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Kochenderfer, J.N.; Dudley, M.E.; Feldman, S.A.; Wilson, W.H.; Spaner, D.E.; Maric, I.; Stetler-Stevenson, M.; Phan, G.Q.; Hughes, M.S.; Sherry, R.M.; et al. B-cell depletion and remissions of malignancy along with cytokine-associated toxicity in a clinical trial of anti-CD19 chimeric-antigen-receptor transduced T cells. Blood 2012, 119, 2709–2720. [Google Scholar] [CrossRef] [PubMed]
  7. Byun, D.J.; Wolchok, J.D.; Rosenberg, L.M.; Girotra, M. Cancer immunotherapy—Immune checkpoint blockade and associated endocrinopathies. Nat. Rev. Endocrinol. 2017, 13, 195–207. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. June, C.H.; Warshauer, J.T.; Bluestone, J.A. Is autoimmunity the Achilles’ heel of cancer immunotherapy? Nat. Med. 2017, 23, 540–547. [Google Scholar] [CrossRef] [Green Version]
  9. Davenport, A.J.; Cross, R.S.; Watson, K.A.; Liao, Y.; Shi, W.; Prince, H.M.; Beavis, P.A.; Trapani, J.A.; Kershaw, M.H.; Ritchie, D.S.; et al. Chimeric antigen receptor T cells form nonclassical and potent immune synapses driving rapid cytotoxicity. Proc. Natl. Acad. Sci. USA 2018, 115, 2068–2076. [Google Scholar] [CrossRef] [Green Version]
  10. Sadelain, M.; Brentjens, R.; Rivière, I. The basic principles of chimeric antigen receptor design. Cancer Discov. 2013, 3, 388–398. [Google Scholar] [CrossRef] [Green Version]
  11. Dotti, G.; Gottschalk, S.; Savoldo, B.; Brenner, M.K. Design and development of therapies using chimeric antigen receptor-expressing T cells. Immunol. Rev. 2014, 257, 107–126. [Google Scholar] [CrossRef] [PubMed]
  12. Clapp, G.; Levy, D. A review of mathematical models for leukemia and lymphoma. Drug. Discov. Today Dis. Model. 2015, 16, 1–6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Pillis, L.G.; Radunskaya, A. A Mathematical tumor model with immune resistance and drug therapy: An optimal control approach. J. Theor. Med. 2001, 3, 79–100. [Google Scholar] [CrossRef] [Green Version]
  14. Mostolizadeh, R.; Afsharnezhad, Z.; Marciniak-Czochra, A. Mathematical model of chimeric antigen receptor (CAR) T cell therapy with presence of cytokine. Numer. Algebr. Control Optim. 2018, 8, 63–80. [Google Scholar] [CrossRef] [Green Version]
  15. Khailov, E.N.; Klimenkova, A.D.; Korobeinikov, A. Optimal Control for Anti-cancer Therapy. In Extended Abstracts Spring 2018. Trends in Mathematics; Korobeinikov, A., Caubergh, M., Lázaro, T., Sardanyés, J., Eds.; Birkhäuser: Basel, Switzerland, 2019; Volume 11, pp. 35–43. [Google Scholar]
  16. Grigorenko, N.L.; Khailov, E.N.; Klimenkova, A.D.; Korobeinikov, A. Program and positional control strategies for the Lotka-Volterra competition model. In Stability, Control and Differential Games, Proceedings of the International Conference “Stability, Control, Differential Games” (SCDG2019), Yekaterinburg, Russia, 16–20 September 2019; Tarasyev, A., Maksimov, V., Filippova, T., Eds.; Springer Nature: Cham, Switzerland, 2020; pp. 39–49. [Google Scholar]
  17. Grigorenko, N.L.; Khailov, E.N.; Grigorieva, E.V.; Klimenkova, A.D. Optimal strategies in the treatment of cancers in the Lotka-Volterra mathematical model of competition. Tr. Inst. Mat. Mekhaniki URO RAN 2020, 26, 71–88. [Google Scholar]
  18. Lee, E.B.; Marcus, L. Foundations of Optimal Control Theory; John Wiley & Sons: New York, NY, USA, 1967. [Google Scholar]
  19. Pontryagin, L.S.; Boltyanskii, V.G.; Gamkrelidze, R.V.; Mishchenko, E.F. Mathematical Theory of Optimal Processes; John Wiley & Sons: New York, NY, USA, 1962. [Google Scholar]
  20. Hartman, P. Ordinary Differential Equations; John Wiley & Sons: New York, NY, USA, 1964. [Google Scholar]
  21. Grigorieva, E.V.; Khailov, E.N.; Korobeinikov, A. Parametrization of the attainable set for a nonlinear control model of a biochemical process. Math. Biosci. Eng. 2013, 10, 1067–1094. [Google Scholar]
  22. Grigorieva, E.V.; Khailov, E.N.; Korobeinikov, A. Optimal control for a SIR epidemic model with nonlinear incidence rate. Math. Model. Nat. Phenom. 2016, 11, 90–105. [Google Scholar] [CrossRef]
  23. Grigorieva, E.V.; Khailov, E.N.; Korobeinikov, A. Optimal control for an SEIR epidemic model with nonlinear incidence rate. Stud. Appl. Math. 2018, 141, 353–398. [Google Scholar]
  24. Dmitruk, A.V. A generalized estimate on the number of zeros for solutions of a class of linear differential equations. SIAM J. Control Optim. 1992, 30, 1087–1091. [Google Scholar] [CrossRef]
  25. Grigorieva, E.; Khailov, E.; Deignan, P. Optimal treatment strategies for control model of psoriasis. In Proceedings of the SIAM Conference on Control and its Applications (CT17), Pittsburgh, PA, USA, 10–12 July 2017; pp. 86–93. [Google Scholar]
  26. Grigorieva, E.; Khailov, E. Determination of the optimal controls for an Ebola epidemic model. Discret. Cont. Dyn.-S 2018, 11, 1071–1101. [Google Scholar] [CrossRef] [Green Version]
  27. Grigorieva, E.V.; Khailov, E.N.; Korobeinikov, A. Optimal controls of the highly active antiretroviral therapy. Abstr. Appl. Anal. 2020, 2020, 8107106. [Google Scholar] [CrossRef] [Green Version]
  28. Krasnosel’skii, M.A. The Operator of Translation along the Trajectories of Differential Equations; AMS: Providence, RI, USA, 1968. [Google Scholar]
  29. Demidovich, B.P. Lectures on Stability Theory; Nauka: Moscow, Russia, 1967. [Google Scholar]
  30. Castiglione, F.; Piccoli, B. Cancer immunotherapy, mathematical modeling and optimal control. J. Theor. Biol. 2007, 247, 723–732. [Google Scholar] [CrossRef] [PubMed]
  31. Barros, L.R.; Rodrigues, B.J.; Almeida, R.C. CAR-T cell goes on a mathematical model. J. Cell. Immunol. 2020, 2, 31–37. [Google Scholar]
  32. Pillis, L.G.; Radunskaya, A. The dynamics of an optimally controlled tumor model: A case study. Math. Comput. Model. 2003, 37, 1221–1244. [Google Scholar] [CrossRef]
  33. Bonnans, F.; Martinon, P.; Giorgi, D.; Grélard, V.; Maindrault, S.; Tissot, O.; Liu, J. BOCOP 2.2.0—User Guide. 27 June 2019. Available online: http://bocop.org (accessed on 8 March 2020).
  34. Paszkiewicz, P.J.; Frasle, S.P.; Srivastava, S.; Sommermeyer, D.; Hudecek, M.; Drexler, I.; Sadelain, M.; Liu, L.; Jensen, M.C.; Riddell, S.R.; et al. Targeted antibody-mediated depletion of murine CD19 CAR T cells permanently reverses B cell aplasia. J. Clin. Investig. 2016, 126, 4262–4272. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The orange area is the set of parameters that satisfy inequalities (54).
Figure 1. The orange area is the set of parameters that satisfy inequalities (54).
Games 11 00053 g001
Figure 2. The orange area is the set of parameters that satisfy inequalities (56).
Figure 2. The orange area is the set of parameters that satisfy inequalities (56).
Games 11 00053 g002
Figure 3. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.1 , q = 1.0 , η = 0.9 , γ = 1.1 , α 1 = 1.0 , α 2 = 0.8 , α 3 = 0.9 , β 1 = 0.9 , β 2 = 0.9 , β 3 = 1.1 , m = 0.2 , μ = 100 , u max = 1 , T = 30 and the initial conditions x 0 = 0.55 , y 0 = 0.8 , z 0 = 0.6 .
Figure 3. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.1 , q = 1.0 , η = 0.9 , γ = 1.1 , α 1 = 1.0 , α 2 = 0.8 , α 3 = 0.9 , β 1 = 0.9 , β 2 = 0.9 , β 3 = 1.1 , m = 0.2 , μ = 100 , u max = 1 , T = 30 and the initial conditions x 0 = 0.55 , y 0 = 0.8 , z 0 = 0.6 .
Games 11 00053 g003
Figure 4. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.1 , q = 1.0 , η = 1.2 , γ = 0.8 , α 1 = 1.2 , α 2 = 0.9 , α 3 = 0.9 , β 1 = 0.6 , β 2 = 0.75 , β 3 = 1.3 , m = 0.4 , μ = 100 , u max = 1 , T = 30 and the initial conditions x 0 = 0.6 , y 0 = 0.6 , z 0 = 0.6 .
Figure 4. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.1 , q = 1.0 , η = 1.2 , γ = 0.8 , α 1 = 1.2 , α 2 = 0.9 , α 3 = 0.9 , β 1 = 0.6 , β 2 = 0.75 , β 3 = 1.3 , m = 0.4 , μ = 100 , u max = 1 , T = 30 and the initial conditions x 0 = 0.6 , y 0 = 0.6 , z 0 = 0.6 .
Games 11 00053 g004
Figure 5. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.1 , q = 1.0 , η = 0.9 , γ = 1.0 , α 1 = 1.0 , α 2 = 0.8 , α 3 = 0.8 , β 1 = 0.9 , β 2 = 0.9 , β 3 = 1.0 , m = 0.5 , μ = 50 , u max = 1 , T = 30 and the initial conditions x 0 = 0.6 , y 0 = 0.8 , z 0 = 0.8 .
Figure 5. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.1 , q = 1.0 , η = 0.9 , γ = 1.0 , α 1 = 1.0 , α 2 = 0.8 , α 3 = 0.8 , β 1 = 0.9 , β 2 = 0.9 , β 3 = 1.0 , m = 0.5 , μ = 50 , u max = 1 , T = 30 and the initial conditions x 0 = 0.6 , y 0 = 0.8 , z 0 = 0.8 .
Games 11 00053 g005
Figure 6. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.0 , q = 1.1 , η = 0.8 , γ = 1.0 , α 1 = 1.1 , α 2 = 0.8 , α 3 = 0.7 , β 1 = 0.9 , β 2 = 0.8 , β 3 = 1.0 , m = 0.3 , μ = 1 , u max = 1 , T = 30 and the initial conditions x 0 = 0.6 , y 0 = 0.7 , z 0 = 0.7 .
Figure 6. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.0 , q = 1.1 , η = 0.8 , γ = 1.0 , α 1 = 1.1 , α 2 = 0.8 , α 3 = 0.7 , β 1 = 0.9 , β 2 = 0.8 , β 3 = 1.0 , m = 0.3 , μ = 1 , u max = 1 , T = 30 and the initial conditions x 0 = 0.6 , y 0 = 0.7 , z 0 = 0.7 .
Games 11 00053 g006
Figure 7. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.0 , q = 1.1 , η = 0.8 , γ = 1.0 , α 1 = 1.1 , α 2 = 0.8 , α 3 = 0.1 , β 1 = 0.9 , β 2 = 0.8 , β 3 = 1.0 , m = 0.3 , μ = 1 , u max = 1 , T = 30 and the initial conditions x 0 = 0.7 , y 0 = 0.7 , z 0 = 0.1 .
Figure 7. The optimal solutions x * ( t ) , y * ( t ) , z * ( t ) and optimal control u * ( t ) for the values of the parameters r = 0.6 , p = 1.0 , q = 1.1 , η = 0.8 , γ = 1.0 , α 1 = 1.1 , α 2 = 0.8 , α 3 = 0.1 , β 1 = 0.9 , β 2 = 0.8 , β 3 = 1.0 , m = 0.3 , μ = 1 , u max = 1 , T = 30 and the initial conditions x 0 = 0.7 , y 0 = 0.7 , z 0 = 0.1 .
Games 11 00053 g007
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Khailov, E.; Grigorieva, E.; Klimenkova, A. Optimal CAR T-cell Immunotherapy Strategies for a Leukemia Treatment Model. Games 2020, 11, 53. https://doi.org/10.3390/g11040053

AMA Style

Khailov E, Grigorieva E, Klimenkova A. Optimal CAR T-cell Immunotherapy Strategies for a Leukemia Treatment Model. Games. 2020; 11(4):53. https://doi.org/10.3390/g11040053

Chicago/Turabian Style

Khailov, Evgenii, Ellina Grigorieva, and Anna Klimenkova. 2020. "Optimal CAR T-cell Immunotherapy Strategies for a Leukemia Treatment Model" Games 11, no. 4: 53. https://doi.org/10.3390/g11040053

APA Style

Khailov, E., Grigorieva, E., & Klimenkova, A. (2020). Optimal CAR T-cell Immunotherapy Strategies for a Leukemia Treatment Model. Games, 11(4), 53. https://doi.org/10.3390/g11040053

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop