1. Introduction
Cancer is a type of disease that causes abnormal growth of cells in the body, leading to the formation of carcinomas, which can eventually turn into malignant tumours. In 2020, the International Agency for Research on Cancer reported 19.3 million new cancer cases and nearly 10 million cancer-related deaths [
1]. There are various methods for treating cancer, and the treatment choice largely depends on the specific type of cancer and its impact on the patient’s health.
Radiotherapy is a commonly used cancer treatment technique involving exposing patients to ionising radiation to target cancerous cells. There are various forms of radiotherapy, such as Volumetric Modulated Arc Therapy (VMAT), Stereotactic Body Radiation Therapy (SBRT), and Intensity Modulated Radiation Therapy (IMRT), among others. IMRT is one of the most widely used methods of radiation therapy, and is delivered using a linear accelerator (linac) machine [
2] (
Figure 1). IMRT aims to effectively deliver the prescribed radiation dose to the cancerous cells while minimising the exposure of healthy structures [
3]. This is achieved by modulating the radiation passing through the linac using a multi-leaf collimator (MLC) device.
The IMRT technique enables the delivery of an optimal radiation dose to the tumour while minimising exposure to surrounding healthy organs [
4]. However, finding a treatment plan that balances the desired dose to the tumour and minimal side effects on surrounding organs is highly complex. To address this, the IMRT planning process is typically split into three sequential sub-problems: beam angle optimisation (BAO), fluence map optimisation (FMO), and multi-leaf collimator sequencing [
5]. First, the BAO problem aims to identify the best possible combination of beam angles from which the radiation should be delivered, also known as the beam angle configuration (BAC). Once a BAC has been selected, the optimal intensities for that BAC must be found (Fluence Map Optimisation problem, FMO). Finally, in the MLC sequencing problem, we compute a set of deliverable aperture shapes and their corresponding intensities.
This sequential approach ends with a treatment plan consisting of a large set of aperture shapes (with corresponding intensity values). Unfortunately, having too many apertures and larger intensity values per aperture means longer treatment time. The total delivery time of a treatment plan is calculated considering both the
beam-on time and the decomposition times. The beam-on time (BoT) is the total time a patient is exposed to radiation. The decomposition time is the time the linear accelerator needs to move from one bean angle in a BAC to the next one and the time needed by the MLC to move from one aperture shape to the other [
6,
7].
As a general rule, prolonged treatment time is something we want to avoid, as it increases the attention time per patient and, thus, reduces the number of patients treated per day [
8]. Further, longer treatment plans are more likely to suffer from inaccuracies produced, for instance, by patient’s movements.
One strategy commonly used to minimise the total delivery time of treatment plans generated using the sequential approach described before is to reduce the number of apertures. This can be made by “rounding” the intensity values computed during the FMO phase. Unfortunately, such strategies can severely impair the final treatment plan quality.
One alternative to the sequential approach that does not require any “rounding” process is the direct aperture optimisation problem (DAO). The main idea in DAO is to solve the FMO problem considering a limited number of deliverable aperture shapes and the physical constraints associated with the MLC sequencing.
To solve the DAO problem, we must find a set of aperture shapes and their associated intensity values [
9]. Usually, aperture shapes are optimised using heuristic strategies [
10,
11] or looking for the best possible combination of aperture shapes from a pre-defined set of apertures [
12]. To optimise intensity values, gradient-based optimisation methods are usually implemented. Compared to the sequential approach, the treatment plan obtained using DAO is not only deliverable, but also better regarding the objective function value [
13].
In this paper, we implement a particle swarm optimisation algorithm (PSO) combined with a mathematical programming technique to solve the DAO problem. PSO is recognised for effectively solving large-scale nonlinear optimisation problems through a good balance between exploitation (local search) and exploration (global search) [
14,
15]. While the PSO algorithm finds the best aperture shapes at each beam angle for a given BAC, the mathematical programming algorithm optimises each aperture’s intensity value. Also, we present a reparation heuristic for those aperture shapes that have a negligible effect on the treatment plan. To analyse our algorithm results, we use a set of clinical cases of prostate cancer and compare the treatment plans obtained by our algorithm to those obtained by the traditional sequential approach. The results show that our algorithm can find deliverable treatment plans using fewer apertures and significantly reduce the beam-on time compared to the traditional sequential approach. Compared to deliverable treatment plans with a similar number of apertures, our algorithm outperforms them regarding objective function values.
The remainder of this paper is organised as follows:
Section 2 introduces the general concepts of IMRT and DAO and the mathematical models we will consider in this study. In
Section 3, the algorithms we implement in this paper are presented.
Section 4 presents the results obtained by our algorithm applied to a prostate case. A discussion of these results is also included in this section. Finally, in
Section 5, we draw the main conclusions of our work and outline future work.
3. Solution Method
This section introduces our hybrid PSO algorithm to solve the DAO problem. The main goal of our algorithm is to obtain a high-quality treatment plan for IMRT that consists of a set of deliverable set of aperture shapes and their corresponding intensity values.
In
Section 3.1, we explain the original PSO algorithm proposed in [
35] and how we adapt it to the DAO problem. Then, in
Section 3.2, we define a reparation heuristic that uses a mathematical programming algorithm to improve the solution found by our PSO algorithm.
3.1. Particle Swarm Optimisation
The PSO is a nature-inspired population-based metaheuristic algorithm that imitates the social behaviour of birds in nature. This swarm consists of particles that search the objective space intending to find different high-quality solutions. Each particle is, in turn, composed of two fitness-related elements. The first element is the current fitness value of the i-th particle, and the second element is the fitness value of the best position the i-th particle has ever found during the algorithm execution, . Finally, the algorithm also keeps track of the best fitness value found so far, .
The PSO starts with an initial population of particles whose positions have been randomly assigned. The
i-th particle’s position at iteration
t is represented by
. The direction of particles in each iteration is determined by a velocity variable denoted by
that obtains its value from Equation (
4).
where
t is the current iteration,
is the best position the
i-th particle has achieved, and
is the best position any particle in the swarm has achieved. Parameter
is the constriction factor used to adjust the velocity of each particle and obtain a balance between exploration and exploitation. The parameter
w is the algorithm’s inertia and controls the last velocity contribution. Parameter
and
are learning factors for managing the impact of
and
. Parameters
and
are random numbers between 0 and 1. The new position of each particle is updated by adding the current velocity to the function of the position of the particle, as shown in Equation (
5).
In the proposed algorithm, we represented the particles as shown in
Figure 6. As we can see, the particle is composed of three attributes, namely the current fitness value (a real-valued attribute), its best singular position (a treatment plan), and its current position (a treatment plan). Naturally, the current fitness value results from evaluating the current particle’s position in the objective function considered by the algorithm.
As in any other heuristic algorithm, solutions generated by the PSO algorithms are not (necessarily) optimal. One drawback of the PSO implemented here is that there is no relation between the intensity values associated with an aperture and the aperture itself. Unfortunately, as mentioned before, the aperture shape optimisation problem is an NP-hard problem that mathematical programming solvers cannot solve in a reasonable time. Unlike this, the apertures’ intensity optimisation problem (also known as aperture weight optimisation [
9] or segmentation weight optimisation) is a convex continuous problem that can quickly be solved for solvers such as Gurobi (see, for instance, [
24,
25,
26]). Then, we propose to implement a hybrid PSO with a mathematical programming algorithm to solve the DAO problem. We use the PSO algorithm to find a set of aperture shapes and their corresponding intensities, which the linear solver will then optimise.
To better understand the algorithm’s behaviour, we can see in
Figure 7 how an aperture shape and the associated intensities change in each step. Considering the representation of the treatment plan mentioned in
Section 2.2, we can represent the aperture shape and the intensities obtained by the PSO algorithm like a tuple
. The intensities
are optimised by the solver at the end of each iteration of the PSO algorithm. As a result, we obtain a new tuple
where, as mentioned before, some intensities in
are set to zero by the solver. To improve the aperture shapes that resulted in (near) zero intensity value after the solver optimisation, we use a reparation heuristic. This heuristic only modifies
, leading to a new tuple
. Finally, the reparation heuristic passes on the Solver the tuple
so we can obtain the optimal intensity values for the new set of apertures
, generating the tuple
. Finally, the treatment plan defined by the tuple
is passed onto the PSO algorithm for the next iteration. This process is repeated until the PSO algorithm meets some termination criterion (e.g., it reaches a predetermined number of iterations).
3.2. Reparation Heuristic
As mentioned in the previous paragraph, as a result of the solver usage, we obtain the optimal intensities for each aperture at each beam angle. Since the optimisation solver is conditioned to the aperture shapes obtained at each iteration by the PSO algorithm, it is not unusual that some of the intensities end up in the optimisation process with values close to zero.
In practice, apertures with associated intensities near to zero value are equivalent to having no aperture at all, i.e., an insignificant (or null) impact on the treatment plan. Further, improving the shapes of those apertures with intensity values close to zero is complex. To address this issue, we propose a reparation heuristic that allows us to avoid (as much as possible) those apertures with a negligible effect on the treatment plan.
Figure 8 shows a numerical example of the intensities optimisation process. On top of the image, we can see four aperture shapes with their associated intensities. We can see that all the intensities are modified on the bottom part of the same image.
The main idea of the reparation heuristic proposed here is to replace those apertures with intensity values closer to zero with apertures that (hopefully) can help after running the solver. Particularly, we aim to irradiate those parts of the aperture shape that are not irradiated from any other aperture of the beam angle.
To this end, we generate a new aperture that results from overlapping the apertures with an intensity value greater than 1. We call this new aperture the “overlapped aperture”, and the ones with intensity values greater than one “the original apertures”.
Figure 9 shows an example of the apertures overlapping process. Fields with a value of 1 correspond to beamlets radiation passes through. Fields with zero value correspond to the beamlets closed in the original apertures. Finally, −1 corresponds to the inactive beamlets (those that do not hit the tumour).
As shown in
Figure 9, the overlapped aperture corresponds to the original apertures’ aggregation, i.e., the overlapped aperture keeps open beamlets that are open in at least one original aperture and sets closed those beamlets that are closed in all the original apertures.
As a result of this aggregation process, we have a matrix showing all the beamlets currently open in at least one original aperture. As mentioned above, we want to diversify our search, and thus, we want to irradiate from those fields that are not currently in use.
To this end, the reparation heuristic generates the complementary matrix of the overlapped matrix, as shown in
Figure 10.
It is important to keep in mind some considerations about the application of our reparation heuristic. First, in some cases, the shape of the complementary matrix does not satisfy the MLC physical constraints and can not directly replace the original aperture. In that case, we can select part of the aperture that is actually deliverable and remove those parts that do not satisfy MLC physical constraints. As shown in
Figure 11, we divide the complementary matrix into two different apertures that satisfy the MLC physical constraints.
Second, suppose the number of original apertures with an intensity value close to zero is more than one. In that case, we must divide the complimentary matrix to generate as many new apertures as needed. Note that this situation can help us to solve our first consideration (undeliverable aperture shapes), as we can divide the complementary matrix in such a way that all (or most of) the open beamlets in the complementary matrix can be added to the new apertures (see, for instance,
Figure 11).
Finally, the reparation heuristic replaces those apertures with (near) zero intensity values by the aperture shapes obtained in the previous step. We need to note that, in some cases, one or more apertures still with (near) zero intensity values as the number of deliverable aperture shapes produced by the reparation heuristic is less than the number of apertures with (near) zero intensity values. Once we obtained the repaired aperture shapes, we optimised the intensities values, as shown in
Figure 12.
4. Computational Experiments
This Section introduces the experiments performed by our algorithm and analyses the obtained results. The Section is divided into three subsections. In
Section 4.1, we introduce the set of instances considered in our study and the parameters used by the PSO. In
Section 4.2, we obtain the best parameters for the PSO algorithm using the framework Irace [
36]. Finally, in
Section 4.3, we compare our PSO to two algorithms used in the literature. Comparison is made regarding the obtained objective function values, the required number of aperture shapes, and their beam-on time.
4.1. Experimental Setup
In this work, we perform a set of initial experiments on the prostate case instance from
CERR package [
37] and also examine a prostate case acquired from Clinica Alemana de Santiago, Chile. This particular patient is denoted as TRT001 [
19]. We use this prostate case to evaluate the performance of the PSO algorithm introduced in
Section 3.1. For the CERR and TRT001 cases, we consider three organs: the prostate, where the tumour is located, the bladder, and the rectum (see
Figure 13). We label the rectum and the bladder as organs at risk (OARs) and the prostate as planning target volume (PTV).
The number of voxels per region in the CERR case is 15,172 for the prostate, 22,936 for the bladder and 18,128 for the rectum. We consider 72 beam angles, all of which are on the same plane. Similarly, in the TRT001 case, the prostate comprises 13,081 voxels, the bladder holds 19,762 voxels, and the rectum encompasses 8500 voxels.
Like other works in the problem we consider a set of 14 equidistant BACs [
12,
17,
18,
21,
24,
25]. Each BAC consists of five beam angles for the CERR and TRT01 instances, as shown in
Table 1.
Table 2 details the prescribed doses,
, considered per each organ at all the instances and the weights for both under-dose
and overdose
.
4.2. Irace Parameter
To optimise the parameter used in the PSO algorithm implemented, we tried the package Irace [
36]. This package is an extension of the iterative F-race algorithm (I/F race) [
38,
39]. The principal use of this method is for the automatic configuration of optimisation algorithms. This is performed by finding the most appropriate configuration of parameters from a set of instances executed in the algorithm. This package has also been used for the parameters optimisation of the algorithm proposed by Caceres et al. [
21]. That said, using IRace aims to find suitable parameters for our PSO implementation. The parameters to optimise within the IRace package are shown in
Table 3.
Table 4 shows the results provided by the IRace package:
The number of iterations used by our algorithm is given by Equation (
6), where we set the evaluation to 40,000 (number obtained testing the algorithm) and an
of 518 (given in
Table 4) doing a total of 95 iterations, like limits for the algorithm.
4.3. Experiments on Test Instances
In our experiments, we measure the performance of the proposed PSO using the best-found parameter configuration, described in
Section 4.2. Note that we run our algorithm 30 times per BAC, as 30 is a widely accepted value for statistical analysis [
40].
Table 5,
Table 6,
Table 7 and
Table 8 report the results obtained by both the sequential and the PSO approaches when applied to the CERR and TRT001 cases. As mentioned in the previous section, the IMRT sequential approach obtains a fluence map, optimising the dose-volume model of the FMO problem. Next, the MLC sequencing problem is solved for the resulting fluence maps by using a well-known algorithm from [
7], which finds a set of apertures that minimise the BoT. In
Table 5 and
Table 6, column
corresponds to the cost of the optimal fluence map using the function in Equation (
2). Columns
,
and
correspond to the cost of the fluence maps with intensities rounded to the nearest integer, the nearest multiple of 2, and the nearest multiple of 4, respectively. For each rounding, we also report the number of apertures generated by the MLC sequencing algorithm (#ap) and the BoT.
Table 7 and
Table 8 report the results obtained by our PSO algorithm. Due to its stochastic nature, the strategy was run 30 times on each instance. We report the mean over the 14 instances of each set, the best value for each set, the mean number of apertures with intensity different to zero, and the mean BoT. We need to point out that apertures for which the intensity is set to zero by the mathematical programming solver in the last iteration are considered closed.
When comparing the objective function value reported by the PSO and the optimal (but not deliverable) fluence map, the difference is 29.41% and 30.49% for CERR and TRT001, respectively, with the PSO algorithm being the one with the higher objective value. This difference in the objective function value is reduced when the rounding process is applied to the optimal fluence map. For instance, when the optimal fluence map is rounded to the nearest multiple of 1 and 2, the difference is 25.39% and 13.71% for the CERR case and 26.98% and 17.98% for the TRT001 case, respectively. Further, rounding to the nearest multiple of 4 leads to an impairment in the quality of the rounded treatment plan that makes solutions provided by our PSO algorithm become better in all cases. Further, even though our algorithm is not better than the and treatment plans (with respect to the objective function value), the number of aperture shapes our solutions need is always smaller than the apertures needed by the solutions obtained by the sequential approach. Also, it is interesting to note that even though our approach is not directly focused on reducing the beam on time value, our approach reports better values in all cases compared to the sequential approach. This is mainly because of the fact that we use far fewer aperture shapes in our final treatment plans.
In addition, we report the dose-volume histogram (DVH) for the CERR and TRT001 in
Figure 14 and
Figure 15, respectively. DVH curves specify the received dose level by different volumes of structures. In the case of CERR, we can see that our algorithm obtains treatments that do not overdose the voxels in the PTV. Unlike this, the solutions obtained by the optimal fluence map overdose above 30% of PTV voxels. When observing the OARs, our algorithm overdoses more voxels than the optimal fluence map. However, the max overdose received for the voxels is less than the received by the optimal fluence map. In the case of TRT001, the PSO and the optimal fluence map have a similar curve, where both do not overdose the PTV. When observing the OARs, our algorithm overdoses more voxels than the optimal fluence map. It is necessary to remember that the optima fluence map is not a deliverable treatment and needs to pass for the MLC sequencing problem.
5. Conclusions
This paper introduces a hybrid heuristic based on PSO and mathematical programming to solve the DAO problem in radiation therapy for cancer treatment. The proposed PSO heuristic finds a set of deliverable aperture shapes and their corresponding intensities for each beam angle within a clinically acceptable time. Further, even though our heuristic algorithm was allowed to use only five aperture shapes per beam angle, they could find very competitive treatment plans.
Comparing our algorithm with the traditional sequential approach shows that the proposed algorithm can obtain competitive results regarding the objective function value. However, the difference with the optimal solution generated by the FMO is still significant. On the opposite, when evaluating the number of apertures generated by our algorithm, we can observe a substantial reduction compared to the traditional approach. This is very important as fewer aperture shapes mean, in general, shorter treatment times, which is something desirable from a clinical point of view.
In future work, we can see different research lines to improve the obtained results. First, we believe that improving the reparation heuristic to activate apertures that have intensities close to zero would allow us to find better-quality treatment plans. This is because the more apertures are used, the better the treatment plan quality. Note that, as mentioned before in the paper, this would be at the cost of longer treatment times. In addition, we seek to extend our single-objective PSO algorithm to a multi-objective one. This is because IMRT is an inherently multi-objective problem, since there is a compromise between tumour irradiation and avoiding damage to the organs at risk. Extending our approach to a multi-objective one is a challenging task from both computational and clinical points of view. However, we are sure that addressing the problem as a multi-objective one will help us better understand the underlying trade-offs between tumour control and OARs sparing.