1. Introduction
In radiotherapy, accurate modeling of uncertainties in dose is essential for attaining high-quality, robust treatment plans. This work explores different distributions to analytically model the most relevant dose-distribution plan-metric in radiotherapy treatment plans.
Radiotherapy is a cornerstone of cancer treatment, with nearly two-thirds of patients with loco-regional tumors in Western countries undergoing radiotherapy [
1]. Over recent decades, the field has seen rapid advancements including substantial improvements in radiation delivery precision [
1], and the incorporation of innovative technologies like heavy-ion beams [
2].
In radiotherapy treatment planning, simulation and optimization are conducted on patient imaging data to delineate an optimal treatment plan. This plan establishes the geometrical and physical parameters of the radiation beam(s) to ensure a uniform prescribed radiation dose within the target volume while minimizing the dose absorbed by surrounding healthy tissues [
3]. The absorbed dose, characterized as the absorbed energy per unit mass deposited by ionizing radiation, is measured in grays (Gy) [
4].
A paramount challenge in this domain arises from executional and preparational uncertainties impacting the absorbed dose. Traditional approaches to account for these uncertainties include the incorporation of margins by enlarging the target volume based on probabilistic recipes derived from population estimates [
5], or personalized quantification through explicit error scenarios [
6]. These error scenarios are either determined as worst-case estimates for robust optimization [
7,
8], or derived from random sampling using probability distributions to parameterize the uncertainty model (e.g., [
9]).
These methodologies, however, are constrained by both the inherent limitations of the underlying uncertainty model and the computational cost of simulations. A notable drawback is the implicit nature of the mathematical transformation from input (uncertainty model) to output (probability distribution over dose or plan metrics), which obscures an explicit understanding and estimation of the resultant probability distributions [
10]. This opacity has led to a scarcity of precise estimations and parameterizations of probability distributions over plan metrics [
10,
11,
12,
13]. Thus, an exploration into more accurate and computationally efficient statistical frameworks could significantly enhance the analysis of patient data and potentially lead to more reliable and optimized treatment plans.
An alternative approach to circumvent the limitations of conventional methods is to employ an analytical methodology to propagate dose uncertainties. The foundational method employed in this work is termed analytical probabilistic modeling (APM) [
14]. Distinct from sampling-based strategies, APM encapsulates a probabilistic dose calculation algorithm, furnishing closed-form expressions for the moments of the resultant probability density of absorbed dose, contingent on assorted assumptions embedded in the input uncertainty model [
10,
15,
16,
17]. This approach markedly enhances computational efficiency by leveraging the analytical formulation for both uncertainty quantification and probabilistic optimization. Although the runtimes are highly dependent on the used hardware (CPU or GPU, number or cores, etc.) and effort in computational optimization, uncertainty propagation with APM can be performed in seconds to minutes, while uncertainty propagation with Monte Carlo sampling would take minutes to hours and suffer from statistical noise (for detailed runtimes of various approaches, see [
10,
11,
18,
19]).
A quintessential plan metric in this domain is the dose–volume histogram (DVH) [
20]. A DVH condenses the information from the three-dimensional dose distribution into a cumulative histogram: For a specified volume, such as an organ at risk or the clinical target volume, the DVH denotes the fraction of the volume of interest that has received a dose value
or less. This is a cumulative summation of the nominal dose
over the voxels in
V. The nominal DVH point,
, is computed as
where
symbolizes the Heaviside step function (e.g., [
13]). In essence, only the voxels that have received a nominal dose
contribute with a weight of
to the sum. It is axiomatic that
as the DVH points represent a fraction.
Past research [
13] illustrated how APM could be used to analytically compute the moments of DVH points, i.e., the volume fraction enveloped at least with a specific dose
. In this investigation, a multivariate normal distribution of the dose over the voxels was presupposed. However, a closer inspection reveals that this Gaussian model may be physically contentious, given the volume fraction’s intrinsic constraint to positive values. Despite this, the Gaussian model was evaluated with patient data, and comparison to Monte Carlo sampling showed satisfactory results [
13].
In this work, our objective is to investigate potentially more adept choices of probability distributions to model dose and DVH points under uncertainty. We perform an evaluation of a variety of positive continuous distributions as the marginal probability density function of the dose over a voxel, juxtaposing them with the empirical distribution derived from Monte Carlo sampling. To encapsulate the spatial correlation in dose, we opt for Gaussian and Ali–Mikhail–Haq Copulas [
21,
22], as they are a suitable framework to amalgamate different marginal distributions over individual voxels. Our examination of dose-distribution assumptions bifurcates into two scenarios: (a) on simulated DVHs engendered using a one-dimensional prototype constructed atop an open-source “APM toolbox” [
23], and (b) on patient data for a lung case subjected to proton therapy.
2. Materials and Methods
In this section, we introduce the mathematical formulation of APM, accentuating the computation of the first two moments of DVHs under APM. This includes the introduction of the probability distributions and Copulas alongside a narrative of the quantitative analysis.
2.1. Analytical Probabilistic Modeling for Radiotherapy Treatment Planning
Analytical probabilistic modeling (APM) for radiotherapy treatment emerged in the medical physics community through the work of Bangert et al. [
14], and has since evolved through subsequent refinements aimed at enhancing computational performance [
10], adapting to biological-ion planning [
24], accommodating varying radiotherapy schedules [
16], and extending to encapsulate uncertainty in plan quality metrics [
13]. Central to the APM approach is the reconfiguration of radiotherapy treatment planning algorithms to enable a closed-form propagation of uncertainties. This transpires through the calculation of moments of the resultant probability distribution, employing the “law of the unconscious statistician” [
14,
25] as the mechanism for computing the expectation value of a function concerning a random variable. In this section, we provide a brief overview of the APM concept as it pertains to this work.
In depicting dose deposition through analytical probabilistic modeling (APM), we divide the patient into V voxels, serving as 3D pixels. To modulate the intensity of a radiation beam, it is itself discretized into B beamlets composing the radiation field. For a given beamlet intensity vector , the dose distribution can be represented as a vector and computed via the linear transformation with a dose–influence matrix containing the individual, normalized beamlet dose distributions.
When taking into account uncertainties such as positional shifts of the patient, we can represent these uncertainties as a random vector , whose realizations form a single fraction treatment scenario. Then, our dose–influence matrix D becomes a random variable depending on , propagating the uncertainties further to the dose vector . APM now assumes to follow a multivariate normal distribution and derives closed-form expressions for computing the elements of the matrix D. Since these expressions can be integrated against the Gaussian PDF over , one can compute moments such as the expected value and covariances of matrix elements in closed-form as well.
Consequently, APM does not propagate the entire probability distribution, but accurately captures its propagated moments, which is beneficial for radiotherapy treatment planning [
10,
14]. The computational complexity of APM on patient cases, however, restricts it to only the computation of the first and second moments. Thus, any further attempts to propagate uncertainties to quality metrics reliant on the dose
are confined to the accuracy and appropriateness of the selected matched distribution over
, given the first and second moments.
To understand the relevance of the dose–influence matrix
D it needs to be added that in any treatment planning system the values of the beamlet intensity vector
are optimized so that the dose meets the requirements of the treatment plan. A common, simple objective function could be formulated as penalized least squares to a prescribed dose
[
14],
where
is the nominal dose, and
is a diagonal matrix with user-chosen penalties for each voxel. Larger penalty values can be set in voxels closer to, or inside, organs at risk. A minimizer adjusts the beamlet intensity vector
, recomputes the dose vector
(using the dose–influence matrix), and reevaluates the objective function
until it reaches a minimum. The dose–influence matrix then allows for fast computation of the dose vector
during optimization, and with APM can also be translated to probabilistic formulations optimizing the expected value of the objective [
10,
14,
16].
2.2. Analytical Probabilistic Modeling of Dose–Volume Histograms
We summarize here only the main results of APM applied to the computation of dose–volume histograms relevant to this work. The full derivation can be found in Wahl et al. [
13].
Assuming a given distribution of the dose, the expected value of a DVH point can be computed as [
13]
where
is the marginal CDF of the dose at voxel
i.
Furthermore, the variance in a DVH point can be computed as [
13]
where the first term represents the second moment of a DVH point, with
being the marginal bivariate CDF of the dose at voxels
i and
l.
An alternative approach for calculating the moments of DVH points, instead of using APM, is to generate Monte Carlo simulations of . This method involves sampling the uncertainty vector times from a Gaussian distribution. For each sample s, the nominal dose vector is computed as the product , where is the dose–influence matrix specific to sample s. Through this Monte Carlo technique, the expected value of a DVH point is estimated using the sample mean, while the variance is determined from the sample variance. However, this approach can be computationally intensive, especially for large numbers of voxels and beamlets. In our study, we utilize this Monte Carlo method as a reference for comparison against the APM-derived moments of the DVH points.
2.3. Probability Distribution Models
As highlighted in
Section 1, previous studies using APM conceptualized the dose distribution as a multivariate Gaussian [
13], denoted by
. Theoretically, adopting a Gaussian model for dose distribution is somewhat challenging to justify, considering already the simple fact that dose values should be confined to
, whereas a Gaussian distribution encompasses the entire real line
. Nevertheless, empirical evidence, as demonstrated in Wahl et al. [
13], has shown that the Gaussian model is sufficiently accurate for the practical computation of probabilistic DVHs. In our current research, we aim to investigate alternative models for dose distribution to enhance our understanding and improve the accuracy of dose-distribution modeling.
In the computation of the expectation value of DVH points, Equation (3) necessitates a model for the marginal univariate CDFs. To this end, we examine various distribution types, including Gaussian, log-normal, four-parameter beta, gamma, and Gumbel distributions. The PDFs under consideration are continuous, which is a prerequisite for using them as the marginal PDFs of Copulas. As the common choice, the Gaussian normal distribution serves as the benchmark. The Gumbel distribution, as an extreme-value distribution, is tested due to its ability to model skewness in combination with a one-sided tail, possibly representing few extreme high dose values. Since those distributions have real support, the gamma and log-normal distributions could provide alternatives supported only on the positive real line . Finally, the four-parameter beta distribution is chosen for its versatility. The selected distributions, thus, allow the influence of the support of the distribution in modeling the dose to be evaluated, challenging the Gaussian assumption.
Moreover, for calculating the variance in DVH points, Equation (4) requires analytical expressions for bivariate CDFs. In addressing this, we employ Copulas [
22] to construct bivariate CDFs by merging different marginal PDFs. Consider the dose in voxel
i, denoted as
, with its marginal CDF
, and similarly, the dose in voxel
l,
, with its marginal CDF
. We generate uniformly distributed random variables
and
through the probability integral transform:
Subsequently, the bivariate CDF is constructed using the Copula
:
where
represents the Copula parameter, selected to match the correlation coefficient between
and
. Our exploration includes two types of coupling functions
: the Gaussian and the Ali–Mikhail–Haq [
21] Copulas. The Gaussian Copula is defined as
where
is the inverse CDF of a standard normal distribution, and
is the bivariate CDF of a normal distribution with a mean vector of zero and covariance matrix with elements
and
. Notably, setting
to the correlation coefficient between
and
renders the Gaussian Copula with Gaussian marginals equivalent to the bivariate CDF of a (multivariate) Gaussian distribution.
The Ali–Mikhail–Haq Copula, a member of the Archimedean Copula family, is defined as follows [
21]:
Our selection of these two Copula models is motivated by their clear relationship between the Copula parameter and the correlation between and . It is noteworthy that in both models, when the correlation coefficient equals zero, the equation simplifies to , which aligns with the expected behavior of independent variables.
Finally, we explore the use of beta distributions for modeling Monte Carlo-sampled distributions of DVH points, and compare them with the Gaussian distributions. The rationale for considering a beta distribution is compelling: let us focus on a DVH point at a specific dose . For a voxel within a VOI, if its dose d is greater than or equal to (or less than ), it is counted towards (or excluded from) the computation of the respective relative volume. This scenario can be likened to a success (or a failure) in a Bernoulli experiment. Ideally, if DVH points were independent, such a setup would naturally align with a Bernoulli process, making the beta distribution an ideal fit since it represents the probability distribution of success in this context. However, in practical terms, the voxel-based Bernoulli experiments tend to be interdependent.
2.4. Testing of the Models
Our initial evaluation of various dose-distribution assumptions is conducted using simulated DVHs on a one-dimensional prototype. This prototype is developed using the open-source “APM toolbox” [
23], which facilitates the creation of simple, low-dimensional geometrical configurations for experimental purposes. The prototype’s analyzed volume extends from −50 mm to 50 mm. It includes a cancerous volume of interest (VOI) spanning from −20 mm to 20 mm, and a healthy VOI extending from 20 mm to 40 mm. The setup consists of 100 voxels, equating to one voxel per millimeter. A prescribed dose of
is administered to the target VOI. In our analysis, we specifically consider the scenario where uncertainties in the pencil-beam positions are perfectly correlated, representing a realistic approximation for solid translations of the patient.
In the subsequent phase of our study, we assess the proposed models using a patient lung case treated with proton therapy. To streamline our analysis, namely, to minimize computation time and simplify the geometry, we employ a single anterior beam with a gantry angle of 0°. This approach is based on previously established reference dose distributions and corresponding error scenarios detailed in Wahl et al. [
26]. These reference data were computed using the open-source radiation treatment planning software matRad [
24], which allows for a comprehensive and precise evaluation within the context of proton therapy.
For the patient lung case, we additionally introduce and analyze an extra model, termed the “resampled Gaussian”. This model is derived by generating 100 random multivariate Gaussian samples of the dose, using the mean and standard deviation extracted from the Monte Carlo samples. This inclusion allows for a comprehensive comparison of the models, assessing their ability to replicate the statistical properties observed in the sampled data.
In both simulated scenarios and actual data assessments, we evaluate the accuracy of the dose uncertainty models by comparing them against Monte Carlo samples. The efficacy of the various marginal distributions in accurately representing the sampled marginal histograms is examined using Pearson’s chi-square goodness-of-fit test. The test statistic is built as
where
j runs over the histogram bins,
represents the sampled value of the histogram bin, and
represents the modeled value. The test statistic follows approximately a
distribution when the number of entries is large [
27]. While the chi-square test is a binned approach, and thus, has certain limitations, we opt for it due to the distribution independence of its test-statistic, unlike other unbinned goodness-of-fit tests such as the Kolmogorov–Smirnov or Anderson–Darling tests. Additionally, we assess the overall impact on the DVHs by comparing the mean and standard deviation of the DVH points derived from the Monte Carlo samples with those obtained using different uncertainty models (calculated via Equation (3) and Equation (4), respectively).
4. Discussion
The primary objective of this study was to explore realistic models describing cumulative histograms arising as so-called DVHs in radiotherapy. We based our work on previous approaches relying on APM [
13,
14], interpreting the computation of the DVH as a series of Bernoulli experiments on uncertain dose values. With this interpretation, uni- and bivariate CDFs are required for computation of mean and variance in the DVH bins.
The integration of Copulas in constructing bivariate CDFs offers the flexibility to fuse different marginal PDFs. This approach is particularly beneficial in accurately representing the joint distribution of voxels across various sections within a volume of interest (VOI), potentially enhancing the precision (compared to the standard multivariate Gaussian [
13]) and applicability of DVH models in medical physics.
Explicit modeling of uncertainty in (cumulative) histograms, including closed-form propagation from the underlying data, seems to remain an exotic task. In the search for comparable approaches, only a few published works in other fields for specialized cases could be identified. For example, Vermeesch [
29] derives a frequentist and Bayesian approach to quantify uncertainties in categorical histograms as well as histograms representing time series—approaches which do not directly translate to this work. Supposedly it is often straightforward to compute “error bars” by sampling from the underlying data, or the histograms count realizations of an independent random variable, where Poisson noise may be directly assumed. In the context of radiotherapy, however, independence is not given, and the creation of a treatment plan and corresponding uncertainty modeling may be time-critical, raising the need for explicit models.
In our analysis of the one-dimensional artificial geometry, as well as of the patient lung case, the four-parameter beta distribution could improve modeling of the uncertain dose distribution across voxels within both the cancerous and healthy VOIs (more successful in the latter). This distribution, unlike the Gaussian, is capable of capturing the potentially multi-modal and skewed characteristics. However, no distribution was capable of consistently passing a goodness-of-fit test at the
significance level (especially in the artificial case). The ability of the four-parameter beta distribution to describe more complex, skewed, often multi-modal dose distributions is noteworthy, but significant improvements are still needed to appropriately model the dose distributions. Considering that in the literature the only tested distribution for the dose-distributions underlying DVH computation are normal, triangular, or uniform distributions [
13,
30,
31]—and thus, neither skewed nor multi-modal—these results constitute a first step towards a more accurate description of the dose distribution.
As a minor technical remark, the minimum and maximum values for the four-parameter beta distribution were set based on the mean () and standard deviation () of the voxel dose distribution. The selection of these bounds for the distribution’s support was somewhat arbitrary and could be subjected to further scrutiny, yet we found that they aptly represented the simulated dose distributions.
For the patient lung case, to determine the most effective mix of distributions—that is, the combination of probability density functions (PDFs) that minimizes the chi-square test statistic at each voxel—we employed a chi-square goodness-of-fit test. However, this approach has its limitations. For instance, a distribution might exhibit a low chi-square value but still have very non-random residuals, indicating that the distribution, despite a favorable test statistic, may not be an accurate model. In other words, the process of identifying the best mix could be further refined for greater accuracy. Nevertheless, similar to our findings in the simulations, most tested distributions frequently failed the goodness-of-fit test. We suggest that even with a more optimized approach to determining the best mix, the overall impact on the results may not be substantial.
The application of Copulas allowed us to build the required bivariate CDFs from different (continuous) correlated marginal PDFs. Despite Copulas being a highly versatile approach, we found that the Copula models did not outperform the multivariate Gaussian model (or, equivalently, the model of a Gaussian Copula with Gaussian marginal PDFs). Specifically, the best mix of marginal PDFs combined with a Gaussian Copula did not demonstrate any significant advantage over the multivariate Gaussian model in accurately describing the DVH moments. DVHs derived from Gaussian resampling showed a similar pattern to those from the multivariate Gaussian model, with minor discrepancies mainly noted in the calculated standard deviation. The most effective models, exhibiting a bias within of the Monte Carlo mean for the DVH mean values, include the resampled Gaussian, and the Gaussian Copula with Gaussian, gamma, and four-parameter beta marginals, all of which approximate the standard deviation within a range. Notably, the most substantial deviations from the Monte Carlo-derived DVH mean or standard deviation were observed at the prescribed dose level. Additionally, our findings suggest that the Ali–Mikhail–Haq Copula tends to underperform relative to the Gaussian Copula, particularly in terms of underestimating the standard deviation.
In terms of generalization, the investigation of a single case in this work does not allow extrapolation to other cases. With the same irradiation setup and treatment site we may expect similar results, but the effect of changes to beam arrangements, the input uncertainty model, or anatomy cannot easily be predicted.
For example, the correlation between voxel doses is highly dependent on the correlation assumptions on the input uncertainties (i.e., patient displacement, anatomical variations, machine delivery uncertainty) [
10,
17]. In our simulations, we opted to model our uncertainty as patient shifts, which entails perfect correlation of all beamlets constituting a beam for irradiation. Future research may include additional sources of uncertainty, resulting in more complex correlation patterns in the input uncertainty model. Here, our approach may offer some guidance on how to represent the resulting voxel dose PDFs and CDFs and directly propagating them through the DVH calculation. Further, the APM approach allows for computation of higher moments [
14], which would be beneficial to determine the shape of the marginal PDFs or covariances more accurately. However, the increased computational cost required for these calculations is currently the limiting factor prohibiting such calculations.
In an attempt to accurately represent Monte Carlo-sampled distributions of DVH points, we explored the use of the physically motivated beta distributions and compared them with the standard assumption of Gaussian distributions. As expected, the voxel-based Bernoulli experiments tended to be interdependent, which complicates the modeling process and prevents the beta distribution from being an exact fit. Despite these challenges, and the fact that the beta distribution does not consistently clear the bar of a goodness-of-fit test at the significance level, our observations indicate that it provides a better description of the distribution of sampled DVH points compared to a Gaussian model. This suggests that, while not perfect, the beta distribution offers a more nuanced and potentially more accurate representation of DVH-point distributions in our analysis.
Finally, our findings suggest that while the four-parameter beta distribution may be useful to model uncertainty in the dose distribution, their fits have not yet proven accurate enough to be employed in practice. The tested Copula models, which in this work did not outperform the bivariate Gaussian benchmark, still can constitute a straightforward way to build bivariate CDFs under different marginal PDFs, allowing us to employ specific features of the marginal probability densities in implicit modeling. For the future, considering more sources of input uncertainty and other anatomies or even in other scientific field, our work may demonstrate various modeling tools for computation of cumulative histograms from uncertain data. Another advantage, not explored in this work, is that the analytical approach of modeling these uncertainties enables, for example, the computation of derivatives. In the case of radiotherapy treatment planning, this could allow efficient optimization of dose subject to a confidence interval for a DVH point, thus rendering the treatment plan quantifiably robust (compare [
32,
33,
34], who apply such methods using empirical DVH uncertainty estimates).
Consequently, we consider the approach itself valuable for general investigations into how uncertain data can be evaluated using probabilities in cumulative histograms.