A TV–BM3D Iterative Algorithm for VMAT-CT Reconstruction

Chien, Chia-Lung; Guo, Beibei; Zhang, Rui

doi:10.3390/jimaging12040166

Open AccessArticle

A TV–BM3D Iterative Algorithm for VMAT-CT Reconstruction

by

Chia-Lung Chien

¹,

Beibei Guo

² and

Rui Zhang

^3,4,*

¹

Department of Radiation Oncology, St. Jude Children’s Research Hospital, Memphis, TN 38105, USA

²

Department of Experimental Statistics, Louisiana State University, Baton Rouge, LA 70803, USA

³

Department of Radiation Oncology, Baylor College of Medicine, Houston, TX 77030, USA

⁴

Department of Physics and Astronomy, Louisiana State University, Baton Rouge, LA 70803, USA

^*

Author to whom correspondence should be addressed.

J. Imaging 2026, 12(4), 166; https://doi.org/10.3390/jimaging12040166

Submission received: 5 February 2026 / Revised: 30 March 2026 / Accepted: 2 April 2026 / Published: 10 April 2026

(This article belongs to the Section Medical Imaging)

Download

Browse Figures

Versions Notes

Abstract

Volumetric modulated arc therapy-computed tomography (VMAT-CT), which is the CT reconstructed using the portal images collected during VMAT, can potentially be an effective onsite imaging tool. The goal of this study was to propose an iterative reconstruction algorithm that can further improve the image quality of VMAT-CT and reduce the number of failed reconstructions. An iterative algorithm combining total variation (TV) with block-matching and 3D filtering (BM3D) was proposed, addressing the L1-L2 regularization problem using the split Bregman method. We collected portal images from 67 VMAT cases including 50 phantom and 17 real-patient cases. Both Feldkamp–Davis–Kress (FDK) and TV-BM3D iterative algorithms were used to reconstruct VMAT-CT using the collected images. The preprocessing methods developed by our group previously were also used in this study. A total of 48 out of 50 phantom cases and 15 out of 17 real-patient cases were successfully reconstructed using the iterative algorithm together with image preprocessing. In contrast, 39 phantom cases and 8 patient cases could be reconstructed using the original FDK algorithm, and 44 phantom cases and 11 patient cases could be reconstructed using the FDK algorithm together with preprocessing. Compared with the FDK algorithm, the TV-BM3D iterative algorithm significantly improved the image quality of VMAT-CT at all treatment sites. To the best of our knowledge, this study is the first to develop an iterative VMAT-CT reconstruction algorithm. It can be used to reconstruct CT images locally, and is superior to FDK-based algorithms in terms of the success rate and reconstructed image quality. This strongly supports the use of VMAT-CT as a promising imaging tool for treatment monitoring and adaptive radiotherapy.

Keywords:

volumetric modulated arc therapy; computed tomography; iterative reconstruction; compressed sensing; total variation; block-matching and 3D filtering

1. Introduction

Volumetric modulated arc therapy (VMAT) is a popular rotational radiotherapy (RT) technique due to its faster delivery, increased degree of freedom for dose optimization, and improved dose conformity [1,2]. To detect the intra-fractional anatomical changes without introducing extra imaging dose or cost, Poludniowski et al. [3] proposed to reconstruct megavoltage (MV) computed tomography (CT) using electronic portal imaging device (EPID) images collected during VMAT and named it VMAT-CT. The proposed reconstruction is a three-dimensional (3D) lambda tomography (LT) method based on the Feldkamp–Davis–Kress (FDK) algorithm and lambda filter [3]. However, the poor image quality of VMAT-CT due to data insufficiency, truncation and blurriness hindered its applications in clinic. To improve the image quality of VMAT-CT, our group proposed a new extrapolation scheme that extrapolates along collimator angles instead of horizontal direction to preserve most of the useful information in EPID images [4]. Furthermore, we proposed systematic methods to preprocess EPID images, including online region-based active contouring, multi-leaf collimator (MLC) motion modeling, and outlier filtering, and significantly improved the image quality of VMAT-CT for multiple treatment sites (head and neck, lung, and esophagus) [5].

However, our methods still failed to reconstruct VMAT-CT from certain VMAT plans that had extremely insufficient projection data. The failures resulted from the inherent limitation of the LT algorithm: the reconstruction quality degrades dramatically if the projection sampling is sparse or the projection angle range is less than 180° plus the fan angle [6,7]. Although the LT is non-quantitative in that it does not require an exact and unique reconstruction [8,9], the reconstruct may fail if the sampled angle range for certain voxels is fewer than the lowest acceptable cutoff threshold (1.57 radians or 90°) [3].

Iterative reconstruction algorithms for incomplete projection data have been proposed to overcome the limitations of the LT algorithm. The iterative algorithms can introduce image constraints [10,11,12,13], which can be prior known information or realistic assumptions of the missing data such as positivity of voxel values, bounds of image smoothness and voxel values, so the reconstruction can be protected from unrealistic artifacts and distortions coming from data deficiency.

The concept of compressed sensing (CS) was proposed in 2006 [14]. According to CS theory, a signal can be recovered from fewer samples than the number required by Nyquist sampling theorem if the signal is sparse. Consequently, if an image

f

can be sparsified by operations such as a discrete gradient operator [15,16], the image can be reconstructed from less sampling. Sidky et al. proposed an iterative algorithm for CT reconstruction based on CS theory and incorporated total variation (TV) minimization [17], and later developed the iterative algorithm named adaptive-steepest-descent-projection-onto-convex-sets (ASD-POCS) [18]. Since then, other studies have adopted TV minimization in the iterative reconstruction to solve the problem of insufficient projection data [19,20,21].

TV minimization has an assumption that the pixel values within the structures in a CT image are piecewise constant such that the non-zero signal concentrates on the boundaries of these structures in the TV domain of the CT image [22,23]. Under this assumption, TV minimization can exploit the gradient sparsity by the L1 regularization technique, protect the edges of internal structures within images, and smooth out noises within the anatomical structures, which is suitable for medical images that have a uniform intensity within a structure. However, if the image intensity fluctuates drastically because of complicated structures or large streaking artifacts, TV minimization might introduce staircase artifacts that degrade the image quality and fail to remove the streaking artifacts in the CT reconstruction [24]. Although many revisions of TV minimization have been proposed to solve this problem such as edge-preserving TV [25], anisotropic TV [26], and adaptive-weighted TV [27], they are still not effective for images with poor image contrast and require considerable tuning labors.

The block-matching and 3D filtering (BM3D) method [28,29] is an advanced image-denoising method that can also encourage data sparsity. In this method, 3D stacks, which are similar patches grouped by block-matching, can be sparsified by linear transform such as Fourier transform and hard-thresholding such as L1 regularization [28]. While TV minimization utilizes gradient sparsity in the spatial domain, the BM3D method achieves sparse representation in the transform domain but with the assumption that an anatomical structure recognized by blocking matching would have a similar appearance throughout a medical image. Unlike TV minimization, the BM3D method does not expect an image to have uniform intensity within a structure, thus avoiding the introduction of staircase artifacts when the image intensity fluctuates. Therefore, several groups have proposed iterative algorithms for CT reconstruction using BM3D filters [30,31,32].

The goal of this study was to develop a CS-based iterative algorithm for VMAT-CT reconstruction. This algorithm utilizes both TV minimization and BM3D denoising, and can further improve the image quality of VMAT. The preprocessing methods previously developed by our group [5] were also used in this study to achieve the best results. To the best of our knowledge, this study developed the first iterative VMAT-CT reconstruction algorithm that is superior to FDK-based algorithms in terms of success rate and reconstructed image quality.

2. Materials and Methods

2.1. TV-BM3D Iterative VMAT-CT Reconstruction

The TV of a 3D CT image

f

was defined as the sum of the L1 norms of its discrete gradient in the x, y, and z directions at every voxel:

{‖f‖}_{TV} = {‖\nabla_{x} f‖}_{1} + {‖\nabla_{y} f‖}_{1} + {‖\nabla_{z} f‖}_{1}

(1)

In the traditional algorithm, TV minimization was incorporated into the iterative algorithm to solve the following convex optimization problem:

f^{*} = \min_{f} {{‖f‖}_{TV} + {μ ‖R f - p‖}_{2}^{2}}

(2)

where

R

is the forward projection operator,

p

is the raw projection data, μ is the hyperparameter controlling the weight of the regularization, and

{||f||}_{T V}

is the TV regularization term shown above and represents the sparseness constraint of the CT image. The traditional algorithm treats the optimization as two phases in each iteration: the first phase is to enforce the projection data consistency, which is represented by the fidelity term (

{| | R f - p | |}_{2}^{2})

, using the simultaneous algebraic reconstruction technique (SART) [33] and the non-negativity of the reconstructed CT (

f)

; the second phase is to minimize TV with the adaptive steepest gradient descent algorithm [10].

However, because VMAT-CT reconstruction is performed within a local volume that is much smaller than the field of view of open-field conventional CT or cone beam CT (CBCT), the projection operation

R f

, which represents the Radon transform in discrete form, fails to describe the incomplete Radon transform situation of VMAT-CT. Instead, the projection operator should be modified with the local filtering operator, which would work effectively for projection data truncation [34,35,36]. Therefore, we modified the fidelity term to be

{||{(R f)}_{L} - {(p)}_{L}||}_{2}^{2}

to relate the projection operation of VMAT-CT to the raw truncated EPID images, where L represents the local filtering. Furthermore, we incorporated the BM3D regularization term besides TV regularization into our TV-BM3D iterative algorithm. The final optimization problem could be expressed as

f^{*} = \min_{f} {{‖f‖}_{TV} + {μ ‖(R {f)}_{L} - p_{L}‖}_{2}^{2} + δ B M 3 D (f)}

(3)

where

B M 3 D (f)

is the BM3D regularization term [37,38], δ is the hyperparameter controlling the weight of BM3D regularization and is set to 1, and the EPID image after local filtering operation can be represented as

p_{L} (u, v) = p (u, v) \otimes e_{R} (u)

(4)

where (u, v) is the generic EPID coordinate and

e_{R}

the convolution kernel defined by the local filter [3]. Figure 1 shows the flow chart of the iterative TV-BM3D reconstruction algorithm proposed in this study.

Considering it is tedious to iteratively solve the L2 fidelity term and the two L1 regularization terms of Equation (3), we adopted the split Bregman iteration method and a two-step iteration to transform the difficult L1–L2 problem into a sequence of subproblems and Bregman updates [38,39,40,41,42].

The first step is to minimize the fidelity term

f^{j} = \min_{\tilde{f}} {‖(R {\tilde{f})}_{L} - p_{L}‖}_{2}^{2}

(5)

After the raw EPID data were preprocessed [5], the projection difference between the filtered EPID images and the locally filtered forward projections from the current VMAT-CT (a blank input initially) was calculated as

Δ p_{L} = p_{L} - (R {f)}_{L}

. This projection difference

(Δ p_{L})

was then back-projected to the VMAT-CT domain to generate Δf_L using a modified SART [3,35,36,43], and the VMAT-CT

(f)

(a blank CT initially) was updated accordingly. The modified SART operates only within the localized volume of interest and is expressed as

Modified SART

x_{1} = f^{j}

;

for

i = 1 : N_{θ}

x_{i + 1} = x_{i} + ∆ f_{L}

= x_{i} + I \cdot λ V_{θ_{i}}^{- 1} R_{θ_{i}}^{T} W_{θ_{i}} {(Δ p_{L})}_{θ_{i}} such that {(Δ p_{L})}_{θ_{i}} \in M_{θ_{i}}

;

end

u p d a t e d f^{j} = x_{N_{θ} + 1}

;

Here,

f^{j}

and

u p d a t e d f^{j}

are VMAT-CT before and after the modified SART update, N_θ is the number of gantry angles, R is the forward projection operator, V is the diagonal matrix with nth diagonal element as

V_{n n} = \sum_{m \in M_{β}} |R_{m n}|

, W is the diagonal matrix with mth diagonal element as

W_{m m} = \frac{1}{\sum_{n \in I_{V O I}} |R_{m n}|}

, λ is the hyperparameter for iterations,

I

is the 3D masking function for the reconstruction volume based on planning target volume,

M_{θ}

is the two-dimensional masking function of each EPID image at gantry angle

θ

. We also enforced the non-negativity of each voxel in the VMAT-CT reconstruction after each modified SART update.

The second step is the TV-BM3D denoising based on the split Bregman method. After the first step of minimization of the fidelity term, the optimization problem of Equation (3) can be transformed and written as [38,39,40]

\begin{matrix} \underset{\tilde{f}}{m i n} \{{‖D_{x}‖}_{1} + {‖D_{y}‖}_{1} + {‖D_{z}‖}_{1} + μ {‖\tilde{f} - f^{it}‖}_{2}^{2} + δ |D_{w}|\} \\ such that D_{x} = \nabla_{x} \hat{f}, D_{y} = \nabla_{y} \hat{f}, D_{z} = \nabla_{z} \hat{f}, D_{w} = B M 3 D (\hat{f}) \end{matrix}

(6)

By applying the Bregman iteration with multiple penalty terms, the constrained problem can be fulfilled as

\begin{matrix} {\hat{f}}^{k + 1}, D_{x}^{k + 1}, D_{y}^{k + 1}, & D_{z}^{k}, D_{w}^{k + 1} \\ = \underset{\hat{f}, D_{x}, D_{y}, D_{z}, D_{w}}{m i n} {{‖D_{x}‖}_{1} + {‖D_{y}‖}_{1} + {‖D_{z}‖}_{1} + {μ ‖\hat{f} - f^{j}‖}_{2}^{2} \\ + δ {‖D_{w}‖}_{1} + α {‖D_{x}^{k} - \nabla_{x} \hat{f} - b_{x}^{k}‖}_{2}^{2} + α {‖D_{y}^{k} - \nabla_{y} \hat{f} - b_{y}^{k}‖}_{2}^{2} \\ + α {‖D_{z}^{k} - \nabla_{z} \hat{f} - b_{z}^{k}‖}_{2}^{2} + {β ‖D_{w}^{k} - B M 3 D (\hat{f}) - b_{w}^{k}‖}_{2}^{2}} \end{matrix}

(7)

where

k

represents the kth denoising loop; α, and β are denoising parameters to tune the accuracy of

D_{x}, D_{y}, D_{z},

and

D_{w}

, respectively;

b_{i}^{k}

is given by the split Bregman iteration.

The split Bregman method could solve the pluralistic problem by successively minimizing L1 and L2 components with respect to

\hat{f}

,

D_{x}, D_{y}, D_{z}, and D_{w}

:

\begin{matrix} {\hat{f}}^{k + 1} = \underset{\hat{f}}{m i n} {{μ ‖\hat{f} - f^{j}‖}_{2}^{2} + α {‖D_{x}^{k} - \nabla_{x} \hat{f} - b_{x}^{k}‖}_{2}^{2} + α {‖D_{y}^{k} - \nabla_{y} \hat{f} - b_{y}^{k}‖}_{2}^{2} \\ + α {‖D_{z}^{k} - \nabla_{z} \hat{f} - b_{z}^{k}‖}_{2}^{2} + {β ‖D_{w}^{k} - BM 3 D (\hat{f}) - b_{w}^{k}‖}_{2}^{2}} \end{matrix}

(8)

and

\{\begin{array}{l} {D_{x}^{k + 1} = \underset{D_{x}}{m i n} ‖D_{x}‖}_{1} + α {‖D_{x}^{k} - \nabla_{x} {\hat{f}}^{k + 1} - b_{x}^{k}‖}_{2}^{2} \\ \begin{array}{l} {D_{y}^{k + 1} = \underset{D_{y}}{m i n} ‖D_{y}‖}_{1} + α {‖D_{y}^{k} - \nabla_{y} {\hat{f}}^{k + 1} - b_{y}^{k}‖}_{2}^{2} \\ \begin{array}{l} {D_{z}^{k + 1} = \underset{D_{z}}{m i n} ‖D_{z}‖}_{1} + α {‖D_{z}^{k} - \nabla_{z} {\hat{f}}^{k + 1} - b_{z}^{k}‖}_{2}^{2} \\ {D_{w}^{k + 1} = \underset{D_{w}}{m i n} ‖D_{w}‖}_{1} + {β ‖D_{w}^{k} - BM 3 D ({\hat{f}}^{k + 1}) - b_{w}^{k}‖}_{2}^{2} \end{array} \end{array} \end{array}

(9)

where the values of

b_{x}^{k}, b_{y}^{k}, b_{z}^{k}, {and b}_{w}^{k}

can be solved as

\{\begin{array}{l} b_{x}^{k + 1} = b_{x}^{k} + \nabla_{x} {\hat{f}}^{k + 1} - D_{x}^{k + 1} \\ \begin{array}{l} b_{y}^{k + 1} = b_{y}^{k} + \nabla_{y} {\hat{f}}^{k + 1} - D_{y}^{k + 1} \\ \begin{array}{l} b_{z}^{k + 1} = b_{z}^{k} + \nabla_{z} {\hat{f}}^{k + 1} - D_{z}^{k + 1} \\ b_{w}^{k + 1} = b_{w}^{k} + B M 3 D ({\hat{f}}^{k + 1}) - D_{w}^{k + 1} \end{array} \end{array} \end{array}

(10)

when the variables

{\hat{f}}^{k + 1}, D_{x}^{k + 1}, D_{y}^{k + 1}, D_{z}^{k + 1}, D_{w}^{k + 1}

are fixed.

Since

{\hat{f}}^{k}

is decoupled from the L1 components of the problem, the solution of

{\hat{f}}^{k}

could be achieved by Fourier transform method and expressed as [39]

{\hat{f}}^{k + 1} = ifft 2 \{\frac{fft 2 (μ f^{j} + α (\nabla_{x}^{T} (D_{x}^{k} - b_{x}^{k}) + \nabla_{y}^{T} (D_{y}^{k} - b_{y}^{k}) + \nabla_{z}^{T} (D_{z}^{k} - b_{z}^{k})) + β (D_{w}^{k} - b_{w}^{k}))}{fft 2 (μ + α ∆ + β)}\}

(11)

where fft2 and ifft2 represent 2D Fourier transform and inverse Fourier transform; Δ is the Laplace operator;

\nabla_{x}^{T}, \nabla_{y}^{T}, and \nabla_{z}^{T}

are the transpose gradient operators [38].

Also, the solutions of Equation (9) are given using the shrinkage operator:

\{\begin{array}{l} D_{x}^{k + 1} = shrink (\nabla_{x} {\hat{f}}^{k + 1} + b_{x}^{k}, \frac{1}{α}) \\ D_{y}^{k + 1} = shrink (\nabla_{y} {\hat{f}}^{k + 1} + b_{y}^{k}, \frac{1}{α}) \\ D_{z}^{k + 1} = shrink (\nabla_{z} {\hat{f}}^{k + 1} + b_{z}^{k}, \frac{1}{α}) \\ D_{w}^{k + 1} = shrink (BM 3 D ({\hat{f}}^{k + 1}) + b_{w}^{k}, \frac{1}{β}) \end{array}

(12)

where the shrink function is defined as

shrink (x, σ) = \{\begin{matrix} x - σ, x \in (σ, \infty) \\ \begin{matrix} 0, x \in (- σ, σ) \\ x + σ, x \in (- \infty, - σ) \end{matrix} \end{matrix}

(13)

for

σ \geq 0

.

We set the number of split Bregman denoising loops as 10. After the reconstructed VMAT-CT

(f)

was denoised by the TV-BM3D, it would be checked with the stopping criteria: if the iteration reached the maximum iteration number (

N_{stop}

), or if the square difference of reconstructions between two successive iterations was below a predetermined threshold. We defined a normalized update parameter

r^{j}

as the quantitative value for stable stopping execution:

r^{j} = \frac{‖f^{j + 1} - f^{j}‖}{‖f^{j = 1}‖}, f o r 1 < j < N_{s t o p}

(14)

We set the maximum iteration number

N_{stop}

as 20 and the threshold value for the normalized image update parameter

r^{j}

as 0.005 based on our trials such that the change between two successive iterations becomes inappreciable. Finally, if none of the stopping criteria were met, VMAT-CT

f^{j}

would be sent back for the local-filtered Randon transform

{(R f^{j})}_{L}

of the next iteration loop.

In summary, the TV-BM3D algorithm can be described as follows (justification for the chosen parameter values and their impact on the performance of the framework can be found in Supplementary Materials):

TV-BM3D Algorithm

Set the values of parameters: $μ = 2, δ = 1, α = 1, β = 0.3, r = 0.005,$ $N_{s t o p} = 20$ .

Preprocess raw EPID images p to obtain processed images $p_{L}$ .

Initialization: Blank VMAT-CT input $f^{0}$ ; blank forward project input (R $f^{0}$ )_L.

Main iteration (j = 0, 1, 2, …).

1. Modified SART update.

1.1. Compute

{∆ p}_{L}

and update

f^{j}

using Equation (5).

1.2. Enforce non-negative constraint.

2. TV-BM3D denoising (split Bregman loop).

Initialization:

D_{x}^{0} = D_{y}^{0} = D_{z}^{0} = D_{w}^{0} = b_{x}^{0} = b_{y}^{0} = b_{z}^{0} = b_{w}^{0} = 0;

f⁰ = f^j.

For k = 0, 1, 2, … until convergence:

2.1. Update Bregman variables

D_{x}^{k + 1}, D_{y}^{k + 1}, D_{z}^{k + 1} {, D}_{w}^{k + 1}

using Equation (12);

2.2. Update Bregman variables

b_{x}^{k + 1}, b_{y}^{k + 1}, b_{z}^{k + 1} {, b}_{w}^{k + 1}

using Equation (10);

2.3. Update

f^{k + 1}

with the variables

D_{x}^{k}, D_{y}^{k}, D_{z}^{k} {, D}_{w}^{k} a n d b_{x}^{k}, b_{y}^{k}, b_{z}^{k} {, b}_{w}^{k}

using Equation (11);

End

f^{j + 1} = f^{k + 1}

.

3. Stopping criterion.

if

r^{j} = \frac{‖f^{j + 1} - f^{j}‖}{‖f^{1}‖}

< r, or

j \geq N_{s t o p}

.

4. Prepare for next iteration.

Compute local-filtered Randon transform

{(R f^{j + 1})}_{L}

, and return to step 1.

To accelerate the computation, we implemented the GPU-accelerated CUDA code of the forward and backward projection operators from the TIGRE toolbox version 3.1 [44] and the MEX code for the BM3D operator from the BM3D MATLAB package version 2.01 [28,45]. The computations were performed on a Dell workstation (Dell Technologies Inc., Round Rock, TX, USA) featuring an Intel Core i9-12900K 3.2 GHz CPU (Intel Corporation, Santa Clara, CA, USA), 128 GB of RAM, and a NVIDIA RTX A6000 GPU (NVIDIA Corporation, Santa Clara, CA, USA).

2.2. Image Quality (IQ) Analysis

We used the contrast-to-noise ratio (CNR) and structural similarity index measure (SSIM) to quantitatively evaluate the image quality of VMAT-CT.

CNR is defined as [37]

CNR = \frac{|{\bar{x}}_{V O I} - {\bar{x}}_{r e f}|}{σ_{r e f}}

(15)

where

{\bar{x}}_{V O I}

is the mean voxel value within volume of interest (VOI) in VMAT-CT,

σ_{r e f}

is the standard deviation of the voxel value within the reference volume, and

{\bar{x}}_{r e f}

is the mean voxel value in the reference volume. The reference volume was drawn as a box in the soft tissue area, and the VOI was drawn as a box in the air cavity or bony area if no air cavity was available. Both volumes were approximately 1 cm³. In this study, VMAT-CT reconstruction was considered successful if CNR was higher than 2.

SSIM is defined as [46]

S S I M = \frac{(2 μ_{i} μ_{pCT} + C_{1}) (2 σ_{i} σ_{pCT} + C_{2})}{(μ_{i}^{2} + μ_{pCT}^{2} + C_{1}) (σ_{i}^{2} + σ_{pCT}^{2} + C_{2})}

(16)

where μ_i and μ_pCT represent the local mean pixel values of VMAT-CT and reference CT respectively, σ_i and σ_pCT denote the local standard deviation of VMAT-CT and reference CT respectively, and C₁ and C₂ are regularization constants to prevent numerical instability in scenarios where μ or σ values approach zero, ensuring robust metric computation.

As we discussed in our previous study, VMAT-CT is applicable to cancer sites with sufficient density differences around the target region. For the phantom study, we used the same 50 cases based on clinical VMAT plans for multiple treatment sites (left lung (LL), right lung (RL), esophagus (ESO), and head and neck (H&N)) delivered to the Rando Chest phantom (LL, RL, ESO) or the Rando Head phantom (H&N), as explained in our previous study [5]. We also acquired 17 real-patient cases with treatment sites in the thoracic regions (RL, LL, ESO). All VMAT plans had two coplanar 6 MV arcs and were delivered using an Elekta Versa HD linac (Elekta Oncology Systems, Crawley, UK). The arc range, which is the angular span of gantry rotation—defined by the start and stop gantry angles—over which radiation is delivered continuously during a VMAT arc, and numbers of EPID images for all cases are listed in Table 1. All EPID data were recorded using the Elekta iVew system at 4 frames/second. The EPID panel has 0.8 mm × 0.8 mm pixel size and is at 160 cm source to detector distance. The number of reconstructions matches the number of cases, with each VMAT-CT dataset reconstructed into a 3D volume measuring 270 × 270 × 263 mm³ and an isotropic voxel size of 1 mm³

We used R programing language for the one-way ANOVA with split-plot design to analyze if the difference in CNR or SSIM is significant (p < 0.05) among VMAT-CTs reconstructed with the original method [4] (FDK-based algorithm; EPID images were processed with constant extrapolation, uniform edge erosion, and collimator angle correction) denoted as “FDK”, reconstructed with the FDK algorithm together with the systematic preprocessing methods developed by our group [5] denoted as “FDK + preprocessing”, and reconstructed with the TV-BM3D iterative algorithm together with EPID preprocessing denoted as “iterative + preprocessing”. More specifically, we compared FDK with FDK + preprocessing, FDK with iterative + preprocessing, and FDK + preprocessing with iterative + preprocessing. We used the pairwise post hoc Tukey test sequentially when ANOVA showed that the difference was significant.

3. Results

Figure 2 shows pretreatment CBCT and VMAT-CT images of three phantom cases. The red contour overlaid on each CBCT image corresponds to the prescription isodose line from the planning CT, transferred via rigid registration between the planning CT and CBCT. VMAT-CT images reconstructed with FDK are degraded by lots of artifacts and certain anatomy features in them are not recognizable. VMAT-CT images reconstructed with FDK + preprocessing have improved image quality, but the structures in the VMAT-CT are still not fully distinguishable because of artifacts and distortions. VMAT-CT images reconstructed with iterative + preprocessing have recognizable structures and the fewest artifacts.

Similarly, Figure 3 shows five real-patient cases. VMAT-CT images reconstructed with FDK suffer from significant streaking artifacts. VMAT-CT images reconstructed with FDK + preprocessing have limited improvements. The VMAT-CT images reconstructed with iterative + preprocessing have further improved image quality and discernable anatomy structures.

In both Figure 2 and Figure 3, some reconstructions were severely affected by the angular data insufficiency and have black holes in them, which could not be resolved with the iterative algorithm. Additional challenging cases are provided in the supplementary material, further illustrating that our iterative algorithm successfully reconstructs VMAT-CT while the FDK algorithms could not.

Figure 4 and Figure 5 present box-and-whisker plots (boxplots) of CNR and SSIM metrics for the phantom study. The central line within each box represents the median, while the lower and upper boundaries of the box correspond to the first (Q1) and third (Q3) quartiles, respectively, defining the interquartile range (IQR) that contains the middle 50% of the observations. The whiskers extend to the most extreme values within 1.5 × IQR from the quartiles, and values beyond this range are plotted as outliers in circles. These plots demonstrate how reconstruction algorithms influence VMAT-CT image quality, especially the effect of the iterative algorithm.

The post hoc Tukey tests (Table 2 and Table 3) show that combining the preprocessing method with the iterative algorithm produced statistically significant enhancements for both CNR (p < 0.0001) and SSIM (p < 0.0001).

Figure 6 and Figure 7 display boxplots of CNR and SSIM for real-patient cases. Both CNR and SSIM exhibit statistically significant differences (p < 0.0001) when different reconstruction algorithms were used.

The post hoc Tukey test (Table 4) further demonstrates that the preprocessing method and iterative algorithm each significantly enhance VMAT-CT image quality in patient cases.

4. Discussion

The concept of VMAT-CT was proposed a decade ago but did not gain popularity due to multiple limitations and technical challenges. Because the daily portal images during VMAT are highly blurred due to beam modulation, and commercial software cannot be used to reconstruct CT based on these images, most clinics in the US do not collect or utilize these images to our knowledge. A huge amount of image data that does not require any additional hardware, beam time or imaging dose could have been used for treatment monitoring and dose tracking purposes. There are some studies that investigated prostate localization during VMAT based on fiducial markers and portal images collected during VMAT [47,48], but this type of tracking cannot reveal patient anatomy or dose information.

In this study, we adopted the concept of CS theory, introduced TV and BM3D as the regularization constraints, and developed a TV-BM3D iterative reconstruction algorithm to improve the image quality of VMAT-CT. We succeeded in reconstructing 48 out of 50 phantom cases and 15 out of 17 patient cases using iterative + preprocessing. In contrast, only 39 phantom cases and eight patient cases could be reconstructed with FDK, and 44 phantom cases and 11 patient cases could be reconstructed with FDK + preprocessing. All phantom and patient cases show improvements in the image quality using the TV-BM3D iterative reconstruction algorithm. Our iterative algorithm can remove the irregular artifacts due to insufficient projection data and show the hidden structures in VMAT-CT that could not be revealed by the FDK-based algorithm.

The BM3D denoising algorithm characterizes pattern searching by extracting similar blocks within an image and grouping them into a few templates. With the collaborative filtering to enhance the similarity between blocks in each template, BM3D can reconstruct structures such as bones based on the assumption that these structures feature similar appearance in a medical image. On the other hand, TV minimization assumes that the voxel values within a structure in a CT image are nearly the same. Therefore, BM3D and TV exploit data sparsity with different assumptions, and both provide constraints for the iterative algorithm to solve the sparse data problem in CT reconstruction, making the TV-BM3D iterative algorithm more effective than TV minimization or BM3D alone. The proper choice of block size and noise level in the BM3D method is crucial for the denoising performance, and extra tuning efforts are required to balance the denoising power of TV and BM3D regularizations.

Reconstruction from incomplete projection data, such as limited-angle or truncated field-of-view CT, is an ill-posed problem in which analytical algorithms like the FDK often produce strong streak artifacts and noise amplification due to violation of the full-sampling assumption. Iterative reconstruction methods incorporating sparsity constraints have therefore become the state-of-the-art for sparse or truncated CT data. In particular, TV regularization has been widely used to suppress streak artifacts and stabilize reconstruction from limited projections, although it may introduce over-smoothing and loss of fine anatomical details, particularly under severe data incompleteness [18]. Several studies have proposed improved regularization models to overcome these limitations. For example, compressed-sensing-based CT reconstruction frameworks demonstrated that sparse regularization could significantly improve image quality with reduced projection data, forming the theoretical foundation for many modern iterative CT reconstruction algorithms. Subsequent developments introduced adaptive or relative TV models to better preserve edges and textures under limited-angle acquisition. More advanced methods incorporate additional priors such as non-local patch similarity or prior-image constraints to better preserve structural information [49]. For example, the prior-image-constrained compressed sensing framework and non-local regularization approaches have demonstrated improved reconstruction accuracy for undersampled CT data [50].

Building on these developments, the proposed algorithm integrates local sparsity constraints (TV) with non-local self-similarity priors (BM3D) within a unified optimization framework solved using the Split Bregman method [28,39]. This hybrid regularization strategy improves both noise suppression and structural preservation compared with analytical reconstruction and TV-only methods. Quantitatively, the proposed method achieved CNR values ranging from 3.61 to 19.57 (mean ≈ 9.3) and SSIM values ranging from 0.087 to 0.782 (mean ≈ 0.27) across all 64 cases. While the mean SSIM appears lower than some reports in the literature, meaningful comparison requires careful consideration of acquisition conditions, reference definitions, and task difficulty, particularly in sparse-view and limited-angle CT reconstruction.

TV-based reconstruction remains a standard baseline for sparse-view and limited-angle CT. Conventional TV-based baselines have demonstrated SSIM values of approximately 0.812–0.960 and CNR values of 1.97–7.27 under sparse-view conditions (30–90 projections) on the Shepp–Logan phantom. More advanced TV-based variants, such as reinforced TV (rTV), have pushed performance further with SSIM up to 0.984 and CNR up to 14.26 under 90-projection sparse-view acquisition [51]. In more demanding limited-angle configurations, single-energy TV regularization has yielded SSIM ≈ 0.88 and CNR ≈ 2.8 on anthropomorphic phantoms [52]. More recent work, such as Xi et al. [53], reports SSIM values of approximately 0.85–0.93 for standard TV and up to ~0.90–0.97 for advanced high-order TV formulations, when evaluated against matched full-view reference images under moderate sparse-view conditions. However, these high SSIM values are largely attributable to matched-reference evaluation and moderate undersampling regimes. Importantly, TV-based methods inherently impose piecewise-constant assumptions, which suppress noise but also attenuate low-contrast features and fine textures, leading to moderate CNR improvement but reduced structural fidelity, particularly in highly undersampled scenarios.

Recent studies have explored deep learning-based reconstruction or sinogram completion for limited-angle CT. Across deep learning-based methods, SSIM values are typically reported in the range of ~0.80–0.95 under moderate sparse-view or low-dose conditions, depending on the similarity between training and testing distributions [54]. However, these methods are typically evaluated on datasets with consistent geometries, full-angular sampling ranges, and reference reconstructions from dense-view filtered back-projection. Moreover, they often rely on large, well-matched training datasets, supervised learning with high-quality ground truth, and limited generalizability across imaging systems or treatment sites [55]. In contrast, the proposed approach operates directly on measured portal images without the need for training data, supporting the feasibility of VMAT-CT as a practical in-treatment imaging modality for treatment monitoring and adaptive radiotherapy.

What fundamentally distinguishes our study from the vast majority of the sparse-view and limited-angle CT reconstruction literature is the nature of the raw data. Nearly all benchmark studies—including those employing TV-based methods, advanced TV variants, and deep learning approaches—evaluate performance on kV CT datasets acquired under idealized or controlled conditions: well-calibrated geometries, consistent photon flux, and relatively predictable noise characteristics. In these studies, the primary ill-posedness stems solely from angular undersampling or restricted scan ranges, with the underlying projection data remaining otherwise coherent and physically well-behaved. In stark contrast, our data originate from MV portal imaging, which introduces a cascade of compounded degradations absent from conventional benchmarks: inherently poor image quality due to high-energy photon physics, severe irregularity from MLC modulation that creates highly non-uniform fluence patterns, substantial blurring from both MLC motion during delivery and scatter-dominated signals, and extreme angular incompleteness far beyond typical limited-angle scenarios. Consequently, where standard sparse-view studies address reconstruction under ideal acquisition models with controlled subsampling, we confront a regime in which the forward model itself is corrupted by time-varying modulation, mechanical motion, and physical degradations that violate nearly all conventional assumptions. This places our reconstruction task in a fundamentally more challenging class of problems, rendering direct quantitative comparisons of CNR and SSIM across studies inherently inequitable without careful contextualization of the underlying data fidelity and acquisition physics. That being said, the proposed method demonstrates substantial performance gains, with multiple cases achieving CNR > 14–19, exceeding the typical upper range reported for conventional iterative reconstruction. Similarly, the upper range of SSIM values (0.4–0.8) approaches or surpasses those observed in early deep learning-based reconstruction frameworks. To our knowledge, this work represents the first demonstration of an iterative reconstruction framework specifically designed for VMAT-CT, and it highlights the potential of advanced regularization methods to overcome the severe data incompleteness inherent in treatment-time imaging.

There are several limitations of this study. First, the stopping threshold value (0.005) is determined by our trials of VMAT-CT reconstruction. However, the convergence is affected by the strengths of BM3D denoising which is tuned by the noise levels in BM3D, as well as TV minimization which is tuned by steepest-descent step size and the number of steps within the inner-iteration. If the regularizations of BM3D denoising and TV minimization are adjusted unbalanced, the VMAT-CT may be overly smoothed at each iteration step such that the updated parameter

r^{j}

, which represents the change between successive VMAT-CT iterations, will be too large to converge. Second, some failed cases of VMAT-CT remain unsolved even with the iterative algorithm. For VMAT-CT with extremely poor quality, tuning the proper block size and finding the patterns for BM3D could be challenging and correct templates could not be represented by blocks. One feasible approach is that the regularization models in our algorithm could be decomposed and replaced by a convolutional neural network (CNN) such that the limitations of tuning regularization parameters of TV and BM3D can be relieved. For example, some groups introduced deep CNN into the alternating direction method of multipliers (ADMMs) iterative reconstruction algorithm as the regularization term to solve the distorted limited-angle CT images, and found better image recovery than the iterative algorithms with TV regularization [56,57]. Finally, the speed of the iterative algorithm is relatively slower. Table 5 shows the overall computational time of the whole 3D VMAT-CT reconstruction. Compared with FDK and FDK + preprocessing, iterative + preprocessing takes the longest time ranging between 6 and 10 min because of the varying convergence speed for different cases of VMAT-CT. The computational bottleneck of the iterative reconstruction is the BM3D denoising, which involves computationally demanding processes such as block-matching, grouping, and aggregation. There are several studies in the literature about GPU-based BM3D denoising, but they are limited to applications to a 2D image and require memory organization and thread cooperation for data exchange [58]. Future work on 3D GPU-based BM3D denoising in MATLAB could further accelerate the TV-BM3D iterative algorithm.

In the future, the framework of our TV-BM3D iterative algorithm could be revised to have faster convergence and require less tuning. Instead of optimizing in two alternative phases, the optimization problem may be solved using Barzilai–Borwein formulation in a single phase [7]. Because the tuning parameters, including regularization weighting factor, block size and noise levels of BM3D, step size and iteration number of TV minimization, are affected by some characteristics of VMAT plans such as MLC modulation complexity score [59], small aperture score for the aperture size [60], and the inherit CT contrasts at the locations of treatment sites [61], we can reduce the tuning labors in the clinical workflow by pre-setting these parameters as specific protocols for each treatment site, which is similar to the kV-CBCT protocols used in the clinic.

5. Conclusions

A TV-BM3D iterative reconstruction algorithm was proposed for VMAT-CT reconstruction. This algorithm significantly outperformed traditional methods: while at most 44/50 phantom cases and 11/17 real-patient cases could be reconstructed with the FDK algorithms, the proposed iterative method successfully reconstructed 48/50 phantom cases and 15/17 real-patient cases. This represents a substantial improvement in reconstruction success rate. Moreover, our algorithm significantly enhances the image quality of VMAT-CT across all treatment sites. To our knowledge, this is the first iterative reconstruction algorithm developed specifically for VMAT-CT. This study, together with our previous work [4,5,62], strongly supports VMAT-CT as a promising 3D and four-dimensional (4D) imaging tool for treatment monitoring and adaptive RT.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/jimaging12040166/s1, Figure S1: Our framework demonstrates successful reconstructions of VMAT-CT for challenging cases, in contrast to FDK algorithm that cannot perform such reconstructions. (First column) Pretreatment CBCT overlaid by the prescription isodose lines (red); (second column) VMAT-CT reconstructed with FDK; (third column) VMAT-CT reconstructed with FDK + preprocessing; (fourth column) VMAT-CT reconstructed with iterative + preprocessing. Refs. [63,64,65,66,67,68] are cited in Supplementary Materials.

Author Contributions

Conceptualization, R.Z.; Methodology, C.-L.C. and R.Z.; Data analysis and investigation, C.-L.C., R.Z. and B.G.; Supervision, R.Z.; Writing—original draft, C.-L.C. and R.Z.; Reviews and editing, C.-L.C., R.Z. and B.G. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the American Cancer Society (RSG-22-030-01-CTPS), Louisiana Board of Regents Proof-of-Concept/Prototyping Initiative Fund (LEQSF(2022-23)-RD-D-03), Louisiana State University LIFT² Fund (LSU-2022-LIFT-004), Louisiana State University Faculty Research Grant, and Kenneth R. Hogstrom Superior Graduate Student Scholarship.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Louisiana State University (protocol code 4035 and date of approval 17 April 2018).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The original contributions presented in this study are included in the article and Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Popple, R.A.; Balter, P.A.; Orton, C.G. Point/Counterpoint. Because of the advantages of rotational techniques, conventional IMRT will soon become obsolete. Med. Phys. 2014, 41, 100601. [Google Scholar] [CrossRef]
Teoh, M.; Clark, C.H.; Wood, K.; Whitaker, S.; Nisbet, A. Volumetric modulated arc therapy: A review of current literature and clinical use in practice. Br. J. Radiol. 2011, 84, 967–996. [Google Scholar] [CrossRef]
Poludniowski, G.; Thomas, M.D.; Evans, P.M.; Webb, S. CT reconstruction from portal images acquired during volumetric-modulated arc therapy. Phys. Med. Biol. 2010, 55, 5635–5651. [Google Scholar] [CrossRef]
Zhao, X.; Zhang, R. Feasibility of 3D tracking and adaptation of VMAT based on VMAT-CT. Radiother. Oncol. 2020, 149, 18–24. [Google Scholar] [CrossRef]
Chien, C.L.; Zhao, X.; Guo, B.; Zhang, R. Technical note: Preprocessing of portal images to improve image quality of VMAT-CT. Med. Phys. 2024, 51, 2119–2127. [Google Scholar] [CrossRef]
Noo, F.; Defrise, M.; Clackdoyle, R.; Kudo, H. Image reconstruction from fan-beam projections on less than a short scan. Phys. Med. Biol. 2002, 47, 2525. [Google Scholar] [CrossRef] [PubMed]
Park, J.C.; Song, B.; Kim, J.S.; Park, S.H.; Kim, H.K.; Liu, Z.; Suh, T.S.; Song, W.Y. Fast compressed sensing-based CBCT reconstruction using Barzilai-Borwein formulation for application to on-line IGRT. Med. Phys. 2012, 39, 1207–1217. [Google Scholar] [CrossRef]
Wang, G.; Yu, H. Can interior tomography outperform lambda tomography? Proc. Natl. Acad. Sci. USA 2010, 107, E92–E93. [Google Scholar] [CrossRef] [PubMed]
Quinto, E.T.; Ozan, O.; Skoglund, U. Reply to Wang and Yu: Both electron lambda tomography and interior tomography have their uses. Proc. Natl. Acad. Sci. USA 2010, 107, E94–E95. [Google Scholar] [CrossRef]
Sidky, E.Y.; Kao, C.-M.; Pan, X. Effect of the data constraint on few-view, fan-beam CT image reconstruction by TV minimization. In Proceedings of the 2006 IEEE Nuclear Science Symposium Conference Record, San Diego, CA, USA, 29 October–4 November 2006; pp. 2296–2298. [Google Scholar]
Li, B.; Deng, J.; Lonn, A.H.; Hsieh, J. An enhanced reconstruction algorithm to extend CT scan field-of-view with z-axis consistency constraint. Med. Phys. 2012, 39, 6028–6034. [Google Scholar] [CrossRef]
Xu, Q.; Mou, X. Interior tomography using the truncated Hilbert transform with the total variation constraint. In Proceedings of the 2013 6th International Conference on Biomedical Engineering and Informatics, Hangzhou, China, 16–18 December 2013; pp. 48–52. [Google Scholar]
Frikel, J.; Haltmeier, M. Efficient regularization with wavelet sparsity constraints in photoacoustic tomography. Inverse Probl. 2018, 34, 024006. [Google Scholar] [CrossRef]
Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Krahmer, F.; Kruschel, C.; Sandbichler, M. Total variation minimization in compressed sensing. In Compressed Sensing and its Applications; Springer: Berlin/Heidelberg, Germany, 2017; pp. 333–358. [Google Scholar][Green Version]
Figueiredo, M.A.T.; Nowak, R.D.; Wright, S.J. Gradient Projection for Sparse Reconstruction: Application to Compressed Sensing and Other Inverse Problems. IEEE J. Sel. Top. Signal Process. 2007, 1, 586–597. [Google Scholar] [CrossRef]
Sidky, E.Y.; Kao, C.-M.; Pan, X. Accurate image reconstruction from few-views and limited-angle data in divergent-beam CT. J. X-Ray Sci. Technol. 2006, 14, 119–139. [Google Scholar] [CrossRef]
Sidky, E.Y.; Pan, X. Image reconstruction in circular cone-beam computed tomography by constrained, total-variation minimization. Phys. Med. Biol. 2008, 53, 4777–4807. [Google Scholar] [CrossRef] [PubMed]
Kudo, H.; Suzuki, T.; Rashed, E.A. Image reconstruction for sparse-view CT and interior CT-introduction to compressed sensing and differentiated backprojection. Quant. Imaging Med. Surg. 2013, 3, 147–161. [Google Scholar] [CrossRef]
Je, U.K.; Cho, H.M.; Cho, H.S.; Park, Y.O.; Park, C.K.; Lim, H.W.; Kim, K.S.; Kim, G.A.; Park, S.Y.; Woo, T.H.; et al. Feasibility study for application of the compressed-sensing framework to interior computed tomography (ICT) for low-dose, high-accurate dental x-ray imaging. Radiat. Phys. Chem. 2016, 119, 272–278. [Google Scholar] [CrossRef]
Matsutomo, N.; Fukaya, K.; Hashimoto, T.; Yamamoto, T.; Sato, E. Performance of compressed sensing-based iterative reconstruction for single-photon emission computed tomography from undersampled projection data: A simulation study in 123I-N-omega-fluoropropyl-2beta-carbomethoxy-3beta-(4-iodophenyl)nortropane imaging. Nucl. Med. Commun. 2019, 40, 106–114. [Google Scholar] [CrossRef]
Ward, J.P.; Lee, M.; Ye, J.C.; Unser, M. Interior Tomography Using 1D Generalized Total Variation. Part I: Mathematical Foundation. SIAM J. Imaging Sci. 2015, 8, 226–247. [Google Scholar] [CrossRef]
Lee, M.; Han, Y.; Ward, J.P.; Unser, M.; Ye, J.C. Interior tomography using 1D generalized total variation. Part II: Multiscale implementation. SIAM J. Imaging Sci. 2015, 8, 2452–2486. [Google Scholar] [CrossRef][Green Version]
Zeng, G.L. On few-view tomography and staircase artifacts. IEEE Trans. Nucl. Sci. 2015, 62, 851–858. [Google Scholar] [CrossRef]
Tian, Z.; Jia, X.; Yuan, K.; Pan, T.; Jiang, S.B. Low-dose CT reconstruction via edge-preserving total variation regularization. Phys. Med. Biol. 2011, 56, 5949–5967. [Google Scholar] [CrossRef]
Chen, Z.; Jin, X.; Li, L.; Wang, G. A limited-angle CT reconstruction method based on anisotropic TV minimization. Phys. Med. Biol. 2013, 58, 2119–2141. [Google Scholar] [CrossRef]
Liu, Y.; Liang, Z.; Ma, J.; Lu, H.; Wang, K.; Zhang, H.; Moore, W. Total variation-stokes strategy for sparse-view X-ray CT image reconstruction. IEEE Trans. Med. Imaging 2013, 33, 749–763. [Google Scholar]
Dabov, K.; Foi, A.; Katkovnik, V.; Egiazarian, K. Image denoising by sparse 3-D transform-domain collaborative filtering. IEEE Trans. Image Process. 2007, 16, 2080–2095. [Google Scholar] [CrossRef]
Dabov, K.; Foi, A.; Katkovnik, V.; Egiazarian, K. Image denoising with block-matching and 3D filtering. In Proceedings of the Image Processing: Algorithms and Systems, Neural Networks, and Machine Learning, San Jose, CA, USA, 16–18 January 2006; p. 606414. [Google Scholar]
Li, X.; Chen, Z.; Xing, Y. Multi-segment limited-angle CT reconstruction via a BM3D filter. In Proceedings of the 2012 IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC), Anaheim, CA, USA, 29 October–3 November 2012; pp. 2390–2394. [Google Scholar]
Lyu, Q.; Yang, C.; Gao, H.; Xue, Y.; O’Connor, D.; Niu, T.; Sheng, K. Technical Note: Iterative megavoltage CT (MVCT) reconstruction using block-matching 3D-transform (BM3D) regularization. Med. Phys. 2018, 45, 2603–2610. [Google Scholar] [CrossRef]
Chen, L.; Gou, S.; Yao, Y.; Bai, J.; Jiao, L.; Sheng, K. Denoising of low dose CT image with context-based BM3D. In Proceedings of the 2016 IEEE Region 10 Conference (TENCON), Marina Bay Sands, Singapore, 22–25 November 2016; pp. 682–685. [Google Scholar]
Andersen, A.H.; Kak, A.C. Simultaneous algebraic reconstruction technique (SART): A superior implementation of the art algorithm. Ultrason. Imaging 1984, 6, 81–94. [Google Scholar] [CrossRef] [PubMed]
Dennerlein, F.; Maier, A. Approximate truncation robust computed tomography—ATRACT. Phys. Med. Biol. 2013, 58, 6133. [Google Scholar] [CrossRef]
Sidky, E.Y.; Kraemer, D.N.; Roth, E.G.; Ullberg, C.; Reiser, I.S.; Pan, X. Analysis of iterative region-of-interest image reconstruction for x-ray computed tomography. J. Med. Imaging 2014, 1, 031007. [Google Scholar] [CrossRef] [PubMed][Green Version]
Zhang, H.; Li, L.; Yan, B.; Wang, L.; Cai, A.; Hu, G. A two-step filtering-based iterative image reconstruction method for interior tomography. J. X-Ray Sci. Technol. 2016, 24, 733–747. [Google Scholar] [CrossRef] [PubMed]
Sheng, K.; Gou, S.; Wu, J.; Qi, S.X. Denoised and texture enhanced MVCT to improve soft tissue conspicuity. Med. Phys. 2014, 41, 101916. [Google Scholar] [CrossRef] [PubMed]
Huang, S.; Tang, C.; Xu, M.; Qiu, Y.; Lei, Z. BM3D-based total variation algorithm for speckle removal with structure-preserving in OCT images. Appl. Opt. 2019, 58, 6233–6243. [Google Scholar] [CrossRef] [PubMed]
Goldstein, T.; Osher, S. The Split Bregman Method for L1-Regularized Problems. SIAM J. Imaging Sci. 2009, 2, 323–343. [Google Scholar] [CrossRef]
Chamorro-Servent, J.; Abascal, J.F.; Aguirre, J.; Arridge, S.; Correia, T.; Ripoll, J.; Desco, M.; Vaquero, J.J. Use of Split Bregman denoising for iterative reconstruction in fluorescence diffuse optical tomography. J. Biomed. Opt. 2013, 18, 076016. [Google Scholar] [CrossRef] [PubMed]
Hashemi, S.; Song, W.Y.; Sahgal, A.; Lee, Y.; Huynh, C.; Grouza, V.; Nordström, H.; Eriksson, M.; Dorenlot, A.; Régis, J.M. Simultaneous deblurring and iterative reconstruction of CBCT for image guided brain radiosurgery. Phys. Med. Biol. 2017, 62, 2521. [Google Scholar] [CrossRef]
Chen, C.; Xu, G. A new linearized split Bregman iterative algorithm for image reconstruction in sparse-view X-ray computed tomography. Comput. Math. Appl. 2016, 71, 1537–1559. [Google Scholar] [CrossRef]
Pelt, D.M.; Batenburg, K.J. A method for locally approximating regularized iterative tomographic reconstruction methods. arXiv 2016, arXiv:1604.02292. [Google Scholar] [CrossRef]
Biguri, A.; Dosanjh, M.; Hancock, S.; Soleimani, M. TIGRE: A MATLAB-GPU toolbox for CBCT image reconstruction. Biomed. Phys. Eng. Express 2016, 2, 055010. [Google Scholar] [CrossRef]
Mäkinen, Y.; Azzari, L.; Foi, A. Collaborative filtering of correlated noise: Exact transform-domain variance for improved shrinkage and patch matching. IEEE Trans. Image Process. 2020, 29, 8339–8354. [Google Scholar] [CrossRef]
Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef]
Xu, Q.; Tong, X.; Lin, M.; Chen, X.; ElDib, A.; Lin, T.; Chen, L.; Ma, C.C. Time and frequency to observe fiducial markers in MLC-modulated fields during prostate IMRT/VMAT beam delivery. Phys. Med. 2020, 76, 142–149. [Google Scholar] [CrossRef]
Azcona, J.D.; Li, R.; Mok, E.; Hancock, S.; Xing, L. Automatic prostate tracking and motion assessment in volumetric modulated arc therapy with an electronic portal imaging device. Int. J. Radiat. Oncol. Biol. Phys. 2013, 86, 762–768. [Google Scholar] [CrossRef][Green Version]
Zhang, H.; Zeng, D.; Zhang, H.; Wang, J.; Liang, Z.; Ma, J. Applications of nonlocal means algorithm in low-dose X-ray CT image processing and reconstruction: A review. Med. Phys. 2017, 44, 1168–1185. [Google Scholar] [CrossRef]
Chen, G.H.; Tang, J.; Leng, S. Prior image constrained compressed sensing (PICCS): A method to accurately reconstruct dynamic CT images from highly undersampled projection data sets. Med. Phys. 2008, 35, 660–663. [Google Scholar] [CrossRef]
Ertas, M. A nonlinear total variation based computed tomography (CT) image reconstruction method using gradient reinforcement. PeerJ 2024, 12, e16715. [Google Scholar] [CrossRef] [PubMed]
Schroder, L.; Stankovic, U.; Rit, S.; Sonke, J.J. Image quality of dual-energy cone-beam CT with total nuclear variation regularization. Biomed. Phys. Eng. Express 2022, 8, 025012. [Google Scholar] [CrossRef] [PubMed]
Xi, Y.; Zhou, P.; Yu, H.; Zhang, T.; Zhang, L.; Qiao, Z.; Liu, F. Adaptive-weighted high order TV algorithm for sparse-view CT reconstruction. Med. Phys. 2023, 50, 5568–5584. [Google Scholar] [CrossRef]
Xie, S.; Zheng, X.; Chen, Y.; Xie, L.; Liu, J.; Zhang, Y.; Yan, J.; Zhu, H.; Hu, Y. Artifact Removal using Improved GoogLeNet for Sparse-view CT Reconstruction. Sci. Rep. 2018, 8, 6700. [Google Scholar] [CrossRef]
Zhang, R.; Szczykutowicz, T.P.; Toia, G.V. Artificial Intelligence in Computed Tomography Image Reconstruction: A Review of Recent Advances. J. Comput. Assist. Tomogr. 2025, 49, 521–530. [Google Scholar] [CrossRef]
Wang, J.; Zeng, L.; Wang, C.; Guo, Y. ADMM-based deep reconstruction for limited-angle CT. Phys. Med. Biol. 2019, 64, 115011. [Google Scholar] [CrossRef] [PubMed]
Cheng, W.; Wang, Y.; Li, H.; Duan, Y. Learned full-sampling reconstruction from incomplete data. IEEE Trans. Comput. Imaging 2020, 6, 945–957. [Google Scholar] [CrossRef]
Honzatko, D.; Krulis, M. Accelerating block-matching and 3D filtering method for image denoising on GPUs. J. Real-Time Image Process. 2019, 16, 2273–2287. [Google Scholar] [CrossRef]
McNiven, A.L.; Sharpe, M.B.; Purdie, T.G. A new metric for assessing IMRT modulation complexity and plan deliverability. Med. Phys. 2010, 37, 505–515. [Google Scholar] [CrossRef] [PubMed]
Götstedt, J.; Karlsson Hauer, A.; Bäck, A. Development and evaluation of aperture-based complexity metrics using film and EPID measurements of static MLC openings. Med. Phys. 2015, 42, 3911–3921. [Google Scholar] [CrossRef]
Jaffray, D.A.; Siewerdsen, J.H.; Wong, J.W.; Martinez, A.A. Flat-panel cone-beam computed tomography for image-guided radiation therapy. Int. J. Radiat. Oncol. Biol. Phys. 2002, 53, 1337–1349. [Google Scholar] [CrossRef]
Zhao, X.; Zhang, R. Feasibility of 4D VMAT-CT. Biomed. Phys. Eng. Express 2022, 8, 065018. [Google Scholar] [CrossRef]
Bian, J.; Siewerdsen, J.H.; Han, X.; Sidky, E.Y.; Prince, J.L.; Pelizzari, C.A.; Pan, X. Evaluation of sparse-view reconstruction from flat-panel-detector cone-beam CT. Phys. Med. Biol. 2010, 55, 6575–6599. [Google Scholar] [CrossRef]
Dong, W.; Zhang, L.; Shi, G.; Li, X. Nonlocally centralized sparse representation for image restoration. IEEE Trans. Image Process. 2013, 22, 1620–1630. [Google Scholar] [CrossRef]
Setzer, S. Operator splittings, Bregman methods and frame shrinkage in image processing. Int. J. Comput. Vis. 2011, 92, 265–280. [Google Scholar] [CrossRef]
Ramani, S.; Fessler, J.A. A splitting-based iterative algorithm for accelerated statistical X-ray CT reconstruction. IEEE Trans. Med. Imaging 2012, 31, 677–688. [Google Scholar] [CrossRef]
Lebrun, M. An Analysis and Implementation of the BM3D Image Denoising Method. Image Process. On Line 2012, 2, 175–213. [Google Scholar] [CrossRef]
Wisselink, H.J.; Pelgrim, G.J.; Rook, M.; Dudurych, I.; van den Berge, M.; de Bock, G.H.; Vliegenthart, R. Improved precision of noise estimation in CT with a volume-based approach. Eur. Radiol. Exp. 2021, 5, 39. [Google Scholar] [CrossRef]

Figure 1. The flow chart of the iterative TV-BM3D reconstruction algorithm proposed in this study. ORACM: Online region-based active contouring; MLC: Multi-leaf collimator.

Figure 2. VMAT-CT reconstructions of a Rando phantom. (First column) Pretreatment CBCT overlaid by the prescription isodose lines (red); (second column) VMAT-CT reconstructed with FDK; (third column) VMAT-CT reconstructed with FDK + preprocessing; (fourth column) VMAT-CT reconstructed with iterative + preprocessing.

Figure 3. VMAT-CT reconstructions of real-patient cases. (First column) Pretreatment CBCT images overlaid by the prescription isodose lines (red); (second column) VMAT-CT reconstructed with FDK; (third column) VMAT-CT reconstructed with FDK + preprocessing; (fourth column) VMAT-CT reconstructed with iterative + preprocessing.

Figure 4. Boxplot of CNR of VMAT-CT in the phantom study.

Figure 5. Boxplot of SSIM of VMAT-CT in the phantom study.

Figure 6. Boxplot of CNR of VMAT-CT in the real-patient study.

Figure 7. Boxplot of SSIM of VMAT-CT in the real-patient study.

Table 1. Arc ranges and number of acquired EPID images per treatment site.

	ESO	LL	RL	H&N
Arc range	[−175, 175]	[−30, 175]	[−175, 30]	[−175, 175]
Number of EPID images	281 ± 127	213 ± 67	229 ± 48	248 ± 86

Table 2. The post hoc Tukey test of CNR of VMAT-CT in the phantom study for various combinations of reconstruction methods and treatment sites.

	CNR (p Value)
	ESO	LL	RL	H&N
FDK vs. FDK + preprocessing	<0.0001	0.0004	0.0117	0.0368
FDK vs. iterative + preprocessing	<0.0001	<0.0001	<0.0001	<0.0001
FDK + preprocessing vs. iterative + preprocessing	<0.0001	<0.0001	0.0001	0.0002

Table 3. The post hoc Tukey test of SSIM of VMAT-CT in the phantom study for various combinations of reconstruction methods and treatment sites.

	SSIM (p Value)
	ESO	LL	RL	H&N
FDK vs. FDK + preprocessing	0.0102	0.0356	0.0037	0.8084
FDK vs. iterative + preprocessing	0.0013	0.0002	<0.0001	0.0473
FDK + preprocessing vs. iterative + preprocessing	<0.0001	<0.0001	<0.0001	0.012

Table 4. The post hoc Tukey test in CNR and SSIM of VMAT-CT in the real-patient study for various combinations of reconstruction methods, using post hoc Tukey tests.

	CNR (p Value)	SSIM (p Value)
FDK vs. FDK + preprocessing	0.0088	0.0339
FDK vs. iterative + preprocessing	<0.0001	<0.0001
FDK + preprocessing vs. iterative + preprocessing	0.0002	0.0008

Table 5. Computational time of VMAT-CT reconstructions.

Algorithm	Time (s)
FDK	153 ± 61
FDK + preprocessing	307 ± 91
Iterative + preprocessing	471 ± 122

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chien, C.-L.; Guo, B.; Zhang, R. A TV–BM3D Iterative Algorithm for VMAT-CT Reconstruction. J. Imaging 2026, 12, 166. https://doi.org/10.3390/jimaging12040166

AMA Style

Chien C-L, Guo B, Zhang R. A TV–BM3D Iterative Algorithm for VMAT-CT Reconstruction. Journal of Imaging. 2026; 12(4):166. https://doi.org/10.3390/jimaging12040166

Chicago/Turabian Style

Chien, Chia-Lung, Beibei Guo, and Rui Zhang. 2026. "A TV–BM3D Iterative Algorithm for VMAT-CT Reconstruction" Journal of Imaging 12, no. 4: 166. https://doi.org/10.3390/jimaging12040166

APA Style

Chien, C.-L., Guo, B., & Zhang, R. (2026). A TV–BM3D Iterative Algorithm for VMAT-CT Reconstruction. Journal of Imaging, 12(4), 166. https://doi.org/10.3390/jimaging12040166

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A TV–BM3D Iterative Algorithm for VMAT-CT Reconstruction

Abstract

1. Introduction

2. Materials and Methods

2.1. TV-BM3D Iterative VMAT-CT Reconstruction

2.2. Image Quality (IQ) Analysis

3. Results

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI