1. Introduction
Tensor completion aims to recover missing entries of partially observed multi-dimensional data by exploiting structural regularity across different modes [
1,
2,
3,
4,
5,
6,
7,
8]. Because tensors preserve multi-way relationships that are often lost under matrix flattening, tensor-based models have been widely used in image restoration, video processing, recommendation systems, and other applications involving high-dimensional structured data [
1,
9,
10,
11,
12,
13,
14,
15,
16,
17]. In many practical scenarios, however, tensor observations are incomplete due to occlusion, transmission errors, sensing limitations, or data corruption [
1,
18,
19], making reliable recovery of missing entries a central problem.
A common modeling assumption is that the underlying tensor admits approximately low-rank structures across its mode unfoldings. Under this assumption, tensor completion can be viewed as a rank minimization problem constrained by the observed entries [
20]. Since direct tensor-rank minimization is NP-hard [
21], practical methods typically replace rank with tractable surrogates, often employing advanced optimization algorithms such as accelerated gradient or coordinate descent methods [
22,
23,
24]. One of the most widely used formulations minimizes a weighted sum of nuclear norms of the unfolding matrices [
11,
13,
25], leading to the convex combination model
where
and
. This formulation has shown good empirical performance and underlies several influential tensor completion methods, including high-accuracy low-rank tensor completion (HaLRTC) [
1].
Despite its practical success, the convex-combination formulation imposes a restrictive weighting rule: all mode weights must be non-negative. This requirement is natural from the convex-analysis viewpoint, but it may also limit the flexibility with which different tensor modes are balanced in reconstruction. In particular, when different modes exhibit heterogeneous structural roles, a purely non-negative weighting scheme may be too rigid to express mode-dependent emphasis or relaxation. Motivated by this observation, we study whether allowing signed weights can provide a useful extension of unfolding-based nuclear-norm regularization.
Based on this idea, we consider the following linear combination model:
where
are predetermined weights satisfying
. Relative to the standard convex combination, this model enlarges the admissible weighting space while retaining the same unfolding-based nuclear-norm structure. To solve it numerically, we build on the ADMM framework of HaLRTC and introduce an extended singular value thresholding-style update that handles both positive and negative weights.
Our main contributions are threefold:
Model formulation. We extend the standard convex-combination weighting rule to a signed linear-combination framework for unfolding-based tensor nuclear norms.
Algorithm design. We develop an ADMM-based algorithm, FlexHaLRTC, with an extended SVT-style update to implement the signed-weight model.
Empirical evaluation. We report competitive results on image inpainting and video completion, with more noticeable gains in several high-missing-rate settings.
The remainder of this paper is organized as follows.
Section 2 introduces the notation and preliminaries.
Section 3 presents our FlexHaLRTC (flexible high-accuracy low-rank tensor completion) algorithm, an ADMM-based method for solving the proposed linear combination model.
Section 4 provides experimental results on color image inpainting.
Section 5 extends the evaluation to video completion tasks.
Section 6 concludes with discussions and future directions.
2. Notation and Preliminaries
We follow standard mathematical conventions. Uppercase letters (e.g.,
X) denote matrices, lowercase letters (e.g.,
) denote scalars, and calligraphic letters (e.g.,
) denote tensors. For a matrix
, the Frobenius norm is defined as
, and the nuclear norm as
, where
denotes the
i-th largest singular value. The nuclear norm is the tightest convex relaxation of the matrix rank function [
26,
27]. The singular value decomposition (SVD) of
X is denoted as
, where
and
have orthonormal columns,
with
, and
. For an
n-th order tensor
, a mode-
i fiber is a vector obtained by fixing all indices except the
i-th one, and the mode-
i unfolding
is a matrix formed by arranging all mode-
i fibers as its columns. The Tucker rank of
is the tuple
where
. When we refer to
in optimization problems, it denotes the Tucker rank unless otherwise specified. The Frobenius norm of tensor
is defined as
, which satisfies
for all
i.
For the tensor completion problem, denotes the set of indices of observed entries, denotes the restriction of tensor to , and denotes the observed values. Given weight parameters , a convex combination requires and , while a linear combination allows with the normalization constraint . In the ADMM algorithm, denotes the auxiliary matrix variable for mode i at iteration k, denotes the Lagrange multiplier matrix, and is the penalty parameter.
The singular value thresholding (SVT) operator [
28] serves as the proximal operator of the nuclear norm. For a matrix
A with SVD
, where
and threshold
, the standard SVT is
, which solves
and promotes low-rank structure through soft thresholding. To accommodate negative weights, we introduce an extended SVT-style update for negative thresholds: for
, we define
. This extension should be viewed as an algorithmic rule that symmetrically expands singular values when the corresponding weight is negative, rather than as the proximal operator of a convex penalty. The unified definition is
In our ADMM framework, the threshold for mode i is . Consequently, when , induces rank reduction via shrinkage; when , it induces rank enhancement via expansion; when , no thresholding is applied.
3. Optimization Algorithm
As discussed in the previous section, the linear combination model is intended to relax the expressiveness limitations of the traditional convex combination framework by allowing the weights to take arbitrary real values, subject only to the normalization constraint . This added flexibility also introduces additional challenges in the optimization procedure.
To efficiently solve the resulting optimization problem, we adopt an Alternating Direction Method of Multipliers (ADMM) framework [
29], which is based on the successful HaLRTC algorithm. The ADMM framework is well suited to this problem because it decomposes the objective into simpler subproblems that can be handled efficiently. Our algorithm follows the overall structure of HaLRTC, with the main modification being the singular value thresholding (SVT) step, which is extended to accommodate both positive and negative weights.
3.1. HaLRTC ADMM Framework
The HaLRTC algorithm solves the convex combination tensor completion problem using ADMM. The optimization problem is reformulated by introducing auxiliary variables
for each mode
i, representing the low-rank approximations of the mode-unfolding matrices:
where
and
. The augmented Lagrangian function is
The ADMM algorithm iteratively updates the variables as follows:
This step involves minimizing the augmented Lagrangian with respect to
:
The solution is obtained via the singular value thresholding (SVT) operator:
where
. Since
in HaLRTC, the threshold
is non-negative, and the SVT operator shrinks singular values via soft thresholding.
This step minimizes the augmented Lagrangian with respect to
:
The solution is
where
denotes the tensor reconstructed by folding the matrix back along mode
i.
The Lagrange multipliers are updated using
3.2. Extension to Linear Combination
We now extend the HaLRTC framework to allow arbitrary real-valued weights subject to the constraint . We refer to this extended algorithm as FlexHaLRTC to emphasize its flexibility in handling both positive and negative weights. Because the signed weights relax the standard non-negativity constraint, the resulting objective should be viewed as a heuristic extension of the original convex-combination model rather than a standard convex optimization problem. The key modification lies in the singular value thresholding step, where the threshold can now be negative.
Following the unified SVT definition from
Section 2, the
update becomes
where the extended SVT operator is defined as
and
is the singular value decomposition of
A.
When
(positive weight), we have
, and the SVT operator performs soft thresholding. The element-wise transformation is
This shrinkage operation promotes low-rank structures by eliminating small singular values and reducing the magnitude of larger ones.
When
(negative weight), we have
, and the SVT operator expands the singular values. The element-wise transformation is
Since
, we have
; thus, this operation uniformly increases all singular values by
. In algorithmic terms, this expansion increases the influence of mode
i in the reconstruction and can counterbalance over-aggressive low-rank shrinkage when the data contains a richer structure than a purely low-rank model can capture.
The negative weights can therefore be interpreted as a flexible way to modulate the relative influence of different tensor modes, rather than enforcing low-rank regularization uniformly across all unfoldings. This interpretation is particularly reasonable for RGB data, where the color mode has only three channels and is structurally different from the spatial modes; using a slightly negative weight helps avoid over-constraining inter-channel variation while preserving the dominant spatial low-rank structure. A heuristic interpretation of this extended SVT update is provided in
Appendix A.
The mathematical formulation for our linear combination model is identical to (
4), except that the weights
are no longer constrained to be non-negative. The update rules for
and
remain the same as in HaLRTC, with only the
update step modified to handle negative thresholds through the extended SVT-style update.
3.3. Weight Parameter Selection
In the FlexHaLRTC framework, the weights are treated as hyperparameters. While theoretical guidelines for optimal weight selection remain an open research question, we adopt an empirical approach. The weights can be chosen based on prior knowledge about the data or tuned on a validation set. For instance, if one mode is known to be more corrupted or less structured, it could be assigned a negative or smaller positive weight. In practice, we found that a mild negative weight on the color mode, such as in the image experiments, is more stable than using a large negative value. This provides a mechanism for mode-dependent regularization.
In our experiments (
Section 4), we demonstrate this process by conducting a grid search on a validation image to identify an effective weight configuration, which is then applied to the entire test set. The complete procedure of our proposed method is summarized in Algorithm 1.
| Algorithm 1 FlexHaLRTC algorithm. |
| Require: Observed entries , observation index set , weights with |
| (may be negative), penalty parameter , maximum iterations K |
| Ensure: Completed tensor |
- 1:
Initialize , for - 2:
Initialize and - 3:
for to K do - 4:
for to n do - 5:
- 6:
Compute the SVD and let - 7:
- 8:
if then - 9:
- 10:
else - 11:
- 12:
end if - 13:
end for - 14:
- 15:
- 16:
for to n do - 17:
- 18:
end for - 19:
end for - 20:
return
|
FlexHaLRTC extends the traditional ADMM framework by allowing signed weights in the unfolding-based regularization term and by incorporating the corresponding extended SVT-style update. This generalization provides a more flexible mode-dependent regularization mechanism while retaining a simple implementation. In practice, the algorithm exhibits stable empirical behavior under moderate negative weights for the parameter ranges considered in our experiments.
4. Experimental Results
4.1. Experimental Configuration and Implementation Details
To ensure reproducibility and facilitate a fair comparison, we provide comprehensive implementation details for all experiments conducted in this study.
4.1.1. Hardware and Software Environment
All experiments were conducted on a laptop with an Intel Core i5-10200H CPU and 16 GB RAM running Windows 11 (64-bit). We used MATLAB R2021a and Python 3.8.10 (NumPy 1.21.2, SciPy 1.7.1, PyTorch 1.9.0, OpenCV 4.5.3, Matplotlib 3.4.3, scikit-image 0.18.3, Pillow 8.3.1). The seven test images (Airplane, Baboon, Cloudy, House, Panda, Peppers, and Parrot) and the Foreman/Akiyo video sequences are standard public benchmarks widely used in the image and video processing community (e.g., from the USC-SIPI Image Database and other open research repositories). These datasets are publicly available for academic research purposes.
4.1.2. Algorithm-Specific Hyperparameters
For a fair comparison, we used the parameter settings recommended in the original publications for all baselines.
For the proposed FlexHaLRTC and the baseline HaLRTC algorithms, we utilized a fixed-iteration stopping criterion. Specifically, we set the penalty parameter and fixed the maximum iterations at . HaLRTC utilized its default uniform weights . Conversely, for FlexHaLRTC, the weight configuration was determined via a grid search on validation data, where the negative weight on the color mode enables more flexible inter-channel correlation modeling.
For other competing methods, we strictly followed the default parameter settings optimized in their original publications. Specifically, SILRTC [
1] was restricted to
with
. TNN [
13] and WSTNN [
30] utilized adaptive penalty updates with their default tolerance
. Lp-TNN [
31] was set with truncation parameter
and truncation rank
. Both tensor ring methods (NTRC and FANTRC [
32]) employed regularization parameter
with
.
4.2. Color Image Restoration Under Random Missing Patterns
4.2.1. Test Dataset and Evaluation Metrics
To evaluate our algorithm under random pixel missing scenarios, we selected seven representative
RGB images (
Figure 1) covering natural scenes and complex textures. We simulated uniform random pixel loss from 30% to 90% and compared FlexHaLRTC with seven state-of-the-art tensor completion algorithms (HaLRTC [
1], SILRTC [
1], TNN [
13], WSTNN [
30], Lp-TNN [
31], NTRC [
32], and FANTRC [
32]).
The restoration performance was quantitatively evaluated using PSNR, SSIM, and RSE. The peak signal-to-noise ratio (PSNR) is defined as
where MAX is the maximum possible pixel value (e.g., 255 for 8-bit images) and
is the mean squared error between the original image
X and the reconstructed image
. The structural similarity index (SSIM) is defined as
where
and
are the mean values,
and
are the variances,
is the covariance, and
and
are small constants for numerical stability. The relative squared error (RSE) is defined as
where
and
denote the original and reconstructed tensors, respectively. Higher PSNR and SSIM values indicate better reconstruction quality, while lower RSE values are preferred.
4.2.2. Empirical Convergence Analysis
Due to the introduction of negative weights in the proposed linear-combination model, establishing a rigorous theoretical convergence proof is non-trivial. Therefore, we provide an empirical convergence study on two representative images, House and Airplane, under a 90% missing rate. We compare FlexHaLRTC with the standard HaLRTC using the same observation masks and report the relative squared error (RSE) along the iterations.
As shown in
Figure 2, both methods exhibit stable empirical convergence behavior across different image contents. The RSE curves for both images gradually flatten after approximately 120 iterations. Notably, FlexHaLRTC attains a lower final reconstruction error than HaLRTC in both cases, with final RSE values reaching steady states significantly earlier. These results provide empirical evidence that the proposed algorithm remains stable and robust in practice despite the use of moderate negative weights.
It is worth noting that while the introduction of negative weights provides significant flexibility and performance gains, it also poses a non-trivial challenge for a rigorous theoretical convergence proof, as the objective function may deviate from standard convexity. However, the comprehensive empirical studies conducted on various images consistently demonstrate that FlexHaLRTC maintains a stable and monotonic convergence behavior in practice. We regard the formal mathematical derivation of the convergence boundaries for this flexible weighting scheme as a significant direction for our future research.
4.2.3. Sensitivity Analysis of Weight Parameter
The weight parameters balance the contributions of different tensor modes in FlexHaLRTC. To investigate the impact of the negative weight parameter , we conducted sensitivity experiments on the Baboon image at a 90% missing rate, varying from to 0 while maintaining .
As shown in
Figure 3, moderate negative weights in the color mode (
) consistently outperform purely positive configurations, with a clear PSNR improvement peak. However, extreme negative weights (
) lead to a sharp degradation in performance and potential numerical instability. This empirical observation supports our choice of
as a robust default for natural image completion.
4.2.4. Sensitivity Analysis of Penalty Parameter
The penalty parameter in the ADMM framework plays a critical role in balancing the convergence speed and solution accuracy. To systematically analyze the impact of , we conducted sensitivity experiments on the Baboon image at a 90% missing rate, testing 35 different values spanning seven orders of magnitude from to .
Figure 4 illustrates the performance variation with respect to the penalty parameter
. The results suggest an effective plateau region roughly between
and
, where the model maintains good reconstruction accuracy and limited sensitivity to parameter variations. Outside this region, performance degrades due to either insufficient regularization or excessive penalty. Consequently, we set
as the default parameter, which lies within this effective range and provides a practical balance between reconstruction quality and numerical stability.
4.2.5. Algorithm Stability Analysis
To assess robustness, we repeated the experiment 10 times with different random seeds (42–51) at four representative values of (, , , and ).
Figure 5 shows low run-to-run variance in the effective
range (PSNR standard deviation below 0.1 dB; RSE standard deviation below 0.003), suggesting stable empirical behavior under random missing patterns. Since this trend is consistent with the
-sensitivity results, we treat stability as a supplementary robustness check rather than a separate core finding.
4.2.6. Visual Quality Assessment
To visually demonstrate the effectiveness of our method,
Figure 6 presents a comprehensive comparison of reconstruction results for seven test images at a 90% missing rate.
FlexHaLRTC generally provides competitive visual quality (
Figure 6), with improved edge preservation and artifact suppression on several images. For instance, it recovers the structural details of
House and the textures of
Baboon more clearly than SILRTC and TNN. However, the latest results also show that HaLRTC performs better on some highly smooth or strongly correlated cases (such as
Cloudy), while
Pepper remains an exception, where HaLRTC is clearly better. To support this observation, we computed RGB inter-channel correlations on the seven test images: the mean pairwise correlation of
Pepper is 0.5315 (R-G: 0.2944, R-B: 0.4341, G-B: 0.8661), lower than highly correlated images, such as
Cloudy (0.9699),
House (0.9567), and
Panda (0.9678). This weaker cross-channel coupling may partly explain the reduced benefit of a negative color-mode weight on
Pepper. Overall, the visual ranking is broadly consistent with the quantitative results.
4.2.7. Quantitative Performance Analysis
Table 1 reports the quantitative evaluation at a 90% missing rate. The results demonstrate that FlexHaLRTC achieves superior performance on the majority of the test set, attaining the highest PSNR or lowest RSE on five out of seven images. Compared to the baseline HaLRTC, FlexHaLRTC provides consistent improvements across most natural scenes, such as
House (21.49 dB vs. 20.92 dB) and
Baboon (18.72 dB vs. 17.89 dB).
While the average PSNR across all seven images shows a marginal decrease of 0.09 dB, this is primarily attributed to the significant performance gap observed in two specific cases:
Cloudy and
Pepper. As discussed in
Section 3.3, our weight configuration
leverages inter-channel correlations. In images like
Pepper, where the cross-channel coupling is relatively weak (mean correlation 0.5315), the benefit of negative color-mode weighting is diminished. However, the consistent gains on the other five images suggest that the proposed signed-weight strategy effectively enhances the expressive power of the unfolding-based model for general natural images. In terms of efficiency, both FlexHaLRTC and HaLRTC exhibit comparable computational costs, typically requiring 62–66 s per image.
Figure 7 illustrates performance trends from a 30% to 90% missing rate on the Cloudy image. The three panels report RSE, PSNR, and SSIM, respectively. FlexHaLRTC performs competitively across the full range and shows more noticeable advantages at higher missing rates. In particular, the gap to HaLRTC becomes more visible beyond 70%, suggesting that the linear-combination weighting is especially useful in high-sparsity settings.
4.3. Cross-Dataset Generalization on Hyperspectral Imaging
To demonstrate the cross-dataset generalization capability of the proposed algorithm, we evaluated its performance on the standard Pavia University hyperspectral image (HSI) dataset. The original data were cropped to a sub-volume to serve as a 3-way tensor. We simulated a severe missing scenario with an 80% random missing rate.
Using the weight configuration
, FlexHaLRTC successfully reconstructed the hyperspectral data, achieving a PSNR of 28.01 dB, an SSIM of 0.7674, and an RSE of 0.180. These results were compared with several baseline methods: SiLRTC (PSNR: 14.10 dB), HaLRTC (PSNR: 31.21 dB), and Matrix Completion (MC) (PSNR: 27.01 dB). These quantitative results confirm that the proposed linear combination model and its corresponding optimization framework maintain stable execution and robust recovery performance across diverse data modalities, extending beyond standard RGB natural images. The visual comparison is shown in
Figure 8.
4.4. Color Image Restoration Under Structured Block Missing
In practical applications, missing data often appears in contiguous structured blocks rather than random isolated pixels. To evaluate the robustness of FlexHaLRTC against such challenging scenarios, we conducted a structured block missing experiment on the standard House image (). A contiguous block of data was removed from the center of the image, severely disrupting the continuous structural information of the scene.
FlexHaLRTC was comprehensively compared against standard algorithms, including HaLRTC and SiLRTC, with Matrix Completion (MC) being applied on the unfolded matrix. The quantitative results demonstrate the significant advantage of the proposed method: FlexHaLRTC achieved a PSNR of 29.47 dB, SSIM of 0.9623, and RSE of 0.063. It outperformed not only HaLRTC (PSNR: 28.17 dB) and SiLRTC (PSNR: 18.32 dB) by a large margin, but also marginally surpassed the Matrix Completion (MC) baseline (PSNR: 29.35 dB). Visually, FlexHaLRTC better preserves the geometric continuity of the occluded regions, confirming that the flexible weight adjustment provides critical regularizations for structured missing data recovery. The visual results with zoom-in details are presented in
Figure 9.
5. Video Completion Experiments
5.1. Video Dataset and Experimental Configuration
To further assess the capabilities of FlexHaLRTC in temporal data completion, we evaluate its performance on video completion tasks using the standard Akiyo and Cricket video sequences, widely used benchmarks in video processing research. The original video sequences consist of frames at a CIF resolution ( pixels). For computational efficiency and a fair comparison with baseline methods, we resized all frames to pixels using bicubic interpolation while preserving the aspect ratio. We selected frames 1–100 for our experiments to ensure sufficient temporal correlation while maintaining reasonable computational cost.
5.1.1. Video-Specific Implementation Details
For video completion experiments, we employed two distinct tensor formulations. The first approach is single-frame completion, where each frame is treated independently as a 3D tensor with dimensions of height, width, and color channels. The second approach is multi-frame completion, where consecutive frames are stacked into a 4D tensor with dimensions of height, width, color channels, and time. The 4D formulation enables exploitation of temporal correlations across frames, leading to improved temporal consistency.
We simulated random pixel loss at four distinct missing rates of 20%, 40%, 60%, and 80%. For each missing rate, we generated 5 different random missing patterns and reported average performance metrics. The missing patterns were generated using uniform random sampling without replacement, consistent with the image completion experiments.
5.1.2. Algorithm Configuration for Video Completion
For video experiments, we compared FlexHaLRTC against three representative baselines (NTRC, FANTRC, and HaLRTC). We limited the baseline set to methods that are directly applicable to video tensors under our computational budget and implementation setting—to keep the comparison protocol consistent and tractable. To maintain consistency with the image experiments, we followed the same parameter-selection rationale rather than tuning separate settings for each missing rate or test case. Specifically, the penalty parameter and iteration budget were kept unchanged ( and for both FlexHaLRTC and HaLRTC). For FlexHaLRTC, the single-frame setting directly adopts the 3D image configuration . In the multi-frame setting, this design principle is extended to the temporal mode: positive weights are assigned to the two spatial modes and the temporal mode, while a mild negative weight is retained on the color mode to avoid over-regularizing inter-channel correlations, leading to .
HaLRTC: Uniform weights ( for 4D, and for 3D), penalty parameter , and fixed maximum iterations .
NTRC: Tensor ring rank for 4D tensors, learning rate , maximum iterations , convergence tolerance .
FANTRC: Adaptive tensor ring rank with maximum rank 12, learning rate , maximum iterations , convergence tolerance .
5.2. Single-Frame Completion Results
In the single-frame scenario, each video frame is treated independently as a 3D tensor with the following dimensions: height × width × color channels. This formulation allows for a direct application of the image completion methodology to video data. While this approach does not exploit temporal correlations between frames, it provides a useful reference setting and remains computationally simple.
Figure 10 presents a visual comparison of reconstruction results across four missing rates (20%, 40%, 60%, and 80%) for a representative frame from the
Akiyo sequence.
As shown in
Figure 10, visual differences become more pronounced as the missing rate increases. At a 20% missing rate, most methods produce acceptable results, while FlexHaLRTC and HaLRTC preserve finer details. At 40% and 60%, NTRC shows increasing blur and FANTRC degrades more noticeably; FlexHaLRTC remains visually competitive with clearer facial structures. At 80%, FlexHaLRTC and HaLRTC still recover recognizable facial content, with FlexHaLRTC showing fewer visible artifacts in this example.
Table 2 provides the quantitative evaluation and broadly agrees with the visual comparison.
FlexHaLRTC achieves the highest PSNR and lowest RSE in this table across all missing rates. For example, at a 40% missing rate, it improves over HaLRTC by 1.94 dB. At a 80% missing rate, it attains 25.38 dB PSNR and 0.771 SSIM, indicating that the reconstruction quality remains reasonable under severe sparsity.
While single-frame completion treats each frame independently, video data exhibits strong temporal correlations that can be exploited for improved reconstruction quality. In multi-frame completion, we stack consecutive frames into a 4D tensor with the following dimensions: height × width × color × time. This formulation allows the algorithm to leverage temporal redundancy, where similar content appears across adjacent frames.
The key challenge in multi-frame video completion is maintaining temporal consistency—ensuring smooth transitions between reconstructed frames without flickering artifacts. Traditional frame-independent methods often produce temporarily inconsistent results, where reconstruction quality varies significantly from frame to frame.
Table 3 summarizes the quantitative performance of FlexHaLRTC across two representative video sequences with different motion characteristics. The Akiyo sequence contains a relatively static background and slow head motion, while the Cricket sequence involves rapid athletic movements.
Figure 11 shows consecutive frames from the
Cricket (a) and
Akiyo (b) sequences at a 50% missing rate. The reconstructed sequences appear temporally smooth and visually consistent. For the
Akiyo sequence, FlexHaLRTC achieves an average PSNR of 30.12 dB and SSIM of 0.9116, as shown in
Table 3, demonstrating its high restoration quality even at high missing rates.
Across 100 frames of the Akiyo sequence, the average PSNR/SSIM values are 28.45 dB/0.876 versus 26.08 dB/0.831 for frame-independent processing. These results suggest that the linear-combination framework can be extended to 4D tensors and that it effectively improves temporal consistency in this setting.
6. Conclusions
This paper introduced FlexHaLRTC, a tensor completion method that replaces the standard convex combination of nuclear norms with a linear combination, allowing for negative weight parameters to be obtained. We proposed an extended singular value thresholding (SVT)-style update to handle both shrinkage (for positive weights) and expansion (for negative weights) of singular values within an ADMM-based algorithm.
Experiments on color image inpainting and video completion indicate that FlexHaLRTC performs competitively against the compared baselines, with more noticeable gains in several high-missing-rate settings. Negative weights appear to be useful for modeling mode interactions in many cases, yielding improvements in PSNR, SSIM, and RSE on most tested images. In addition, the video experiments follow the same parameter-selection rationale as the image experiments, suggesting that the proposed signed-weight strategy may transfer beyond a single data modality. However, the current weight selection relies on a grid search, and its generalization across broader data types still requires further study.
Future research directions include (1) developing adaptive or data-driven methods for automatic weight selection, (2) extending the linear combination framework to other tensor decomposition models such as tensor train and Tucker decomposition, and (3) providing a more rigorous theoretical analysis of the optimization dynamics induced by signed weights.