HDNLS: Hybrid Deep-Learning and Non-Linear Least Squares-Based Method for Fast Multi-Component T1ρ Mapping in the Knee Joint

Singh, Dilbag; Regatte, Ravinder R.; Zibetti, Marcelo V. W.

doi:10.3390/bioengineering12010008

Open AccessArticle

HDNLS: Hybrid Deep-Learning and Non-Linear Least Squares-Based Method for Fast Multi-Component T1ρ Mapping in the Knee Joint

by

Dilbag Singh

^*

,

Ravinder R. Regatte

and

Marcelo V. W. Zibetti

^*

Center of Biomedical Imaging, Department of Radiology, New York University Grossman School of Medicine, New York, NY 10016, USA

^*

Authors to whom correspondence should be addressed.

Bioengineering 2025, 12(1), 8; https://doi.org/10.3390/bioengineering12010008

Submission received: 29 October 2024 / Revised: 10 December 2024 / Accepted: 20 December 2024 / Published: 25 December 2024

(This article belongs to the Section Biomechanics and Sports Medicine)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Non-linear least squares (NLS) methods are commonly used for quantitative magnetic resonance imaging (MRI), especially for multi-exponential T1ρ mapping, which provides precise parameter estimation for different relaxation models in tissues, such as mono-exponential (ME), bi-exponential (BE), and stretched-exponential (SE) models. However, NLS may suffer from problems like sensitivity to initial guesses, slow convergence speed, and high computational cost. While deep learning (DL)-based T1ρ fitting methods offer faster alternatives, they often face challenges such as noise sensitivity and reliance on NLS-generated reference data for training. To address these limitations of both approaches, we propose the HDNLS, a hybrid model for fast multi-component parameter mapping, particularly targeted for T1ρ mapping in the knee joint. HDNLS combines voxel-wise DL, trained with synthetic data, with a few iterations of NLS to accelerate the fitting process, thus eliminating the need for reference MRI data for training. Due to the inverse-problem nature of the parameter mapping, certain parameters in a specific model may be more sensitive to noise, such as the short component in the BE model. To address this, the number of NLS iterations in HDNLS can act as a regularization, stabilizing the estimation to obtain meaningful solutions. Thus, in this work, we conducted a comprehensive analysis of the impact of NLS iterations on HDNLS performance and proposed four variants that balance estimation accuracy and computational speed. These variants are Ultrafast-NLS, Superfast-HDNLS, HDNLS, and Relaxed-HDNLS. These methods allow users to select a suitable configuration based on their specific speed and performance requirements. Among these, HDNLS emerges as the optimal trade-off between performance and fitting time. Extensive experiments on synthetic data demonstrate that HDNLS achieves comparable performance to NLS and regularized-NLS (RNLS) with a minimum of a 13-fold improvement in speed. HDNLS is just a little slower than DL-based methods; however, it significantly improves estimation quality, offering a solution for T1ρ fitting that is fast and reliable.

Keywords:

multi-component MRI fitting; non-linear least squares (NLS); T1ρ mapping; deep learning; knee joint

1. Introduction

Quantitative magnetic resonance imaging (qMRI) is essential for accurately evaluating tissue properties and detecting early pathological changes across various anatomical regions before the development of morphological changes [1,2,3]. Traditional qualitative MRI imaging, used in clinical practice, often lacks the sensitivity and specificity required for such early detection [4]. qMRI overcomes this problem by acquiring multi-contrast images with controlled variations in MRI pulse-sequence parameters [5]. It enables the estimation of tissue relaxation properties, and non-linear least squares (NLS) methods are usually the reference approaches to compute quantitative parameters [6,7,8].

NLS methods are fundamental tools in inverse problems and statistical estimations, particularly in parameter fitting. These methods extend linear regression to handle problems where the variables have a non-linear relationship. They aim to minimize the sum of squared differences between observed and predicted values [9,10]. The theoretical support and soundness of NLS methods, such as Jacobian-based approaches (like the Gauss–Newton and Levenberg–Marquardt approaches) and derivative-free algorithms (like the Nelder–Mead method) are strong, making them the preferred choice in multiple applications, but they are generally slow for large multi-dimensional datasets, among other problems [11,12,13].

Gauss–Newton methods are efficient for those problems where models are nearly linear. To achieve better convergence speed, it utilizes the Jacobian matrix to approximate the Hessian [9,10]. The Levenberg–Marquardt algorithm refines the Gauss–Newton method by using the concept of a damping factor. Thus, it enhances convergence stability and performance in more complex non-linear problems when the Hessian is ill-conditioned [11,12]. Powell’s Dog Leg method integrates Gauss–Newton and the steepest descent directions to effectively manage substantial curvature in the error surface. In contrast, hybrid approaches that combine the Levenberg–Marquardt algorithm with Quasi-Newton methods use adaptive step sizes and approximate Hessians to achieve robustness and efficiency in large-scale optimization problems [9,10,11]. Approximation-based versions of these methods, (i.e., adaptations of the Levenberg–Marquardt and Dog Leg approaches) replace exact derivative computations with approximations [12,13]. This substitution reduces computational complexity and improves efficiency [14,15]. Trust Region Conjugate Gradient (TRCG) [9] combines trust-region methods with iterative inversion of Hessian or its approximation, being a good choice for parameter fitting. Therefore, these approximation-based methods may perform better on high-dimensional and large-scale optimization problems.

However, these methods come with significant computational demands and the risk of converging to local minima, especially in complex parameter spaces. The accuracy of these methods is significantly dependent on the quality of initial parameter estimates. Thus, poor initialization potentially results in suboptimal solutions.

From the literature, it can be found that NLS methods are effective for T1ρ mapping but are hampered by slow computational speeds. As a result, NLS is typically applied only to specific regions of interest (ROIs), such as cartilage voxels in the knee joint. NLS can also be unstable when dealing with multi-component relaxation. To address these issues, researchers have developed variants like non-negative least squares (NNLS) [16,17,18] and regularized NLS (RNLS) [18,19].

NNLS adds a non-negativity constraint to stabilize estimations, ensuring real-valued, monotonically decreasing relaxations [16,17,18]. RNLS extends NLS by incorporating regularization penalties to improve stability and accommodate independent (in-voxel) or multi-voxel (spatial) regularization functions [18,19]. Thus, RNLS is more suitable for recovering all voxels in imaging applications. Improved stability makes RNLS somewhat faster than NLS, but it still requires many iterations.

1.1. Research Motivation

Considering an MR image slice of size 256 × 256 = 65,536 voxels, NLS fitting requires approximately 1 millisecond per voxel on high-performance computing (HPC) systems. Thus, fitting all 65,536 voxels would take about 65.54 s for a single slice. Extending this to 4D-MRI data or higher-dimensional scans, such as 512 × 512 or 1024 × 1024, results in exponential increases in computational time. Figure 1 illustrates the time required by our NLS fitting implementation for varying numbers of MRI slices on HPC systems and NVIDIA GPUs. For a single slice, the theoretical time on HPC systems is 1.09 min, while the NVIDIA GPU takes 2.24 min. Assuming a linear increase in time with the number of slices in a dataset, one can easily reach the HPC requirement of 1092.3 min and the NVIDIA GPU requirement of 2240 min for 1000 images. These data highlight the substantial computational demands of NLS fitting.

Due to the time-consuming nature of NLS methods, they are often limited to knee cartilage ROIs. However, recent research highlights the importance of the relaxation map of the full knee joint MRI for assessing osteoarthritis (OA) progression and detecting subtle changes in tissue properties beyond just the cartilage [20]. Therefore, researchers have started exploring artificial intelligence (AI) models as faster alternatives to NLS.

1.2. AI-Based Exponential Fitting

Recent advancements in AI offer suitable alternatives to traditional NLS methods. AI models, such as random forest regression, neural networks (NNs), and deep NNs have been extensively utilized to enhance exponential model fitting [21,22,23,24,25]. These models benefit from rapid training and deployment, thus providing stable and accurate curve fitting that is usually faster than existing NLS methods [26,27,28]. By utilizing advanced function approximation abilities, AI models can efficiently address the issues of slow convergence and suboptimal solutions associated with traditional NLS methods [23,24,25,26].

By incorporating AI into the fitting process, researchers have achieved significant improvements in parameter mapping [29,30,31,32]. Additionally, AI models can be trained in an end-to-end manner, i.e., learning optimal mappings from raw k-space or undersampled multi-contrast MRI data to T1ρ maps without the need for explicit MRI reconstruction and NLS fitting [33].

Therefore, researchers are actively exploring faster methods for obtaining quantitative MRI values of the knee joint, including deep learning (DL) approaches [34,35,36,37,38,39,40,41]. However, most supervised DL methods [34,35,36,37] rely on reference values produced by NLS methods. This requires applying NLS to thousands of voxels in each 3D image, and this process must be repeated for every image in a large dataset.

Moreover, the integration of AI-based models with traditional NLS methods has led to the development of hybrid approaches that utilize the strengths of both methods. For example, DL models can provide initial parameter estimates or refine results from NLS methods, thus enhancing accuracy and robustness in T1ρ estimation.

Table 1 presents a description of existing AI-based exponential fitting methods. The table highlights that the majority of AI-based solutions predominantly focus on ME fitting, leaving SE and BE relaxation models relatively underexplored. This limitation creates a significant gap in addressing the diverse relaxation dynamics present in different tissues. Additionally, most existing approaches are often sensitive to noise and prone to inaccuracies in parameter estimation, not providing similar goodness-of-fit properties to NLS-based techniques.

This gap underscores the importance of hybrid approaches, which integrate the strengths of DL and NLS. Such integration ensures improved parameter estimation, enhanced robustness to noise, and better handling of complex multi-component T1ρ mapping tasks. Thus, Table 1 further emphasizes the need for methods that go beyond ME fitting and utilize hybrid strategies to overcome the limitations of AI-based or traditional approaches.

1.3. Proposed Solution

Although AI-based exponential fitting models have demonstrated remarkable performance, most researchers have primarily focused on mono-exponential (ME) fitting, with limited exploration of bi-exponential (BE) and stretched-exponential (SE) fitting. Additionally, many AI-based exponential fitting methods require actual MRI data as a reference for training. To address these gaps, this paper proposes the HDNLS, a hybrid model for fast multi-component fitting, dedicated in this work to T1ρ mapping of the knee joint. In HDNLS, we initially designed a DL model that uses synthetic data for training, thus eliminating the need for reference MRI data. However, the estimated map may be sensitive to noise, particularly in the short component of the BE model. To address this, the results from the proposed DL model are further refined using NLS with an optimized number of iterations. To optimize the number of iterations, this paper presents an ablation study in which we analyze the trade-off between the number of NLS iterations and the mean normalized absolute difference (MNAD), as well as between the NLS iterations and normalized root-mean-squared residuals (NRMSRs).

1.4. Contributions

The key contributions of the paper are as follows:

(a): Initially, a DL-based multi-component T1ρ mapping method is proposed. This approach utilizes synthetic data for training, thereby eliminating the need for reference MRI data.
(b): The HDNLS method is proposed for fast multi-component T1ρ mapping in the knee joint. This method integrates DL and NLS. It effectively addresses key limitations of NLS, including sensitivity to initial guesses, poor convergence speed, and high computational cost.
(c): Since HDNLS’s performance depends on the number of NLS iterations, it becomes a Pareto optimization problem. To address this, a comprehensive analysis of the impact of NLS iterations on HDNLS performance is conducted.
(d): Four variants of HDNLS are suggested that balance accuracy and computational speed. These variants are Ultrafast-NLS, Superfast-HDNLS, HDNLS, and Relaxed-HDNLS. These methods enable users to select a suitable configuration based on their specific speed and performance requirements.

2. Proposed Methodology

The measured signal

s

of the relaxation of a voxel in position

x

at a TSL specified by

t

can be expressed as follows:

s (x, t) = f (x, t, θ) + v (x, t),

(1)

where

v

is the noise and

f (x, t, θ)

is the relaxation model with parameters

θ

. For simplicity, we will omit the spatial position

x

, assuming that it corresponds to the relaxation process in one voxel. Note that the parameters

θ

are also related to that specific voxel.

2.1. Multi-Component Relaxation Models

Generally, ME, BE, and SE models are commonly used for fitting a T1ρ map. Table 2 presents the mathematical formulations, associated parameters, descriptions, and applicable ranges for these three models. The ME model assumes a homogeneous tissue environment, while the BE model introduces two compartments with distinct relaxation times, and the SE model accounts for heterogeneous environments with distributed relaxation rates.

Table 3 highlights the key features, advantages, and disadvantages of each multi-component relaxation model. The choice of the specific model depends on the tissue characteristics and the requirements of the analysis. While the ME model is simple and suitable for homogeneous tissues, the BE model provides a more detailed representation of tissues with two relaxation components but is computationally more demanding. The SE model offers a balance by modeling complex tissue environments with fewer parameters and greater stability, though its physical interpretation can be less intuitive.

Figure 2 illustrates the working of multi-component relaxation models and their respective outputs. It includes T1ρ-weighted images with six TSLs and voxel-wise relaxation measurements. It presents the expected T1ρ maps obtained from ME, BE (the long component, the fraction of the long component, and the short component), and SE models (SE-T1ρ map and beta (β) parameter).

The NLS approach fits the parameters voxel-wise, minimizing the squared Euclidean norm of the residual, using the following:

\hat{θ} = \underset{θ}{arg min} \sum_{t = 1}^{T} {|s (t) - f (t, θ)|}^{2},

(2)

where

T

is the number of TSLs. This is solved using iterative algorithms such as the one described in [9].

2.2. Deep Learning-Based Multi-Component Relaxation Models

Figure 3 shows a diagram of the voxel-by-voxel working of the DL model for fitting multi-component T1ρ relaxation models. Typically, NNs are trained by minimizing the squared error of the target parameters. However, to achieve data consistency, as in NLS estimations, we used a customized loss function, designed as follows:

L o s s = \sum_{i} γ_{s} {‖s_{i} - {\hat{f}}_{i}‖}^{2} + γ_{θ} {‖θ_{i} - {\hat{θ}}_{i}‖}^{2},

(3)

where

i

is the index of the data elements in the training dataset,

{\hat{θ}}_{i} = F_{w} (s_{i})

,

s = {[s (1) \dots s (T)]}^{T}

being a column vector with the measured relaxation signal and

F_{w}

the fitting network with learning parameters

w

, and

{\hat{f}}_{i}

is the relaxation signal produced with the estimated parameters

{\hat{θ}}_{i}

, using any of the equations presented in Table 2. Target parameters

θ_{i}

are sampled from a known distribution,

Θ

, and

s_{i}

are synthetically generated using Equation (1) and equations presented in Table 2 and the sampled parameters.

The first term of the loss function tries to reduce the residue, as in NLS approaches, producing a relaxation curve consistent with the synthetically generated measured values at the voxel. The second term tries to reduce the error against the ground truth values. This term also acts as a regularization in the NLS sense. We used

γ_{s} = 10

and

γ_{θ} = 1

.

The NN architecture comprises 7 repeated blocks of fully connected layers, each containing 512 intermediate elements followed by a non-linear activation function and dropouts. Thereafter, we utilize customized layers to evaluate the relaxation values for the customized loss, as described in Equation (3). Finally, the regression layer is used to predict the T1ρ maps. This network estimates the relaxation parameters voxel-wise, like any of the NLS-based approaches.

2.3. Sensitivity Analysis of DL Model

We performed a sensitivity analysis of the proposed DL model to determine the optimal configuration of hyperparameters. Initially, multiple trial-and-error experiments were conducted using various hyperparameter values commonly reported in the literature. The final configuration comprised 7 fully connected layers, each containing 512 intermediate elements. Five different activation functions were also evaluated, such as the Gaussian Error Linear Unit (GELU), Leaky ReLU, the hyperbolic tangent (Tanh), Swish, and ReLU. The model was trained using the Adam optimizer, with a learning rate initialized at 1.0 × 10⁻³, over a maximum of 100 epochs. Each epoch processed mini-batches of 200 samples. We selected

γ_{s}

= 10 and

γ_{θ}

= 1 as the optimal weights for the proposed customized loss function. During the sensitivity analysis, we varied only the specific parameter under evaluation while keeping all other parameters fixed.

Figure 4 illustrates the impact of (a) optimizer selection (Adam, SGD, and RMSProp), (b) number of epochs (20 to 200), (c) batch size (50 to 1000), and (d) number of layers (4 to 11). From Figure 4a, it is evident that the RMSE shows a decreasing trend as the number of epochs increases. It eventually stabilized around 100 epochs. Adam outperformed SGD and RMSProp by achieving significantly lower RMSE values (see Figure 4b). Smaller batch sizes achieved lower RMSE values, with performance stabilizing beyond a batch size of 200 (see Figure 4c). Finally, seven fully connected layers achieved noticeably lower RMSE values compared to other configurations (see Figure 4d).

Figure 5 presents the sensitivity analysis of activation functions (ReLU, Leaky ReLU, GELU, Tanh, and Swish) for the DL model. The analysis evaluated these functions across three optimizers, namely, Adam, SGD, and RMSProp. It showed RMSE variations for ME, SE, and BE components under different activation function–optimizer combinations. Among these, the Adam optimizer combined with the ReLU activation function achieved significantly lower RMSE values across all ME, SE, and BE components.

Finally, the sensitivity analysis was conducted to evaluate the impact of varying

γ_{s}

and

γ_{θ}

in the customized loss function on validation data. For better analysis, we evaluated the average RMSE values across all ME, SE, and BE components. Figure 6a shows that increasing

γ_{θ}

from 1 to 10, with

γ_{s}

= 1, results in a rise in RMSE. This emphasizes the importance of keeping

γ_{θ}

relatively low. In contrast, Figure 6b demonstrates a decreasing trend in RMSE as

γ_{s}

increases from 1 to 15, with

γ_{θ}

= 1. This indicates the benefits of assigning a higher weight to

γ_{s}

. However, continually increasing

γ_{s}

in the loss function places an increasingly higher emphasis on minimizing the discrepancy between the measured relaxation signal

s_{i}

and the model-predicted signal

{\hat{f}}_{i}

. While this might improve the consistency of the relaxation curve with the synthetic measurements, it can increase the error in the estimated parameters. The model prioritizes signal fitting at the expense of accurately estimating the parameters

θ_{i}

, as the second term in the loss function becomes relatively low, reducing the impact of the regularization term in stabilizing the parameter estimation. As shown in Figure 6b, RMSE increases when

γ_{s}

exceeds 10, indicating a poor balance between signal consistency and parameter accuracy. Based on these observations, we selected

γ_{s}

= 10 and

γ_{θ}

= 1 as the optimal weights for our loss function.

2.4. Training and Validation Analysis

To ensure a robust evaluation of the proposed DL model, the synthetic dataset was divided into three subsets: training, validation, and test data, in an 80:10:10 ratio. The training subset (80% of the data) was used to train the DL model and optimize its parameters. To prevent overfitting, the validation subset (10% of the data) was employed to monitor the model’s performance during training and fine-tune hyperparameters.

Figure 7 illustrates the training and validation loss curves for 10 and 100 epochs, respectively. In Figure 7a, the difference between training and validation loss values is more apparent, reflecting the initial stages of model training. In contrast, Figure 7b shows that, with 100 epochs, it is difficult to distinguish between the loss values for both training and validation converge. This convergence indicates that the DL model does not suffer from overfitting.

This partitioning strategy also ensures efficient training of the DL model while providing a fair and unbiased assessment of its generalization capability. Data for each subset were randomly selected in a stratified manner. Therefore, the used data also preserved the distribution of key features across all subsets. This approach guarantees consistency and reliability in the evaluation process.

Finally, the test subset, comprising 10% of the data, was reserved for the final evaluation of the model’s performance on unseen data.

2.5. HDNLS

Figure 8 illustrates the proposed HDNLS-based fitting of multi-component T1ρ relaxation models. The estimated map obtained from the DL model may be sensitive to noise, particularly in the short component of BE. To mitigate this issue, the results from the DL model are further refined using NLS iterations. The DL results serve as an initial guess for NLS. This helps to address both the initial guess dependency and poor convergence speed associated with NLS.

In this paper, TRCG [9,51] is used to solve the NLS problem. At each iteration

k

, the parameters are estimated using the following:

θ^{(k + 1)} \leftarrow N L S_U p d a t e (s, θ^{(k)})

(4)

This update aims to minimize the residuals as follows:

r^{(k + 1)} \leftarrow {‖s - f (θ^{(k + 1)})‖}_{2}

(5)

where

s

represents the observed data and

f (θ^{(k + 1)})

denotes the model predictions based on the updated parameters. TRCG ensures that each step effectively refines the parameter estimates. Thus, it enhances the convergence of the optimization process to achieve an optimal fit to the data.

2.6. T1ρ-MRI Data Acquisition and Fitting Methods

All the experiments were performed on a 3T MRI scanner with magnetization-prepared angle-modulated partitioned k-space spoiled gradient echo sequence snapshots (MAPSS) [52], using six spin-lock time (TSL) values: 0.05, 0.6, 1.6, 6, 16, and 36 ms. We scanned a total of 12 healthy volunteers, with a mean age of 38 and SD = 12.4 years old (

n

= 7 males and

n

= 5 females). The MRI scans had a voxel size of 0.7 mm × 0.7 mm × 4 mm, with an FOV of 180 mm × 180 mm × 96 mm. This study was approved by the Institutional Review Board (IRB) of New York University Langone Health and was compliant with the Health Insurance Portability and Accountability Act (HIPAA). All volunteers provided their consent before MRI scanning.

For NLS-based fitting, we used 2000 iterations of the TRCG method [51]. The ME model assumes that the T1ρ values can be in the range of 1–300 ms. The SE model assumes that the T1ρ values can be in the range of 1–300 ms and that β can be in the range of 0.1–1. Because we observed no ME relaxation below 5 ms, we assume that the long component of the BE model can be in the range of 5–300 ms, the short component in the range of 0.5–4 ms, and the BE fraction in the range of 0.01–0.99.

The DL model is trained using the Adam optimizer with a maximum of 100 epochs, and each epoch processes mini-batches of 200 samples. The initial learning rate is set to 1.0 × 10⁻³. The dropout value is set to be 0.1. The proposed model was tested on 3D-T1ρ MRI data from five different healthy subjects. The DL model was trained on 5 healthy datasets.

2.7. Performance Metrics

For comparative analysis, four performance metrics were employed: the median of normalized absolute difference (MNAD), normalized root-mean-squared residuals (NRMSRs), fitting time, and normalized root-mean-squared error (NRMSE). Table 4 provides detailed descriptions and mathematical formulations of these metrics.

3. Optimal Selection of NLS Iterations for HDNLS

The main objective of this section is to evaluate and compare the performance metrics of NLS and HDNLS on synthetic data by varying the number of NLS iterations. Specifically, the focus is on assessing the convergence speed of both methods and determining whether HDNLS can achieve similar performance to NLS with improved speed.

Figure 9 shows the comparison between NLS and HDNLS on synthetic data in terms of MNAD across three different fitting models, namely, ME, SE, and BE. Row 1 shows the trade-off between fitting time (blue) and MNAD (red) for NLS as the number of NLS iterations increases. For each fitting model, MNAD decreases initially with more iterations but eventually shows a minor reduction, while the fitting time increases at a rapid rate. Row 2 shows the performance trade-off between fitting time and MNAD for the HDNLS approach. It also shows that as the number of NLS iterations increases, the HDNLS achieves significantly lesser MNAD values. However, after 500 iterations, the improvement in MNAD becomes marginal compared to the increase in fitting time. Row 3 provides direct comparisons between NLS and HDNLS. It shows that HDNLS takes significantly less time compared to NLS when the number of NLS iterations is less than 300. Thereafter, both NLS and HDNLS show almost similar MNAD values with no significant difference.

Similarly, Figure 10 presents the performance of NLS and HDNLS across ME, SE, and BE fitting models on synthetic data in terms of NRMSRs. The yellow line represents the NRMSR of noise only. The primary goal of NLS and HDNLS is to reduce the NRMSR to levels just below the noise level. It was found that in the case of HDNLS, there is little or no benefit in exceeding 500 iterations, except for the SE component. Please note that due to the significant variations in the scale of values for both NLS and HDNLS, it becomes difficult to observe changes in NLS when the number of iterations exceeds 100, 500, and 1000, respectively, for ME, BE, and SE components. To better illustrate this, we present the comparison of NRMSRs between NLS and HDNLS in row 3, clearly demonstrating the improved convergence speed of HDNLS over NLS. Additionally, the difference in NRMSRs for HDNLS after 500 iterations is minimal and can be disregarded.

From row 3 of both Figure 9 and Figure 10, it is evident that HDNLS converges significantly faster than standard NLS in terms of MNAD and NRMSRs, respectively. This makes HDNLS an efficient choice for determining the number of NLS iterations based on specific requirements. For scenarios where speed is the highest priority and small performance compromises are acceptable, 10 iterations, referred to as Ultrafast-NLS, provide quick results with minimal computational cost. Superfast-HDNLS, with 50 NLS iterations, strikes a balance between speed and performance, offering a faster alternative with acceptable accuracy. HDNLS with 200 iterations achieves an optimal middle ground, delivering both rapid convergence and higher performance. However, for SE, we observed that the results at 200 iterations are not as desirable as those for ME and BE; therefore, we selected HDNLS with 300 iterations for SE map fitting only.

For those who prioritize performance over speed, 500 iterations, termed Relaxed-HDNLS, provide the best results while tolerating slower computation times. Thus, HDNLS allows users to select a suitable configuration based on their specific speed and performance requirements.

To check the performance of these HDNLS variants, we further tested them on real MRI data (see Section 2.4 for dataset details). Due to the unavailability of ground truth T1ρ maps, we used the NLS-based estimated T1ρ map as the ground truth. The estimated ME and SE T1ρ maps of the whole knee from NLS and HDNLS variants are presented in Figure 11, along with their respective squared errors compared to the NLS-based ME T1ρ map. Similarly, Figure 12 shows the ME and SE T1ρ maps for the knee cartilage only. Both figures demonstrate that HDNLS and Relaxed-HDNLS achieve significantly better results, with notably lower errors, compared to Ultra-HDNLS and Super-HDNLS.

Although it is challenging to visually distinguish differences between the obtained T1ρ maps, the evaluated squared error differences provide a clear distinction. Higher squared error values indicate poorer performance. From Figure 11 and Figure 12, the squared error maps for HDNLS and Relaxed-HDNLS show significantly lower errors. This highlights their superior performance and closer alignment with the NLS-based T1ρ maps.

Figure 13 presents the estimated BE long T1ρ component, the fraction of the long T1ρ component, and short T1ρ maps of the whole knee from NLS and HDNLS variants, along with their respective squared errors compared to the ground truth, i.e., the NLS-based ME T1ρ map. Similarly, Figure 14 shows the estimated BE long T1ρ component, the fraction of the long T1ρ component, and short T1ρ maps for knee cartilage only. Both figures demonstrate that HDNLS and Relaxed-HDNLS achieve significantly better results with much lower errors compared to Ultra-HDNLS and Super-HDNLS.

Table 5 and Table 6 provide a quantitative analysis of HDNLS variants, i.e., Ultra-HDNLS, Super-HDNLS, HDNLS, and Relaxed-HDNLS, for estimating ME, SE, and BE T1ρ maps using real data for the full knee and knee cartilage, respectively. The Relaxed-HDNLS variant outperformed the others by achieving the minimum MNAD and NRMSR values. In contrast, Ultra-HDNLS provided the fastest fitting time for the estimation of all ME, SE, and BE T1ρ maps, but had higher MNAD and NRMSR values. Therefore, Table 5 and Table 6 highlight a trade-off in selecting the best approach. Faster fitting times come at the cost of performance. This necessitates careful consideration of whether to prioritize speed or accuracy when estimating T1ρ maps.

4. Performance Analysis

Based on the visual and quantitative analysis presented in Section 3, we identified a trade-off in selecting the best approach. Faster fitting times come at the cost of performance, necessitating careful consideration of whether to prioritize speed or accuracy when estimating T1ρ maps. Therefore, we selected an optimized HDNLS variant named HDNLS. For ME and BE, it will use 200 NLS iterations, while for SE, we chose 300 iterations. The idea was to select a solution that is around 30 times faster than NLS with significantly fewer errors on synthetic data. For ME and BE, when NLS iterations were 200, we achieved around a 34-times better fitting speed with comparable performance to NLS. However, with 200 iterations for SE fitting, we observed slightly more errors compared to when NLS iterations were set to 300. Due to this, for SE fitting, HDNLS (NLS iterations = 300) shows a speed improvement approximately 27 times better than NLS, with similar results compared to NLS.

The main objective of this section is to compare the performance of HDNLS against NLS, RNLS, and DL.

Figure 15 and Figure 16 present the estimated ME and SE T1ρ maps for the full MRI joint and knee cartilage, obtained using NLS, RNLS, DL, and HDNLS, respectively. The error was also computed between NLS and DL and between NLS and HDNLS for the full knee joint and knee cartilage ROIs. The analysis shows that the error between NLS and HDNLS is lower compared to that between NLS and DL. This indicates that, compared to DL, the proposed HDNLS method can more effectively serve as an alternative to NLS and RNLS for estimating ME T1ρ, SE T1ρ, and SE beta maps.

The higher error observed in the BE short component is due to its susceptibility to noise caused by low signal intensity and rapid decay dynamics. This fast decay can lead to insufficient signal sampling, negatively affecting accurate fitting. Furthermore, short components exhibit increased sensitivity to minor variations in model parameters, resulting in escalated errors during the fitting process.

Figure 17 and Figure 18 show the predicted BE long T1ρ maps, the fraction of the long BE T1ρ component, and the short T1ρ maps for the full knee joint and knee cartilage ROIs, generated using NLS, RNLS, DL, and HDNLS. The error between NLS and DL and between NLS and HDNLS was also calculated for both the full knee joint and knee cartilage ROIs. The analysis demonstrates that the error between NLS and HDNLS is lower than that between NLS and DL. Thus, it was revealed that, compared to DL, the HDNLS method is a more effective alternative to NLS and RNLS for estimating BE components.

Table 7 presents a quantitative analysis of NLS, RNLS, DL, and HDNLS methods for multi-component T1ρ mapping (ME, SE, and BE) on synthetic data in terms of MNAD, fitting time (in seconds), and NRMSR. It indicates that although NLS outperforms RNLS, DL, and HDNLS, the differences with respect to RNSL and HDNLS are not significant. Additionally, DL achieved better speed compared to the other approaches, but it also achieved the poorest performance. Overall, HDNLS achieved comparable performance to NLS and RNLS and significantly better performance than DL.

Table 8 presents a comparison of MNAD and NRMSE values for DL and HDNLS methods for multi-component T1ρ mapping (ME, SE, and BE) on real data for both the full knee joint and knee cartilage ROIs. The results show that HDNLS significantly outperforms DL by achieving lower MNAD and NRMSE values across all ME, SE, and BE categories. This indicates that HDNLS is a better alternative to NLS than DL for T1ρ mapping.

Table 9 presents a computational speed (in seconds) analysis of NLS, RNLS, DL, and HDNLS methods for multi-component T1ρ mapping (ME, SE, and BE) on real data for both the full knee and knee cartilage ROIs. The results indicate that HDNLS significantly outperforms the traditional methods (NLS and RNLS) in terms of computational speed, although it is slower than DL. Though DL demonstrates substantial speed improvements over NLS, RNLS, and HDNLS, it also results in greater error compared to HDNLS (see Table 8). As stated in Section 3, our objective was to find a suitable trade-off between performance and computational speed. Therefore, while HDNLS takes more time than DL, it also shows significant performance improvements over DL. Thus, HDNLS achieves comparable performance to NLS and RNLS. Compared to NLS, it provides approximately 13- to 14-times better speed for each ME, SE, and BE component for the full knee joint. Additionally, for the knee cartilage ROIs, it offers about 11-, 12-, and 11-times better speed than NLS, respectively.

5. Discussion, Limitations, and Future Directions

5.1. Discussion

The proposed HDNLS method effectively addresses the inherent limitations of traditional NLS [6,7,8,9,10,11,12,13,14,15], NNLS [16,17,18], RNLS [18,19], and AI-based [21,22,23,24,25,26] T1ρ fitting methods by combining their strengths. Since NLS, NNLS, and RNLS approaches are often slow and necessitate careful initialization, HDNLS employs a DL model to provide an initial guess for the NLS fitting process. Additionally, HDNLS eliminates the dependency on reference data, as it was trained on synthetic data. By utilizing DL predictions for NLS initialization, HDNLS achieves superior parameter estimation compared to AI-based models [21,22,23,24,25,26] while avoiding the high computational costs typically associated with conventional methods, such as NLS [6,7,8,9,10,11,12,13,14,15], NNLS [16,17,18], and RNLS [18,19]. Additionally, HDNLS extends its capabilities beyond end-to-end DL models [33,34,35,36,37], which are primarily designed for ME fitting, accommodating various multi-component fitting scenarios such as BE and SE.

As we can see in Table 7, HDNLS significantly outperforms DL by achieving lower MNAD and NRMSE values across all categories for ME, SE, and BE on real data from both the full knee joint and knee cartilage ROIs. Thus, HDNLS is a more robust and superior alternative to DL for accurate T1ρ mapping. However, HDNLS has poorer computational speed than DL due to the use of NLS iterations, as indicated in Table 8. Nonetheless, HDNLS markedly surpasses traditional methods like NLS and RNLS in speed. The rapid speed of DL comes at the cost of accuracy, as reflected in the higher MNAD and NRMSE values compared to HDNLS. Our aim, as outlined in Section 3, was to strike a balance between performance metrics and computational efficiency.

Therefore, we prefer HDNLS over DL, even though it requires more time than DL. HDNLS provides significant performance enhancements, achieving results comparable to those of NLS and RNLS. Specifically, HDNLS achieves approximately 13 to 14 times the speed of NLS across ME, SE, and BE components for full knee joint analysis and offers about 11- to 12-times better speed for knee cartilage ROIs. Therefore, HDNLS stands out as an efficient solution for fast multi-component T1ρ mapping, particularly in knee joint imaging, striking an optimal balance between efficiency and high precision.

5.2. Limitations of HDNLS

Although the proposed HDNLS model demonstrates promising performance in T1ρ mapping, it does have certain limitations, which include the following:

(a): The analysis of the BE short component reveals significant challenges due to its susceptibility to noise and rapid decay dynamics. These factors contribute to increased errors in fitting processes, making accurate estimation of BE short relaxation times particularly difficult. The sensitivity of these components to minor parameter variations further complicates the issue.
(b): Despite its computational efficiency compared to traditional NLS and RNLS methods, the iterative nature of HDNLS makes it more computationally demanding than alternative DL approaches.
(c): Although HDNLS demonstrates comparable performance in T1ρ mapping, its applicability to other quantitative MRI parameters, such as T2 mapping or diffusion tensor imaging (DTI), remains unexplored.
(d): The current implementation of HDNLS does not incorporate self-supervised learning techniques. As a result, it may not fully utilize undersampled multi-contrast MRI data.

5.3. Future Directions

Although HDNLS has demonstrated remarkable performance, several challenges remain that need to be addressed in the future.

(a): Further studies are essential to evaluate the performance of HDNLS across diverse patient populations and imaging protocols. Testing the model with data from various MRI scanners is crucial. This will help identify potential limitations and improve generalizability in clinical settings.
(b): While HDNLS primarily targets T1ρ mapping, future research could adapt its methodology for other quantitative MRI parameters, such as T2 mapping and diffusion tensor imaging (DTI). This broader application could provide valuable insights into tissue characterization and significantly enhance diagnostic capabilities.
(c): Self-supervised learning techniques could minimize reliance on labeled and synthetic data. This would enhance the training efficiency of HDNLS. Additionally, this approach may improve HDNLS’s ability to learn from raw or undersampled multi-contrast MRI data. Consequently, we can develop an end-to-end HDNLS model for parameter fitting.
(d): Optimizing the DL architecture or incorporating advanced techniques like attention mechanisms or transformer networks could enhance HDNLS’s ability to capture complex data patterns. This improvement would reduce the number of NLS iterations required. As a result, the overall process would become faster. Additionally, using advanced regularization techniques could make HDNLS more robust to noise and outliers. This would ensure more accurate parameter estimation in challenging scenarios.
(e): The BE short component is inherently challenging to estimate due to its rapid decay dynamics and low signal intensity, making it more susceptible to noise. We propose the following steps to address this issue in the near future: (i) Implement regularization techniques, such as spatial smoothness constraints and multi-voxel regularization, to stabilize parameter estimation across neighboring voxels. (ii) Integrate advanced noise-aware DL architectures, incorporating uncertainty quantification, into the DL component of HDNLS. These architectures will explicitly account for input data variability, improving robustness and reliability under noisy conditions. (iii) Explore adaptive weighting schemes in the loss function during training for BE fitting to assign appropriate emphasis to the short component. This approach aims to reduce the sensitivity of the short component to noise while maintaining overall model performance.
(f): HDNLS can be extended using a pre-trained model on similar domains, such as quantitative MRI or exponential model fitting tasks. This may accelerate convergence, improve generalization, and potentially expand the range of applicability for voxel-wise multi-component T1ρ fitting.
(g): As part of future studies, we will evaluate the HDNLS framework in clinical workflows to assess its utility and impact on patient diagnosis and treatment planning.

6. Conclusions

This paper introduced HDNLS, a hybrid model that combined DL and NLS for efficient multi-component T1ρ mapping in the knee joint. HDNLS effectively addressed key limitations of NLS, such as sensitivity to initial guesses, poor convergence speed, and high computational costs. By utilizing synthetic data for training of the DL model, we eliminated the need for reference MRI data and enhanced noise robustness through NLS refinements. We conducted a comprehensive analysis of NLS iterations on FastNL and proposed four variants. These variants were Ultrafast-NLS, Superfast-HDNLS, HDNLS, and Relaxed-HDNLS. Among these, HDNLS achieved an optimal balance between accuracy and speed. HDNLS achieved performance comparable to NLS and RNLS with a minimum 13-times increase in fitting speed. Although HDNLS is slower than DL, it significantly outperformed it in terms of MNAD and NRMSE. Therefore, HDNLS serves as a robust and faster alternative to NLS for multi-component T1ρ fitting.

Author Contributions

Conceptualization, methodology, and writing—original draft preparation: D.S.; visualization, supervision, project administration, and funding acquisition: M.V.W.Z. and R.R.R. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by NIH grants R01-AR076328-01A1, R01-AR076985-01A1, and R01-AR078308-01A1 and was performed under the rubric of the Center of Advanced Imaging Innovation and Research (CAI2R), an NIBIB Biomedical Technology Resource Center (NIH P41-EB017183).

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki and approved by the Institutional Review Board (or Ethics Committee) of NYU Langone Health (protocol code: i21-00710; date of approval: 13 March 2022).

Informed Consent Statement

This study was approved by the Institutional Review Board (IRB) of New York University Langone Health and was compliant with the Health Insurance Portability and Accountability Act (HIPAA). All volunteers provided their consent before MRI scanning.

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. The data are not publicly available due to privacy and ethical policy.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Vergeldt, F.J.; Prusova, A.; Fereidouni, F.; van Amerongen, H.; Van As, H.; Scheenen, T.W.J.; Bader, A.N. Multi-component quantitative magnetic resonance imaging by phasor representation. Sci. Rep. 2017, 7, 861. [Google Scholar] [CrossRef] [PubMed]
Rosenkrantz, A.B.; Mendiratta-Lala, M.; Bartholmai, B.J.; Ganeshan, D.; Abramson, R.G.; Burton, K.R.; John-Paul, J.Y.; Scalzetti, E.M.; Yankeelov, T.E.; Subramaniam, R.M.; et al. Clinical utility of quantitative imaging. Acad. Radiol. 2015, 22, 33–49. [Google Scholar] [CrossRef] [PubMed]
Keenan, K.E.; Biller, J.R.; Delfino, J.G.; Boss, M.A.; Does, M.D.; Evelhoch, J.L.; Griswold, M.A.; Gunter, J.L.; Hinks, R.S.; Hoffman, S.W.; et al. Recommendations towards standards for quantitative MRI (qMRI) and outstanding needs. J. Magn. Reson. Imaging JMRI 2019, 49, e26. [Google Scholar] [CrossRef] [PubMed]
Richter, R.H.; Byerly, D.; Schultz, D.; Mansfield, L.T. Challenges in the interpretation of MRI examinations without radiographic correlation: Pearls and pitfalls to avoid. Cureus 2021, 13, e16419. [Google Scholar] [CrossRef]
Cercignani, M.; Dowell, N.G.; Tofts, P.S. Quantitative MRI of the Brain: Principles of Physical Measurement; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
Brown, R.W.; Cheng, Y.-C.N.; Haacke, E.M.; Thompson, M.R.; Venkatesan, R. Magnetic Resonance Imaging: Physical Principles and Sequence Design; John Wiley & Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
Kuperman, V. Magnetic Resonance Imaging: Physical Principles and Applications; Elsevier: Amsterdam, The Netherlands, 2000. [Google Scholar]
Jelescu, I.O.; Veraart, J.; Fieremans, E.; Novikov, D.S. Degeneracy in model parameter estimation for multi-compartmental diffusion in neuronal tissue. NMR Biomed. 2016, 29, 33–47. [Google Scholar] [CrossRef]
Madsen, K.; Nielsen, H.B.; Tingleff, O. Methods for Non-Linear Least Squares Problems; Technical University of Denmark: Kongens Lyngby, Denmark, 2004. [Google Scholar]
Dong, G.; Flaschel, M.; Hintermüller, M.; Papafitsoros, K.; Sirotenko, C.; Tabelow, K. Data-driven methods for quantitative imaging. arXiv 2024, arXiv:2404.07886. [Google Scholar] [CrossRef]
Levenberg, K. A method for the solution of certain non-linear problems in least squares. Q. Appl. Math. 1944, 2, 164–168. [Google Scholar] [CrossRef]
Gavin, H.P. The Levenberg-Marquardt Algorithm for Nonlinear Least Squares Curve-Fitting Problems; Department of Civil and Environmental Engineering Duke University: Durham, NC, USA, 3 August 2019. [Google Scholar]
Shterenlikht, A.; Alexander, N.A. Levenberg–Marquardt vs Powell’s dogleg method for Gurson–Tvergaard–Needleman plasticity model. Comput. Methods Appl. Mech. Eng. 2012, 237, 1–9. [Google Scholar] [CrossRef]
Marquardt, D.W. An algorithm for least-squares estimation of nonlinear parameters. J. Soc. Ind. Appl. Math. 1963, 11, 431–441. [Google Scholar] [CrossRef]
Nelder, J.A.; Mead, R. A simplex method for function minimization. Comput. J. 1965, 7, 308–313. [Google Scholar] [CrossRef]
Hwang, D.; Du, Y.P. Improved myelin water quantification using spatially regularized non-negative least squares algorithm. J. Magn. Reson. Imaging 2009, 30, 203–208. [Google Scholar] [CrossRef] [PubMed]
Fenrich, F.R.E.; Beaulieu, C.; Allen, P.S. Relaxation times and microstructures. NMR Biomed. 2001, 14, 133–139. [Google Scholar] [CrossRef] [PubMed]
Zibetti, M.V.W.; Helou, E.S.; Sharafi, A.; Regatte, R.R. Fast multicomponent 3D-T1ρ relaxometry. NMR Biomed. 2020, 33, e4318. [Google Scholar] [CrossRef] [PubMed]
Eriksson, J. Optimization and Regularization of Nonlinear Least Squares Problems; Verlag nicht ermittelbar: Jerusalem, Israel, 1996. [Google Scholar]
Singh, D.; Regatte, R.R.; Zibetti, M.V.W. Self-Supervised Deep-Learning Networks for Mono and Bi-exponential T1ρ Fitting in the Knee Joint. In Proceedings of the ISMRM, Singapore, 4–9 May 2024; 2024. Available online: https://submissions.mirasmart.com/ISMRM2024/Itinerary/?Refresh=1&ses=D-162 (accessed on 3 October 2024).
Nedjati-Gilani, G.L.; Schneider, T.; Hall, M.G.; Cawley, N.; Hill, I.; Ciccarelli, O.; Drobnjak, I.; Gandini Wheeler-Kingshott, C.A.M.; Alexander, D.C. Machine learning based compartment models with permeability for white matter microstructure imaging. NeuroImage 2017, 150, 119–135. [Google Scholar] [CrossRef]
Barbieri, S.; Gurney-Champion, O.J.; Klaassen, R.; Thoeny, H.C. Deep learning how to fit an intravoxel incoherent motion model to diffusion-weighted MRI. Magn. Reson. Med. 2020, 83, 312–321. [Google Scholar] [CrossRef]
Sjölund, J.; Eklund, A.; Özarslan, E.; Herberthson, M.; Bänkestad, M.; Knutsson, H. Bayesian uncertainty quantification in linear models for diffusion MRI. NeuroImage 2018, 175, 272–285. [Google Scholar] [CrossRef]
Liu, H.; Li, D.K.B. Myelin water imaging data analysis in less than one minute. NeuroImage 2020, 210, 116551. [Google Scholar] [CrossRef]
Rafati, J.; Marcia, R.F. Improving L-BFGS initialization for trust-region methods in deep learning. In Proceedings of the 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, FL, USA, 17–20 December 2018; IEEE: New York, NY, USA, 2018; pp. 501–508. [Google Scholar]
Bishop, C.M.; Roach, C.M. Fast curve fitting using neural networks. Rev. Sci. Instrum. 1992, 63, 4450–4456. [Google Scholar] [CrossRef]
Gözcü, B.; Mahabadi, R.K.; Li, Y.-H.; Ilıcak, E.; Cukur, T.; Scarlett, J.; Cevher, V. Learning-based compressive MRI. IEEE Trans. Med. Imaging 2018, 37, 1394–1406. [Google Scholar] [CrossRef]
Jung, S.; Lee, H.; Ryu, K.; Song, J.E.; Park, M.; Moon, W.-J.; Kim, D.-H. Artificial neural network for multi-echo gradient echo–based myelin water fraction estimation. Magn. Reson. Med. 2021, 85, 380–389. [Google Scholar] [CrossRef]
Zibetti, M.V.W.; Herman, G.T.; Regatte, R.R. Fast data-driven learning of parallel MRI sampling patterns for large scale problems. Sci. Rep. 2021, 11, 19312. [Google Scholar] [CrossRef] [PubMed]
Bertleff, M.; Domsch, S.; Weingärtner, S.; Zapp, J.; O’Brien, K.; Barth, M.; Schad, L.R. Diffusion parameter mapping with the combined intravoxel incoherent motion and kurtosis model using artificial neural networks at 3 T. NMR Biomed. 2017, 30, e3833. [Google Scholar] [CrossRef] [PubMed]
Domsch, S.; Mürle, B.; Weingärtner, S.; Zapp, J.; Wenz, F.; Schad, L.R. Oxygen extraction fraction mapping at 3 Tesla using an artificial neural network: A feasibility study. Magn. Reson. Med. 2018, 79, 890–899. [Google Scholar] [CrossRef] [PubMed]
Balasubramanyam, C.; Ajay, M.S.; Spandana, K.R.; Shetty, A.B.; Seetharamu, K.N. Curve fitting for coarse data using artificial neural network. WSEAS Trans. Math. 2014, 13, 406–415. [Google Scholar]
Liu, F.; Feng, L.; Kijowski, R. MANTIS: Model-augmented neural network with incoherent k-space sampling for efficient MR parameter mapping. Magn. Reson. Med. 2019, 82, 174–188. [Google Scholar] [CrossRef]
Fu, Z.; Mandava, S.; Keerthivasan, M.B.; Li, Z.; Johnson, K.; Martin, D.R.; Altbach, M.I.; Bilgin, A. A multi-scale residual network for accelerated radial MR parameter mapping. Magn. Reson. Imaging 2020, 73, 152–162. [Google Scholar] [CrossRef]
Li, H.; Yang, M.; Kim, J.H.; Zhang, C.; Liu, R.; Huang, P.; Liang, D.; Zhang, X.; Li, X.; Ying, L. SuperMAP: Deep ultra-fast MR relaxometry with joint spatiotemporal undersampling. Magn. Reson. Med. 2023, 89, 64–76. [Google Scholar] [CrossRef]
Liu, F.; Samsonov, A.; Chen, L.; Kijowski, R.; Feng, L. SANTIS: Sampling-augmented neural network with incoherent structure for MR image reconstruction. Magn. Reson. Med. 2019, 82, 1890–1904. [Google Scholar] [CrossRef]
Liu, F.; Kijowski, R.; El Fakhri, G.; Feng, L. Magnetic resonance parameter mapping using model-guided self-supervised deep learning. Magn. Reson. Med. 2021, 85, 3211–3226. [Google Scholar] [CrossRef]
Liu, F.; Kijowski, R.; Feng, L.; El Fakhri, G. High-performance rapid MR parameter mapping using model-based deep adversarial learning. Magn. Reson. Imaging 2020, 74, 152–160. [Google Scholar] [CrossRef]
Feng, R.; Zhao, J.; Wang, H.; Yang, B.; Feng, J.; Shi, Y.; Zhang, M.; Liu, C.; Zhang, Y.; Zhuang, J.; et al. MoDL-QSM: Model-based deep learning for quantitative susceptibility mapping. NeuroImage 2021, 240, 118376. [Google Scholar] [CrossRef] [PubMed]
Jun, Y.; Shin, H.; Eo, T.; Kim, T.; Hwang, D. Deep model-based magnetic resonance parameter mapping network (DOPAMINE) for fast T1 mapping using variable flip angle method. Med. Image Anal. 2021, 70, 102017. [Google Scholar] [CrossRef] [PubMed]
Bian, W.; Jang, A.; Liu, F. Improving quantitative MRI using self-supervised deep learning with model reinforcement: Demonstration for rapid T1 mapping. Magn. Reson. Med. 2024, 92, 98–111. [Google Scholar] [CrossRef] [PubMed]
Gabrielle Blumenkrantz, B.; Lozano, B.J.; Julio, C.-G.; Ries, M.; Majumdar, S. In vivo T1Rho and T2 mapping of articular cartilage in osteoarthritis of the knee using 3 tesla MRI. Osteoarthr. Cartil. 2007, 15, 789–797. [Google Scholar]
Milford, D.; Rosbach, N.; Bendszus, M.; Heiland, S. Mono-exponential fitting in T2-relaxometry: Relevance of offset and first echo. PLoS ONE 2015, 10, e0145255. [Google Scholar] [CrossRef]
Souza, R.B.; Feeley, B.T.; Zarins, Z.A.; Link, T.M.; Li, X.; Majumdar, S. T1rho MRI relaxation in knee OA subjects with varying sizes of cartilage lesions. Knee 2013, 20, 113–119. [Google Scholar] [CrossRef]
Shao, H.; Chang, E.; Pauli, C.; Zanganeh, S.; Bae, W.; Chung, C.; Tang, G.; Du, J. UTE bi-component analysis of T2* relaxation in articular cartilage. Osteoarthr. Cartil. 2016, 24, 364–373. [Google Scholar] [CrossRef]
Sharafi, A.; Xia, D.; Chang, G.; Regatte, R.R. Biexponential T1ρ relaxation mapping of human knee cartilage in vivo at 3 T. NMR Biomed. 2017, 30, e3760. [Google Scholar] [CrossRef]
Bakker, C.J.G.; Vriend, J. Multi-exponential water proton spin-lattice relaxation in biological tissues and its implications for quantitative NMR imaging. Phys. Med. Biol. 1984, 29, 509–518. [Google Scholar] [CrossRef]
Reiter, D.A.; Magin, R.L.; Li, W.; Trujillo, J.J.; Velasco, M.P.; Spencer, R.G. Anomalous T2 relaxation in normal and degraded cartilage. Magn. Reson. Med. 2016, 76, 953–962. [Google Scholar] [CrossRef]
Wilson, R.; Bowen, L.; Kim, W.; Reiter, D.; Neu, C. Stretched-Exponential Modeling of Anomalous T1ρ and T2 Relaxation in the Intervertebral Disc In Vivo. bioRxiv 2020, preprint. [Google Scholar] [CrossRef]
Johnston, D.C. Stretched exponential relaxation arising from a continuous sum of exponential decays. Phys. Rev. B 2006, 74, 184430. [Google Scholar] [CrossRef]
Steihaug, T. The conjugate gradient method and trust regions in large scale optimization. SIAM J. Numer. Anal. 1983, 20, 626–637. [Google Scholar] [CrossRef]
Li, X.; Han, E.T.; Busse, R.F.; Majumdar, S. In vivo T1ρ mapping in cartilage using 3D magnetization-prepared angle-modulated partitioned k-space spoiled gradient echo snapshots (3D MAPSS). Magn. Reson. Med. 2008, 59, 298–307. [Google Scholar] [CrossRef]

Figure 1. Computational time for NLS fitting across different MRI slice counts on HPC (theoretical) and NVIDIA GPU systems.

Figure 2. Representative visualization of multi-component T1ρ relaxation models in the knee joint.

Figure 3. Voxel-by-voxel framework of the DL model for fitting multi-component T1ρ relaxation models with a customized loss function.

Figure 4. Sensitivity analysis of the DL model: (a) optimizer selection, (b) epochs, (c) batch size, and (d) number of layers, on RMSE.

Figure 5. Sensitivity analysis of the combination of activation functions (ReLU, Leaky ReLU, GELU, Tanh, and Swish) with optimizers (Adam, SGD, and RMSProp) for the DL model: (a) ME component, (b) SE component, and (c) BE component.

Figure 6. Sensitivity analysis of the loss function with varying

γ_{s}

and

γ_{θ}

on validation data. (a) Impact of

γ_{θ}

on RMSE with

γ_{s}

fixed at 1, demonstrating an increase in RMSE as

γ_{θ}

rises from 1 to 10. (b) Influence of

γ_{s}

on RMSE with

γ_{θ}

fixed at 1, showing decreasing RMSE values as

γ_{s}

increases from 1 to 15. However, RMSE starts to increase beyond

γ_{s}

= 10.

Figure 6. Sensitivity analysis of the loss function with varying

γ_{s}

and

γ_{θ}

on validation data. (a) Impact of

γ_{θ}

on RMSE with

γ_{s}

fixed at 1, demonstrating an increase in RMSE as

γ_{θ}

rises from 1 to 10. (b) Influence of

γ_{s}

on RMSE with

γ_{θ}

fixed at 1, showing decreasing RMSE values as

γ_{s}

increases from 1 to 15. However, RMSE starts to increase beyond

γ_{s}

= 10.

Figure 7. Training and validation loss analysis: (a) for 10 epochs and (b) for 100 epochs.

Figure 8. Proposed HDNLS-based fitting of multi-component T1ρ relaxation models in the knee joint.

Figure 9. Comparison of NLS and HDNLS across mono-exponential, bi-exponential, and stretched-exponential fitting models on synthetic data. Row 1 illustrates the trade-off between fitting time and MNAD for NLS. Row 2 shows the trade-off between fitting time and MNAD for HDNLS. Row 3 shows the convergence of NLS and HDNLS in terms of MNAD.

Figure 10. Performance metrics comparison of NLS and HDNLS in terms of fitting time and normalized root-mean-squared residuals (NRMSRs) across ME, SE, and BE fitting models. Row 1 shows the trade-off between fitting time and NRMSR for NLS. Row 2 shows the trade-off between fitting time and NRMSR for HDNLS. Row 3 presents the convergence of both NLS and HDNLS with respect to log (NRMSR).

Figure 11. Estimated HDNLS variant-based ME and SE-T1ρ maps for the full knee joint.

Figure 12. Estimated HDNLS variant-based ME and SE T1ρ maps for knee cartilage ROIs only.

Figure 13. Estimated HDNLS variant-based BE long T1ρ, fraction of the long T1ρ, and short T1ρ maps for the full knee joint.

Figure 14. Estimated HDNLS variant-based BE long T1ρ, fraction of the long T1ρ, and short T1ρ maps for knee cartilage ROIs only.

Figure 15. Estimated NLS-, RNLS-, DL-, and HDNLS-based ME and SE T1ρ maps for the full knee joint.

Figure 16. Estimated NLS-, RNLS-, DL-, and HDNLS-based ME and SE T1ρ maps for the knee cartilage ROIs only.

Figure 17. Estimated NLS-, RNLS-, DL-, and HDNLS-based BE long T1ρ component, fraction of the long T1ρ component, and short T1ρ maps for the full knee joint.

Figure 18. Estimated NLS, RNLS, DL, and HDNLS-based BE long T1ρ component, fraction of the long T1ρ component, and short T1ρ maps for knee cartilage ROIs only.

Table 1. Comparative analysis of the existing AI-based exponential fitting methods.

Ref.	Year	Method	Features	Open Challenges
[21]	2017	Monte Carlo simulations and a random forest regressor	- Established a robust mapping from diffusion-weighted MR signal features to the underlying microstructure parameters of brain tissue - Estimated water residence time in brain white matter	- Did not estimate multi-component T1ρ map - Did not provide uncertainty estimates for predictions - Used a very basic representation of white matter
[22]	2019	Deep neural network (DNN)	- Employed non-supervised DNNs to fit the Intravoxel Incoherent Motion (IVIM) - Does not rely on priors - Parameter maps were smooth and consistent within homogeneous tissues	- DNN may require retraining for different acquisition protocols - Training and testing on same data - Did not improve Bayesian convergence in low SNR regions
[24]	2020	DNN	- Used a DNN to directly estimate myelin water fraction (MWF) - Can produce whole-brain MWF maps in approximately 33 s - Better voxel-wise agreement was observed between DNN-based MWF estimates and NNLS-based ground truths	- DNN relied on NNLS-generated ground truths, so it inherited the NNLS method’s vulnerability to noisy data - DNNs are sensitive to noise and artifacts in the input data
[33]	2019	MANTIS	- Integrated a model-augmented neural network with incoherent k-space sampling for efficient MR parameter mapping - Combined physics-based modeling with DL for accelerated image reconstruction - Reduced noise and artifacts in reconstructed parameter maps	- Generalization to diverse datasets may need further validation due to dependency on sampling patterns - Performance depends on the quality of its training data, which could lead to overfitting and limited generalizability
[38]	2020	Model-based deep adversarial learning	- Employed a generative adversarial network (GAN) for enhanced reconstruction quality - Achieved rapid parameter mapping with high fidelity and robustness - Effectively removed noise and artifacts	- Adversarial training is computationally demanding and sensitive to hyperparameter tuning - GANs may generate outputs that appear realistic but deviate from the true underlying signal
[39]	2021	MoDL-QSM	- Combined model-based reconstruction with DL for quantitative susceptibility mapping (QSM) - Used iterative model unrolling with DL modules to improve QSM accuracy - Produced high-quality susceptibility maps with reduced streaking artifacts	- Model unrolling introduces computational overhead - Requires careful hyperparameter optimization to ensure convergence
[40]	2021	DOPAMINE	- Integrated model-based and deep learning techniques for fast T1 mapping - Used the variable flip angle (VFA) method for efficient data acquisition - Reduced dependency on large training datasets by incorporating physics-based modeling - Handled noise and artifacts effectively	- Dependent on specific acquisition protocols (VFA), limiting flexibility for other methods - Computationally intensive due to the integration of model-based techniques
[37]	2021	RELAX	- Utilized two physical models, such as the MR imaging model and the quantitative imaging model, in network training to fit the parameters - Eliminated the need for fully sampled reference datasets - Directly reconstructs MR parameter maps from undersampled k-space data	- Tested only on simulated and coil-combined data - Only a residual U-Net was used - RELAX’s performance relies on the precision of its physical models, limiting its robustness to inaccuracies
[35]	2023	SuperMAP	- Achieved mapping with acceleration factors up to R = 32 - Generated relaxation maps from a single scan using optimized data acquisition - Directly converted undersampled parameter-weighted images into quantitative maps - Performed well on both retrospectively undersampled and prospective data, demonstrating adaptability	- Performance relies on the quality and diversity of retrospectively undersampled training data - Heavily reliant on the quality of the undersampling patterns; suboptimal patterns may degrade performance
[41]	2024	RELAX-MORE	- Used self-supervised learning, eliminating the need for large, labeled datasets - Unrolled an iterative model-based qMRI reconstruction into a DL framework - Produced accurate parameter maps that correct image artifacts, remove noise, and recover features even in regions with imperfect imaging conditions	- The unrolling of iterative processes and subject-specific training could still be computationally intensive - Subject-specific training might require more careful convergence monitoring, as the self-supervised framework relies on single-subject data

Table 2. Multi-component relaxation models: mono-exponential (ME), bi-exponential (BE), and stretched-exponential (SE) models.

Model	Equation	$Parameters θ$	Description	Range
ME [20,42,43,44]	$f (t, θ) = A \exp (- \frac{t}{T_{M E}})$	$\{A, T_{M E}\}$	Assumes tissue is homogeneous with a single relaxation time. $A$ is the complex-valued amplitude and $T_{M E}$ is the real-valued relaxation time.	$T_{M E} > 0 m s$ (for knee cartilage ≥ 5 ms)
BE [45,46,47]	$f (t, θ) = A ((1 - \emptyset_{f}) \exp (- \frac{t}{T_{s}}) + \emptyset_{f} \exp (- \frac{t}{T_{l}}))$	$\{A, \emptyset_{f}, T_{s}, T_{l}\}$	Accounts for two compartments with distinct relaxation times, $T_{s}$ (short) and $T_{l}$ (long). $\emptyset_{f}$ is the fraction of the long compartment.	$0 {\leq \emptyset}_{f} \leq 1$ $T_{s} < 5 m s$ $T_{l} \geq 5 m s$
SE [48,49,50]	$f (t, θ) = A \exp (- {(\frac{t}{T_{S E}})}^{β})$	$\{A, T_{S E}, β\}$	Models heterogeneous environments with distributed relaxation rates. $T_{S E}$ is the relaxation time, and $β$ is the stretching exponent.	$0 < β \leq 1$ $T_{S E} > 0 m s$

Table 3. Features, advantages, and disadvantages of different relaxation models (ME, BE, and SE).

Model	Features	Advantages	Disadvantages
ME	- Fundamental model for single-component relaxation	- Simple and easy to implement - Suitable for homogeneous tissues with a single relaxation component	- Limited in handling tissues with multiple relaxation components
BE	- Extends the ME model by introducing two compartments (short and long) - Represents two-component decay systems - Accounts for two non-exchanging compartments in tissues, each with distinct short and long relaxation times	- Better representation of tissues with two relaxation components, e.g., free water and water bonded to macromolecules	- Requires more parameters and is computationally expensive - May face convergence issues during fitting - The short component is usually sensitive to noise
SE	- Reduces to ME when β = 1 - Can be used as an alternative to BE for describing tissue heterogeneity - Can approximate a broad distribution of relaxation rates with fewer parameters, simplifying complex tissue modeling	- Suitable for heterogeneous tissues - More stable and requires fewer parameters than BE - Can describe multi-component relaxation	- Less straightforward physical interpretation compared to BE

Table 4. Mathematical formulations of performance metrics used for comparative analysis.

Metric	Description	Equation	Symbols
MNAD	Measures the median of the normalized absolute difference between estimated and reference T1ρ parameters across voxels.	$M N A D (Φ) = \underset{i \in I, p \in P}{median} (\frac{\|{\bar{θ}}_{i, p} - θ_{i, p}\|}{\|{\bar{θ}}_{i, p}\|})$	$θ_{i, p}$ and ${\bar{θ}}_{i, p}$ are the estimated and reference parameters p of the voxel indexed by i that belongs to the region of interest or set of voxels I, and $Φ = {\{θ_{i}\}}_{i \in I}$ .
NRMSR	Assesses the size of the residual between MR measurements and predicted MR signals.	$N R M S R (F) = \sqrt{\underset{i \in I}{m e a n} \frac{{‖s_{i} - {\hat{f}}_{i}‖}_{2}}{{‖s_{i}‖}_{2}}}$	$s_{i}$ is the measured MR signal at the voxel indexed by i, and ${\hat{f}}_{i}$ is the predicted MR signal at the same voxel with $θ_{i}$ $, and F = {\{{\hat{f}}_{i}\}}_{i \in I}$ .
NRMSE	Assesses the normalized value of the root-mean-squared error (RMSE) across the reference parameters.	$\begin{matrix} N R M S E (Φ) = \\ \sqrt{\frac{\underset{i \in I, p \in P}{m e a n} {\|{\bar{θ}}_{i, p} - θ_{i, p}\|}^{2}}{\underset{i \in I, p \in P}{m e a n} {\|{\bar{θ}}_{i, p}\|}^{2}}} # \end{matrix}$	$θ_{i, p}$ and ${\bar{θ}}_{i, p}$ are the estimated and reference parameters p of the voxel indexed by i that belongs to the region of interest or set of voxels I, and $Φ = {\{θ_{i}\}}_{i \in I}$ .
Fitting Time	Represents the time required by models (e.g., NLS, RNLS, HDNLS, and DL) to compute the T1ρ map for one MRI slice, measured on the same hardware.	$F i t t i n g T i m e = t o c - t i c$	$t i c :$ Start time of the fitting process. $t o c :$ End time of the fitting process.

Table 5. Quantitative analysis of HDNLS variants for ME, SE, and BE T1ρ maps on real data for full knee joints. Bold quantitative values indicate the outperforming parameter for the specific model.

Method	Metric	Ultra-HDNLS	Super-HDNLS	HDNLS	Relaxed-HDNLS
ME	MNAD (%)	4.68	4.22	4.03	3.82
	Fitting time (s)	0.99	1.76	5.13	10.96
	NRMSR	0.03	0.03	0.02	0.02
SE	MNAD (%)	13.49	8.92	7.18	6.75
	Fitting time (s)	1.12	1.85	3.42	7.12
	NRMSR	1.09	0.70	0.18	0.15
BE	MNAD (%)	21.38	20.26	19.87	19.24
	Fitting time (s)	1.22	2.14	3.96	7.73
	NRMSR	0.03	0.03	0.02	0.02

Table 6. Quantitative analysis of HDNLS variants for ME, SE, and BE T1ρ maps on real data for knee cartilage ROIs only. Bold quantitative values indicate the outperforming parameter for the specific model.

Method	Metric	Ultra-HDNLS	Super-HDNLS	HDNLS	Relaxed-HDNLS
ME	MNAD (%)	4.36	4.16	3.86	3.74
	Fitting time (s)	0.52	0.95	2.38	5.73
	NRMSR	0.03	0.02	0.02	0.02
SE	MNAD (%)	13.24	8.73	7.12	6.67
	Fitting time (s)	0.51	0.87	2.28	3.71
	NRMSR	1.09	0.70	0.18	0.14
BE	MNAD (%)	20.34	19.77	19.28	18.94
	Fitting time (s)	0.46	0.87	1.85	3.26
	NRMSR	0.03	0.02	0.02	0.02

Table 7. Quantitative analysis of NLS, RNLS, DL, and HDNLS for ME, SE, and BE T1ρ maps on synthetic data. Bold quantitative values indicate the outperforming parameter for the specific model.

Method	Metric	NLS	RNLS	DL	HDNLS
ME	MNAD (%)	3.65	3.69	4.29	3.70
	Fitting time (s)	37.48	32.83	0.79	4.94
	NRMSR	0.02	0.02	0.03	0.02
SE	MNAD (%)	6.18	6.27	7.61	6.32
	Fitting time (s)	41.9	37.75	0.61	4.17
	NRMSR	0.15	0.15	0.18	0.16
BE	MNAD (%)	16.19	16.36	18.37	16.47
	Fitting time (s)	11.73	11.16	0.52	2.48
	NRMSR	0.02	0.02	0.02	0.02

Table 8. Quantitative analysis of NLS, RNLS, DL, and HDNLS for ME, SE, and BE T1ρ maps on real data for both full knee joint and knee cartilage ROIs only. Bold quantitative values indicate the outperforming parameter for the specific model.

Method	Metric	Full Knee		Knee Cartilage
Method	Metric	DL	HDNLS	DL	HDNLS
ME	MNAD (%)	22.36	18.93	19.32	17.20
ME	NRMSE (%)	24.05	21.25	21.20	19.31
SE	MNAD (%)	26.38	23.94	23.38	21.64
SE	NRMSE (%)	25.81	22.94	22.46	19.85
BE	MNAD (%)	17.70	13.02	14.09	12.24
BE	NRMSE (%)	20.16	16.36	18.37	14.85

Table 9. Computational speed analysis of NLS, RNLS, DL, and HDNLS for ME, SE, and BE T1ρ maps on real data for both full knee joint and knee cartilage only. Bold quantitative values indicate the outperforming parameter for the specific model.

Method	Metric	NLS	RNLS	DL	HDNLS
ME	Full Knee	274.68	234.22	1.35	19.32
ME	Knee Cartilage	76.37	65.82	0.92	6.78
SE	Full Knee	293.49	262.92	1.78	22.50
SE	Knee Cartilage	91.12	88.39	1.67	7.51
BE	Full Knee	257.38	229.13	1.21	18.14
BE	Knee Cartilage	70.26	67.14	0.84	6.13

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Singh, D.; Regatte, R.R.; Zibetti, M.V.W. HDNLS: Hybrid Deep-Learning and Non-Linear Least Squares-Based Method for Fast Multi-Component T1ρ Mapping in the Knee Joint. Bioengineering 2025, 12, 8. https://doi.org/10.3390/bioengineering12010008

AMA Style

Singh D, Regatte RR, Zibetti MVW. HDNLS: Hybrid Deep-Learning and Non-Linear Least Squares-Based Method for Fast Multi-Component T1ρ Mapping in the Knee Joint. Bioengineering. 2025; 12(1):8. https://doi.org/10.3390/bioengineering12010008

Chicago/Turabian Style

Singh, Dilbag, Ravinder R. Regatte, and Marcelo V. W. Zibetti. 2025. "HDNLS: Hybrid Deep-Learning and Non-Linear Least Squares-Based Method for Fast Multi-Component T1ρ Mapping in the Knee Joint" Bioengineering 12, no. 1: 8. https://doi.org/10.3390/bioengineering12010008

APA Style

Singh, D., Regatte, R. R., & Zibetti, M. V. W. (2025). HDNLS: Hybrid Deep-Learning and Non-Linear Least Squares-Based Method for Fast Multi-Component T1ρ Mapping in the Knee Joint. Bioengineering, 12(1), 8. https://doi.org/10.3390/bioengineering12010008

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

HDNLS: Hybrid Deep-Learning and Non-Linear Least Squares-Based Method for Fast Multi-Component T1ρ Mapping in the Knee Joint

Abstract

1. Introduction

1.1. Research Motivation

1.2. AI-Based Exponential Fitting

1.3. Proposed Solution

1.4. Contributions

2. Proposed Methodology

2.1. Multi-Component Relaxation Models

2.2. Deep Learning-Based Multi-Component Relaxation Models

2.3. Sensitivity Analysis of DL Model

2.4. Training and Validation Analysis

2.5. HDNLS

2.6. T1ρ-MRI Data Acquisition and Fitting Methods

2.7. Performance Metrics

3. Optimal Selection of NLS Iterations for HDNLS

4. Performance Analysis

5. Discussion, Limitations, and Future Directions

5.1. Discussion

5.2. Limitations of HDNLS

5.3. Future Directions

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI