Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension

Qu, Xiaobo; Guo, Di; Cao, Xue; Cai, Shuhui; Chen, Zhong

doi:10.3390/s110908888

Open AccessArticle

Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension^†

¹

Department of Communication Engineering, Fujian Key Laboratory of Plasma and Magnetic Resonance, Xiamen University, Xiamen 361005, China

²

Department of Electronic Science, Fujian Key Laboratory of Plasma and Magnetic Resonance, Xiamen 361005, China

³

School of Software, Shanghai Jiao Tong University, Shanghai 200240, China

^*

Author to whom correspondence should be addressed.

^†

A one page abstract of this work was presented at the 18th Scientific Meeting of the International Society for Magnetic Resonance in Medicine, Stockholm, Sweden, 1–7 May 2010; p. 3371.

Sensors 2011, 11(9), 8888-8909; https://doi.org/10.3390/s110908888

Submission received: 31 July 2011 / Revised: 31 August 2011 / Accepted: 5 September 2011 / Published: 15 September 2011

(This article belongs to the Section Chemical Sensors)

Download

Browse Figures

Versions Notes

Abstract

: Reducing the acquisition time for two-dimensional nuclear magnetic resonance (2D NMR) spectra is important. One way to achieve this goal is reducing the acquired data. In this paper, within the framework of compressed sensing, we proposed to undersample the data in the indirect dimension for a type of self-sparse 2D NMR spectra, that is, only a few meaningful spectral peaks occupy partial locations, while the rest of locations have very small or even no peaks. The spectrum is reconstructed by enforcing its sparsity in an identity matrix domain with ℓ_p (p = 0.5) norm optimization algorithm. Both theoretical analysis and simulation results show that the proposed method can reduce the reconstruction errors compared with the wavelet-based ℓ₁ norm optimization.

Keywords:

NMR; spectral reconstruction; sparsity; undersampling; compressed sensing

Graphical Abstract

1. Introduction

Nuclear magnetic resonance (NMR) spectroscopy is widely utilized to analyze the structures of chemicals and proteins. Multidimensional NMR spectra can provide more information than one-dimensional (1D) NMR spectra. The acquisition time for a conventional two-dimensional (2D) NMR spectrum is mostly determined by the number of t₁ increments in the indirect dimension. One possible way is to reduce the acquisition time is to reduce the number of t₁ increments. However, this will result in aliasing of the spectrum in the indirect dimension [1,2], because the sampling rate is lower than the requirement of the Nyquist sampling rule.

Researchers have been seeking ways to suppress the aliasing from the aspects of sampling and reconstruction. Radial sampling presents relatively small leakage artifacts [3] and Poisson disk sampling is observed to provide a large low-artifact area in the signal vicinity [4]. The maximum sampling time for multi-dimensional NMR experiments was analyzed by Vosegaard and co-workers [5]. Besides the sampling patterns, some reconstruction algorithms have been employed to improve spectral quality, including maximum entropy [6,7], iterative CLEAN algorithm [8] and Bayesian reconstruction [9]. The sparse sampling was incorporated with intermolecular multiple-quantum coherences for high-resolution 2D NMR spectra in inhomogeneous fields [10].

Recently compressed sensing (CS) theory [11,12], for reconstructing signals from fewer numbers of measurements than the number that the Nyquist sampling rule requires has attracted lots of attention in medical imaging [13], single pixel imaging [14], and computer vision [15], etc. Under the assumption that the acquired data is sparse or compressible in a certain sparsifying transform domain, CS can successfully recover the original signal from a small number of linear projections with little or no loss of information. The choice of sparsifying transform is important in the CS. The sparsfying transform should be maximally incoherent with the measurement operator. Intuitively, the target signal should be sparsely represented in the transform domain, e.g., wavelet transform domain, and this spare representation should be spread out in the encoding scheme. Iddo introduced CS to reconstruct a 2D NMR spectrum from partial random measurements of its time domain signal under the assumption that the spectrum is sparse in the wavelet domain [16].

In this paper, we focus on the reconstruction of self-sparse NMR spectra, that is, a few meaningful spectral peaks occupy partial locations while the rest locations have very small or even no meaningful peaks. NMR spectra includes regions where no signals arise because of the discrete nature of chemical groups [17]. The reason we pay attention to self-sparse NMR spectra is that many NMR spectra of chemical substances fall in this type [3,10,16,17]. Based on the concept of sparsity and coherence in CS, we demonstrate that a wavelet transform is not necessary to sparsify the self-sparse NMR spectra or even worsens the reconstruction. We propose to reconstruct the NMR spectrum by enforcing its sparsity in an identity matrix domain with a ℓ_p (p = 0.5) norm optimization algorithm. Simulation results show that the proposed method can reduce the reconstruction errors compared with the wavelet-based ℓ₁ norm optimization.

Recently, Kazimierczuk and Orekhov [18] and Holland et al. [19] independently proposed to use CS in proton NMR and showed promising results in reducing acquired data. A combination of spatially encoding the indirect domain information and CS was proposed by Shrot and Frydman [20]. The spectra were considered to be sparse themselves [18–20], differing from the sparse representation using wavelets [16]. However, no comparison on the reconstructed spectra with and without wavelet transform was given and no theoretical analysis was presented. In this paper, we will analyze the performance of wavelet transform in the CS-NMR basing on the sparsity and coherence properties and simulated results.

The remainder of this paper is organized as follows. In Section 2, the reason to undersample the indirect dimension is given by calculating the acquisition time for a 2D NMR spectrum. In Section 3, the two key factors of CS, sparsity and coherence, are briefly summarized and their values are estimated for 2D spectra, followed by the proposed reconstruction method. In Section 4, reconstruction of self-sparse NMR spectra is simulated to show the shortcomings of the wavelet and the advantage of the identity matrix. The improvement of utilizing the ℓ_p norm is also demonstrated. Finally, discussions and conclusions are given in Section 5.

2. Undersampling in the Indirect Dimension of 2D NMR

In NMR spectroscopy, a typical sampled noiseless time domain signal can be described as a sum of exponentially decaying sinusoids:

y_{k} = \sum_{j = 1}^{J} (A_{j} e^{i ϕ_{j}}) e^{- \frac{k Δ t}{τ_{j}}} e^{2 π ik Δ t ω_{j}}

(1)

where J is the number of sinusoids, A_j, ∅_j, τ_j and ω_j are the amplitude, phase in radians, decay time and frequency, respectively, of the jth sinusoid [21]. Δt is the sampling interval and k (k = 0, 1, …, K) is an integer to denote the kth sample point. Such a signal will give rise to a spectrum that is the sum of Lorentizian peaks centered at different frequencies ω_j [21], where j corresponds to jth type of nuclear spins. A conventional 1D single pulse NMR experiment enforces an excitation pulse on a sample followed immediately by data acquisition. The signal eventually decays due to relaxation [22], thus it is called free induction decay (FID). Fourier transform (FT) is applied on the FID to obtain a frequency domain spectrum. Figure 1 shows the simulated FID signal and the corresponding 1D NMR spectrum obtained from FT.

The typical experimental time for a 1D NMR spectrum usually takes several seconds, thus it is not time consuming. However, for a 2D NMR spectrum, the time domain signal is generated based on two time variables t₁ and t₂. As shown in Figure 2, one scan of 2D NMR spectrum contains three steps: first, the sample is excited by one or more pulses in the preparation period. These pulses result in the evolution of magnetization with time t₁; then, the sample is further excited in the mixing period; finally, an FID signal is recorded as a function of t₂. Usually, t₁ is set as t₁ = Δt₁, 2Δt₁, ..., n₁Δt₁, N₁Δt₁ (The increment Δt₁ is usually at the order of milliseconds). The number of t₁ increments (N₁) is determined by:

N_{1} = \frac{{SW}_{1}}{{Δ f}_{1}}

(2)

where

{SW}_{1} = \frac{1}{{Δ t}_{1}}

is the desired spectral width and

{Δ f}_{1} = \frac{1}{N_{1} {Δ t}_{1}}

is the corresponding spectral resolution. The typical N₁ is from 50 to 500 [22]. Given a fixed t₁ = n₁Δt₁, one scan is performed and the FID signal is recorded and stored along the direct dimension. After the scan, the nuclear spins are allowed to return to their equilibrium states before the next scan for t₁ = (n₁ + 1)Δt₁ [22].

Finally, 2D FT is performed on the 2D FID data. If the time for performing all the pulses in one scan is t_p, the total scanning time for a 2D NMR spectrum will be:

T_{N_{1}} = \sum_{n_{1} = 1}^{N_{1}} (d_{1} + n_{1} {Δ t}_{1} + t_{m} + t_{2} + t_{p}) = N_{1} (d_{1} + \frac{(1 + N_{1}) {Δ t}_{1}}{2} + t_{m} + t_{2} + t_{p})

(3)

In order to obtain a good resolution in the indirection dimension, N₁ is usually several tens or hundreds or even more. This will cause the total scanning time for a 2D NMR spectrum to be tens of minutes or even several hours [22–26].

In this paper, we aim to reduce the scan number for the t₁ dimension. Rather than using the uniform increment in the indirect dimension (t₁ = Δt₁, 2Δt₁, ..., n₁Δt₁, N₁Δt₁), we randomly choose unduplicated Q numbers from n_q ∈ {1, 2, ..., N₁}, and let t₁ = n_qΔt₁. Let:

ρ = \frac{Q}{N_{1}}

(4)

be the sampling rate in this paper, the total time to scan a 2D NMR spectrum is approximately:

T_{Q} = \frac{Q}{N_{1}} T_{N_{1}} = ρ T_{N_{1}}

(5)

The approximation is made by ignoring the total evolution time ∑_{n_q∈ {1,2,...,N₁},q = 1,2,...,Q}n_qΔt₁ since this value is only in the order of seconds. Compared to the time to acquire a 2D spectrum with fully sampled FIDs in the indirection dimension, undersampling the FIDs in the indirect dimension can greatly reduce the acquisition time for a 2D NMR spectrum if ρ is small enough. Figure 3 shows an example where we randomly undersample the indirect dimension with sampling rate ρ = 5/11 = 0.45. It means we save nearly half of the acquisition time of the conventional scheme.

However, this undersampling will result in aliasing artifacts [1,6]. It would be of great value if we can minimize these artifacts and reconstruct the full 2D NMR spectrum from the limited data. Here we explore the undersampling and reconstruction methods under the framework of CS.

3. Reconstruction of 2D Self-Sparse NMR Spectra with Compressed Sensing

3.1. Basic Concepts in Compressed Sensing

The CS proposed by Candès et al. [11] and Donoho [12] is a new theory to do undersampling and reconstruct the signal of interest from limited physically acquired data. They build a theoretical foundation that one can exactly or approximately recover signals from highly incomplete measurements. The two basic tenets to guarantee the performance of CS are sparsity and incoherence.

(a) Sparsity. For the signal x ∈ R^N and a basis dictionary Ψ ∈ R^{S × N} (e.g., identity matrix, FT, discrete cosine transform or wavelet transform matrix), the sparsity is often interpreted as:

S = {‖ α ‖}_{0} = {‖ Ψ x ‖}_{0} ≪ N

(6)

where ‖ α ‖₀ denotes the ℓ₀ norm that counts the nonzero entries in α, and S is the number of nonzero entries. If x is sparse without transformation (namely sparse in identity matrix I ∈ R^{N × N}), it is called self-sparse since other complicated sparsifying transform, e.g., wavelet transform, is not required.

Candès et al. [11] and Donoho [12] proved that it is possible to recover the original signal x from O(NlogS) measurements. This means the required number of measurements is proportional to the number of nonzero entries in the basis Ψ. The smaller the S is, the less the number of measurements is required.

(b) Incoherence. When a signal x is sampled by a sensing matrix Φ_{M × N}, the measurements y ∈ R^M of x is:

f = Φ x

(7)

The coherence is defined as [27,28]:

μ (Φ, Ψ) = max_{k, j} | 〈 ϕ_{k}, ψ_{j} 〉 |

(8)

where ∅_k is the kth rows of Φ and Ψ_j is the jth column of Ψ. The coherence measures the largest correlation between any row of Φ and column of Ψ. The less the coherence between Φ and Ψ is, the smaller the μ is. The value range of μ is

[1, \sqrt{N}]

. The minimal coherence μ = 1 occurs when Φ and Ψ is a time-frequency pair [29]. CS requires the coherence to be as small as possible, which means each measurement vector ∅_k must be ‘spread out’ in the Ψ domain [28].

If the signal x satisfies [30]:

S = {‖ α ‖}_{0} < \frac{1}{2} (1 + \frac{1}{μ (Φ, Ψ)})

(9)

it can be perfectly recovered by solving:

\hat{α} = min_{α} {‖ α ‖}_{0}, s . t . y = Φ Ψ α

(10)

where ‖ α ‖₀ denotes the ℓ₀ norm that counts the nonzero entries in α.

The recovered signal is:

\hat{x} = Ψ \hat{α}

(11)

Equation (9) implies that if the coherence between Φ and Ψ is small, more non-zeros can be allowed in the sparse representation α. CS suggests Φ to be random enough to guarantee its incoherence with any Ψ. This is also observed that random sampling in time domain can improve the quality of reconstructed spectra [31].

However, ℓ₀ norm is known to be intractable and sensitive to noise [11,12], and ℓ₁ norm convex optimization is commonly used in CS to recover x by solving:

\hat{α} = min_{α} {‖ α ‖}_{1}, s . t . y = Φ Ψ α

(12)

The accuracy of CS reconstruction using Equation (12) can be guaranteed if ΦΨ satisfies the appropriate restricted isometry properties [32]. A restricted isometry constant σ_s [32] defined as the smallest number such that:

(1 - σ_{S}) {‖ α ‖}_{2}^{2} \leq ‖ Φ Ψ α ‖ \leq (1 + σ_{S}) {‖ α ‖}_{2}^{2}

(13)

holds for all vectors that have at most S nonzero entries. If

σ_{2 S} < \sqrt{2} - 1

, the solution to the ℓ₁ norm problem is that of the ℓ₀ problem [32].

The number of measurements M should satisfy:

M \geq C \cdot μ^{2} (Φ, Ψ) \cdot S \cdot log N

(14)

so that the signal x can be exactly recovered from measurements y in overwhelming majority of cases [28]. Equation (14) implies that the number of measurements is proportional to the number of nonzero entries S in α and the square of coherence μ. If both S and μ are small, the required number of measurements M could be small. This means that one can perform fewer measurements to save acquisition time while reconstruct original signal x very well.

Iddo [16] applied CS to remove the aliasing artifacts from incompletely acquired FID data by enforcing the sparsity of 2D NMR spectra in wavelet domain according to:

\hat{α} = min_{α} {‖ α ‖}_{1}, s . t . ‖ y - Θ F^{T} Ψ^{T} α ‖ \leq σ

(15)

where y is the measurements in time domain, Θ is a random sampling operator defining the FIDs acquired in the indirect dimension, F^T denotes the inverse 2D FT, and Ψ^T is the inverse 2D wavelet transform. According to Equation (11), the recovered spectrum is x̂ = Ψ^Tα̂.

In this paper, we focus on the reconstruction of self-sparse NMR spectra in which significant peaks take up partial locations of the full NMR spectra while the remaining locations have very small or even no peaks. Ideally, if the number of sinusoids J in Equation (1) is very small, and the meaningful peaks are narrow enough relative to the whole 2D frequency coverage, the spectra can be considered to be sparse since the number of non-zeros for the spectra is much smaller than the number of spectrum points in the 2D NMR spectra.

The sparsifying transform and the coherence between Ψ and Φ = ΘF^T play important roles in the CS, as we have discussed. In the following sections, we will demonstrate that wavelet is not necessary to sparsify or even worsens the self-sparse NMR spectra based on the concept of sparsity and coherence. We will then reconstruct the NMR spectrum by enforcing its sparsity in an identity matrix domain with ℓ_p (p = 0.5) norm optimization algorithm.

To represent the NMR spectra in conventional way [4–7,17], the X and Y coordinate axes are shown with unit of parts per million (ppm) [21] defined as:

δ = \frac{ω - ω_{ref}}{ω_{0}} \times 10^{6}

(16)

where δ is the chemical shift of a peak with frequency ω, ω_ref is the frequency of a reference peak and ω₀ is the spectrometer carrier frequency.

3.2. Sparsity of Self-Sparse NMR Spectra

Figure 4(a) shows a 2D ¹H-¹H correlation spectroscopy (COSY) spectrum where most of the peaks fill partial and very limited regions of the full spectrum. This leads to the sparsity of spectrum because the number of non zeros in the 2D spectrum is much smaller than the number of spectrum points. This phenomenon is also observed by Yoh Matsuki et al. [17].

To test the sparsity of NMR spectra, we can measure the decay of coefficients in a sparsifying transform domain and evaluate the approximation error by retaining the k-term largest coefficients, because the reconstruction error is proportional to the power law decay k^−r, where r is a constant implying the sparsity of signal [29]. Rapid decay of coefficients implies that one can use less non-zero coefficients to approximate a NMR spectrum. If we directly measure the decay of signal without complicated sparsifying transform, e.g., wavelets, it means measure the self-sparsity of signal. Mathematical saying is measuring its sparsity in the identity matrix.

As shown in Figure 4(b), both the spectra and its wavelet coefficients can achieve rapid decay. By retaining 3% largest magnitude coefficients, the spectra can be reconstructed well in Figure 4(c,d). However, the spectrum is sparser than its representation in the wavelet domain. This is demonstrated by the faster decay of spectrum than that of its wavelet coefficients in Figure 4(b). By retaining the 1% largest magnitude coefficients, the wavelet fails to represent some peaks while the spectrum itself can represent these peaks, as marked by the arrows in Figure 4(e,f).

For a 2D ¹H-¹³C COSY spectrum, the spectrum decays faster than its wavelet coefficients (Figure 5(b)). This implies that the identity matrix can provide a sparser representation of spectra than a wavelet does. Peaks are lost or distorted by using the wavelet transform to represent the spectrum (Figure 5(e)), but the spectrum is represented very well with the identity matrix (Figure 5(f)). This phenomenon is consistent with the observation on the 2D ¹H-¹H COSY spectrum discussed above.

As a result, this spectrum is self-sparse, which means spectrum is sparse in the identity matrix. Thus, according to Equations (9) and (14), it is better to use an identity matrix than to use a wavelet to reconstruct the self-sparse spectra from undersampled FIDs since the wavelet cannot provide a sparser representation of the spectrum. In fact, Stern et al. [33] proposed to do iterative soft thresholding on the spectrum directly, not on wavelet coefficients, to recover one dimensional NMR spectra from the truncated FIDs. Although the sparsity of NMR spectra is not explicitly expressed in that work [33], the recovered spectrum is obtained from minimizing ℓ₁ norm of spectrum, which implies enforcing the sparsity of the spectrum. The problem of their method is that truncation violates the random sampling scheme in CS and results in strong Gibbs ringing which is hard to suppress [29]. What is more, truncating the 1D FID is not necessary to save the time to scan a spectrum since scanning a 1D NMR spectrum is fast and only takes on the order of seconds.

3.3. Coherence Property of Wavelet-Based and Identity Matrix-Based CS-NMR Spectra

Besides the sparsity of signal, another key factor for CS is the coherence between Φ and Ψ According to Equations (9) and (14), fewer measurements are required for signal sampling system Φ if it is less coherent with Ψ and the signal has same sparsity for different Ψ.

Pioneering work on CS has pointed out that the coherence of a time-frequency pair is μ(Φ, I) = μ(ΘF^T, I) = 1 [28]. Thus, we only need to compute the coherence between undersampled Fourier operator Φ and wavelet basis Ψ^T.

The undersampling of Θ in the indirect dimension is carried out by choosing some of the FID points in this dimension. To make this undersampling intuitive, a binary mask which has the same size of 2D FID is shown as the undersampling pattern in Figure 6(a). If the value of mask at location (i, j) is equal to 1 shown as a white pixel, the FID at location (i, j) is acquired.

To avoid the influence of randomness on the coherence calculation, Θ is randomly generated 10 times and the coherence is averaged for each sampling rate. Figure 6(b) shows that the coherence between wavelet and undersampled Fourier operator Φ is larger than the coherence between identity matrix and Φ. So, from the aspect of coherence, it is also better to choose the identity matrix for self-sparse NMR spectra.

3.4. Reconstruction of Self-Sparse NMR Spectra with ℓ_p Norm Minimization

In this paper, we propose to reconstruct the self-sparse 2D NMR spectra with identity matrix I as follows:

\hat{x} = min_{x} {‖ x ‖}_{1}, s . t . y = Φ x

(17)

where Φ = ΘF^T.

To further improve the reconstruction, a ℓ_p (0 < p < 1) norm is incorporated which has been demonstrated to give better reconstruction of MR images with fewer measurements than ℓ₁ norm does [34–37]:

\hat{x} = min_{x} {‖ x ‖}_{p}^{p}, s . t . y = Φ x

(18)

where

{‖ x ‖}_{p}^{p} = {\sum_{n = 1}^{N} | x_{n} |}^{p}

and x_n is the nth entry of vector x. For the function f(x) = |x|^p, with p → 0, f(x) gets closer to the ℓ₀ norm of x, as shown in Figure 7.

Theoretically, the required number of measurements [38] by enforcing the sparsity with a ℓ_p (0 < p < 1) norm is:

M \geq C_{1} (p) K + p C_{2} (p) K log (N / K)

(19)

where C₁ and C₂ are determined explicitly and bounded in p and the recommend p is 0.5 [34].

In this paper, the ℓ_p norm minimization is solved via the p-shrinkage operator [39] with continuation algorithm [40] because of its fast computation. This algorithm is abbreviated as PSOCA and summarized in Algorithm 1.

Algorithm 1. Self-sparse NMR spectra reconstruction with undersampled data using PSOCA.

**Algorithm 1.** Self-sparse NMR spectra reconstruction with undersampled data using PSOCA.
Initialization:
Input the sampled FID data y, set the regularization parameter λ =10⁸ and tolerance of inner loop η = 5 × 10⁻³. Initialize x = FΘ^T y, x_last = x, β = 2⁶, and α = 0.
Main:
While β ≤ 2¹⁶
Inner loop:
1. Given x,
For j = 1 to J, solve Equation (20), the solution is α;
2. Given α, solve Equation (22), the solution is x;
3. If ‖Δx‖ = ‖x_last – x‖ > η, x_last ← x, go to step 1; Otherwise, go to step 4;
Outer loop:
4. x̂ ← x, β ← 2β, go to step 1.
End While
Output: x̂

For a given continuation parameter β, PSOCA is implemented to solve two sub-problems:

(1) p-shrinkage operator

α_{j} = S_{ɛ}^{p} (x_{j}) = max {x_{j} - ɛ {| x_{j} |}^{p - 1}, 0} \frac{x_{j}}{| x_{j} |}

(20)

where

ɛ = β^{\frac{1}{p - 2}}

and β is a parameter to be updated in the continuation scheme, x_j and α_j are the jth entry of column vectors x and α, respectively.

(2) solve the linear equation:

min_{x} \frac{β}{2} {‖ α - x ‖}_{2}^{2} + \frac{λ}{2} {‖ y - Φ x ‖}_{2}^{2}

(21)

which can be simplified to:

(β I + λ P) F^{T} x = β F^{T} α + λ Θ^{T} y

(22)

where the term P = Θ^TΘ is a diagonal matrix consisting of ones and zeros. The diagonal entries of P correspond to the location of FID data and the entry value is 1 if a corresponding FID data point is sampled, otherwise the entry value is 0. Equation (22) can be solved fast since only a discrete Fourier transform and entry-wise division are required.

4. Simulation Results and Analysis

In this section, we will show the advantages of the proposed method in two aspects: (1) identity matrix as the sparsifying transform is compared with wavelet transform; (2) ℓ_p norm minimization is compared with ℓ₁ norm minimization. The recommended value of p is 0.5 for stability from empirical experiments [34]. The notation ℓ_0.5 is short for ℓ_p with p = 0.5. The typical ℓ₁ norm minimization algorithms compared in this paper include iterative soft thresholding (IST) algorithm [16,41–43], alternating and continuation algorithm (ACA) [40]. The ACA is just p = 1 in PSOCA.

Because regions of small spectrum values usually contain no peaks for practical analysis, we set magnitude smaller than a constant T to be zero according to:

x_{T} (j) = {\begin{array}{l} x (j), & x (j) \geq T \\ 0, & x (j) < T \end{array}

(23)

where x denotes the absolute value of spectra and x_T denotes the absolute value of post processed NMR spectra. For evaluation, T is set to two values. First, T is set to zero, which means a spectrum with small absolute values, possibly noise, are not suppressed. Second, T is set to the lowest value of contour when plotting the 2D spectrum. This is reasonable because peaks with absolute values smaller than T are not seen in the contour plot.

Suppose x̂ denotes the reconstructed spectrum from undersampled FID, relative ℓ₂ norm error (RLNE) is defined to measure the reconstruction error as:

RLNE = \frac{{‖ {\hat{x}}_{T} - {\tilde{x}}_{T} ‖}_{2}}{{‖ {\hat{x}}_{T} ‖}_{2}}

(24)

where x̃ is the reconstructed spectrum from fully sampled FID and 0 ≤ x̃, x̂_T ≤ 1. RLNE evaluates the normalized error presented in the reconstructed spectrum from undersampled FID. The lower the RLNE is, the better the reconstructed spectrum is consistent to the fully sampled spectrum.

4.1. Reconstruction of the spectra

The improvement by using the proposed method is verified from the less crowed ¹H-¹H COSY spectrum and more crowded ¹H-¹³C COSY spectrum. The sampling patterns of the two spectra are shown in Figure 8.

Figure 9(c–h) show the reconstructed ¹H-¹H COSY spectra corresponding to the sampling pattern in Figure 9(a) with a sampling rate of 0.20. With the ℓ₁ norm minimization, all the peaks are recovered successfully by using identity matrix (Figure 9(d,f)), while some peaks are lost by using wavelets (Figure 9(c,e)).

Since the contours for the marked peaks look faint, we also plot the 1D slices along the indirect dimension in Figure 10. The height of one peak in the wavelet-based reconstruction in Figure 10(a,b) are much lower than those in the fully sampled spectrum, leading to the peak lost in the contour plots in Figure 9(c,e).

Furthermore, the nonlinear operation on wavelet coefficients induces the artifacts labeled in Figure 9(c,e). This phenomenon is also observed in the 1D slices shown in Figure 10(a,b), where wavelet reconstruction generates illusive peaks. With the ℓ_0.5 norm minimization, the errors caused from wavelet and identity matrix reconstruction are reduced, as shown in Table 1. One can still observe the reduced peak height and artifacts in wavelet-based reconstruction, but identity matrix performs very well (Figure 10(d)). The advantage of ℓ_0.5 norm over ℓ₁ norm is obvious in the crowded ¹H-¹³C COSY spectra, as will be shown in the following discussion.

Figure 11 shows the reconstructed ¹H-¹³C COSY spectra corresponding to the sampling pattern in Figure 8(b) with a sampling rate of 0.25. Some peaks are obviously lost in the reconstructed spectra using wavelets with both ℓ₁ norm and ℓ_0.5 norm minimization (Figure 11(c,e,g)). These lost peaks are found in the identity matrix-based reconstruction spectra (Figure 11(d,f,h)). With the ℓ_0.5 norm minimization, the intensities of the peaks marked with arrow in Figure 11(h) are more consistent to the fully sampled spectra in Figure 11(b) than those in the reconstructed spectra with the ℓ₁ norm minimization (Figure 11(d,f)). The smallest reconstruction error is achieved with the proposed identity matrix-based ℓ_0.5 norm minimization method (Table 2).

All above simulation results demonstrate that wavelet-based reconstruction obviously induces the loss of some peaks in the crowded ¹H-¹³C COSY spectrum and loss of some weak peaks in the less crowded ¹H-¹H COSY spectrum. The wavelet may even worsen the reconstructed spectra. Thus, it is not a good choice to use wavelets for the self-sparse spectra discussed in this paper.

4.2. Discussion on the Computation

Our simulation is run on a dual core 2.2 GHz CPU laptop with 3 GB RAM. The computational time for the algorithms using wavelet is two times that using the identity matrix, as shown in Table 3.

In the simulation, with the gradual increase of continuation parameter β, the previous solution was used as a ‘warm start’ for the next alternating optimization in the PSOCA. For a given β, with the increase of iterations in inner loop, the difference between reconstructed spectra decreases (see Figure 12(a)), so does the error between the reconstructed spectrum and the fully sampled spectrum (see Figure 12(b)). The reconstruction error decreases when β becomes large in the outer loop. The computational time of ℓ_0.5 norm minimization in PSOCA is nearly four times as that of ℓ₁ norm minimization, as shown in Table 3.

5. Conclusions and Future Work

Random sampling in the indirect dimension is introduced to reconstruct 2D self-sparse NMR spectra within the CS framework. Based on the assumption of sparsity of NMR spectra, one may remove the aliasing by penalizing the ℓ₁ norm on the coefficients of the sparse representation of NMR spectra. Considering the sparsity and the coherence property, we demonstrate that wavelet transform may reduce the peak height and result in loss of peaks. Thus, a wavelet is not necessary and even worsens the reconstruction of self-sparse NMR spectra. With the ℓ_p (p = 0.5) norm minimization, the quality of reconstructed spectra can be further improved.

However, how to define the meaningless peaks depends on applications and a qualitative analysis of self-sparse NMR spectra is needed in order to satisfy the requirement of CS. By defining regularity of ideal Lorentizian peaks with aspect to typical vanishing moment wavelet basis, it is possible to give a boundary for the approximation error of Lorentizian peaks in wavelet representation. Thus, one may quantify the sparsity of spectra composed of ideal Lorentizian peaks using wavelets. Another way is to set up a database and analyze the sparsity of the meaningful peaks based on the prior knowledge of chemists. Since the peak height may be reduced in the wavelet-based reconstruction and this reduction depends on the crowd of peaks, it is expected to give a quantitative analysis on the effect of using/skipping wavelet transform by setting up a simulated spectrum or spectrum from real chemical substance, in which the crowd of peaks and the fixed relative height of peaks are pre-defined in the spectrum. Besides, based on the coherence property in CS, the analysis of the performance of different random sampling schemes, e.g., Poisson disk sampling, may lead to further reduction of sampling rate and reconstruction error. Extension of the proposed method on higher dimensional NMR spectra is worth investigating.

Acknowledgments

This work was partially supported by the NNSF of China under Grant 10974164, and the Research Fund for the Doctoral Program of Higher Education of China under Grant 200803840019. Xiaobo Qu and Di Guo would like to acknowledge the fellowship of Postgraduates’ Oversea Study Program for Building High-Level Universities from the China Scholarship Council. The authors also thank the reviewers for their thorough review and highly appreciate the comments and suggestions, which significantly contributed to improving the quality of this article.

References

Bretthorst, GL. Nonuniform sampling: Bandwidth and aliasing. Concept Magn. Reson. A 2008, 32A, 417–435. [Google Scholar]
Maciejewski, MW; Qui, HZ; Rujan, I; Mobli, M; Hoch, JC. Nonuniform sampling and spectral aliasing. J. Magn. Reson 2009, 199, 88–93. [Google Scholar]
Kazimierczuk, K; Kozminski, W; Zhukov, I. Two-dimensional fourier transform of arbitrarily sampled NMR data sets. J. Magn. Reson 2006, 179, 323–328. [Google Scholar]
Kazimierczuk, K; Zawadzka, A; Kozminski, W. Optimization of random time domain sampling in multidimensional NMR. J. Magn. Reson 2008, 192, 123–130. [Google Scholar]
Vosegaard, T; Nielsen, NC. Defining the sampling space in multidimensional NMR experiments: What should the maximum sampling time be? J. Magn. Reson 2009, 199, 146–158. [Google Scholar]
Mobli, M; Hoch, JC. Maximum entropy spectral reconstruction of nonuniformly sampled data. Concept Magn. Reson. A 2008, 32A, 436–448. [Google Scholar]
Jee, JG. Real-time acquisition of three dimensional NMR spectra by non-uniform sampling and maximum entropy processing. Bull. Korean Chem. Soc 2008, 29, 2017–2022. [Google Scholar]
Coggins, BE; Zhou, P. High resolution 4-D spectroscopy with sparse concentric shell sampling and FFT-CLEAN. J. Biomol. NMR 2008, 42, 225–239. [Google Scholar]
Yoon, JW; Godsill, SJ. Bayesian inference for multidimensional NMR image reconstruction. Proceedings of the European Signal Processing Conference (EUSIPCO), Florence, Italy, 4–8 September 2006.
Lin, MJ; Huang, YQ; Chen, X; Cai, SH; Chen, Z. High-resolution 2D NMR spectra in inhomogeneous fields based on intermolecular multiple-quantum coherences with efficient acquisition schemes. J. Magn. Reson 2011, 208, 87–94. [Google Scholar]
Candes, EJ; Romberg, J; Tao, T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inform. Theory 2006, 52, 489–509. [Google Scholar]
Donoho, DL. Compressed sensing. IEEE Trans. Inform. Theory 2006, 52, 1289–1306. [Google Scholar]
Lustig, M; Donoho, D; Pauly, JM. Sparse MRI: The application of compressed sensing for rapid MR imaging. Magn. Reson. Med 2007, 58, 1182–1195. [Google Scholar]
Duarte, MF; Davenport, MA; Takhar, D; Laska, JN; Sun, T; Kelly, KF; Baraniuk, RG. Single-pixel imaging via compressive sampling. IEEE Signal Proc. Mag 2008, 25, 83–91. [Google Scholar]
Wright, J; Yang, AY; Ganesh, A; Sastry, SS; Ma, Y. Robust face recognition via sparse representation. IEEE Trans. Pattern Anal 2009, 31, 210–227. [Google Scholar]
Drori, I. Fast l₁ minimization by iterative thresholding for multidimensional NMR spectroscopy. EURASIP J Adv Sig Proc 2007. [Google Scholar] [CrossRef]
Matsuki, Y; Eddy, MT; Herzfeld, J. Spectroscopy by integration of frequency and time domain information for fast acquisition of high-resolution dark spectra. J. Am. Chem. Soc 2009, 131, 4648–4656. [Google Scholar]
Kazimierczuk, K; Orekhov, VY. Accelerated NMR spectroscopy by using compressed sensing. Angew. Chem. Int. Ed 2011, 50, 5556–5559. [Google Scholar]
Holland, DJ; Bostock, MJ; Gladden, LF; Nietlispach, D. Fast multidimensional NMR spectroscopy using compressed sensing. Angew. Chem. Int. Ed 2011, 50, 6548–6551. [Google Scholar]
Shrot, Y; Frydman, L. Compressed sensing and the reconstruction of ultrafast 2D NMR data: Principles and biomolecular applications. J. Magn. Reson 2011, 209, 352–358. [Google Scholar]
Hoch, JC; Stern, AS. NMR Data Processing; Wiley-Liss: New York, NY, USA, 1996; p. 38. [Google Scholar]
Keeler, J. Understanding NMR Spectroscopy; Wiley: New York, NY, USA, 2005; Chapter 7,; pp. 1–30. [Google Scholar]
Aue, WP; Bartholdi, E; Ernst, RR. 2-Dimensional spectroscopy: Application to nuclear magnetic-resonance. J. Chem. Phys 1976, 64, 2229–2246. [Google Scholar]
Ernst, RR; Bodenhausen, G; Wokaun, A. Principles of Nuclear Magnetic Resonance in One and Two dimensions; Oxford University Press: New York, NY, USA, 1990. [Google Scholar]
Frydman, L; Scherf, T; Lupulescu, A. The acquisition of multidimensional NMR spectra within a single scan. Proc. Natl. Acad. Sci. USA 2002, 99, 15858–15862. [Google Scholar]
De Graaf, RA. In Vivo NMR Spectroscopy Principles and Techniques, 3rd ed; John Wiley & Sons: Hoboken, NJ, USA, 2007; pp. 389–444. [Google Scholar]
Donoho, DL; Huo, XM. Uncertainty principles and ideal atomic decomposition. IEEE Trans. Inform. Theory 2001, 47, 2845–2862. [Google Scholar]
Candes, E; Romberg, J. Sparsity and incoherence in compressive sampling. Inverse Probl 2007, 23, 969–985. [Google Scholar]
Candès, EJ; Romberg, J. Practical signal recovery from random projections. Proceedings of the Wavelet Applications in Signal and Image Processing XI, San Diego, CA, USA, 31 July–4 August 2005; p. 5914.
Elad, M. Optimized projections for compressed sensing. IEEE Trans. Signal Process 2007, 55, 5695–5702. [Google Scholar]
Hoch, JC; Maciejewski, MW; Filipovic, B. Randomization improves sparse sampling in multidimensional NMR. J. Magn. Reson 2008, 193, 317–320. [Google Scholar]
Candes, EJ. The restricted isometry property and its implications for compressed sensing. Compt. Rendus Math 2008, 346, 589–592. [Google Scholar]
Stern, AS; Donoho, DL; Hoch, JC. NMR data processing using iterative thresholding and minimum l₁-norm reconstruction. J. Magn. Reson 2007, 188, 295–300. [Google Scholar]
Chartrand, R. Exact reconstruction of sparse signals via nonconvex minimization. IEEE Signal Proc. Lett 2007, 14, 707–710. [Google Scholar]
Trzasko, J; Manduca, A. Highly undersampled magnetic resonance image reconstruction via homotopic l₀-minimization. IEEE Trans. Med. Imaging 2009, 28, 106–121. [Google Scholar]
Qu, X; Cao, X; Guo, D; Hu, C; Chen, Z. Compressed sensing MRI with combined sparsifying transforms and smoothed l₀ norm minimization. Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing—ICASSP’10, Dallas, TX, USA, 14–19 March 2010; pp. 626–629.
Majumdar, A; Ward, R. Under-determined non-cartesian MR reconstruction with non-convex sparsity promoting analysis prior. Proceedings of the Medical Image Computing and Computer-Assisted Intervention—MICCAI’10, Beijing, China, 20–24 September 2010; pp. 513–520.
Chartrand, R; Staneva, V. Restricted isometry properties and nonconvex compressive sensing. Inverse Probl 2008, 24, 1–14. [Google Scholar]
Chartrand, R. Fast algorithms for nonconvex compressive sensing: MRI reconstruction from very few data. Proceedings of the 2009 IEEE International Symposium on Biomedical Imaging: From Nano to Macro—ISBI’09, Boston, MA, USA, 28 June–1 July 2009; pp. 262–265.
Yang, JF; Zhang, Y; Yin, WT. A fast alternating direction method for TV L1-L2 signal reconstruction from partial fourier data. IEEE J. Sel. Top. Signal Process 2010, 4, 288–297. [Google Scholar]
Qu, XB; Zhang, WR; Guo, D; Cai, CB; Cai, SH; Chen, Z. Iterative thresholding compressed sensing MRI based on contourlet transform. Inverse Probl. Sci. En 2010, 18, 737–758. [Google Scholar]
Guo, D; Qu, XB; Huang, LF; Yao, Y. Sparsity-based spatial interpolation in wireless sensor networks. Sensors 2011, 11, 2385–2407. [Google Scholar]
Zibulevsky, M; Elad, M. L1-L2 optimization in signal and image processing. IEEE Signal Proc. Mag 2010, 27, 76–88. [Google Scholar]

Figure 1. Simulated FID data in time domain (a) and its corresponding 1D NMR spectrum (b). Note: the FID is simulated according to Equation (1) with J = 2, A₁ = 0.5, A₂ = 1, Δt = 0.01 s, τ₁ = τ₂ = 800, ∅₁ = ∅₂ = 0, and ω₁ = 70 Hz, ω₂ = 20 Hz.

Figure 2. General scheme for 2D NMR spectra.

Figure 3. An example of random undersampling in the indirect dimension. The symbol ⇐ denotes the acquired FIDs.

Figure 4. Sparsity of a ¹H-¹H COSY spectrum and its wavelet (symmlet wavelet with four decomposition levels and eight vanishing moments) representation. (a) The fully sampled NMR spectrum; (b) decay of real part of spectrum and its wavelet coefficients; (c,e) reconstructed spectra from 3% and 1% largest coefficients in wavelet domain; (d,f) reconstructed spectra from 3% and 1% largest coefficients in identity matrix domain. Note: the wavelet fails to represent peaks marked with arrows in (e) and these peaks are successfully represented in (f).

Figure 5. Sparsity of a ¹H-¹³C COSY spectrum and its wavelet (symmlet wavelet with four decomposition levels and eight vanishing moments) representation. (a) The fully sampled NMR spectrum; (b) decay of real part of spectrum and its wavelet coefficients; (c,e) reconstructed spectra from 1% and 0.1% largest coefficients in wavelet domain; (d,f) reconstructed spectra from 1% and 0.1% largest coefficients in identity matrix domain. Note: the wavelet fails to represent peaks marked with arrows in (e) and these peaks are successfully represented in (f).

Figure 6. Coherence of wavelet and FT. (a) One sampling pattern in the indirect dimension with sampling rate ρ = 0.30 (fully sampled points in the indirect dimension is N₁ = 64); (b) coherences for different sampling rates. The symmlet wavelet with four decomposition levels and eight vanishing moments is chosen as a typical wavelet for test, which is also the typical wavelet in [16]. Error bar stands for the standard deviation when repeating 10 times at each sampling rate.

Figure 7. The value of f(x) = |x|^p versus the value of p.

Figure 8. Sampling pattern used in simulation. (a) Cartesian sampling pattern with sampling rate 0.20 for the 2D ¹H-¹H COSY spectrum (N₁ = 256 points) in Figure 4(a); and (b) Cartesian sampling pattern with sampling rate 0.25 for the 2D ¹H-¹³C COSY spectrum (N₁ = 128 points) in Figure 5(a).

Figure 9. CS reconstruction of a 2D ¹H-¹H COSY spectrum using wavelet and identity matrix. (a,b) reconstructed spectra using fully sampled FID and undersampled FID with zero filling, respectively; (c,d) reconstructed spectra using wavelets and identity matrix with IST-based ℓ₁ norm, respectively; (e,f) reconstructed spectra using wavelets and identity matrix with PSOCA-based ℓ₁ norm, respectively; (g,h) reconstructed spectra using wavelets and identity matrix with PSOCA-based ℓ_p norm, respectively.

Figure 10. 1D slices along the indirect dimension for the chemical shift of 8.2 ppm (a–c) or 7.2 ppm (d) in the direct dimension. (a) Spectra reconstructed with IST-based ℓ₁ norm; (b) spectra reconstructed with PSOCA-based ℓ₁ norm; (c) spectra reconstructed with PSOCA-based ℓ_0.5 norm; (d) spectra reconstructed with PSOCA-based ℓ_0.5 norm.

Figure 11. CS reconstruction of a 2D ¹H-¹³C COSY spectrum using wavelet and identity matrix. (a,b) spectra reconstructed using fully sampled FID (N₁ = 128 points) and undersampled FID with zero filling, respectively; (c,d) spectra reconstructed using wavelets and identity matrix with IST-based ℓ₁ norm, respectively; (e,f) spectra reconstructed using wavelets and identity matrix with PSOCA-based ℓ₁ norm, respectively; (g,h) spectra reconstructed using wavelets and identity matrix with PSOCA-based ℓ_0.5 norm, respectively.

Figure 12. Numerical performance of PSOCA. (a) The ℓ₂ norm of difference between reconstructed spectra in the current and previous iteration when β = 2¹² in inner loop; (b) the reconstruction error RLNE of the reconstructed spectra when β = 2¹² in inner loop; and (c) the reconstruction error RLNE versus the iterations in outer loop in PSOCA.

Table 1. Reconstruction error of a ¹H-¹H COSY spectrum.

**Table 1.** Reconstruction error of a ¹H-¹H COSY spectrum.
Methods		Zero-filling	IST ℓ₁	PSOCA ℓ₁	PSOCA ℓ_0.5
Wavelet	RLNE (T = 0)	2.054	0.415	0.393	0.430
Wavelet	RLNE (T = 0.1)	0.059	0.012	0.010	0.007
Identity matrix	RLNE (T = 0)	2.054	0.282	0.273	0.245
Identity matrix	RLNE (T = 0.1)	0.059	0.010	0.007	0.022

Table 2. Reconstruction error of a ¹H-¹³C COSY spectrum.

**Table 2.** Reconstruction error of a ¹H-¹³C COSY spectrum.
Methods		Zero-filling	IST ℓ₁	PSOCA ℓ₁	PSOCA ℓ_0.5
Wavelet	RLNE (T = 0)	1.687	0.547	0.533	0.541
Wavelet	RLNE (T = 0.1)	0.098	0.044	0.042	0.042
Identity matrix	RLNE (T = 0)	1.687	0.422	0.405	0.343
Identity matrix	RLNE (T = 0.1)	0.098	0.033	0.031	0.027

Table 3. Running time for reconstruction of a NMR spectrum (unit: second).

**Table 3.** Running time for reconstruction of a NMR spectrum (unit: second).
Methods	Zero-filling		IST ℓ₁		PSOCA ℓ₁		PSOCA ℓ_0.5

	¹H-¹H	¹H-¹³C	¹H-¹H	¹H-¹³C	¹H-¹H	¹H-¹³C	¹H-¹H	¹H-¹³C
Wavelet	0.1	0.1	11.1	56.8	8.5	70.4	29.1	221.2
Identity matrix	0.1	0.1	5.9	27.5	5.7	31.8	16.0	105.6

© 2011 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Qu, X.; Guo, D.; Cao, X.; Cai, S.; Chen, Z. Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension. Sensors 2011, 11, 8888-8909. https://doi.org/10.3390/s110908888

AMA Style

Qu X, Guo D, Cao X, Cai S, Chen Z. Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension. Sensors. 2011; 11(9):8888-8909. https://doi.org/10.3390/s110908888

Chicago/Turabian Style

Qu, Xiaobo, Di Guo, Xue Cao, Shuhui Cai, and Zhong Chen. 2011. "Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension" Sensors 11, no. 9: 8888-8909. https://doi.org/10.3390/s110908888

Article Menu

Reconstruction of Self-Sparse 2D NMR Spectra from Undersampled Data in the Indirect Dimension^†

Abstract

1. Introduction

2. Undersampling in the Indirect Dimension of 2D NMR