Next Article in Journal
Reduction of Species Identification Errors in Surveys of Marine Wildlife Abundance Utilising Unoccupied Aerial Vehicles (UAVs)
Next Article in Special Issue
Rapid Target Detection of Fruit Trees Using UAV Imaging and Improved Light YOLOv4 Algorithm
Previous Article in Journal
Investigating the Effects of Snow Cover and Vegetation on Soil Temperature Using Remote Sensing Indicators in the Three River Source Region, China
Previous Article in Special Issue
A Novel Multispectral Line Segment Matching Method Based on Phase Congruency and Multiple Local Homographies
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dictionary Learning- and Total Variation-Based High-Light-Efficiency Snapshot Multi-Aperture Spectral Imaging

School of Mechanical Engineering and Automation, Fuzhou University, Fuzhou 350108, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(16), 4115; https://doi.org/10.3390/rs14164115
Submission received: 11 July 2022 / Revised: 10 August 2022 / Accepted: 17 August 2022 / Published: 22 August 2022
(This article belongs to the Special Issue Machine Vision and Advanced Image Processing in Remote Sensing)

Abstract

:
Conventional multispectral imaging systems based on bandpass filters struggle to record multispectral videos with high spatial resolutions because of their limited light efficiencies. This paper proposes a multi-aperture multispectral imaging system based on notch filters that overcomes this limitation by allowing light from most of the spectrum to pass through. Based on this imaging principle, a prototype multi-aperture multispectral imaging system comprising notch filters was built and demonstrated. Further, a dictionary learning- and total variation-based spectral super-resolution algorithm was developed to reconstruct spectral images. The simulation results obtained using public multispectral datasets showed that, compared to the dictionary learning-based spectral super-resolution algorithm, the proposed algorithm reconstructed the spectral information with a higher accuracy and removed noise, and the verification experiments confirmed the performance efficiency of the prototype system. The experimental results showed that the proposed imaging system can capture images with high spatial and spectral resolutions under low illumination conditions. The proposed algorithm improved the spectral resolution of the acquired data from 9 to 31 bands, and the average peak signal-to-noise ratio remained above 43 dB, which is 13 dB higher than those of the state-of-the-art coded aperture snapshot spectral imaging methods. Simultaneously, the frame rate of the imaging system was up to 5000 frames/s under natural daylight.

Graphical Abstract

1. Introduction

Multispectral images (MSIs) contain several to dozens of spectral bands of the target scene and are widely used in agriculture [1], medical diagnosis [2,3], remote sensing [4], and other applications. However, conventional complementary metal-oxide semiconductor (CMOS) or charge-coupled device imaging sensors cannot directly acquire the three-dimensional (3D) spectral data of a target scene. Therefore, for existing multispectral imaging systems, scarification of the temporal or spatial resolution has been introduced to obtain multispectral data cubes [5,6,7]. Imaging systems that directly capture multispectral data cubes can be categorized into three types: spatial/spectral-scanning spectrometers, per-pixel filter mosaic (micro/nanostructure) snapshot multispectral cameras, and multi-aperture multispectral systems. Spatial-scanning spectrometers capture all spectral information from a point or spatial line. By contrast, spectral-scanning spectrometers record a single-wavelength image each time, and other wavelengths can be captured by scanning filters or by changing the central wavelength of the filters. Therefore, spatial/spectral-scanning spectrometers require a long time to collect MSIs [8,9], making them unsuitable for capturing MSIs of moving scenes. Snapshot multispectral cameras, which are based on the traditional Bayer color imaging concept, directly deposit 4 × 4 or 5 × 5 pixel-wise filter units on a CMOS image sensor. Although a monolithic deposition results in compactness, the spatial resolution is degraded [10]. Compared with snapshot multispectral cameras, multi-aperture spectral imaging systems, with different bandpass filters [11] or continuous variable filters [6] in front of each aperture, exhibit significantly improved spatial resolutions. However, owing to the narrow bandwidth of each filter, the use of bandpass filters or continuous variable filters results in low light efficiency, and the number of apertures is limited for low volumes and small weights. Therefore, conventional multispectral imaging systems cannot simultaneously achieve high spatial, spectral, and frame-rate spectral imaging [5]. Computational spectral imaging technology overcomes the disadvantages of conventional spectral imaging systems, utilizing an imaging prior (e.g., smoothness or sparsity) to recover the original high-dimensional spectral information from the low-dimensional observation data by modulating the spectral and spatial information of the target scene with a coded aperture. A typical computational spectral imaging method is the coded aperture snapshot spectral imaging (CASSI) [12], which enables the acquisition of MSI snapshots using a single imaging sensor; however, its optical system is complicated and requires precise calibration. In practice, the reconstructed spectral images obtained from CASSI usually exhibit limited spatial resolution and severe spectral distortion because the required conditions for high-fidelity compressive sensing reconstruction are difficult to verify through a coded aperture design [13,14]. The shortcomings of CASSI include light-intensity attenuation in long optical paths and coding apertures, and the highest frame rate reported in the literature is 30 fps [15,16]. Hence, a CASSI-based dual-camera compressive hyperspectral imaging (DCCHI) system [15], including a panchromatic camera, was proposed to reduce the complexity of underdetermined calculations and improve the image quality. As demonstrated in previous studies, the combination of a multi-aperture system design and compressive sensing algorithms enables high-spectral-resolution imaging and acquisition of almost gigapixel hyperspectral data cubes with hundreds of spectral bands [17,18]. However, multi-aperture systems exhibit low light efficiencies caused by bandpass filters. To solve the issue of light efficiency affecting the speed of multispectral imaging, Wang et al. [5] proposed a notch filter and compressive sensing theory-based multispectral imaging system, which enables high light efficiency as well as a high imaging frame rate.
For computational spectral imaging systems, the signals can be recovered from acquired data using compressive sensing theory, and the required minimum frequency of the measurement data is much lower than that required by Nyquist’s sampling law. In 2006, Donoho et al. [19,20] proposed compressive sensing theory, which is widely used in image processing [21], biomedicine [22], and wireless communication [23]. Compressive sensing theory assumes that a signal is sparse, and the observation matrix and basis matrix are incoherent [24]. Although most signals are not sparse in the temporal or spatial domain, they are on a sparse basis (e.g., discrete wavelet transform, discrete cosine waveform, and discrete Fourier transform), and the original signal can still be recovered using compressive sensing theory. Because compressive sensing only requires a small amount of observational data to recover the original high-dimensional data with a high probability, the application of compressive sensing theory in the field of computational imaging enables compressed sampling of the spatial [25,26], spectral [12], or time dimensions [27] of the target scene. For instance, magnetic resonance imaging utilizes the image signal sparseness in the Fourier domain to recover the original image with small sampling data, reduce the image acquisition time, and improve the image quality [28]. The typical imaging algorithms developed for CASSI systems include generalized alternating projection with total variation (GAP-TV) [29], two-step iterative shrinkage/thresholding (TwIST) [30] based on the total variation (TV) prior [31], and dictionary-based reconstruction [15]. The TV-based methods can be used for denoising or deblurring by assuming image smoothing, whereas the dictionary learning methods can be used to train a dictionary with high sparsity to improve the imaging quality; however, they are heavily influenced by the noise in the measured data. Thus, these two types of algorithms cannot simultaneously satisfy the requirements of high image quality and strong robustness. For multispectral imaging, an imaging algorithm with good imaging quality and strong noise robustness is particularly important for real-world applications because of the noise introduced by environmental light or imaging sensors.
The main contributions of this paper are as follows:
  • A multispectral imaging system that combines a notch-filter array and multiple apertures is proposed. The use of notch filters enables the development of a high-light-efficiency imaging system that overcomes the drawbacks of conventional bandpass-filter-based multispectral imaging systems (e.g., low spatial resolutions and imaging speeds). Compared with CASSI or DCCHI systems, the proposed multi-aperture multispectral imaging system enables more spectral information from the target scene to be captured, significantly reducing the complexity of the underdetermined reconstruction problem. Compared with those of other bandpass-filter-based multispectral imaging systems, the higher light efficiency yielded by the notch-filter array significantly improves the imaging quality and temporal resolution of the multispectral imaging system.
  • A dictionary learning- and TV-based spectral super-resolution algorithm (DL-TV) is proposed; it can train sparse dictionaries to achieve a high imaging quality as well as reduce noise with TV. Because the proposed method introduces more imaging priors, it can provide better imaging performance than the alternative direction multiplier method (ADMM) [32] with the dictionary learning algorithm (DL) [33].
  • The effectiveness of the proposed system and algorithm is demonstrated through simulations using various datasets.
  • A snapshot multispectral-imaging prototype system is built to verify real-world imaging performance via indoor experiments and field tests. The experimental results demonstrate that the combination of the proposed imaging system and compressive-sensing-based super-resolution spectral algorithm can obtain high-quality as well as high-spatial-spectral-resolution images.

2. Methods

2.1. Notch Filter Imaging Model

An MSI is a 3D data cube represented by C M × N × Q , where M and N are the spatial dimensions of the image and Q is the spectral dimension. Owing to the difficulty of mathematically modeling and constructing an optimization function for a 3D data block, C is transformed into a 2D matrix: X Q × M N , where M N represents the product of two spatial dimensions.
Figure 1 shows the spectral transmittance of the eight notch filters used in this study. The notch images, Y K × M N , captured by the notch filters can be expressed as:
Y = T X ,
where T K × Q denotes the transmittance measurement matrix of the proposed system and is determined by the selected notch filters. For the simulation and experiments, k = 9 was used. T ( i , λ ) denotes the transmittance of the i t h notch filter at the λ t h band. Compared with a bandpass filter, which transmits only a narrow spectrum of light, resulting in a low light efficiency, the notch filter enables most of the spectrum to pass through while blocking only a specific portion of the spectrum. Consequently, the light efficiency of a notch-filter-based imaging system is close to that of a panchromatic imaging system. Therefore, the exposure time of the imaging process can be decreased, and the signal-to-noise ratio and imaging speed can be improved. For a notch-filtered spectral image, the corresponding bandpass spectral image was obtained by subtracting the panchromatic image from the notch-filtered spectral image. As shown in Figure 1, in the proposed multi-aperture multispectral imaging system, eight of the nine apertures were covered with a notch filter, and the central aperture was used to capture a panchromatic image.
In an ideal situation, the transmittances of the transmission and cutoff bands are 100% and 0%, respectively. Therefore, the transmittance in the ideal case can be approximately written as
T ( i , λ ) = { 1 , o t h e r w i s e 0 , | λ a i | b i ,
where a i denotes the central wavelength of the i t h notch filter, and b i is its corresponding full bandwidth at half maximum. However, owing to the optical materials and manufacturing errors, the actual transmittance of the transmission band is between 90% and 100%, whereas the cutoff transmittance is less than 10% [5].

2.2. Spectral Super-Resolution Algorithm

Reportedly, the spectral information of one pixel can be sparsely represented by a trained dictionary [34], and the calculated results outperform the linear combination of a few spectral bases obtained via principal component analysis [35]. For a point ( m , n ) on a spectral image, the spectral vector x j Q × 1 can be expressed as
x j = D θ j ,   j = { 1 , , M N } ,
where D Q × P denotes an overcomplete dictionary trained using MSIs. P is the number of dictionary atoms. Here, Q = 31 and P = 2 Q = 62 . The dictionary D was trained using the K-SVD algorithm by generalizing the K-means clustering process [36]. We let θ j P × 1 denote the sparse vector of x j with a much smaller number (<Q) of nonzero elements. To speed up the calculation, all the pixels in the spatial domain were simultaneously calculated. Substituting Equation (3) into Equation (1) yields
Y = T D Θ ,
where the sparse matrix, Θ P × M N , must be solved, and its nonzero elements must be minimized as follows:
arg min Θ 1 2 Y T D Θ F 2 + η Θ 1 .
The first term in Equation (5) is the fidelity term, the second term is the sparse term, and η is the regularization factor that balances the fidelity and sparse terms. Because the MSIs are smooth in the spatial dimension, the TV constraint term [37] is added to the optimization objective of Equation (5) and can be expressed as:
arg min Θ 1 2 Y T D Θ F 2 + η Θ 1 + η TV X TV ,
where η TV is the regularization factor used to adjust the smoothness of the target, and X TV is the TV constraint term, which can be expressed as:
X TV = λ = 1 Q m = 1 M n = 1 N X v ( λ ,   ( m 1 ) × N + n ) 2 + X h ( λ ,   ( m 1 ) × N + n ) 2 [ X v ( λ ,   ( m 1 ) × N + n ) X h ( λ ,   ( m 1 ) × N + n ) ] = [ C ( m + 1 , n , λ ) C ( m , n , λ ) C ( m , n + 1 , λ ) C ( m ,   n ,   λ ) ] ,
where X v ( λ ,   ( m 1 ) × N + n ) and X h ( λ ,   ( m 1 ) × N + n ) are the vertical and horizontal differences between the pixels of X , respectively. Substituting Equation (3) into Equation (6) yields
arg min Θ 1 2 Y T D Θ F 2 + η Θ 1 + η TV D Θ TV .
By adding auxiliary variables Z 1 = Θ and Z 2 = D Θ to Equation (8), the above problem can be written as
arg min Θ 1 2 Y T D Θ F 2 + η Z 1 1 + η TV Z 2 TV , s . t .   Z 1 = Θ ,   Z 2 = D Θ .
The corresponding augmented Lagrangian function is as follows:
( Θ , Z 1 , Z 2 , V 1 , V 2 ) = 1 2 Y T D Θ F 2 + η Z 1 1 + η TV Z 2 TV + ρ 1 2 Θ Z 1 + V 1 ρ 1 F 2 + ρ 1 2 D Θ Z 2 + V 2 ρ 2 F 2 .
where V 1 and V 2 are the Lagrange multipliers, and ρ 1 and ρ 2 and are the coefficients of the regular terms. ADMM theory [32] states that minimizing (6) is equivalent to iterating over the following steps until convergence:
Θ k + 1 = arg min Θ ( Θ , Z 1 k , Z 2 k , V 1 k , V 2 k ) ,
Z 1 k + 1 = arg min Z 1 ( Θ 1 k + 1 , Z 1 , V 1 k ) ,
Z 2 k + 1 = arg min Z 2 ( Θ 1 k + 1 , Z 2 , V 2 k ) ,
The Lagrange multipliers are updated by
V 1 k + 1 = V 1 k + ρ 1 ( Θ k + 1 Z 1 k + 1 ) ,
V 2 k + 1 = V 2 k + ρ 2 ( D Θ k + 1 Z 2 k + 1 ) .
where k denotes the number of iterations. Equation (11) can be solved as follows:
Θ k + 1 = arg min Θ 1 2 Y T D Θ F 2 + ρ 1 2 Θ Z 1 k + V 1 k ρ 1 F 2 + ρ 2 2 D Θ Z 2 k + V 2 k ρ 2 F 2 ,
Θ k + 1 = ( D T T T T D + ρ 1 I + ρ 2 D T D ) 1 × [ D T T T Y + ρ 1 ( Z 1 k V 1 k ρ 1 ) + ρ 2 ( Z 2 k V 2 k ρ 2 ) ] ,
where I is the identity matrix, and the superscript T denotes the matrix transpose. Equation (12) is a typical lasso problem and can be solved as follows:
Z 1 k + 1 = arg min Z 1 ρ 1 2 Θ k + 1 Z 1 + V 1 k ρ 1 F 2 + η Z 1 1 ,
Z 1 k + 1 = s o f t ( Θ k + 1 + V 1 k ρ 1 , η ρ 1 ) .
Here, s o f t (   ) is the well-known soft-thresholding operator [38]
s o f t ( a , b ) = max ( | a | b , 0 ) s ( a ) ,
where s is the signum function [39,40]. Equation (13) is equivalent to
Z 2 k + 1 = arg min Z 2 ρ 2 2 Θ k + 1 Z 2 + V 2 k ρ 2 F 2 + η TV Z 2 TV ,
Equation (21) is a simple TV denoising problem that can be solved using any TV algorithm; however, the fast gradient projection algorithms are generally recommended for solving this equation because of their quick convergence [41]. When η TV = 0 , the algorithm is the DL algorithm (Algorithm 1).
Algorithm 1: DL-TV for MSI Reconstruction
1: Input: T , D , ρ 1 > 0 , ρ 2 > 0
2: Initialization: Z 1 , Z 2 , V 1 , V 2 , Θ , I t e r max , k = 1 ;
3: while k I t e r max do
  4: Update Θ k + 1 via Equation (17)
  5: Update Z 1 k + 1 via Equation (19)
  6: Update Z 2 k + 1 via Equation (21)
  7: Update V 1 k + 1 via Equation (14)
  8: Update V 2 k + 1 via Equation. (15)
9: end while
10: Compute Θ k + 2 via Equation (17)
Output: MSI X = D × Θ k + 2

2.3. Prototype System

The proposed multi-aperture snapshot multispectral imaging system is shown in Figure 2. The system consists of an array of eight notch filters (Edmund OD 4.0 Notch), an array of lenses (Edmund FL 35 mm, f/1.8–f/16), and a monochromatic camera array (HIKVISION, MV-CA013-20GMGCGN). To record panchromatic images, no filter was added to the central lens, as shown in Figure 1b and Figure 2b. The eight surrounding lenses were equipped with notch filters of varying central wavelengths, as shown in Figure 1. All the cameras were synchronized to capture eight notch images and one panchromatic image simultaneously. Because the field of view of the images captured by each camera varies, image registration is required. The traditional bandpass multi-aperture system only captures a particular band of the spectrum of the target scenes. With most of the spectral bands missing, some areas of the spectral image appear dark, making the image registration of the multi-aperture image system difficult. The notch filter collects an image that is almost the same as a panchromatic image, because only a small portion of the spectrum is rejected, and the speeded up robust features algorithm can be used to achieve high-precision image registration [42].

3. Simulations Using Public Datasets

The CAVE [43] and ICVL [44] datasets were used to verify the effectiveness of the proposed algorithm (Section 2). These datasets include MSIs with 31 spectral bands ranging from 400 to 700 nm, and the spectral resolution of the MSIs was 10 nm. The CAVE dataset includes 31 indoor MSIs with a spatial resolution of 512 × 512 pixels. The ICVL dataset includes 201 MSIs with a spatial resolution of 1392 × 1300 pixels, and downsampling to 512 × 512 pixels during simulation. In this study, 16,000 pixel vectors were selected from 16 MSIs of the CAVE dataset to form the training dataset, as suggested in [44], and the remaining 15 MSIs were used as the test dataset, as listed in Table 1. When choosing the training set, the central region of each image was sampled to avoid the black background area. For the trained dictionaries, different dictionaries resulted in different imaging qualities for each MSI in the test set. Therefore, all test sets shown in Section 3 as well as the field test data presented in Section 4 were implemented using the same dictionary trained in this study. In the simulation, eight notch images and one panchromatic image were created to simulate the images obtained using the proposed system. Because of the fast convergence of the ADMM algorithm, the number of DL-TV and DL iterations was set to k = 40 in this study. When performing calculations using the CAVE and ICVL datasets, the parameters used in DL and DL-TV were set as follows: DL: η = 0.01 , ρ = 0.001 ; DL-TV: η = 0.01 , η TV = 0.00001 , ρ 1 = 0.001 , ρ 2 = 0.001 .
For comparison, CASSI simulations were conducted using TwIST with the TV algorithm, GAP-TV, and the plug-and-play (PnP) approach [45], and the DCCHI simulations were performed using TwIST with the TV algorithm. The coded aperture mask reported in [45] was used for simulation. Zhang et al. [46] mentioned that TwIST would converge after 80 iterations; therefore, the iteration number in TwIST was set to 150. According to [14], the regularization parameter used in TwIST was set to τ = 0.1. The iterations and parameters of PnP and GAP-TV were set as reported in [45].
The test results are presented in Table 1, Table 2, Table 3 and Table 4. Table 1 and Table 2 show the calculation results of the test dataset of 15 MSIs selected from the CAVE dataset, and Table 3 and Table 4 provide the calculation results for the ICVL dataset. For both the datasets and under all four metrics, DL outperformed TwIST, GAP-TV, PnP, and DCCHI, whereas DL-TV outperformed DL, demonstrating that DL-TV can effectively preserve the spectral information of the scene.
Figure 3 shows the simulation results generated from three different MSIs of the CAVE dataset. The spectral images reconstructed using DL-TV and DL were compared with those generated using the CASSI and DCCHI systems, and the corresponding error images were obtained from the reconstructed spectral image and ground-truth image for comparison. Spectral curves of selected regions were also plotted and compared with the ground truth. Additionally, the Pearson correlation coefficient (corr) was used to assess the fidelity of the recovered spectra. For the CASSI system, TwIST exhibited the poorest imaging quality, whereas GAP-TV and PnP provided slightly improved quality. Table 1, Table 2, Table 3 and Table 4 show that DCCHI can provide relatively better results than CASSI. The error images and spectral curves show that the combination of the proposed system and DL-TV significantly outperforms CASSI and DCCHI. It is clear that the imaging quality achievable by DL-TV is the best, so this approach yields accurate and detailed spectral textures of the target scene. Figure 4 shows the reconstructed spectra of “toy” MSI data. It can be clearly seen that DL-TV provides better reconstructed spectra than the other methods. Figure 5 plots three selected spectral frames from two ICVL scenes. Again, it is clear that DL-TV yields the best results.
In field tests, imaging noise is often introduced due to illumination conditions and imaging sensor noise. Hence, Gaussian noise with standard deviations of σ = 10 ,   20 ,   and   40 was added during the simulation process for verification. Owing to the different noise levels, the regularization parameters must be tuned according to the noise level; in general, the larger the noise, the larger the regularization parameters. Table 5 shows the simulation results for different levels of noise. It can be seen that irrespective of the noise level, the PSNR and SSIM of the spectral images generated by DL-TV are significantly higher than those of the images produced by DL, implying that DL-TV is more robust against noise. The algorithms were run on a Windows 10 64-bit system with AMD R5-4600H and 16 GB RAM, and the calculation time is shown in Table 6.

4. Experiments Using Actual Captured Data

4.1. Indoor Experiments

First, the notch-filter-based multi-aperture multispectral imaging system was built using nine monochromatic cameras, each with a resolution of 1024 × 1280 pixels, and this prototype system was used to capture the ColorChecker card and target image (ISO 12233 2014 eSFR) in an indoor environment. Eight notch-filtered images and one panchromatic image were collected in a single shot. The captured images were then processed using the proposed algorithms to reconstruct 31 bandpass spectral images. The illumination was provided by a spectrum-tunable light source (Thouslite-LEDCube) and was limited to the spectral range of 420–690 nm; therefore, the spectra of the MSIs in the experimental part also range from 420 to 690 nm, with the spectral bands of 28, as also reported in [47]. Figure 6 shows the reconstructed spectrum for 18 color blocks and the reconstructed MSIs for all bands. The exposure time for acquiring the notch-filtered and panchromatic images was 4 ms (frame rate of up to 250 fps). Simultaneously, the bandpass spectral images that were captured by the bandpass filters (THORLABS-FB) were obtained with the same exposure time. Figure 6 reveals that the calculated spectrum of the color block is almost the same as the ground truth captured by the hyperspectral camera (GaiaField_V10E), implying that the proposed system can restore the spectral information of the target scene. Figure 7 compares the hyperspectral images reconstructed by the proposed system with the DL and DL-TV algorithms. For demonstration, an ISO 12233 target was used along with the reconstructed spectral images with central wavelengths of 450, 530, 560, 610, and 660 nm, as shown in Figure 7. Figure 7 demonstrates that in the calculated spectral images, the detailed features and texture information of the ISO 12233 target are well preserved, indicating that the proposed system and algorithm do not reduce the spatial resolution of the target scene. In both scenes, it can be observed that compared to DL, DL-TV can effectively remove the noise from the actual captured data and can restore the target scene with higher accuracy. This experiment demonstrated the effectiveness of the prototype system and proposed algorithm for super-resolution spectral imaging.

4.2. Field Tests

For the field tests, the exposure time for capturing the notch-filtered and panchromatic images was set to 0.2 ms, and the theoretical frame rate of the imaging system was up to 5000 fps. For each single shot, the captured eight notch-filtered images and one panchromatic image were processed with DL-TV to generate 31 multispectral bandpass spectral images with central wavelengths in the range of 400–700 nm and a spectral resolution of 10 nm (i.e., at wavelengths of 400, 410, 420, 430, … 700 nm). Figure 8 shows 5 of the 31 reconstructed spectral images with central wavelengths of 450, 530, 560, 610, and 660 nm, along with a captured panchromatic image.
The field test results demonstrate that the proposed algorithm functioned efficiently for the prototype multispectral imaging system and could provide MSIs with a high frame rate.

5. Discussion

From Table 1, Table 2, Table 3 and Table 4, TwIST, GAP-TV, and PnP based on CASSI had lower PSNR, SSIM, and RMSE values for the CAVE and ICVL data sets than the DCHHI and the notch-multi-aperture system. On the MSIs “beads”, “toy”, and “cloth”, the CASSI reconstruction results had the worst PSNR, SSIM, and RMSE. As shown in Figure 3 and Figure 4, both “beads” and “cloth” had repetitive and complex texture structures. For “clay” with simple texture structure, it can be well reconstructed, as shown in rows 5 and 6 of Figure 3. The coded aperture broke the spatial structure of the scene (as shown in Figure 4); it therefore had difficulty retaining the texture information. DCCHI captured the panchromatic image and coded frame at the same time, obtaining more spatial-texture information than CASSI, so it achieved better PSNR, SSIM, and RMSE; however, its reconstruction–evaluation measurements on “beads” and “cloth” were not very good. The proposed notch-multi-aperture system obtained eight notch images and a panchromatic image at the same time, so it had more information about the target scene than CASSI and DCCHI; as a result, it achieved good evaluation indicators on all tested MSIs.

6. Conclusions

A multispectral imaging system that combines notch filters and multiple apertures is proposed. By taking advantage of the high light efficiency of the notch filters, the exposure time can be reduced to <0.2 ms under natural daylight. The proposed system was evaluated via simulations on the CAVE and ICVL datasets, and the corresponding results were compared with those obtained using the CASSI and DCCHI systems. Evidently, the proposed system can effectively improve imaging quality, and the proposed super-resolution spectral imaging methods provide better reconstruction quality and robustness against noise than those facilitated by the DL algorithm.
The proposed system can be used in medical contexts to capture the morphological and biochemical characteristics required for accurate disease diagnosis or functional analysis during a surgical operation. It is also suitable for dynamic remote-sensing classification, mineral analysis, and monitoring of agricultural produce.
Although the proposed system exhibits a high spatial quality and spectral fidelity, some of its aspects require further improvements. First, when capturing scenes with large depth differences, the registration accuracy of the images captured by different apertures is reduced, which could affect the quality of the reconstructed MSIs. Therefore, a better registration algorithm is required to expand the depth of field of the proposed system. Second, as the DL and DL-TV calculations for 512 × 512 × 31 MSI reconstruction are time-consuming, future work will be focused on developing a faster algorithm for real-time multispectral imaging. A deep learning algorithm could be used to replace the iterative process [48], which would greatly reduce the calculation time.

Author Contributions

Conceptualization, F.H. and P.L.; methodology, P.L. and X.W.; software, P.L.; validation, P.L. and X.W.; formal analysis, F.H and R.C.; investigation, F.H. and P.L.; resources, R.C., B.Z. and X.W.; data curation, F.H.; writing—original draft preparation, P.L.; writing—review and editing, F.H. and X.W.; visualization, F.H., P.L. and X.W.; supervision, R.C., B.Z. and X.W.; project administration, F.H. and X.W.; funding acquisition, F.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Fuzhou University (2019T009, GXRC-18066); Department of Education, Fujian Province (JAT190005).

Data Availability Statement

The dataset, CAVE, is available at: http://www1.cs.columbia.edu/CAVE/databases/multispectral/, accessed on 11 July 2022. The dataset, ICVL, is available at: http://icvl.cs.bgu.ac.il/hyperspectral/, accessed on 11 July 2022.

Acknowledgments

The authors would like to express their gratitude to the anonymous reviewers and editors who worked selflessly to improve our manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Landgrebe, D. The evolution of Landsat data analysis. Photogramm. Eng. Remote Sens. 1997, 63, 859–867. [Google Scholar]
  2. Lu, G.; Fei, B. Medical hyperspectral imaging: A review. J. Biomed. Opt. 2014, 19, 010901. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, L.V.; Wu, H.-I. Biomedical Optics: Principles and Imaging; John Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar]
  4. Gat, N.; Subramanian, S.; Barhen, J.; Toomarian, N. Spectral imaging applications: Remote sensing, environmental monitoring, medicine, military operations, factory automation, and manufacturing. In Proceedings of the 25th AIPR Workshop: Emerging Applications of Computer Vision, Washington, DC, USA, 16–18 October 1996; pp. 63–77. [Google Scholar]
  5. Zhang, M.; Wang, L.; Zhang, L.; Huang, H. High light efficiency snapshot spectral imaging via spatial multiplexing and spectral mixing. Opt. Express 2020, 28, 19837–19850. [Google Scholar] [CrossRef] [PubMed]
  6. Mu, T.; Han, F.; Bao, D.; Zhang, C.; Liang, R. Compact snapshot optically replicating and remapping imaging spectrometer (ORRIS) using a focal plane continuous variable filter. Opt. Lett. 2019, 44, 1281–1284. [Google Scholar] [CrossRef] [PubMed]
  7. Liang, J.; Tian, X.; Ju, H.; Wang, D.; Wu, H.; Ren, L.; Liang, R. Reconfigurable snapshot polarimetric imaging technique through spectral-polarization filtering. Opt. Lett. 2019, 44, 4574–4577. [Google Scholar] [CrossRef] [PubMed]
  8. Goetz, A.F.; Vane, G.; Solomon, J.E.; Rock, B.N. Imaging spectrometry for earth remote sensing. Science 1985, 228, 1147–1153. [Google Scholar] [CrossRef] [PubMed]
  9. Sellar, R.G.; Boreman, G.D. Comparison of relative signal-to-noise ratios of different classes of imaging spectrometer. Appl. Opt. 2005, 44, 1614–1624. [Google Scholar] [CrossRef]
  10. He, Q.; Wang, R. Hyperspectral imaging enabled by an unmodified smartphone for analyzing skin morphological features and monitoring hemodynamics. Biomed. Opt. Express 2020, 11, 895–910. [Google Scholar] [CrossRef]
  11. Genser, N.; Seiler, J.; Kaup, A. Camera array for multi-spectral imaging. IEEE Trans. Image Process. 2020, 29, 9234–9249. [Google Scholar] [CrossRef]
  12. Gehm, M.E.; John, R.; Brady, D.J.; Willett, R.M.; Schulz, T.J. Single-shot compressive spectral imaging with a dual-disperser architecture. Opt. Express 2007, 15, 14013–14027. [Google Scholar] [CrossRef]
  13. Tao, C.; Zhu, H.; Sun, P.; Wu, R.; Zheng, Z. Hyperspectral image recovery based on fusion of coded aperture snapshot spectral imaging and RGB images by guided filtering. Opt. Commun. 2020, 458, 124804. [Google Scholar] [CrossRef]
  14. Wang, L.; Xiong, Z.; Gao, D.; Shi, G.; Wu, F. Dual-camera design for coded aperture snapshot spectral imaging. Appl. Opt. 2015, 54, 848–858. [Google Scholar] [CrossRef] [Green Version]
  15. Wang, L.; Xiong, Z.; Gao, D.; Shi, G.; Zeng, W.; Wu, F. High-speed hyperspectral video acquisition with a dual-camera architecture. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 4942–4950. [Google Scholar]
  16. Wagadarikar, A.A.; Pitsianis, N.P.; Sun, X.; Brady, D.J. Video rate spectral imaging using a coded aperture snapshot spectral imager. Opt. Express 2009, 17, 6368–6388. [Google Scholar] [CrossRef] [PubMed]
  17. Oiknine, Y.; August, I.; Stern, A. Multi-aperture snapshot compressive hyperspectral camera. Opt. Lett. 2018, 43, 5042. [Google Scholar] [CrossRef] [PubMed]
  18. Carles, G.; Chen, S.; Bustin, N.; Downing, J.; McCall, D.; Wood, A.; Harvey, A.R. Multi-aperture foveated imaging. Opt. Lett. 2016, 41, 1869–1872. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  19. Candès, E.J.; Romberg, J.; Tao, T. Robust uncertainty principles: Exact signal reconstruction from highly incomplete frequency information. IEEE Trans. Inf. Theory 2006, 52, 489–509. [Google Scholar] [CrossRef] [Green Version]
  20. Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
  21. Gan, L. Block compressed sensing of natural images. In Proceedings of the 15th International Conference on Digital Signal Processing, Cardiff, UK, 1–4 July 2007; pp. 403–406. [Google Scholar]
  22. Fauvel, S.; Ward, R.K. An energy efficient compressed sensing framework for the compression of electroencephalogram signals. Sensors 2014, 14, 1474–1496. [Google Scholar] [CrossRef] [Green Version]
  23. Leinonen, M.; Codreanu, M.; Juntti, M. Sequential compressed sensing with progressive signal reconstruction in wireless sensor networks. IEEE Trans. Wirel. Commun. 2014, 14, 1622–1635. [Google Scholar] [CrossRef]
  24. Candes, E.; Romberg, J. Sparsity and incoherence in compressive sampling. Inverse Probl. 2007, 23, 969. [Google Scholar] [CrossRef] [Green Version]
  25. Duarte, M.F.; Davenport, M.A.; Takhar, D.; Laska, J.N.; Sun, T.; Kelly, K.F.; Baraniuk, R.G. Single-pixel imaging via compressive sampling. IEEE Signal Process. Mag. 2008, 25, 83–91. [Google Scholar] [CrossRef] [Green Version]
  26. Xue, J.; Zhao, Y.Q.; Bu, Y.; Liao, W.; Chan, J.C.W.; Philips, W. Spatial-spectral structured sparse low-rank representation for hyperspectral image super-resolution. IEEE Trans. Image Process. 2021, 30, 3084–3097. [Google Scholar] [CrossRef] [PubMed]
  27. Llull, P.; Liao, X.; Yuan, X.; Yang, J.; Kittle, D.; Carin, L.; Sapiro, G.; Brady, D.J. Coded aperture compressive temporal imaging. Opt. Express 2013, 21, 10526–10545. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Lustig, M.; Donoho, D.L.; Santos, J.M.; Pauly, J.M. Compressed Sensing MRI. IEEE Signal Process. Mag. 2008, 25, 72–82. [Google Scholar] [CrossRef]
  29. Yuan, X. Generalized alternating projection based total variation minimization for compressive sensing. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Phoenix, AZ, USA, 25–28 September 2016; pp. 2539–2543. [Google Scholar]
  30. Bioucas-Dias, J.M.; Figueiredo, M.A. A new TwIST: Two-step iterative shrinkage/thresholding algorithms for image restoration. IEEE Trans. Image Process. 2007, 16, 2992–3004. [Google Scholar] [CrossRef] [Green Version]
  31. Chambolle, A. An algorithm for total variation minimization and applications. J. Math. Imaging Vis. 2004, 20, 89–97. [Google Scholar]
  32. Boyd, S.; Parikh, N.; Chu, E.; Peleato, B.; Eckstein, J. Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers. Found. Trends® Mach. Learn. 2011, 3, 1–122. [Google Scholar]
  33. Huang, F.; Lin, P.; Wu, X.; Cao, R.; Zhou, B. Compressive Sensing-Based Super-Resolution Multispectral Imaging System. Available online: https://ui.adsabs.harvard.edu/abs/2022SPIE12169E..34H/abstract (accessed on 27 March 2022).
  34. Lansel, S.; Parmar, M.; Wandell, B.A. Dictionaries for sparse representation and recovery of reflectances. In Proceedings of the IS&T-SPIE Electronic Imaging Symposium, San Jose, CA, USA, 19–20 January 2009. [Google Scholar]
  35. Parkkinen, J.P.S.; Hallikainen, J.; Jaaskelainen, T. Characteristic spectra of Munsell colors. J. Opt. Soc. Am. A 1989, 6, 318–322. [Google Scholar] [CrossRef]
  36. Aharon, M.; Elad, M.; Bruckstein, A. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 2006, 54, 4311–4322. [Google Scholar] [CrossRef]
  37. Rudin, L.I.; Osher, S.; Fatemi, E. Nonlinear total variation based noise removal algorithms. Phys. D Nonlinear Phenom. 1992, 60, 259–268. [Google Scholar] [CrossRef]
  38. Donoho, D.L. De-noising by soft-thresholding. IEEE Trans. Inf. Theory 1995, 41, 613–627. [Google Scholar] [CrossRef] [Green Version]
  39. Yun, B.I.; Petković, M.S. Iterative methods based on the signum function approach for solving nonlinear equations. Numer. Algorithms 2009, 52, 649–662. [Google Scholar] [CrossRef]
  40. Singh, D.; Kaur, M.; Jabarulla, M.Y.; Kumar, V.; Lee, H.-N. Evolving fusion-based visibility restoration model for hazy remote sensing images using dynamic differential evolution. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1002214. [Google Scholar] [CrossRef]
  41. Beck, A.; Teboulle, M. Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Trans. Image Process. 2009, 18, 2419–2434. [Google Scholar] [CrossRef] [Green Version]
  42. Bay, H.; Ess, A.; Tuytelaars, T.; Van Gool, L. Speeded-up robust features (SURF). Comput. Vis. Image Underst. 2008, 110, 346–359. [Google Scholar] [CrossRef]
  43. Yasuma, F.; Mitsunaga, T. Generalized assorted pixel camera: Postcapture control of resolution, dynamic range, and spectrum. IEEE Trans. Image Process. 2010, 19, 2241–2253. [Google Scholar] [CrossRef] [Green Version]
  44. Arad, B.; Ben-Shahar, O. Sparse recovery of hyperspectral signal from natural RGB images. In Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands, 8–16 October 2016; pp. 19–34. [Google Scholar]
  45. Zheng, S.; Liu, Y.; Meng, Z.; Qiao, M.; Tong, Z.; Yang, X.; Han, S.; Yuan, X. Deep plug-and-play priors for spectral snapshot compressive imaging. Photonics Res. 2021, 9, B18–B29. [Google Scholar] [CrossRef]
  46. Zhang, S.; Huang, H.; Fu, Y. Fast parallel implementation of dual-camera compressive hyperspectral imaging system. IEEE Trans. Circuits Syst. Video Technol. 2018, 29, 3404–3414. [Google Scholar] [CrossRef]
  47. Tao, C.; Zhu, H.; Wang, X.; Zheng, S.; Xie, Q.; Wang, C.; Wu, R.; Zheng, Z. Compressive single-pixel hyperspectral imaging using RGB sensors. Opt. Express 2021, 29, 11207–11220. [Google Scholar] [CrossRef]
  48. Gregor, K.; LeCun, Y. Learning fast approximations of sparse coding. In Proceedings of the 27th International Conference on Machine Learning, Haifa, Israel, 21–25 June 2010; pp. 399–406. [Google Scholar]
Figure 1. (a) Transmittance of notch filters and (b) their locations in the multi-aperture imaging system. P indicates the aperture used for capturing panchromatic images. The central wavelengths of the notch filters are 405, 457, 488, 514, 532, 561, 632, and 685 nm.
Figure 1. (a) Transmittance of notch filters and (b) their locations in the multi-aperture imaging system. P indicates the aperture used for capturing panchromatic images. The central wavelengths of the notch filters are 405, 457, 488, 514, 532, 561, 632, and 685 nm.
Remotesensing 14 04115 g001
Figure 2. (a) Proposed multi-aperture multisystem, (b) lens array covered by the notch filter array shown in Figure 1b, and (c) images captured by the camera in (b) in a single shot.
Figure 2. (a) Proposed multi-aperture multisystem, (b) lens array covered by the notch filter array shown in Figure 1b, and (c) images captured by the camera in (b) in a single shot.
Remotesensing 14 04115 g002
Figure 3. Simulation results calculated using three different MSIs of the CAVE dataset. The first, third, and fifth columns show the simulation results of “beads”, “cloth”, and “clay” MSIs. Error maps are shown in columns 2, 4, and 6 for comparison. To compare with the ground truth, the spectral curves of selected regions are shown in (af).
Figure 3. Simulation results calculated using three different MSIs of the CAVE dataset. The first, third, and fifth columns show the simulation results of “beads”, “cloth”, and “clay” MSIs. Error maps are shown in columns 2, 4, and 6 for comparison. To compare with the ground truth, the spectral curves of selected regions are shown in (af).
Remotesensing 14 04115 g003
Figure 4. Reconstruction spectra of “toy” MSI of the CAVE dataset. The original RGB image and the coded frame of the scene are shown at the top. The spectra of four color regions are shown on the left. The three reconstructed frames at wavelengths 510, 560, and 640 nm are shown on the right.
Figure 4. Reconstruction spectra of “toy” MSI of the CAVE dataset. The original RGB image and the coded frame of the scene are shown at the top. The spectra of four color regions are shown on the left. The three reconstructed frames at wavelengths 510, 560, and 640 nm are shown on the right.
Remotesensing 14 04115 g004
Figure 5. Simulation results of “BGU1113” (top) and “Labtest1502” (bottom) MSIs of ICVL dataset. To compare with the ground truth, the spectral curves of selected regions are shown in (a,b).
Figure 5. Simulation results of “BGU1113” (top) and “Labtest1502” (bottom) MSIs of ICVL dataset. To compare with the ground truth, the spectral curves of selected regions are shown in (a,b).
Remotesensing 14 04115 g005
Figure 6. Comparison of the spectra of the 18 color blocks between the reconstructed results and ground truth. The reconstructed MSIs are displayed at the bottom. The spectral range is 420–700 nm. The zoomed-in regions for the 500 nm band are displayed in the right part. BP is the bandpass image.
Figure 6. Comparison of the spectra of the 18 color blocks between the reconstructed results and ground truth. The reconstructed MSIs are displayed at the bottom. The spectral range is 420–700 nm. The zoomed-in regions for the 500 nm band are displayed in the right part. BP is the bandpass image.
Remotesensing 14 04115 g006
Figure 7. Comparison between the hyperspectral images reconstructed using the DL and DL-TV algorithms of the proposed system for the target image in the 450, 530, 560, 610, and 660 nm bands. Zoomed-in images for the 660 nm band are displayed as well.
Figure 7. Comparison between the hyperspectral images reconstructed using the DL and DL-TV algorithms of the proposed system for the target image in the 450, 530, 560, 610, and 660 nm bands. Zoomed-in images for the 660 nm band are displayed as well.
Remotesensing 14 04115 g007
Figure 8. Field test results obtained by the proposed multispectral imaging system. The reconstructed MSIs were calculated using the DL and DL-TV algorithms. A total of 5 of the 31 reconstructed spectral images, with central wavelengths of 450, 530, 560, 610, and 660 nm, were selected and are shown. The zoomed-in regions from three selected spectral images are displayed, along with a captured monochromatic reference image.
Figure 8. Field test results obtained by the proposed multispectral imaging system. The reconstructed MSIs were calculated using the DL and DL-TV algorithms. A total of 5 of the 31 reconstructed spectral images, with central wavelengths of 450, 530, 560, 610, and 660 nm, were selected and are shown. The zoomed-in regions from three selected spectral images are displayed, along with a captured monochromatic reference image.
Remotesensing 14 04115 g008
Table 1. Average peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) of the 15 reconstructed MSIs corresponding to the CAVE dataset.
Table 1. Average peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) of the 15 reconstructed MSIs corresponding to the CAVE dataset.
MSIPSNRSSIM
TwIST
[30]
GAP-TV
[29]
PnP
[45]
DCCHI
[15]
DL
[33]
DL-TVTwISTGAP-TVPnPDCCHIDLDL-TV
Balloons31.94133.53034.21538.97042.03942.6300.9500.9430.9550.9900.9900.993
Beads20.78821.87321.87025.18936.58637.0300.5680.5580.5590.8250.9660.969
CD29.61631.97531.52234.48332.68632.7850.9270.9120.9070.9710.9730.977
Toy23.95624.77424.99734.04442.61843.1030.8050.7910.8220.9640.9910.995
Clay31.95432.97932.96036.75444.93846.7200.9070.8930.8970.9520.9760.984
Cloth23.29223.51723.94224.82541.49242.5230.4900.4860.5000.8000.9820.985
Egyptian32.75332.86732.45343.50348.82050.0890.9040.9060.9210.9920.9900.996
Face32.46732.44932.46340.81444.15344.9750.9280.9150.9080.9880.9850.994
Beers30.19032.12734.26539.19442.03742.4560.9400.9280.9520.9840.9910.995
Food30.28231.18632.11436.64344.68845.7060.8720.8530.9040.9520.9870.991
Lemon28.85329.73330.69539.83347.35748.4580.8710.8410.8740.9750.9910.994
Lemons33.48533.28233.19641.85147.56248.6620.9380.9240.9270.9830.9920.996
Peppers28.50130.18730.62136.30744.85445.7440.8930.8930.9080.9500.9890.993
Strawberries32.08931.35330.83241.73847.01948.0650.8950.8700.8870.9710.9910.994
Sushi31.63732.30132.89340.86346.03946.8210.9550.9480.9580.9890.9900.993
Average29.45430.27530.60337.00143.52644.3840.8560.8440.8590.9520.9860.990
Table 2. Average root-mean-square error (RMSE) and spectral angle mapper (SAM) of the 15 reconstructed MSIs corresponding to the CAVE dataset.
Table 2. Average root-mean-square error (RMSE) and spectral angle mapper (SAM) of the 15 reconstructed MSIs corresponding to the CAVE dataset.
MSIRMSESAM
TwISTGAP-TVPnPDCCHIDLDL-TVTwISTGAP-TVPnPDCCHIDLDL-TV
Balloons6.4835.3874.9742.9142.6442.5600.0910.0850.1190.0700.0880.085
Beads23.51820.70220.70914.3664.8774.7380.3040.3110.3100.3130.1090.104
CD8.4986.4556.8074.8487.2867.2040.1210.1340.1930.0940.1300.127
Toy16.24014.78114.3995.5542.2592.1750.1820.1870.2100.1250.0850.075
Clay6.6735.9175.8643.8591.7121.5620.1900.2280.3160.1560.1480.124
Cloth17.63117.11816.40515.9643.0322.8780.1710.1720.1710.3180.0810.080
Egyptian5.9345.8326.1031.7161.1721.0840.2640.2550.3470.1130.1720.140
Face6.1136.1076.0862.4001.9081.8230.1310.1440.2440.0870.1080.095
Beers7.9176.3294.9522.9152.5012.4130.0440.0410.0420.0330.0380.038
Food7.8937.0806.3313.8121.9791.8880.1540.1810.2130.1370.1200.113
Lemon9.2598.3447.4602.6561.3641.2640.1770.2290.2540.1430.1080.099
Lemons5.4215.5395.6052.0901.3151.2240.1070.1170.2170.0870.0820.073
Peppers9.6097.9177.5203.9231.7841.6910.1560.1650.2410.1520.1140.103
Strawberries6.3756.9217.3912.1321.4161.3250.1490.1770.2480.1050.0960.087
Sushi6.7016.2165.7962.3501.7151.6440.1060.1260.1840.0800.1380.130
Average9.6188.7108.4274.7672.4642.365 0.1560.1700.2210.1340.1080.098
Table 3. Average PSNR and SSIM of the 15 reconstructed MSIs for the ICVL dataset.
Table 3. Average PSNR and SSIM of the 15 reconstructed MSIs for the ICVL dataset.
MSIPSNRSSIM
TwISTGAP-TVPnPDCCHIDLDL-TVTwISTGAP-TVPnPDCCHIDLDL-TV
4cam164033.015 33.094 33.632 35.419 43.257 43.817 0.857 0.842 0.845 0.963 0.992 0.994
BGU111325.628 26.814 27.912 33.257 40.845 41.251 0.785 0.770 0.796 0.960 0.990 0.992
BGU113627.366 28.047 28.765 32.470 42.085 42.230 0.801 0.790 0.838 0.963 0.994 0.995
Flower133624.876 26.054 27.235 27.001 40.061 40.459 0.676 0.677 0.714 0.905 0.987 0.988
Labtest150231.674 30.418 29.901 40.858 48.608 49.425 0.879 0.841 0.849 0.982 0.996 0.997
Labtest150437.060 37.019 37.915 42.547 52.464 53.778 0.940 0.931 0.938 0.988 0.998 0.999
CAMP165927.165 28.585 29.365 35.412 39.474 39.738 0.863 0.853 0.850 0.977 0.993 0.995
bgu145931.275 31.042 31.788 37.235 45.490 46.376 0.831 0.819 0.833 0.968 0.989 0.990
bgu152325.717 26.944 27.325 28.526 39.883 40.109 0.763 0.754 0.798 0.930 0.988 0.989
eve154933.527 34.585 35.221 38.622 44.409 44.654 0.898 0.888 0.890 0.977 0.995 0.996
eve160228.396 28.515 28.957 31.834 41.213 41.325 0.823 0.802 0.833 0.951 0.991 0.993
gavyam093032.781 31.870 31.867 40.023 45.644 45.953 0.860 0.831 0.831 0.971 0.994 0.995
grf094927.403 27.865 28.849 30.452 42.080 42.655 0.746 0.737 0.761 0.934 0.989 0.992
hill121926.995 28.057 28.656 27.617 40.198 40.699 0.735 0.729 0.740 0.913 0.987 0.989
hill123528.370 29.203 29.651 28.408 39.655 40.144 0.771 0.767 0.769 0.932 0.988 0.992
Average29.417 29.874 30.469 33.979 43.024 43.508 0.815 0.802 0.819 0.954 0.991 0.993
Table 4. Average RMSE and SAM of the 15 reconstructed MSIs for the ICVL dataset.
Table 4. Average RMSE and SAM of the 15 reconstructed MSIs for the ICVL dataset.
MSIRMSESAM
TwISTGAP-TVPnPDCCHIDLDL-TVTwISTGAP-TVPnPDCCHIDLDL-TV
4cam16405.859 5.725 5.438 4.667 2.206 2.145 0.036 0.040 0.044 0.054 0.037 0.036
BGU111313.823 11.905 10.668 5.954 2.939 2.878 0.074 0.079 0.077 0.084 0.059 0.059
BGU113611.181 10.245 9.362 6.758 2.787 2.808 0.072 0.081 0.077 0.079 0.038 0.038
Flower133614.867 12.976 11.551 12.881 3.782 3.808 0.078 0.073 0.065 0.156 0.049 0.050
Labtest15026.810 7.726 8.181 2.365 1.137 1.072 0.047 0.067 0.066 0.034 0.032 0.031
Labtest15043.696 3.625 3.298 2.037 0.689 0.622 0.045 0.053 0.055 0.060 0.030 0.029
CAMP165911.262 9.612 8.931 4.793 3.874 3.888 0.056 0.053 0.054 0.050 0.040 0.041
bgu14597.307 7.280 6.788 3.712 1.992 1.930 0.075 0.092 0.092 0.078 0.068 0.067
bgu152313.244 11.537 11.000 10.660 3.832 3.875 0.069 0.061 0.064 0.130 0.055 0.056
eve15495.529 4.839 4.536 3.127 2.102 2.107 0.030 0.031 0.034 0.038 0.036 0.036
eve16029.775 9.594 9.108 7.201 3.095 3.109 0.040 0.048 0.054 0.074 0.042 0.042
gavyam09306.061 6.564 6.566 2.567 1.695 1.676 0.065 0.086 0.086 0.052 0.049 0.049
grf094911.263 10.514 9.548 8.421 2.752 2.692 0.062 0.063 0.061 0.112 0.046 0.045
hill121911.601 10.321 9.772 12.098 3.695 3.657 0.053 0.049 0.053 0.122 0.060 0.059
hill12359.963 9.050 8.695 11.065 3.583 3.500 0.040 0.037 0.043 0.100 0.050 0.050
Average9.483 8.768 8.229 6.554 2.677 2.651 0.056 0.061 0.062 0.082 0.0461 0.0459
Table 5. Average PSNR and SSIM of 15 reconstructed MSIs under different levels of noise, obtained using the CAVE dataset.
Table 5. Average PSNR and SSIM of 15 reconstructed MSIs under different levels of noise, obtained using the CAVE dataset.
MSIσ = 10σ = 20σ = 40
PSNRSSIMPSNRSSIMPSNRSSIM
DLDL-TVDLDL-TVDLDL-TVDLDL-TVDLDL-TVDLDL-TV
Balloons32.63736.6470.7420.91929.30934.0790.5430.86325.48030.9810.3540.683
Beads30.32831.1140.8240.87827.93228.6130.6950.81624.67027.2620.5440.720
CD29.45331.1150.7310.88827.65829.7120.5570.81925.49628.3270.4030.651
Toy33.53035.6050.7980.90930.20633.2090.6570.85926.82230.8670.4970.708
Clay34.72136.6940.7070.80131.09733.8680.4860.70827.23431.1170.3020.484
Cloth32.94034.6690.7830.87228.88731.5740.6140.81024.80229.1300.4510.683
Egyptian36.57140.1080.8210.89832.93237.5680.6380.81329.33233.5290.4260.593
Face33.95137.1190.6780.85530.70934.5590.5010.78527.38531.4650.3360.571
Beers32.08435.1470.6990.91428.08332.3880.4640.85724.15629.7260.2710.673
Food34.61435.5350.7900.87930.72732.9650.6140.82226.90930.4460.4420.661
Lemon35.02437.9470.7870.90831.47535.4220.6410.85728.02632.2040.4800.699
Lemons34.90537.7660.7560.88631.03235.0760.5710.82327.30131.8420.4060.649
Peppers34.54135.9010.7790.88530.71433.1350.5990.82526.68130.5530.4210.655
Strawberries35.07638.3410.7600.87831.33635.8640.5890.82027.90732.5470.4230.648
Sushi36.02438.3280.7860.89932.27436.0610.6210.84329.24232.7310.4660.668
Average33.76036.1360.7630.88530.29233.6060.5860.82126.76330.8490.4150.650
Table 6. Comparison of reconstruction time in seconds between different methods.
Table 6. Comparison of reconstruction time in seconds between different methods.
MethodTwISTGPA-TVPnPDCCHIDLDL-TV
Time500.5569.8540.8525.66.464.1
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Huang, F.; Lin, P.; Cao, R.; Zhou, B.; Wu, X. Dictionary Learning- and Total Variation-Based High-Light-Efficiency Snapshot Multi-Aperture Spectral Imaging. Remote Sens. 2022, 14, 4115. https://doi.org/10.3390/rs14164115

AMA Style

Huang F, Lin P, Cao R, Zhou B, Wu X. Dictionary Learning- and Total Variation-Based High-Light-Efficiency Snapshot Multi-Aperture Spectral Imaging. Remote Sensing. 2022; 14(16):4115. https://doi.org/10.3390/rs14164115

Chicago/Turabian Style

Huang, Feng, Peng Lin, Rongjin Cao, Bin Zhou, and Xianyu Wu. 2022. "Dictionary Learning- and Total Variation-Based High-Light-Efficiency Snapshot Multi-Aperture Spectral Imaging" Remote Sensing 14, no. 16: 4115. https://doi.org/10.3390/rs14164115

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop