Learned Design of a Compressive Hyperspectral Imager for Remote Sensing by a Physics-Constrained Autoencoder

Heiser, Yaron; Stern, Adrian

doi:10.3390/rs14153766

Open AccessArticle

Learned Design of a Compressive Hyperspectral Imager for Remote Sensing by a Physics-Constrained Autoencoder

by

Yaron Heiser

^* and

Adrian Stern

Electro-Optics and Photonics Department, School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer Sheva 8410501, Israel

^*

Author to whom correspondence should be addressed.

Remote Sens. 2022, 14(15), 3766; https://doi.org/10.3390/rs14153766

Submission received: 8 July 2022 / Revised: 31 July 2022 / Accepted: 2 August 2022 / Published: 5 August 2022

(This article belongs to the Special Issue Scalable and Credible Artificial Intelligence for Remote Sensing Imagery Understanding)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Designing and optimizing systems by end-to-end deep learning is a recently emerging field. We present a novel physics-constrained autoencoder (PyCAE) for the design and optimization of a physically realizable sensing model. As a case study, we design a compressive hyperspectral imaging system for remote sensing based on this approach, which allows capturing hundreds of spectral bands with as few as four compressed measurements. We demonstrate our deep learning approach to design spectral compression with a spectral light modulator (SpLM) encoder and a reconstruction neural network decoder. The SpLM consists of a set of modified Fabry–Pérot resonator (mFPR) etalons that are designed to have a staircase-shaped geometry. Each stair occupies a few pixel columns of a push-broom-like spectral imager. The mFPR’s stairs can sample the earth terrain in along-track scanning from an airborne or spaceborne moving platform. The SpLM is jointly designed with an autoencoder by a data-driven approach, while spectra from remote sensing databases are used to train the system. The SpLM’s parameters are optimized by integrating its physically realizable sensing model in the encoder part of the PyCAE. The decoder part of the PyCAE implements the spectral reconstruction.

Keywords:

design learning; deep learning; compressive hyperspectral imaging; remote sensing

1. Introduction

In recent years, deep learning has been playing a major role in the field of hyperspectral imaging (HSI) [1,2,3,4,5,6,7,8,9,10,11]. Various deep neural networks (DNNs) have been proposed to classify spectra features and segmentation [3,5] and use band selection for better classification [10,11]. Recently, DNNs have been used for the reconstruction of high-dimensional spectra from a few bands of the spectrum. These spectra are acquired by compressive sensing HSI (CS HSI) techniques, which reduce the storage capacity occupied by the huge HS cubes [4,6,7] and reduce the effort associated with their acquisition.

Over the last decades, spectral imaging [12,13,14,15] has become progressively utilized in airborne and spaceborne remote sensing tasks [16,17], among many others. Today, it is used in many fields, such as vegetation science [18], urban mapping [19], geology [20], mineralogy [21], mine detection [22], and more. Most such spectral imagers take advantage of the platform’s motion and may acquire the spectral data from the visible to the infrared wavelength ranges, by performing along-track spatial scanning of the imaged object [14,23]. The spectral information can then be measured directly, that is, by first splitting it into spectral components using dispersive or diffractive optical elements, followed by direct measurement of each component. Another approach is to acquire the spectral information indirectly, incorporating multiplexed and coded spectral measurements. This allows the benefit from Fellgett’s multiplex advantage [24,25] and, therefore, provides a significant gain in optical throughput, at the cost of less intuitive system design and the need for post-processing. A well-known example of indirect spectral acquisition is Fourier transform spectroscopy, which is used both for spectrometry [26] and spectral imaging [27,28]. As conventional Fourier transform spectroscopy systems are based on mechanically scanned Michelson of Mach–Zehnder interferometers, they demand strict stability and precision requirements. Thus, in harsh aerial and space environments, significant sophistication in their mechanical construction is required, and high-cost translational stages are incorporated to preserve the interferometric stability [29,30]. The indirect spectral acquisition can also be achieved by utilizing voltage-driven liquid crystal phase retarders [31,32,33,34,35,36], evading the need for incorporating moving parts, which results in these strict stability requirements. One such system is the compressive sensing miniature ultra-spectral imaging (CS-MUSI) system presented in [32,33,35]. It performs spectral multiplexing by applying a variable voltage to a thick liquid crystal cell (LCC), thus achieving distinctive wideband spectral encodings.

Our group took the idea of CS-MUSI a step further to the remote sensing area by introducing a design that can be used with push-broom type scanning. In [37], we presented a wedge-shaped LCC that exploits a satellite’s motion along an orbit. Pixels of columns of the sensor behind the LCC capture the cross-track strips of the ground, each with a specific spectral modulation according to the height of the nematic liquid in front of the columns. The height of the nematic liquid changes linearly with the columns due to the geometric shape of the wedge (Figure 1b). Unlike CS-MUSI, wedge-shaped LCC passively changes the modulation of the light according to its geometrical shape; therefore, there is no need to apply different voltages to generate spectral modulations. Among the distinct properties of the wedge-shaped LCC-based spectral imager in [37] are its outstanding optical throughput (about two orders of magnitude higher than conventional HSI) and the ability to switch to panchromatic imaging mode.

With the wedge-shaped LCC (Figure 1b), the different modulations obtained by the continuously changed nematic liquid depth in front of various sensor’s columns determine the spectral linear sensing process that produces the spectral cube according to Equation (1).

g = Φ f,

(1)

where

g \in ℝ^{N_{c}}

is the compressed spectrum,

N_{c}

is the number of measurements,

Φ \in ℝ^{N_{c} \times N_{λ}}

is the spectral sensing matrix,

N_{λ}

is the number of spectral bands in the incoming signal, and

f \in ℝ^{N_{λ}}

is the observed spectrum. On the other hand, the wedge-shaped LCC spectral modulation has a huge disadvantage, because the captured spectrum measurements are redundant, and thus, its acquired HS cube requires large memory storage and huge bandwidth capacity.

Figure 1a depicts the acquisition of the HS cube from the satellite. It describes the scanning along-track of an airborne or a spaceborne moving platform (in this case, a satellite camera equipped with an SpLM holding a staircase set of mFPR etalons [38]) scanning the earth terrain with a field of view (FoV) consisting of a sequence of the cross-track slices from the various stairs of the SpLM. The imager’s optics projects the scene on the mFPR etalons, and each etalon is placed in front of a sensor column, thus capturing the spectrally encoded spectrum of a cross-track strip on the ground. The stair may occupy multiple sensor columns so that upon appropriate processing (e.g., registration and averaging) it improves the robustness of the measurements. Figure 1b,d illustrates the difference between scanning the earth terrain with a wedge-shaped LCC and scanning the earth terrain with a staircase-shaped SpLM. In Figure 1b a continuous slant of the wedge-shaped LCC in front of the sensor’s columns dictates that each column should participate in the sampling with its own modulation and the resulting sensing matrix is overcomplete (e.g., taking 1250 measurements for 224 spectral bands).

In this paper, we keep evolving from the wedge-shaped LCC concept (based on CS-MUSI) [32,35], by using a set of mFPR etalons for SpLM (Figure 1d). In the staircase design, the width of the stairs is equal among all the stairs, so the image can be acquired continuously with the motion of the satellite in its orbit. The staircase design yields a novel CS HSI that:

(1): Compresses the spectral information of the scene from $N_{λ}$ spectral bands to $N_{s}$ measurements respective to $N_{s}$ stairs in the SpLM,

(2): Considers mFPR technology for the SpLM, which is applicable in a much broader spectral regime than the wedge-shaped LCC. Thus, instead of the wedge-shaped LCC (Figure 1b), here, we use a staircase-shaped set of mFPR etalons (see Figure 1d). The advantage of mFPR is that it can be used for a wider spectral range, including visible light and the SWIR ( $0.4 μ m - 2.5 μ m$ ).

The staircase shape of the SpLM allows it to sample the terrain with an undercomplete spectral sensing matrix; that is, to compress the signal to a few measurements only, while each stair occupies a few pixel columns. In this way, each measurement can be averaged from a few pixels in its corresponding row for the robustness of the measurement. Together, they determine the compressive spectral sensing matrix for each row

x

under each stair

y

, which is used to compress the input signal. Similar to Equation (1), the sensing model is now described by Equation (2):

g_{c s} = Φ_{c s} f,

(2)

where

g_{c s} \in ℝ^{N_{s}}

is the compressed spectrum,

N_{s}

is the number of stairs (measurements), and

Φ_{c s} \in ℝ^{N_{s} \times N_{λ}}

is the compressive spectral sensing matrix.

In this paper, the optimal set of stairs height is learned by the PyCAE in an end-to-end fashion, from samples of remote-sensing HS cubes (i.e., Pavia center and Salinas valley [39]). A detailed description of the PyCAE will be given in the next section. After optimizing the height of the stairs, the encoder of the PyCAE can be replaced by a staircase-shaped set of mFPRs, and measurements from this SpLM are reconstructed by the PyCAE decoder.

Our contribution is as follows:

We introduce a novel approach for learning the design and optimization of physical parameters of a system, by using a physics-constrained autoencoder (PyCAE).
We introduce a new concept of a compressive sensing hyperspectral imager for remote sensing with high throughput and an extremely high compressive ratio of the spectral dimension.

2. Materials and Methods

2.1. Physics-Constrained Autoencoder (PyCAE)

The proposed system consists of an encoder that emulates hardware with relay optics that images the light from the scene through the SpLM onto the sensor and software, which is a reconstruction DNN. The hardware can be modeled as a “sensing layer” and serves as the encoder of the PyCAE, while the reconstruction DNN serves as a decoder (Figure 2). The coupling between them holds the compressed measurements while serving as the bottleneck of the autoencoder. Unlike encoders of conventional deep learning autoencoders, in our model, the encoder is constrained to comply with the physics of our optical sensing hardware. Figure 2 depicts the general scheme of the system, including the deep learning model and respective sensing and reconstruction system.

PyCAE jointly optimizes the physical parameters that determine the values of the sensing matrix with the weights of the reconstruction DNN. Through its training process, we solve the following optimization problem:

{\hat{θ}, \hat{W}} = \underset{θ, W}{argmin} E_{f} ℒ (N_{θ, W} (f), f) s . t . \in S_{p}

(3)

where

N_{θ, W} = D_{W} \circ E_{θ}

is the PyCAE DNN consisting of the encoder

E_{θ}

, which models the sensing matrix

Φ_{θ}

, which depends on the physical parameters

θ

, and of the decoder DNN

D_{W}

with the trained weights W.

ℒ

denotes the loss function (Section 2.6), and

E_{f}

denotes the expectation operator, practically evaluated as an average over the training set of spectra

f

.

S_{p}

denotes a constrained set, in our case, dictated by the physics of the spectral modulation (see Section 2.2).

After training the autoencoder, the encoder “sensing layer” is detached from the decoder and replaced by the real physical device, which feeds its measurements into the reconstruction DNN,

D_{W} (g)

, where

g

holds the measurements in the bottleneck of the PyCAE. In our case, the sensing layer is mathematically modeled by

Φ_{c s}

, which represents the transmission of the SpLM, depending on the parameters of the mFPR (see Section 2.2).

For training purposes, we feed a high-dimensional spectrum

f \in ℝ^{N_{λ}}

with

N_{λ}

spectral dimensions as an input of the encoder and feed the same spectrum to the autoencoder’s output, while the bottleneck holds

N_{s} ≪ N_{λ}

compressed measurements,

g \in ℝ^{N_{s}}

.

2.2. The SpLM and Its Model

In [38], we introduced the modified Fabry–Pérot Resonator (mFPR) for compressive hyperspectral imaging [36]. To obtain different spectral modulations for different measurements, refs. [38,40] modified the classical etalons in two fundamental ways. First, a wide range of gaps between the mirrors of the etalons is used to obtain more than a single spectral transmission peak, as in the case of classical FP-etalon. Second, the reflectivity of the mirrors is reduced by about

20 % - 30 %

to obtain multiple and wider spectral peaks. The array of the stairs set of the staircase-shaped mFPR is particularly adequate for this task, because it has good modulation, high throughput, and supports the visible and SWIR spectrum.

Here, we implement these etalons in the shape of stairs, so they can be used for compressive scanning imaging in the vertical direction of the stairs. We also model a BK7 glass as a substance to fill the etalons gaps, because it has good transparency to all the spectrum ranges we work on.

The spectral transmission of an ideal FP resonator is given by Equations (4) and (5) [38,40]:

T (λ) = \frac{1}{1 + F s i n^{2} (\frac{δ (λ)}{2})},

(4)

where

δ

is the optical thickness, and

F

is the finesse coefficients that are equal to

δ (λ) = \frac{2 π}{λ} 2 n d c o s (α); F = \frac{4 R}{{(1 - R)}^{2}}

(5)

where

λ

is the wavelength,

n

is the refractive index of the material between the two FP mirrors, and

α

is the incident angle of light.

R

is the reflectivity of both mirrors that defines the full width at half-maximum (FWHM) of the spectral peaks.

d

is the gap between the mirrors, which is responsible for the spectral modulation shape. As the gap grows, the number of the orders grows as well (i.e., the number of the spectral peaks). The gaps between the mirrors

d

are the parameters that need to be optimized by the PyCAE to give a series of optimized modulations to construct an efficient sensing matrix. The spectral transmission

T (λ)

is then the modulation of each stair of the SpLM, which is one line of the spectral sensing matrix

Φ_{θ}

.

2.3. Encoder—Sensing Layer

Central to the PyCAE is the encoder, which models the sensing layer and reflects the spectral sensing matrix

Φ_{θ}

. It represents a set of mFPR etalons forming the staircase shape (Figure 1d). Each stair in the staircase is an mFPR etalon whose height determines the gap

{d_{i}}_{i = 1}^{N_{s}}

between the mirrors of the etalon. By establishing each etalon’s gap

d

, the respective modulation of the spectral transmittance

T (λ)

(given by Equation (4)) determines a row of the sensing matrix

Φ_{θ}

. The bottleneck of the PyCAE,

g

, is the output of the encoder, and it holds the measurements according to Equation (6):

g_{i} = \sum_{j \in λ} T_{i j} f_{j}, i = 1, \dots, N_{s},

(6)

where

T_{i j}

is the

j^{t h}

wavelength transmittance of the mFPR at the

i^{t h}

stair (i.e.,

T_{i j} = T (λ_{j}; d_{i})

), and

f_{j}

is the intensity of the

j^{t h}

element of the spectrum of the incident light.

The encoder is used to optimize the SpLM’s parameters

θ

(e.g., the height of the stair-shaped mFPR’s etalons,

{d_{i}}_{i = 1}^{N_{s}}

). It implements the physical model (Equation (4)) as its activation function, while the layer’s weights which reflect the height of the stairs are optimized parameters that are improved together with the reconstruction network’s weights at each iteration.

The optimized PyCAE’s encoder reflects the SpLM’s set of mFPR etalons (Figure 2); hence, the optimization process must consider the stairs’ physical constraints and the production limitations. To do so, we use the straight-through estimator (STE) technique, which was originally proposed by [41] for binary neural networks.

2.4. Straight-Through Estimator (STE)

To constrain the weights that represent the height of the stairs of the SpLM that are parametrized in the encoder, such as to be in a certain range of height and to have a minimum step of height between consecutive stairs, we use the STE algorithm.

According to the STE, at the feedforward phase, the input of the layer flows through the layer’s weights to the activation function of the layer, as if it had been the identity function [42]. But in the backpropagation phase, the flow of the gradients does not directly update the weights that represent the stairs’ heights

{d_{i}}_{i = 1}^{N_{s}}

. Instead, the gradients update the weights according to Equations (7)–(9):

0.1 μ m \geq θ_{i} \geq 10 μ m,

(7)

θ_{i} = {\begin{matrix} \frac{⌊ θ_{i} \cdot 10 ⌋}{10} + 0.05, & \frac{⌊ θ_{i} \cdot 10 ⌋}{10} + 0.05 < θ_{i} < \frac{⌊ θ_{i} \cdot 10 ⌋}{10} + 0.1 \\ \frac{⌊ θ_{i} \cdot 10 ⌋}{10}, & \frac{⌊ θ_{i} \cdot 10 ⌋}{10} < θ_{i} \leq \frac{⌊ θ_{i} \cdot 10 ⌋}{10} + 0.05 \end{matrix}

(8)

θ_{i} ≔ θ_{i} - η Δ L (θ_{i}), i = 1, \dots, N_{s}

(9)

Equation (7) determines the range of depths for all the mFPR stairs between

0.1 μ m

and

10 μ m

, and Equation (8) constrains the smallest step between consecutive stairs to be

50 nm

. In Equation (9),

θ_{i}

of the encoder is the

i^{t h}

weight, which represents a gap

d_{i}

of the

i^{t h}

mFPR etalon to be updated;

η

is the learning rate; and

\partial L (θ_{i})

is the gradient, which backpropagates through the set of constraint functions as if it had been the identity of the activation function

T (d)

(the function that represents the action of the mFPR etalons with respect to

d_{i}

).

\partial L (θ)

is calculated by the chain rule from the loss function at the output of the decoder, as shown in Equation (10):

\frac{\partial L (θ)}{\partial θ} = \frac{\partial L}{\partial w_{n}} \cdot \frac{\partial w_{n}}{\partial a_{n - 1}} \dots \frac{\partial a_{i}}{\partial w_{i}} \cdot \frac{\partial w_{i}}{\partial a_{i - 1}} \dots \frac{\partial a_{1}}{\partial w_{1}} \cdot \frac{\partial w_{1}}{\partial T (θ)} \cdot \frac{\partial T (θ)}{\partial θ},

(10)

where

n

is the number of layers in the decoder,

w_{i}

are the weights of the

i^{t h}

layer of the decoder, and

a_{i}

is the activation of the

i^{t h}

layer of the decoder.

Equation (11) shows the gradient of the mFPR etalons transmittance (Equations (4) and (5)) with respect to the depth

d_{i}

of the etalons for an incident angle of light

α = 0^{°}

:

\frac{\partial L (θ)}{\partial θ} \equiv \frac{\partial T (d)}{\partial d} = \frac{- 4 π n F s i n (\frac{2 π n}{λ} d) \cos (\frac{2 π n}{λ} d)}{λ {(1 + F s i n^{2} (\frac{2 π n}{λ} d))}^{2}},

(11)

Figure 3 and Algorithm 1 depict the STE mechanism:

Algorithm 1. Straight-Through Estimator (STE)

Initialization: Initialize weights due to model design constraints. (e.g., all weights value are between the minimum and maximum boundaries)
Feedforward: Use current weights to flow information straight through the physical formula of the model.
Backpropagation:

Update weights by gradients flow through all layers from the output layer to the input layer (sensing layer).
Deploy constraints on the updated weights to fit model requirements. (e.g., clip values between minimum and maximum boundaries).

In the case of a set of mFPR stairs of our SpLM, we constrain the range of the stair heights in the range of

0.1 μ m - 10 μ m

by clipping the weights in each iteration and limiting the minimum step between two stairs to

50 nm

, by binning the weights in bins of

50 nm

.

2.5. Decoder—Reconstruction DNN

The decoder of PyCAE serves as a reconstruction DNN. We designed it as a residual DNN following the general scheme of the DR2-net [43]. DR2-net was developed initially to reconstruct 2D images from compressed images; therefore, we adapted it to accept 1-dimensional (1D) spectra, as we successfully did in [6]. We modified the kernel sizes for each layer in a residual block (as they are hyperparameters of the network) to a 1D kernel.

Figure 4a schemes the basic structure of the reconstruction DNN. The first layer of the reconstruction DNN is a fully connected layer that has

N_{λ}

nodes as the

N_{λ}

spectral bands in the label of the spectrum. This layer is followed by batch normalization and a ‘Relu’ activation function. The layer provides a first approximation of the spectrum. The first approximation is followed by a train of residual blocks, where each block consists of 3 convolutional layers and a skip connection, as shown in Figure 4a. The first layer has 64 kernels of size 9, the second layer has 32 kernels of size 1, and the third layer has 1 filter with a kernel consisting of 7 elements. Each layer is followed by batch normalization and a ‘Relu’ activation function. The last layer of the reconstruction DNN is a convolutional layer with 1 filter, which has 1 element and no activation function. Because the task of the reconstruction network is to reconstruct a high-dimensional space from a very small dimensional space, this task is challenging. The transformation of a compressed spectrum

g

to a fully high-dimensional spectrum

\hat{f}

is hard because it is an ill-posed problem, where the compressed signal

g

is transformed from a narrow subspace to a wider subspace. To mitigate this, we employ a progressive reconstruction approach inspired by the progressive upsampling super-resolution algorithms [44]. Our network upsamples the signal in 4 steps to intermediate sizes and upsamples it again to the final size of the original spectrum, where each step has a train of 5 residual blocks. The number of subnetworks is a hyperparameter of the system. Figure 4b shows 2 steps of a progressive upsampling network.

2.6. Loss Function and Metrics for the PyCAE

We use the mean-square-error (MSE) as the loss function of the PyCAE to reduce the differences between sample pairs (input–label) from the training set in the training phase. The MSE is defined as follows:

M S E (\hat{f}, f) = \frac{1}{n} \sum_{i = 1}^{n} ∥ {\hat{f}}_{i} - f_{i} ∥^{2}

(12)

where

n

is the number of spectral bands in the signal

f \in ℝ^{N_{λ}}

, and

\hat{f} \in ℝ^{N_{λ}}

is the estimation of signal

f

.

To evaluate the reconstruction performance, we use two widely used metrics:

(1): Structural similarity index measure (SSIM) [45] to assess the similarity between the structure of the estimated signal to the label as follows:

$S S I M (\hat{f}, f) = \frac{(2 μ_{\hat{f}} μ_{f} + c_{1}) (2 σ_{\hat{f} f} + c_{2})}{(μ_{\hat{f}}^{2} + μ_{f}^{2} + c_{1}) (σ_{\hat{f}}^{2} + σ_{f}^{2} + c_{2})},$

(13)

where $\hat{f} a n d f$ are the estimated and original spectrum, respectively; $μ_{\hat{f}} a n d μ_{f}$ are the averages of the estimated and original spectrum, respectively; $σ_{\hat{f}}^{2} a n d σ_{f}^{2}$ are the variances of the estimated and original spectrum, respectively; and $σ_{\hat{f} f}$ is the covariance of the estimated and original spectrum.

c_{1} = {(k_{1} L)}^{2} a n d c_{2} = {(k_{2} L)}^{2}

are two variables to stabilize the division with a weak denominator,

L

is the dynamic range of the spectral bands, and

k_{1} a n d k_{2}

are constants where

k_{1} = 0.01

and

k_{2} = 0.03

by default.

(2): Spectral angle mapper (SAM) [46] evaluates the spectral information preservation degree between the label to the reconstructed spectrum:

$S A M (\hat{f}, f) = c o s^{- 1} (\frac{〈 \hat{f}, f 〉}{{∥ \hat{f} ∥}_{2} {∥ f ∥}_{2}}),$

(14)

where $\hat{f} a n d f$ are the estimated and original spectrum, respectively.

3. Results

3.1. Training Details

To train the PyCAE for spectra reconstruction, we need to initialize the etalon gaps parameters for the activation function according to Equations (4) and (5) The reflectivity of the mirrors is

R ~ 0.8

; hence, the finesse is

F ~ 80

. We take the incident angle

α

to be

0^{°}

, and the refraction index

n

is taken to be the BK7 refractive index for each band, depending on the HS cube dataset under analysis.

We used the ADAM optimizer with a learning rate of 0.001 and a learning rate scheduler that reduces the learning rate by a factor of

e^{- 0.1}

after the 3 first epochs.

We trained the PyCAE on two common remote sensing HS cube datasets, Pavia center, and Salinas valley from [39], and evaluated the model on their validation set. We also evaluated its generalization on another HS cube; we evaluated the performance on the Pavia University dataset with the model trained on the Pavia center dataset.

3.1.1. Datasets—Pavia Center and Salinas Valley

The Pavia center HS cube from [39], consists of two concatenated images of scenes that were acquired by the ROSIS sensor over Pavia center, Italy, with 102 spectral bands from

430 nm

to

850 nm

and with

1096 \times 1096

spatial pixels as shown in Figure 5. The spectra intensity was normalized to be between 0 and 1. The dataset was augmented by injecting Gaussian noise with standard deviation (STD)

σ = 0.004

. Then, the HS cube was divided into a training set, which includes the spectra at 70% of the pixels, and a test set, which includes 30% of the pixels.

The Salinas scene from [39], was acquired by the AVIRIS sensor over Salinas Valley, California, with 224 spectral bands from

400 nm

to

2500 nm

and with

512 \times 217

spatial pixels. The spectra intensity was normalized to be between 0 to 1. The dataset was augmented by injecting Gaussian noise with standard deviation (STD)

σ = 0.001

. Then, the HS cube was divided into a training set, which includes the spectra of 70% of the spatial pixels, and a test set, which includes 30% of the pixels.

The data of the Pavia center dataset has a wider variance than that of the Salinas valley dataset. In Section 3.3, Section 3.4 and Section 3.5, it can be seen that very few spectra in the dataset are needed to reconstruct the Salinas valley dataset (only 111,104 spectra in the HS cube) with better reconstructions than in the Pavia center and Pavia University (Section 3.2.) datasets (Pavia center includes 1,201,216 spectra in its HS cube), even though its compression ratio is twice smaller. This is due to the tight variance of its data compared to the variance of spectra in the Pavia center and Pavia University datasets.

3.1.2. Pavia University—Generalization Evaluation

In Section 3.1.1, we trained the PyCAE on 70% spectra of one dataset (Pavia center or Salinas valley) and validated and tested it on the remaining 30% spectra. Because many of the spectra are from the same materials under evaluation, they are almost the same, with small deviations inside each group of materials. For this reason, they show a particularly good reconstruction performance.

To evaluate the generalization of the PyCAE on a complete unseen dataset, we used the Pavia University dataset from [39] (Figure 6) and consists of a spatial resolution of

610 \times 610

pixels as a test set that didn’t participate in the training phase. Since the Pavia University HS cube was acquired in the same spectral range

(430 nm - 850 nm)

as the Pavia center HS cube, we evaluated the PyCAE that was trained on the Pavia center dataset with the Pavia University dataset.

3.2. The Optimized SpLMs

During the training, the SpLM’s geometries were optimized for each dataset and specifically for each version of the PyCAE, where each version has a different number of mFPR stairs. Accordingly, the compatible sensing matrices were optimized. After the optimization, the order of the stairs of the SpLM can be sorted in an increasing order of heights to mitigate the manufacturing process. Then, the resulting measures from the sensor must be registered to the decoder (reconstruction DNN) in the same order it was optimized. As an example, Figure 7 shows the height profile of the learned SpLMs and their respective sensing matrices of the Pavia center for each configuration. The learned profiles of the SpLMs with eight, six, and four staircases are shown on the left-hand side. The SpLMs with the sorted staircase are shown in the center. On the right-hand side of Figure 7, the respective sensing matrix,

ϕ_{C S}

, is shown. Each row in the sensing matrix represents the spectral transmission of the mFPR realized in each stair. It can be seen that shallow stairs have low-frequency modulation, while deep stairs have high-frequency modulation. The number of stairs determines the number of rows in the sensing matrix, M, and thus the number of different modulated measurements.

3.3. The Reconstruction DNN

The decoder of the PyCAE (Figure 4) is the software part used for HSI reconstruction, and as such, we report here its running speed on a GPU and its number of parameters. The runtime of the reconstruction DNN implemented by the Tensorflow 2 framework on Nvidia^® RTX 3090 TI GPU is 32 msec., both for PyCAE that was trained on the Pavia center dataset and for the PyCAE that was trained on the Salinas valley dataset. It can be seen that the run time is much shorter than that of common iterative algorithms [7].

The reconstruction DNN (i.e., the decoder of PyCAE), which was trained on the Pavia center dataset, has 672,201 trainable parameters and 4508 nontrainable parameters (total of 676,709 parameters) for PyCAE with 4 measurement configurations, 689,519 trainable parameters and 4516 nontrainable parameters (total of 694,035 parameters) for PyCAE with 6 measurement configurations, and 706,827 trainable parameters and 4524 nontrainable parameters (total of 711,351 parameters) for PyCAE with 8 measurement configurations.

The decoder of PyCAE, which was trained on the Salinas valley dataset, has 1,402,809 trainable parameters and 5240 nontrainable parameters (total of 1,408,049 parameters) for PyCAE with 4 measurement configurations, 1,420,635 trainable parameters and 5248 nontrainable parameters (total of 1,425,883 parameters) for PyCAE with 6 measurement configurations, and 1,438,425 trainable parameters and 5256 nontrainable parameters (total of 1,443,681 parameters) for PyCAE with 8 measurement configurations.

3.4. Salinas Valley

Figure 8 shows the results of spectra reconstruction with 224 spectral bands in the visible-light and SWIR domains evaluated with 8, 6, and 4 measurements. The reconstructions of the Salinas valley show excellent performance. Table 1 shows the SSIM and SAM metrics evaluated over the entire test set.

3.5. Pavia Center

Figure 9 shows representative results of spectra reconstruction with 102 spectral bands, evaluated with 8, 6, and 4 measurements. Table 2 summarizes the results of the mean values for SSIM and SAM over the validation/test set.

3.6. Generalization—Pavia University

The Pavia center dataset was augmented by adding noise to let the PyCAE evade overfitting. Also, it was divided into 70% for the training set and 30% for the validation/test set. In this section, we evaluate the PyCAE trained on the Pavia center dataset to predict spectra from the Pavia University dataset, which was acquired by the same spectral imaging system on a different scene (Figure 6). Figure 10 shows the results of spectra reconstruction with 103 spectral bands, evaluated with 8, 6, and 4 measurements. Table 3 summarizes the results of the mean values for SSIM and SAM over the Pavia University test set. This test set was not introduced to the PyCAE during the training process, which was performed on the Pavia center dataset. It can be seen in Figure 10 and Table 3 that the reconstructions of unseen spectra from the Pavia University test set exhibit similar performance obtained with the validation set from the Pavia center.

3.7. Robustness to Quantization Noise

The SpLM modulates the incident light reaching the sensor, but the sensor adds various kinds of noise. Quantization noise is inherently part of the sensing process, and it depends on the number of bits per pixel. It can distort the compressed measurements, which the reconstruction of the spectrum relies on. Here, we evaluate the impact of quantization noise that cannot be avoided in any digital sampling process of a continuous signal.

We normalized the intensity of spectra in both the Pavia center and Salinas valley datasets in the range of

R = [0, 1]

. To evaluate the impact of quantization noise on the reconstruction, we evaluate the quantization impact on sensors that have pixels with

b = 16, 12, 10

, and 8 bits, so the number of quantization levels

L = 2^{b}

, is 65,536, 4096, 2048, and 256, respectively.

The quantization noise (QN) of the compressed spectrum

g

is:

Q N ~ U [- \frac{Δ}{2}, \frac{Δ}{2}], Δ = \frac{\max (g)}{L}

(15)

where

b

is the number of bits in a pixel,

g

are the compressed measurements and

U

is the uniform distribution [47].

Figure 11 demonstrates the robustness of the PyCAE to these levels of quantization noise and Table 4 summarizes the mean SSIM and the mean of SAM over the Salinas Valley dataset.

4. Conclusions

The PyCAE was trained on the Pavia center dataset with

\frac{8}{102} = 7.8 %, \frac{6}{102} = 5.9 %

, and

\frac{4}{102} = 3.9 %

compression ratios. It was also trained on the Salinas valley dataset with

\frac{8}{224} = 3.6 %, \frac{6}{224} = 2.7 %

, and

\frac{4}{224} = 1.8 %

compression ratios. The PyCAE generalization was evaluated on 100% of the Pavia University dataset, by a PyCAE that was trained on 70% of the Pavia center dataset. We found that even at such high compression ratios, our method exhibited excellent reconstructions with an average SSIM above 0.99 and average SAM below 0.07. A similar performance was also obtained on the generalization test on the Pavia University dataset.

Augmenting the datasets by adding exceedingly small normal noise with variation

σ = 4 \times 10^{- 4}

mitigates overfitting and gives much better results.

The robustness to quantization noise of PyCAE was evaluated on the Salinas valley dataset. The evaluation was performed on typical sensor quantization depths (16 bits, 12 bits, 10 bits, and 8 bits), yet we see excellent results with an average SSIM greater than 0.98 and SAM smaller than 0.071.

PyCAE has proved to be a very efficient optimizer for the design and optimization of SpLM made of a set of mFPRs with different depths for a compressive HSI platform. Moreover, PyCAE can be used as a concept for the optimization of many other signal sensors when the physics model of the sensor is mathematically well defined.

We also presented a new compressive HSI for airborne and spaceborne platforms that benefit from high optical throughput together with reduced bandwidth and storage requirements. This HSI platform, optimized by the PyCAE, is superior to the HSI with wedge-shaped LCC as its SpLM, because it compresses the HS cube by a factor of up to 56:1.

Author Contributions

Conceptualization, Y.H. and A.S.; methodology, Y.H.; software, Y.H.; validation, Y.H., and A.S.; investigation, Y.H and A.S.; writing—original draft preparation Y.H.; writing—review and editing, Y.H and A.S.; supervision, A.S.; funding acquisition, A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Ministry of Science, Technology and Space, Israel, grant number 3-18410 and 3-13351.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ghamisi, P.; Yokoya, N.; Jun, L.; Liao, W.; Liu, S.; Plaza, J.; Rasti, B.; Plaza, A. Advances in Hyperspectral Image and Signal Processing: A Comprehensive Overview of the State of the Art. GRSM 2017, 5, 37–78. [Google Scholar] [CrossRef] [Green Version]
Li, Z.; Huang, L.; He, J. A Multiscale Deep Middle-level Feature Fusion Network for Hyperspectral Classification. Remote Sens. 2019, 11, 695. [Google Scholar] [CrossRef] [Green Version]
Li, S.; Song, W.; Fang, L.; Chen, Y.; Ghamisi, P.; Benediktsson, J.A. Deep Learning for Hyperspectral Image Classification: An Overview. IEEE Trans. 2019, 57, 6690–6709. [Google Scholar] [CrossRef] [Green Version]
Huang, L.; Luo, R.; Liu, X.; Hao, X. Spectral imaging with deep learning. Light Sci. Appl. 2022, 11, 2047. [Google Scholar] [CrossRef]
Jia, S.; Jiang, S.; Lin, Z.; Li, N.; Xu, M.; Yu, S. A survey: Deep learning for hyperspectral image classification with few labeled samples. Neurocomputing 2021, 448, 179–204. [Google Scholar] [CrossRef]
Heiser, Y.; Oiknine, Y.; Stern, A. Compressive Hyperspectral Image Reconstruction with Deep Neural Networks; SPIE: Bellingham, WA, USA, 2019; p. 109890M. [Google Scholar]
Gedalin, D.; Oiknine, Y.; Stern, A. DeepCubeNet: Reconstruction of spectrally compressive sensed hyperspectral images with deep neural networks. Opt. Express 2019, 27, 35811–35822. [Google Scholar] [CrossRef]
Cohen, N.; Shmilovich, S.; Oiknine, Y.; Stern, A. Deep neural network classification in the compressively sensed spectral image domain. J. Electron. Imaging 2021, 30, 041406. [Google Scholar] [CrossRef]
Roy, S.K.; Das, S.; Song, T.; Chanda, B. DARecNet-BS: Unsupervised Dual-Attention Reconstruction Network for Hyperspectral Band Selection. LGRS 2021, 18, 2152–2156. [Google Scholar] [CrossRef]
Sellami, A.; Farah, M.; Riadh Farah, I.; Solaiman, B. Hyperspectral imagery classification based on semi-supervised 3-D deep neural network and adaptive band selection. Expert Syst. Appl. 2019, 129, 246–259. [Google Scholar] [CrossRef]
Sawant, S.S.; Manoharan, P.; Loganathan, A. Band selection strategies for hyperspectral image classification based on machine learning and artificial intelligent techniques–Survey. Arab. J. Geosci. 2021, 14, 646. [Google Scholar] [CrossRef]
Shaw, G.A.; Burke, H.K. Spectral Imaging for Remote Sensing. Linc. Lab. J. 2003, 14, 3–28. [Google Scholar]
Hagen, N.; Kudenov, M.W. Review of snapshot spectral imaging technologies. Opt. Eng. 2013, 52, 090901. [Google Scholar] [CrossRef] [Green Version]
Eismann, M.T. Hyperspectral Remote Sensing; SPIE: Bellingham, WA, USA, 2012. [Google Scholar]
Manolakis, D.G.; Lockwood, R.B.; Cooley, T.W. Hyperspectral Imaging Remote Sensing: Physics, Sensors, and Algorithms; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
Vane, G.; Green, R.O.; Chrien, T.G.; Enmark, H.T.; Hansen, E.G.; Porter, W.M. The airborne visible/infrared imaging spectrometer (AVIRIS). Remote Sens. Environ. 1993, 44, 127–143. [Google Scholar] [CrossRef]
Knight, E.J.; Kvaran, G. Landsat-8 operational land imager design, characterization, and performance. Remote Sens. 2014, 6, 10286–10305. [Google Scholar] [CrossRef] [Green Version]
Thenkabail, P.S.; Lyon, J.G. Hyperspectral Remote Sensing of Vegetation; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
Gaitani, N.; Burud, I.; Thiis, T.; Santamouris, M. High-resolution spectral mapping of urban thermal properties with Unmanned Aerial Vehicles. Build. Environ. 2017, 121, 215–224. [Google Scholar] [CrossRef]
Gupta, R.P. Remote Sensing Geology; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Galvão, L.S.; Formaggio, A.R.; Couto, E.G.; Roberts, D.A. Relationships between the mineralogical and chemical composition of tropical soils and topography from hyperspectral remote sensing data. ISPRS J. Photogramm. Remote Sens. 2008, 63, 259–271. [Google Scholar] [CrossRef]
Maathuis, B.H.P.; Genderen, J.L.V. A review of satellite and airborne sensors for remote sensing based detection of minefields and landmines. Int. J. Remote Sens. 2004, 25, 5201–5245. [Google Scholar] [CrossRef]
Schott, J.R. Remote Sensing: The Image Chain Approach; Oxford University Press on Demand: Oxford, UK, 2007. [Google Scholar]
Fellgett, P. Conclusions on multiplex methods. J. Phys. Colloq. 1967, 28, C2-171. [Google Scholar] [CrossRef] [Green Version]
Gao, L.; Wang, L.V. A review of snapshot multidimensional optical imaging: Measuring photon tags in parallel. Phys. Rep. 2016, 616, 1–37. [Google Scholar] [CrossRef] [Green Version]
Griffiths, P.R.; De Haseth, J.A. Fourier Transform Infrared Spectrometry; John Wiley & Sons: Hoboken, NJ, USA, 2007. [Google Scholar]
Foken, T. Springer Handbook of Atmospheric Measurements; Springer: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
Sabbah, S.; Harig, R.; Rusch, P.; Eichmann, J.; Keens, A.; Gerhard, J.; Sabbah, S.; Harig, R.; Rusch, P.; Eichmann, J.; et al. Remote sensing of gases by hyperspectral imaging: System performance and measurements Remote sensing of gases by hyperspectral imaging: System performance and measurements. Opt. Eng. 2012, 51, 111717. [Google Scholar] [CrossRef]
Persky, M.J. A review of spaceborne infrared Fourier transform spectrometers for remote sensing. Rev. Sci. Instrum. 1995, 66, 4763–4797. [Google Scholar] [CrossRef]
Ferrec, Y.; Taboury, J.; Sauer, H.; Chavel, P.; Fournet, P.; Coudrain, C.; Deschamps, J.; Primot, J. Experimental results from an airborne static Fourier transform imaging spectrometer. Appl. Opt. 2011, 50, 5894–5904. [Google Scholar] [CrossRef] [PubMed]
Itoh, K.; Inoue, T.; Ohta, T.; Ichioka, Y. Liquid-crystal imaging Fourier-spectrometer array. Opt. Lett. 1990, 15, 652–654. [Google Scholar] [CrossRef]
August, I.; Oiknine, Y.; AbuLeil, M.; Abdulhalim, I.; Stern, A. Miniature Compressive Ultra-spectral Imaging System Utilizing a Single Liquid Crystal Phase Retarder. Sci. Rep. 2016, 6, 23524. [Google Scholar] [CrossRef]
Oiknine, Y.; August, I.; Stern, A. Along-track scanning using a liquid crystal compressive hyperspectral imager. Opt. Express 2016, 24, 8446–8457. [Google Scholar] [CrossRef]
Jullien, A.; Pascal, R.; Bortolozzo, U.; Forget, N.; Residori, S. High-resolution hyperspectral imaging with cascaded liquid crystal cells. Optica 2017, 4, 400. [Google Scholar] [CrossRef]
Oiknine, Y.; August, I.; Farber, V.; Gedalin, D.; Stern, A. Compressive Sensing Hyperspectral Imaging by Spectral Multiplexing with Liquid Crystal. J. Imaging 2018, 5, 3. [Google Scholar] [CrossRef] [Green Version]
Stern, A. Optical Compressive Imaging; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
Shmilovich, S.; Oiknine, Y.; Abuleil, M.; Abdulhalim, I.; Blumberg, D.G.; Stern, A. Dual-camera design for hyperspectral and panchromatic imaging, using a wedge shaped liquid crystal as a spectral multiplexer. Sci. Rep. 2020, 10, 3455. [Google Scholar] [CrossRef] [Green Version]
Oiknine, Y.; August, I.; Stern, A. Multi-aperture snapshot compressive hyperspectral camera. Opt. Lett. 2018, 43, 5042–5045. [Google Scholar] [CrossRef]
Hyperspectral Remote Sensing Scenes—Grupo de Inteligencia Computacional (GIC). Available online: https://www.ehu.eus/ccwintco/index.php/Hyperspectral_Remote_Sensing_Scenes (accessed on 7 July 2022).
Oiknine, Y.; August, I.; Blumberg, D.G.; Stern, A. Compressive sensing resonator spectroscopy. Opt. Lett. 2017, 42, 25–28. [Google Scholar] [CrossRef] [PubMed]
Hinton, G.; Srivastava, N.; Swersky, K. Neural Networks for Machine Learning; Coursera: Mountain View, CA, USA, 2012. [Google Scholar]
Bengio, Y.; L’eonard, N.; Courville, A. Estimating or Propagating Gradients through Stochastic Neurons for Conditional Computation. Neural Comput. 1999, 11, 1199–1209. [Google Scholar] [CrossRef]
Yao, H.; Dai, F.; Zhang, S.; Zhang, Y.; Tian, Q.; Xu, C. DR2-Net: Deep Residual Reconstruction Network for image compressive sensing. Neurocomputing 2019, 359, 483–493. [Google Scholar] [CrossRef] [Green Version]
Wang, Z.; Chen, J.; Hoi, S.C. Deep learning for image super-resolution: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 43, 3365–3387. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef] [Green Version]
Zhao, J.; Kechasov, D.; Rewald, B.; Bodner, G.; Verheul, M.; Clarke, N.; Clarke, J.L. Deep Learning in Hyperspectral Image Reconstruction from Single RGB images—A Case Study on Tomato Quality Parameters. Remote Sens. 2020, 12, 3258. [Google Scholar] [CrossRef]
Proakis, J.G.; Manolakis, D.K. Digital Signal Processing: Principles, Algorithms, and Applications, 4th ed.; Pearson: London, UK, 2007; pp. 403–408. [Google Scholar]

Figure 1. A description of the scanning along-track of a satellite camera equipped with SpLM. (a) A satellite camera equipped with SpLM holding a staircase set of mFPR etalons scanning the earth terrain in orbit. Each stair has its own field of view (FoV) capturing a strip of the terrain. (b) The wedge-shaped spectral modulator (e.g., the LCC [37]) has a continuous slant, thus, each column of pixels has its own modulation. (c) An example of an overcomplete sensing matrix map

(1250 \times 224)

respective to the wedge-shaped LCC [37]. (d) The staircase-shaped mFPR has a discrete slant with a staircase design, where all stairs have equal width, but each stair has a different height, resulting in different spectral modulation. Here, each step has its own FoV and occupies a few pixel columns. (e) An undercomplete sensing matrix map

(4 \times 224)

determined by the number of stairs and their heights (4 stairs only in this example).

Figure 1. A description of the scanning along-track of a satellite camera equipped with SpLM. (a) A satellite camera equipped with SpLM holding a staircase set of mFPR etalons scanning the earth terrain in orbit. Each stair has its own field of view (FoV) capturing a strip of the terrain. (b) The wedge-shaped spectral modulator (e.g., the LCC [37]) has a continuous slant, thus, each column of pixels has its own modulation. (c) An example of an overcomplete sensing matrix map

(1250 \times 224)

respective to the wedge-shaped LCC [37]. (d) The staircase-shaped mFPR has a discrete slant with a staircase design, where all stairs have equal width, but each stair has a different height, resulting in different spectral modulation. Here, each step has its own FoV and occupies a few pixel columns. (e) An undercomplete sensing matrix map

(4 \times 224)

determined by the number of stairs and their heights (4 stairs only in this example).

Figure 2. (a) DNN encoder, which models our SpLM, is used to optimize the gaps of the mFPR stair-shaped etalons by learning from pairs of inputs

f

and the labels

\hat{f}

. The outputs of the encoder,

g

, at the bottleneck of the autoencoder, represent the measurements. (b) After the optimal gaps of the mFPR’s stair-shaped etalons are learned by the encoder, the staircase-shaped SpLM can be produced and replace the software encoder.

Figure 2. (a) DNN encoder, which models our SpLM, is used to optimize the gaps of the mFPR stair-shaped etalons by learning from pairs of inputs

f

and the labels

\hat{f}

. The outputs of the encoder,

g

, at the bottleneck of the autoencoder, represent the measurements. (b) After the optimal gaps of the mFPR’s stair-shaped etalons are learned by the encoder, the staircase-shaped SpLM can be produced and replace the software encoder.

Figure 3. Illustration of the STE algorithm for the physical constraints and limitations that must be imposed on the gaps of the etalons. At the feedforward phase, the input signal flows from the encoder outputs

g

(the measurements) to the decoder and is reconstructed at the output of the decoder. At the backpropagation phase, gradients flow backward from the output layer of the decoder through the bottleneck and update the etalons depths

d

. Then, a set of constraints and limitations are imposed on

d

by the STE, and

\tilde{d}

, the constrained etalons depths, are assigned as the new weights of the sensing layer.

Figure 3. Illustration of the STE algorithm for the physical constraints and limitations that must be imposed on the gaps of the etalons. At the feedforward phase, the input signal flows from the encoder outputs

g

(the measurements) to the decoder and is reconstructed at the output of the decoder. At the backpropagation phase, gradients flow backward from the output layer of the decoder through the bottleneck and update the etalons depths

d

. Then, a set of constraints and limitations are imposed on

d

by the STE, and

\tilde{d}

, the constrained etalons depths, are assigned as the new weights of the sensing layer.

Figure 4. (a) A compressed input

g

is obtained by the multiplication of the high-dimensional spectrum

f

with the sensing matrix

Φ

. The first step in the reconstruction network takes the compressed signal

g

and expands it to the high-dimensional size by a fully connected layer, which learns its coarse approximation. Then, it is refined by a train of residual convolutional blocks, where each block contains 3 convolutional layers. The first layer has 64 kernels of size 9, the second has 32 kernels of size 1, and the third has 1 kernel of size 7. The number of residual blocks is a hyperparameter of the system. (b) A progressive reconstruction network upsamples the signal to intermediate size and upsamples it again using a sequence of reconstruction network blocks as shown in (a). The number of subnetworks is a hyperparameter of the system.

Figure 4. (a) A compressed input

g

is obtained by the multiplication of the high-dimensional spectrum

f

with the sensing matrix

Φ

. The first step in the reconstruction network takes the compressed signal

g

and expands it to the high-dimensional size by a fully connected layer, which learns its coarse approximation. Then, it is refined by a train of residual convolutional blocks, where each block contains 3 convolutional layers. The first layer has 64 kernels of size 9, the second has 32 kernels of size 1, and the third has 1 kernel of size 7. The number of residual blocks is a hyperparameter of the system. (b) A progressive reconstruction network upsamples the signal to intermediate size and upsamples it again using a sequence of reconstruction network blocks as shown in (a). The number of subnetworks is a hyperparameter of the system.

Figure 5. (a) Two concatenated images of the Pavia center in Italy. (b) Salinas scene of the Salinas valley, California.

Figure 6. Pavia University is used for the evaluation of PyCAE performance, as it is trained on the Pavia center dataset.

Figure 7. Three configurations of SpLMs were obtained by training the PyCAE on “Pavia center” data: (a) SpLM with 8 stairs, (b) SpLM with 6 stairs, and (c) SpLM with 4 stairs. The stairs height of the SpLMs as obtained from the learning process are shown on the left-hand side. The respective sensing matrices are shown on the right-hand map. The center image shows the preferable reordering of the stairs to obtain a staircase as in Figure 1d.

Figure 8. Figure shows 6 examples for comparison of spectra reconstruction from 3 SpLMs configurations: 8 measurements vs. 6 measurements vs. 4 measurements. The spectra have 224 bands in the range of

(400 nm - 2500 nm)

.

Figure 8. Figure shows 6 examples for comparison of spectra reconstruction from 3 SpLMs configurations: 8 measurements vs. 6 measurements vs. 4 measurements. The spectra have 224 bands in the range of

(400 nm - 2500 nm)

.

Figure 9. Figure shows 6 spectra reconstruction from the validation/test set. Each spectrum is reconstructed from 3 different SpLMs (yielding 8 measurements, 6 measurements, and 4 measurements). The spectra have 102 bands in the range of

(430 nm - 850 nm)

.

Figure 9. Figure shows 6 spectra reconstruction from the validation/test set. Each spectrum is reconstructed from 3 different SpLMs (yielding 8 measurements, 6 measurements, and 4 measurements). The spectra have 102 bands in the range of

(430 nm - 850 nm)

.

Figure 10. Figure shows 6 representative spectra reconstruction from the dataset of Pavia University. Each spectrum is reconstructed from 3 different SpLMs (8 measurements, 6 measurements, and 4 measurements). The spectra have 103 spectral bands in the range of

(430 nm - 850 nm)

.

Figure 10. Figure shows 6 representative spectra reconstruction from the dataset of Pavia University. Each spectrum is reconstructed from 3 different SpLMs (8 measurements, 6 measurements, and 4 measurements). The spectra have 103 spectral bands in the range of

(430 nm - 850 nm)

.

Figure 11. Quantization noise effect on SpLM with 4 mFPR stairs, 16 bits, 12 bits, 10 bits, and 8 bits, on 2 examples of spectra reconstruction.

Table 1. The metrics of SSIM and SAM on the test set for configuration of 8 measurements, 6 measurements, and 4 measurements.

	8 Measurements	6 Measurements	4 Measurements
SSIM	0.9996	0.9913	0.9995
SAM	0.0106	0.0755	0.0138

Table 2. The mean SSIM and mean SAM over the validation/test set of Pavia center for 3 configurations (8 measurements, 6 measurements, and 4 measurements).

	8 Measurements	6 Measurements	4 Measurements
SSIM	0.9941	0.991	0.9909
SAM	0.05515	0.06891	0.07346

Table 3. The mean SSIM and mean SAM over the dataset of Pavia University for 3 configurations (8 measurements, 6 measurements, and 4 measurements).

	8 Measurements	6 Measurements	4 Measurements
SSIM	0.99210	0.9921	0.9823
SAM	0.04746	0.0484	0.07484

Table 4. The metrics of SSIM and SAM on the test set for quantization noise of 4 measurements with 16 bits, 12 bits, 10 bits, and 8 bits.

	16 Bits	12 Bits	10 Bits	8 Bits
SSIM	0.99922	0.9922	0.9892	0.9873
SAM	0.03934	0.04092	0.04541	0.05196

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Heiser, Y.; Stern, A. Learned Design of a Compressive Hyperspectral Imager for Remote Sensing by a Physics-Constrained Autoencoder. Remote Sens. 2022, 14, 3766. https://doi.org/10.3390/rs14153766

AMA Style

Heiser Y, Stern A. Learned Design of a Compressive Hyperspectral Imager for Remote Sensing by a Physics-Constrained Autoencoder. Remote Sensing. 2022; 14(15):3766. https://doi.org/10.3390/rs14153766

Chicago/Turabian Style

Heiser, Yaron, and Adrian Stern. 2022. "Learned Design of a Compressive Hyperspectral Imager for Remote Sensing by a Physics-Constrained Autoencoder" Remote Sensing 14, no. 15: 3766. https://doi.org/10.3390/rs14153766

APA Style

Heiser, Y., & Stern, A. (2022). Learned Design of a Compressive Hyperspectral Imager for Remote Sensing by a Physics-Constrained Autoencoder. Remote Sensing, 14(15), 3766. https://doi.org/10.3390/rs14153766

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Learned Design of a Compressive Hyperspectral Imager for Remote Sensing by a Physics-Constrained Autoencoder

Abstract

1. Introduction

2. Materials and Methods

2.1. Physics-Constrained Autoencoder (PyCAE)

2.2. The SpLM and Its Model

2.3. Encoder—Sensing Layer

2.4. Straight-Through Estimator (STE)

2.5. Decoder—Reconstruction DNN

2.6. Loss Function and Metrics for the PyCAE

3. Results

3.1. Training Details

3.1.1. Datasets—Pavia Center and Salinas Valley

3.1.2. Pavia University—Generalization Evaluation

3.2. The Optimized SpLMs

3.3. The Reconstruction DNN

3.4. Salinas Valley

3.5. Pavia Center

3.6. Generalization—Pavia University

3.7. Robustness to Quantization Noise

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI