Gap-Filling of NDVI Satellite Data Using Tucker Decomposition: Exploiting Spatio-Temporal Patterns

Þórðarson, Andri Freyr; Baum, Andreas; García, Mónica; Vicente-Serrano, Sergio M.; Stockmarr, Anders

doi:10.3390/rs13194007

Open AccessArticle

Gap-Filling of NDVI Satellite Data Using Tucker Decomposition: Exploiting Spatio-Temporal Patterns

by

Andri Freyr Þórðarson

¹

,

Andreas Baum

^1,*

,

Mónica García

²

,

Sergio M. Vicente-Serrano

³ and

Anders Stockmarr

¹

Department of Applied Mathematics and Computer Science, Technical University of Denmark, 2800 Lyngby, Denmark

²

Department of Environmental Engineering, Technical University of Denmark, 2800 Lyngby, Denmark

³

IPE-CSIC, Intituto Pirenaico de Ecologia, Consejo Superior de Investigaciones Cientificas, 50059 Zaragoza, Spain

^*

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(19), 4007; https://doi.org/10.3390/rs13194007

Submission received: 25 August 2021 / Revised: 29 September 2021 / Accepted: 29 September 2021 / Published: 6 October 2021

(This article belongs to the Topic High-Resolution Earth Observation Systems, Technologies, and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Remote sensing satellite images in the optical domain often contain missing or misleading data due to overcast conditions or sensor malfunctioning, concealing potentially important information. In this paper, we apply expectation maximization (EM) Tucker to NDVI satellite data from the Iberian Peninsula in order to gap-fill missing information. EM Tucker belongs to a family of tensor decomposition methods that are known to offer a number of interesting properties, including the ability to directly analyze data stored in multidimensional arrays and to explicitly exploit their multiway structure, which is lost when traditional spatial-, temporal- and spectral-based methods are used. In order to evaluate the gap-filling accuracy of EM Tucker for NDVI images, we used three data sets based on advanced very-high resolution radiometer (AVHRR) imagery over the Iberian Peninsula with artificially added missing data as well as a data set originating from the Iberian Peninsula with natural missing data. The performance of EM Tucker was compared to a simple mean imputation, a spatio-temporal hybrid method, and an iterative method based on principal component analysis (PCA). In comparison, imputation of the missing data using EM Tucker consistently yielded the most accurate results across the three simulated data sets, with levels of missing data ranging from 10 to 90%.

Keywords:

missing data; imputation; data completion; remote sensing; tensor decomposition; Tucker; multiway analysis; machine learning

Graphical Abstract

1. Introduction

The presence of missing data caused by clouds or artifacts in optical satellite data is something that needs to be dealt with in order to obtain a complete time series describing land–surface dynamics.

State of the art algorithms to complete missing information in remote sensing data can be divided into four main categories: Spatial-based, temporal-based, spectral-based, and hybrid methods [1]. Spatial-based methods make use of spatial information to estimate missing data, for example spatial interpolation [2,3], which uses the weighted average of pixels surrounding the missing region [1]. Despite generally being computationally efficient and easy to implement, these methods tend to not perform well when the missing areas cover large heterogeneous regions [4]. Temporal methods utilize data from the same region at different time points. Different temporal replacement methods exist, which can be implemented on a pixel-by-pixel basis [5,6], patch-by-patch [7,8], or on a whole missing region [9,10,11]: Temporal filter methods include sliding window filter methods, which are commonly used to reconstruct normalized difference vegetation index (NDVI) time-series data [12,13,14,15,16] according to some criteria; function-based curve-fitting methods [17,18,19,20]; frequency domain methods [21,22]; and temporal learning based methods [23,24]. Temporal methods tend to perform poorly when the landscape or vegetation changes substantially through dynamical effects due to cloudy conditions or when clouds persist in time and space.

Spectral-based methods are typically multivariate, making up for missing data in one channel by gathering information from the same spatio-temporal unit by using all of the available channels. However, for the optical and thermal range of 400–12,000 nanometers, any missing data in a single channel will result in missing data in all of the channels because clouds are not transparent to radiation at these wavelengths. Wang et al. [25] applied polynomial regression to predict the reflectance of channel 6 of the aqua moderate resolution imaging spectroradiometer (MODIS) sensor, where 15 of the 20 detectors are nonfunctional or noisy channels. This can be useful where one of the channels is malfunctioning due to technical reasons. However, this method cannot be applied to gap-fill for cloudy conditions because clouds are opaque in the optical thermal range, meaning that missing information in one channel will be missing in other channels as well.

Lastly, hybrid methods blend two or more of the above-mentioned categories, combining the strengths of each method with existing examples of implementations including joint spatio-temporal [26,27] and joint spatio-spectral methods [28]. The simplest form of a joint method is to implement two or more of the above-mentioned methods successively, feeding the results of one method into the next algorithm. Sarafanov et al. demonstrate a successful implementation of a successive spatio-temporal method using a machine learning approach [29].

Successive utilization methods do not, however, exploit the existing correlation between different dimensions of the data. Attempts to remedy this include a novel method proposed by Cheng et al. [27] that merges concepts from the spatial and temporal categories, imputing missing data from different spatial locations at different times, assuming that similar groups of neighboring pixels in a given image will have similar dynamics in multitemporal images. Further exploiting the multiway structure of the data by including all modes (dimensions of a tensor, meaning spatial, temporal, and spectral) is however possible with tensor decomposition, a widely used group of algorithms in data mining that is rapidly becoming more relevant and useful with the increasing computational power and storage capabilities of modern computers [30]. A major advantage of tensor decomposition methods is the ability to explicitly take into account the multiway structure of data [30], thereby taking advantage of relationships between pixels in the spatial, spectral, and temporal dimensions. It is well documented that tensor decomposition methods can be used to fill in missing data accurately, even when a large proportion of the data is missing [31,32,33]. The amount of missing data that can be handled while still obtaining accurate results will depend on the tensor decomposition method that is used as well as the structure of the tensor that is decomposed. It has been shown that applications of Tucker decomposition, one of the most widely used methods for gap-filling data in domains other than remote sensing, such as chemometrics [34] or big data for traffic applications [35], can successfully reconstruct noisy tensors with up to 95% missing data [36], but they have not, to our knowledge, been applied to gap-fill time series data from remote sensing data sets at present.

The Iberian Peninsula is a perfect natural laboratory to test gap-filling methods for the remote sensing indices of vegetation greenness due to its large bioclimatic gradients and high seasonal and interannual climatic variations that create dynamic patterns. In addition, land use changes [2,37,38] such as afforestation [39], intensification, aridification, and more frequent forest fires [3,40,41] are prevalent. Given the uncertainty in projected climate change for future decades [42,43], it is crucial to monitor also monitor vegetation changes at a regional scale. These can be accomplished in a cost-effective way by using satellite data. NDVI is a good indicator of changes in vegetation growth and senescence associated with climate and human impacts [44,45,46,47]. The NDVI high-resolution data set from the Iberian Peninsula from Vicente-Serrano et al. [48], which spans 34 years, has been used to assess drought impacts or forest resilience using biweekly composites [49,50,51]. Increasing the temporal resolution to daily time scale can improve biological forecasting by detecting early warning signals of abrupt transitions or ecological thresholds [52,53].

The goal in this study is to assess the performance of expectation maximization (EM) Tucker, a Tucker implementation that iteratively imputes missing data to fill in gaps caused by clouds or low viewing angles in NDVI images from the AVHRR sensor. We used both simulated and real data sets and benchmark the results through comparisons with state-of-the-art methods for gap-filling NDVI time series.

The paper is structured as follows: First, two study regions are introduced followed by an introduction of different gap-filling methods and the performance metrics that we used to compare those methods. Three data sets based on the first study region were established, and missing data were added artificially in order to benchmark the methods. A fourth data set was established based on the second, larger study region. This data set contained natural missing data. We applied all of the methods discussed in an earlier section to this fourth data set in order to demonstrate the application’s performance in a comparative fashion in a real-world situation.

2. Study Region and Data Set

2.1. Study Region

We investigated two separate regions on the peninsula, both of which are within the borders of Spain. The first region (study region 1) is bound by 39°10′N–38°51′N and −2°58′E–2°36′E and primarily consists of forested land, pastures, and cropland. There are no large bodies of water within the region. The second region (study region 2) is located further south and is bound by 36°56′N and 36°45′N and −3°58′E and −2°37′N. This is a larger and more diverse region and includes the greenhouse covered region in the plains of Campo de Dalías in the south-east, the Sierra Nevada mountain range, a few bodies of water as well as shrublands, small forests, and croplands to the north and west. The two study regions were identified by selecting spatial subsets from the Iberian Peninsula after a screening process looking for sites with enough spatial variability driven by different land cover types. An additional criterion for study region 1 was that it needed to contain as little missing data as possible. Observations of the Iberian Peninsula over the course of the chosen time frame are shown in Figure 1. Study regions 1 and 2 are indicated by black rectangles. In addition, Figure 2 shows enlarged plots of the two rectangles.

2.2. Data Sets and Pre-Processing

The Iberian data set consists of daily AVHRR images taken between 1981 and 2015 and encompasses Portugal, Spain, Gibraltar, Andorra, and a small part of Southern France.

The data were acquired from the satellites NOAA-7, -9, -11, -14, -16, -18, and -19 at a spatial resolution of 1.2 km. Daily images were subject to processing that included calibration and cross-calibration, geographical matching, cloud cover removal, top-of-the-atmosphere reflectance calculation, and topographical correction. Details of the processing procedure can be found in [48]. Daily red and infrared reflectance were used to calculate the NDVI index.

Daily NDVI data are available for the entire time period between 1989 and 2015. However, we only used data from a single year (2008), as we assumed that the temporal patterns would be redundant across several years and because we wanted to demonstrate that the method is also applicable to real-world scenarios where less data are available.

The reduced data set (2008) consists of 366 images, each of which contains 1115 × 834 pixels. A large part of the frames is empty, and the remaining ones indicate some degree of missing data. The missing data are the result of prior artifact removal, which was applied to cope with heavy cloud cover, high view angles, or because the images were damaged in some way. Overall, missing data constitute 90% of the data set. There are 215 time frames that contain no data at all. Disregarding these empty frames as well as the ocean (see the estimated shoreline in Figure 1f), the amount of missing data drops to 56%. After removing frames that contain very little or no data, 66 frames remained, resulting in a multiway array of 1115 horizontal × 834 vertical × 66 temporal pixels. Examples of five chosen time frames with various levels of missing data can be seen in Figure 1.

To quantitatively assess the performance of the gap-filling methods, it was necessary to have ground truth values. This is not possible when data are missing due to clouds unless ground sensors are available. To overcome this, we used study region 1, which contained little missing data in order to create three different data sets, namely SIM1, SIM2, and SPAIN1. These were used for model evaluation. Study region 2 contained natural missing data and was used to create a fourth data set, namely SPAIN2, which was used to demonstrate how the gap-filling methods function when applied to a larger area. An overview of all of the data sets is shown in Table 1.

The purpose of the simulated time series data sets (SIM1-2) was to conduct a proof-of-concept study under well controlled conditions. The real time series data sets (SPAIN1-2), on the other hand, were used to demonstrate performance with real imagery. SIM1, SIM2, and SPAIN1 were used to evaluate the algorithms with different levels of artificially added missing data. The fourth data set, SPAIN2, was used to demonstrate the application of the algorithms to a larger area (11,660 km²) of the Iberian Peninsula. Thus, for SPAIN2, the actual ground truth values were not known.

The simulated data sets were constructed using the methods described as follows: For SIM1, a single time frame (23 January) was selected from study region 1, as indicated by the small black rectangle in Figure 1. This time frame was replicated 30 times, as shown in Figure 3. All 30 frames in SIM1 were completely identical. To construct the data set, SIM2 gaussian noise with a mean µ = 0 and standard deviation σ = 0.02 was added to all of the SIM1 pixels.

For SPAIN1, study region 1, which contained an overall low amount of missing data, was selected (upper rectangle in Figure 1). All of the time frames indicating missing data were removed. The dimensions for all of the aforementioned data sets are shown in Table 1. A selected number of time frames from SPAIN1 are shown in Figure 4.

Missing data were added to SIM1, SIM2, and SPAIN1 in two different ways. First, pixels were removed completely at random (MCAR) until the missing data reached a desired percentage of elements. Secondly, pixels were randomly removed in 5 × 5 blocks at a time, resulting in data that were missing at random (MAR). The levels of missing data that were analyzed were 0%, 1%, 2.5%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, and 99%. Examples of three levels of missing data are shown in Figure 5.

In the Iberian data set, the distribution of the missing data appears to be MAR due to the fact that clouds would typically cover patches of land, leading to block-wise missing data. The addition of missing data was conducted a number of times in order to generate data sets that could indicate different levels of missing data in order to evaluate how this aspect would affect the gap-filling performance of the algorithms.

The geographic area of the SPAIN2 data set is shown in Figure 1 and is indicated by the lower rectangle (study region 2). The SPAIN2 dimensions were 90 × 90 × 66 pixels. As the ground truth values were not known, SPAIN2 was solely used to demonstrate the application of EM Tucker for the gap-filling of a larger area. Results are presented and compared to four reference methods by visual means only.

3. Methods

Throughout this paper, tensors will be denoted using upper-case calligraphic letters, matrices and vectors will be denoted using bold, scalars will be denoted using lower-case letters, and matrix or tensor dimensions will be denoted using upper-case letters.

3.1. Tucker Decomposition

Tensors, or multiway arrays, are higher-order generalizations of vectors and matrices. Each dimension of a tensor is called a mode. A matrix is a second order (or two-way) tensor indicating two modes, referring to rows and columns. A third order tensor has three modes, referring to rows, columns, and so-called tubes. As a consequence, matrices store data in “tables”, while third order tensors store data in “boxes”. We will denote third order tensors as

X^{I \times J \times K}

, where I corresponds to the number of rows or horizontal pixels, J corresponds to the number of columns or vertical pixels, and K corresponds to the number of tubes or frames. Using this notation, the data sets that were previously introduced can be denoted as

X_{S I M 1}^{30 \times 30 \times 30}

for SIM1,

X_{S I M 2}^{30 \times 30 \times 30}

for SIM2,

X_{S P A I N 1}^{30 \times 30 \times 54}

for SPAIN1, and

X_{S P A I N 2}^{90 \times 90 \times 66}

for SPAIN2 in tensor form.

Formally, Tucker decomposes a three-way tensor

X^{I \times J \times K}

into the three loading matrices

A^{I \times P}

,

B^{J \times Q}

,

C^{K \times R}

and a core tensor

G^{P \times Q \times R}

, where P, Q, and R denote the ranks in the respective modes (see Figure 6) [54,55]. The rank of a given mode can be understood as the number of independent loading vectors that are necessary in that mode to reconstruct the systematic variation in

X

. The decomposition is described element-wise in Equation (1).

Figure 6. A visual representation of the Tucker decomposition (a). The third order tensor

X

is decomposed into a core tensor

G

and the loading matrices A, B, and C. The residuals are represented with

E

, a tensor that is the same size as

X

. The Tucker decomposition allows for different ranks along the tensor modes, which are denoted as P, Q, and R. The imputation process of EM Tucker is described in (b).

Figure 6. A visual representation of the Tucker decomposition (a). The third order tensor

X

is decomposed into a core tensor

G

and the loading matrices A, B, and C. The residuals are represented with

E

, a tensor that is the same size as

X

. The Tucker decomposition allows for different ranks along the tensor modes, which are denoted as P, Q, and R. The imputation process of EM Tucker is described in (b).

x_{i j k} = \sum_{p = 1}^{P} \sum_{q = 1}^{Q} \sum_{r = 1}^{R} a_{i p} b_{j q} c_{k r} g_{p q r} + e_{i j k}

(1)

where

x_{i j k}

represents a pixel at the i-th row, the j-th column, and the k-th tube of the tensor

X

. Alternatively, the Tucker decomposition can be formulated using matrix notation, as shown in Equation (2):

X_{(1)} = A G_{(1)} {(C \otimes B)}^{T} + E_{(1)}

(2)

where ⊗ denotes the Kronecker product, and

X_{(1)}

is the matricized tensor

X

across the first mode, as defined by Equation (3).

X^{I \times J \times K} \to X_{(1)}^{I \times J K}

(3)

The objective of the Tucker model is to minimize the reconstruction error determined via the following loss function equation, Equation (4), where

∥ \cdot ∥_{F}^{2}

represents the squared Frobenius norm.

L (A, B, C) = ‖ X_{(1)} - A G_{(1)} {(C \otimes B)}^{T} ‖_{F}^{2} = ‖ E_{(1)} ‖_{F}^{2}

(4)

Here,

E_{(1)}

represents the residual tensor matricized across its first mode. The objective is most commonly achieved through the use of the alternating least squares algorithm (ALS), which updates the estimates of the loading matrices A, B, and C and the core tensor

G

in an iterative fashion, as shown in Equations (5)–(8).

A \leftarrow X_{(1)} {(G_{(1)} {(C \otimes B)}^{T})}^{†}

(5)

B \leftarrow X_{(2)} {(G_{(2)} {(C \otimes A)}^{T})}^{†}

(6)

C \leftarrow X_{(3)} {(G_{(3)} {(B \otimes A)}^{T})}^{†}

(7)

G \leftarrow X \times_{1} A^{†} \times_{2} B^{†} \times_{3} C^{†}

(8)

where

{(\cdot)}^{†}

denotes the Moore–Penrose inverse, and

\times_{n}

is the so-called n-mode product indicating the following property, see Equation (9):

X \times_{1} A = Z s u c h t h a t Z_{(1)} = A X_{(1)}

(9)

In the present context, the intuition behind the Tucker decomposition can be understood as the desire to find a linear combination of outer vector products between the spatial loading vectors in B and C in order to form “factor landscapes”, which are then scaled by scores in A to reconstruct the original data as accurately as possible. Because of this, the spatial loading vectors in B and C can interact or “cross-talk”. The interaction pattern of these loading vectors is encoded in the core tensor

G

. An illustrative overview of the EM Tucker decomposition as well as a flowchart describing the imputation process is provided in Figure 6.

3.2. EM Tucker for Imputation of Missing Values

The loading matrices A, B, and C are typically initialized using higher order singular value decomposition (HOSVD) [30]. However, HOSVD cannot be used directly in the presence of missing data, as the loss cannot be determined; therefore, prior imputation or marginalization is required. In this study, the imputation approach is chosen, which entails replacing missing data with sensible values before the Tucker decomposition can be applied. One example of a comprehensive imputation approach includes the expectation maximization (EM) algorithm [56], which assumes normal distribution, equal variance, and independence of the residuals.

To gap-fill missing data, we applied EM Tucker, which can be understood as a modification of the Tucker algorithm, such that the iterative imputation of missing values is possible during the ALS fitting procedure [31]. In particular, Equations (5)–(8) are applied to a tensor

\tilde{X}

, which is defined such that

{\tilde{X}}^{(s)} = X * Y + M^{(s)} * (1 - Y)

(10)

where ∗ denotes the element-wise product, 1 denotes a tensor containing ones,

M^{(s)}

denotes the so-called interim model reconstruction at iteration s, and

Y

is a tensor containing elements

y_{i j k}

such that

y_{i j k} = \{\begin{array}{l} 0 i f | x_{i j k} m i s s i n g \\ 1 i f | x_{i j k} n o t m i s s i n g \end{array}

(11)

Using this procedure, the imputed values are updated at each ALS iteration by using the interim model

M^{(s)}

. This resembles the maximization step. To incorporate the updated tensor

{\tilde{X}}^{(s)}

, the loss function is modified with respect to Equation (10), such that

L (A^{(s)}, B^{(s)}, C^{(s)}) = ‖ {\tilde{X}}_{(1)}^{(s)} - A^{(s)} G_{(1)}^{(s)} {(C^{(s)} \otimes B^{(s)})}^{T} ‖_{F}^{2}

(12)

The loss calculation hereby represents the expectation step of the EM algorithm. Initial imputations for

{\tilde{X}}^{(0)}

are obtained via a combination of row and column means from the matricized tensor. This is completed prior to the initialization of the EM Tucker algorithm [57].

3.3. Model Selection

Decisions needed to be made in order to select the appropriate ranks—or number of components—across the three different tensor modes for the Tucker decomposition. Generally speaking, the reconstruction error will decrease as the number of components increases. However, this will eventually lead to the overfitting of the data, which could result in the potential loss of accuracy when gap-filling any missing data. Strategies for choosing the correct number of components include cross validation strategies as well as the application of so-called information criteria, such as the akaike information criterion (AIC) or the Bayesian information criterion (BIC). Atkinson et al. use these methods to estimate the performance of different models that were applied to NDVI time series data [58].

However, due to the fact that ground truth data are not available for the missing data, AIC and BIC were deemed inadequate to select the optimal number of components across the tensor modes. Instead, we have chosen an alternative model selection strategy that relies on the knowledge of the land-use types of the site. In the following we support our decision through some initial investigations.

We repeatedly decomposed SIM1 (without missing data) using EM Tucker using different ranks across the spatial modes, P and Q, while assuming the rank across the time mode to be one (R = 1). The model reconstructions for a single time frame obtained for the different spatial ranks can be seen in Figure 7. It appears that the model reconstruction was not accurate when choosing the low ranks across the spatial modes, i.e., the spatial details were not fully recovered. It is noteworthy that the SIM1 tensor did not contain any noise. Hence, we expected to obtain a perfect reconstruction given that our model should capture the systematic variation in the data. We could further see that the reconstruction improved when increasing the spatial ranks. We concluded that the maximal rank was necessary across the spatial modes to obtain accurate data reconstructions. Therefore, P and Q were set to 30 for all Tucker decompositions.

The time mode rank (R), on the other hand, can be understood as the number of independent temporal profiles in the spatial data. For SIM1 and SIM2, we know that all of the time frames contained exactly the same spatial information. Therefore, we chose R = 1 to accurately reconstruct the systematic variation in these tensors. Stated in other words, there is only one independent temporal profile, i.e., all of the pixel values change jointly across the different frames.

However, when looking at SPAIN1, we could find two independent temporal patterns in the data. This is because the region consists of two different land cover types that varied from each other independently of time. We therefore applied a time mode rank R = 2 to model the systematic variations referring to the two sub-regions within the SPAIN1 data set.

In conclusion, we noted that more temporal components would be required if more independent temporal changes were present in the data. This would be the case when trying to gap-fill many very large areas at once using Tucker. However, this is not advisable due to the fact that decomposing large tensors would require considerably high amounts of computational resources, such as computer memory. To mitigate this problem, we limited the geographic size of the tensors and suggested that larger areas, such as SPAIN2, be gap-filled in an iterative fashion. If subsets are chosen to be sufficiently small enough, a low rank can be assumed in the time mode.

To demonstrate this iterative gap-filling procedure, we divided SPAIN2 into nine sub-regions yielding nine tensors of reduced dimensionality, each indicating 10 horizontal pixels × 10 vertical pixels × 52 time points. EM Tucker decompositions were conducted separately for all of the nine tensors.

The selected number of time mode components, R, for the nine EM Tucker decompositions of the SPAIN2 data set were 5, 5, 2, 2, 4, 4, 5, 2, and 4 when counting from top-left to bottom-right in a column-wise fashion. The reasoning behind choosing these ranks were the following: For the EM Tucker decomposition of SPAIN1, we applied R = 2 because we knew about the occurrence of two independent temporal patterns. The subsequent reconstruction of SPAIN1 (with 15% of the missing data) using the obtained Tucker model resulted in 75% of the explained variance among the non-missing pixels. Given the results from this real-world data set, we defined a threshold of 75%, meaning that for the application of the method to new data sets, then number of time mode components that are necessary to explain at least 75% of the variance among the non-missing pixels must be selected.

3.4. Metrics

To evaluate the gap-filling accuracy, the difference between the reconstructions and the original tensors needed to be estimated for SIM1, SIM2, and SPAIN1. This was completed using the relative root mean square error (RRMSE), which is the root mean squared error (RMSE) where data is missing, as shown in Equation (13), divided by the mean of the original tensor, see Equation (14). The overall sum of the elements in the tensor

X

,

\sum_{i}^{I} \sum_{j}^{J} \sum_{k}^{K} x_{i j k}

, was re-written as

\sum^{X} x_{i j k}

.

R M S E = \sqrt{\frac{1}{\sum^{Y} (1 - y_{i j k})} \sum^{X} {(x_{i j k}^{P r e d} - x_{i j k})}^{2} (1 - y_{i j k})}

(13)

R R M S E = \frac{R M S E}{\bar{x}}

(14)

As a second metric, the correlation coefficient between the original tensor and the reconstructed tensor was used. A correlation coefficient close to 1 suggests accurate reconstructions/imputation of the missing data. Only the tensor elements where data were missing were taken into account. The correlation coefficient

r^{3 D}

was calculated as an average across all time frames, as seen in Equation (16). The average predicted value per k-th frame,

{\bar{x}}_{k}^{P r e d}

, was calculated as described in Equation (15).

{\bar{x}}_{k}^{P r e d} = \frac{1}{\sum_{i} \sum_{j} (1 - y_{i j k})} \sum_{i} \sum_{j} x_{i j k}^{P r e d} (1 - y_{i j k})

(15)

r^{3 D} = \frac{1}{K} \sum_{k} \frac{\sum_{i} \sum_{j} (1 - y_{i j k}) (x_{i j k} - {\bar{x}}_{k}) (x_{i j k}^{P r e d} - {\bar{x}}_{k}^{P r e d})}{\sqrt{(\sum_{i} \sum_{j} (1 - y_{i j k}) {(x_{i j k} - {\bar{x}}_{k})}^{2}) (\sum_{i} \sum_{j} (1 - y_{i j k}) {(x_{i j k}^{P r e d} - {\bar{x}}_{k}^{P r e d})}^{2})}}

(16)

The structural similarity index (SSIM) [59] is a third metric that was utilized in this paper to better account for spatial differences between the original tensors and the reconstructed ones. A SSIM of 1 indicates the highest and 0 represents lowest possible structural similarity between images. Unlike previous metrics, SSIM takes into account both elements where missing data were present and where they were not. The tensor mean

\bar{x}

and standard deviation

σ_{x}

are defined as shown in Equations (17) and (18).

\bar{x} = \frac{\sum^{X} x_{i j k}}{N}

(17)

σ_{x} = \sqrt{\frac{\sum^{X} (x_{i j k} - \bar{x})}{N}}

(18)

where N represents the number of elements in tensor

X .

SSIM was then calculated in the following way:

σ_{x x^{P r e d}} = \frac{\sum^{X} (x_{i j k} - \bar{x}) \sum^{x^{P r e d}} (x_{i j k}^{P r e d} - {\bar{x}}^{P r e d})}{N}

(19)

3.5. Reference Methods

Table 2 provides an overview of all of the imputation methods that were applied to gap-fill the four data sets. In particular, four different reference methods were chosen to benchmark the gap-filling performance of EM Tucker, namely a simple mean imputation, single imputation Tucker, EM PCA, and a hybrid method. Hereby, the simple mean imputation method served as a baseline for the comparison, as it simply replaced all of the missing elements with the tensor mean

{\bar{x}}^{*}

. As such, the tensor mean was calculated by only taking into account elements where data were not missing, as denoted in Equation (20).

{\bar{x}}^{*} = \frac{1}{\sum^{Y} y_{i j k}} \sum^{X} x_{i j k} y_{i j k}

(20)

For single imputation Tucker (SI Tucker from here on), all missing elements were initially replaced using the tensor mean, as seen in Equation (20). Subsequently, a Tucker decomposition was performed on the dense tensor. In contrast to EM Tucker, the initial imputations of the missing elements were not updated during the ALS fitting procedure. The optimal number of components were chosen in a similar manner as the one described for EM Tucker (Section 3.3).

EM PCA is a two-way matrix decomposition. To facilitate the decomposition, the tensor was matricized, and PCA was applied to the resulting I × JK matrix using an EM algorithm [60]. The optimal number of PCA components varied between one and two components for SIM1, SIM2, and SPAIN1. For SPAIN2, the optimal component numbers were determined in a similar fashion as the one described for EM Tucker (Section 3.3).

Finally, performance of EM Tucker was compared to a hybrid spatio-temporal imputation method using a sliding time window. Its components are a temporal-based method that estimates missing pixels by calculating the average pixels in the same spatial location from the closest time frames where that information was available. For this analysis, a window size of six was used. When none of the six nearest time frames contained data in the same spatial location of the missing pixel, the missing data were filled using a spatial method, namely K Nearest Neighbors (KNN). This gradually happened more frequently, as the percentage of missing data increased. The KNN algorithm imputed the missing data using the corresponding value from the nearest neighbor column of the IJ x K matricized tensor. This procedure failed when all of the pixels in a time frame were missing.

We used Matlab (version 2017b) [61] and R (version 4.0.3) [62] during this study. The specific packages used for the imputations are stated in Table 2.

4. Results

Gap-filling accuracies for the SIM1 data set are shown in Figure 8, with the X-axis showing the percentage of missing data. For this simulation, low RRMSE’s were expected because the data simply consisted of identical frames. If a pixel was missing in one frame, the same information was available in at least one of the other 29 frames for reasonably low levels of missing data. The results reflect this expectation. All of the models indicated no error when 0% of the data were missing due to the fact that only missing pixels were considered during the RRMSE calculation. EM Tucker could reconstruct the tensor close to perfectly in the MCAR case with up to 80% data missing. The EM PCA algorithm failed when the missing data exceeded 70% in the MCAR case and 50% in the MAR case. This can be explained by the fact that no column means could be determined when entire columns of the matricized array were missing. For the EM PCA algorithm, this step was important in order to impute missing elements before iterative imputation was conducted.

The RRMSE for the SI Tucker model appeared to grow approximately linearly as the amount of missing data was increased and approached the level of the EM Tucker for both conditions, MCAR and MAR, as the missing data reached high levels of around 90%. EM PCA consistently performed worse than both EM Tucker and single imputation Tucker. All of the models outperformed the simple mean imputation baseline method, significantly except for EM Tucker when 95% of the or more was missing, and the hybrid method when the level was above 70%. The performance of the hybrid method was approximately on par with EM Tucker in the MCAR case for low levels of missing data but was outperformed by EM Tucker for levels of missing data above 40%. In the MAR case, the hybrid method also performed on par with EM Tucker for the same low levels of missing data, and it even outperformed EM Tucker slightly for MAR at 1–10% and MCAR 1–20%.

In the case of SIM2, Gaussian noise was added to the homogenous tensor, and all models performed significantly worse, except for EM PCA, which returned similar error levels as it did prior to adding noise. This puts the EM PCA imputation performance approximately on par with EM Tucker for the MAR case, and it outperformed EM Tucker in the MCAR case for all levels of missing data. These results are shown in Figure 9. The hybrid method was outperformed by single imputation Tucker at 50% levels of missing data for both cases, and it was outperformed by the simple mean imputation method, where the missing data exceeded 60%. Interestingly, EM Tucker was outperformed by both the mean imputation method and single imputation Tucker for levels of missing data higher than 80%.

The results for the same models applied to SPAIN1, the subsection of the Iberian data set, are shown in Figure 10. As expected, the resulting error rate was higher for all data completion methods in comparison to SIM1. This was expected, especially when considering the fact that SPAIN1 had higher variability in time and space than SIM1. On the other hand, the error rates were comparable to the ones obtained from SIM2. Notably, for SPAIN1, EM Tucker yielded the lowest RRMSE for all levels of missing data up to 90% for MCAR and up to 95% for MAR. The EM PCA imputation performed significantly worse compared to the other methods. However, it outperformed the hybrid method, where more than 40% of the data were missing, but the algorithm broke when the missing data reached 70% and 40% for MCAR and MAR, respectively. Single imputation Tucker outperformed the hybrid method when more than 50% of the data were missing, but it never matched the performance of EM Tucker.

Evidently, EM Tucker was capable of reconstructing the tensor accurately, even when only fractions of the data remained, with a RRMSE of 12.6% (MCAR) and 16.8% (MAR) for 90% missing data.

To further investigate the performance of the methods when applied to SPAIN1, the average correlation between the ground truth and the gap-filled data was calculated as shown in Equation (16) at different levels of missing data. These results are displayed in Figure 11. For the same purpose, the SSIM was calculated and is shown in Figure 12.

The SSIM and correlation analysis generally revealed the same model hierarchy as the one that was stated previously. The SSIM scores were generally higher than the correlation scores due to the fact that SSIM was calculated on a frame-by-frame basis, while the correlation was calculated on a pixel-by-pixel basis, only taking missing pixels into account. The results from the mean imputation approach had zero correlation to the original tensor. This is a model that predicted the same value

{\bar{x}}^{*}

, for every pixel that was missing, and it is therefore immediately clear that the correlation is zero. The single imputation Tucker model performed noticeably worse than EM Tucker did. EM Tucker displayed the best performance out of all of the algorithms, especially when the missing data exceeded 40%. Remarkably, EM Tucker maintained a high average correlation at very high levels of missing data. The EM Tucker reconstructions maintained the highest level of SSIM throughout all levels of missing data up to 90%, where it was outperformed by mean imputation for both MCAR and MAR. The main difference between the correlation analysis and the SSIM analysis is that EM PCA scored slightly higher on the SSIM metric in relation to the other methods.

Concluding with respect to SPAIN1, we found a clear model hierarchy in terms of gap-filling performance. EM Tucker significantly outperformed all of the methods that were tested, especially at high levels of missing data, only being outperformed by the mean imputation approach when the missing data reached 95% (for both MCAR and MAR). Single imputation Tucker generally performed better than the hybrid method, which, in turn, generally performed better than the EM PCA algorithm. The hybrid method performed slightly better in the MCAR simulated homogenous case when the frames were all equal (SIM1), but EM Tucker performed significantly better for all other cases. However, it is important to note here that the real missing data in the Iberian data set are not MCAR—they more closely resemble the MAR case.

Lastly, naturally missing data from the fourth data set, namely SPAIN2, was gap-filled using the five methods. The results for the selected time frames are shown in Figure 13, where the first column shows the original frames prior to imputation. Missing values are represented in white.

The individual frames indicate a grid, which specifies the nine sub-regions, that was used to gap-fill the data in an iterative fashion as outlined in Section 3.3. The time frames that were chosen to be displayed contained different levels of missing data and were sampled throughout the year to cover all seasons, with lower NDVI values in the Sierra Nevada mountains (blue color in the lower center grid). The areas with higher NDVI during the summer correspond to forested areas in mountain ranges, and the largest seasonality in NDVI corresponds to agricultural crops showing low NDVI during the summer (water limited) and winter (temperature limited). It is noteworthy that only the values that were originally missing change in the reconstructed frames. Therefore, the reconstructed frames may look somewhat similar between the models at first glance.

It is generally clear that the mean imputation method is heavily biased, as clear homogeneous patches were used to gap-fill the missing data. EM PCA, on the other hand, results in somewhat blurry reconstructions or gap-filled information. In contrast, SI Tucker, the hybrid method, and EM Tucker lead to well-balanced imputations. Nonetheless, the gap-filled information differs for these three algorithms. 23 July clearly indicates that SI Tucker results in different reconstructions, i.e., when looking at the middle right sub-region. For the purposes of interpretation, we have included the results for the previous day, namely 22 July. Given the assumption that we would not expect regions to change drastically from one day to another, one can conclude that SI Tucker fails to predict higher NDVI regions in the middle-right sub-region of 23 July. EM Tucker as well as the hybrid method results support that conclusion, as both methods predict higher NDVIs in that region.

The computation times for all of the methods applied to SPAIN2 are shown in Table 3. This is the combined time that it took to the impute missing data for all nine sub-tensors within SPAIN2. EM Tucker was noticeably slower than the other methods, with a total imputation time of just over 6 min. The analysis was conducted on an Intel(R) Core i5-8250U with 4.00 GB RAM.

5. Discussion

Looking at the results, it appears that both the hybrid method as well as EM Tucker are good choices for gap-filling missing NDVI image data. While the application of the hybrid method is easily scalable to larger geographical areas due to the simple nature of the algorithm, i.e., each pixel can be processed individually, the major advantage of EM Tucker becomes evident when large fractions of the data are missing. Our results show that EM Tucker can beat the performance of the hybrid method in such cases. This can be explained by the fact that EM Tucker utilizes temporal as well as spatial information simultaneously, while the algorithm of the hybrid method uses either one or the other to gap-fill the missing elements. In addition, we showed that EM Tucker performs better than the SI Tucker algorithm due to the fact that the former updates the imputed values during the alternating least squares fitting procedure, as seen in Equation (10). This makes the EM Tucker algorithm less prone to being negatively affected by initial incorrect imputations, i.e., the initial replacement of missing elements with column, row, or tensor means or combinations thereof.

The results obtained through the simulation studies were used to evaluate and compare the gap-filling performance of the algorithms. Furthermore, these results were utilized to define model selection strategies, i.e., to choose to correct number of components for the Tucker decomposition. We have compared methods under two scenarios, namely when data were missing completely at random (MCAR) and when data were missing at random (MAR), i.e., block-wise. Although we stated that natural missing data would match the MAR scenario, one must underline the fact that certain geographical regions, e.g., mountainous areas, might violate the randomness assumption, meaning that cloud patches might not be randomly distributed, both with respect to temporal as well as to the spatial domain. To avoid problems and to ensure that our results from SIM1, SIM2, and SPAIN1 generalize well to real-world applications, we suggest gap-filling larger areas iteratively using sub-regions of sufficiently small geographical size. This will reduce model complexity and will enable the method to be applied to very large areas, as the decompositions of the individual sub-regions can be distributed across several computational workers.

Besides the imputation of missing data, tensor decompositions such as Tucker can be used to detect anomalies if the data contain spatial or temporal indications of them. These anomalies can be caused by sudden phenological changes from, e.g., fire, insect attacks, deforestation, or drought. As discussed in Section 3.3, this would lead to additional independent temporal profiles and, therefore, could only be explained by the Tucker model if the time mode components were increased accordingly. For monitoring purposes, scores in the loading matrix A together with the abnormal residuals in

E

could be used to identify outlying behavior (see Figure 6a). However, it was not within the scope of this study to evaluate the methods towards their ability to detect anomalies or outliers. We refer the reader to other studies that highlight the potential of tensor decompositions for anomaly detection in satellite data [64].

6. Conclusions

In this study, we presented a proof-of-concept that EM Tucker can be used to gap-fill missing data in NDVI satellite images. We benchmarked the method against four reference methods using three data sets with simulated clouds as well as one real-world data set. While missing data were added artificially to the simulated data sets, a sub-region of the Iberian Peninsula was used to gap-fill naturally occurring missing data. Benchmark results indicated that EM Tucker offers superior performance in terms of reconstruction accuracy, especially when larger fractions of up to 95% of the data were missing. This could be well explained by the fact that the algorithm utilizes temporal as well as spatial information in a simultaneous fitting procedure, allowing to all of the available systematic variation to be incorporated in order to reconstruct the missing elements.

To underline the applicability of the method to real-world scenarios, we applied EM Tucker to a larger area in the south of the Iberian Peninsula, which contained 15% missing data overall. The results were presented visually and were compared to the four reference methods. In order to process larger regions, we propose dividing the area into smaller geographical sub-regions to be gap-filled individually. This approach lowers the requirements for computational resources, while offering the ability to process sub-regions in parallel at the same time. This can be understood as processing a given area using a “moving box” imputation.

The results suggest that EM Tucker is a viable method to correct measuring errors and to fill in missing data for satellite imagery. Furthermore, the high accuracy of the algorithm could enable improved satellite imagery pre-processing, especially when a big proportion of the data are corrupted or missing.

Author Contributions

A.F.Þ. and A.B. conducted the data analysis and drafted the manuscript. A.B. planned and designed the thesis project. A.B., M.G. and A.S. supervised the study. S.M.V.-S. and M.G. provided the data and provided domain knowledge and feedback. All authors contributed to the drafting of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This study did not receive any external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

AIC	Akaike information criterion
ALS	Alternating least squares
AVHRR	Advanced very-high-resolution radiometer
BIC	Bayesian information criterion
EM	Expectation maximization
HOSVD	Higher order singular value decomposition
KNN	K-nearest neighbors
MAR	Missing at random
MCAR	Missing completely at random
MODIS	Moderate resolution imaging spectroradiometer
NDVI	Normalized difference vegetation index
PCA	Principal component analysis
RMSE	Root mean square error
RRMSE	Relative root mean square error
SI	Single imputation
SSIM	Structural similarity index

References

Shen, H.; Li, X.; Cheng, Q.; Zeng, C.; Yang, G.; Li, H.; Zhang, L. Missing Information Reconstruction of Remote Sensing Data: A Technical Review. IEEE Geosci. Remote Sens. Mag. 2015, 3, 61–85. [Google Scholar] [CrossRef]
Wang, Q.; Wang, L.; Wei, C.; Jin, Y.; Li, Z.; Tong, X.; Atkinson, P.M. Filling gaps in Landsat ETM+ SLC-off images with Sentinel-2 MSI images. Int. J. Appl. Earth Obs. Geoinf. 2021, 101, 102365. [Google Scholar] [CrossRef]
Zhang, L.; Wu, X. An edge-guided image interpolation algorithm via directional filtering and data fusion. IEEE Trans. Image Process. 2006, 15, 2226–2238. [Google Scholar] [CrossRef] [Green Version]
Zhang, Q.; Yuan, Q.; Zeng, C.; Li, X.; Wei, Y. Missing Data Reconstruction in Remote Sensing Image With a Unified Spatial–Temporal–Spectral Deep Convolutional Neural Network. IEEE Trans. Geosci. Remote Sens. 2018, 56, 4274–4288. [Google Scholar] [CrossRef] [Green Version]
Holben, B.N. Characteristics of maximum-value composite images from temporal AVHRR data. Int. J. Remote Sens. 1986, 7, 1417–1434. [Google Scholar] [CrossRef]
Zhu, Z.; Woodcock, C.E.; Holden, C.; Yang, Z. Generating synthetic Landsat images based on all available Landsat data: Predicting Landsat surface reflectance at any given time. Remote Sens. Environ. 2015, 162, 67–83. [Google Scholar] [CrossRef]
Lin, C.-H.; Lai, K.-H.; Chen, Z.-B.; Chen, J.-Y. Patch-Based Information Reconstruction of Cloud-Contaminated Multitemporal Images. IEEE Trans. Geosci. Remote Sens. 2013, 52, 163–174. [Google Scholar] [CrossRef]
Lin, C.-H.; Tsai, P.-H.; Lai, K.-H.; Chen, J.-Y. Cloud Removal From Multitemporal Satellite Images Using Information Cloning. IEEE Trans. Geosci. Remote Sens. 2012, 51, 232–241. [Google Scholar] [CrossRef]
Zhang, X.; Qin, F.; Qin, Y. Study on the Thick Cloud Removal Method Based on Multi-Temporal Remote Sensing Images. In Proceedings of the 2010 International Conference on Multimedia Technology, Ningbo, China, 29–31 October 2010; IEEE: Ningbo, China, 2010. [Google Scholar]
Li, M.; Liew, S.C.; Kwoh, L.K. Producing Cloud Free and Cloud-Shadow Free Mosaic from Cloudy IKONOS Images. In Proceedings of the IGARSS 2003. 2003 IEEE International Geoscience and Remote Sensing Symposium, Toulouse, France, 21–25 July 2003; IEEE: Toulouse, France, 2003. Proceedings (IEEE Cat. No.03CH37477). [Google Scholar]
Helmer, E.H.; Ruefenacht, B. Cloud-Free Satellite Image Mosaics with Regression Trees and Histogram Matching. Photogramm. Eng. Remote Sens. 2005, 71, 1079–1089. [Google Scholar] [CrossRef] [Green Version]
Chen, J.; Jönsson, P.; Tamura, M.; Gu, Z.; Matsushita, B.; Eklundh, L. A simple method for reconstructing a high-quality NDVI time-series data set based on the Savitzky–Golay filter. Remote Sens. Environ. 2004, 91, 332–344. [Google Scholar] [CrossRef]
Julien, Y.; Sobrino, J.A. Comparison of cloud-reconstruction methods for time series of composite NDVI data. Remote Sens. Environ. 2010, 114, 618–625. [Google Scholar] [CrossRef]
Viovy, N.; Arino, O.; Belward, A.S. The Best Index Slope Extraction (BISE): A method for reducing noise in NDVI time-series. Int. J. Remote Sens. 1992, 13, 1585–1590. [Google Scholar] [CrossRef]
Ma, M.; Veroustraete, F. Reconstructing pathfinder AVHRR land NDVI time-series data for the Northwest of China. Adv. Space Res. 2005, 37, 835–840. [Google Scholar] [CrossRef]
Zhu, W.; Pan, Y.; He, H.; Wang, L.; Mou, M.; Liu, J. A Changing-Weight Filter Method for Reconstructing a High-Quality NDVI Time Series to Preserve the Integrity of Vegetation Phenology. IEEE Trans. Geosci. Remote Sens. 2011, 50, 1085–1094. [Google Scholar] [CrossRef]
Song, C.; Huang, B.; You, S. Comparison of Three Time-Series NDVI Reconstruction Methods Based on TIMESAT. In Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, Munich, Germany, 22–27 July 2012. [Google Scholar]
Julien, Y.; Sobrino, J.A. Global land surface phenology trends from GIMMS database. Int. J. Remote Sens. 2009, 30, 3495–3513. [Google Scholar] [CrossRef] [Green Version]
Jönsson, P.; Eklundh, L. Seasonality extraction by function fitting to time-series of satellite sensor data. IEEE Trans. Geosci. Remote Sens. 2002, 40, 1824–1832. [Google Scholar] [CrossRef]
Beck, P.S.A.; Atzberger, C.; Høgda, K.A.; Johansen, B.; Skidmore, A.K. Improved Monitoring of Vegetation Dynamics at Very High Latitudes: A New Method Using MODIS NDVI. Remote Sens. Environ. 2006, 100, 321–334. [Google Scholar] [CrossRef]
Sellers, P.J.; Tucker, C.J.; Collatz, G.J.; Los, S.O.; Justice, C.O.; Dazlich, D.A.; Randall, D.A. A global 1° by 1° NDVI data set for climate studies. Part 2: The generation of global fields of terrestrial biophysical parameters from the NDVI. Int. J. Remote Sens. 1994, 15, 3519–3545. [Google Scholar] [CrossRef]
Ghaderpour, E.; Vujadinovic, T. Change Detection within Remotely Sensed Satellite Image Time Series via Spectral Analysis. Remote Sens. 2020, 12, 4001. [Google Scholar] [CrossRef]
Lorenzi, L.; Melgani, F.; Mercier, G. Missing-Area Reconstruction in Multispectral Images Under a Compressive Sensing Perspective. IEEE Trans. Geosci. Remote Sens. 2013, 51, 3998–4008. [Google Scholar] [CrossRef]
Li, X.; Shen, H.; Zhang, L.; Zhang, H.; Yuan, Q.; Yang, G. Recovering Quantitative Remote Sensing Products Contaminated by Thick Clouds and Shadows Using Multitemporal Dictionary Learning. IEEE Trans. Geosci. Remote Sens. 2014, 52, 7086–7098. [Google Scholar] [CrossRef]
Wang, L.; Qu, J.J.; Xiong, X.; Hao, X.; Xie, Y.; Che, N. A New Method for Retrieving Band 6 of Aqua MODIS. IEEE Geosci. Remote Sens. Lett. 2006, 3, 267–270. [Google Scholar] [CrossRef]
Zeng, C.; Shen, H.; Zhang, L. Recovering missing pixels for Landsat ETM+ SLC-off imagery using multi-temporal regression analysis and a regularization method. Remote Sens. Environ. 2013, 131, 182–194. [Google Scholar] [CrossRef]
Cheng, Q.; Shen, H.; Zhang, L.; Yuan, Q.; Zeng, C. Cloud removal for remotely sensed images by similar pixel replacement guided with a spatio-temporal MRF model. ISPRS J. Photogramm. Remote Sens. 2014, 92, 54–68. [Google Scholar] [CrossRef]
Benabdelkader, S.; Melgani, F. Contextual Spatiospectral Postreconstruction of Cloud-Contaminated Images. IEEE Geosci. Remote Sens. Lett. 2008, 5, 204–208. [Google Scholar] [CrossRef]
Sarafanov, M.; Kazakov, E.; Nikitin, N.O.; Kalyuzhnaya, A.V. A Machine Learning Approach for Remote Sensing Data Gap-Filling with Open-Source Implementation: An Example Regarding Land Surface Temperature, Surface Albedo and NDVI. Remote Sens. 2020, 12, 3865. [Google Scholar] [CrossRef]
Mørup, M. Applications of tensor (multiway array) factorizations and decompositions in data mining. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2011, 1, 24–40. [Google Scholar] [CrossRef]
Tomasi, G.; Bro, R. PARAFAC and missing values. Chemom. Intell. Lab. Syst. 2005, 75, 163–180. [Google Scholar] [CrossRef]
Liu, J.; Musialski, P.; Wonka, P.; Ye, J. Tensor Completion for Estimating Missing Values in Visual Data. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 35, 208–220. [Google Scholar] [CrossRef]
Asif, M.T.; Mitrovic, N.; Dauwels, J.; Jaillet, P. Matrix and Tensor Based Methods for Missing Data Estimation in Large Traffic Networks. IEEE Trans. Intell. Transp. Syst. 2016, 17, 1816–1825. [Google Scholar] [CrossRef]
Bro, R. Review on Multiway Analysis in Chemistry—2000–2005. Crit. Rev. Anal. Chem. 2006, 36, 279–293. [Google Scholar] [CrossRef]
Carroll, J.D.; Chang, J.-J. Analysis of individual differences in multidimensional scaling via an n-way generalization of “Eckart-Young” decomposition. Psychometrika 1970, 35, 283–319. [Google Scholar] [CrossRef]
Tan, H.; Feng, J.; Chen, Z.; Yang, F.; Wang, W. Low Multilinear Rank Approximation of Tensors and Application in Missing Traffic Data. Adv. Mech. Eng. 2014, 6, 157597. [Google Scholar] [CrossRef] [Green Version]
Lasanta, T.; Vicente-Serrano, S.M. Complex land cover change processes in semiarid Mediterranean regions: An approach using Landsat images in northeast Spain. Remote Sens. Environ. 2012, 124, 1–14. [Google Scholar] [CrossRef]
Stellmes, M.; Röder, A.; Udelhoven, T.; Hill, J. Mapping syndromes of land change in Spain with remote sensing time series, demographic and climatic data. Land Use Policy 2013, 30, 685–702. [Google Scholar] [CrossRef]
Kuemmerle, T.; Levers, C.; Erb, K.; Estel, S.; Jepsen, M.R.; Müller, D.; Plutzar, C.; Stürck, J.; Verkerk, P.J.; Verburg, P.; et al. Hotspots of land use change in Europe. Environ. Res. Lett. 2016, 11, 064020. [Google Scholar] [CrossRef]
Hill, J.; Stellmes, M.; Udelhoven, T.; Röder, A.; Sommer, S. Mediterranean desertification and land degradation: Mapping related land use change syndromes based on satellite observations. Glob. Planet. Chang. 2008, 64, 146–157. [Google Scholar] [CrossRef]
Gouveia, C.M.; Páscoa, P.; Russo, A.; Trigo, R.M. Land Degradation Trend Assessment over Iberia during 1982–2012. Cuad. Investig. Geogr. 2016, 42, 89. [Google Scholar] [CrossRef] [Green Version]
Spinoni, J.; Barbosa, P.; Bucchignani, E.; Cassano, J.; Cavazos, T.; Christensen, J.H.; Christensen, O.B.; Coppola, E.; Evans, J.; Geyer, B.; et al. Future Global Meteorological Drought Hot Spots: A Study Based on CORDEX Data. J. Clim. 2020, 33, 3635–3661. [Google Scholar] [CrossRef]
Noguera, I.; Domínguez-Castro, F.; Vicente-Serrano, S.M. Flash Drought Response to Precipitation and Atmospheric Evaporative Demand in Spain. Atmosphere 2021, 12, 165. [Google Scholar] [CrossRef]
Del Barrio, G.; Puigdefábregas, J.; Sanjuán, M.E.; Stellmes, M.; Ruiz, A. Assessment and monitoring of land condition in the Iberian Peninsula, 1989–2000. Remote Sens. Environ. 2010, 114, 1817–1832. [Google Scholar] [CrossRef]
Lanfredi, M.; Coppola, R.; Simoniello, T.; Coluzzi, R.; D’Emilio, M.; Imbrenda, V.; Macchiato, M. Early Identification of Land Degradation Hotspots in Complex Bio-Geographic Regions. Remote Sens. 2015, 7, 8154–8179. [Google Scholar] [CrossRef] [Green Version]
Dardel, C.; Kergoat, L.; Hiernaux, P.; Mougin, E.; Grippa, M.; Tucker, C.J. Re-greening Sahel: 30years of remote sensing data and field observations (Mali, Niger). Remote Sens. Environ. 2014, 140, 350–364. [Google Scholar] [CrossRef]
Vicente-Serrano, S.; Cabello, D.; Tomás-Burguera, M.; Martín-Hernández, N.; Beguería, S.; Azorin-Molina, C.; Kenawy, A. Drought variability and land degradation in semiarid regions: Assessment using remote sensing data and drought indices (1982–2011). Remote Sens. 2015, 7, 4391–4423. [Google Scholar] [CrossRef] [Green Version]
Vicente-Serrano, S.M.; Martín-Hernández, N.; Reig, F.; Azorin-Molina, C.; Zabalza, J.; Beguería, S.; Domínguez-Castro, F.; El Kenawy, A.; Peña-Gallardo, M.; Noguera, I.; et al. Vegetation greening in spain detected from long term data (1981–2015). Int. J. Remote Sens. 2019, 41, 1709–1740. [Google Scholar] [CrossRef]
Gazol, A.; Camarero, J.J.; Vicente-Serrano, S.M.; Sánchez-Salguero, R.; Gutierrez, E.; de Luis, M.; Sangüesa-Barreda, G.; Novak, K.; Rozas, V.; Tíscar, P.A.; et al. Forest resilience to drought varies across biomes. Glob. Chang. Biol. 2018, 24, 2143–2158. [Google Scholar] [CrossRef]
Vicente-Serrano, S.M.; Azorin-Molina, C.; Peña-Gallardo, M.; Tomas-Burguera, M.; Domínguez-Castro, F.; Martín-Hernández, N.; Beguería, S.; El Kenawy, A.; Noguera, I.; García, M. A high-resolution spatial assessment of the impacts of drought variability on vegetation activity in Spain from 1981 to 2015. Nat. Hazards Earth Syst. Sci. 2019, 19, 1189–1213. [Google Scholar] [CrossRef] [Green Version]
Vicente-Serrano, S.M.; Martín-Hernández, N.; Camarero, J.J.; Gazol, A.; Sánchez-Salguero, R.; Peña-Gallardo, M.; El Kenawy, A.; Domínguez-Castro, F.; Tomas-Burguera, M.; Gutiérrez, E.; et al. Linking tree-ring growth and satellite-derived gross primary growth in multiple forest biomes. Temporal-scale matters. Ecol. Indic. 2019, 108, 105753. [Google Scholar] [CrossRef]
D’Odorico, P.; Bhattachan, A. Hydrologic variability in dryland regions: Impacts on ecosystem dynamics and food security. Philos. Trans. R. Soc. B. Biol. Sci. 2012, 367, 3145–3157. [Google Scholar] [CrossRef] [Green Version]
Scheffer, M. Foreseeing tipping points. Nature 2010, 467, 411–412. [Google Scholar] [CrossRef]
Afanador, N. Expectation Maximization (EM) For Imputation of Missing Values. mvdalab v1.4. Available online: https://rdrr.io/cran/mvdalab/man/imputeEM.html (accessed on 3 October 2021).
Walczak, B.; Massart, D.L. Dealing with missing data: Part I. Chemom. Intell. Lab. Syst. 2001, 58, 15–27. [Google Scholar] [CrossRef]
Moon, T.K. The expectation-maximization algorithm. IEEE Signal Process. Mag. 1996, 13, 47–60. [Google Scholar] [CrossRef]
Andersson, C.A.; Bro, R. The N-Way Toolbox for MATLAB. Chemom. Intell. Lab. Syst. 2000, 52, 1–4. [Google Scholar] [CrossRef]
Atkinson, P.M.; Jeganathan, C.; Dash, J.; Atzberger, C. Inter-comparison of four models for smoothing satellite sensor time-series data to estimate vegetation phenology. Remote Sens. Environ. 2012, 123, 400–417. [Google Scholar] [CrossRef]
Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef] [Green Version]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data Via The EM Algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1977, 39, 1–22. [Google Scholar] [CrossRef]
MATLAB. 1.8.0121 (R2017b), The MathWorks Inc.: Natick, MA, USA, 2017.
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2020. [Google Scholar]
Henson, R.; Cetto, L. The MATLAB bioinformatics toolbox. In Encyclopedia of Genetics, Genomics, Proteomics and Bioinformatics; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2005. [Google Scholar]
Shin, Y.; Lee, S.; Tariq, S.; Lee, M.S.; Jung, O.; Chung, D.; Woo, S.S. ITAD. In Proceedings of the 29th ACM International Conference on Information and Knowledge Management, New York, NY, USA, 19 October 2020; ACM: New York, NY, USA, 2020. [Google Scholar]

Figure 1. Seasonal NDVI differences on the Iberian Peninsula in 2008: 23 January (a), 2 April (b), 18 June (c), 24 August (d), and 15 November (e). The smaller black rectangle near the middle of the peninsula shows study region 1, and the larger black rectangle in the south shows study region 2. The last subfigure (f) depicts a mask, which was used to separate ocean from land, with white representing land, and black representing ocean.

Figure 2. Enlarged study regions as indicated by the black rectangles in Figure 1 referring to 23 January. (a) shows study region 1, and (b) shows study region 2. Note that the latter is three times larger than the former. Study region 2 contains natural missing data (white), unlike study region 1.

Figure 3. A single time frame from study region 1 corresponding to 23 January is shown in (a). This particular day was selected because it contained no missing data. The chosen frame was replicated 30 times (b) to form a 30 × 30 × 30 data set (SIM1). Subsequently, missing data were added artificially.

Figure 4. Selected time frames in the SPAIN1 subsection from the Iberian data set (study region 1), which contained no missing data. The important thing to notice here is that the frames change over time.

Figure 5. Three different levels of missing data for one frame in SIM1 are displayed as examples. Upper row indicates data missing completely at random (MCAR). Lower row indicates data missing at random (MAR).

Figure 7. Tucker model reconstructions for a single time frame shown for different ranks in the spatial modes (P, Q). The tensor that was decomposed here consisted of 30 identical 30 × 30 frames (SIM1). The reconstruction becomes more accurate as the ranks in both spatial modes are increased.

Figure 8. RRMSE for different models using the homogeneous tensor, SIM1, with artificially added MCAR (a) and MAR (b) elements. RRMSE values were calculated at positions indicated by the light grey vertical lines.

Figure 9. RRMSE for different models using the noisy tensor, SIM2, with artificially added MCAR (a) and MAR (b) elements.

Figure 10. RRMSE for different models applied to SPAIN1. The tensor contained artificially added MCAR (a) and MAR (b) elements.

Figure 11. Average correlation between the ground truth and the gap-filled data for SPAIN1 based on each of the methods at different levels of artificially added missing data, both for MCAR (a) and MAR (b). The correlation for the mean imputation method was always zero for all levels of missing data (the black line).

Figure 12. Structural similarity indices (SSIM) for SPAIN1 for all gap-filling methods at different levels of artificially added missing data, both for MCAR (a) and MAR (b).

Figure 13. Selected time frames from SPAIN2 (study region 2) are shown across rows (Sierra Nevada mountains appear in light blue (NDVI = 0) in winter in the low-center cell). The first column shows the SPAIN2 frames prior to imputation, where missing values are represented in white. The different imputation methods are shown across the remaining columns. Imputing the missing values, the SPAIN2 tensor was divided into nine equally large 30 × 30 × 66 sub-tensors, and the imputation was conducted on each sub-tensor individually. The blue grid, drawn on top of the each of the frames, shows this division.

Table 1. An overview of all data sets used in this study. The resulting data sets will be referred to by the corresponding aliases.

Alias	Description	Dimension
SIM1	Constructed by repeating a single time frame from study region 1 with no missing data. Missing data were added artificially. Used for model evaluation.	30 × 30 × 30
SIM2	Constructed by adding noise to SIM1. Missing data were added artificially. Used for model evaluation.	30 × 30 × 30
SPAIN1	All time frames from study region 1 with no missing data. Missing data were added artificially. Used for model evaluation.	30 × 30 × 54
SPAIN2	Study region 2 with natural missing data. No ground truth data available. Used to demonstrate the performance of the models visually.	90 × 90 × 66

Table 2. An overview of all imputation methods applied in this project.

Alias	Description	Software
Single mean imputation	Tensor mean imputed for missing values	No external code used
Single imputation Tucker (SI Tucker)	Tensor mean was imputed for missing values prior to decomposition	“tucker” function, N-Way Toolbox, Matlab [54]
Hybrid method	Running-window temporal imputation. Remaining missing data then imputed with KNN	“knnimpute” function, Bioinformatics toolbox, Matlab [63]
EM PCA	Column mean was imputed prior to iterative PCA decomposition	“imputeEM” function, mvdlab package, R [55]
EM Tucker	A combination of row and column mean was imputed prior to iterative decomposition	“tucker” function, N-Way Toolbox, Matlab [54]

Table 3. Total computation time for all methods applied to SPAIN2.

Method	Total Computation Time [s]
Simple mean imputation	0.03
Single imputation Tucker	5.11
Hybrid method	1.26
EM PCA	6.06
EM Tucker	363.94

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Þórðarson, A.F.; Baum, A.; García, M.; Vicente-Serrano, S.M.; Stockmarr, A. Gap-Filling of NDVI Satellite Data Using Tucker Decomposition: Exploiting Spatio-Temporal Patterns. Remote Sens. 2021, 13, 4007. https://doi.org/10.3390/rs13194007

AMA Style

Þórðarson AF, Baum A, García M, Vicente-Serrano SM, Stockmarr A. Gap-Filling of NDVI Satellite Data Using Tucker Decomposition: Exploiting Spatio-Temporal Patterns. Remote Sensing. 2021; 13(19):4007. https://doi.org/10.3390/rs13194007

Chicago/Turabian Style

Þórðarson, Andri Freyr, Andreas Baum, Mónica García, Sergio M. Vicente-Serrano, and Anders Stockmarr. 2021. "Gap-Filling of NDVI Satellite Data Using Tucker Decomposition: Exploiting Spatio-Temporal Patterns" Remote Sensing 13, no. 19: 4007. https://doi.org/10.3390/rs13194007

APA Style

Þórðarson, A. F., Baum, A., García, M., Vicente-Serrano, S. M., & Stockmarr, A. (2021). Gap-Filling of NDVI Satellite Data Using Tucker Decomposition: Exploiting Spatio-Temporal Patterns. Remote Sensing, 13(19), 4007. https://doi.org/10.3390/rs13194007

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Gap-Filling of NDVI Satellite Data Using Tucker Decomposition: Exploiting Spatio-Temporal Patterns

Abstract

1. Introduction

2. Study Region and Data Set

2.1. Study Region

2.2. Data Sets and Pre-Processing

3. Methods

3.1. Tucker Decomposition

3.2. EM Tucker for Imputation of Missing Values

3.3. Model Selection

3.4. Metrics

3.5. Reference Methods

4. Results

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI