Sustainable Development : Evaluating Optimal Technique for Spatial Data Forecast †

This paper is an approach to forecast the spatial data in time series domain. Normally in GIS (Geographical Information System), we need raster forecasting. Moving average, exponential smoothing, and linear regression methods of forecasting are used over one-dimensional data. Present work concentrates on using these methods on satellite images applying them from pixel to pixel of historical temporal satellite data. An example set of satellite images from years 2011 to 2015 has been used to forecast the image in the year 2016. GIS tools have been developed in ArcGIS 10.1 using python to implement the methods of forecasting. Forecasted and actual images of the year 2016 have been compared by calculating the Normalized Difference Vegetation Indices (NDVI) and change detection to identify the best method.


Introduction
GIS has been used for the purpose of presentation and analysis of the spatial data worldwide providing an effective platform for gathering different types of information from various sources into one system [1].It represents a useful solution to the management of vast naturally spatially variable datasets [2].Forecasting should not only visualize the data in GIS but also estimate its evolution [3].Scenario development is a method of identifying the different possible circumstances for a certain resource of consideration with respect to several factors affecting it.One among all such scenarios is business as usual (BAU) which follows the same trend as of its historical data.Various methods of forecasting have been used for evaluating the business as usual (BAU) scenarios.Certain methods encountered are exponential smoothing, moving averages and regressions [4].This paper is an attempt to forecast variables with the three mentioned methods of forecasting over a historical dataset of five raster images.

Moving Average Method
Moving average method of forecasting is one of the time series technique which uses only recent history and represents multiple observations.It places more weight on the most recent observation [5,6].The steps performed in the method are shown as: 1. Calculate Simple Moving Average (SMA) with N = 2, 3, 4 and 5.
3. Forecast using the relation as: In order to consider average for the maximum number of historical data, we have taken N = 3.

Exponential Smoothing Method
In exponential smoothing forecast is just a weighted sum of the last observation and the previous forecast i.e., it gives more weight to last observation and decreasing weights to earlier observations [5,6].The steps performed in the method are shown as: 1. Calculate Simple Exponential Smoothing estimates for different values (0.4…0.8) of α.
2. Calculate Double Exponential Smoothing estimates for all values of α.
3. Forecast using the relation as: In order to consider our quantity of historical data, we have taken α = 0.45 since α = √ , where d = number of data [4].

Linear Regression Method
This is a causal model, where an attempt is made to setup a mathematical relationship between the variable of interest and causal factors, and forecast by inserting the values of the causal factors into the mathematical model [5,6].For n historical data points ( , ), ( , ), …, ( , ), forecasted value of for the desired xi can be obtained by the relation defined as: where a =

Methodology
Satellite data for various consecutive historical time steps (years/months/days) depending upon their availability is collected.Such data has certain intensity value at every coordinate or pixel (x, y, z) where x, y, and z denote the band, row, and column respectively for any satellite data.All the historical intensity values corresponding to a pixel are used to forecast its intensity for the future time step.Three methods of forecasting i.e., moving average (MA), exponential smoothing (ES), and linear regression (LR) are applied one by one on the historical dataset to determine the forecasted satellite data.Later NDVI images for each forecasted data and original data are generated.Generated NDVI images are classified into two classes covering vegetation and non vegetation parts respectively.Change detection is applied to the original NDVI image and forecasted NDVI images to identify the best method among three selected for forecasting the spatial data.

Implementation
The methodology has been implemented over a region comprising of nine villages in Chirawa region of district Jhunjhunu in the Rajasthan state of India.The region falling under the study area has been clipped from temporal Resourcesat 2 LISS 4 satellite images with spatial resolution of 5 m, for years 2011 to 2016 procured from NRSA (National Remote Sensing Agency), India.Images from years 2011 to 2015 have been used for implementing the methodology while the image for the year 2016 is used for validation.Python tools have been developed to forecast as per the three methods.The time step required in the tools is given as 1,1, and 6 for MA, ES, and LR respectively to forecast satellite image for the year 2016.

Results and Discussion
The actual and forecasted images for the year 2016, classified into vegetation and non vegetation parts as per generated NDVI [7] are shown in Figure 1a-d.Classification and change detection statistics are presented in Tables 1 and 2. Among all the three methods of forecasting applied to the satellite images over a selected region, exponential smoothing method presents better results in comparison to moving average and linear regression methods.It reflects about 78.5% match of vegetation class to the original image with about 36.5% change of pixels which is far better than other two methods.

Conclusion
GIS tools developed for forecasting spatial data using various extrapolation methods can help in generating various possible outcomes in the future.Such tools can be used in scenario planning activities for the data having geographical relevance.Among all the three selected methods of forecasting, exponential smoothing has given the best results over the data set chosen.These statistics would vary depending upon the quantity of historical data.Very high significance and variance of vegetation in the images as compared to a negligible variance of other features required the generation of NDVI images and change detection for validation purpose.This may work for predicting the other natural features as well, except men-made or artificial features.This process will help to forecast the z-values only instead of predicting the spread of features.It will be suitable for raster data such as interpolated rainfall maps, interpolated temperature maps, etc. for various historical years.
and S 2 t are Simple Exponential smoothing and Double Exponential Smoothing respectively.Xt−1 is previous to forecasting term and α is called the smoothing factor.

Figure 1 .
Figure 1.Vegetation based classified images for the year 2016: (a) Actual image; (b) Moving average forecasted image; (c) Exponential smoothing forecasted image; (d) Linear regression forecasted image.

Table 1 .
Vegetation match in percentage of actual image with forecasted images of the year 2016.

Table 2 .
Change detection in percentage of actual image with forecasted images of the year 2016.