Spatiotemporal Mapping and Monitoring of Whiting in the Semi-Enclosed Gulf Using Moderate Resolution Imaging Spectroradiometer (MODIS) Time Series Images and a Generic Ensemble Tree-Based Model

: Whiting events in seas and lakes are a natural phenomenon caused by suspended calcium carbonate (CaCO 3 ) particles. The Arabian Gulf, which is a semi-enclosed sea, is prone to extensive whiting that covers tens of thousands of square kilometres. Despite the extent and frequency of whiting events in the Gulf, studies documenting the whiting phenomenon are lacking. Therefore, the primary objective of this study was to detect, map and document the spatial and temporal distributions of whiting events in the Gulf using daily


Background
Whiting is a short-lived phenomenon of milky parcels of water or bright in-water features, which has been reported globally in lacustrine, marine and freshwater environments and in semi-enclosed areas [1][2][3][4][5][6][7]. The ephemeral patches in whiting are turbid water with high levels of suspended fine-grained calcium carbonate mineral particles [8][9][10][11]. Whiting events last from days to weeks and can be visualised via satellite images as extended milky-white water varying from a few meters to square kilometres long [1,12].
Although whiting has been studied for decades to determine valid explanations for its occurrences and causes, the event remains controversial [10]. Various assumptions have been made to explain the causes of whiting. These assumptions include (1) the resuspension of fine-grained sediments caused by fish activities, microturbulent bursts and wind [8,[12][13][14][15]; (2) bio-induced precipitation from the removal of CO2 by photosynthesis [11,[16][17][18][19] and (3) abiotic precipitation initiated by fluctuations in water temperature and ion activities related to climate change [8,20]. Carbonates, which are produced by the physical and biological disintegration of animal and algal bioclasts, blooms of microscopic algae during photosynthesis and abiotic precipitation or calcification of suspended picoplankton and organic matter, may be the possible sources of suspended carbonate minerals, such as aragonite and high and low magnesium calcite [8].
The occurrence of whiting has prompted researchers to establish a scientific description of the phenomena and to determine its association with climate change and oil deposition [21][22][23]. Whiting events are ephemeral, thus sample collection using traditional field measurements can be challenging, time consuming and costly. Remote sensing with ocean colour satellite instruments provides a set of high temporal-resolution data with various scales and records of satellite images and derivatives. This method enables the spatiotemporal mapping of whiting events; however, limited studies have adopted remote sensing technology to study these occurrences in marine environments [10]. The majority of the studies used satellite data to map whiting in the Bahama Banks [12,24], in the Ten Thousand Islands in southwest Florida [10,25], in the Great Lakes in North America [26] and in the Feldberg Lake District, Klocksin Lake Chain and Rheinsberg Lake regions in Germany [27]. Whiting events based on general properties are varyingly recognised with remote sensing data from visual identification and manual delineation [10,24,[28][29][30][31]. Dierssen et al. [12] studied the spectral behaviour of whiting to identify the spatial extent of whiting patches in the Bahama Banks. Considering the shallowness of the southwest Florida coast and the spectral similarity between whiting occurrences and bright shallow bottom sediments, Long et al. [25] delineated whiting patches manually by relying on visual inspection and spatial contrast. Recently, Long et al. [10] used contrast enhancement and floating algae index (FAI) images to differentiate in-water whiting features from clouds to map the spatiotemporal variability of the southwest Florida whiting events from 2003 to 2015. Other studies focused on detecting and estimating the concentration of particulate inorganic carbon (PIC), or calcium carbonate particles, in the surface layer of the water column from the water-leaving radiance and reflectance differences concept through the computation of chlorophyll-a concentration [32][33][34].
The whiting phenomenon in the Gulf was initially reported in the 1960s. In 1962, Wells and Illing [35] observed whiting in numerous places in the Gulf such as in the eastern part of the Qatar Peninsula towards the coast of Abu Dhabi and off the coast of Saudi Arabia between Ras Tanura and Ras Safaniya. Although more than 50 years have passed since whiting events were initially reported in the Gulf, this phenomenon remains unclear [36,37]. To the best of the authors' knowledge, limited

Study Region
The Arabian Gulf is located in the Middle East and surrounded by the coasts of eight countries, namely, United Arab Emirates (UAE), Saudi Arabia, Oman, Kuwait, Bahrain, Iraq, Qatar and Iran ( Figure 1). The Gulf is a semi-enclosed marginal sea positioned in a subtropical hyperarid region (between the latitudes of 24 • to 30 • N and the latitudes of 48 • to 57 • E), with an average annual rainfall of less than 5 cm in the coastal areas [33,34]. The Gulf is nearly 990 km long and 56-370 km wide. It has an average depth of 36 m and a maximum depth of nearly 100 m and occupies a surface area of 239 km 2 . The deepest region of the Gulf (more than 40 m deep) is near the Iranian coast and continues into the Strait of Hormuz. Meanwhile, the shallowest regions (less than 20 m deep) are located along the coasts of the UAE, Qatar, Bahrain and around the head of the Gulf. Owing to its shallowness, sea surface temperatures fluctuate significantly and the Gulf is considered as the hottest sea in the world during summer [35,38]. The seawater temperature ranges from less than 20 • C in winter to over 34 • C in summer. As a result of high evaporation rates during the hot and long summers in the region and the lack of precipitation, water in the Gulf is characterised by high salinity greater than 39 psu [33,39,40]. The Gulf is subjected to strong winds and often associated with dust storms, with the most extreme occurring in summer and late spring and moderate dust storms occurring in winter.
Remote Sens. 2019, 11, x FOR PEER REVIEW 3 of 23 in the Gulf, this phenomenon remains unclear [36,37]. To the best of the authors' knowledge, limited effort has been exerted to map whiting in the Gulf using remote sensing techniques. The objectives of the present study were: (1) to adopt the correlation-based feature selection (CFS) to identify the most significant features for whiting extraction, (2) to develop a semi-automated framework to detect the whiting coverage in the Gulf from the Moderate Resolution Imaging Spectroradiometer (MODIS) images, using adaptive boosting (AdaBoost) and rule-based classification approach, (3) to compare and assess the performance of various applied tree-based machine learning methods, namely, the single decision tree (DT), random forest and the gradient boosted decision tree (GBDT)and (4) to document the frequency, duration, seasonality, spatial coverage and distribution of whiting occurrences in the Gulf between 2002 and 2018 using satellite observations.

Study Region
The Arabian Gulf is located in the Middle East and surrounded by the coasts of eight countries, namely, United Arab Emirates (UAE), Saudi Arabia, Oman, Kuwait, Bahrain, Iraq, Qatar and Iran ( Figure 1). The Gulf is a semi-enclosed marginal sea positioned in a subtropical hyperarid region (between the latitudes of 24° to 30°N and the latitudes of 48° to 57 °E), with an average annual rainfall of less than 5 cm in the coastal areas [33,34]. The Gulf is nearly 990 km long and 56-370 km wide. It has an average depth of 36 m and a maximum depth of nearly 100 m and occupies a surface area of 239 km 2 . The deepest region of the Gulf (more than 40 m deep) is near the Iranian coast and continues into the Strait of Hormuz. Meanwhile, the shallowest regions (less than 20 m deep) are located along the coasts of the UAE, Qatar, Bahrain and around the head of the Gulf. Owing to its shallowness, sea surface temperatures fluctuate significantly and the Gulf is considered as the hottest sea in the world during summer [35,38]. The seawater temperature ranges from less than 20 °C in winter to over 34 °C in summer. As a result of high evaporation rates during the hot and long summers in the region and the lack of precipitation, water in the Gulf is characterised by high salinity greater than 39 psu [33,39,40]. The Gulf is subjected to strong winds and often associated with dust storms, with the most extreme occurring in summer and late spring and moderate dust storms occurring in winter.

Overview
Daily satellite data from MODIS for the period of 2002 to 2018 were obtained and visually inspected for whiting. The dates of whiting events were then classified in accordance with the month of the occurrence and the number of consecutive days during which whiting persisted. The data were then used to analyse frequency, seasonality and duration of whiting events in the Gulf.
The identification, mapping and classification of whiting in the Gulf required the extensive analysis of the acquired MODIS data. The analysis framework, which is illustrated in Figure 2, is summarised as follows: (1) preprocessing of the MODIS satellite data, (2) multiresolution image segmentation and parameter optimisation, (3) selection of the significant attributes using CFS, (4) classification of the daily MODIS time series images using AdaBoost DT and rule-based classification and (5) identification of the whiting spatiotemporal pattern (time series frequency, duration and seasonality, spatial coverage and distribution) in the Gulf. The aforementioned steps are further discussed in the following subsections.

Overview
Daily satellite data from MODIS for the period of 2002 to 2018 were obtained and visually inspected for whiting. The dates of whiting events were then classified in accordance with the month of the occurrence and the number of consecutive days during which whiting persisted. The data were then used to analyse frequency, seasonality and duration of whiting events in the Gulf.
The identification, mapping and classification of whiting in the Gulf required the extensive analysis of the acquired MODIS data. The analysis framework, which is illustrated in Figure 2, is summarised as follows: (1) preprocessing of the MODIS satellite data, (2) multiresolution image segmentation and parameter optimisation, (3) selection of the significant attributes using CFS, (4) classification of the daily MODIS time series images using AdaBoost DT and rule-based classification and (5) identification of the whiting spatiotemporal pattern (time series frequency, duration and seasonality, spatial coverage and distribution) in the Gulf. The aforementioned steps are further discussed in the following subsections.

MODIS Datasets for Whiting Exploration
The exploration of whiting events in the Gulf as a short-lived, repetitive phenomenon by fieldbased studies is challenging. Therefore, high temporal resolution satellite images offered excellent sources of data for mapping and monitoring whiting in the Gulf. The daily high temporal-resolution Terra/Aqua MODIS surface reflectance products (MOD09GA and MYD09GA) with a coarse spatial resolution of 500 m were used in this study. The products, which covered the entire Gulf in one scene for the period of 2002-2018, were downloaded from NASA's Earthdata website (https://search.earthdata.nasa.gov/search). These products were atmospherically corrected by the MODIS Land Science Team for aerosols, thin cirrus clouds and gases. Figure 3 illustrates the initiation and disappearance of a whiting event from February to March 2003. The consecutive MODIS images show the dramatic changes during the 6 consecutive days of observation. High-resolution satellite images, such as from the Landsat, with a 30 m spatial resolution and 16-day temporal resolutions, or from the Sentinel-2, with a 5-day revisit time and 10 m resolution, may be insufficient to capture such short-term whiting events given their limited temporal resolution.
The MODIS products (MOD09GA and MYD09GA) contained seven spectral bands with a spatial resolution of 0.

MODIS Datasets for Whiting Exploration
The exploration of whiting events in the Gulf as a short-lived, repetitive phenomenon by field-based studies is challenging. Therefore, high temporal resolution satellite images offered excellent sources of data for mapping and monitoring whiting in the Gulf. The daily high temporal-resolution Terra/Aqua MODIS surface reflectance products (MOD09GA and MYD09GA) with a coarse spatial resolution of 500 m were used in this study. The products, which covered the entire Gulf in one scene for the period of 2002-2018, were downloaded from NASA's Earthdata website (https: //search.earthdata.nasa.gov/search). These products were atmospherically corrected by the MODIS Land Science Team for aerosols, thin cirrus clouds and gases. Figure 3 illustrates the initiation and disappearance of a whiting event from February to March 2003. The consecutive MODIS images show the dramatic changes during the 6 consecutive days of observation. High-resolution satellite images, such as from the Landsat, with a 30 m spatial resolution and 16-day temporal resolutions, or from the Sentinel-2, with a 5-day revisit time and 10 m resolution, may be insufficient to capture such short-term whiting events given their limited temporal resolution.
The MODIS products (MOD09GA and MYD09GA) contained seven spectral bands with a spatial resolution of 0.5 km. Their reflectance bands were band 1 (red: 0.620-0.  The whiting events appeared in the MODIS satellite images as turbid and milky features. The spectral response of various surface water features at different locations in the Gulf appeared in a sample MOD09GA product during a whiting event and shown in Figure 4a-h. Figure 4f-h show whiting events with high intensities of bands 3 (blue region of the spectra) and 4 (green region of the spectra). Figure 4e shows a clear water sample with a low intensities of all bands. Shallow areas at the northwest section of the gulf show high reflections in bands 1 (red region of the spectra) and 2 (NIR region of the spectra), as shown in Figure 4a. Overall, the images show variations in terms of relative intensities of bands 3 and 4 and bands 1 and 2. Meanwhile, the intensities of bands 5, 6 and 7 did not exhibit significant variations. The whiting events appeared in the MODIS satellite images as turbid and milky features. The spectral response of various surface water features at different locations in the Gulf appeared in a sample MOD09GA product during a whiting event and shown in Figure 4a-h. Figure 4f-h show whiting events with high intensities of bands 3 (blue region of the spectra) and 4 (green region of the spectra). Figure 4e shows a clear water sample with a low intensities of all bands. Shallow areas at the northwest section of the gulf show high reflections in bands 1 (red region of the spectra) and 2 (NIR region of the spectra), as shown in Figure 4a. Overall, the images show variations in terms of relative intensities of bands 3 and 4 and bands 1 and 2. Meanwhile, the intensities of bands 5, 6 and 7 did not exhibit significant variations.

Object-Based Analysis and Image Segmentation Optimisation
A traditional per-pixel classification approach is useful for feature extraction when the targets of interest are smaller than the spatial resolution of the remotely sensed data [41]. This approach only considers the spectral properties of each pixel and disregards any spatial or contextual information related to the classified pixel [42]. Geographic object-based image analysis (GEOBIA) has been extensively used in classifying very-high spatial resolution data as an alternative to a pixel-based approach. GEOBIA works by assessing spatially neighbouring groups of pixels rather than individual pixels [43]. GEOBIA is not only limited to high resolution images because the approach is not spatial-resolution dependent; therefore, it can be applied to different resolutions if the sizes of the intended objects are compatible with the spatial resolution of the images [44,45]. Thus, GEOBIA has been successfully adopted and implemented to classify MODIS time series data in different applications [44][45][46][47][48][49][50].
The generic GEOBIA framework can be divided into (1) image segmentation, which is the process of generating homogenous and nonoverlapping image objects/segments from image pixels and (2) image object/segments classification [51]. The multiresolution image segmentation algorithm (MRS) [52], which is one of the most used algorithms, is applied to the time series MODIS data. The MRS is a bottom-up region-growing algorithm that commences with pixels as individual segments.
MRS is goverened by three main parameters, namely, (a) scale, (b) shape/color weight and (c) compactness/smoothness weight. This algorithm merges neighbouring pixels in each successive step on the basis of homogeneity (shape and compactness), which describes the similarity of contiguous objects. The degree of fitting, which is a value determined by the scale parameter defined by an analyst, is measured in each merging procedure. Moreover, the merge is performed if the degree of fitting is less than the minimum degree of fitting [53]. The scale parameter is one of the most critical parameters in the segmentation process. It profoundly influences resultant image objects and subsequent classification steps because it controls the size of image-generated objects [54,55]. Selecting high-scale values generate large image objects (undersegmentation), whereas selecting small-scale values yield small image objects (oversegmentation). Thus, the utilisation of an optimisation technique to find the optimum scale parameters for delineating whiting is vital to avoid the subjectivity of using a trial and error visual approach.

Object-Based Analysis and Image Segmentation Optimisation
A traditional per-pixel classification approach is useful for feature extraction when the targets of interest are smaller than the spatial resolution of the remotely sensed data [41]. This approach only considers the spectral properties of each pixel and disregards any spatial or contextual information related to the classified pixel [42]. Geographic object-based image analysis (GEOBIA) has been extensively used in classifying very-high spatial resolution data as an alternative to a pixel-based approach. GEOBIA works by assessing spatially neighbouring groups of pixels rather than individual pixels [43]. GEOBIA is not only limited to high resolution images because the approach is not spatial-resolution dependent; therefore, it can be applied to different resolutions if the sizes of the intended objects are compatible with the spatial resolution of the images [44,45]. Thus, GEOBIA has been successfully adopted and implemented to classify MODIS time series data in different applications [44][45][46][47][48][49][50].
The generic GEOBIA framework can be divided into (1) image segmentation, which is the process of generating homogenous and nonoverlapping image objects/segments from image pixels and (2) image object/segments classification [51]. The multiresolution image segmentation algorithm (MRS) [52], which is one of the most used algorithms, is applied to the time series MODIS data. The MRS is a bottom-up region-growing algorithm that commences with pixels as individual segments. Several unsupervised segmentation quality measures have been utilised in the literature to identify the best scale parameters [56][57][58][59]. In the current study, the performance of two unsupervised segmentation quality measures, namely, the objective function (OF) [60] and the F-measure [57] were compared to find an optimum scale value that accurately delineates whiting. Both measures adopt oversegmentation and undersegmentation metrics by using the values of weighted variance and spatial autocorrelations (Moran's I). They are expressed in Equations (1)- (5).
Remote Sens. 2019, 11, 1193 where MI norm and WV norm are the normalised Moran's I and weighted variance, respectively. In Equation (3), WV denotes the weighted variance and a i and v i are the area and variance of image object/segment (generated by MRS) i, respectively. In Equation (4), n symbolises the total number of objects, z i and z j are the means of the spectral value of image objects i (Oi) and j (Oj), respectively, z is the mean spectral value of the total objects in a specific band and w i,j is a spatial proximity measure between image objects i (Oi) and j (Oj) in which nearby image objects are defined as 1 and other objects are considered as 0. The normalised function for WV and MI can be expressed using the following equation where X, X max and X min are the original, maximum and minimum values of the weighted variance or Moran's I for a spectral band, respectively. The highest values of the OF or F-measure indicate high segmentation quality and the optimum scale is identified as the scale that achieves image objects with the highest OF or F-measure values. After the selection of an appropriate scale to delineate whiting, various spectral features and indices were computed and investigated to find the most relevant features to be used in the classification phase.

Feature Selection
One of the strongest characteristics of GEOBIA is it enables the generation of hundreds of features or attributes for analysis (e.g., spectral bands and indices, textural, geometric and contextual attributes). The extraction and utilisation of a considerable number of features (e.g., variables or attributes) in the analysis are computationally intensive; hence, they can negatively affect classification accuracy [61]. Therefore, FS, which involves the selection of an essential feature subset from an enormous amount of generated features, is a decisive step used in image analysis procedures. In addition, FS can achieve an equivalent or higher classification accuracy than the original feature space and improve the efficiency of GEOBIA [62][63][64][65][66]. FS methods may be generally grouped into three classes, namely, filter, wrapper and embedded methods [64,65]. The filter method is considered the simplest and fastest method among the three classes. It utilises certain statistical measures (e.g., correlation coefficients, variance, chi-square test measures and ANOVA F-values) to rank and select relevant features without using any learning algorithms [64,[67][68][69]. By contrast, the wrapper method adopts a classification algorithm as part of the evaluation process to classify the training data and assess the results. The wrapper method selects the most significant feature subset that produces the highest classification accuracy [70,71]. Finally, the embedded method exhibits a trade-off between the filter and the wrapper methods. This method is considered feature ranking because features are selected during the construction of a classification model without further evaluating the selected feature subset [65]. Comparisons of various FS techniques are available in the literature [64,[72][73][74].
A wrapper method that combines CFS and the naïve Bayes classifier was used in this study to find the most relevant features for extracting whiting features. CFS has recently been successfully applied to FS and has outperformed various FS techniques with GEOBIA [63,75].

CFS
CFS is a popular approach that uses a search algorithm with a heuristic evaluation function to assess the merit of feature subsets [76,77]. It measures the worth of each feature to predict the class label along with the intercorrelation level among features [78]. A heuristic evaluation function is designed on the basis of the hypothesis that superior feature subsets encompass correlated features with classes though they remain uncorrelated with one another [76,79]. Merits (heuristics) can be formalised using the following formula where k denotes the number of features, f indicates the feature, r cf symbolises the mean feature correlation with a class and r ff is the average intercorrelation among subset features.

Feature Acquisition and Computation
In general, various spectral indices and bands may be used to map water surface features and water bodies. In this study, numerous attributes, such as mean spectral reflectance, standard deviation and spectral indices, which have been previously reported in the literature, were computed for the image objects of several images for FS. Table 1 lists the 32 attributes that were examined using CFS.

B3, B4 -
The present study used CFS and the naïve Bayes classifier to determine the most relevant feature subset for extracting whiting events from the multitemporal MODIS data. Then, training and testing samples were prepared from the image objects of various whiting events. Table 2 shows the dates and number of samples that were selected for FS. These samples were normalised to a scale from 0 to 1 and then split 70% for training and 30% for testing. The optimised image objects with the selected significant features via CFS were ultimately used to classify whiting using various tree-based classification algorithms.

Boosting Decision Tree Classification
Boosting is an ensemble machine learning algorithm that is used to improve the accuracy of a classifier by decreasing the classification algorithm's sensitivity to noise and labelling errors in the training datasets [92,93]. The performance of numerous classifiers (weak learners) that are learned by resampled versions of the training samples is combined to improve classification accuracy. Boosting DTs are ensemble methods that use multiple iterations of DT classifiers. A boosted DT called the AdaBoost algorithm was adopted in this study.
AdaBoost [92], also known as adaptive boosting, is a generic iterative supervised learning algorithm that combines multiple classifiers (weak classifiers) to obtain high accuracy. The AdaBoost algorithm chooses training samples on the basis of adaptive resampling by selecting misclassified datasets produced by a previous classifier. The erroneously classified samples in a prior iteration are selected more often than correctly classified samples. Furthermore, new DT models are forced to focus on the misclassified samples and minimise the errors of the former trees [94,95]. Misclassified training samples are given increased weights in each iteration; thus, the classifier can improve its performance in new datasets. Ultimately, all trees (not only a final tree or trees) are incorporated because the additive model is designed such that combinations of all trees can give the optimum solution [96].
Given surface water features with diverse spectral values, training image objects were selected to represent five classes with different spectral responses, namely, whiting, low whiting, clear water, interlayer (a class with subtle differences between low whiting and clear water), apparent sediment (shallow areas next to the shore) and apparent green (potential true green-coloured algae). The prepared samples with the most significant features selected by CFS were split 70% for training and 30% for assessing the performance of the AdaBoost classification model. Using the verified prediction model developed by the AdaBoost algorithm, the multitemporal image objects were classified into the aforementioned five classes by rule-based classification. The performance of the AdaBoost classification technique was compared with that of various tree-based classification algorithms, such as random forest, the single DT and the gradient boosted decision tree. Classification results were assessed in this study by computing the overall accuracy (OA), the Kappa coefficient (KC) from the confusion matrix. While the OA of the classification indicates the percentage of correctly classified image objects, the KC statistically measures and analyses the degree of agreement between the classified and reference image objects [97].

Results & Discussion
The results of the spatiotemporal mapping of whiting in the Gulf using the MODIS time series images and a generic ensemble tree-based model are presented in this section. Furthermore, the frequency, seasonality, duration and geographic distribution and extent of whiting events in the Gulf are also documented and analysed.

Whiting Temporal Pattern in the Gulf
The primary objective of this study was to map the spatial and temporal extents of whiting in the Gulf for 16 years by using satellite images. Therefore, a total of 5800 image scenes of the Gulf were inspected prior to the analysis to detect the existence of whiting. Cloud-free image sets obtained for the period of 2002 to 2018 were used to generate generic statistics on the seasonality, frequency and duration of whiting events in the Gulf, as shown in Table 3. The results showed that whiting events reoccurred in the region exclusively during the winter season (November to March). Figure 5 enumerates the number of events per month/year and the number of days per month/year where whiting events occurred during the study period (2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010)(2011)(2012)(2013)(2014)(2015)(2016)(2017)(2018). The frequency of whiting events during the study period ranged from one to two events per winter month to two to eight events per year. The highest frequency of whiting events was observed in February. The total number of whiting days ranged from 2 to 11 days per month to 8 to 34 days per year. Furthermore, the duration of individual events ranged from 2 to 8 days. The total number whiting days during the past 16 years was approximately 289 days or approximately 7% of the total number of days or 16% of the total days (November to March).
February. The total number of whiting days ranged from 2 to 11 days per month to 8 to 34 days per year. Furthermore, the duration of individual events ranged from 2 to 8 days. The total number whiting days during the past 16 years was approximately 289 days or approximately 7% of the total number of days or 16% of the total days (November to March).

Results of Image Segmentation
Unsupervised image segmentation quality measures based on the OF and the F-measure were utilised to find the optimal scale for whiting event delineation by varying the scale parameter. The mean values of NIR image objects are frequently used in image segmentation assessments to compute undersegmentation and oversegmentation metrics. However, the NIR reflectance band was unsuitable for assessing the segmentation of images with whiting occurrences because the whiting phenomenon does not exist on this band. Therefore, the mean values and standard deviation of the mean blue spectral image objects were used to compute the OF and the F-score values (Equations (1)-(5)). Table 4

Results of Image Segmentation
Unsupervised image segmentation quality measures based on the OF and the F-measure were utilised to find the optimal scale for whiting event delineation by varying the scale parameter.
The mean values of NIR image objects are frequently used in image segmentation assessments to compute undersegmentation and oversegmentation metrics. However, the NIR reflectance band was unsuitable for assessing the segmentation of images with whiting occurrences because the whiting phenomenon does not exist on this band. Therefore, the mean values and standard deviation of the mean blue spectral image objects were used to compute the OF and the F-score values (Equations (1)-(5)). Table 4

Results of FS and Analysis
FS reduces the dimensionality of data, diminishes the complexity of classification models, minimises overfitting and accelerates the process. CFS was selected among various FS methods because of its successful implementation in several remote sensing applications. As listed in Table 1, various spectral bands and indices were examined to determine the most significant features for detecting whiting event occurrences from a large number of MODIS images. Given the considerable variation between the examined features in terms of range and numerical value, all the data were normalised to a scale ranging from 0 to 1. The CFS algorithm with the best-first search strategy [98] was implemented along with the naïve Bayes classifier to evaluate the worth of the selected features. Accuracy was assessed via tenfold stratified cross-validation on the selected training data. According to the results, the best-first eight significant features were the green (G), NDGB, FAI, CI, CI2, CI869, BRI and SD-R. Therefore, combinations of each pair of the best-first selected features were selected for further examination. Figure 6 shows the possible combinations of the selected features and their corresponding overall accuracy and kappa coefficient. These tests reveal that the utilisation of the NDGB and the mean green attribute values or the NDGB and CI of the image objects were similarly excellent features for accurately mapping whiting using the MODIS data. This finding was typically consistent with the spectral analysis of various surface features shown in Figure 4. In this study, the NDGB and the green band were selected for further analysis.

Classification Results
Given the lack of historical records of whiting events in the Gulf, the AdaBoost model was trained and tested with samples selected on the basis of satellite image inspection and the recommendation of Wells and Illing (1962). Figure 7 depicts the generated AdaBoost DT model from the most significant features, namely, the mean green and the NDGB, to extract whiting event features from satellite images with moderate spatial resolution. Optimised time series image objects were eventually classified into six classes (namely, clear water, interlayer, low whiting, whiting, apparent green and apparent sediments) by the developed AdaBoost model and GEOBIA rule-based classification. Whiting water with a high intensity can be simply identified with the generic developed model by rule-based classification when the slope (NDGB) is less than 0.1 and the intensity of the green band is greater than 1500 in the MODIS scene. Whiting with a low intensity can be recognised when the slope (NDGB) is less than 0.1 and the mean of the green band range is between 750-1500.

Classification Results
Given the lack of historical records of whiting events in the Gulf, the AdaBoost model was trained and tested with samples selected on the basis of satellite image inspection and the recommendation of Wells and Illing (1962). Figure 7 depicts the generated AdaBoost DT model from the most significant features, namely, the mean green and the NDGB, to extract whiting event features from satellite images with moderate spatial resolution. Optimised time series image objects were eventually classified into six classes (namely, clear water, interlayer, low whiting, whiting, apparent green and apparent sediments) by the developed AdaBoost model and GEOBIA rule-based classification. Whiting water with a high intensity can be simply identified with the generic developed model by rule-based classification when the slope (NDGB) is less than 0.1 and the intensity of the green band is greater than 1500 in the MODIS scene. Whiting with a low intensity can be recognised when the slope (NDGB) is less than 0.1 and the mean of the green band range is between 750-1500.  The performance of the developed AdaBoost classification model was compared with that of the three tree-based classification algorithms. The AdaBoost classification model gave superior classification outcomes compared to the other models, as shown in Table 5.    The performance of the developed AdaBoost classification model was compared with that of the three tree-based classification algorithms. The AdaBoost classification model gave superior classification outcomes compared to the other models, as shown in Table 5.

Spatial Distribution of Whiting in the Gulf
Generalisation of the occurrence pattern of whiting in the Gulf can be challenging because of the existence of clouds in the MODIS satellite images, especially during winter. Cloud-free images representing the highest concentration of whiting events from each year were selected to compute the peak/maximum area covered by whiting and to determine the spatial distribution of whiting in the region. A total of 17 MODIS images (one image per year) were classified with the developed model. The extent of the areas covered by whiting were then computed, as shown in Table 6. The maximum coverage of whiting occurred in March 2012, followed by March 2003, whereas the minimum coverage of assessed whiting events was recorded in December 2010. The classification results were used to identify the common areas subjected to whiting. To statistically identify significant hot spots (High spatial frequency of whiting events overs 17 years), statistical analysis of spatial clustering was carried out using Optimized Hot Spot Analysis (Getis-Ord Gi*) [99,100]. This tool automatically aggregates whiting events and identifies statistically significant spatial clusters where the focus is on presence or absence of each whiting event rather than a measured attribute associated with whiting events. Figure 9 shows significant whiting spatiotemporal clusters, in the semi-enclosed gulf, with various levels of confidence, areas that are statistically significant at the 99 percent confidence level showed that whiting events were the frequent in the southwest sections of the Gulf and along the coasts of UAE, Qatar, Bahrain and opposite the coast of Al Jubail in Saudi Arabia.

Conclusions
Previous studies mapping whiting events from satellite images, specifically in the Bahama Banks and in the coast of Southwest Florida, relied on the manual delineation of whiting on the basis of spatial contrast, spectral behaviour of whiting and derived spectral indices. Considering the limited studies on whiting event mapping in the Gulf, the present study aimed to document the spatial extent and the seasonal variability of whiting events in the Gulf between 2002 and 2018 using MODIS data. This study acquired and analysed extensive daily data for mapping and documenting, documented spatiotemporal distribution and presented an effective model (integrated CFS, Adaboost and rulebased classification) for detecting and classifying whiting in the Gulf using the MODIS data. The results of FS showed that the combination of the mean of the green band and the NDGB or the combination of the NDGB and CI were the most significant feature for detecting the brightness of inwater features compared with all the examined features in the classification. This study used various tree-based machine learning classifiers, namely, rule-based classification based on a single DT, GBDT, RF and AdaBoost, to classify the optimised multitemporal image objects. The results showed that the rule-based classification based on AdaBoost DT outperformed the supervised tree-based GEOBIA classifiers. Therefore, this study adopted the AdaBoost classification model to find a generic model for distinguishing objects of whiting water directly and classifying time series image objects by rulebased classification.
The adopted model showed an outstanding and expeditious approach to extracting and characterising whiting events quantitatively from time series images. Whiting events in the Gulf occurred during the winter season (November to March) and were extensively located in the southwestern section of the Gulf, mainly along the UAE coast. During the study period (2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010)(2011)(2012)(2013)(2014)(2015)(2016)(2017)(2018), the whiting events occurred exclusively for 5 to 34 days per year and covered areas ranging from 12,000 km 2 to 60,000 km 2 . These events require further investigations for in-situ measurements and laboratory analysis, on the basis of the common spatial distribution of whiting. Therefore, whiting in

Conclusions
Previous studies mapping whiting events from satellite images, specifically in the Bahama Banks and in the coast of Southwest Florida, relied on the manual delineation of whiting on the basis of spatial contrast, spectral behaviour of whiting and derived spectral indices. Considering the limited studies on whiting event mapping in the Gulf, the present study aimed to document the spatial extent and the seasonal variability of whiting events in the Gulf between 2002 and 2018 using MODIS data. This study acquired and analysed extensive daily data for mapping and documenting, documented spatiotemporal distribution and presented an effective model (integrated CFS, Adaboost and rule-based classification) for detecting and classifying whiting in the Gulf using the MODIS data. The results of FS showed that the combination of the mean of the green band and the NDGB or the combination of the NDGB and CI were the most significant feature for detecting the brightness of in-water features compared with all the examined features in the classification. This study used various tree-based machine learning classifiers, namely, rule-based classification based on a single DT, GBDT, RF and AdaBoost, to classify the optimised multitemporal image objects. The results showed that the rule-based classification based on AdaBoost DT outperformed the supervised tree-based GEOBIA classifiers. Therefore, this study adopted the AdaBoost classification model to find a generic model for distinguishing objects of whiting water directly and classifying time series image objects by rule-based classification.
The adopted model showed an outstanding and expeditious approach to extracting and characterising whiting events quantitatively from time series images. Whiting events in the Gulf occurred during the winter season (November to March) and were extensively located in the southwestern section of the Gulf, mainly along the UAE coast. During the study period (2002)(2003)(2004)(2005)(2006)(2007)(2008)(2009)(2010)(2011)(2012)(2013)(2014)(2015)(2016)(2017)(2018), the whiting events occurred exclusively for 5 to 34 days per year and covered areas ranging from 12,000 km 2 to 60,000 km 2 . These events require further investigations for in-situ measurements and laboratory analysis, on the basis of the common spatial distribution of whiting. Therefore, whiting in the Arabian Gulf merits further attention from the scientific community to examine biophysical, biogeochemical and environmental factors that may reveal the causes of the whiting occurrences. Funding: This research received no external funding.

Conflicts of Interest:
The authors declare no conflicts of interest.