Artificial Neural Networks for Mapping Coastal Lagoon of Chilika Lake, India, Using Earth Observation Data

: This study presents the environmental mapping of the Chilika Lake coastal lagoon, India, using satellite images Landsat 8-9 OLI/TIRS processed using machine learning (ML) methods. The largest brackish water coastal lagoon in Asia, Chilika Lake, is a wetland of international importance included in the Ramsar site due to its rich biodiversity, productivity, and precious habitat for migrating birds and rare species. The vulnerable ecosystems of the Chilika Lagoon are subject to climate effects (monsoon effects) and anthropogenic activities (overexploitation through fishing and pollution by microplastics). Such environmental pressure results in the eutrophication of the lake, coastal erosion, fluctuations in size, and changes in land cover types in the surrounding landscapes. The habitat monitoring of the coastal lagoons is complex and difficult to implement with conventional Geographic Information System (GIS) methods. In particular, landscape variability, patch fragmentation, and landscape dynamics play a crucial role in environmental dynamics along the eastern coasts of the Bay of Bengal, which is strongly affected by the Indian monsoon system, which controls the precipitation pattern and ecosystem structure. To improve methods of environmental monitoring of coastal areas, this study employs the methods of ML and Artificial Neural Networks (ANNs), which present a powerful tool for computer vision, image classification, and analysis of Earth Observation (EO) data. Multispectral satellite data were processed by several ML image classification methods, including Random Forest (RF), Support Vector Machine (SVM), and the ANN-based MultiLayer Perceptron (MLP) Classifier. The results are compared and discussed. The ANN-based approach outperformed the other methods in terms of accuracy and precision of mapping. Ten land cover classes around the Chilika coastal lagoon were identified via spatio-temporal variations in land cover types from 2019 until 2024. This study provides ML-based maps implemented using Geographic Resources Analysis Support System (GRASS) GIS image analysis software and aims to support ML-based mapping approach of environmental processes over the Chilika Lake coastal lagoon, India.


Introduction 1.Background
Coastal lagoons play a key role in the hydrological and ecological processes in the zones between land and sea.They distribute and diversify riverine sediments [1], reduce turbulence of tidal flow [2], regulate seasonal current water circulation [3], and enrich shelf waters with nutrients [4].Worldwide, coastal lagoons are among the most productive and biodiverse systems, providing essential habitat for a wide variety of aquatic species and threatened marine fauna [5][6][7].Coastal and brackish water lagoons that form continuities in the terrestrial and shelf ecosystems present transitional aquatic ecosystems located in the zones between land and sea.They are often associated with dynamic environmental conditions and high biodiversity due to their connections to the marine and terrestrial communities [8].
Formed as a result of the marine transgression, coastal lagoons vary with depth and geomorphic features, tidal circulation patterns, salinity, and wind forcing [9].The formation of the barriers between the land and ocean is driven by the controversial forces of erosion and sediment deposition within the coastal lagoons [10].The accumulation of river sediments depends on the speed of the river that enters the coastal lagoon and transports sediments in large quantities.Furthermore, complex hydrological processes such as winds, oceanic current,s and coastal waves, as well as groundwater discharge [11], intensify the mixing of sediments and nutrients within the coastal lagoons [12].Moreover, different slope and substrate types and density-driven currents with diverse morphodynamic tidal regimes affect littoral zones and create stratified conditions of sediment resuspension [13,14].
Topographically isolated and hydrologically distinct from the surrounding landscapes, coastal lagoons include unique species of wild flora and fauna (e.g., bird species).The transitional location of the coastal lagoons determines their high levels of biodiversity through the intense physical-chemical gradients.A particular feature of the coastal lagoons that ensures their high productivity as aquatic systems is their shallow bathymetry [15].The water mass of the lagoon is well mixed because the sunlight reaches all levels of the shallow systems of the coastal lagoons until the lowest bottom layer of water due to the active waves and currents.Such dynamics activate the recycling of nutrients and increase biological productivity [16].They have been declared as marine protected areas worldwide due to their rich bioproductivity and valuable environment.
However, recent monitoring has indicated their decline, habitat loss, and environmental vulnerability [17][18][19].These declines have been attributed to a variety of indirect and direct causes, including climate change [20] and anthropogenic activities with their associated pollution [21].High levels of stress result in environmental threats to these treasured ecosystems [22].Among recent environmental problems of coastal lagoons are affected biodiversity patterns and loss of habitats and rare species [23,24], as well as disrupted land cover patterns due to climate effects.Examples of pollution include organic, chemical, and biological types such as microplastic [25,26] or heavy metals [27].
The transitional nature of coastal lagoons makes them vulnerable to the cumulative effects from climate fluctuations, specifically rising sea levels [28], flooding [29], hydrological disturbances, nutrient availability [30], and human impacts [31].The richness in natural resources of coastal lagoons attracts the local population and urges them to actively use the resources of the aquatic environment.Hence, coastal lagoons serve as a valuable source of food and natural resources and support economic development and sustainability for the local population.In turn, this leads to the overexploitation of these unique areas and increases anthropogenic pressure [32].

Objective and Goal
The main goal of this research is to map and analyze changes in the land cover types surrounding the coastal lagoon on the lake using machine learning (ML) algorithms using Geographic Resources Analysis Support System (GRASS) GIS and Earth Observation (EO) data.We used Landsat 8-9 OLI/TIRS satellite images from recent six years (2019, 2020, 2021, 2022, 2023 and 2024) and processed them using ANN and ML methods to analyze the spatio-temporal distribution of land cover types in Chilika coastal lagoon, Bay of Bengal, India (Figure 1).To set up the advanced practical background for the environmental analysis of Chilika Lake, this study presents an ML approach for the automation of EO data processing, classification, and visualization.To achieve this goal, the objective is to use the ANN methods from Python's library Scikit-Learn version 1.4.2[33], which are embedded in the GRASS GIS version 8.3 [34] through ML modules designed for data partition and satellite image processing.
The ML approach was selected since it creates a plausible paradigm to map the environmental variability of the coastal landscapes surrounding the lagoon of Chilika Lake.Among the existing methods, this study employs the MultiLayer Perceptron (MLP) algorithm of ANN, which presents an effective solution to image analysis.

Research Gap
There is little research analyzing and contrasting landscapes of the Odisha coasts using the ML approach.Existing studies draw generalizations regarding the links between the ecosystems of the coastal lagoon of Chilika Lake and the adjacent habitat communities [35][36][37][38].However, they utilize the conventional tools of cartographic software, which applies traditional mapping methods.To the best of our knowledge, no reported research has been carried out using ANN techniques to study the environmental variability of Chilika Lake with a spatial extent with the coordinates of 19°28 ′ -19°54 ′ N; 85°06 ′ -85°35 ′ E; see Figure 2.
At the same time, ANN methods and scripting libraries are promising tools for cartographic tasks and image processing for mapping areas of coastal lagoons, which are notable for the high complexity of land cover patterns and the heterogeneity of landscapes [39][40][41][42][43][44].In this regard, GRASS GIS presents a powerful cartographic toolset that includes diverse modules that can be used for satellite image processing [45].Hence, besides the traditional general-purpose programming languages, GRASS GIS is also used for creation.Hence, ML applications in cartography provide insight into its spatio-temporal variability of landscapes and environmental processes through the classification of the EO data [46][47][48][49].

Theoretical Framework and Motivation
Monitoring coastal landscapes and variations in land cover types around coastal lagoons is essential for land management and conservation activities of the Chilika Lake.Such activities are carried out and reported in previous studies based on conventional mapping.Nevertheless, monitoring land cover types in a lake using traditional methods is often time-consuming and labor-intensive and includes considerable manual work, which is prone to errors.Though mapping using Geographic Information System (GIS) presents a reliable solution to Earth Observation (EO) data processing, estimation accuracy is still a notable challenge in mapping coastal areas with high heterogeneity of land cover types.Due to the logical straightforwardness of the classification algorithms, their application for thematic GIS-based mapping and analysis of landscape dynamics presents a wellknown approach to cartographic workflow with existing case studies on Chilika Lake, the Mahanadi Delta, and the Odisha coastal area [50][51][52][53].
However, the spectral complexity of the multispectral satellite images makes recognizing land cover types in coastal areas a challenging and less accurate task using k-means clustering or "MaxLike" classification.For instance, specifically for lagoons, the optical properties of coastal waters and shelf areas are significantly affected by the suspended sediment from the colored dissolved organic matter (CDOM), which can create noise on the EO data.On the other hand, the classification of the EO data using machine learn-ing (ML) methods has been fairly successful.For example, to solve these issues, ML algorithms present automation of image classification [54,55], which is achieved through computer vision algorithms of pattern recognition and analysis that enables the recognition of geometrical complexity [48].
The application of ML methods based on the Artificial Neural Networks (ANNs) applied to Remote Sensing (RS) data processing considerably increases the effectiveness of mapping [56,57].Advanced ML methods enable the landscape dynamics of spatial and temporal trends to be automatically revealed through computer-based algorithms of pattern recognition and data analysis, as reported in existing studies [58][59][60][61][62]. Several ML and AI algorithms exist to analyze and quantify spatial data using analytical and empirical approaches.Their main approach includes neural networks that teach computers to process data in an analytical way that simulates the human brain in pattern recognition [63,64].Among the advanced ML algorithms are the Random Forest (RF) [65], Support Vector Machines (SVMs) [66], and Naive Bayes [67], to mention a few.In this study, we use such algorithms for processing RS data.The goal of this approach is to perform satellite image classification with a case study of Chilika Lake coastal lagoon, East India.

Study Area
The study area covers a spatial extent with the coordinates of 19°28 ′ -19°54 ′ N; 85°06 ′ -85°35 ′ E, [68] located in the state of Odisha, East India, Figure 3.The largest brackish water lagoon in Asia [69,70], Chilika Lake covers a total area of over 1100 km 2 with existing fluctuations of the lake surface reported between 1165 km 2 and 906 km 2 [71].The lake is located at the junction of two different water masses-riverine freshwater and oceanic salt waters from the tidal influx of the Bay of Bengal.Different water fronts interact with the bathymetry of the lagoon and generate local hydrographic settings, which causes local variations in salinity, current turbulence, and circular flow patterns of waves [72].The coastal lagoon formed by the Chilika Lake lies at the estuary of the Daya River, which enters the Bay of Bengal on the east coast of India; see Figure 3.
A recent geological study has shown that the shallow limnological system of Chilika Lake was a part of the Bay of Bengal during the later stages of the Pleistocene period [73], and underwent marked geomorphic evolution during the late Holocene [74,75].This in-cluded significant denudation and weathering of the surrounding coasts.Such processes are mostly caused by climate variability and accelerated by the effects of monsoon cycles, which influence the distribution of mangroves [76] and other vegetation types around the lake and estuarine environment [77].Currently, the dynamics of the surface area in the coastal lagoon of Chilika Lake present a response to the cumulative effects of tidal morphodynamics, winds, and morphometry.The integrating forces of these processes resulted in fluctuating surface and area, which affect the ecosystem of the surrounding wetlands.
The coastal lagoon of the Chilika Lake has important conservation features.These include, for instance, rare aquatic and sub-aquatic plants, endemic species, mangrove associations, and plants of horticultural importance.The significant biodiversity and ecosystem value of the Chilika Lake can be illustrated by the impressive number of species, which exceeds 300 fish species [78] and 726 species of flowering plants [79].Moreover, Chilika Lake is the largest habitat for migratory waterbirds across India and a home to multiple threatened and rare species, including both plants and animals [80].
Such rich natural resources and the unique environment of Chilika Lake have attracted humans to this area since the ancient period.Archaeological records prove that human settlements existed in the Chilika Lake area since at least the Neolithic period when this area served as a marine harbor and port [81].The attractiveness of Chilika Lake is explained by its favorable climate, beneficial topographic setting, and strategic location, which have given access to the Indian Ocean and ensure safe maritime trade and international commercial connections in India since the ancient period.The importance of the Chilika Lake both ecologically and historically in the development of Indian civilization resulted in its official designation a as UNESCO World Heritage site [82] and a Ramsar site [83].

Data
Spatial analysis was limited to RS data using multispectral satellite images Landsat.A time series of satellite images collected at regular time intervals and covering the study area is a key instrument for environmental analysis [84].To this end, six satellite images were collected during the spring period (February-March) and covering the time interval of 2019 to 2024.All but two datasets (early March 2020 and early March 2023) were acquired within the period of February (that is, images on 2019, 2021, 2022, and 2024), when the aquatic and coastal vegetation around the lagoon is typically well developed prior to the pre-monsoon decline during the period from April to June and monsoon rains, which last in India from June to September.Hence, the images were taken on the following dates: 13 February 2019, 3 March 2020, 2 February 2021, 13 February 2022, 4 March 2023 and 11 February 2024.
Water turbidity is relatively low during the late winter to early spring period due to seasonally reduced rainfall in the "no monsoon" period.This allowed for the identification of land cover types through the satellite image analysis.Finally, the spring period enables the detection of algae blooms in the coastal lagoon, which usually occur in India from February to May (subject to climate fluctuations).Algae bloom in the coastal lagoon of Chilika Lake is caused by several factors such as the effects of monsoon cycles, riverine discharge, and seasonal upwelling.Hence, the images were selected for spring period with low cloudiness (below 10%) to increase the quality of image analysis.
The satellite data were obtained from the Landsat 8-9 OLI/TIRS mission and downloaded from the NASA EarthExplorer website (https://earthexplorer.usgs.gov/,accessed 3 March 2024).The original images are shown in Figure 4. Image frames acquired from EarthExplorer were imported to the GRASS GIS individually using extent and resolution corresponding to the multispectral bands, then pre-processed for top-of-atmosphere reflectance using the "i.landsat.toar"module.The major technical characteristics of the satellite images common for all the scenes are as follows.The images were obtained during the daytime in the nadir from sensor OLI/TIRS of Landsat Collection Category T1, Nr.Other geospatial data include the cartographic datasets: the raster topographic grid of the General Bathymetric Chart of the Oceans (GEBCO) and vector layers of the administrative division of India.These data were used to visualize the location of the region within the country at the state level, as well as the terrain of the study area.

Methodological Workflow
These data were processed using the GRASS GIS software version 8.3.1,Generic Mapping Tools (GMT) cartographic scripting toolset version 6.4.0 [85] and QGIS software version 3.34.The methodology used for mapping is derived from previous works [86][87][88].The workflow of this study included several processes and approaches to image analysis and multi-source data, as summarized in the methodological scheme in Figure 5.
As such, combining RS data and machine learning (ML) techniques presents the integration of the two technologies that complement each other in the programming approach of GRASS GIS and help overcome the limitation of using just one.Moreover, the current focus on employing the ML methods for monitoring the coastal lagoon of the Chilika Lake underscores its important contribution to the modeling of the geospatial data.This approach to data processing supports the computer-based modeling of landscape dynamics in the coastal regions of the Bay of Bengal and helps analyze how these regions are affected by the environmental and climate variability in the monsoon climate of the Indian Ocean.

Image Processing
The images were processed using the GRASS GIS (v.8.3) image processing software using methods explained in the following subsections.First, the images were imported and preprocessed.Then, the images were classified using the unsupervised clustering of the maximum-likelihood discriminant analysis classifier (MaxLik).During the clustering steps, the signature file was generated and reported using the k-means algorithm.The aim of this step is to perform cluster maps and to obtain a training dataset.The classification was performed by the 'i.maxlik' module of GRASS GIS.The code for these steps is presented in Listing 1 using GRASS GIS syntax: Listing 1. GRASS GIS code for clustering method using k-means algorithm.The visualization of the maps was performed using cartographic tools of GRASS GIS as follows in Listing 2: Listing 2. GRASS GIS code for mapping and cartographic display.The next steps included machine learning (ML) algorithms for image processing and analysis by GRASS GIS.

Machine Learning 3.4.1. Random Forest
The mathematical foundation of the Random Forest (RF) classification consists of the following steps of the workflow initially developed by [65].For b = 1 to B, a bootstrap sample Z * of size N from the training data has been drawn.Afterward, the random-forest tree T b is increased to the bootstrapped data.This is carried out iteratively by recursively repeating the logical steps for each terminal node of the tree that represents the individual class in land cover classification until the minimum node size min is reached.
The m variables are randomly selected using the data obtained in the "r.random" module of GRASS GIS obtained from the pixel variables "p".The best variable and split point are then selected among the m, and the model splits each node into two "daughter" nodes.The output model of the ensemble of trees is received as {T b } B 1 .Then, to make a prediction of the assignment of each pixel within the matrix of the raster image to the specific land cover class, the model evaluates each new point x (that is, a cell on the satellite image) using the following logical expression.Let Ĉb (x) be the class prediction of the b-th random-forest tree.Then, the RF classification is performed by running Equation (1) derived from [89]: The criteria for the RF model include the number of pixels forming the class (the spectral reflectance variability of vegetation and land categories).The optimal parameter was defined as 10 classes using existing similar studies.Second, the extent of the area in the pixel's surrounding was set up according to the resolution of the Landsat images as 30 m per pixel around the target land cover class, forming the complete landscape pattern, plus including the pixels themselves evaluated for spectral reflectance using the ANN and ML methods.Hence, the RF classification obtains a class vote from each tree and then classifies it using majority vote and analysis of each pixel within the raster image.In GRASS GIS, the RF-based image classification is carried out using the code presented in Listing 1.
Here, the training pixels were first generated to train from an earlier land cover classification.Then, they were used as training datasets to perform a classification on recent Landsat images (in the example of code below, for the image of 2023).Afterwards, the model was trained using the "RandomForestClassifier" embedded algorithm using "r.learn.train"module.The prediction of the model's performance was carried out using the "r.learn.predict"module.The shaded relief was added as a background to the image, and the isolines were derived using the "r.contour" module.The code is presented in Listing 3.

MultiLayer Perceptron
The process of optimal categorization of the image scene into the land categories was performed iteratively using the ML tools defining the key parameters of spectral reflectance.The MultiLayer Perceptron (MLP) algorithms are a class of Artificial Neural Network (ANN) methods.The general methodological scheme for the ANN is presented in Figure 6.
In GRASS GIS, the MLPClassifier is derived from the Scikit-Learn Library of Python and is based on fundamentals of predictive learning [64].Its performance differs from other classifiers since it uses the principle of the feedforward ANN.For data analysis, ANN uses three layers that are used as structures of network topology in the flow of information for data partition.Pixels of the raster layer present the nodes of the input, hidden, and output layers (see Figure 6), which train the model using supervised learning.The principal approach of this process consists of the connection between each node in one layer to those in the following layer through a certain weight (w ij ).The MLPClassifier iteratively evaluates the training data using these connections in weights of pixels.The algorithm changes weights repetitively using estimated error until the output image approaches the expected result and the error is minimized.This sequential operation is formulated in Equation ( 2): where e j (n) is the degree of error in an output node j in the n-th pixel of the raster dataset and ε(n) is the node weights, which are iteratively adjusted using the minimization of the errors in the classified raster image for the n-th pixel of the raster matrix.Using optimization, each weight w ij is estimated and changed accordingly, as in Equation ( 3): where y i (n) is the result of the previous step of classification, and η is the tuning parameter of optimization, which aims at the quick convergence of the weights of pixels during the iterative process of image classification.Hence, the MLPC algorithm is sensitive to feature scaling, which is related to the resolution of the original raster image.The essential approach of this algorithm consists of the random selection of the hidden nodes and analytical determination of the output weights of neural networks [63].As a result, the MLPClassifier algorithm ensures higher generalization output at a faster learning speed.Using ML modules of GRASS GIS, the image classification using the MLPC algorithm was implemented using the combination of the modules "r.learn.train"used for extracting training data, supervised machine learning, and cross-validation using the Python package Scikit-Learn, and the module "r.learn.predict"for estimating prediction of pixels' classification.The technical implementation was performed using the code in Listing 4:

Support Vector Machine
The Support Vector Machine (SVM) Classifier uses supervised learning methods [66] and classifies the data into classes using decisions on the largest separation between the classes.Hence, it discriminates the values of the pixels constituting the images and identifies the largest distance to the nearest training sample.When SVC uses training vectors of sample pixels that are located within the margin of classes using the following algorithm approach-x i ∈ R p , where i = 1 . . .n as two classes and a vector y ∈ {1, −1} n -it aims to find the w ∈ R p and b ∈ R so that the assignment of pixels to correct land cover class is true for most samples using Equation (4): which depends on the definitions of y i (ω T φ(x i ) + b) ≥ 1 − ζ i , which should be greater than one for the optimal prediction of the correctly classified pixels on a raster scene, and The SVC analyzes Digital Numbers (DNs) of pixels on the image to maximize the margins of the classes using iterative analysis.The estimated decision function in a classified matrix of the image consists of cells for a sample of x pixels, as shown in Equation (5): The result of the classification assigns the predicted classes and the support vectors, which are summarized using attributes of the classification.The effectiveness of the SVM method is that it is a memory-efficient approach that optimizes the use of the computational capacities of the machine.The flexibility of this algorithm is ensured by different Kernel functions defined, which include both common and custom kernels.The practical implementation of this approach in GRASS GIS is presented in Listing 5 below.First, the SVC model is trained using the "r.learn.train"module.Then, the prediction of pixel assignments is carried out using the "r.learn.predict"module.Afterward, the raster categories thatare automatically applied to the classification output are checked using "r.category" module.The visualization is performed using the modules "r.colors", "d.rast", and "d.legend".An example of the code used for classification of the image for 2019 is presented below.
Listing 5. GRASS GIS code for SVC method for supervised image classification.model_name = SVC n_estimators =500 save_model = svc_model .gz --overwrite r .learn .predict group = L_2019 load_model = svc_model .gz output = s v c _ c l a s s i f i c a t i o n r .category s v c _ c l a s s i f i c a t i o n r .colors s v c _ c l a s s i f i c a t i o n color = bcyr -e d .rast s v c _ c l a s s i f i c a t i o n d .legend raster = s v c _ c l a s s i f i c a t i o n title = " SVM 2019 " title_ fontsi ze =19 font = " Helvetica " fontsize =17 bgcolor = white border_color = white The GitHub repository is created to summarize the methodology and the results of all the models and classification outputs.The programming scripts used for plotting the data, confusion matrices, and maps are also included in this repository for a comparative analysis of the statistical outputs and quantitative estimations on land cover types.
The overall performance of the tested ML classifiers and ANN were evaluated for accuracy, F1 score, Cohen's kappa coefficient, and other parameters.Here, Cohen's kappa is a quantitative measure that evaluates the reliability of rating coefficients that assess the accuracy of pixels's classification and assignment to diverse land cover types [90].The applicability of Cohen's Kappa techniques for data evaluation is proven by their use in various studies [91][92][93].The reliability was evaluated for three classifiers-(1) Random Forest Classifier, (2) Support Vector Machine (SVM) Classifier, and (3) Multilayer Perceptron Classifiers (MLPClassifier)-to evaluate the accuracy of these approaches to satellite image processing.Cohen's Kappa is an important factor in the interpretation of test findings regarding raster image classification.In this study, Cohen's kappa was defined in weighted form using the formula in Equation ( 6): where the P 0 indicates the probability of agreement and P e is the probability of random agreement.The advantage of Cohen's kappa is that it presents a robust measure of estimating the statistical probability of evaluation compared to the simple percent agreement calculation [94].This is possible since Cohen's kappa takes into account the possibility of the random agreement of pixels' assignment to diverse land cover classes.Hence, it is an appropriate measure of the reliability of pixels assigned to diverse land cover classes.In this way, Cohen's kappa estimates the degree to which these classifiers produce similar results under consistent environmental conditions of landscapes, that is, the same date, sunlight, and azimuth angle of the satellite images.
In contrast to Cohen's Kappa, the F-score evaluates the predictive performance and classification performance.Generally, it is calculated from the precision of the classification using the number of correctly classified pixels divided by the number of all samples that are predicted to be correct.Precision indicates the positive predictive value in sensitive classification [95,96].According to previous studies on image classification [97], the estimation of the F score is performed using the following formula in Equation ( 7): Widely used in diverse cases of image processing, such as classification, partition, or segmentation [98][99][100], the F1 score presents the harmonic mean of the precision and recall of image classification model.These two metrics contribute equally to the estimation, which enables the F1 score metric to correctly indicate the reliability of image classification.

Results and Discussion
The identified categories include the following land cover types, defined as classes corresponding to the following land cover patterns: (1) salt water bodies (ocean); (2) brackish water (lake); (3) wetlands and coastal lagoon; (4) dense deciduous forest; (5) agricultural land, croplands; (6) trees and vegetated areas; (7) built-up and urban areas; (8) grassland and shrubland; (9) rural areas; and (10) freshwater (river).The spatio-temporal change in land cover types around Chilika Lake between the years 2019 and 2024 is shown in maps presented in Figures 7-10 and compared with existing similar research [35,101].The hydrological effects can be explained by the increase in water turbidity during the spring period, which affects the visibility of the lake surface through the decreased transparency of the water column affected by the increased photosynthetic activity during this period.The distribution of mangroves that are mostly situated along the tidal water ways are distinguished due to their spectral separability on the EO data.Thus, salt marsh grasses are generally located along the tidal flats due to the specific environmental setting of these physiographic units.
The areas of wetlands and coastal lagoon as well as water bodies increased due to new mouth opening of the Chilika Lake.The scrubland and grassland decreased, while the forest areas increased.This numerical computations of changes are summarized in Table 1, and the statistical report tables are provided in the GitHub repository.The dynamics in land cover types in the coastal area of Chilika imply that that scrubland and grassland were converted into forest due to higher dense vegetation cover.Notably, the agriculture fallow land does not show much difference between the estimated years of 2019 and 2024.The comparative analysis of the maps shows that the surroundings of the Chilika Lake underwent changes in land cover types.This was caused by the cumulative effects from both the anthropogenic and natural events and results in variations of biological productivity, water eutrophication, and the extent of mangrove forests which strongly depend on the variations in the salinity as discussed above.In turn, the salinity level in the lake is regulated by the monsoon processes and the changes in oceanic seawater and river inflow and counterparts.The effects from the human activities in the watershed of Chilika lagoon result in increased siltation and an increase in nutrients.The land cover types were identified on the satellite images in the study area using information on classification adopted from existing studies [71]; see Figure 7.
During the evaluated period of 2019-2024, notable changes were observed in the Chilika Lake surroundings, including settlements and populated built-up areas, agriculture lands, croplands and plantations, and barren or wasteland areas.At the same time, other classes did not show many differences in the five-year time span.Between 2019 and 2024 (Table 1), the area of agricultural plantations as well as barren land decreased, while urban population areas and built-up areas occupied by settlements increased.The increase in the settlement and built-up areas is identified between 2019 and 2022 and between 2023 and 2024, which might refer to the natural increase in urban population caused by socio-economic drivers; see Figure 8.
Besides the anthropogenic issues, the variations in the color and salinity of water within the Chilika Lake visible on the images are related to the monsoon rains, which strongly influence the hydrography of the coastal lagoon; see Figure 9. Thus, the inflow of sediment and water from the catchments of rivers and tributaries upstream are at a maximum during monsoon months.This is also increased by the oceanic sediment transport.Consequently, these months are notable for intensive floods and turbulence of waters in the coastal lagoon.Accordingly, the impact of both oceanic tides and freshwater inflow from the rivers regulate the salinity of Chilika Lake and change it depending on the dominating force.Spatial variability of the salinity detected in the images classified using the ANN-based MLPClassifier demonstrates the decrease during the monsoon period due to the influx of freshwater, which especially concerns the northern and central segments of the lake, while the southern sector is the least affected even during monsoon and maintains its brackish-water conditions; see Figure 10.Such hydrological patterns are well reflected in the satellite images, which show various colors of water in the lake.During spring periods, when the images were taken, the water level of the lagoon gradually decreases and reaches its lowest level during summer.Finally, the effects of winds also creates an input into the balance of salinity of the Chilika Lake by water turbidity in upper layers.Such climate effects facilitate the influx of saline water from the ocean and increase the salinity of the lake accordingly.In turn, the variability in salinity and geochemical characteristics related to such complex processes affect aquatic vegetation and algae bloom during favorable periods, which can be identified in the satellite images.The siltation, increase in nutrient enrichment, and reduced salinity favored the growth of weeds and eutrophication within the basin of lake.Such changes can be detected by computer vision algorithms using the Random Forest method due to different levels of spectral reflectance of the lacustrine surface; see Figure 8.Hence, spring algal bloom in the inland waters of Chilika Lake is a major issue in the spring period, when the imagery was taken, which in turn affects water quality, especially in the shallow estuarine segments of the lake.Furthermore, the effects of eutrophication lead to the degradation of the aquatic habitats.For instance, the level of eutrophication increased in the lake from 2019 to 2020 and 2021, after which the situation stabilised.The results of image classification using SVM method are presented in Figure 9. Image analysis showed that the algal growth is increased in 2021 and 2024, which was best detected using the ANN approach of MLPClassifier, Figure 10.The evaluation of accuracy was performed using chi-square test and visualized in Figure 11.Accordingly, the stability of points assigned to land cover classes were evaluated and reported as follows for each year: for 2019, 98.08% points stable; for 2020, 98.23%; for 2021, 98.19%; for 2022, 98.25%; for 2023, 98.37%; and for 2024, 98.08% points stable, respectively.Finally, class means and standard deviations computed for each band and each year, respectively, are presented in the Table A2 in Appendix B for the years 2019 to 2024.The maps of accuracy analysis assessment of image classification has been performed using rejection-threshold probability techniques using chi square test, the results of which are presented in Figure 11.The final results of the computations included the convergence for iterations of pixels assigned to the land cover classes.The convergence was computed for each image and demonstrated the following results: for the year 2019, 98.1%; for the year 2020, 98.2%; for the year 2021, 98.2%; for the year 2022, 98.2%; for the year 2023, 98.4%; and for the year 2024, 98.1%.This shows high precision and accuracy of the calculations using GRASS GIS algorithms of image processing.The class separability matrices computed for 10 land cover types identified in the Chilika lagoon for the years 2019, 2020, 2021, 2022, 2023, and 2024 are reported in the tables placed in Appendix C. The GitHub repository with GRASS GIS scripts and the results of the ANN and ML image processing is available online at https://github.com/paulinelemenkova/India_Chilika_Lake_GRASS_GIS_ANN_ML_Image_Processing (accessed on 4 April 2024) and contains the results of the image processing.Diverse types of lacustrine vegetation such as weeds and grasses distributed along the sheltered lagoon margins include dense algae meadows on the flats and shallow water areas, which are mostly occupied by brackish weeds and other marsh grasses.Such vegetation largely depends on the specific lacustrine environment and is naturally distributed along the inner margin of the Chilika coasts.Marine influences on the Chilika coastal lagoon are directed through the barrier spits and estuaries of the adjacent rivers.Among the environmental problems that affected the variations in the land cover types are the strong eutrophication, which was visible on the images due to the heavy nutrient upload.
Change-detection analysis in the land cover types from 2013 to 2023 demonstrated shifts in land cover types within the basin of the Chilika Lake coastal lagoon.The computed areas occupied by various land cover classes for each of the category are summarized in Table 1.Such variability of landscape patches around the Chilika Lake proved the existence of fluctuations in the processes of siltation and eutrophication in the coastal lagoon over the studied period (years 2019-2024).Major causes of such environmental changes are related to sedimentation processes such as low river flow speed, which cause stagnation of water and eutrophication.Second, the increased sediment budgetary balance and flushing of sediments into the Bay of Bengal also contribute to the changes in the lake level.Since water fluctuations in various parts of the Chilika Lake are subject to monsoon effects in dry and wet seasons, wetland types can be discriminated on the satellite images along the margin areas of the lagoon.These variations are also caused by the different hydrological effects (e.g., currents and water level) and geomorphic settings of the surrounding relief.
Table 2 compares the hyper parameters of the GRASS GIS used for classification indices in for ANN and various models of ML.Cohen's Kappa was computed to compare the results of raster image classification, such as landscapes viewed by RS Landsat sensors and identified using ANN and ML algorithms of GRASS GIS.In this way, Cohen's Kappa presents a straightforward statistical approach that computes the confusion matrix by considering each pixel of the comparable raster images as a single rating made by two raters.In this regard, the results were computed and summarized below.
Table 2 summarizes and compares such parameters as accuracy, F1 score, Cohen's kappa coefficient, and related parameters.Based on the computed values of Cohen's Kappa coefficient and the F1 method for the evaluation of classified data, the results suggest that the 0.785 Cohen's Kappa coefficient and F1 score of 0.89 are rated as the good strength of agreement.Hence, the ANN technique can be interpreted as a reliable approach for satellite image classification in terms of accuracy, followed by the SVM and RF.The interpretation of the F score is as follows.The highest possible value of 1.0 indicates the perfect precision and recall of variables, while the lowest theoretically possible value of 0 indicates the lowest precision or recall which are zero in this case.Hence, higher values generally mean better results of satellite image classification.Likewise, as other correlation statistics, the kappa can range from −1 to +1.The interpretation of values is as follows.Values of the levels of 0.60 to 0.79 have a moderate level of agreement, those in the interval of 0.8-0.9(that is, over 80% below 90%) are acceptable as having strong level of agreement, and those above 0.9 are almost perfect [102].The short-term evolution of the physiographic habitats and land cover types around the Chilika coastal lagoon is strongly affected by the humid monsoon climate of the Bay of Bengal and human activities.The analysis of satellite images performed using ML and ANN algorithms enabled us to recognize fluctuations over the areas of Chilika lagoon, which depend on the water level, tidal level, monsoon effects, and inflow of rivers of Daya and Bhargavi into the lagoon.In terms of technical capabilities, the ANN model performed the best in terms of learning capabilities of the model in GRASS GIS and the effectiveness in capturing anomalies and outliers in classified pixels assigned to various land cover classes.Although the ANN model required high computational resources and was demonstrated to be a highly time-consuming model, its performance was excellent with regard to the other models.Furthermore, the lagoon depends on the inflow from the sea waters through the small stream inlets.Such variations in hydrology strongly affect the lacustrine environments, as demonstrated on the satellite images processed using GRASS GIS.
Wetlands, swamps, and mangroves situated along the coasts of the Chilika lagoon are modified by the increase in sediments, which is in turn caused by the effects from monsoon storms and repetitive local floods in the estuaries.Thus, spatial variations in the lake's surface revealed the topographic differences between various regions of the lagoon.For instance, its western part is more vulnerable to geomorphic erosion due to higher topographic elevations.This is exaggerated by the more intense fishing in this part of the lagoon, which is caused by the location of settlements and villages placed in this region.Finally, high rainfall triggered runoff, which affected the watershed of the Chilika Lake.
High biodiversity in the physiographic setting of the Chilika lagoon affected the structure and intensity of the habitat dynamics in its ecosystems.Thus, the complex mosaic of the vegetation types detected on the multispectral satellite images point at different geomorphic and landscape units of the lagoon system.Local variations in the vegetation distribution along the fringes of the Chilika lagoon are caused by the periodic monsoon effects on the hydrological system and interruptions in the sedimentation process, which depend on the riverine inflow as discussed above.These processes are intensified by local soil erosion, monsoon storms, and fishery activities, which have affected the distribution of vegetation on tidal flats and the surrounding landscapes.
Social and ecological implications of the presented findings include mapping lacustrine landscapes of Chilika Lake in East India for spatio-temporal analysis of changes in land cover types quality and extent.Specifically, the evaluated changes enable the detection of eutrophication and dynamics in the nearby landscapes, which is essential for the environmental monitoring of the coastal lagoons.

Conclusions
Mapping coastal lagoons using RS data over time is important for distinguishing the effects of climate and anthropogenic impacts causing major natural disturbances in the surrounding landscapes.This study demonstrated the application of ML/ANN techniques for satellite image processing, aiming to map and visualize changes over the coastal landscape of Chilika Lake in the 2019-2024 time period.The mosaic of ten land cover types was identified around the Chilika Lake and recognized at a resolution of 30 m using GRASS GIS techniques of ML applied to satellite image processing, allowing an assessment of individual landscape patches.Technically, we analyzed the difference between the effects of diverse ML algorithms (Maximal Likelihood discriminant analysis, Random Forest, Support Vector Machine, and MLPClassifier of ANN) on Landsat image analysis and the effects of these approaches on image classification.
Owing to technological limitations of existing GIS tools (traditional methods of classification), some previous mapping efforts in the Chilika Lake relied on a relatively coarse discrimination of land/water coverage and the detection variability of land cover types, including mangroves.These were referred to as potential aquatic habitatsto analyze all coastal areas where land cover changes would likely occur.In contrast, this study revealed the effectiveness of the ML/ANN algorithms for RS data classification, which present a powerful alternative to the traditional cartographic tasks through the automation of image classification.
The results of this research show the impact of climate variability on changes in land cover types during short-term time gap and landscape dynamics around the lagoon of Chilika Lake.Recent boundaries of coastal land categories (2023) closely followed the contour of previous patches detected on earlier images (e.g., 2020, 2019), largely because the extent of the lake limits the existing land cover types, as well as distribution of aquatic vegetation and restrict mangroves to shallow depths with brackish water.The implications of this study are useful for environmental decision-making and can support in policymaking in the sustainable development of Ramsar marine sites of India.The paper also demonstrated the value of EO data for environmental analysis.Landsat 8-9 OLI/TIRS satellite images provide an added value to the conventional environmental analysis through land cover classification using ML methods.
Machine learning (ML) and ANN techniques were evaluated for satellite image processing and proved their effectiveness and usefulness for the environmental analysis performed using GRASS GIS.The use of such data and methods can improve cartographic results in coastal areas that observe some structural variability in landscape patches.A landsat sensor with moderate spectral and high temporal resolution has proven useful in estimating the spatio-temporal variations in land cover types in the coastal lagoon of Chilika Lake in early spring period since 2019 to 2024.Specifically, Random Forest, Support Vector Machine, and MultiLayer Perceptron classifier algorithms were successful in capturing the trend of landscape dynamics in the lacustrine surroundings using ML methods of the automatic analysis of spectral reflectance of pixels.Similarly, the scripting algorithm of GRASS GIS enabled the cartographic workflow using different modules for raster data processing.
Time series maps of land cover types derived from the classified Landsat images explained the overall relationship between climate effects, environmental patterns, and hydrological processes through sedimentation, algal bloom detected in the lake during the spring period, and variations in land cover types of the surrounding landscapes in eastern India, as evaluated during a short-term period of the past six years.

Figure 1 .
Figure 1.Topographic map of India with indicated study area showing the location of the Landsat data within the country.Software: GMT.Map source: author.

Figure 2 .
Figure 2. Enlarged fragment of the topographic map of India with indicates study area showing the location of the Landsat data over the coastal lagoon of Chilika Lake.Software: GMT.Map source: author.

Figure 3 .
Figure 3. Administrative map of India showing the location of the Odisha state on the eastern coast of the Indian subcontinent.Software: QGIS.Map source: author.
2. The Worldwide Reference System (WRS) Path/Row is 140/46.The images were projected into the Universal Transverse Mercator (UTM) projection Zone 45, Datum and Ellipsoid is World Geodetic System 84 (WGS84), Ground Control Points Version 5; Station Identifier LGN.The remaining technical characteristics of the EO data are summarized in Appendix A.

Listing 3 .
GRASS GIS code for RF method for supervised image classification.r .random input = L _ 20 19 _ cl us t er s seed =100 npoints =1000 raster = t ra in i ng _ pi xe l s --overwrite r .learn .train group = L_2023 training_map = tr a in in g _p ix e ls \ model_name = R a n d o m F o r e s t C l a s s i f i e r n_estimators =500 save_model = rf_model .gz r .learn .predict group = L_2023 load_model = rf_model .gz output = r f _ c l a s s i f i c a t i o n r .category r f _ c l a s s i f i c a t i o n r .import input =/ path / Chilika / gebco2023 .tif output = shaded \ _relief1 extent = region r .contour shaded \ _relief1 out = isolines step =200 --overwrite

Figure 6 .
Figure 6.General methodological scheme for the Artificial Neural Network (ANN) used for image classification.Software: R version 4.3.3,DiagrammeR library version 1.0.11.Diagram source: author.

Listing 4 .
GRASS GIS code for RF method for supervised image classification.r .learn .train group = L_2019 training_map = tr a in in g _p ix e ls \ model_name = MLPClassifier n_estimators =500 save_model = mlpc_model .gz --overwrite r .learn .predict group = L_2019 load_model = mlpc_model .gz output = m l p c _ c l a s s i f i c a t io n -overwrite r .category m l p c _ c l a s s i f i c a t i o n r .colors m l p c _ c l a s s i f i c a t i o n color = plasma -e # data mapping : d .mon wx1 d .rast shad ed_rel ief1 d .vect isolines color = ' 100:93:134 ' width =0 d .rast m l p c _ c l a s s i f i c a t i o n d .grid -g size =00:30:00 color = white width =0.1 fontsize =16 text_color = white d .legend raster = m l p c _ c l a s s i f i c a t i o n title = " MLPC 2019 " tit le_fon tsize =19 font = " Helvetica " fontsize =17 bgcolor = white border_color = white d .legend raster = sh aded_r elief1 title = " Relief , m " title _fonts ize =19 font = " Helvetica " fontsize =17 bgcolor = white border_color = white -f d .out .file output = MLPC_2019 format = jpg --overwrite

Figure 11 .
Figure 11.Accuracy analysis of image classification using algorithm of reject threshold probability by chi square test.Background relief: GEBCO.Software: GRASS GIS.Mapping source: author.

Table 1 .
Estimated classes of land cover types for 2019-2024 in the Chilika Lake coastal lagoon.