Next Article in Journal
A Robust and Reliable Positioning Method for Complex Environments Based on Quality-Controlled Multi-Sensor Fusion of GNSS, INS, and LiDAR
Previous Article in Journal
Next-Generation C-Band SAR Mission: Design Concept for Earth Observation Service Continuity
Previous Article in Special Issue
Ocean Colour Estimates of Phytoplankton Diversity in the Mediterranean Sea: Update of the Operational Regional Algorithms Within the Copernicus Marine Service
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Machine Learning Approaches to Phytoplankton Identification and Classification Using GCOM-C/SGLI Imagery

by
Danang Surya Candra
1 and
Eko Siswanto
2,3,*
1
Research Center for Geoinformatics, Research Organization for Electronics and Informatics, National Research and Innovation Agency of Indonesia (BRIN), Bandung 40135, Indonesia
2
Research Institute for Global Change (RIGC), Japan Agency for Marine-Earth Science and Technology (JAMSTEC), Yokohama 236-0001, Japan
3
Advanced Institute for Marine Ecosystem Change (WPI-AIMEC), Yokohama 236-0001, Japan
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(22), 3759; https://doi.org/10.3390/rs17223759
Submission received: 30 September 2025 / Revised: 16 November 2025 / Accepted: 17 November 2025 / Published: 19 November 2025

Highlights

What are the main findings?
  • Machine learning approaches were developed to classify phytoplankton types, including coccolithophores, diatoms, and dinoflagellates, using GCOM-C/SGLI satellite imagery.
  • Random Forest (RF) and Gradient Tree Boosting (GTB) models outperformed other tested algorithms, achieving high accuracy of classification results.
What are the implications of the main findings?
  • The developed machine learning models enable scalable monitoring of phytoplankton blooms, supporting both regional and global ocean observation.
  • Combining remote sensing with artificial intelligence can be used as an alternative approach to monitor the marine ecosystem.

Abstract

Phytoplankton are fundamental to sustaining marine ecosystems and significantly influence the global carbon cycle. However, identifying their types accurately from satellite imagery remains a challenge. This study presents machine learning approaches for classifying phytoplankton types, including coccolithophores, diatoms, and dinoflagellates, using Second-generation Global Imager (SGLI) imagery aboard the GCOM-C satellite. Several algorithms were evaluated, with Random Forest (RF) and Gradient Tree Boosting (GTB) achieving the highest classification performance in classifying coccolitophores and diatoms. On the other hand, both RF and Classification and Regression Trees (CARTs) are effective for distinguishing dinoflagellates from surrounding water types. To assess model transferability, the developed machine learning models were applied in another sub-regions and on a different date of acquisition. The validation confirmed the ability of the model to generalize across sub-region and temporal variations in SGLI imagery. As a result, the potential of combined machine learning and SGLI imagery can improve phytoplankton detection, enabling large-scale monitoring at both regional and global levels. This paper highlights the importance of combining artificial intelligence with satellite-derived ocean color data to improve the monitoring of marine ecosystems.

1. Introduction

Phytoplankton, key primary producers in the ocean, generate about 50% of global net primary production and thereby shape nutrient cycling, carbon dynamics, and climate regulation [1,2,3]. Coccolithophores, diatoms, and dinoflagellates are important phytoplankton functional groups, each with distinct ecological and biogeochemical roles [4,5].
Coccolithophores are unicellular eukaryotic phytoplankton that contribute significantly to the global carbon cycle through particulate inorganic carbon (PIC) production and calcification. Coccolithophores produce calcium carbonate (CaCO3) plates (coccoliths) that regulate seawater optics and sustain the oceanic carbonate pump via calcification flux [6,7,8]. During blooms, detached coccoliths in suspension intensify backscattering and reflectance at times forming striking turquoise patches visible in satellite images [9,10]. These features complicate discrimination from non-calcifying taxa and suspended particulates [4,11]. Diatoms, by contrast, build siliceous frustules and dominate in high-nutrient, upwelling, or high-latitude regimes; they are among the most efficient drivers of organic carbon export (the biological pump) in many ocean regions [12,13,14]. Dinoflagellates are prominent drivers of harmful algal blooms (HABs) that threaten marine resources and ecosystem stability, with trophic flexibility spanning autotrophy, heterotrophy, and mixotrophy [15,16,17]. Due to this diversity in physiological and optical traits, distinguishing these groups remotely is crucial for understanding ecosystem dynamics and carbon cycling.
Satellite ocean color remote sensing has been widely used to monitor phytoplankton distributions since the 1990s, via sensors such as SeaWiFS, MODIS, MERIS, and OLCI [18,19,20]. These platforms measure water-leaving reflectance, which is related to inherent optical properties (absorption and scattering) of water constituents. However, reliably discriminating phytoplankton functional types (PFTs) via conventional empirical or semi-analytical algorithms remains elusive. Coccolithophore optical signals often overlap with those from non-alkalinity variance and suspended sediments; traditional band-ratio or threshold-based indices yield ambiguous results in optically complex waters, leading to misclassification [4,8]. On the other hand, diatoms and dinoflagellates share overlapping pigment absorption features and scattering characteristics with other taxa, further confounding discrimination [21]. In coastal or turbid waters, high CDOM or suspended particulate load further degrades classification performance [22,23,24].
To address these limitations, machine learning (ML) has emerged as a promising path forward. Machine learning (ML) methods can be used to capture complex nonlinear relationships between spectral signatures and phytoplankton groups, which offers high performance on high-dimensional and noisy datasets [25,26]. Recent reviews highlight the growing integration of ML in ocean color retrieval, classification, and fusion tasks [27]. Machine Learning can be used to rapidly retrieve ocean color information from hyperspectral satellite observations under the influence of clouds, aerosols, and sunglint [28]. In addition, machine learning has proven to outperform traditional physics- and statistics-based algorithms in image information extraction across various domains, and its potential is increasingly being recognized in ocean remote sensing applications [29]. On the other hand, the most effective artificial neural network architectures for estimating chlorophyll concentrations in coastal waters using data from the MODIS Aqua ocean color sensor have been identified [30]. Several algorithms have been developed to estimate the large-scale distribution of phytoplankton cell sizes, referred to as phytoplankton size classes (PSCs), in surface ocean waters using remotely sensed data. Among the tested methods, the Random Forest approach outperformed PLS, ANN, and SVM in calibrating PSC retrieval models [31].
The GCOM-C (Shikisai) satellite mission, operated by JAXA, carries the Second-generation Global Imager (SGLI) sensor, offering 19 spectral bands across visible to shortwave infrared wavelengths and spatial resolutions from 250 m to 1000 m [32,33,34]. To maximize the potential of SGLI, a shift from conventional thresholds to a data-driven classification framework is required.
This study aims to develop a machine learning model to identify and classify coccolithophores, diatoms, and dinoflagellates using GCOM-C/SGLI imagery. Several machine learning algorithms will be evaluated and compared with particular emphasis on ensemble classifiers that have demonstrated high predictive performance. This study will also assess the developed machine learning models for another sub-region and on a different date of acquisition. The aim is to demonstrate the ability of the models to generalize across sub-regions and temporal variations in SGLI imagery. The implications of these findings are for operational monitoring of marine ecosystems.

2. Materials and Methods

2.1. Site Selection and Rationale

To demonstrate the reliability of the proposed method and ensure broader applicability, SGLI data from several sub-regions were utilised in this study, as illustrated in Figure 1. We also used NOAA (National Oceanic and Atmospheric Administration) ETOPO1 which provides a 1-arc-minute resolution representation of Earth’s relief, integrating both land elevations and ocean-floor bathymetry. Several existing studies have identified or confirmed the presence of optical water and phytoplankton types in different sub-regions, as described in Table 1, which were used to verify the presence of phytoplankton in these areas. This is to assist in creating datasets for a machine learning model.

2.2. GCOM-C/SGLI Data Acquisition and Extraction

The GCOM-C/SGLI provides 19 spectral bands at spatial resolutions ranging from 250 m to 1000 m, with a global revisit time of approximately two days. Its unique bands, such as 380 nm UV and shortwave infrared, enhance its capacity to resolve atmospheric and oceanic processes. Level-2 SGLI products underwent atmospheric correction using JAXA’s standard algorithms, followed by cloud masking, radiometric calibration, and sunglint removal. Atmospheric correction was conducted using standard SGLI algorithms, ensuring removal of aerosols and Rayleigh scattering. Cloud and glint pixels were masked. The reflectance spectra were normalised to minimise illumination differences.
This study used the SGLI Level-2 dataset with the latest Version 3 atmospheric correction that includes remote sensing reflectance (Rrs, sr−1) and Chl-a at a spatial resolution of 250 m [38]. The Second-generation Global Imager (SGLI) measures remote-sensing reflectance (Rrs) at seven discrete wavelengths (380, 412, 443, 490, 530, 565, and 670 nm) spanning the ultraviolet to visible spectral range. Chl is usually used to identify the presence of phytoplankton, but it cannot be used to accurately determine the type of phytoplankton. Therefore, both Rrs and Chl were used to identify the type of phytoplankton for machine learning datasets.

2.3. Machine Learning Approach to Phytoplankton Identification and Classification

Phytoplankton classification using GCOM-C/SGLI imagery and machine learning is presented in Figure 1. All image preprocessing, classification and analysis were conducted using the Google Earth Engine platform [39]. A crucial aspect of implementing machine learning is the construction of an appropriate dataset, as it enables the model to learn and effectively distinguish the target classes. In this study, we employed remote sensing reflectance (Rrs) at discrete wavelengths (380, 412, 443, 490, 530, 565, and 670 nm), hereafter denoted as b1–b7. Chlorophyll concentration values were incorporated to guide class definition, while published phytoplankton data were additionally utilized to ensure the dataset accurately represented each category. The dataset was divided into two parts: a training dataset for the machine learning process (70%) and a testing dataset to evaluate the reliability of the machine learning model (30%). We also used bathymetry data (NOAA ETOPO1) to enhance classification accuracy, particularly in distinguishing between classes such as turbid waters and coccolithophore-dominated waters. The final model can be used to identify and classify phytoplankton in other areas or other acquisition dates of the GCOM-C/SGLI imagery.
This study used several machine learning methods to identify and classify phytoplankton. We used Random Forest (RF), Classification, Regression Tree (CART), and Gradient Tree Boosting (GTB). Tree-based algorithms especially RF have consistently demonstrated superior performance in remote sensing applications compared to traditional statistical and classical machine learning classifiers, such as Maximum Likelihood and Support Vector Machines (SVMs) [40,41,42,43]. They have consistently demonstrated superior performance in remote sensing classification tasks. Unlike parametric classifiers, these methods do not rely on distributional assumptions of spectral features and are therefore more robust when applied to heterogeneous environments. RF is widely recognized for its robustness and computational efficiency, while GTB has shown even higher accuracies in distinguishing spectrally overlapping classes, such as turbid waters and coccolithophore blooms. Compared to a classical machine learning algorithm (e.g., ANN, SVM, or kNN), these algorithms not only provide improved accuracy but also enhanced generalization capacity, making them particularly suitable for complex ocean-color and phytoplankton classification problems.
The RF algorithm was employed in this study as the primary machine learning method due to its robustness and suitability for both classification and regression tasks [44]. RF is an ensemble learning technique that combines the predictions of multiple decision trees to improve generalization performance and reduce overfitting. This method is based on the principle of bootstrap aggregating, whereby multiple subsets of the training dataset are generated through random sampling with replacement. Each subset is used to train an individual decision tree, thereby introducing diversity among the base learners.
During the construction of each tree, a random subset of features is selected at every decision node rather than considering the entire feature set. This randomization ensures a low correlation among trees, further enhancing the ensemble’s predictive power. Each decision tree is grown to its maximum extent without pruning, which allows the trees to capture complex patterns in the data. For classification tasks, the final prediction is determined through majority voting across all trees. Mathematically, the Random Forest prediction for classification can be expressed as:
y = m o d e { h 1 x , h 2 x , h 3 x , , h B x , }
where h B x denotes the prediction of the b t h decision tree and B represents the total number of trees in the forest. Several hyperparameters govern the performance of the Random Forest model. The number of trees ( n_estimators ) influences the stability and accuracy of the predictions, while the maximum number of features considered at each split ( max_features ) controls the diversity among trees. Tree complexity is managed through parameters such as maximum depth ( max_depth ) and the minimum number of samples per split or leaf, which serve as stopping criteria during tree construction.
The RF algorithm offers several advantages, including resilience to noisy data and outliers, the capacity to handle high-dimensional feature spaces, and the provision of internal estimates of feature importance, which enhance interpretability.
CART represents a non-parametric supervised machine learning approach that partitions input data into homogeneous subsets [45]. The final model is structured as a binary tree in which terminal nodes correspond to predicted classes or continuous values. CART offers simplicity, interpretability, and direct feature selection, but its predictive accuracy is often constrained by high variance and sensitivity to noise, particularly in heterogeneous remote sensing environments.
On the other hand, GTB is an advanced ensemble learning approach that constructs predictive models by sequentially combining multiple decision trees, where each tree is trained to correct the residual errors of the preceding ones [46]. GTB has become a widely adopted technique due to its ability to handle high-dimensional, heterogeneous, and often noisy spectral and spatial features. Therefore, this classifier is suitable to address complex ocean-color and phytoplankton classification issues.

2.4. Assessment of the Results

To ensure the robustness and generalizability of the model, the performance of the classification model was assessed using a confusion matrix, which provides a comprehensive comparison between predicted and actual class labels. The matrix summarizes classification outcomes in terms of correctly and incorrectly classified samples, structured as true positives, true negatives, false positives, and false negatives. From this matrix, several evaluation metrics were derived, namely Producer’s Accuracy, User’s Accuracy, Overall Accuracy, Kappa Coefficient, Correlation Coefficient and R2 which together provide a rigorous assessment of classification reliability [47].
Producer’s Accuracy ( P A ) represents the probability that a reference sample of a given class is correctly classified. It is calculated as the ratio of correctly classified samples in a particular class to the total number of reference samples for that class. P A is analogous to recall or sensitivity, and it reflects how well the classifier can recognize members of a given class:
P A i = n i i j = 1 k n i j
User’s Accuracy ( U A ) measures the probability that a sample classified into a given class actually belongs to that class. It is calculated as the ratio of correctly classified samples in a class to the total number of samples assigned to that class. U A is equivalent to precision, and it indicates the reliability of the classification from the user’s perspective:
U A i = n i i j = 1 k n j i
Overall Accuracy ( O A ) provides a general measure of classification performance by dividing the total number of correctly classified samples by the total number of samples in the dataset:
O A = j = 1 k n i i N
where N is the total number of samples across all classes. While O A is widely used, it may not fully capture classification reliability in the presence of class imbalance. To address this limitation, the K a p p a   C o e f f i c i e n t   ( κ ) was calculated. Kappa measures the agreement between the classification results and the reference data, adjusted for the agreement that could occur by chance. It is expressed as:
κ = O A P e 1 P e
where P e is the hypothetical probability of chance agreement, calculated from the row and column totals of the confusion matrix.
In this study, we used Pearson correlation that compares actual class and predicted class. It shows whether the classifier’s predictions increase or decrease in a pattern similar to the actual labels. To classify the correlation coefficients as indicating weak, moderate, or strong associations, we applied the interpretive thresholds outlined in Table 2 [48].

3. Results

The identification and classification of phytoplankton using GCOM-C/SGLI imagery are divided by phytoplankton types, i.e., coccolithophores, diatoms, and dinoflagellates. Each type of phytoplankton in this study will be identified using several machine learning methods in this section and analysed to find the best machine learning model. In our study, we adopted a simpler 70/30 train–test split approach to maintain computational efficiency and ensure a clear separation between the training and independent testing datasets. Given the large size and representativeness of our dataset, this split was sufficient to achieve stable and reliable model performance without significant overfitting.
We conducted various scenarios with different inputs for these ML models and tested them to determine the optimal scenario. The scenarios are utilising: (1) seven bands of SGLI data using RF, (2) six bands of SGLI data excluding Rrs380 using RF, and (3) seven bands of SGLI data using CART, and (4) seven bands of SGLI data using GTB.

3.1. Coccolithophore Blooms Identification and Classification

Three classifiers, i.e., RF, CART, and GBT, were tested to identify and classify coccolithophore blooms in Sagami Bay, Japan. The following are the results of coccolithophore blooms identification and classification using various scenarios with three different ML classifiers. We used a 70/30 train–test split approach with 203 samples for the coccolithophore class, 57 samples for the turbid class, 86 samples for the oligotrophic class, 26 samples for the mesotrophic class, and 29 samples for the mix water class. The spatial distribution of all samples were spread evenly across each class and split for 70/30 train–test randomly.
Figure 2a,b show the Rrs values of each class provided from the datasets and the average Rrs values for each class, with Rrs plotted against different spectral bands (B1 to B7), respectively. The turbid water demonstrates the strongest reflectance in almost all bands. It reaches a peak around B5 and B6, which indicates a high concentration of suspended particles that scatter light. The presence of coccolithophores can also be identified by their high reflectance in the visible band.
We can visually inspect the classification results in Figure 3, where all classifiers accurately classify each class. All classifiers obtained high accuracy results, with their accuracy varying across scenarios (Table 3). In the seven-bands scenario, the GBT classifier achieved the best result, with an overall accuracy of 0.984 (Kappa = 0.975), followed by the RF, which achieved an accuracy of 0.976 (Kappa = 0.962). CART also produced competitive results (0.967, Kappa = 0.950), although its classification threshold was less precise.
The results showed that excluding band 1 (Rrs380) made a significant decrease in accuracy, as tested in RF (Accuracy = 0.939, Kappa = 0.911), highlighting the importance of the Rrs380 for detecting coccolithophores. This finding is consistent with previous research highlighting how coccolithophore blooms, which are rich in calcite plates, exhibit strong backscatter signals at shorter wavelengths.
To validate the proposed method, we tested the ML model obtained in the previous process in another sub-region in the south of Plymouth City in the English Channel area. As shown visually in Figure 4, the classification result demonstrates high accuracy. However, there is an oddity in the detection of coccolithophores in deep-sea regions due to the detected turbidity, as turbidity is typically caused by suspended particles, such as mud, clay, and organic matter, in the water and is more commonly found in shallow waters. This anomaly was verified using chlorophyll data from GCOM-C/SGLI, which showed that the chlorophyll concentration in the area was extremely high. This indicates that the area is a coccolithophore because the phytoplankton were also detected in the surrounding area. To address this issue, the proposed method utilised bathymetry data from ETOPO1 to separate the turbid and coccolithophore classes, thereby achieving more accurate classification results. It shows promising results because the deep-sea area that was initially classified as turbid has been reclassified as the correct class, which is what it should be.

3.2. Diatom Blooms Identification and Classification

The following are the results of diatom blooms identification and classification using various scenarios with three different ML classifiers. Figure 5 presents the average remote sensing reflectance (Rrs) spectra across different air classes—Diatom, Low Diatom, Turbid, and Clear—plotted against b1 to b7, the spectral bands. Across all classes, Rrs values are lowest at b1 (shorter wavelengths), gradually increasing towards the b2 to b4 visible spectrum, before decreasing again at longer wavelengths (b6 to b7). Diatom and Low-Diatom waters show enhanced reflectance in b3 to b6 due to phytoplankton pigment absorption, dominated by chlorophyll-a near b4 to b5. Turbid waters show the highest reflectance across all bands, which reflects the strong backscattering caused by suspended particles that is characteristic of turbid conditions.
Diatom classification using GCOM-C SGLI images and machine learning algorithms showed high performance across all tested scenarios (Table 4, Figure 6). The RF and GTB model with seven spectral bands achieved the highest accuracy at 0.978 (Kappa = 0.969). The RF model trained with six bands produced slightly lower but still robust results (Overall Accuracy = 0.964, Kappa = 0.950). The CART algorithm using seven bands performed comparably, with an overall accuracy of 0.966 (Kappa = 0.953).
These findings indicate that including all seven spectral bands yields high accuracy; on the other hand, removing b1 results in a measurable increase in performance. Among the classifiers, the RF and GTB have outperformed the single tree approach (CART) in handling the spectral variability of diatoms. Classification results show that diatom identification using GCOM-C SGLI is feasible with high reliability, particularly when machine learning methods are employed and complete spectral information is used.
We validated the developed RF model with seven bands of SGLI data from a different acquisition date to assess its reliability. The classification result in Figure 7 shows that it was successful in identifying diatoms, aligning with the original training dataset. While minor discrepancies were found, the validation confirmed the ability of the model to generalize across temporal variations in SGLI imagery.

3.3. Dinoflagellate Blooms Identification and Classification

The following are the results of dinoflagellate blooms identification and classification using various scenarios with three different ML classifiers. Figure 8 shows the average Rrs spectra across seven spectral bands for six distinct water classes, including harmful algal bloom (HAB) cases (Karenia and Karenia-low), mixed phytoplankton assemblages (Mix-low and Mix-high), turbid waters, and clear waters. The Karenia and Karenia-low classes have low reflectance across all bands, particularly in the b1 to b4 visible range. This spectral depression reflects the high chlorophyll-a and accessory pigment concentrations typical of dense Karenia blooms, which dominate light absorption and suppress water-leaving radiance.
The RF, CART, and GTB classifiers with seven bands generated bloom detections that closely matched the SGLI baseline, with minimal misclassification and clear delineation of bloom boundaries. Dinoflagellate blooms were identified and classified using GCOM-C SGLI data combined with RF, CHART, and GTB (Figure 9). Overall accuracies and Kappa coefficients for the tested scenarios are presented in Table 5. The RF and CART classifiers using seven spectral bands produced the highest accuracies, with an overall accuracy of 0.988 and corresponding Kappa coefficients of 0.982. In contrast, reducing the RF input to six bands resulted in a decline in performance, with an overall accuracy of 0.922 and a Kappa coefficient of 0.888. GTB with seven bands also demonstrated strong performance but was slightly less accurate, yielding an overall accuracy of 0.976 and a Kappa coefficient of 0.963.

4. Discussion

Our findings reinforce the growing evidence that machine learning (ML) methods, particularly ensemble approaches, are effective for classifying coccolithophore, diatom, and dinoflagellate blooms from SGLI images. Consistent with previous studies, RF and GTB achieved superior performance relative to a classical machine learning algorithm, such as SVM [49,50]. This finding also aligns with a prior study that RF outperformed other ML approaches such as ANN, MLR, SVR and XGBoost [31,51]. The comparatively weak performance of CART is likely attributable to its limited capacity to generalize from a relatively small training dataset, as it is a single decision tree model.
Model transferability tests highlight the importance of incorporating environmental context alongside spatial and spectral information. When the RF coccolithophore model was applied to a different sub-region in the English Channel, the inclusion of bathymetric information improved classification accuracy, suggesting that coupling SGLI reflectance with geophysical parameters enhances robustness in heterogeneous coastal and open-ocean environments. This finding aligns with prior work emphasizing the spatial and spectral transferability of machine learning models [52,53,54].
Temporal validation further demonstrated that the RF model performed consistently across different acquisition dates, indicating relative stability in diatom spectral signatures despite seasonal or environmental variability. Dinoflagellate results demonstrate that SGLI’s spectral resolution is sufficient for operational HAB monitoring. Both RF and CART achieved strong classification accuracy, reinforcing earlier findings that SGLI can reliably distinguish dinoflagellates from co-occurring water types [36,55,56]. This capability supports applications in early-warning systems and routine ecological assessments.
Overall, these results position our work within a growing body of literature demonstrating the advantages of ensemble ML approaches using SGLI images for phytoplankton classification, reinforcing the prior works that ML can identify phytoplankton using ocean colour data [57,58]. By evaluating three phytoplankton groups and explicitly testing spatial and temporal transferability, our study extends the current literature on operational marine-ecosystem monitoring and provides evidence for the broader applicability of ML-based classification frameworks.

5. Conclusions

This study demonstrated the effectiveness of machine learning models, particularly Random Forest and Gradient Tree Boosting, in classifying phytoplankton, including coccolithophores, diatoms, and dinoflagellates, using GCOM-C/SGLI imagery. These findings could inform an alternative approach to operational monitoring of phytoplankton on regional and global scales.
Despite encouraging results, some limitations warrant consideration. First, classification accuracies may vary under conditions of optically complex waters, where non-algal particles and dissolved organic matter confound spectral signals. Validation with in situ datasets across diverse aquatic environments is needed to confirm model generalizability. Second, while GCOM-C SGLI provides sufficient spectral coverage for diatom detection, hyperspectral sensors such as PACE OCI could further improve discrimination of phytoplankton functional types. Future work should also explore temporal dynamics, including seasonal blooms, to assess the potential of SGLI for long-term ecosystem monitoring. A comparative analysis with existing PFT algorithms is essential to identify the most effective approach, which will be addressed in future work.

Author Contributions

Conceptualization: D.S.C. and E.S.; methodology: D.S.C. and E.S.; software: D.S.C.; validation: D.S.C. and E.S.; formal analysis: D.S.C. and E.S.; investigation: D.S.C. and E.S.; resources: D.S.C. and E.S.; data curation: D.S.C. and E.S.; writing—original draft preparation: D.S.C. and E.S.; writing—review and editing: D.S.C. and E.S.; visualization: D.S.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Japan Aerospace Exploration Agency–4th Research Announcement on the Earth Observations [JAXA–4th EORA, 25RT000201], the Asia-Pacific Network for Global Change Research [APN, CRRP2024-05MY-Siswanto], and Grants-in-Aid for Scientific Research [KAKENHI JP21H05317] from the Ministry of Education, Culture, Sports, Science, and Technology-Japan (MEXT).

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to extend thanks to Research Center for Geoinformatics, Research Organization for Electronics and Informatics, National Research and Innovation Agency of Indonesia (BRIN).

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Ryan-Keogh, T.J.; Thomalla, S.J.; Chang, N.; Moalusi, T. A new global oceanic multi-model net primary productivity data product. Earth Syst. Sci. Data 2023, 15, 4829–4848. [Google Scholar] [CrossRef]
  2. Silsbe, G.M.; Fox, J.; Westberry, T.K.; Halsey, K.H. Global declines in net primary production in the ocean color era. Nat. Commun. 2025, 16, 5821. [Google Scholar] [CrossRef]
  3. Litchman, E.; de Tezanos Pinto, P.; Edwards, K.F.; Klausmeier, C.A.; Kremer, C.T.; Thomas, M.K. Global biogeochemical impacts of phytoplankton: A trait-based perspective. J. Ecol. 2015, 103, 1384–1396. [Google Scholar] [CrossRef]
  4. Moore, T.; Dowell, M.; Franz, B. Detection of coccolithophore blooms in ocean color satellite imagery: A generalized approach for use with multiple sensors. Remote Sens. Environ. 2011, 117, 249–263. [Google Scholar] [CrossRef]
  5. Anderson, S.I.; Barton, A.D.; Clayton, S.; Dutkiewicz, S.; Rynearson, T.A. Marine phytoplankton functional types exhibit diverse responses to thermal change. Nat. Commun. 2021, 12, 6413. [Google Scholar] [CrossRef] [PubMed]
  6. El Hourany, R.; Karlusich, J.P.; Zinger, L.; Loisel, H.; Levy, M.; Bowler, C. Linking satellites to genes with machine learning to estimate phytoplankton community structure from space. Ocean Sci. 2024, 20, 217–239. [Google Scholar] [CrossRef]
  7. Balch, W.M.; Bowler, B.C.; Drapeau, D.T.; Lubelczyk, L.C.; Lyczkowski, E. Vertical Distributions of Coccolithophores, PIC, POC, Biogenic Silica, and Chlorophyll a Throughout the Global Ocean. Global Biogeochem. Cycles 2018, 32, 2–17. [Google Scholar] [CrossRef] [PubMed]
  8. Balch, W.M.; Mitchell, C. Remote sensing algorithms for particulate inorganic carbon (PIC) and the global cycle of PIC. Earth Sci. Rev. 2023, 239, 104363. [Google Scholar] [CrossRef]
  9. Grubb, A.; Johns, C.; Hayden, M.; Subhas, A.; Thamatrakoln, K.; Bidle, K. Calcification increases carbon supply, photosynthesis, and growth in a globally distributed coccolithophore. Limnol. Oceanogr. 2024, 69, 2152–2166. [Google Scholar] [CrossRef]
  10. Neukermans, G.; Fournier, G. Optical Modeling of Spectral Backscattering and Remote Sensing Reflectance from Emiliania huxleyi Blooms. Front. Mar. Sci. 2018, 5, 146. [Google Scholar] [CrossRef]
  11. Cetinić, I.; Rousseaux, C.S.; Carroll, I.T.; Chase, A.P.; Kramer, S.J.; Werdell, P.J.; Siegel, D.A.; Dierssen, H.M.; Catlett, D.; Neeley, A.; et al. Phytoplankton composition from sPACE: Requirements, opportunities, and challenges. Remote Sens. Environ. 2024, 302, 113964. [Google Scholar] [CrossRef]
  12. Williams, J.R.; Giering, S.L.C.; Baker, C.A.; Pabortsava, K.; Briggs, N.; East, H.; Espinola, B.; Blackbird, S.; Le Moigne, F.A.C.; Villa-Alfageme, M.; et al. Inefficient transfer of diatoms through the subpolar Southern Ocean twilight zone. Nat. Geosci. 2025, 18, 72–77. [Google Scholar] [CrossRef] [PubMed]
  13. Abrantes, F.; Cermeno, P.; Lopes, C.; Romero, O.; Matos, L.; Van Iperen, J.; Rufino, M.; Magalhães, V. Diatoms Si uptake capacity drives carbon export in coastal upwelling systems. Biogeosciences 2016, 13, 4099–4109. [Google Scholar] [CrossRef]
  14. Zúñiga, D.; Sanchez-Vidal, A.; Flexas, M.M.; Carroll, D.; Rufino, M.M.; Spreen, G.; Calafat, A.; Abrantes, F. Sinking Diatom Assemblages as a Key Driver for Deep Carbon and Silicon Export in the Scotia Sea (Southern Ocean). Front. Earth Sci. 2021, 9, 579198. [Google Scholar] [CrossRef]
  15. Larsson, M.E.; Bramucci, A.R.; Collins, S.; Hallegraeff, G.; Kahlke, T.; Raina, J.-B.; Seymour, J.R.; Doblin, M.A. Mucospheres produced by a mixotrophic protist impact ocean carbon cycling. Nat. Commun. 2022, 13, 1301. [Google Scholar] [CrossRef]
  16. Möller, K.; Thoms, S.; Tillmann, U.; Krock, B.; Koch, F.; Peeken, I.; Meunier, C.L. Effects of bottom-up factors on growth and toxin content of a harmful algae bloom dinoflagellate. Limnol. Oceanogr. 2024, 69, 1335–1349. [Google Scholar] [CrossRef]
  17. Mulholland, M.R.; Morse, R.; Egerton, T.; Bernhardt, P.W.; Filippino, K.C. Blooms of Dinoflagellate Mixotrophs in a Lower Chesapeake Bay Tributary: Carbon and Nitrogen Uptake over Diurnal, Seasonal, and Interannual Timescales. Estuaries Coasts 2018, 41, 1744–1765. [Google Scholar] [CrossRef]
  18. Odermatt, D.; Gitelson, A.; Brando, V.E.; Schaepman, M. Review of constituent retrieval in optically deep and complex waters from satellite imagery. Remote Sens. Environ. 2012, 118, 116–126. [Google Scholar] [CrossRef]
  19. Blondeau-Patissier, D.; Gower, J.F.R.; Dekker, A.G.; Phinn, S.R.; Brando, V.E. A review of ocean color remote sensing methods and statistical techniques for the detection, mapping and analysis of phytoplankton blooms in coastal and open oceans. Prog. Oceanogr. 2014, 123, 123–144. [Google Scholar] [CrossRef]
  20. Groom, S.; Sathyendranath, S.; Ban, Y.; Bernard, S.; Brewin, R.; Brotas, V.; Brockmann, C.; Chauhan, P.; Choi, J.-K.; Chuprin, A.; et al. Satellite Ocean Colour: Current Status and Future Perspective. Front. Mar. Sci. 2019, 6, 485. [Google Scholar] [CrossRef]
  21. Zhang, Y.; Shen, F.; Zhao, H.; Sun, X.; Zhu, Q.; Li, M. Optical distinguishability of phytoplankton species and its implications for hyperspectral remote sensing discrimination potential. J. Sea Res. 2024, 202, 102540. [Google Scholar] [CrossRef]
  22. Ansper-Toomsalu, A.; Uusõue, M.; Kangro, K.; Hieronymi, M.; Alikas, K. Suitability of different in-water algorithms for eutrophic and absorbing waters applied to Sentinel-2 MSI and Sentinel-3 OLCI data. Front. Remote Sens. 2024, 5, 1423332. [Google Scholar] [CrossRef]
  23. Bi, S.; Hieronymi, M. Holistic optical water type classification for ocean, coastal, and inland waters. Limnol. Oceanogr. 2024, 69, 1547–1561. [Google Scholar] [CrossRef]
  24. Wu, Z.; Pang, J.; Li, J.; Wang, Y.; Ruan, J.; Zhang, X.; Yang, L.; Pang, Y.; Gao, Y. A review of remote sensing-based water quality monitoring in turbid coastal waters. Intell. Mar. Technol. Syst. 2025, 3, 24. [Google Scholar] [CrossRef]
  25. Bachimanchi, H.; Pinder, M.I.M.; Robert, C.; De Wit, P.; Havenhand, J.; Kinnby, A.; Midtvedt, D.; Selander, E.; Volpe, G. Deep-learning-powered data analysis in plankton ecology. Limnol. Oceanogr. Lett. 2024, 9, 324–339. [Google Scholar] [CrossRef]
  26. Li, Z.; Yang, W.; Matsushita, B.; Kondoh, A. Remote estimation of phytoplankton primary production in clear to turbid waters by integrating a semi-analytical model with a machine learning algorithm. Remote Sens. Environ. 2022, 275, 113027. [Google Scholar] [CrossRef]
  27. Zhang, Z.; Chen, P.; Zhang, S.; Huang, H.; Pan, Y.; Pan, D. A Review of Machine Learning Applications in Ocean Color Remote Sensing. Remote Sens. 2025, 17, 1776. [Google Scholar] [CrossRef]
  28. Fasnacht, Z.; Joiner, J.; Haffner, D.; Qin, W.; Vasilkov, A.; Castellanos, P.; Krotkov, N. Using Machine Learning for Timely Estimates of Ocean Color Information from Hyperspectral Satellite Measurements in the Presence of Clouds, Aerosols, and Sunglint. Front. Remote Sens. 2022, 3, 846174. [Google Scholar] [CrossRef]
  29. Li, X.; Liu, B.; Zheng, G.; Ren, Y.; Zhang, S.; Liu, Y.; Gao, L.; Liu, Y.; Zhang, B.; Wang, F. Deep learning-based information mining from ocean remote sensing imagery. Natl. Sci. Rev. 2020, 7, 1584–1605. [Google Scholar] [CrossRef]
  30. Hadjal, M.; Medina-Lopez, E.; Ren, J.; Gallego, A.; McKee, D. An Artificial Neural Network Algorithm to Retrieve Chlorophyll a for Northwest European Shelf Seas from Top of Atmosphere Ocean Colour Reflectance. Remote Sens. 2022, 14, 3353. [Google Scholar] [CrossRef]
  31. Hu, S.; Liu, H.; Zhao, W.; Shi, T.; Hu, Z.; Li, Q.; Wu, G. Comparison of Machine Learning Techniques in Inferring Phytoplankton Size Classes. Remote Sens. 2018, 10, 191. [Google Scholar] [CrossRef]
  32. Nakajima, T.Y.; Ishida, H.; Nagao, T.M.; Hori, M.; Letu, H.; Higuchi, R.; Tamaru, N.; Imoto, N.; Yamazaki, A. Theoretical basis of the algorithms and early phase results of the GCOM-C (Shikisai) SGLI cloud products. Prog. Earth Planet. Sci. 2019, 6, 52. [Google Scholar] [CrossRef]
  33. Zheng, L.; Lee, Z.; Wang, Y.; Yu, X.; Lai, W.; Shang, S. Evaluation of near-blue UV remote sensing reflectance over the global ocean from SNPP VIIRS, PACE OCI, and GCOM-C SGLI. Opt. Express 2025, 33, 40465–40488. [Google Scholar] [CrossRef]
  34. Salem, S.I.; Toratani, M.; Higa, H.; Son, S.; Siswanto, E.; Ishizaka, J. Long-Term Evaluation of GCOM-C/SGLI Reflectance and Water Quality Products: Variability Among JAXA G-Portal and JASMES. Remote Sens. 2025, 17, 221. [Google Scholar] [CrossRef]
  35. Yano, K.; Takayama, Y.; Shimode, S.; Toratani, M.; Murakami, H.; Kuwahara, V.S. Observation of a coccolithophore Gephyrocapsa oceanica bloom in the temperate coastal waters of Sagami Bay, Japan. Plankton Benthos Res. 2024, 19, 37–50. [Google Scholar] [CrossRef]
  36. Siswanto, E.; Luang-on, J.; Ogata, K.; Higa, H.; Toratani, M. Observations of water optical properties during red tide outbreaks off southeast Hokkaido by GCOM-C/SGLI: Implications for the development of red tide algorithms. Remote Sens. Lett. 2024, 15, 121–132. [Google Scholar] [CrossRef]
  37. Kuroda, H.; Taniuchi, Y.; Watanabe, T.; Azumaya, T.; Hasegawa, N. Distribution of Harmful Algae (Karenia spp.) in October 2021 Off Southeast Hokkaido, Japan. Front. Mar. Sci. 2022, 9, 841364. [Google Scholar] [CrossRef]
  38. Murakami, H.; Antoine, D.; Vellucci, V.; Frouin, R. System vicarious calibration of GCOM-C/SGLI visible and near-infrared channels. J. Oceanogr. 2022, 78, 245–261. [Google Scholar] [CrossRef]
  39. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  40. Belgiu, M.; Draguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
  41. Rodriguez-Galiano, V.; Ghimire, B.; Rogan, J.; Chica, M.; Rigol-Sanchez, J. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J. Photogramm. Remote Sens. 2012, 67, 93–104. [Google Scholar] [CrossRef]
  42. Pal, M. Random forest classifier for remote sensing classification. Int. J. Remote Sens. 2005, 26, 217–222. [Google Scholar] [CrossRef]
  43. Fernandez-Delgado, M.; Cernadas, E.; Barro, S.; Amorim, D. Do we Need Hundreds of Classifiers to Solve Real World Classification Problems? J. Mach. Learn. Res. 2014, 15, 3133–3181. [Google Scholar]
  44. Breimanvol, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  45. Loh, W.-Y. Classification and Regression Trees. WIREs Data Min. Knowl. Discov. 2011, 1, 14–23. [Google Scholar] [CrossRef]
  46. Friedman, J. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2000, 29, 1189–1232. [Google Scholar] [CrossRef]
  47. Congalton, R.G. A review of assessing the accuracy of classifications of remotely sensed data. Remote Sens. Environ. 1991, 37, 35–46. [Google Scholar] [CrossRef]
  48. Schober, P.; Boer, C.; Schwarte, L. Correlation Coefficients: Appropriate Use and Interpretation. Anesth. Analg. 2018, 126, 1763–1768. [Google Scholar] [CrossRef] [PubMed]
  49. Song, W.; Dolan, J.M.; Cline, D.; Xiong, G. Learning-Based Algal Bloom Event Recognition for Oceanographic Decision Support System Using Remote Sensing Data. Remote Sens. 2015, 7, 13564–13585. [Google Scholar] [CrossRef]
  50. Joshi, N.; Park, J.; Zhao, K.; Londo, A.; Khanal, S. Monitoring Harmful Algal Blooms and Water Quality Using Sentinel-3 OLCI Satellite Imagery with Machine Learning. Remote Sens. 2024, 16, 2444. [Google Scholar] [CrossRef]
  51. Xu, Y.; Zhang, D.; Lin, J.; Peng, Q.; Lei, X.; Jin, T.; Wang, J.; Yuan, R. Prediction of phytoplankton biomass and identification of key influencing factors using interpretable machine learning models. Ecol. Indic. 2024, 158, 111320. [Google Scholar] [CrossRef]
  52. Orynbaikyzy, A.; Gessner, U.; Conrad, C. Spatial Transferability of Random Forest Models for Crop Type Classification Using Sentinel-1 and Sentinel-2. Remote Sens. 2022, 14, 1493. [Google Scholar] [CrossRef]
  53. Rubin, H.J.; Lutz, D.A.; Steele, B.G.; Cottingham, K.L.; Weathers, K.C.; Ducey, M.J.; Palace, M.; Johnson, K.M.; Chipman, J.W. Remote Sensing of Lake Water Clarity: Performance and Transferability of Both Historical Algorithms and Machine Learning. Remote Sens. 2021, 13, 1434. [Google Scholar] [CrossRef]
  54. Zhao, Z.; Wang, W.; Chen, J.; Yao, J.; Liao, Y.; Liu, J. Heterogeneous transfer learning considering feature representation and environmental consistency for landslide spatial prediction. GISci. Remote Sens. 2024, 61, 2349343. [Google Scholar] [CrossRef]
  55. Feng, C.; Zhu, Y.; Shen, A.; Li, C.; Song, Q.; Tao, B.; Zeng, J. Assessment of GCOM-C Satellite Imagery in Bloom Detection: A Case Study in the East China Sea. Remote Sens. 2023, 15, 691. [Google Scholar] [CrossRef]
  56. Luang-On, J.; Siswanto, E.; Ogata, K.; Toratani, M.; Buranapratheprat, A.; Leenawarat, D.; Ishizaka, J. Enhancing the reliability of GCOM-C/SGLI data for red tide detection in the upper Gulf of Thailand. Remote Sens. Lett. 2024, 15, 1096–1106. [Google Scholar] [CrossRef]
  57. Fan, Y.; Li, W.; Chen, N.; Ahn, J.-H.; Park, Y.-J.; Kratzer, S.; Schroeder, T.; Ishizaka, J.; Chang, R.; Stamnes, K. OC-SMART: A machine learning based data analysis platform for satellite ocean color sensors. Remote Sens. Environ. 2021, 253, 112236. [Google Scholar] [CrossRef]
  58. Adhikary, S.; Tiwari, S.P.; Banerjee, S.; Dwivedi, A.; Rahman, S.M. Global marine phytoplankton dynamics analysis with machine learning and reanalyzed remote sensing. PeerJ 2024, 12, e17361. [Google Scholar] [CrossRef]
Figure 1. Flowchart of Chlorophyll Classification using GCOM-C/SGLI imagery and Machine Learning.
Figure 1. Flowchart of Chlorophyll Classification using GCOM-C/SGLI imagery and Machine Learning.
Remotesensing 17 03759 g001
Figure 2. (a) The Rrs values of coccolitophore datasets. (b) Average Rrs value for each class.
Figure 2. (a) The Rrs values of coccolitophore datasets. (b) Average Rrs value for each class.
Remotesensing 17 03759 g002aRemotesensing 17 03759 g002b
Figure 3. Coccolithophore classification results for some scenarios in Sagami Bay, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs, (c) Classification results using RF with six bands as the inputs, (d) Classification results using CART with seven bands as the inputs, and (e) Classification results using GTB with seven bands as the inputs.
Figure 3. Coccolithophore classification results for some scenarios in Sagami Bay, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs, (c) Classification results using RF with six bands as the inputs, (d) Classification results using CART with seven bands as the inputs, and (e) Classification results using GTB with seven bands as the inputs.
Remotesensing 17 03759 g003aRemotesensing 17 03759 g003b
Figure 4. Coccolithophore classification results for some scenarios in another sub-region in the south of Plymouth City in the English Channel area. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs without bathymetry data, (c) Classification results using RF with seven bands as the inputs with bathymetry data.
Figure 4. Coccolithophore classification results for some scenarios in another sub-region in the south of Plymouth City in the English Channel area. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs without bathymetry data, (c) Classification results using RF with seven bands as the inputs with bathymetry data.
Remotesensing 17 03759 g004
Figure 5. (a) The Rrs values of diatom datasets. (b) Average Rrs value for each class.
Figure 5. (a) The Rrs values of diatom datasets. (b) Average Rrs value for each class.
Remotesensing 17 03759 g005aRemotesensing 17 03759 g005b
Figure 6. Diatom classification results with some scenarios in the Waters of Southeast Hokkaido, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs, (c) Classification results using RF with six bands as the inputs, (d) Classification results using CART with seven bands as the inputs, and (e) Classification results using GTB with seven bands as the inputs.
Figure 6. Diatom classification results with some scenarios in the Waters of Southeast Hokkaido, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs, (c) Classification results using RF with six bands as the inputs, (d) Classification results using CART with seven bands as the inputs, and (e) Classification results using GTB with seven bands as the inputs.
Remotesensing 17 03759 g006aRemotesensing 17 03759 g006b
Figure 7. Diatom classification results from a different acquisition date in Waters of Southeast Hokkaido, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs.
Figure 7. Diatom classification results from a different acquisition date in Waters of Southeast Hokkaido, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs.
Remotesensing 17 03759 g007
Figure 8. (a) The Rrs values of dinoflagellate (Karenia) datasets. (b) Average Rrs value for each class.
Figure 8. (a) The Rrs values of dinoflagellate (Karenia) datasets. (b) Average Rrs value for each class.
Remotesensing 17 03759 g008aRemotesensing 17 03759 g008b
Figure 9. Dinoflagellate classification results with some scenarios in the Waters of Southeast Hokkaido, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs, (c) Classification results using RF with six bands as the inputs, (d) Classification results using CART with seven bands as the inputs, and (e) Classification results using GTB with seven bands as the inputs.
Figure 9. Dinoflagellate classification results with some scenarios in the Waters of Southeast Hokkaido, Japan. (a) RGB B643 of SGLI imagery, (b) Classification results using RF with seven bands as the inputs, (c) Classification results using RF with six bands as the inputs, (d) Classification results using CART with seven bands as the inputs, and (e) Classification results using GTB with seven bands as the inputs.
Remotesensing 17 03759 g009aRemotesensing 17 03759 g009b
Table 1. Descriptions of the expected or confirmed optical water and phytoplankton types in different sub-regions.
Table 1. Descriptions of the expected or confirmed optical water and phytoplankton types in different sub-regions.
Sub-RegionObservation DateConsideration/Confirmation
Sagami Bay17 May 2020Confirmed phytoplankton coccolithophore bloom [35]
Waters of Southeast Hokkaido13 October 2021Confirmed red tide caused by phytoplankton from the dinoflagellate group [36,37]
8 May 2022Expected to be seasonal blooms of phytoplankton diatoms
7 May 2022Expected to be seasonal blooms of phytoplankton diatoms
Table 2. Interpretation of Correlation Coefficient [48].
Table 2. Interpretation of Correlation Coefficient [48].
Absolute Magnitude of
Correlation Coefficient
Interpretation
0.00–0.10Negligible correlation
0.10–0.39Weak correlation
0.40–0.69Moderate correlation
0.70–0.89Strong correlation
0.90–1.00Very strong correlation
Table 3. Accuracies of the coccolithophore classification results.
Table 3. Accuracies of the coccolithophore classification results.
ScenarioOverall
Accuracy
Kappa
Coefficient
Pearson Correlation CoefficientDetermination Coefficient (R2)
RF with seven bands0.9760.9620.9770.954
RF with six bands0.9390.9110.9310.861
CART with seven bands0.9670.9500.9560.908
GTB with seven bands0.9840.9750.9870.974
Table 4. Accuracies of the diatom classification results.
Table 4. Accuracies of the diatom classification results.
ScenarioOverall
Accuracy
Kappa
Coefficient
Pearson Correlation CoefficientDetermination Coefficient (R2)
RF with seven bands0.9780.9690.9700.940
RF with six bands0.9880.9830.9960.991
CART with seven bands0.9660.9530.9660.933
GTB with seven bands0.9780.9690.9700.940
Table 5. Accuracies of the dinoflagellate classification results.
Table 5. Accuracies of the dinoflagellate classification results.
ScenarioOverall
Accuracy
Kappa
Coefficient
Pearson Correlation CoefficientDetermination Coefficient (R2)
RF with seven bands0.9880.9820.9970.994
RF with six bands0.9220.8880.9280.846
CART with seven bands0.9880.9820.9970.994
GTB with seven bands0.9760.9630.9940.987
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Candra, D.S.; Siswanto, E. Machine Learning Approaches to Phytoplankton Identification and Classification Using GCOM-C/SGLI Imagery. Remote Sens. 2025, 17, 3759. https://doi.org/10.3390/rs17223759

AMA Style

Candra DS, Siswanto E. Machine Learning Approaches to Phytoplankton Identification and Classification Using GCOM-C/SGLI Imagery. Remote Sensing. 2025; 17(22):3759. https://doi.org/10.3390/rs17223759

Chicago/Turabian Style

Candra, Danang Surya, and Eko Siswanto. 2025. "Machine Learning Approaches to Phytoplankton Identification and Classification Using GCOM-C/SGLI Imagery" Remote Sensing 17, no. 22: 3759. https://doi.org/10.3390/rs17223759

APA Style

Candra, D. S., & Siswanto, E. (2025). Machine Learning Approaches to Phytoplankton Identification and Classification Using GCOM-C/SGLI Imagery. Remote Sensing, 17(22), 3759. https://doi.org/10.3390/rs17223759

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop