Next Article in Journal
A Group-Wise Feature Enhancement-and-Fusion Network with Dual-Polarization Feature Enrichment for SAR Ship Detection
Next Article in Special Issue
High-Resolution Drone Images Show That the Distribution of Mussels Depends on Microhabitat Features of Intertidal Rocky Shores
Previous Article in Journal
An Overview of Vegetation Dynamics Revealed by Remote Sensing and Its Feedback to Regional and Global Climate
Previous Article in Special Issue
Species Distribution Models at Regional Scale: Cymodocea nodosa Seagrasses
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Ensemble Three-Dimensional Habitat Modeling of Indian Ocean Immature Albacore Tuna (Thunnus alalunga) Using Remote Sensing Data

1
Department of Environmental Biology Fisheries Science, National Taiwan Ocean University, No. 2, Beining Rd., Zhongzheng Dist., Keelung City 20224, Taiwan
2
Center of Excellence for Ocean Engineering, National Taiwan Ocean University, Keelung City 20224, Taiwan
3
Doctoral Degree Program in Ocean Resource and Environmental Changes, National Taiwan Ocean University, No. 2, Beining Rd., Zhongzheng Dist., Keelung City 20224, Taiwan
4
Center of Excellence for Oceans, National Taiwan Ocean University, No. 2, Beining Rd., Zhongzheng Dist., Keelung City 20224, Taiwan
5
Coastal and Offshore Resources Research Center of Fisheries, Research Institute, Kaoshiung City 80672, Taiwan
6
Department of Geography, School of Sciences, Netaji Subhas Open University, Kolkata 700064, West Bengal, India
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(20), 5278; https://doi.org/10.3390/rs14205278
Submission received: 15 August 2022 / Revised: 28 September 2022 / Accepted: 19 October 2022 / Published: 21 October 2022
(This article belongs to the Special Issue Remote Sensing Applied to Marine Species Distribution)

Abstract

:
This study evaluated the vertical distribution of immature albacore tuna (Thunnus alalunga) in the Indian Ocean as a function of various environmental parameters. Albacore tuna fishing data were gathered from the logbooks of large-sized Taiwanese longline vessels. Fishery and environmental data for the period from 1998 to 2016 were collected. In addition to the surface variable, the most influential vertical temperature, dissolved oxygen (OXY), chlorophyll, and salinity layers were found at various depths (i.e., 5, 26, and 53 m for SST; 200, 244, and 147 m for OXY; 508, 628, and 411 for SSCI; and 411, 508, and 773 m for SSS) among 20 vertical layers based on Akaike criterion information value of generalized linear model. Relative to the 20 vertical layers base models, these layers had the lowest Akaike information criteria. For the correlation between the standardized and predicted catch per unit effort (CPUE), the correlation values for the generalized linear model (GLM), generalized additive model (GAM), boosted regression tree (BRT), and random forest (RF) model were 0.798, 0.832, 0.841, and 0.856, respectively. The GAM-, BRT-, and RF-derived full models were selected, whereas the GLM-derived full model was excluded because its correlation value was the lowest among the four models. From March to September, a higher immature albacore standardized CPUE was mainly observed from 30°S to 40°S. A northward shift was observed after September, and the standardized CPUE was mainly concentrated at the south coast of Madagascar from November to January.

1. Introduction

Albacore tuna is a highly migratory and carnivorous species belonging to the Scombridae family [1]. This species is found in the temperate waters of all the three major oceans (Indian, Pacific, and Atlantic) and is cosmopolitan in nature [2,3,4]. Albacore tuna is a key commercial species, accounting for up to 6% of total global tuna catches by weight [5]. Taiwanese longline vessels have been fishing for Indian Ocean albacores since 1950. Taiwanese drifting longline vessels account for the majority of albacore catches in the Indian Ocean (90% of total catches). The output of albacore tuna has increased in response to its growing demand. The albacore harvest achieved by Taiwanese longline vessels increased from 10,000 tons in 1950 to 102,594 tons in 2018 [6,7]. However, high exploitation levels ca lead to overfishing, which can cause the collapse of albacore stocks (recovery may require several decades) and threaten livelihoods, communities, and food security. Therefore, the spatial distribution of this species in the Indian Ocean must be comprehensively evaluated to identify potential higher or lower habitat areas on the basis of marine habitat data. Sustainable fishing- and ecosystem-based fishery management can be implemented by regulating fishing efforts in higher or lower habitat regions.
Species distribution modeling (SDM; also known as habitat modeling, ecological niche modeling, bioclimatic envelope modeling, or resource selection function modeling [8,9,10]) is the most frequently used technique for evaluating the habitat pattern of a species. The basic premise of habitat modeling is to estimate species distribution by using computer algorithms based on mathematical representations of the known distribution of a species in environmental regions [11]. Reality, generality, and precision are the three main objectives of habitat modeling [12]. Numerous statistical relationships between existing species distributions and environmental variables are used in the prediction analysis.
To examine the potential effect of environmental variability on the distribution of a species, high-resolution spatiotemporal oceanographic data are required. This type of information can be found in data assimilation model products provided by several satellites, including the Moderate Resolution Imaging Spectroradiometer (MODIS), Copernicus (COP), and the Advanced Very High-Resolution Radiometer (AVHRR) etc., which provide information on vertical oceanographic structures and how they affect species distribution. The basic difference between different satellites is their spatial resolution. Multisatellite detection has been performed to collect data on various oceanographic parameters since 1978. These data are particularly beneficial in the fields of oceanography and fishery management [13,14,15]. Because of the availability of large-scale data, a further analysis can enable obtaining useful information on fisheries misuse and management [16,17,18,19]. Multisatellite detection has increased our understanding of factors that affect the living habitats of fish species and other related species [20,21,22,23,24]. In addition, data obtained through precise detection can help academicians to develop cost-effective fishery management models, modelers to generate statistics, and fishermen to become more fuel-efficient when attempting to locate fishing sites [25].
Model comparison studies have revealed that various modeling methods differ in terms of their statistical and predictive performance [26,27,28,29]. This might be due to basic differences in model complexity. Researchers have examined multimodel frameworks for obtaining robust forecasts by combining the capabilities of multiple model algorithms that apply an ensemble model forecast technique [30,31]. To produce forecasts ensembles, several simulations spanning multiple sets of initial conditions, model classes, parameters, and boundary conditions are used [32]. A combination of forecasts produces a lower mean error than any of its constituent single projections, when thees projection contains information on single parameters [33]. Several recent studies have applied the ensemble model technique for maritime habitat mapping and reported that this technique allows the merging of habitat projections from multiple model algorithms with decreased bias and high predictive accuracy [30,31,34].
By using the ensemble model technique and three-dimensional oceanographic data, we evaluated the probable habitats of immature albacore tuna in the Indian Ocean. In our previous study, only a single-algorithm model and surface oceanographic factors were used to analyze immature albacore (average weight of <14 kg) habitat in the study region [35]. Although other habitat model studies have produced limited immature albacore habitat predictions, we could not evaluate preferable vertical immature albacore habitat characteristics because of the availability of numerous modeling platforms. Thus, the present study examined the geographical and temporal patterns of immature albacore habitat by using weighted mean ensemble projections (obtained from species distribution models) and three-dimensional ocean data to fill potential information gaps. Thus, the present study explored robust three-dimensional immature albacore habitat characteristics derived from an ensemble of model forecasts and investigated the mechanistic linkage between the spatial and temporal variability of potential immature albacore habitats under changing environmental conditions. Figure 1 illustrates the experimental flow of the present study. For all the acronyms, refer to Abbreviations Section.

2. Materials and Methods

2.1. Data Collection

2.1.1. Albacore Tuna Fishery Data

Weight at first maturity of Indian Ocean albacore is 14 kg [5]. Following this, an average weight of 14 kg was taken as the threshold to separate mature and immature albacore in the present study. The present study divided CPUE data into two categories, which represent different life history stages of albacore tuna: immature (average weight <14 kg), mature (average weight >14 kg). Only immature albacore data were used in the present study (as present study is the continuation of Mondal et al., 2021).
Immature albacore tuna fishery data for the period from 1998 to 2016 were collected from the fishing logbooks of large-sized longline vessels (deep-water fishing vessels with a registered tonnage of >100 tons and a length of >24 m) of the Overseas Fisheries Development Council of Taiwan. Data from small-sized fishing vessels (mostly coastal water fishing vessels with a gross registered tonnage of <100 tons and a length of <24 m) were not used in the present study because of the lack of data pertaining to the study period. The data have a spatial coverage from 0°S to 45°S and from 20°E to 120°E with a spatial resolution of 1° × 1°. The logbooks recorded the year, month, latitude, longitude, number of catches, number of hooks used, hooks per basket (data were not available for specific years), and weight (whether the weight was dry or wet weight was not specified). Data related to soaking time, hook depth, and operation time were not found in the data set.

2.1.2. Oceanographic Data

For our analysis, data on 10 oceanographic parameters were acquired from Copernicus (The GLORYS12V1 product is the CMEMS global ocean eddy-resolving with 1/12° horizontal resolution from ERA5 reanalysis covering the altimetry (Table 1). The model component is the NEMO. Along with track altimeter data, satellite data of vertical profiles of different environmental parameters are jointly assimilated with a processing level of L4. The data covered a range of spatial resolutions from 0°S to 45°S (albacore fishing is only conducted in this region during specific months) and from 20°E to 120°E. To match the large-scale albacore tuna fisheries statistics from 1998 to 2016, data from all months (i.e., January to December) for the period from 1998 to 2016 were collected. Environmental data did not have a spatial coverage of 1° × 1°; these data were interpolated to a 1° × 1° spatial grid (also the daily data were converted to monthly data) using MATLAB version 2019a (Kriging method—Kriging is a geostatistics method that predicts the value in a geographic area given a set of measurements) because the spatial resolution of the fisheries data was 1° × 1°. As of February 2, 2022, all data were accessible. Data with daily temporal resolution were converted into a monthly base using MATLAB version 2019a because the temporal resolution of fisheries data was monthly. An earlier study [4] mentioned that a phytoplankton patch must mature into a foraging ground after a minimum of 5–7 days, and it was suggested that 1-month lag chlorophyll data be used as a potential predictor. Additionally, it could take some time to find the predating zone by searching, therefore a larger concentration of SSC might not always indicate a larger albacore biomass at a given moment. In order to determine whether there was any meaningful justification for employing SSC lag data, the authors attempted to use the lag data of SSC as well.

2.2. Standardization of Nominal Catch Per Unit Effort

The relative abundance of immature albacore was indexed as catch per unit effort (CPUE, nominal). Nominal CPUE (N.CPUE; per 1000 hooks) was calculated using the following formula:
N.CPUE = (No. of albacore catch)/(No. of hooks used)
In fisheries, CPUE is used as a reliable proxy for relative abundance. However, using raw CPUE as an index of abundance can be problematic sometime. This is because CPUE can be “hyper stable” or less sensitive to the rapid changes of abundance. Changes in the fishing location, strategy, season, and fishing pattern can cause changes to CPUE that are independent of relative abundance. To reduce the dominance of several spatial (latitude and longitude) and temporal (year and month) factors, N.CPUE was standardized using the common method of generalized linear modeling (GLM) to obtain a bias-filtered data set (standardized CPUE, S.CPUE). The mgcv package [11,19,22] was used to build a stepwise GLM model (Gaussian distribution) with five factors (Year, month, latitude, longitude, and interactions) in R-studio version 3.6.0. Three interactions i.e., Year*Lat, Year*Lon, and Lat*Lon were all included together. A total of five models were examined, and the optimal model for standardization was selected on the basis of the lowest Akaike information criterion (AIC) [4], the most deviance explained (percentage), and highest correlation (R2) values. The Akaike information criterion (AIC) is an estimator of prediction error, and thereby relative quality of statistical models, for a given set of data. Given a collection of models for the data, AIC estimates the quality of each model, relative to each of the other models. The GLM models were built using the following formula:
GLMn: Log (N.CPUE + c) ~ a1 + a2 + a3 + … + an + µ + €
where c is the constant value of 0.1, n is the number of variables, GLMn is the model with n factors, µ is the interaction (Year*Lat, Year*Lon, and Lat*Lon), and € is a variable with normal distribution and a mean value of zero.

2.3. Selection of Oceanographic Parameters and Vertical LAYER

The generalized additive model (GAM) is often used to select suitable environmental parameters prior to the development of a habitat model [35,36]. Because of complex relationships between an angle community and the environment, determining whether the relationship is linear or nonlinear is a difficult task. Therefore, models such as GAM (Gaussian distribution with smoothing spline regression), which allow for nonlinear reactions, are more suitable than other models for examining relationships between angle communities and conditions [37,38,39,40,41]). The use of boosted regression trees (BRTs) (no. of regression trees used was 531 with interaction depth of 7 and bag fraction of 0.6) and random forests (RFs) (no. of regression trees used was 325 with interaction depth of 4 and bag fraction of 0.8) can aid parameter selection. Nonparametric random forest models can model highly nonlinear relationships, resulting in improved classification performance. A BRT is a type of a nonlinear model that divides data into regions on the basis of if–then questions. The three aforementioned methods were employed in the present study to select appropriate parameters, and each parameter was rated using each method on the basis of their correlation value. Only parameters with a Pearson correlation value of >0.3 for at least two of the three methods (i.e., GAM, BRT, and RF) were selected for our subsequent analysis. Under the shaky linear rule, a correlation value of <0.3 indicates a weak positive (negative) linear relationship [42]. R studio version 3.6.0 was used to test selected parameters for collinearity. Collinearity between parameter pairs was indicated by a Pearson correlation value of >0.7 [23] and variance inflation factor value of >5.
The most influential oceanographic layers of the selected parameters for immature albacore S.CPUE were identified using a GAM and the mgcv package version 1.8–2 [43]; this method is widely used in the literature [31]. Twenty vertical layers (5, 26, 53, 77, 97, 147, 200, 244, 300, 411, 508, 628, 773, 856, 947, and 1045 m) were initially used, and the vertical profiles for the selected oceanographic variables were subsequently selected on the basis of the three most influential vertical layers [31] that had the lowest AICs [44].

2.4. Construction and Evaluation of the Single-Algorithm Habitat Model

Following the finalization of the oceanographic variables, four single-algorithm models were built (full models that had all selected parameters and layers and were validated using BIOMOD2) [45,46]. The four models were the GLM (Gaussian distribution), GAM (Gaussian distribution with smoothing spline regression), BRT (no. of regression trees used were 531 with interaction depth of 7 and bag fraction of 0.6), and RF (no. of regression trees used were 325 with interaction depth of 4 and bag fraction of 0.8). The package’s species distribution models also use a set of statistical techniques, which include conventional regression-based models and current machine-learning platforms. Thus, the proposed ranges of the single-algorithm models were all used to investigate and exploit their specific prediction capabilities.
All single-algorithm models were initially assessed using four validation techniques (i.e., random splitting [RS], leave-one-out cross-validation [LOOCV], 10-fold cross-validation [10-fold], and repeated (3) 10-fold cross-validation [repeated 10-fold]. During the validation process, the fishing data set with sample no. of 67,548 was divided into two random parts at a 70:30 (training:testing) ratio. Three coefficients (i.e., Pearson correlation coefficient [R], root-mean-square error [RMSE], and mean absolute error [MAE]) were calculated for both the training and testing data by applying all the validation techniques in each single-algorithm model. We used the training data set to train the model, while the testing data were used to compare the model performances. 70% of the data were used to train a model. This same model was used to test the rest of the 30% data in order to check the selected model’s biasedness for prediction. This biasedness was checked using three coefficients (R, RMSE and MAE). Lesser difference between the coefficient values of training and testing data sets indicated least biasedness. Thus, we applied these methodologies in the present study. Validations were performed using the tidyverse and caret R-packages (version 3.6.0). Smaller differences in R, RMSE, and MAE values between the two data sets (70:30) indicate a more favorable model performance with less bias. After the successful validation, Akaike information criterion [AIC], root-mean-square error [RMSE], and mean absolute error [MAE] were calculated for all single-algorithm final models using 100% data to evaluate the predictive performances of each single-algorithm model. Least AIC, RMSE, and MAE value indicate the best performing model. The single-algorithm model with higher AIC, RMSE, and MAE values was eliminated from the further analysis.

2.5. Ensemble Model Development, Evaluation, and Prediction

Following the performance evaluation of the single-algorithm models, we used R, RMSE, and MAE to establish a weighted mean ensemble model for immature albacore habitats by applying the full selected single-algorithm models. The ensemble model was validated using four validation techniques (i.e., RS, LOOCV, 10-fold, and repeated (3) 10-fold). For the validation process, the fishing data set was randomly divided into 2 parts at a 70:30 (training: testing) ratio. Three coefficients (i.e., Pearson correlation coefficient [R], root-mean-square error [RMSE], and mean absolute error [MAE]) were calculated for both the training and testing data by applying all the validation techniques in the ensemble model. Validation was performed using the tidyverse and caret R-packages (version 3.6.0). Lesser differences in R, RMSE, and MAE values between the two data sets (70:30) indicate a more favorable model performance with less bias.
The ensemble model was used to perform the final prediction if the validation method indicated a more favorable model performance with no significant biasedness. The predicted values for each point of the study area from the final model were mapped to a 1° × 1° spatial grid by using the ArcGIS software (version 10.2). The ensemble model was used to predict CPUE (P.CPUE), which was then used as a proxy for relative abundance.

3. Results

3.1. Standardization of Nominal CPUE Data

The full GLM model (with all the factors) had the lowest AIC value (142,977) and the highest R2 value (0.78) after stepwise GLM model generation was performed (Table 2). The standardization model’s quantile–quantile plot and histogram (Figure 2) indicate an almost normal distribution. Therefore, a selected model was employed to standardize the N.CPUE of immature albacore. N.CPUE (monthly summed CPUE) ranged from 0.1 to 4000 (Figure 3). The monthly total CPUE decreased to a range of 0.1 to 2200 after standardization. For the subsequent study of juvenile albacore tuna, S.CPUE was applied.

3.2. Oceanographic Parameters and Vertical Layer Selection

Table 3 presents the significance of various environmental characteristics when several selection procedures were applied. The environmental characteristics of dissolved oxygen (OXY), temperature (SST), salinity (SSS), mixed layer depth (MLD), and chlorophyll level with a 1-month lag (SSC1) all had correlation values of >0.3 with S.CPUE when the GAM technique was used. OXY (0.855) had the highest correlation with S.CPUE. The generalized cross-validation (GCV) index for OXY was the lowest among the metrics. For correlation and GCV index values, SSC1 outperformed the chlorophyll level with no lag (SSC0) and chlorophyll level with a 2-month lag (SSC2). OXY, SST, SSS, MLD, SSC1, and eddy kinetic energy (EKE) all had correlation values of >0.3 with S.CPUE when the BRT method was used. OXY (0.857) had the highest correlation with S.CPUE. OXY had the lowest RMSE and MAE index values among all the parameters. In the correlation of RMSE and MAE index values, SSC1 outperformed SSC0 and SSC2. OXY, SST, SSS, MLD, SSC0, SSC1, and SSC2 had correlation values of >0.3 with S.CPUE when the RF technique was used. OXY (0.836) had the highest correlation with S.CPUE. OXY had the lowest RMSE and MAE index values among all the parameters. Although all SSC delays had correlation values of >0.3, SSC1 outperformed SSC0 and SSC2 in terms of correlation, RMSE, and MAE index values. Consequently, only SSC1 was selected. The aforementioned data indicate that for all the selection methods, OXY, SST, SSS, MLD, and SSC1 had correlation values of >0.3. EKE only had a correlation value of >0.3 when the BRT method was used. Therefore, the final model was constructed using only OXY, SST, SSS, MLD, and SSC1. A multicollinearity test revealed that the selected parameters did not exhibit any collinearity (Table 4).
A GAM analysis of the effects of vertical SST, OXY, SSC1, and SSS on immature albacore S.CPUE was performed to identify the subsurface habitat characteristics of immature albacore. In addition to the surface variable (MLD), the most influential vertical SST, OXY, SSC1 and SSS layers were found at various depths (i.e., 5, 26, and 53 m for SST; 200, 244, and 147 m for OXY; 508, 628, and 411 for SSC1; and 411, 508, and 773 m for SSS). Relative to the upper-surface base models, these layers had lower AICs. Table 5 evaluated the different selected environmental layers, with respect to S.CPUE, based on Pearson correlation analysis and VIP score ranking. Results showed that temperature at 5 m depth was more important than other selected layers, followed by temperature at 26 m depth. Temperature at 53 m depth did not show any significant importance, with a VIP rank of 7. The third most important was oxygen at 147 m depth, followed by 200 m and 244 m. For both the SSS and SSC1, 508 m depth showed more importance than 411 m and 628 m depth. MLD showed the lowest correlation with S.CPUE, with a VIP ranking of 13.

3.3. Relation between Selected Environmental Layers & S.CPUE

Table 6 showed the optimal ranges of different environmental parameters in different vertical layers with respect to the highest S.CPUE. The optimal range of OXY for both_ 200, and 147 m depth was 240–260 mmol/L with respect to the highest S.CPUE, whereas for the 244 m depth this range was a little lesser (235–255). The optimal range of TEM at 5, 26, and 53 m depth was 13–15 °C, 12–14 °C, and 14–16 °C, respectively, for the highest S.CPUE. The optimal range of SSS for both_508, and 628 m depth was 34.3–34.4 psu, whereas this range was a little higher at 411 m depth (34.4–34.5 psu). For MLD, the optimal range was between 240–260 m for the highest S.CPUE.

3.4. Predictive Performance of Single-Algorithm Habitat Models

All the single-algorithm full models exhibited favorable predictive performance. The S.CPUE–P.CPUE Pearson correlation values obtained using the GLM, GAM, BRT, and RF methods were 0.798, 0.832, 0.841, and 0.856, respectively, from Pearson correlation analysis. Table 7 presents the prediction performance (as measured by RMSE, MAE, and R2) achieved using the full single-algorithm models for the selected parameters when each of the three validation techniques were used. The RMSE, MAE, and R2 values obtained from the randomly split data sets exhibited no significant bias in predictive performance. All the single-algorithm models were revealed to be suitable for making predictions.

3.5. Ensemble Model Development and Prediction

Table 8 showed the predictive performances of all the single-algorithm models based on AIC, RMSE, and MAE using 100% of data. All the four single-algorithm full models exhibited proximal and favorable performance. The GAM-, BRT-, and RF-derived full models were selected and the GLM-derived full model was excluded because P.CPUE obtained through this model exhibited the highest AIC, RMSE, and MAE value of all 4 single-algorithm models with S.CPUE. Hence, final ensemble model was constructed using only GAM, BRT, and RF single-algorithm models. Model validation showed no significant differences in R, RMSE, and MAE values between training and test data for ensemble modeling all the validation techniques applied. Figure 4 reveals the findings as follows. From March to September, a high immature albacore S.CPUE was mainly observed between 30°S and 40°S; after September, a northward shift occurred; and from November to January, S.CPUE was mainly concentrated at the south coast of Madagascar. After January, an eastward shift occurred. The P.CPUE obtained from the ensemble model exhibited a high correlation with S.CPUE (0.867). A random splitting evaluation did not reveal any significant bias.

4. Discussion

The present study proposed a new method for inferring the potential habitats of immature albacore tuna, a commercially and ecologically crucial resource, in the Indian Ocean, on the basis of vertical oceanographic preferences. We used a combination of state-of-the-art species habitat algorithms and three-dimensional oceanographic data obtained from a high-resolution numerical model to develop high-quality habitat projections for immature albacore tuna. We investigated vertical immature albacore habitat characteristics, the potential link between environmental circumstances and potential albacore habitats by applying the aforementioned method. Consequently, we examined the ensemble’s habitat models quantitatively for operational albacore resource management applications.

4.1. Evaluation of Ensemble Model

Although the single-algorithm models exhibited consistent oceanographic parameter rankings, they differed in their intermodel statistical performance. These findings reflect an inherent intermodel uncertainty, indicating that the predictive power of statistical methods varies significantly [9]. Furthermore, compared with the top-performing single-algorithm predictions, combining immature albacore habitat forecasts by using the weighted means of three of the four top-performing models resulted in improved performance with respect to the accuracy of immature albacore habitat predictions (Figure 4). This finding corresponds to those of other studies, which reported that using the ensemble model method instead of a single-algorithm platform to forecast marine habitat affinities produces more favorable results [34,47]. Therefore, the ensemble model technique can be a useful tool for operational fisheries (e.g., mapping probable fishing areas) and resource management (e.g., identifying predictable foraging habitats that may include key zones for fish). Three possible reasons can be given in order to describe the good prediction failure by ensemble model in the panel a-j-k-l. Firstly, noise, bias and variance: the combination of decisions from multiple models can help to improve the overall performance. Hence, one of the key reasons to use ensemble models is overcoming noise, bias and variance. However, ensemble models in machine learning might have any noise, bias and variance from different models. This can be one possible reason for few prediction failures in the present study. Authors planned to use more advanced models for ensemble prediction in order to rectify areas of lack in the present study. The second reason was the existence of selected ideal environmental ranges. Furthermore, the area with a high S.CPUE of immature albacore tuna from April to September is where the fish feed. These two details can be used to map out where young albacore tuna are found in the Indian Ocean. Since October through January are not prime feeding months, and because it is summer, young albacore may vertically dive to reach the best SST. These reasons might be the causes of the decline in high CPUE values observed in the southern region between October and January.

4.2. Immature Albacore Habitat

In water temperatures between 11.5 °C and 18 °C, albacore tuna use physiological thermoregulation to maintain a stable body temperature of approximately 20 °C [48]. The body temperature of albacore tuna has been reported to decrease to <11.5 °C, which prompts individual tuna fish to change their behavior by migrating vertically into warmer waters to restore their body temperature. During their active growth phase, albacore tuna require a considerable amount of energy, which may explain their presence in areas where prey congregate at the water surface. Few studies have explored the high-energy intake of immature albacore tuna [49,50]. The high-energy requirements of such fish are expected to affect their distribution; specifically, individual fish tend to congregate in high-productivity locations where food is abundant. Because of the abundance of prey species in surface waters and the minimal need to forage in deeper waters, the vertical distribution of albacore tuna may be restricted to surface waters. However, with their thermoregulation capabilities [48] and swim bladder development [51], albacore tuna can dive into deep waters [52]. Our findings suggest that immature albacore tuna engage in vertical diving behavior only when necessary (e.g., for forging, hiding, thermoregulation, and osmoregulation). Moreover, our results indicate that the vertical distribution of immature albacore is affected by thermal preferences (as indicated by their diet); the vertical distribution of prey species, which is influenced by the ocean’s thermal structure; or a combination of both factors.
Results indicated that, of the examined layers, the temperature at 5 m depth was the most significant, followed by that at 26 m depth. With a VIP score of 7, temperature at 53 m of depth did not demonstrate any major importance. At 147 m, oxygen ranked third in importance, ahead of 200 and 244 m. This occurred because 508 m depth was more significant than 411 m and 628 m depth for both the SSS and SSC1, respectively. With a VIP ranking of 13, MLD displayed the lowest correlation with S.CPUE. In terms of the maximum S.CPUE, the best OXY range for both the 200 and 147 m depths was 240–260 mmol/L, whereas it was slightly lower for the 244 m depth (235–255). For the highest S.CPUE, the best TEM temperature ranges at 5, 26, and 53 m deep were 13–15 °C, 12–14 °C, and 14–16 °C, respectively. The ideal SSS range was 34.3–34.4 psu at both 508 and 628 m depth, but this range was slightly greater at 411 m depth (34.4–34.5 psu). For MLD, a range of between 240 and 260 m produced the highest S.CPUE. Changes in SST will control the muscle contraction. Slower muscle contraction will reduce the tail beat frequency and eventually will affect the swimming speed of fish. With the increase in SST, maximum swimming speed increases. However, an increase in SST greater than the preferred range will increment the energy costs for albacore, resulting in diminished execution and push, and it will influence the growth of albacore. Ambient SST affects the maximum swimming speed of fish. Being a carnivorous species [1], albacore has an indirect relationship with chlorophyll. SSC is the primary producer in the oceanic ecosystem, with secondary producers such as fish [4,6], shrimp, squid, and octopus feeding on it. Immature albacore then feed on these secondary producers. Thus, SSC is a critical factor to consider when describing the abundance of immature albacore tuna. SSS can be a crucial predictor. If the SSS is decreased from the preferred range of albacore then water density will also decrease, eventually affecting the swimming behavior (swimming is difficult in less dense water, thus swimming will cost higher energy loss) of albacore tuna. If the SSS is higher than the preferred range of immature albacore, then it will affect the osmoregulatory cost (use of extra energy) and begin affecting the growth of albacore tuna. MLD and SSH are related to each other and to SST. MLD is primarily determined by the action of turbulent mixing of the water mass due to wind stress and heat exchange at the air–sea interface. Cooling in SST can induce convection, which enlarges the MLD and decreases SSH.
Two possible reasons can affect the higher S.CPUE (used as the proxy of relative abundance) of immature albacore tuna. Presence of preferred optimal environmental ranges can be the first reason. Moreover, the location with high S.CPUE of immature albacore during April to September is the feeding zone of immature albacore tuna. These two facts can describe the spatial distribution of immature albacore tuna in the Indian Ocean. October to January is not the feeding time, and because of summer, an increase in SST might cause immature albacore to vertically dive in order to obtain the optimal SST. These might be the reasons behind the disappearance of high CPUE values in the southern sector from October–January.

4.3. Potential Implications for Albacore Fisheries

Understanding the species’ regional dynamics and interactions at the population scale in all oceanic regions is a key aspect of efforts to improve albacore management. Because albacore movement is affected by seasonal (ideal temperature) and food supply considerations, population structures found between and within oceans are challenging to explain [53]. Furthermore, albacore tuna shift vertically and horizontally in water columns throughout their life cycle, a behavior which increases the difficulty of determining their distribution. Multiple species distribution models, stratified by space and time for each ocean basin and albacore tuna fishery, are required to clarify the albacore’s complex distribution and spatial dynamics and the effect of changing environmental conditions.

5. Conclusions

In addition to the surface variable, the most influential vertical SST, OXY, SSC1 and SSS layers were found at various depths (i.e., 5, 26, and 53 m for SST; 200, 244, and 147 m for OXY; 508, 628, and 411 for SSCI; and 411, 508, and 773 m for SSS). Relative to the upper-surface base models, these layers had the lowest AICs. The GLM, GAM, BRT, and RF methods produced S.CPUE–P.CPUE correlation values of 0.798, 0.832, 0.841, and 0.856, respectively. The anticipated CPUE obtained through the GLM-derived full model exhibited the lowest correlation with S.CPUE; thus, only the GAM-, BRT-, and RF-derived full models were selected. From March to September, a high level immature albacore S.CPUE was mainly observed between 30°S and 40°S; after September, a northward shift occurred; and from November to January, S.CPUE was mainly concentrated at the south coast of Madagascar. Our findings indicate that the vertical distribution of immature albacore is affected by thermal preferences (as indicated by their diet); the vertical distribution of prey species, which is affected by the ocean’s thermal structure; or a combination of both factors.

Author Contributions

Conceptualization, S.M. and M.-A.L.; methodology, M.-A.L. and S.M.; software, S.M.; validation, M.-A.L.; formal analysis, S.M.; investigation, Y.-C.W. and J.-S.W.; resources, Y.-C.W. and J.-S.W.; writing—original draft preparation, S.M.; writing—review and editing, M.-A.L. and B.K.M. Mondal, visualization, Y.-C.W. All authors have read and agreed to the published version of the manuscript.

Funding

The Council of Agriculture (106AS-10.1.5-FA-F1 (4); 107AS-9.1.5-FA-F1 (4)) and Ministry of Science and Technology of Taiwan (MOST105-2611-M-019-011) financed this research.

Data Availability Statement

Not applicable.

Acknowledgments

We thank the three anonymous reviewers and the editor for their valuable comments and suggestions as well as the team members of the Fisheries Agency and Overseas Fisheries Development Council of Taiwan for their assistance in data preparation. We are grateful to the Council of Agriculture, Taiwan, for the grant.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

SDMSpecies distribution model
MODISModerate Resolution Imaging Spectroradiometer
COPCopernicus
AVHRRAdvanced Very High Resolution Radiometer
SSTTemperature
OXYDissolved oxygen
SSC (0–2)Chlorophyll (0–2 months lag)
SSSSalinity
UU-velocity
VV-velocity
EKEEddie kinetic energy
NPPNet primary productivity
MLDMixed layer depth
SSHSea surface height above geoid
N.CPUENominal catch per unit effort
S.CPUEStandardized catch per unit effort
GLMGeneralized linear modeling
AICAkaike information criterion
R2Correlation
GAMGeneralized additive model
BRTBoosted regression trees
RFRandom Forest
RSRandom splitting
LOOCVLeave-one-out cross-validation
RPearson correlation coefficient
RMSERoot mean square error
MAEMean absolute error
P.CPUEPredicted catch per unit effort

References

  1. Nikolic, N.; Morandeau, G.; Hoarau, L.; West, W.; Arrizabalaga, H.; Hoyle, S.; Nicol, S.J.; Bourjea, J.; Puech, A.; Farley, J.H.; et al. Review of albacore tuna, Thunnus alalunga, biology, fisheries and management. Rev. Fish Biol. Fish. 2017, 27, 775–810. [Google Scholar] [CrossRef]
  2. Khedkar, G.D.; Jadhao, B.V. Tuna and Tuna-like Fish of Tropical Climates; Elsevier Science Ltd.: Amsterdam, The Netherlands, 2003. [Google Scholar]
  3. Fernandez-Polanco, J.; Llorente, I. Tuna economics and markets. In Advances in Tuna Aquaculture; Academic Press: Cambridge, MA, USA, 2016; pp. 333–350. [Google Scholar]
  4. Mondal, S.; Vayghan, A.H.; Lee, M.A.; Wang, Y.C.; Semedi, B. Habitat suitability modeling for the feeding ground of immature albacore in the southern Indian Ocean using satellite-derived sea surface temperature and chlorophyll data. Remote Sens. 2021, 13, 2669. [Google Scholar] [CrossRef]
  5. Chen, I.C.; Lee, P.F.; Tzeng, W.N. Distribution of albacore (Thunnus alalunga) in the Indian Ocean and its relation to environmental factors. Fish. Oceanogr. 2005, 14, 71–80. [Google Scholar] [CrossRef]
  6. Wang, S.P. CPUE standardization of blue marlin caught by Taiwanese large-scale longline fishery in the Indian Ocean. In Proceedings of the 17th Working Party on Billfish, La Reunion, France, 9–12 September 2019; IOTC-2019-WPB17, 18. p. 41. [Google Scholar]
  7. Taiwan National Report to the Scientific Committee of the Indian Ocean Tuna Commission, 2019.
  8. Li, X.; Wang, Y. Applying various algorithms for species distribution modelling. Integr. Zool. 2013, 8, 124–135. [Google Scholar] [CrossRef] [PubMed]
  9. Elith, J.; Graham, C.H. Do they? How do they? WHY do they differ? On finding reasons for differing performances of species distribution models. Echography 2009, 32, 66–77. [Google Scholar] [CrossRef]
  10. Zimmermann, N.E.; Edwards, T.C., Jr.; Graham, C.H.; Pearman, P.B.; Svenning, J.C. New trends in species distribution modelling. Echography 2010, 33, 985–989. [Google Scholar] [CrossRef]
  11. Austin, M. Species distribution models and ecological theory: A critical assessment and some possible new approaches. Ecol. Model. 2007, 200, 1–19. [Google Scholar] [CrossRef]
  12. Levins, R. The strategy of model building in population ecology. Am. Sci. 1966, 54, 421–431. [Google Scholar]
  13. Rajeesh, R.; Dwarakish, G.S. Satellite oceanography—A review. Aquat. Procedia 2015, 4, 165–172. [Google Scholar] [CrossRef]
  14. Marra, J. Introduction to satellite oceanography. Limnol. Oceanogr. e-Lect. 2014, 4, 1–35. [Google Scholar] [CrossRef] [Green Version]
  15. Zhou, C.; He, P.; Xu, L.; Bach, P.; Wang, X.; Wan, R.; Tang, H.; Zhang, Y. The effects of mesoscale oceanographic structures and ambient conditions on the catch of albacore tuna in the South Pacific longline fishery. Fish. Oceanogr. 2020, 29, 238–251. [Google Scholar] [CrossRef]
  16. Chen, X.; Tian, S.; Chen, Y.; Liu, B. A modeling approach to identify optimal habitat and suitable fishing grounds for neon flying squid (Ommostrephes bartramii) in the northwest Pacific Ocean. Fish. Bull. 2010, 108, 1–14. [Google Scholar]
  17. Zainuddin, M.; Saitoh, S. Detection of potential fishing ground for albacore tuna using synoptic measurements of ocean color and thermal remote sensing in the northwestern North Pacific. Geophys. Res. Lett. 2004, 31, 31. [Google Scholar] [CrossRef]
  18. Zainuddin, M.; Saitoh, K.; Saitoh, S.-I. Albacore (Thunnus alalunga) fishing ground in relation to oceanographic conditions in the western North Pacific Ocean using remotely sensed satellite data. Fish. Oceanogr. 2008, 17, 61–73. [Google Scholar] [CrossRef] [Green Version]
  19. Lan, K.-W.; Lee, M.-A.; Chou, C.-P.; Vayghan, A.H. Association between the interannual variation in the oceanic environment and catch rates of bigeye tuna (Thunnus obesus) in the Atlantic Ocean. Fish. Oceanogr. 2018, 27, 395–407. [Google Scholar] [CrossRef]
  20. Xu, Y.; Nieto, K.; Teo, S.L.; McClatchie, S.; Holmes, J. Influence of fronts on the spatial distribution of albacore tuna (Thunnus alalunga) in the Northeast Pacific over the past 30 years (1982–2011). Prog. Oceanogr. 2017, 150, 72–78. [Google Scholar] [CrossRef]
  21. Yen, K.-W.; Chen, C.-H. Research Gap Analysis of Remote Sensing Application in Fisheries: Prospects for Achieving the Sustainable Development Goals. Remote Sens. 2021, 13, 1013. [Google Scholar] [CrossRef]
  22. Lan, K.W.; Nishida, T.; Lee, M.A.; Lu, H.J.; Huang, H.W.; Chang, S.K.; Lan, Y.C. Influence of the marine environment variability on the yellowfin tuna (Thunnus albacares) catch rate by the Taiwanese longline fishery in the Arabian sea, with special reference to the high catch in 2004. J. Mar. Sci. Technol. 2012, 20, 514–524. [Google Scholar]
  23. Chang, Y.J.; Hsu, J.; Lai, P.K.; Lan, K.W.; Tsai, W.P. Evaluation of the Impacts of Climate Change on Albacore Distribution in the South Pacific Ocean by Using Ensemble Forecast. Front. Mar. Sci. 2021, 1404. [Google Scholar] [CrossRef]
  24. Lan, K.W.; Chang, Y.J.; Wu, Y.L. Influence of oceanographic and climatic variability on the catch rate of yellowfin tuna (Thunnus albacares) cohorts in the Indian Ocean. In Deep Sea Research Part II: Topical Studies in Oceanography; Elsevier: Amsterdam, The Netherlands, 2020; Volume 175, p. 104681. [Google Scholar]
  25. Klemas, V. Remote sensing of environmental indicators of potential fish aggregation: An overview. Baltica 2012, 25, 99–112. [Google Scholar] [CrossRef] [Green Version]
  26. Pearce, J.; Ferrier, S. Evaluating the predictive performance of habitat models developed using logistic regression. Ecol. Model. 2000, 133, 225–245. [Google Scholar] [CrossRef] [Green Version]
  27. Elith, J.H.; Graham, C.P.H.; Anderson, R.P.; Dudík, M.; Ferrier, S.; Guisan, A.; Hijmans, R.J.; Huettmann, F.; Leathwick, J.R.; Lehmann, A.; et al. Novel methods improve prediction of species’ distributions from occurrence data. Echography 2006, 29, 129–151. [Google Scholar] [CrossRef] [Green Version]
  28. Tsoar, A.; Allouche, O.; Steinitz, O.; Rotem, D.; Kadmon, R. A comparative evaluation of presence-only methods for modelling species distribution. Divers. Distrib. 2007, 13, 397–405. [Google Scholar] [CrossRef]
  29. McKinney, J.; Hoffmayer, E.; Wu, W.; Fulford, R.; Hendon, J. Feeding habitat of the whale shark Rhincodon typus in the northern Gulf of Mexico determined using species distribution modelling. Mar. Ecol. Prog. Ser. 2012, 458, 199–211. [Google Scholar] [CrossRef] [Green Version]
  30. Scales, K.L.; Miller, P.I.; Ingram, S.N.; Hazen, E.L.; Bograd, S.J.; Phillips, R.A. Identifying predictable foraging habitats for a wide-ranging marine predator using ensemble ecological niche models. Divers. Distrib. 2016, 22, 212–224. [Google Scholar] [CrossRef] [Green Version]
  31. Alabia, I.D.; Saitoh, S.I.; Igarashi, H.; Ishikawa, Y.; Usui, N.; Kamachi, M.; Awaji, T.; Seito, M. Ensemble squid habitat model using three-dimensional ocean data. ICES J. Mar. Sci. 2016, 73, 1863–1874. [Google Scholar] [CrossRef] [Green Version]
  32. Araújo, M.B.; New, M. Ensemble forecasting of species distributions. Trends Ecol. Evol. 2007, 22, 42–47. [Google Scholar] [CrossRef]
  33. Bates, J.M.; Granger, C.W.J. The combination of forecasts. J. Oper. Res. Soc. 1969, 20, 451–468. [Google Scholar] [CrossRef]
  34. Oppel, S.; Meirinho, A.; Ramírez, I.; Gardner, B.; O’Connell, A.F.; Miller, P.I.; Louzao, M. Comparison of five modelling techniques to predict the spatial distribution and abundance of seabirds. Biol. Conserv. 2012, 156, 94–104. [Google Scholar] [CrossRef] [Green Version]
  35. Zhao, J.; Cao, J.; Tian, S.; Chen, Y.; Zhang, S.; Wang, Z.; Zhou, X. A comparison between two GAM models in quantifying relationships of environmental variables with fish richness and diversity indices. Aquat. Ecol. 2014, 48, 297–312. [Google Scholar] [CrossRef]
  36. Wood, S.N.; Augustin, N.H. GAMs with integrated model selection using penalized regression splines and applications to environmental modelling. Ecol. Model. 2002, 157, 157–177. [Google Scholar] [CrossRef] [Green Version]
  37. Hastie, T.J.; Tibshirani, R.J. Generalized Additive Models; Chapman & Hall: London, UK, 1990; p. 335. [Google Scholar]
  38. Leathwick, J.R.; Elith, J.; Hastie, T. Comparative performance of generalized additive models and multivariate adaptive regression splines for statistical modelling of species distributions. Ecol. Model. 2006, 199, 188–196. [Google Scholar] [CrossRef]
  39. Ptacnik, R.; Lepistö, L.; Willén, E.; Brettum, P.; Andersen, T.; Rekolainen, S.; Solheim, A.L.; Carvalho, L. Quantitative responses of lake phytoplankton to eutrophication in Northern Europe. Aquat. Ecol. 2008, 42, 227–236. [Google Scholar] [CrossRef]
  40. Chang, J.H.; Chen, Y.; Holland, D.; Grabowski, J. Estimating spatial distribution of American lobster Homarus americanus using habitat variables. Mar. Ecol. Prog. Ser. 2010, 420, 145–156. [Google Scholar] [CrossRef]
  41. Schmiing, M.; Afonso, P.; Tempera, F.; Santos, R.S. Predictive habitat modeling of reef fishes with contrasting trophic ecologies. Mar. Ecol. Prog. Ser. 2013, 474, 201–216. [Google Scholar] [CrossRef] [Green Version]
  42. Ratner, B. The correlation coefficient: Its values range between +1/−1, or do they? J. Target. Meas. Anal. Mark. 2009, 17, 139–142. [Google Scholar] [CrossRef] [Green Version]
  43. Wood, S.N. Generalized Additive Models: An Introduction with R; Chapman and Hall/CRC: Boca Raton, FL, USA, 2006. [Google Scholar]
  44. Akaike, H. A new look at the statistical model identification. In Selected Papers of Hirotugu Akaike; Parzen, E., Tanabe, K., Kitagawa, G., Eds.; Springer: New York, NY, USA, 1998; pp. 215–222. [Google Scholar]
  45. Thuiller, W.; Lafourcade, B.; Engler, R.; Araújo, M.B. BIOMOD—A platform for ensemble forecasting of species distributions. Ecography 2009, 32, 369–373. [Google Scholar] [CrossRef]
  46. Thuiller, W.; Georges, D.; Robin, E. R Package, version 3.1-64. In biomod2: Ensemble Platform for Species Distribution Modeling; Crane Holdings: Stamford, CT, USA, 2014. [Google Scholar]
  47. Perez, J.; Menendez, M.; Camus, P.; Mendez, F.J.; Losada, I.J. Statistical multi-model climate projections of surface ocean waves in Europe. Ocean. Model. 2015, 96, 161–170. [Google Scholar] [CrossRef]
  48. Quinci, E.M.; Torri, M.; Cuttitta, A.; Patti, B. Predicting Potential Spawning Habitat by Ensemble Species Distribution Models: The Case Study of European Anchovy (Engraulis encrasicolus) in the Strait of Sicily. Water 2022, 14, 1400. [Google Scholar] [CrossRef]
  49. Farley, J.H.; Hoyle, S.D.; Eveson, J.P.; Williams, A.J.; Davies, C.R.; Nicol, S.J. Maturity ogives for South Pacific albacore tuna (Thunnus alalunga) that account for spatial and seasonal variation in the distributions of mature and immature fish. PLoS ONE 2014, 9, e83017. [Google Scholar]
  50. Farley, J.H.; Williams, A.J.; Davies, C.R.; Clear, N.P.; Eveson, J.P.; Hoyle, S.D.; Nicol, S.J. Population Biology of Albacore Tuna in the Australian Region; Castray Esplanade: Hobart, Australia, 2012. [Google Scholar]
  51. Dotson, R.C. Minimum swimming speed of albacore, Thunnus alalunga. Fish. Bull 1976, 74, 955–960. [Google Scholar]
  52. Childers, J.; Snyder, S.; Kohin, S. Migration and behavior of juvenile North Pacific albacore (Thunnus alalunga). Fish. Oceanogr. 2011, 20, 157–173. [Google Scholar] [CrossRef] [Green Version]
  53. Sagarminaga, Y.; Arrizabalaga, H. Spatio-temporal distribution of albacore (Thunnus alalunga) catches in the northeastern Atlantic: Relationship with the thermal environment. Fish. Oceanogr. 2010, 19, 121–134. [Google Scholar] [CrossRef]
Figure 1. Experimental flowchart.
Figure 1. Experimental flowchart.
Remotesensing 14 05278 g001
Figure 2. Residual distribution and quantile–quantile plot for selected generalized linear models for immature albacore tuna.
Figure 2. Residual distribution and quantile–quantile plot for selected generalized linear models for immature albacore tuna.
Remotesensing 14 05278 g002
Figure 3. Comparison of N.CPUE and S.CPUE using selected generalized linear model for immature albacore tuna.
Figure 3. Comparison of N.CPUE and S.CPUE using selected generalized linear model for immature albacore tuna.
Remotesensing 14 05278 g003
Figure 4. Monthly three-dimensional spatial distribution of immature albacore tuna in Indian Ocean from 1998 to 2016. (a) January, (b) February, (c) March, (d) April, (e) May, (f) June, (g) July, (h) August, (i) September, (j) October, (k) November, (l) December.
Figure 4. Monthly three-dimensional spatial distribution of immature albacore tuna in Indian Ocean from 1998 to 2016. (a) January, (b) February, (c) March, (d) April, (e) May, (f) June, (g) July, (h) August, (i) September, (j) October, (k) November, (l) December.
Remotesensing 14 05278 g004
Table 1. Sources of various oceanographic variables derived from satellite source.
Table 1. Sources of various oceanographic variables derived from satellite source.
Environmental DataAbb.UnitSourceTime
Period
Spatial
Resolution
Temporal
Resolution
TemperatureSST°CCOP1998–20160.08° × 0.08°Monthly
Dissolved oxygenOXYmmol/LCOP1998–20160.08° × 0.08°Monthly
Chlorophyll (0–2 months lag)SSC (0–2)mgm−3COP1998–20160.25°× 0.25°Monthly
SalinitySSSpsuCOP1998–20160.08° × 0.08°Monthly
U-velocityUms−1COP1998–20160.08° × 0.08°Monthly
V-velocityVms−1COP1998–20160.08° × 0.08°Monthly
Eddie kinetic energyEKEm2s−2COP1998–20160.08° × 0.08°Monthly
Net primary productivityNPPmgm−3day−1COP1998–20160.25° × 0.25°Monthly
Mixed layer depthMLDmeterCOP1998–20160.08° × 0.08°Monthly
Sea surface height above geoidSSHmeterCOP1998–20160.08° × 0.08°Daily
COP—Copernicus. (https://resources.marine.copernicus.eu/products, accessed on 14 August 2022) EKE = 0.5 (U2 + V2). Unit for EKE is m2s−2.
Table 2. Performance of various combinations of generalized linear models for the standardization of immature albacore nominal CPUE.
Table 2. Performance of various combinations of generalized linear models for the standardization of immature albacore nominal CPUE.
No.ModelsAICNullResidualR2P(f)
1Year223,937135,722133,0060.002<0.001
2Year + Month216,436135,722117,8640.131<0.001
3Year + Month + Lat163,405135,72250,2030.63<0.001
4Year + Month + Lat + Lon147,018135,72238,4710.716<0.001
5Year + Month + Lat + Lon + Interactions142,977135,72235,2490.781<0.001
Lat = Latitude; Lon = Longitude; Interactions = Year*Lat, Year*Lon, Lat*Lon
Table 3. Environment parameters selected using GAM, BRT, and RF for immature albacore tuna model construction.
Table 3. Environment parameters selected using GAM, BRT, and RF for immature albacore tuna model construction.
GAMParameterAICDEV EXPAdj. R squ.GCV
OXY86,498.1785.50.8550.235
SST86,563.9685.40.8540.232
SSS150,07359.60.5960.652
MLD174,173.640.50.4050.961
SSC1179,620.936.10.3611.049
SSC0185,294.528.90.2891.149
SSC2195,414.921.30.2131.912
EKE186,516.827.50.2741.721
U187,736.4260.261.195
V199,962.59.960.0991.454
SSH200,105.29.750.0971.458
NPP205,861.61.010.0091.599
BRTParameterRMSEDEV EXPAdj. R squ.MAE
OXY0.4885.70.8570.255
SST0.48385.50.8550.264
SSS0.79760.50.6050.462
MLD0.97940.50.4050.662
SSC10.99239.40.3940.678
SSC01.12327.90.2790.84
SSC21.14524.50.2450.856
EKE1.00232.80.3280.691
U1.07728.40.2840.789
V1.19910.90.1090.942
SSH1.2069.80.0980.904
NPP1.2424.70.0470.991
RFParameterRMSEDEV EXPAdj. R squ.MAE
OXY0.44583.60.8360.251
SST0.47883.10.8310.259
SSS0.71762.10.6210.465
MLD0.95342.40.4240.657
SSC10.98737.10.3710.673
SSC01.00532.70.3270.684
SSC21.02130.90.3090.721
EKE1.13825.10.2510.87
U1.17923.30.2330.859
V1.08721.20.2120.752
SSH1.23412.30.1230.935
NPP1.2419.70.0970.898
Table 4. (a) Multicollinearity test results for parameters selected for the immature albacore tuna model based on Pearson correlation analysis value. (b) Multicollinearity test results for parameters selected for the immature albacore tuna model based on VIF value.
Table 4. (a) Multicollinearity test results for parameters selected for the immature albacore tuna model based on Pearson correlation analysis value. (b) Multicollinearity test results for parameters selected for the immature albacore tuna model based on VIF value.
(a)
OXYSSTSSSMLDSSC1
OXY1
SST−0.581
SSS0.48−0.531
MLD0.55−0.560.41
SSC10.3−0.420.160.31
(b)
OXYSSTSSSMLD
OXY
SST4.3
SSS3.84.0
MLD4.14.33.7
SSC12.7−3.51.12.9
Table 5. Evaluation of different selected environmental layers with respect to S.CPUE.
Table 5. Evaluation of different selected environmental layers with respect to S.CPUE.
ParametersR-squ.VIP
OXY_2000.714
OXY_2440.705
OXY_1470.733
TEM_50.761
TEM_260.752
TEM_530.697
SAL_5080.716
SAL_6280.688
SAL_4110.6211
SSC1_5080.669
SSC1_6280.6710
SSC1_4110.6012
MLD0.4913
Table 6. Optimal ranges of different environmental parameters in different vertical layers with respect to the highest S.CPUE.
Table 6. Optimal ranges of different environmental parameters in different vertical layers with respect to the highest S.CPUE.
ParametersDepthOptimal RangeUnitsS.CPUE
OXY200240–260 12.17
244235–255mmol/L11.38
147240–260 11.99
TEM513–15 12.24
2612–14°C11.91
5314–16 10.63
SSS50834.3–34.4 10.52
62834.3–34.4psu10.14
41134.4–34.5 8.33
SSC15080.012–0.013 15.34
6280.005–0.006Mgm−37.86
4110.02–0.021 19.466
MLD 250–260meter19.14
Table 7. Prediction performance of single-algorithm models when three validation techniques were applied.
Table 7. Prediction performance of single-algorithm models when three validation techniques were applied.
Validation TechniquesMethodsRMSER2MAE
703070307030
10 foldGLM0.5160.5070.8190.8090.3880.381
GAM0.5210.5140.8170.8070.3840.378
BRT0.5150.5010.8180.8110.3850.382
RF0.5140.5020.8180.8130.3870.383
LOOCVGLM0.5140.5070.8150.8030.3860.379
GAM0.5190.5120.8180.8120.3880.381
BRT0.5180.5090.8110.8030.3860.383
RF0.5120.5030.8120.8030.3860.381
CVGLM0.5170.5050.8150.8070.3790.371
GAM0.5140.5080.8140.8050.3780.377
BRT0.5130.5040.8150.8110.3810.371
RF0.5210.5160.8180.8130.3830.376
Random
Splitting
GLM0.5550.5430.8110.8040.3870.382
GAM0.5410.5320.8160.8110.3850.378
BRT0.5430.5280.8130.8020.3860.382
RF0.5480.5330.8130.8030.3840.378
Table 8. Evaluation of single-algorithm models with full data set for ensemble model construction.
Table 8. Evaluation of single-algorithm models with full data set for ensemble model construction.
Single-Algorithm ModelAICRMSEMAE
GLM13,254.230.7710.456
GAM11,354.150.6230.402
BRT10,999.870.5980.376
RF10,785.350.5950.354
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Mondal, S.; Wang, Y.-C.; Lee, M.-A.; Weng, J.-S.; Mondal, B.K. Ensemble Three-Dimensional Habitat Modeling of Indian Ocean Immature Albacore Tuna (Thunnus alalunga) Using Remote Sensing Data. Remote Sens. 2022, 14, 5278. https://doi.org/10.3390/rs14205278

AMA Style

Mondal S, Wang Y-C, Lee M-A, Weng J-S, Mondal BK. Ensemble Three-Dimensional Habitat Modeling of Indian Ocean Immature Albacore Tuna (Thunnus alalunga) Using Remote Sensing Data. Remote Sensing. 2022; 14(20):5278. https://doi.org/10.3390/rs14205278

Chicago/Turabian Style

Mondal, Sandipan, Yi-Chen Wang, Ming-An Lee, Jinn-Shing Weng, and Biraj Kanti Mondal. 2022. "Ensemble Three-Dimensional Habitat Modeling of Indian Ocean Immature Albacore Tuna (Thunnus alalunga) Using Remote Sensing Data" Remote Sensing 14, no. 20: 5278. https://doi.org/10.3390/rs14205278

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop