1. Introduction
Conservation of fishery resources requires accurate estimates of total catch, by species, to inform stock assessment models. Total species catch is typically obtained from either a sample of fishing units, raised through a statistical approach to the fleet level, or from a census of all fishing units. For large-scale tropical tuna purse-seine fisheries, census data sources include logbook data and, in some cases, onboard observer data. However, concerns about the reliability of species composition estimates of these data sources have been raised, in particular, as regards species misidentification and/or estimation error that occurs because species quantities must be estimated rapidly, by eye [
1,
2,
3]. Development of methods for obtaining more accurate estimates of species composition continues to be a topic of research [
1,
4,
5]. For the tropical tuna purse-seine fishery of the Eastern Pacific Ocean (EPO; 50° S to 50° N, from the coast of the Americas to 150° W), which is managed by the Inter-American Tropical Tuna Commission (IATTC), concern over estimated species catch amounts from logbook, observer, and cannery (processors) data [
3], especially misidentification of small individuals of bigeye tuna (BET,
Thunnus obesus) and yellowfin tuna (YFT,
Thunnus albacares) [
6], resulted in modifications of the IATTC port-sampling program in 2000 [
3]. The intent of those modifications was that species composition could be estimated solely from port-sampling data [
3,
7,
8].
Although sample data collected by scientific technicians of port-sampling programs or by observers at sea are considered more reliable for species identification, for large-scale fisheries, such as tropical tuna purse-seine fisheries, it is both costly and logistically challenging to obtain adequate sampling coverage of all areas, time periods, and operational factors (e.g., gear characteristics) required for stock assessment modeling [
1,
4,
9]. For the current EPO stock assessments for BET, YFT, and skipjack (SKJ,
Katsuwonus pelamis) tunas, fisheries are each defined by quarter (but catch is estimated by month), up to five areas, and three purse-seine set types [
10,
11,
12], for a total of up to 180 ‘cells’ for which species catch composition estimates are required. With the coverage of the IATTC port-sampling data, each year there are a number of cells with tropical tuna catch but little to no port-sampling data, and sample data from ‘neighboring’ cells are used to obtain estimates [
3]. There is concern that this approach may lead to estimates with low precision and/or bias for some fisheries defined in stock assessments [
9], and increasing sampling coverage is unlikely due to cost.
Given these data collection and estimation challenges, several model-based estimation approaches have been developed that draw on multiple data sources to mitigate the shortcomings of each. For the European purse-seine fleet [
1], port-sampling and logbook/landing data are used to estimate catch composition, where the port-sampling data are used to correct the species composition of sets associated with sampled wells. A random forest algorithm is then used to model the relationship between set-level corrected species catch and environmental and operational information and then to predict the species composition for unsampled sets. For Western and Central Pacific (WCPO) purse-seine fisheries, models have also been developed to predict species composition at the set level from environmental and operational information, but the model is built using species composition estimates obtained from onboard observer sampling [
4,
5]. For the EPO tuna purse-seine fishery, with its 100% observer coverage of large purse-seine vessels (those with a fish-carrying capacity of at least 364 metric tons), a model-based approach focusing on the use of observer and port-sampling data is most practical.
In 2023, a special purse-seine port-sampling program, the Enhanced Monitoring Program (EMP), was initiated by the IATTC to support additional management measures for BET [
13]. Those additional management measures established an individual vessel threshold system (IVT) for BET purse-seine catches to encourage vessels to reduce their annual catches of BET. The purpose of the EMP was to collect data for trip-level estimation of BET catch, focusing on large vessels with historically high BET catches, as an aid to those vessels and their CPCs in their efforts to comply with the IVT. Given that most of the EPO BET purse-seine catch is generated by sets on tunas associated with floating objects (OBJ) [
14], the EMP focused on sampling wells with catches derived from OBJ sets. The OBJ-set fishery in the EPO is a multispecies fishery, dominated by SKJ catch but also producing catches of BET and YFT, and it is the primary set type generating BET catch for the purse-seine fleet [
14]. Although BET was the focus of the IVT and EMP, the EMP sampling protocol was not species specific and thus generated data for any tropical tuna species in the catch of sampled wells. Given the large number of wells sampled per trip by the EMP and the extensive sampling of the well catch [
15], the EMP data represent a unique data set for in-depth study of the relationship between observer and port-sampling estimates of species composition for the most species-diverse purse-seine set type.
In this paper, we present an analysis of the well-level relationship between observer and EMP port-sampling estimates of species composition for each of the three target tuna species. We explore a different approach to that taken by other studies, whereby we attempt to take advantage of the paired EMP port-sampling and observer estimates at the well level to develop an approach for bias correction of unsampled wells. That is, we assume that set-level environmental and operational factors that would affect species composition of the catch in the well for both data sources are sufficiently controlled for in the analysis by pairing the data at the well level. With this assumption, what is to be estimated with the paired data set is, therefore, the overall relationship between the two types of estimates and any covariate effects related to operational factors that could negatively impact an observer’s ability to make an accurate estimate of species composition. The analysis is based on mixed-effects modeling of observer and EMP data from OBJ-set wells of large purse-seine vessels operating in the EPO during 2023–2024. We discuss the use of these results in the context of a catch estimation for fisheries defined in stock assessments and the implications of the model results for sampling design development.
4. Discussion
In this study, mixed-effects models were fitted to paired port-sampling and observer well-level estimates of species composition as part of the development of a model-based approach for bias-adjusting observers’ species composition estimates for unsampled wells. Overall, it was found that there was a general tendency for the OBS proportions of BET and YFT in the well to be greater than the model-estimated proportions, with the opposite occurring for SKJ. However, these tendencies were sometimes modified by vessel effects. Model complexity, as regards covariate effects other than vessel and trip, was greatest for BET, the species associated with additional management measures during the study period, and least complex for SKJ, the species least likely to be misidentified as BET [
35]. The only two covariates significantly affecting the port-sampling–observer well-level relationships for BET, beyond vessel and trip effects, were presence/absence of a hopper and year. The overall effect of a hopper (intercept and slope effects combined) would be consistent with the use of the hopper by the vessel crew providing the observer with better visibility of catch composition, which led to more similar OBS proportions and model-estimated proportions of BET. The year effect on the overall intercept rotated the fitted relationship to yield a greater difference between OBS proportions and model-estimated proportions of BET in 2024, as compared to 2023, which may be related to evolving dynamics in response to the new management measures, as discussed below. Because the data used in this study were collected from a subset of the EPO purse-seine fleet, repeating this study with data collected for the entire fleet would provide useful information on the extent to which the port-sampling–observer relationship might differ by fleet component. The new port-sampling protocol proposed for the EPO purse-seine fishery [
9] would produce port-sampling data appropriate for such a study.
It is useful to consider aspects of the purse-seine fleet, beyond the EMP data set, to try to interpret the significant year effect of the lowest-AIC BET model. The fleet-level estimate of BET catch for 2024 was about 20% lower than the 2023 estimate, and both the 2023 and 2024 estimates were the lowest since the IATTC adopted a new port-sampling protocol for fleet-level catch estimation in 2000 [
14,
36]. Extra effort on the part of the purse-seine fleet to reduce BET catches in response to the IVT and the EMP, as opposed to a change in BET abundance, has been credited with the decrease in BET catches in 2023 and 2024 [
37]. From fitting the lowest-AIC BET model to each year’s data separately (without the year effect), it can be seen that fitted relationships for the pooled data set are similar, except for the no-hopper intercept (compare the AIC = 458 model coefficients in
Table 6 to those in
Table 9). The slope of the EMP-OBS relationship, and the hopper effect on that slope, were similar for 2023, 2024, and the pooled data. In addition, the intercept when a hopper was present is also similar for the two years, about −0.17 (= −0.2615 + 0.0902) in 2023 and about −0.15 (= −0.5585 + 0.4115) in 2024 (
Table 9). On the other hand, the intercept when a hopper was not present, for 2024, was considerably more negative than that for 2023 (
Table 9), −0.56 versus −0.26 on the Box–Cox scale (
t-test
p-value = 0.052), Thus, while the actual mechanism behind this result is not presently clear, it could reflect evolving dynamics of the fleet and observers as the fishery adapts to the IVT and EMP. Regardless of the underlying mechanism, a significant year effect (
Table 6) has implications for future data collection (see below). We note that the significant year effect is unlikely to be due to a sampling artifact because the EMP sampling protocol and the EMP sampling technicians remained the same during 2023 and 2024, and many of the EMP sampling technicians that began sampling in March 2023 were already experienced because they were part of the EMP pilot study from September 2022 to February 2023.
Regarding future development of this modeling approach, there are several ways in which the mixed-effects models presented might be improved, beyond expanding data collection to a broader section of the purse-seine fleet. First, several of the covariates used in this study were vessel-level or trip-level, yet with more information, they could be revised to be well-level covariates. For example, if hopper use were recorded on a set-by-set basis, then a summary of hopper use related to each well could be computed, such as the proportion of the catch in the well that was from sets for which the hopper was used. This would not only provide hopper-use information on a better scale, matching the well-level scale of the response variable, but it would also reduce any potential for confounding with the vessel effect. The catch area and the trimester, which although not significant in these models, possibly as a result of the paired nature of the data, might similarly be revised using observer data to, for example, the centroid of positions of sets in the well and the month associated with the majority of the catch in the well. Moreover, other operational covariates might be considered to determine if they improved prediction for wells of unsampled vessels.
Second, in this study, incorporating an observer random effect was problematic because of the imbalance of the data with respect to observers. The OBS data were collected by 99 observers, of whom 57 were represented by one trip and an additional 31 observers by two trips. The trip effect used in the mixed-effects model, therefore, likely captured any differences among observers related to their skill at species catch estimation. Developing covariates that capture observer experience and skill, beyond days at sea, and that do not require balanced data, would clearly be beneficial. Such covariates might include, for example, measures of an observer’s ‘exposure’ to different catch loading practices and gear, such as brailer capacity and construction, brailing speed, and hopper use. It is noted that differences among field technicians in their tendencies to underestimate/overestimate quantitative information have been demonstrated in other data collection settings, such as among marine mammal technicians participating in abundance surveys [
38].
Finally, future research might address some of the challenges related to fitting models to weight-based proportion data. In this analysis, the use of data transformation, necessary to meet the Gaussian assumption of the mixed-effects models, led to some nonlinearity on back-transformation. Because the weight-based proportion data do not conform to the distributional assumptions of generalized linear models for Bernoulli and binomial observations (e.g., ref. [
39]), such techniques were not used. Using generalized linear models would have been advantageous because the distributional aspects are handled separately from the modeled relationship between the response variable and covariates. In principle, OBS proportion data based on numbers of fish could be obtained. However, in tropical tuna purse-seine fisheries, catch quantities are generally recorded in terms of weight, not numbers. The weight of tunas recorded by observers or in logbooks could be converted to numbers of fish by species, but this requires length–weight data for all three species and would introduce error due to the conversion.
Results from the species models suggest several aspects worth consideration when designing a port-sampling protocol to collect data for estimation of bias adjustments. First, the importance of the vessel effects for prediction of well-level species composition, particularly for BET, suggests that the sampling protocol should generate data appropriate for estimation of vessel effects. This would suggest sampling a minimum of two trips per vessel (assuming observers are not frequently placed on the same vessel) and three wells per trip of as many vessels as is logistically feasible to be able to estimate vessel effects independent of trip and observer effects. The optimal sample sizes would be best determined using a simulation, which was beyond the scope of this paper. In addition, for all vessels, obtaining operational characteristics that may affect the observer’s ability to clearly view the catch composition would also be important. Second, the importance of the year effect in the lowest-AIC BET model suggests that, rather than assume bias adjustments are temporally invariant, a prudent approach to data collection would be to collect data annually so that bias adjustments can evolve over time, if necessary, with changes in management measures. An analysis of EMP pilot study data [
18] found trends in the proportion of BET in the catch during unloading of individual OBJ-set wells, and as a result, obtaining a sample of the entire well catch (as was performed by the EMP, e.g.,
Figure 1), as opposed to a sample from only a section of the well, was important for reducing variance and bias on the estimated well-level proportion of BET from sample data. Increased variance and bias on the sample estimates would likely lead to increased prediction error on the estimates of species composition for unsampled wells.
The port-sampling–observer mixed-effects models for estimating species proportions of unsampled wells can be viewed as one of two components for estimating fleet-level species catch for the fisheries defined in stock assessments. For fisheries with adequate sample data, a design-based approach could be used for species catch estimation, while for those fisheries with little or no sample data, the mixed-effects species models could be used to estimate the species composition of unsampled wells. This is the general approach adopted for species catch estimation for the purse-seine fleets in the WCPO, although that modeling used environmental and operational covariates to predict set-level catch composition [
5]. Given that our port-sampling data came from a specific subset of the large-vessel purse-seine fleet, we do not present a worked example of the model-based component but rather provide a conceptual outline of the steps that would be involved.
- (1)
Predict species proportions, , for unsampled wells
Wells with OBS > 0: develop models, such as those presented in this paper, with OBS species proportions as a covariate, as well as operational covariates that affect the observer’s view of the catch loading process. Spatial, temporal, and operational covariates related to the fishing process could be included to evaluate whether the port-sampling–observer relationship varies in space and/or time or by gear (e.g., set type), which is an important part of evaluating how well the fitted model may generalize to other fleet components. If there are many zero-valued port-sampling proportions, a mixture model, such as a hurdle model [
39], might be appropriate.
Wells with OBS = 0: develop spatial–temporal models, which may include additional/different covariates than those used in (i). In the case of zero-valued port-sampling proportions, a hurdle model or other type of mixture model might be appropriate.
The data for BET were used to illustrate a simple two-part hurdle model, using covariates shown in
Table 1 that are related to operational aspects of the fishing process. The presence/absence of any amount of BET in the well was modeled with a logistic regression with logit link function [
39]; from
Table 3, there were 67 absences (zeros) and 143 presences (ones). For the EMP BET proportions greater than zero, those proportion values were Box–Cox transformed and modeled with a linear regression model, assuming a Gaussian distribution. Because the data set is relatively small (210 wells), with more than half the trips only represented by one or two wells, only main effects were fitted, without nested random effects for trips within vessels. The importance of area in the logistic component of the hurdle model (
Table 10) is consistent with large-scale spatial patterns in BET habitat preferences and catches in the EPO [
11,
24], suggesting that with a larger data set, spatial–temporal models might yield more accurate predicted species composition than just using the overall mean.
When modeling transformed data, the predicted species proportions would need to be back-transformed using a bias correction. In the case of a hurdle model, the estimated species proportion on the [0, 1] scale would be the product of the estimated probability of presence of the species from the logistic component and the back-transformed, bias-corrected estimate of the species proportion from the positive component.
- (2)
Normalize well-level predicted species proportions
If separate models are fitted for each species, the estimated species proportions would need to be rescaled (normalized) to sum to 1 (by well) over the three species of tropical tunas before estimating species catch. That is, for well
k, the normalized species proportion,
, could be computed as follows:
Differences in model covariates and within-group error, by species, could complicate fitting a single model to the data of all three species, and thus fitting separate models to each species might be preferred in some cases.
- (3)
Estimate species catch
The estimated catch of a species in an unsampled well would be the product of the normalized species proportion for the well and the total well weight of tropical tunas. The estimate of the species catch for unsampled wells of a fishery defined by the stock assessment would be the sum of those well-level estimates. If the port-sampling–observer relationship differs by purse-seine set type, then species estimates for a well with catch from multiple set types would need to be obtained as the sum of estimates for each set type, using the observer estimates of the tropical tuna catch amount that was loaded into the well from each set.
- (4)
Compute variance on the estimated species catch
Because the fleet-level estimate for unsampled wells would be based on the sum of back-transformed species proportions multiplied by their respective well catches, variance could be obtained using a simulation with the estimated model parameters and their corresponding variance–covariance information. Such a posterior simulation approach is discussed in [
40]. As suggested by a reviewer, uncertainty associated with the data transformation could be incorporated into the variance estimation procedure.
The modeling approach presented in this paper could potentially be expanded to leverage spatio-temporal correlations and other operational covariate information for predicting the species composition, as has been done using integrated analysis in other applications (e.g., [
41]). Our approach predicts the species composition based solely on the observer species proportions and the estimated bias in that relationship, relative to the true proportion (as represented by the port-sampling species proportions). If the well-level species composition could be accurately predicted using an integrated analysis approach, this may improve the estimates when the observer data are particularly variable (i.e., the bias is variable and unpredictable) and for wells of unsampled vessels; with the approach presented in this paper, estimates for wells of unsampled vessels would be based on the population-level predictions. Formulation of such a model might have three components. There would be an assumed model for the true species proportion, p_true = f(environmental, spatiotemporal, operational covariates) + error_true, with separate models for observer and port-sampling data: p_observer = g(p_true) + error_observer; and p_port-sampling = h(p_true) + error_port-sampling. The function g would be formulated to address bias in the observer estimates, and modeling error_observer > error_port-sampling would upweight the port-sampling data to reflect its assumed greater accuracy for species composition. Such a model would be able to fit all observer data, not just the observer data of wells sampled by a port-sampling program.