You are currently viewing a new version of our website. To view the old version click .
Sensors
  • Article
  • Open Access

19 December 2025

Multivariate Gas Sensor E-Nose System with PARAFAC and Machine Learning Modeling for Quantifying and Classifying the Impact of Fishing Gears

and
Instituto de Investigación para la Gestión Integrada de Zonas Costeras, Universitat Politècnica de València, Carrer del Paranimf 1, 46730 Grao de Gandia, Valencia, Spain
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Nature Inspired Engineering: Biomimetic Sensors (2nd Edition)

Highlights

This research addresses critical gaps in the real-time quality assurance of the hydrobiological industry by developing a novel monitoring methodology. Accordingly, our findings are structured around the following key questions:
What are the main findings?
  • A novel methodology integrating MOSR sensors, PARAFAC, and Machine Learning was implemented for real-time quality monitoring.
  • The L2 Norm metric successfully quantified metabolic stress deviation, establishing a data-driven distinction between various fishing gears.
What are the implications of the main findings?
  • The methodology provides an objective tool for seafood traceability and quality differentiation.
  • Generating gear-specific stress profiles can be utilized as a robust standard to support modern sustainability and food quality certification schemes.

Abstract

The quality of seafood is intrinsically linked to the accumulated history of stress, feeding, handling, and physical damage imposed by the fishing gear employed. This study proposes an innovative methodology using an E-nose sensor. The study species was Sparus aurata. Eight fishing gears were studied. The methodology integrates Parallel Factor Analysis (PARAFAC) for impact quantification and Machine Learning (ML) for classifying the fishing gear of origin. Longline was established as the method with the lowest deviation. The impact hierarchy, from highest to lowest deviation, is as follows: Aquaculture 50.61% (95% CI: 34%, 68%), Purse seine 37.92% (95% CI: 22%, 54%), Trawl 35.92% (95% CI: 21%, 51%), Gillnet (three panels) 27.69% (95% CI: 14%, 41%), Gillnet (single panel) 24.63% (95% CI: 9%, 40%), Gillnet (two panels) 18.12% (95% CI: 4%, 31%) and Hook and line 1.36% (95% CI: −10%, 13%). For the classification task, 33 ML models were evaluated. Subspace KNN model yielded the best results with an accuracy of 97.14% in the validation and 98.08% in the testing, using 35 variables. Using 10, 15, 20, 25, and 30 variables, an accuracy higher than 85% was achieved. These results demonstrate the high precision in fish traceability by exploiting the sensor response profile left by each fishing gear.

1. Introduction

The fishing industry, in its diverse modalities such as artisanal, industrial, or aquaculture, plays a fundamental role in global food supply, economic development, and the culture of numerous communities. However, the type of fishing gear or aquaculture system used not only determines the sustainability of the catch but also its dynamic quality, which is intrinsically linked to the accumulated history of stress, physical damage, and handling [1,2,3]. Each method implies different degrees of manipulation [4], exposure to contaminants [5], stress levels [6], and post-capture handling times [7], factors that trigger physiological responses which manifest as a “chemical signature” of the fish, making it a silent carrier of its history [6,8].
Fishing gears such as Hook and line, Gillnet, Purse seine, Longline or Trawl influence the selectivity, degree of stress, handling, exposure to contaminants, and physical damage suffered by the fish [4,6]. In parallel, aquaculture introduces other variables that affect the fish composition. The use of antibiotics, antiparasitics, growth promoters, and industrial feeds can alter their metabolism, leaving internal chemical residues that reflect their artificial environment [9,10].
Given that both capture and rearing methods lead to chemical changes in the fish’s composition [5,11], effective monitoring tools are required to verify quality and provenance. However, the current tools available to monitor these effects are generally costly and inaccessible. Techniques such as mass spectrometry [12] or chromatography [13] require specialized laboratories, trained personnel, and prolonged analysis times—resources that are not always available in fishing communities, local markets, or small processing plants [14]. Therefore, it is necessary to develop a solution that is fast, portable, and accessible.
In order to address this need for accessible and portable analytical tools, a highly promising approach lies in electronic noses (E-noses), devices that capture the volatile profile released by fish. This profile directly mirrors metabolic stress and the early stages of spoilage, providing objective parameters for assessing fish quality [15]. However, the scientific literature has not yet clearly established to what extent different fishing gears (trawls, longlines, purse seines, etc.) influence the final quality of seafood products [16]. For this reason, it is essential to combine chemometric analysis, sensor technologies, and direct comparisons between fishing methods [17,18]. This integration would enable much stronger traceability, more reliable quality control, and truly meaningful eco-certifications [18,19,20,21]. Moreover, by applying Machine Learning (ML) algorithms [22] we can accurately classify the capture method, the freshness status of the fish, and even detect potential contaminants [23]. This represents a tangible, readily implementable opportunity to enhance quality control and ensure credible sustainable certification in a sector that currently lacks accessible, objective tools [15].
In response to these challenges, the primary objective of this study is two-fold: first, to quantify the impact of fishing gears [15,23] and classify them correctly through an objective metric, and second, to generate a highly accurate classification model capable of determining the capture method of a fish based on the sensor response profile with respect to the intrinsic basal profile. The species studied is Sparus aurata [24]. We achieve this by integrating an Electronic Nose (E-nose) system with Parallel Factor Analysis (PARAFAC) for data decomposition, allowing us to isolate and quantify the chemical stress signature of each fishing method. The resulting classification model demonstrates high accuracy in distinguishing between the capture methods, providing a robust, cost-effective, and scalable tool for traceability and sustainable quality control in the seafood value chain.
The rest of the study is divided into five sections. Section 2 details the most relevant reported studies, whereas Section 3 describes Materials and Methods. Results are presented in Section 4, followed by Discussion in Section 5 and Conclusions and future perspectives in Section 6.

3. Materials and Methods

This section presents a detailed explanation of the sensor employed, the data analysis and processing to evaluate the impact of fishing gear on the intrinsic basal profile of Sparus aurata, and the parameters established for the classification of fishing gears.

3.1. Sensor

A gas sensor setup based on MOSR technology, widely used for VOC detection, was employed. The 35 variables extracted from the MQ sensor modules are detailed in Table 1. The algorithm governing the processes is shown in Figure 1.
Table 1. MQ sensor modules utilized and their sensitivities.
Figure 1. Proposed node operation. The light blue shaded area is the Data acquisition and preprocessing stage. The orange shaded area is the Chemometric and Machine Learning Analysis stage. The purple shaded area is the Communication and Transfer Data stage. Green dots indicate flow connectors linking different processes in the diagram.

3.2. Data Collection

The sampling was performed individually for each fishing gear. Sparus aurata samples were obtained directly from fishermen upon their arrival at the port. The fishing areas are located in the Western Mediterranean Sea. The port of landing was Gandía. The time elapsed from sample capture until landing was considered an intrinsic part of the accumulated impact of the fishing gear. This approach was based on the fact that each distinct gear type inherently dictates its own operational protocols, handling complexities, and retrieval times, making this duration an inseparable, characteristic variable of the classification factor. On the other hand, the subsequent time, from port reception until the moment of analysis, was standardized and uniform for all samples, thus eliminating this variable as a differentiating or confounding factor. To mitigate sensor drift and environmental effects, the sensor response was normalized and recorded as the dynamic ratio response ( R 0 R S / R 0 ), where R 0 is the stable baseline resistance recorded in a controlled clean air environment [37]. Additionally, new MQ sensor modules were employed for each sampling session to eliminate long-term drift accumulation.
The eviscerated sample was placed (as seen in Figure 2) for 10 min before measurement to condition the chamber. The sample was eviscerated to eliminate the variability of the microbial load. The chamber was hermetically sealed with film. The temperature inside the chamber was maintained at 10 °C throughout all sampling. The sensor was configured to record readings at a frequency of 0.65 min, covering a total period of 1.5 days (36 h). This extended, high-frequency measurements was specifically designed to capture the kinetics of release and the detailed evolutionary profile of post-capture VOCs. For each fishing gear category, 10 biological replicates were collected. Sample weights were 279 ± 20 g with a 95% CI; this controlled biometric range was implemented to minimize the physiological variability related to mass, thereby supporting the assumption that weight variations had no significant impact on the sensor responses.
Figure 2. Top view of the measurement chamber with the MQ sensor modules.

3.3. Preprocessing

Each dataset corresponded to a single fishing gear. It consisted of the measurement of 35 sensor variables along 2600 time instances. For the analysis, these data were structured into a three-dimensional tensor described in Equation (1).
X R I × J × K
where I represents the time instances, J represents the sensor variables, and K represents the fishing gears. Preprocessing was carried out using Python 3.9 with the pandas library. A cleaning and standardization process was performed for each data file, consisting of:
  • Reading: CSV files were imported, and non-numeric values were converted to NaN.
  • Missing Value Imputation: Missing values were imputed using a temporal window algorithm. For each missing value, up to gamma previous valid observations and up to omega subsequent valid observations are considered. If both windows exist, the mean of each window ( μ p r e v , μ n e x t ) and the relative difference ( δ ) are calculated as explained by Equation (2).
δ = μ p r e v μ n e x t m a x μ p r e v , μ n e x t , ε
If δ τ (tolerancia relativa) (relative tolerance), the imputed value is obtained as the weighted mean of the two windows; if δ > τ linear interpolation is applied between the last previous observation and the first subsequent one. If neighbors from only one side exist, the mean of the available window is used; if the series completely lacks valid observations, the column median is used as a fallback value. In this work, we assume γ = 10 , τ = 0.5 and ε = 10 12 which is a numerical tolerance constant used to prevent division-by-zero errors. Additionally, a binary mask (0/1) was generated to identify the imputed positions, which
Outlier detection and treatment: Extreme values were identified using the interquartile range (IQR). The interval was defined according to Equation (3).
I = Q 1 α · I Q R , Q 3 + α · I Q R
where α = 1.5 is a sensitivity factor. Outliers were marked as atypical, and a missing value imputation process, as previously described, was performed; this filtering step is crucial to avoid overfitting to sensor noise and transient artifacts.
The data from each sensor were standardized (StandardScale) so that each series had a mean of 0 and a standard deviation of 1.

3.4. Data Processing

The software employed were Python 3.9 and MATLAB R2025a. 35 variables obtained from the MQ sensors were processed. To determine the impact of the fishing gear, 50 different ranks ( r ) of PARAFAC were studied with 100 distinct initializations, creating a total of 5000 models. 33 ML models were trained to classify the fishing gears, with 65% of the readings assigned for training, 25% for validation, and the remaining 10% for testing the models. The models obtained with the training dataset were evaluated using both the validation dataset and the testing dataset.

3.4.1. Tensorial Analysis (PARAFAC) for Impact Assessment

PARAFAC was employed to decompose the data tensor X into a set of latent ranks representing the underlying patterns of the chemical signature. PARAFAC allows the tensor to be modeled as the sum of vector products and enables the isolation of contributions from each dimension independently. The model is expressed mathematically in Equation (4).
X i j k r = 1 R ω r . α i r . b j r . c k r + E i j k
where X i j k is the element of the tensor at position (i, j, k). ω r is the weight of the r-rank. α i r , b j r and c k r are the elements of the normalized factor matrices A t i m e R I × R , B s e n s o r s R J × R and C f i s h i n g g e a r R K × R , respectively. E i j k is the residual or error unexplained by the model.
The model was executed using the tensorly 0.8.1 library. The number of ranks r was chosen from 1 to 10. 100 distinct seeds were employed for initialization. These values were considered to study the general trend of the models at different numbers of ranks and different initializations. normalize_factors = True was set in the PARAFAC function; the magnitude of each rank was stored in the vector ω r , esto permite interpretar la importancia de cada factor

3.4.2. Determination of the Number of Ranks (r)

The Cumulative Explained Variance ( VE C ) was used as the main criterion and is expressed in Equation (5). A range of r = 1 to r = 50 was evaluated across 100 random initializations (seeds) for each r . This procedure was performed to identify the elbow in the explained variance curve. The inflection point will be selected as the r value that maximizes the explained variance while minimizing the inclusion of residual noise.
V E C = 100 × 1 X X ^ F 2 X F 2
where X represents the original data tensor X R I × J × K . X ^ is the reconstructed tensor estimated by the PARAFAC model with r ranks. · F 2 is the squared Frobenius norm.

3.4.3. Quantification of the Fishing Gear Impact

To quantify the impact of each fishing gear, the magnitude of the deviation of its response profile for each sensor variable was evaluated with respect to the intrinsic basal profile of Sparus aurata. This deviation was quantified using the Euclidean norm ( L 2 norm) of the contributions of the rank ( r ) that determines the optimal dimensionality.
The impact metric for each fishing gear ( k ) is defined as the total magnitude of its accumulated contribution across all selected optimal latent ranks ( r ). To this end, the contribution of rank ( r ) to a specific fishing gear ( k ) is modeled as the product between the rank weight ( ω r ) and the fishing gear factor ( c k r ). Thus, the point impact ( I k s ) of a fishing gear k for a seed index ( s ) is calculated as the L 2 norm of the contribution of the r ranks, according to Equation (6).
I k s = r = 1 R ω r . c k r 2
Subsequently, to reduce the variability associated with different initial conditions, the impact of each fishing gear was averaged over 100 distinct seeds for each r value.

3.4.4. Classification of Fishing Gears

The models used to classify the fishing gears are detailed in Table 2. Each method offers different benefits, especially in terms of adapting to distinct patterns and complexity levels in the data. The metric employed to measure classification performance is accuracy, given its frequent use by numerous authors.
Table 2. Comparison of Classification Models.

4. Results

This section presents the results. First, a summary of the general results from all tests is provided. Subsequently, the performance of the developed models is discussed.

4.1. Fishing Gear Impact

With PARAFAC, 5000 models were studied, comprising 50 different rank (r) values, each with 100 distinct initializations, to estimate the appropriate number of ranks for the impact calculation.
The calculation covered r = 1 up to r = 50 ranks, across 100 distinct seeds (initializations). The results of the average Cumulative Explained Variance are presented in Figure 3, illustrating the progressive variance gain upon incorporating each rank.
Figure 3. Average cumulative explained variance (%) as a function of the number of ranks (r).
The analysis revealed that the model with r = 1 explains approximately 37.6% of the variance. The variance increases significantly, reaching 78.79% with r = 10 ranks. Although the explained variance climbs to a maximum of 96.0% with r = 50 ranks, the small percentage gain after r = 10 suggests that the model begins to capture low-level variability and noise. Consequently, the number of ranks at the inflection point was selected for the calculation of the impact metric, optimizing the balance between explained variance and model stability.
For each number of ranks ( r ) up to 10, the average impact over the 100 initializations (seeds) was calculated. The result of this first stage is the impact metric used to plot the stability in Figure 4. The initial results for r = 1 and r = 2 are low and unstable, as the model still fails to adequately separate the basal component from the variation components. Starting from r = 6 up to 10, the trend of the impact values for each fishing gear stabilizes. This inflection point indicates that this number of ranks is sufficient to isolate the main patterns (basal profile, feeding, stress, handling, damage, and metabolic alteration). The hierarchical order of impact from this point does not vary significantly, which would explain why they do not depend on the exact choice of r in the upper range.
Figure 4. Stability of the Impact Metric as a Function of Model Complexity ( r ).

4.2. Quantification of the Global Average Fishing Gear Impact

To obtain the definitive impact metric r = 10 was chosen because it corresponds to the ‘elbow’ inflection point of the explained variance, capturing 78.79% of the total system variation. The global average impact ( I ¯ k ) over the 100 initializations was calculated.
Figure 5 presents the global average impact (average L 2 norm) of the eight investigated fishing gears with a 95% confidence interval. Samples obtained from aquaculture showed the highest average impact, with 247 ± 28 a.u. This reflects a significant deviation in the intrinsic basal profile of Sparus aurata, which is attributed to the artificial diet and confined environment rather than acute capture stress. Purse seine and trawl fishing occupy the second and third places for highest impact, respectively, consistent with the high physical stress associated with crowding and abrasion [1] showing a mean L 2 norm of 226 ± 26 a.u. and 223 ± 25 a.u., respectively. Gillnet (three panels), Gillnet (single panel) and Gillnet (two panels) reached a mean L 2 norm of 209 ± 22 a.u. 204 ± 25 a.u. and 193 ± 22 a.u., respectively. Longline fishing was the gear with the lowest recorded impact, followed by Hook and line, with a mean L 2 norm of 164 ± 19 a.u. and 166 ± 19 a.u., respectively. This may be due to these methods being individual capture techniques that minimize striking and stress due to crowding.
Figure 5. Global Average Impact ( I ¯ k ) for Fishing Gear. Arbitrary units (a.u).
Longline, with an L 2 norm of 164 ± 19 a.u., was established as the basal profile of minimum impact. It is assumed that it is not possible to find a sample with a lower impact than those obtained by this fishing gear for Sparus aurata. Based on this value, the relative percentage difference was calculated for each gear. A severity scale is established and presented in Table 3 which validates that the impact severity is grouped into physiologically coherent categories [3,26,38].
Table 3. Relative impact increase (95% CI range) by fishing gear.

4.3. Classification Learning of Fishing Gear

Features derived from the MQ sensor responses were used for the classification of the fishing gears. The accuracy of the models was evaluated across subsets of 10, 15, 20, 25, 30, and 35 variables.
Using 10 variables (MQ-4 and MQ-7 sensors), the Kernel Naive Bayes model achieved an accuracy of 86.97% in the validation phase and 88.65% in the test phase (see Figure 6a). Increasing to 15 variables (MQ-4, 7, and 8), the Subspace Discriminant model reached an accuracy of 90.35% in validation and 90.29% in test (see Figure 6b). With 20 variables (MQ-4, 6, 7, and 8), the Subspace Discriminant model remained first, obtaining 92.79% in validation and 92.93% in test (see Figure 6c). With 25 variables (MQ-2, 4, 6, 7, and 8), the Kernel Naive Bayes model reached an accuracy of up to 96.60% in validation and 97.36% in test (see Figure 6d). Subsequently, the set of 30 variables (MQ-2, 4, 5, 6, 7, and 8) with the Kernel Naive Bayes model showed a slight saturation, with accuracies of 96.36% in validation and 96.59% in test (see Figure 6e). Finally, by employing the entirety of the 35 MQ sensor variables (from 2 to 8), the Subspace KNN model achieved the maximum discrimination capacity, with 97.14% in the validation phase and 98.08% in the test phase (see Figure 6f). These results confirm that the use of the complete sensor array allows for the most precise identification of the fishing gears. In Figure 6a–e, the achieved accuracy is presented using various sensor arrays. The validation phase is presented in blue and the test phase in orange. The models are grouped by Model ID and organized according to Table 2.
Figure 6. Comparative Performance of Machine Learning Models in Fishing Gear Classification. (a) 10 variables (MQ-4 and 7). (b) 15 variables (MQ-4, 7 and 8). (c) 20 variables (MQ-4, 6, 7 and 8). (d) 25 variables (MQ-2, 4, 6, 7 and 8). (e) 30 variables (MQ-2, 4, 5, 6, 7 and 8). (f) 35 variables (MQ-2, 3, 4, 5, 6, 7 and 8).

Subspace KNN Classification Model

When all 35 variables are used, the Subspace KNN model achieved the best performance with an accuracy of 97.14% in the validation phase and 98.08% in the test phase. The comprehensive performance profile, including operational efficiency and detailed metrics, is presented in Figure 7. The model demonstrated high operational efficiency: Prediction Speed 174.6 obs/sec, Training Time 205.3 s, Coder Model Size 83.61 MB.
Figure 7. Subspace KNN Classification Model Performance: Macro Metrics in Percentage Scale Visualization using Spider Plots for (a) Validation and (b) Test Datasets.
The errors in the confusion matrices in Figure 8 are concentrated in systematic patterns that reflect the similarity in response profiles induced by the fishing operations. The most significant confusion occurs between aquaculture and high-volume net methods (gillnet and purse seine). This error is due to the fact that both the chronic stress from confinement in aquaculture and the crowding, striking, and physical handling in nets generate comparable Volatile Organic Compound (VOC) signatures. Other errors correlate with the degree of physical damage, such as the confusion between Gillnet (three panels) and trawl, methods that induce a similar acute stress profile. In contrast, the most selective and lowest-impact methods, such as hook and line and longline, exhibit the lowest error rates, as observed in Figure 8a,b, confirming that less aggressive handling produces more distinctive and separable chemical signatures. In summary, the model successfully classifies by isolating fundamental chemical differences. Crucially, the residual errors validate that the similarity in the VOC profile is a direct consequence of the operation and the physical impact of the fishing gear, confirming the physiological clustering previously quantified by the L2 Norm impact metric.
Figure 8. Confusion Matrices of the Subspace KNN Model with 35 Variables. (a) Validation Set. (b) Testing Set.
Figure 9 presents the average influence of the predictive variables in the Subspace KNN model, quantified by the mean of the absolute Shapley values. This Explainable Artificial Intelligence (XAI) metric indicates the average impact of each sensor variable on the magnitude of the prediction [39] to the fishing gear classification. The high stability of this predictive hierarchy was consistently verified, as the relative ranking of feature importance remained invariant across all cross-validation folds.
Figure 9. Shapley importance for Subspace KNN model with 35 variables.
The Shapley analysis reveals a clear chemical hierarchy in the model’s decision-making, which validates that the classification is based on evidence of physiological impact and post-capture handling. The most influential predictors are consistently the readings for Carbon Monoxide (MQ8_CO, MQ4_CO) and Alcohols (MQ4_Alcohol, MQ6_Alcohol). This dependence is crucial: different fishing gears impose distinct levels of physical damage and stress, which affects enzymatic release and initial pH [3]. The stress and struggle induced by certain methods (such as confinement in aquaculture or trawl) force the muscle to perform anaerobic metabolism, leading to rapid chemical changes and the release of specific Volatile Organic Compounds (VOCs). Alcohols are direct markers of this anaerobic fermentation and incipient spoilage, while CO and other gases are byproducts of metabolic stress and accelerated cellular breakdown. The model exploits this combined signature of stress/damage and the initial spoilage rate to create a robust discriminatory profile, confirming that the classification of origin is, in essence, an impact classification.

5. Discussion

Various studies conclude that there is a significant difference in stress levels for fish captured by distinct fishing gears [3,16,28,38,40]. However, these investigations are based on chemical analysis, mortality rates, percentage of damaged individuals, among others. The L2 Norm applied to the PARAFAC scores allowed us to group the impact severity into physiologically coherent categories. The fishing gear with the highest impact is aquaculture with 50.61% (95% CI: 34%, 68%). The significant deviation in aquaculture is primarily due to a chronic metabolic alteration inherent to the production system, influenced by the artificial diet and potential medication, which fundamentally modifies the composition of basal VOCs in the tissue [29]. Purse seine and Trawl reached 37.92% (95% CI: 22%, 54%) and 35.92% (95% CI: 21%, 51%), respectively, indicating a moderate to high impact. The deviation in purse seine and trawl is due to acute and severe stress from crushing and confinement [1,4,26,40], which forces anaerobic metabolism and rapid enzymatic release. This results in the accelerated production of alcohols and carbon monoxide (CO), creating VOC signatures of physical trauma and metabolic alteration similar in magnitude. Finally, the moderate impact methods are Gillnet (three panels), Gillnet (single panel), and Gillnet (two panels) with 27.69% (95% CI: 14%, 41%), 24.63% (95% CI: 9%, 40%), and 18.12% (95% CI: 4%, 31%), respectively. Their deviation correlates with stress from hypoxia and immobilization [16,28], which generates a more attenuated and different VOC signature compared to mechanical trauma. Finally, Hook and line with 1.36% (95% CI: −10%, 13%) and Longline with 0% (95% CI: −12%, 12%) show minimal deviations [27]. This is consistent with faster handling and insignificant tissue damage, which minimizes anaerobic metabolism and post-capture VOC release. Furthermore, both fishing gears are not statistically distinguishable.
In the field of classification, there are various studies utilizing E-nose on marine products. Wijaya et al. [23] achieved 99% accuracy in freshness and an R2 of 0.995 to predict microbiological development. Silva et al. [24] achieved 99% accuracy to classify species and an R2 of 0.99 to estimate the time without refrigeration. Other approaches, such as that by Balim et al. [35] achieved an accuracy of 88.44% to classify freshness. However, these investigations focus on classifying freshness or distinct species. In contrast, works focused on differentiating populations within the same species were reviewed. For example, Lai et al. with two Principal Components (PC), explained 50.2% of the total variance in the distinction of different Sparus aurata populations. Bodin et al. [31] managed to classify geographical origin with a predictive power Q2 of 0.83 and storage conditions with a Q2 of 0.82. The described models achieve an accuracy inferior to the proposed classification model considering the same fish species. With the proposed Subspace KNN classification model, an accuracy of 97.14% was achieved in the validation phase and 98.08% in the test phase.
The performance of the proposed system validates the efficacy of the sensors and the analysis methodology. Furthermore, the coherence of the erroneous classifications between aquaculture and high-volume fishing gears confirms that the system is measuring functional similarity in physiological stress profiles and not random errors. The system is cost-effective and does not require specialized personnel, offering a practical and affordable solution for the value chain.

6. Conclusions

The objective quantification of fishing gear impact is a methodological challenge that requires techniques capable of isolating subtle variation signals. Our approach based on PARAFAC decomposition and Machine Learning (ML) classification of the Volatile Organic Compound (VOC) profile, offers a robust and verifiable solution.
The results demonstrate that the procurement method that deviates the most from the intrinsic basal profile of Sparus aurata is aquaculture, with 50.61% (IC 95%: 34%, 68%). Those with the lowest impact were Longline with 0% (IC 95%: −12%, 12%) and Hook and line with 1.36% (IC 95%: −10%, 13%). For fishing gear classification, the Subspace KNN model achieved an accuracy of 97.14% for validation and 98.08% for testing. Using the mean of the absolute Shapley values the most influential sensors were determined to be the MQ4 and MQ8.
Despite the robustness of the proposed methodological approach, involving the PARAFAC and ML models, it is imperative to acknowledge that further research is still required to ensure the precision of the obtained results.
Future work will focus on two key areas: the scalability and validation of the methodology. Additionally, it will systematically address the influence of critical intrinsic variables, such as capture-to-port time and sample weight. First, to confirm generalizability, dedicated external validation studies are required. This involves expanding the database to include samples collected across diverse geographical basins, multiple ports and various fishing seasons to assure the model’s robustness under varying real-world conditions, as well as incorporating more species of commercial interest. It is also crucial to expand the database to include more species of commercial interest and to evaluate how water quality [21] influences the results. Second, we plan to systematically address and quantify the influence of key variables currently treated as intrinsic factors, specifically: the capture-to-port time (to decouple the fishing gear impact from temporal autolysis/spoilage effects) and weight variations, as these are critical variables for practical application. To improve robustness, drift compensation techniques and advanced preprocessing [41] will be applied to isolate the sensor response from the environmental effect. In parallel, the fusion of multimodal sensors will be explored to maintain accuracy in dynamic and volatile production environments (visible/NIR optics [42]).

Author Contributions

Conceptualization, V.L.S.-A. and J.L.; methodology, V.L.S.-A. and J.L.; software, V.L.S.-A.; validation, V.L.S.-A. and J.L.; formal analysis, V.L.S.-A.; investigation, V.L.S.-A.; resources, J.L.; data curation, V.L.S.-A.; writing—original draft preparation, V.L.S.-A.; writing—review and editing, J.L.; visualization, V.L.S.-A. and J.L.; supervision, J.L.; project administration, J.L.; funding acquisition, V.L.S.-A. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been funded by the “Ministerio de Economía y Competitividad” through the Project TED2021-131040B-C31. This study also forms part of the ThinkInAzul programme and was supported by MCIN with funding from European Union NextGenerationEU (PRTR-C17.I1) and by Generalitat Valenciana (THINKINAZUL/2021/002). The author V.L.S.-A. is supported by a contract from the Universitat Politècnica de València through the “Programa de Ayudas de Investigación y Desarrollo (PAID-01-24)”.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data provided can be found within the article. The original contributions made in this study are included in the document; any additional inquiries can be directed to the author or authors responsible.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
PARAFACParallel factor analysis
MLMachine learning
CIConfidence interval

References

  1. Rasul, M.G.; Yuan, C.; Yu, K.; Takaki, K.; Shah, A.K.M.A. Factors Influencing the Nutritional Composition, Quality and Safety of Dried Fishery Products. Food Res. 2022, 6, 444–466. [Google Scholar] [CrossRef] [PubMed]
  2. Abraha, B.; Admassu, H.; Mahmud, A.; Tsighe, N.; Shui, X.W.; Fang, Y. Effect of Processing Methods on Nutritional and Physico-Chemical Composition of Fish: A Review. MOJ Food Process. Technol. 2018, 6, 376–382. [Google Scholar] [CrossRef]
  3. Erikson, U.; Uglem, S.; Greiff, K. Freeze-Chilling of Whitefish: Effects of Capture, On-Board Processing, Freezing, Frozen Storage, Thawing, and Subsequent Chilled Storage—A Review. Foods 2021, 10, 2661. [Google Scholar] [CrossRef] [PubMed]
  4. Janči, T.; Gauta, T.; Putnik, P.; Kanski, D.; Lovrinov, M. Influence of Fish Handling Practices Onboard Purse Seiners on Quality Parameters of Sardines (Sardina pilchardus) during Cold Storage. Biomolecules 2023, 13, 192. [Google Scholar] [CrossRef] [PubMed]
  5. Khushbu; Gulati, R.; Sushma; Kour, A.; Sharma, P. Ecological Impact of Heavy Metals on Aquatic Environment with Reference to Fish and Human Health. J. Appl. Nat. Sci. 2022, 14, 1471–1484. [Google Scholar] [CrossRef]
  6. Czorlich, Y.; Aykanat, T.; Erkinaro, J.; Orell, P.; Primmer, C.R. Rapid Evolution in Salmon Life History Induced by Direct and Indirect Effects of Fishing. Science 1979 2022, 376, 420–423. [Google Scholar] [CrossRef]
  7. Litaay, C.; Opier, R.D.A.; Horhoruw, S.M.; Hehuwat, J.; Arfah, H.; Rugebregt, M.J. Post-Catch Handling Affects the Quality of Skipjack Tuna (Katsuwonus pelamis). IOP Conf. Ser. Earth Environ. Sci. 2023, 1224, 12014. [Google Scholar] [CrossRef]
  8. Cortés-Sánchez, A.D.J.; Diaz-Ramírez, M.; Torres-Ochoa, E.; Espinosa-Chaurand, L.D.; Rayas-Amor, A.A.; Cruz-Monterrosa, R.G.; Aguilar-Toalá, J.E.; de la Paz Salgado-Cruz, M. Processing, Quality and Elemental Safety of Fish. Appl. Sci. 2024, 14, 2903. [Google Scholar] [CrossRef]
  9. Ljubojević Pelić, D.; Radosavljević, V.; Pelić, M.; Živkov Baloš, M.; Puvača, N.; Jug-Dujaković, J.; Gavrilović, A. Antibiotic Residues in Cultured Fish: Implications for Food Safety and Regulatory Concerns. Fishes 2024, 9, 484. [Google Scholar] [CrossRef]
  10. Lulijwa, R.; Rupia, E.J.; Alfaro, A.C. Antibiotic Use in Aquaculture, Policies and Regulation, Health and Environmental Risks: A Review of the Top 15 Major Producers. Rev. Aquac. 2020, 12, 640–663. [Google Scholar] [CrossRef]
  11. Singh, G.; Sharma, S. Heavy Metal Contamination in Fish: Sources, Mechanisms and Consequences. Aquat. Sci. 2024, 86, 107. [Google Scholar] [CrossRef]
  12. Liu, X.; Liu, Z.; Zhao, H.; Shi, R.; Wang, J.; Liu, J.; Di, S.; Wang, Z.; Wang, X.; Liu, L.; et al. Development and Validation of Modified QuEChERS Method Combined with LC-MS/MS for High-Throughput Analysis of per- and Polyfluoroalkyl Substances in Fish Samples. Microchem. J. 2024, 199, 110037. [Google Scholar] [CrossRef]
  13. Saba, F.; Qayoom, I.; Fatima, M.; Abubakr, A.; Khan, S.; Mushtaq, T.; Rasool, F.; Akhter, M. Understanding Applications of Chromatographic Techniques in Fisheries: Present Status and Future Perspective. Proc. Indian Natl. Sci. Acad. 2025. [Google Scholar] [CrossRef]
  14. Karahan, A.; Douek, J.; Paz, G.; Stern, N.; Kideys, A.E.; Shaish, L.; Goren, M.; Rinkevich, B. Employing DNA Barcoding as Taxonomy and Conservation Tools for Fish Species Censuses at the Southeastern Mediterranean, a Hot-Spot Area for Biological Invasion. J. Nat. Conserv. 2017, 36, 1–9. [Google Scholar] [CrossRef]
  15. Yavuzer, E. Determination of Fish Quality Parameters with Low Cost Electronic Nose. Food Biosci. 2021, 41, 100948. [Google Scholar] [CrossRef]
  16. Svalheim, R.A.; Jensen, T.K.; Hustad, A. Impact of Gillnet Soaking Time on Survival, Stress Physiology, and Muscle Quality in Atlantic Cod (Gadus morhua). Anim. Welf. 2025, 34, e64. [Google Scholar] [CrossRef] [PubMed]
  17. Landge, A.T.; Mano, A.; Akansha; Nikhil, K.; Abuthagir Iburahim, S. IoT and AI Applications in Fishing: Smart Fishing for Sustainable Practices. In Information Technology in Fisheries and Aquaculture; Ganie, P.A., Posti, R., Pandey, P.K., Eds.; Springer Nature: Singapore, 2025; pp. 59–70. [Google Scholar]
  18. Yan, J.; Guo, X.; Duan, S.; Jia, P.; Wang, L.; Peng, C.; Zhang, S. Electronic Nose Feature Extraction Methods: A Review. Sensors 2015, 15, 27804–27831. [Google Scholar] [CrossRef]
  19. Binson, V.A.; Thomas, S. Integrated Sensor System for Real-Time Monitoring and Detection of Fish Quality and Spoilage. Proc. West. Mark. Ed. Assoc. Conf. 2024, 104, 26. [Google Scholar] [CrossRef]
  20. Setyagraha, M.R.M.; Nurqamaradillah, H.A.; Hermawan, L.M.; Pratama, N.R.; Novamizanti, L.; Wijaya, D.R. A Portable Real-Time Electronic Nose for Evaluating Seafood Freshness Using Machine Learning. IEEE Access 2025, 13, 99583–99602. [Google Scholar] [CrossRef]
  21. Eze, E.; Kirby, S.; Attridge, J.; Ajmal, T. Aquaculture 4.0: Hybrid Neural Network Multivariate Water Quality Parameters Forecasting Model. Sci. Rep. 2023, 13, 16129. [Google Scholar] [CrossRef]
  22. Cordero Mancilla, M.A.; Moncada, W.; Silva Alvarado, V.L. A New Machine Learning Algorithm to Simulate the Outlet Flow in a Reservoir, Based on a Water Balance Model. Limnol. Rev. 2025, 25, 29. [Google Scholar] [CrossRef]
  23. Wijaya, D.R.; Syarwan, N.F.; Nugraha, M.A.; Ananda, D.; Fahrudin, T.; Handayani, R. Seafood Quality Detection Using Electronic Nose and Machine Learning Algorithms with Hyperparameter Optimization. IEEE Access 2023, 11, 62484–62495. [Google Scholar] [CrossRef]
  24. Silva Alvarado, V.L.; Diaz, F.J.; Parra, L.; Lloret, J.; Aldana, C.; Saavedra, Y. Proposal for a Gas Sensor Device to Classify Hydrobiological Species and Estimate Non-Refrigeration Time. IEEE Sens. J. 2025, 25, 18015–18022. [Google Scholar] [CrossRef]
  25. Roque d’orbcastel, E.; Bettarel, Y.; Dellinger, M.; Sadoul, B.; Bouvier, T.; Amandé, J.M.; Dagorn, L.; Geffroy, B. Measuring Cortisol in Fish Scales to Study Stress in Wild Tropical Tuna. Environ. Biol. Fishes 2021, 104, 725–732. [Google Scholar] [CrossRef]
  26. Sureda, A.; Barceló, C.; Tejada, S.; Montero, I.; Langley, E.; Box, A. Physiological and Survival Effects of Capture of Red Scorpion Fish Scorpaena Scrofa (Osteichthyes: Scorpaenidae) by Different Fishing Gears in the Balearic Islands (Western Mediterranean). Fish. Res. 2020, 229, 105616. [Google Scholar] [CrossRef]
  27. Santos, M.N.; Gaspar, M.B.; Monteiro, C.C.; Vasconcelos, P. Gill Net and Long-Line Catch Comparisons in a Hake Fishery: The Case of Southern Portugal. Sci. Mar. 2002, 66, 433–441. [Google Scholar] [CrossRef]
  28. Toledo-Guedes, K.; Ulvan, E.M.; Uglem, I. Commercial Gillnetting Is More Stressful for Saithe (Pollachius virens L.) than Jigging: But Is Fillet Quality Affected? Aquat. Living Resour. 2016, 29, 203. [Google Scholar] [CrossRef]
  29. Badaoui, W.; Marhuenda-Egea, F.C.; Valero-Rodriguez, J.M.; Sanchez-Jerez, P.; Arechavala-Lopez, P.; Tole-do-Guedes, K. Metabolomic and Lipidomic Tools for Tracing Fish Escapes from Aquaculture Facilities. ACS Food Sci. Technol. 2024, 4, 871–879. [Google Scholar] [CrossRef]
  30. Lai, J.; Zhao, L.; Fan, Y.; Qu, X.; Liu, D.; Guo, Z.; Wang, Y.; Liu, Q.; Chen, Y. Using Whole Body Elemental Fingerprint Analysis to Distinguish Different Populations of Coilia nasus in a Large River Basin. Biochem. Syst. Ecol. 2015, 60, 249–257. [Google Scholar] [CrossRef]
  31. Bodin, N.; Amiel, A.; Fouché, E.; Sardenne, F.; Chassot, E.; Debrauwer, L.; Guillou, H.; Tremblay-Franco, M.; Canlet, C. NMR-Based Metabolic Profiling and Discrimination of Wild Tropical Tunas by Species, Size Category, Geographic Origin, and on-Board Storage Condition. Food Chem. 2022, 371, 131094. [Google Scholar] [CrossRef]
  32. Rosa, L.N.; Coqueiro, A.; Março, P.H.; Valderrama, P. Thermal Rice Oil Degradation Evaluated by UV–Vis-NIR and PARAFAC. Food Chem. 2019, 273, 52–56. [Google Scholar] [CrossRef]
  33. Rosa, L.N.; Gonçalves, T.R.; Gomes, S.T.M.; Matsushita, M.; Gonçalves, R.P.; Março, P.H.; Valderrama, P. N-Way NIR Data Treatment through PARAFAC in the Evaluation of Protective Effect of Antioxidants in Soybean Oil. Molecules 2020, 25, 4366. [Google Scholar] [CrossRef]
  34. Liu, H.; Zhao, B.; Zhang, X.; Li, L.; Zhao, Y.; Li, Y.; Duan, K. Investigating Biochar-Derived Dissolved Organic Carbon (DOC) Components Extracted Using a Sequential Extraction Protocol. Materials 2022, 15, 3865. [Google Scholar] [CrossRef]
  35. Balım, C.; Olgun, N.; Çalışan, M. Leveraging Feature Fusion of Image Features and Laser Reflectance for Automated Fish Freshness Classification. Sensors 2025, 25, 4374. [Google Scholar] [CrossRef] [PubMed]
  36. Shinoda, R.; Wu, H.; Murata, M.; Ohnuki, H.; Yoshiura, Y.; Endo, H. Development of an Optical Communication Type Biosensor for Real-Time Monitoring of Fish Stress. Sens. Actuators B Chem. 2017, 247, 765–773. [Google Scholar] [CrossRef]
  37. Shooshtari, M.; Salehi, A. An Electronic Nose Based on Carbon Nanotube-Titanium Dioxide Hybrid Nanostructures for Detection and Discrimination of Volatile Organic Compounds. Sens. Actuators B Chem. 2022, 357, 131418. [Google Scholar] [CrossRef]
  38. Veldhuizen, L.J.L.; Berentsen, P.B.M.; de Boer, I.J.M.; van de Vis, J.W.; Bokkers, E.A.M. Fish Welfare in Capture Fisheries: A Review of Injuries and Mortality. Fish. Res. 2018, 204, 41–48. [Google Scholar] [CrossRef]
  39. Kim, D.; Antariksa, G.; Handayani, M.P.; Lee, S.; Lee, J. Explainable Anomaly Detection Framework for Maritime Main Engine Sensor Data. Sensors 2021, 21, 5200. [Google Scholar] [CrossRef] [PubMed]
  40. Digre, H.; Tveit, G.M.; Solvang-Garten, T.; Eilertsen, A.; Aursand, I.G. Pumping of Mackerel (Scomber scombrus) Onboard Purse Seiners, the Effect on Mortality, Catch Damage and Fillet Quality. Fish. Res. 2016, 176, 65–75. [Google Scholar] [CrossRef]
  41. Zaragoza-Esquerdo, M.; Silva-Alvarado, V.L.; Parra, L.; Sendra, S.; Lloret, J.; Pulido-Velázquez, M. Performance of ML-Based Classification Models as Edge Computing in IoT Nodes for a Marine Observatory. In Learning and Analytics in Intelligent Systems; Springer Nature: Berlin/Heidelberg, Germany, 2025; Volume 48, pp. 87–98. [Google Scholar]
  42. Silva Alvarado, V.L.; Heydari, A.; Parra, L.; Lloret, J.; Tomas, J. Proposal for Low-Cost Optical Sensor for Measuring Flow Velocities in Aquatic Environments. Sensors 2024, 24, 6868. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.