1. Introduction
According to the China Fisheries Statistical Yearbook, the production of sea bass in aquaculture has consistently exceeded 100,000 tons in recent years, with Guangdong, Fujian, and other regions accounting for over 80% of the national output [
1]. Sea bass, as one of the key economic fish species and a leading variety in national marine aquaculture, requires precise feeding control to optimize aquaculture efficiency and economic returns. Overfeeding leads to feed wastage and an increase in harmful bacteria in the aquatic environment, raising maintenance costs. Conversely, insufficient feeding results in nutrient deficiencies and slowed growth. Therefore, determining the optimal feeding amount that ensures proper growth without waste is a pressing scientific challenge in aquaculture.
Traditional feeding prediction methods largely depend on historical data and expert experience, typically determining daily feed amounts based on feeding rate tables and the fish’s body weight [
2]. However, fish growth is rarely a simple linear process and typically follows a specific growth curve. These curves provide a more accurate reflection of fish weight changes at different growth stages, allowing for better predictions of their feed requirements [
3]. For example, Xiaoyan et al. [
4] applied the logistic growth curve model to analyze the early growth of largemouth bass, demonstrating that the model accurately simulates the S-shaped growth curve. Similarly, Xiao Jun et al. [
5] utilized the logistic model to examine Nile tilapia growth, showing that the model effectively fits the growth curve with a high R
2 value (0.991), revealing dynamic growth patterns. Li Yuhu et al. [
6] used the Gompertz curve to simulate the growth of the vannamei shrimp, capturing its rapid and stable growth phases. Li Jun et al. [
7] employed the Von Bertalanffy model for lip fish growth in net-cage farming, which effectively described the log-growth and stable phases, providing valuable insights for aquaculture management. Therefore, establishing growth curves for sea bass and adjusting feeding amounts based on weight and growth stages allows for optimized feed delivery at each stage, improving feed conversion efficiency and reducing costs. Although growth curves are essential, they fail to account for the significant effects of water quality and meteorological factors on sea bass feeding and growth.
Real-time water quality monitoring is an indispensable practice in aquaculture. Adjusting feeding amounts based on measured water quality parameters can help reduce feed costs. Buentello et al. [
8] identified that factors such as water temperature and dissolved oxygen influence daily feed demand and feed conversion efficiency. For example, peak feeding demand for spotted bass occurs at 27.1 °C with 100% dissolved oxygen saturation. Wu Qiangze [
9] developed an intelligent feeding system using fuzzy logic control, incorporating dissolved oxygen, water temperature, and fish weight as input variables, with feeding amount as the output. Wu et al. [
10] introduced an adaptive neural network fuzzy inference system to estimate feeding amounts based on dissolved oxygen levels, achieving a prediction accuracy of 97.89%, providing an efficient feeding decision-making tool. Chen et al. [
11] combined backpropagation neural networks with evolutionary algorithms to predict fish food intake using variables such as group size, average weight, dissolved oxygen, and water temperature. Their model demonstrated a correlation coefficient of 0.96 between predicted and actual feeding amounts, offering a robust method for feed intake prediction. While meteorological factors, such as temperature, light, and wind speed, may not directly influence sea bass feeding, they can affect feeding behavior by altering the aquatic environment and sea bass physiology [
12]. Extended cloudy weather and low light conditions can reduce dissolved oxygen levels, potentially impairing feeding and growth. Thus, accurate feeding prediction models must incorporate meteorological influences to improve feeding management. However, most existing models rely on linear regression or traditional machine learning techniques, which are unable to fully capture the complex nonlinear relationships in time-series data involving biomass, water quality, and meteorological factors. This limitation underscores the need for more advanced predictive models to enhance both forecast accuracy and management efficiency in aquaculture.
Recently, the transformer model has demonstrated significant success in natural language processing due to its powerful self-attention mechanism, which overcomes the inefficiencies, gradient explosion, and parallelization issues inherent in traditional models when processing long sequences [
13]. The transformer excels at temporal modeling, enabling the capture of global dependencies in input data. Zhu et al. [
14] developed a semi-supervised transformer network based on adaptive density proxies for fish density estimation in recirculating aquaculture systems, showcasing improved estimation performance and providing a novel tool for aquaculture management. Liu et al. [
15] integrated the Swin transformer model with multi-scale feature fusion to propose an advanced underwater fish segmentation method. This approach leverages self-attention mechanisms to capture long-range dependencies and integrates multi-resolution features, enhancing performance in complex scenarios. Thus, applying transformer models to predict sea bass feeding amounts offers a promising solution to improve aquaculture management accuracy and efficiency. However, when dealing with long time-series data, transformer models face challenges in extracting complex nonlinear relationships among multiple features, limiting their ability to effectively integrate data from varying time scales in sea bass feeding prediction. Liu et al. [
16] and Zhang et al. [
17] addressed this limitation by using multi-scale feature extraction and sparse connections, significantly improving prediction accuracy in time-series data and compensating for the transformer model’s feature extraction shortcomings.
In summary, this study addresses the shortcomings of traditional feeding prediction methods that rely on historical data and experience, as well as the challenges posed by non-linear fish growth and the failure to account for water quality and meteorological influences. First, the logistic growth curve is applied to model the growth stages of sea bass and establish the relationship between biomass and feeding amount. Given the significant impact of water quality and meteorological factors on sea bass feeding behavior in real aquaculture settings, this study further explores the complex interactions between environmental parameters and feeding amounts to enhance model prediction accuracy. Additionally, the original feature set often contains highly correlated features that introduce redundancy, increasing model complexity and reducing generalization ability. To address this, a Spearman correlation analysis and random forest-based feature optimization method is proposed to construct a more precise and effective feature set. Since biomass, water quality, and meteorological data exhibit time-series characteristics, the transformer model is used to explore complex nonlinear relationships within time-series data. A dual-encoder architecture is proposed to handle inputs with different time scales for historical feeding amounts, biomass, water quality, and meteorological data. Furthermore, a multi-scale feature fusion approach is introduced to address inconsistencies across time scales, enhancing the model’s ability to capture temporal dependencies. This results in the development of an optimized transformer-based feeding prediction model for sea bass, which is compared with other models to validate its accuracy and reliability.
4. Discussion
4.1. Relationship with Previous Studies and Hypothesis Verification
The results of this study are consistent with and supplement the existing research on aquaculture feeding prediction. The application of the logistic growth curve model in describing the growth of sea bass is in line with the research conclusions of Xiaoyan et al. [
4] and Xiao Jun et al. [
5], who also used the logistic model to effectively simulate the growth process of fish, which confirms that the growth curve model can better reflect the nonlinear growth characteristics of fish and provides a reasonable basis for establishing the relationship between biomass and feeding amount. This overcomes the defect that traditional methods only rely on historical data and experience to predict feeding amount and cannot adapt to the dynamic changes of fish growth [
2].
In terms of feature optimization, the Spearman + RF method screens out key features that have a significant impact on the feeding amount, which is consistent with the research of Buentello et al. [
9] that water quality factors such as water temperature and dissolved oxygen affect fish feeding behavior. At the same time, this method removes redundant features, reduces the complexity of the model, and improves the generalization ability, which is also supported by the view of Chen et al. [
11] that optimizing the feature set is beneficial to improving the performance of the prediction model.
The excellent performance of the proposed FO-DEMST model in feeding amount prediction verifies the hypothesis that integrating multi-scale feature fusion and dual-encoder structure into the transformer model can improve the prediction accuracy. This is similar to the research results of Liu et al. [
16] and Zhang et al. [
17], who pointed out that processing time series data with multi-scale methods can better capture the temporal dependencies in the data. The dual-encoder structure in the model separately processes different types of data, which solves the problem that the traditional transformer model has difficulty in effectively handling data with different time scales in aquaculture scenarios [
14], and further confirms the effectiveness of the model design.
4.2. Implications of the Findings
The improvement of the prediction accuracy of the FO-DEMST model has important practical significance for aquaculture production. Accurate feeding amount prediction can avoid the waste of feed caused by overfeeding and the slow growth of fish caused by underfeeding, thereby reducing aquaculture costs and improving economic benefits. At the same time, reducing feed waste can also reduce the pollution of the aquaculture environment caused by residual feed, which is conducive to the sustainable development of aquaculture.
The model is applicable to both open pond aquaculture and closed recirculating aquaculture systems, which expands its scope of application. In open ponds, where environmental factors such as meteorology and water quality are more variable, the model can comprehensively consider these factors to make more accurate predictions. In recirculating aquaculture systems with relatively stable environmental conditions, the model can also play a role in precise feeding by relying on key parameters, providing technical support for intelligent aquaculture.
4.3. Future Research Directions
Although the FO-DEMST model has achieved good results, there are still some aspects that can be improved in future research. Firstly, the research data comes from a single aquaculture site, and the model’s adaptability in different regions and aquaculture environments needs to be further verified. Expanding the sample range and including more diverse aquaculture scenarios can enhance the generalization ability of the model.
Secondly, the current model does not take into account some biological factors such as the health status of sea bass and the stocking density. Adding these factors into the model can make the prediction more in line with the actual situation of aquaculture. For example, combining with underwater monitoring technology to obtain the activity status and health information of fish in real time, and integrating them into the model as features.
In addition, the current model mainly focuses on short-term feeding amount prediction. In the future, we can explore the long-term prediction of feeding amount, considering the seasonal changes and long-term growth trends of fish, so as to provide more comprehensive decision-making support for aquaculture production planning and feed management.
Finally, the computational complexity of the FO-DEMST model is relatively high. Reducing the model’s complexity while ensuring prediction accuracy is also a direction worthy of research, and is conducive to the popularization and application of the model in actual aquaculture production.
5. Conclusions
This study presents a model that combines feature optimization with an enhanced transformer architecture for predicting the feeding amounts of European seabass (Dicentrarchus labrax). By using a logistic growth curve to model the varying growth characteristics of the seabass at different stages, the relationship between biomass and feeding amounts is established. Since feeding behavior in aquaculture is influenced by factors such as water quality and environmental conditions, the model incorporates the relationship between the farming environment and feeding amounts. Feature optimization was performed through Spearman correlation analysis and random forest-based feature selection, resulting in a refined and effective feature set.
Additionally, the FO-DEMST model processes input data at different temporal granularities, effectively capturing multi-scale temporal dependencies to achieve accurate predictions. Experimental results show that after feature optimization, the FO-DEMST model achieves an MSE of 0.42 and an MAE of 0.31 for feeding amount prediction. These results demonstrate a significant improvement in prediction accuracy over the transformer model. Furthermore, the FO-DEMST model outperforms CNN, LSTM, and transformer models in terms of both accuracy and efficiency, confirming the superiority and effectiveness of the FO-DEMST model.
The improved prediction accuracy of the FO-DEMST model, as evidenced by reductions in MSE and MAE, translates into tangible benefits for aquaculture operations. Specifically, more precise feeding predictions can lead to significant savings in feed costs—by minimizing waste—and reduce management time through optimized feeding schedules, ultimately enhancing overall farm efficiency and sustainability.
This model is particularly suited for both open pond systems and closed-loop recirculating aquaculture systems (RAS). In open pond settings, where environmental factors like water temperature, dissolved oxygen, and weather conditions can fluctuate widely, the inclusion of these variables in the model allows for more accurate feeding predictions. Similarly, in RAS, where precise control over water quality parameters is possible, the model’s ability to integrate detailed environmental data enhances its predictive accuracy. However, the implementation may require adjustments depending on the specific monitoring capabilities and operational scales of different facilities.