AGREE-YOLO: A Framework for Seafood Recognition and Cross-Cultural Gastronomic Recommendation
Abstract
1. Introduction
2. Related Work
3. Methodology
3.1. System Architecture of AGREE-YOLO
| Algorithm 1 End-to-end workflow of AGREE-YOLO |
| Input: Raw seafood image I, user natural-language query Q Output: Culturally adapted recipe R, dish image D |
| 1: B ← YOLOv13_GSConv_WIoU (I) → Bounding boxes & class labels 2: for each detection b in B do |
| 3: confidence ← b.confidence 4: if confidence ≥ 0.83 then |
| 5: store_into_MySQL (b.class_label, b.confidence, timestamp) 6: end if 7: end for 8: refined_query ← USRR (Q) → User Seafood Requirement Refinemen 9: SQL_stmt ← rookie_text2data (refined_query) → Convert NL to SQL 10: records ← rookie_execute_sql (SQL_stmt) → Fetch from MySQ 11: cuisine ← CCS (records) → Conditional Country Selection 12: R ← CMG (records, cuisine) → Cooking Method Generato 13: D ← SDP (records, cuisine) → Seafood Dish Picture Generator 14: return (R, D) |
3.2. YOLO-Based Seafood Detection
3.2.1. Dataset Preparation
3.2.2. Lightweight, Real-Time-Optimized YOLOv13 Architecture

- Parameter Reduction: The hybrid convolutional strategy of GSConv minimizes the number of trainable parameters.
- Real-Time Performance: The reduced computational load increases the inference speed, which is crucial for real-time applications.
- Detection Accuracy: The dynamic focusing mechanism of the WIoU improves the localization accuracy, which is reflected by higher mean average precision (mAP) and recall values.
3.3. Data Storage and Management
- Detection_ID (Identity Document): An autoincrementing primary key that uniquely indexes each detection event, thereby enabling granular tracking of individual results;
- Detection_Timestamp: To record the precise date and time at which each seafood detection event is executed by the YOLO-based recognition module, thereby enabling temporal traceability, chronological querying, and synchronization with downstream agent-mediated recommendation processes.
- Yolo_Confidence_Score: A DECIMAL field with a CHECK constraint (≥0.83) for filtering low-reliability detections to ensure that only high-confidence results are used for recommendation logic;
- Image_Source_ID: A unique, nonnull VARCHAR field that records the identifier (e.g., file path or dataset ID) of the original image, thereby enabling traceability back to raw imagery for result verification or reanalysis;
- Nutritional_Ingredient: A nonnull ENUM field restricted to “Holothurian,” “Echinus,” or “Scallop,” which ensures standardized classification of detected seafood and eliminates ambiguity in agent queries.
3.4. ChatFlow Integration in the Dify Environment
3.5. Seafood-Centric Agent-Based LLM Reasoning for Culturally Adaptive and Psychologically Pleasing Recipe Generation in the Dify Environment
4. Experimental Results
4.1. Experimental Evaluation of YOLO-Based Seafood Detection
4.2. Operational Validation of the AGREE Framework in the Dify Environment
- (1)
- Evaluation methodology and metrics. A dedicated human evaluation framework was designed and implemented to assess the end-to-end performance of the cross-cultural gastronomic recommendation system. The evaluation was conducted using a 5-point Likert scale (1 = strongly disagree, 5 = strongly agree) with three core quantitative metrics:
- Recommendation usefulness: Measured the practicality and implementability of generated recipes for real culinary scenarios.
- Cultural authenticity: Assessed alignment of recipes, cooking procedures, and dish presentations with traditional regional culinary standards.
- System usability: Evaluated the intuitiveness, stability, and ease of interaction of the AGREE-YOLO pipeline.
- (2)
- Statistical outcomes of human user evaluation. The human evaluation yielded robust statistical results that strongly validate the system’s performance:
- The average score for recommendation usefulness reached 4.72 out of 5.00, demonstrating high practical value for real-world cooking applications.
- The average score for cultural authenticity was 4.68 out of 5.00, confirming strict alignment with regional culinary norms and expert-recognized cooking procedures.
- The overall user satisfaction rate was 96.8%, reflecting strong acceptance among both professional chefs and ordinary consumers.
- Personalization effectiveness was validated via comparative analysis, confirming that generated recipes and dish images were accurately tailored to user-specified cultural preferences and ingredient contexts.
5. Discussion
5.1. Performance of the Optimized YOLOv13 Model for Seafood Detection
5.2. Efficacy of the AGREE Framework in Cross-Cultural Recommendation
5.3. Practical Implications and Limitations
5.4. Comparison with Related Work
6. Conclusions and Future Work
6.1. Conclusions
6.2. Future Directions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- De Cock, A.; Forio, M.A.E.; Hansen, H.H.; Jacxsens, L.; Lachat, C.; Goethals, P. Integrated control of seafood safety and quality beyond farm-to-fork: AI-related opportunities and challenges. Trends Food Sci. Technol. 2025, 163, 105096. [Google Scholar] [CrossRef]
- Xu, M.Y.; Fang, D.L.; Kimatu, B.M.; Lyu, L.; Wu, W.L.; Cao, F.L.; Li, W.L. Recent advances in anthocyanin-based films and its application in sustainable intelligent food packaging: A review. Food Control 2024, 162, 110431. [Google Scholar] [CrossRef]
- Zhu, J.D.; He, W.W.; Weng, W.D.; Zhang, T.; Mao, Y.Z.; Yuan, X.T.; Ma, P.Z.; Mao, G.J. An Embedding Skeleton for Fish Detection and Marine Organisms Recognition. Symmetry 2022, 14, 1082. [Google Scholar] [CrossRef]
- Taylor, B.; Ofori, K.F.; Parsaeimehr, A.; Evrendilek, G.A.; Attarwala, T.; Ozbay, G. Exploring the Complexities of Seafood: From Benefits to Contaminants. Foods 2025, 14, 1461. [Google Scholar] [CrossRef]
- Mahbubi, A.; Fatoni, A.; Iskandar, S.P.; Yunita, E. Promoting local food into gastro-tourism at the Gunung Kubing Geosite of Belitong UNESCO global geopark. Int. J. Gastron. Food Sci. 2025, 42, 101339. [Google Scholar] [CrossRef]
- Ge, C.; Zhang, G.J.; Wang, Y.J.; Shao, D.D.; Song, X.J.; Wang, Z.W. Research Status and Development Trends of Artificial Intelligence in Smart Agriculture. Agriculture 2025, 15, 2247. [Google Scholar] [CrossRef]
- Wu, H.Y.; Jia, B.W.; Yuan, X.M. LLM-led vision-spectral fusion: A zero-shot approach to temporal fruit image classification. Neural Netw. 2026, 194, 108155. [Google Scholar] [CrossRef]
- Li, H.L.; Li, J.; Wei, H.B.; Liu, Z.; Zhan, Z.F.; Ren, Q.L. Slim-neck by GSConv: A lightweight-design for real-time detector architectures. J. Real-Time Image Process. 2024, 21, 62. [Google Scholar] [CrossRef]
- Wang, J.L.; Qin, C.C.; Hou, B.B.; Yuan, Y.; Zhang, Y.K.; Feng, W.F. LCGSC-YOLO: A lightweight apple leaf diseases detection method based on LCNet and GSConv module under YOLO framework. Front. Plant Sci. 2024, 15, 1398277. [Google Scholar] [CrossRef]
- Su, B.; Zhu, Y.Y.; Lin, Y.F. Pest-YOLOv8: Enhanced detection for small and complex agricultural pests using triple attention and wise-IoU. J. Plant Dis. Prot. 2025, 132, 135. [Google Scholar] [CrossRef]
- Tang, Y.S.; Zhang, Y.; Xiao, J.R.; Cao, Y.; Yu, Z.J. An Enhanced Shuffle Attention with Context Decoupling Head with Wise IoU Loss for SAR Ship Detection. Remote Sens. 2024, 16, 4128. [Google Scholar] [CrossRef]
- Fu, T.Q.; Hu, Q.; Zhao, J.W.; Jiang, G.Y.; Shan, L.H.; Rong, Y. Underwater target detection and recognition based on cross-modal fusion of flow and electric information. Measurement 2025, 246, 116681. [Google Scholar] [CrossRef]
- Joshi, A.; Pandey, N.; Diwakar, M.; Singh, P.; Shankar, A.; Alqahtani, F. WBD-YOLO-AM: YOLOv8 with attention module and IoT-based wild boar detection and deterrence system for safeguarding small millets. Turk. J. Agric. For. 2025, 49, 769–786. [Google Scholar] [CrossRef]
- Zhang, W.X.; Shi, X.W.; Jiang, M.L.; Zhang, A.; Zeng, L.G.; Al-qaness, M.A.A. Improved you only look once for weed detection in soybean field under complex background. Eng. Appl. Artif. Intell. 2025, 151, 110762. [Google Scholar] [CrossRef]
- Dong, C.L.; Ying, H.C.; Hu, R.J.; Xu, Y.Y.; Chen, J.T.; Zhuang, F.Z.; Wu, J. A Progressively-Passing-Then-Disentangling Approach to Recipe Recommendation. IEEE Trans. Multimed. 2025, 27, 2760–2771. [Google Scholar] [CrossRef]
- Foster, J.; Brintrup, A. Aiding food security and sustainability efforts through graph neural network-based consumer food ingredient detection and substitution. Sci. Rep. 2025, 13, 18809. [Google Scholar] [CrossRef]
- Vats, M.; Flinders, B.; Visvikis, T.; Dawid, C.; Hofmann, T.F.; Cuypers, E.; Heeren, R.M.A. Mass Spectrometry Imaging for Spatial Ingredient Classification in Plant-Based Food. J. Am. Soc. Mass Spectrom. 2024, 36, 100–107. [Google Scholar] [CrossRef]
- Enriquez, J.P.; Archila-Godinez, J.C. Social and cultural influences on food choices: A review. Crit. Rev. Food Sci. Nutr. 2022, 62, 3698–3704. [Google Scholar] [CrossRef] [PubMed]
- Hu, Y.; Zhang, X.M.; Fang, Y.K.; Gao, Z.F. The influence of multicultural experience on attitudes towards new foods in the U.S. Appetite 2025, 206, 107822. [Google Scholar] [CrossRef]
- Khanna, S.K. Cultural Influences on Food: Dietary and Health Implications. Ecol. Food Nutr. 2021, 60, 633–635. [Google Scholar] [CrossRef]
- Krzyzewska, A. AI Advice for Amateur Food Production: Assessing Sustainability of LLM Recommendations. Sustainability 2025, 17, 10466. [Google Scholar] [CrossRef]
- Mamun, A.; Arefeen, A.; Racette, S.B.; Sears, D.D.; Whisner, C.M.; Buman, M.P.; Ghasemzadeh, H. LLM-Powered Prediction of Hyperglycemia and Discovery of Behavioral Treatment Pathways from Wearables and Diet. Sensors 2025, 25, 5372. [Google Scholar] [CrossRef]
- Wang, X.L.; Min, W.Q.; Sheng, G.R.; Song, J.R.; Yang, Y.C.; Yao, T.; Jiang, S.Q. LLM-informed global-local contextualization for zero-shot food detection. Pattern Recognit. 2026, 173, 112928. [Google Scholar] [CrossRef]
- Nian, F.D.; Hu, Y.J.; Gu, Y.H.; Wu, Z.Z.; Yang, S.M.; Shu, J.H. Ingredient-guided multi-modal interaction and refinement network for RGB-D food nutrition assessment. Digit. Signal Process. 2024, 153, 104664. [Google Scholar] [CrossRef]
- Qiao, G.H.; Nong, L.M.; Cheng, C.Y.; Shen, Z.W.; Zhu, J.L.; Li, H. ProFood: Progressive RGB-D fusion network for food detection in complex diet scenes. J. Food Compos. Anal. 2026, 149, 108773. [Google Scholar] [CrossRef]
- Zhang, E.S.; Li, A.Z.; Zhang, G.X.; Lu, W.H.; Zhang, Q.X.; Chen, L.; Jiang, L.; Ju, P.; Qu, F.L. A shikimic acid derived carbon dots (SACNDs-FITC) for multi-modal detection and removal of Hg2+: Probe design, sensing performance, and applications in food analysis. Spectrochim. Acta Part A-Mol. Biomol. Spectrosc. 2025, 331, 125765. [Google Scholar] [CrossRef]
- Öttl, A.; Termansen, M. Agent-Based Modelling of food systems: A scoping review on incorporation of behavioural insights. Environ. Model. Softw. 2025, 193, 106617. [Google Scholar] [CrossRef]
- Wang, Y.K.; Yang, Y.; Slanzi, C.M.; Li, X.L.; Ojeda, A.; Paro, F.; Deblais, L.; Yakubu, H.; Hassen, B.M.; Game, H.; et al. Quantitative multi-pathway assessment of exposure to Escherichia coli for infants in Rural Ethiopia. PLoS Neglected Trop. Dis. 2025, 19, e0013154. [Google Scholar] [CrossRef] [PubMed]
- Li, J.X.; Yan, Z.P. EAMSF-DETR: Edge-aware multi-scale feature fusion network based on DETR for underwater object detection. Opt. Laser Technol. 2026, 194, 114357. [Google Scholar] [CrossRef]
- Hu, Z.Y.; Chen, Q. MOA-YOLO: An Accurate, Real-Time, and Lightweight YOLOv10-Based Algorithm for Deep-Sea Fish Detection. IEEE Sens. J. 2025, 25, 23933–23947. [Google Scholar] [CrossRef]
- Mou, D.; Wei, Z.H.; Ni, L.; Song, N.; Sun, Y.W.; Chu, W.Z.; Jin, B.K. LLM-enhanced representation learning for graph collaborative filtering recommendation models. J. Intell. Inf. Syst. 2025, 63, 1179–1202. [Google Scholar] [CrossRef]
- Kopitar, L.; Bedrac, L.; Strath, L.J.; Bian, J.; Stiglic, G. Improving Personalized Meal Planning with Large Language Models: Identifying and Decomposing Compound Ingredients. Nutrients 2025, 17, 1492. [Google Scholar] [CrossRef] [PubMed]
- Chen, J.; Er, M.J. A hybrid architecture based on structured state space sequence model and convolutional neural network for real-time object detection. Eng. Appl. Artif. Intell. 2025, 156, 111067. [Google Scholar] [CrossRef]
- Ahmadian, S.; Rostami, M.; Jalali, S.M.J.; Oussalah, M.; Farrahi, V. A healthy and reliable rating profile expansion approach to address data sparsity in food recommendation systems. Knowl. Inf. Syst. 2025, 67, 3699–3735. [Google Scholar] [CrossRef]
- Cao, Y.Y.; Zhao, Z.X.; Huang, Y.; Lin, X.; Luo, S.Y.; Xiang, B.R.; Yang, H.C. Case instance segmentation of small farmland based on Mask R-CNN of feature pyramid network with double attention mechanism in high resolution satellite images. Comput. Electron. Agric. 2023, 212, 108073. [Google Scholar] [CrossRef]
- Feng, J.H.; Zhao, X.R.; Zhu, T.Y.; Li, T.; Qiu, Z.C.; Li, Z.W. Detection mature bud for daylily based on Faster R-CNN integrated with CBAM. IEEE Access 2023, 11, 81646–81655. [Google Scholar] [CrossRef]
- Ma, P.H.; Tsai, S.W.; He, Y.Y.; Jia, X.X.; Zhen, D.Y.; Yu, N.; Wang, Q.; Ahuja, J.K.C.; Wei, C. Large language models in food science: Innovations, applications, and future. Trends Food Sci. Technol. 2024, 148, 104488. [Google Scholar] [CrossRef]
- Rezayi, S.; Liu, Z.L.; Wu, Z.H.; Dhakal, C.; Ge, B.; Dai, H.X.; Mai, G.C.; Liu, N.H.; Zhen, C.; Liu, T.M.; et al. Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications. IEEE Trans. Big Data 2025, 11, 1235–1246. [Google Scholar] [CrossRef]
- Chen, H.Y.; Li, Y.Z.; Xu, P.; Li, J.Q.; Noor, A.; Zhou, X.Y.; He, W.C.; Wang, T.R.; Mou, Z.Y.; Song, L.G.; et al. Octopus-inspired soft gripper with embedded triboelectric tactile sensor for underwater target recognition and grasp. Nano Energy 2025, 140, 111007. [Google Scholar] [CrossRef]
- Dong, K.; Li, D.Y.; Zhang, J.J.; Zhao, X.L.; Dong, L.J.; Zhang, P. RV-YOLO: Real-time object detection algorithm for rail transit platform scenarios. J. Real-Time Image Process. 2025, 22, 143. [Google Scholar] [CrossRef]
- Su, Z.B.; Zhao, K.Q.; Fan, Z.H.; Guo, X.H. RTPV-YOLO: Real-Time Photovoltaic Detection With UAV-Based Thermal and RGB Imaging. IEEE Trans. Aerosp. Electron. Syst. 2025, 61, 13799–13810. [Google Scholar] [CrossRef]
- Li, H.B.; Zhu, J.Y.; Mao, X.; Hao, X.L.; Li, S.Y.; Yu, Q.Y.; Shi, Y.; Qian, J.P. Achieving precise cropland parcel extraction from remote sensing images through integration of segment anything model and adaptive mask refinement. Comput. Electron. Agric. 2026, 243, 111347. [Google Scholar] [CrossRef]
- Chen, J.L.; Cui, Q.W.; Ye, Y. 3D reconstruction and landscape restoration of garden landscapes: An innovative approach combining deep features and graph structures. Front. Environ. Sci. 2025, 13, 1556042. [Google Scholar] [CrossRef]
- Xiao, Z.Y.; Diao, G.; Deng, Z.H. Fine grained food image recognition based on swin transformer. J. Food Eng. 2024, 380, 112134. [Google Scholar] [CrossRef]
- Kostic, M.; Sarac, V.; Narandzic, T.; Kovacevic, D.B. Digital and Green Technological Drivers of Transformation in the Agri-Food Sector. Foods 2026, 15, 1081. [Google Scholar] [CrossRef] [PubMed]
- Benyezza, H.; Bouhedda, M.; Kara, R.; Rebouh, S. Smart platform based on IoT and WSN for monitoring and control of a greenhouse in the context of precision agriculture. Internet Things 2023, 23, 100830. [Google Scholar] [CrossRef]
- Soekamto, Y.S.; Lim, A.; Limanjaya, L.C.; Purwanto, Y.K.; Lee, S.H.; Kang, D.K. Pic2Plate: A Vision-Language and Retrieval-Augmented Framework for Personalized Recipe Recommendations. Sensors 2025, 25, 449. [Google Scholar] [CrossRef] [PubMed]
- Morales-Garzón, A.; Gutiérrez-Batista, K.; Martin-Bautista, M.J. Adaptafood: An intelligent system to adapt recipes to specialised diets and healthy lifestyles. Multimed. Syst. 2025, 31, 87. [Google Scholar] [CrossRef]







| Column Name | Data Type | Constraints | Purpose |
|---|---|---|---|
| Detection_ID | INT | PRIMARY KEY, AUTO_INCREMENT | Unique index for each detection event |
| Yolo_Confidence_Score | DECIMAL (3.2) | CHECK (yolo_confidence_score ≥ 0.83) | Ensures high-reliability detection data |
| Detection_Timestamp | DATETIME | NOT NULL | Timestamp for detection traceability and synchronization |
| Image_Source_ID | VARCHAR (80) | UNIQUE, NOT NULL | Enables traceability to original imagery |
| Nutritional_Ingredient | VARCHAR (60) | NOT NULL, ENUM (‘Holothurian’, ‘Echinus’, ‘Scallop’) | Standardized ingredient classification |
| Hyperparameter | Value |
|---|---|
| batch size | 16 |
| learning rate | 0.01 |
| epochs | 300 |
| imgsz | 640 |
| mosaic | 1.0 |
| mixup | 0.0 |
| copy_paste | 0.1 |
| Model | Inference Time (ms) | P (%) | R (%) | Params (M) | GFLOPS |
|---|---|---|---|---|---|
| SSD | 7.38 | 0.613 | 0.546 | 14.12 | 36.4 |
| Faster R-CNN | 46.3 | 0.764 | 0.685 | 44.18 | 203.7 |
| YOLOv8n | 1.2 | 0.859 | 0.823 | 2.58 | 6.6 |
| YOLOv11n | 1.3 | 0.867 | 0.835 | 2.51 | 6.4 |
| YOLOv13n | 2.2 | 0.873 | 0.831 | 2.45 | 6.2 |
| YOLOv13n-GSConv-WIoU (Ours) | 2.0 | 0.912 | 0.873 | 2.35 | 6.1 |
| System | Visual Input | Seafood Detection Precision (%) | Cross-Cultural Adaptability (%) | End-to-End Automation (%) | Visual–Textual Integration (%) | Database Query Accuracy (%) | Image Generation |
|---|---|---|---|---|---|---|---|
| Pic2Plate [47] | Yes | 77 * | 23.5 | 40 | 68.0 | N/A † | No |
| KERL [32] | No | N/A | 31.2 | 20 | 0 | 56.7 | No |
| Adaptafood [48] | Yes | 0 ‡ | 45.8 | 60 | 52 | N/A † | No |
| AGREE-YOLO (Ours) | Yes | 91.2 ‡ | 92.7 | 100 | 89.5 | 94.3 | Yes |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Hou, M.; Liu, S.; Wei, J.; Zhi, K.; Liu, M.; Lin, C. AGREE-YOLO: A Framework for Seafood Recognition and Cross-Cultural Gastronomic Recommendation. Foods 2026, 15, 1795. https://doi.org/10.3390/foods15101795
Hou M, Liu S, Wei J, Zhi K, Liu M, Lin C. AGREE-YOLO: A Framework for Seafood Recognition and Cross-Cultural Gastronomic Recommendation. Foods. 2026; 15(10):1795. https://doi.org/10.3390/foods15101795
Chicago/Turabian StyleHou, Mingxin, Shucheng Liu, Jianhua Wei, Kunfang Zhi, Mingxin Liu, and Cong Lin. 2026. "AGREE-YOLO: A Framework for Seafood Recognition and Cross-Cultural Gastronomic Recommendation" Foods 15, no. 10: 1795. https://doi.org/10.3390/foods15101795
APA StyleHou, M., Liu, S., Wei, J., Zhi, K., Liu, M., & Lin, C. (2026). AGREE-YOLO: A Framework for Seafood Recognition and Cross-Cultural Gastronomic Recommendation. Foods, 15(10), 1795. https://doi.org/10.3390/foods15101795

