Enhanced SOLOv2: An Effective Instance Segmentation Algorithm for Densely Overlapping Silkworms
Abstract
1. Introduction
- Dataset Construction: A dedicated dataset for silkworm segmentation was built, comprising 1264 images captured under high-density breeding conditions to encompass diverse scenarios. This dataset provides essential data support for silkworm instance segmentation research.
- Baseline Model Identification: A systematic comparison of mainstream instance segmentation methods (including YOLOv5n-seg, YOLOv5s-seg, YOLACT, SOLOv1, SOLOv2, and Mask R-CNN) was conducted for silkworm segmentation. Based on comprehensive evaluation metrics, SOLOv2 was identified as the optimal baseline model.
- Enhanced SOLOv2 Architecture: An improved SOLOv2 architecture with dual-branch modifications was proposed. In the ResNet backbone network, three modules (LDConv, HWD, and EAMF-Net) were introduced to adapt to silkworm morphological variations, enhance feature extraction for partially visible targets, and resolve geometric continuity in occluded regions. In the mask feature branch, three components (DySample, ASFF, and SimAM) were sequentially integrated for improving mask generation quality.
- Comprehensive Experimental Validation: Extensive experiments were conducted on the self-constructed dataset, with both qualitative and quantitative results validating the effectiveness and accuracy of the proposed algorithm for silkworm segmentation. Notably, the method demonstrates superior segmentation precision for partially visible silkworms and achieves higher boundary adherence when segmenting overlapping silkworm regions.
2. Materials and Methods
2.1. Overall Workflow
2.2. Dataset Construction
2.3. Baseline Model Selection for Silkworm Instance Segmentation
2.4. Enhanced SOLOv2 Model
2.4.1. SOLOv2 Model Overview
2.4.2. Feature-Enhanced Backbone Network
- 1.
- Linear Deformable Convolution (LDConv)
- 2.
- Haar Wavelet-Based Downsampling (HWD)
- 3.
- Edge-Augmented Multi-attention Fusion Network (EAMF-Net)
2.4.3. Enhanced Mask Feature Branch Network
- 1.
- Dynamic Upsampling Module (Dysample)
- 2.
- Adaptively Spatial Feature Fusion (ASFF)
- 3.
- Simple Attention Module (SimAM)
3. Experimental Results and Discussion
3.1. Experimental Setup
3.2. Results Analysis
3.2.1. Silkworm Segmentation Metrics
3.2.2. Ablation Study on Improved SOLOv2
3.2.3. Comparative Experiments with Mainstream Instance Segmentation Methods
3.2.4. Visual Evaluation
3.2.5. Segmentation Confidence
3.3. Feasibility Analysis for Deployment on Low-Resource Devices
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Yang, S.; Zuo, X.; Zhao, Z. Research on bombyx mori recognition in rearing seats based on image processing technology. Acta Sericol. Sin. 2022, 48, 5. [Google Scholar]
- Hu, X.; Qian, L.; Zhang, D. Research on segmentation technology of Nosema bombycis images. Mod. Electron. Tech. 2018, 41–44. [Google Scholar]
- Luo, Y.; Wen, C.; Wen, J.; Xu, Q.; Liang, X.; Yang, J.; Lan, Z. Data density-based RGB color component feature extraction and area measurement method for silkworm. J. Asia Pac. Entomol. 2023, 26, 102153. [Google Scholar] [CrossRef]
- Gowramma, B.H.; Poornima, B.; Ranjana, B.J. Enhancing Silkworm Breeding Through Advanced Image Segmentation: A Comparative Study of U-Net, Mask R-CNN, And FCN. In Proceedings of the 2024 First International Conference on Innovations in Communications, Electrical and Computer Engineering (ICICEC), Davangere, India, 24–25 October 2024. [Google Scholar]
- Wang, Y.; Jiang, X.; Liu, Y. Design and implementation of silkworm detection system based on deep learning. Electron. Compon. Inf. Technol. 2024, 8, 67–70. [Google Scholar]
- He, R.; Zheng, K.; Wei, Q. Research on silkworm recognition and counting in industrialized rearing based on improved Mask R-CNN model. Smart Agric. 2022, 4, 163–173. [Google Scholar]
- Meng, K.; Xu, K.; Cattani, P.; Mei, S. Camouflaged cotton bollworm instance segmentation based on PVT and Mask R-CNN. Comput. Electron. Agric. 2024, 226, 109450. [Google Scholar] [CrossRef]
- Tassis, L.M.; de Souza, J.E.T.; Krohling, R.A. A deep learning approach combining instance and semantic segmentation to identify diseases and pests of coffee leaves from in-field images. Comput. Electron. Agric. 2021, 186, 106191. [Google Scholar] [CrossRef]
- Uygun, T.; Ozguven, M.M. Determination of tomato leafminer: Tuta absoluta (Meyrick) (Lepidoptera: Gelechiidae) damage on tomato using deep learning instance segmentation method. Eur. Food Res. Technol. 2024, 250, 1837–1852. [Google Scholar] [CrossRef]
- Chen, L.; Strauch, M.; Daub, M.; Jansen, M.; Luigs, H.-G.; Merhof, D. Instance Segmentation of Nematode Cysts in Microscopic Images of Soil Samples. In Proceedings of the 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Berlin, Germany, 23–27 July 2019. [Google Scholar]
- Guo, Y.; Gao, J.; Wang, X.; Jia, H.; Wang, Y.; Zeng, Y.; Tian, X.; Mu, X.; Chen, Y.; OuYang, X. Precious Tree Pest Identification with Improved Instance Segmentation Model in Real Complex Natural Environments. Forests 2022, 13, 2048. [Google Scholar] [CrossRef]
- Hariharan, B.; Arbeláez, P.; Girshick, R.; Malik, J. Simultaneous detection and segmentation. In Proceedings of the Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, 6–12 September 2014; Proceedings, Part VII 13. Springer International Publishing: Berlin/Heidelberg, Germany, 2014; pp. 297–312. [Google Scholar]
- Yuan, J.; Cheng, C.; Liu, Z.; Li, H. A segmentation method for high-density overlapping silkworms based on the SOLOv2 framework. J. Phys. Conf. Ser. 2025. submitted. [Google Scholar]
- Wang, X.; Zhang, R.; Kong, T.; Li, L.; Shen, C. SOLOv2: Dynamic and Fast Instance Segmentation. In Proceedings of the 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, BC, Canada, 6–12 December 2020. [Google Scholar]
- Zhang, X.; Song, Y.; Song, T.; Yang, D.; Ye, Y.; Zhou, J.; Zhang, L. LDConv: Linear deformable convolution for improving convolutional neural networks. Image Vis. Comput. 2024, 149, 105190. [Google Scholar] [CrossRef]
- Xu, G.; Liao, W.; Zhang, X.; Li, C.; He, X.; Wu, X. Haar wavelet downsampling: A simple but effective downsampling module for semantic segmentation. Pattern Recognit. 2023, 143, 109819. [Google Scholar] [CrossRef]
- Gao, S.; Zhang, P.; Yan, T.; Lu, H. Multi-scale and detail-enhanced segment anything model for salient object detection. In Proceedings of the 32nd ACM International Conference on Multimedia, Melbourne, Australia, 28 October 2024. [Google Scholar]
- Liu, W.; Lu, H.; Fu, H.; Cao, Z. Learning to upsample by learning to sample. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Paris, France, 1–6 October 2023. [Google Scholar]
- Liu, S.; Huang, D.; Wang, Y. Learning spatial fusion for single-shot object detection. arXiv 2019, arXiv:1911.09516. [Google Scholar] [CrossRef]
- Yang, L.; Zhang, R.Y.; Li, L.; Xie, X. Simam: A simple, parameter-free attention module for convolutional neural networks. In Proceedings of the International Conference on Machine Learning, Vienna, Austria, 18–24 July 2021. [Google Scholar]
- Huang, Z.; Huang, L.; Gong, Y.; Huang, C.; Wang, X. Mask scoring r-cnn. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019. [Google Scholar]
- Cai, Z.; Vasconcelos, M. Cascade R-CNN: High quality object detection and instance segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 43, 1483–1498. [Google Scholar] [CrossRef] [PubMed]
- Tian, Z.; Shen, C.; Chen, H. Conditional convolutions for instance segmentation. In Computer Vision–ECCV 2020, Proceedings of the 16th European Conference, Glasgow, UK, 23–28 August 2020; Proceedings, Part I; Springer International Publishing: Glasgow, UK, 2020; Volume 16, pp. 282–298. [Google Scholar]
Module | SOLOv2 | AP | APs | APm | APl | Size (Mb) | |||||
---|---|---|---|---|---|---|---|---|---|---|---|
Baseline | × | × | × | × | × | × | 0.809 | 0.626 | 0.818 | 0.856 | 178 |
LDConv | √ | × | × | × | × | × | 0.817 | 0.638 | 0.823 | 0.867 | 160 |
HWD | √ | √ | × | × | × | × | 0.828 | 0.669 | 0.837 | 0.872 | 208 |
Dysample | √ | √ | √ | × | × | × | 0.833 | 0.682 | 0.848 | 0.871 | 209 |
ASFF | √ | √ | √ | √ | × | × | 0.840 | 0.687 | 0.847 | 0.876 | 209 |
SimAM | √ | √ | √ | √ | √ | × | 0.843 | 0.707 | 0.851 | 0.869 | 209 |
EAMF-Net | √ | √ | √ | √ | √ | √ | 0.851 | 0.728 | 0.858 | 0.876 | 306 |
Category | Model | AP | APS | APm | APl | Size (Mb) | Time (ms) |
---|---|---|---|---|---|---|---|
Two-stage | Mask Scoring R-CNN | 0.794 | 0.753 | 0.786 | 0.834 | 464 | 120.8 |
Cascade Mask R-CNN | 0.721 | 0.734 | 0.704 | 0.791 | 564 | 140.2 | |
Single-stage | SOLOv2 | 0.809 | 0.626 | 0.818 | 0.856 | 178 | 88.3 |
CondInst | 0.814 | 0.661 | 0.818 | 0.871 | 236 | 100.5 | |
Ours | 0.851 | 0.728 | 0.858 | 0.876 | 306 | 105.4 |
Confidence Levels | SOLOv2 (%) | Ours (%) |
---|---|---|
Highly confident (>0.9) | 9.61 | 88.80 |
Confident (0.7–0.9) | 78.83 | 9.70 |
Low confidence (<0.7) | 19.57 | 1.50 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yuan, J.; Li, H.; Cheng, C.; Liu, Z.; Wu, S.; Guo, D. Enhanced SOLOv2: An Effective Instance Segmentation Algorithm for Densely Overlapping Silkworms. Sensors 2025, 25, 5703. https://doi.org/10.3390/s25185703
Yuan J, Li H, Cheng C, Liu Z, Wu S, Guo D. Enhanced SOLOv2: An Effective Instance Segmentation Algorithm for Densely Overlapping Silkworms. Sensors. 2025; 25(18):5703. https://doi.org/10.3390/s25185703
Chicago/Turabian StyleYuan, Jianying, Hao Li, Chen Cheng, Zugui Liu, Sidong Wu, and Dequan Guo. 2025. "Enhanced SOLOv2: An Effective Instance Segmentation Algorithm for Densely Overlapping Silkworms" Sensors 25, no. 18: 5703. https://doi.org/10.3390/s25185703
APA StyleYuan, J., Li, H., Cheng, C., Liu, Z., Wu, S., & Guo, D. (2025). Enhanced SOLOv2: An Effective Instance Segmentation Algorithm for Densely Overlapping Silkworms. Sensors, 25(18), 5703. https://doi.org/10.3390/s25185703