Local Polar Coordinate Feature Representation and Heterogeneous Fusion Framework for Accurate Leaf Image Retrieval
Abstract
1. Introduction
- We propose a novel shape representation, LPCFR, which captures the spatial distribution from two locally orthogonal perspectives while simultaneously extracting local curvature characteristics, thus improving the discriminative power of shape representation.
- We develop a new heterogeneous fusion framework, HFER, which enhances the compatibility and robustness of heterogeneous features by encoding both local contextual information and structural relations.
- Extensive experiments on both species-level and cultivar-level leaf datasets validate the effectiveness and generality of the proposed representation and fusion framework, consistently outperforming state-of-the-art methods.
2. Related Work
3. The Methodology
3.1. Local Polar Coordinate Feature Representation
3.1.1. Definition
3.1.2. Characters of LPCFR
3.2. Heterogeneous Feature Fusion with Exponential Weighting and Ranking
4. Experiments
4.1. Experimental Details
4.1.1. Experimental Setting
4.1.2. Evaluation Metrics
4.1.3. Baseline Methods
4.1.4. Datasets
- ICL220. The ICL220 leaf dataset [8] is a classic database, primarily designed for assessing the performance of plant recognition methods. It is collected from 220 plant species, each of which involves at least 26 leave images. All the images have a resolution of approximately 250 × 500 pixels and exhibit low intra-class variation alongside medium inter-class similarity. Here, the same equal set is adopted as in this paper [45], namely, taking the first 26 leaf images from each species and discarding the remainder for obtaining the ICL220 leaf subset with a total of 26 × 220 = 5720 images. Sample leaves from each species in the benchmark leaf datasets are displayed in Figure 7. Obviously, some leaves belonging to different species are very similar in shape.
- MEW2012. The second dataset used to test the recognition performance of our approach is the MEW2012 leaf dataset [44]. This dataset contains 9745 leaf images categorized into 153 species and each category has at least 50 leaf images. The images have a resolution of approximately 1000 × 2000 pixels and demonstrate moderate intra-class variation as well as moderate inter-class similarity. Sample leaves from each species in the MEW2012 leaf dataset are displayed in Figure 8. It is evident that the shapes of some images are highly similar, which makes the plant leaf recognition very challenging.
- SoyCultivar200. To further access the effectiveness and generality of the proposed methods, we perform retrieval experiments on the SoyCultivar200 [20] leaf dataset, which consists of three subsets. Each subset contains leaves collected from 200 soybean cultivars, with 10 leaf samples obtained from the upper, middle, and lower parts of each cultivar, respectively. The images possess a resolution of roughly 2000 × 3000 pixels and exhibit low variability within the same class, while showing a high degree of similarity between different classes. Figure 9, Figure 10, and Figure 11, respectively, show the sample leaves from the upper, middle, and lower parts of selected cultivars on the SoyCultivar200 dataset. It can be observed that the SoyCultivar200 dataset poses great challenges due to the high morphological similarity among different cultivars, which increases the difficulty of image retrieval. We conduct single leaf image pattern retrieval experiments on the three subsets. In addition, the 6000 leaf images are grouped into 2000 groups for joint pattern matching. These two retrieval tasks effectively reflect the performance of the proposed methods.
4.2. Experimental Results
4.2.1. Species-Level ICL220 Leaf Dataset
4.2.2. Species-Level MEW2012 Leaf Dataset
4.2.3. Cultivar-Level SoyCultivar200 Leaf Dataset
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Adamek, T.; O’Connor, N. A multiscale representation method for nonrigid shapes with a single closed contour. IEEE Trans. Circuits Syst. Video Technol. 2004, 14, 742–753. [Google Scholar] [CrossRef]
- Wang, X.; Feng, B.; Bai, X.; Liu, W.; Latecki, L.J. Bag of contour fragments for robust shape classification. Pattern Recognit. 2014, 47, 2116–2125. [Google Scholar] [CrossRef]
- Zheng, Y.; Guo, B.; Li, C.; Yan, Y. A Weighted Fourier and Wavelet-Like Shape Descriptor Based on IDSC for Object Recognition. Symmetry 2019, 11, 693. [Google Scholar] [CrossRef]
- Zhang, Z.; Peng, Y.; Duan, X.; Hou, Q.; Li, Z. Dual-axis generalized cross attention and shape-aware network for 2D medical image segmentation. Biomed. Signal Process. Control 2025, 107, 107791. [Google Scholar] [CrossRef]
- Feng, Y.F.; Shen, L.Y.; Yuan, C.M.; Li, X. Deep shape representation with sharp feature preservation. Comput.-Aided Des. 2023, 157, 103468. [Google Scholar] [CrossRef]
- Zhang, S.; Zhang, C.; Wang, X. Plant species recognition based on global–local maximum margin discriminant projection. Knowl.-Based Syst. 2020, 200, 105998. [Google Scholar] [CrossRef]
- Saleem, G.; Akhtar, M.; Ahmed, N.; Qureshi, W. Automated analysis of visual leaf shape features for plant classification. Comput. Electron. Agric. 2019, 157, 270–280. [Google Scholar] [CrossRef]
- Hu, R.X.; Jia, W.; Ling, H.; Huang, D. Multiscale distance matrix for fast plant leaf recognition. IEEE Trans. Image Process. Publ. IEEE Signal Process. Soc. 2012, 21, 4667–4672. [Google Scholar]
- Sokic, E.; Konjicija, S. Phase preserving Fourier descriptor for shape-based image retrieval. Signal Process. Image Commun. 2016, 40, 82–96. [Google Scholar] [CrossRef]
- Chen, X.; Wang, B. Invariant leaf image recognition with histogram of Gaussian convolution vectors. Comput. Electron. Agric. 2020, 178, 105714. [Google Scholar] [CrossRef]
- Yang, C. Plant leaf recognition by integrating shape and texture features. Pattern Recognit. 2021, 112, 107809. [Google Scholar] [CrossRef]
- Wu, H.; Fang, L.; Yu, Q.; Jingrong, Y.; Yang, C. Plant leaf identification based on shape and convolutional features. Expert Syst. Appl. 2023, 219, 119626. [Google Scholar] [CrossRef]
- Zhao, C.; Chan, S.F.; Cham, W.K.; Chu, L. Plant Identification using leaf shapes—A pattern counting approach. Pattern Recognit. 2015, 48, 3203–3215. [Google Scholar] [CrossRef]
- de Souza, M.M.; Medeiros, F.N.; Ramalho, G.L.; de Paula, I.C., Jr.; Oliveira, I.N. Evolutionary optimization of a multiscale descriptor for leaf shape analysis. Expert Syst. Appl. 2016, 63, 375–385. [Google Scholar] [CrossRef]
- Ben Haj Rhouma, M.; Žunić, J.; Younis, M.C. Moment invariants for multi-component shapes with applications to leaf classification. Comput. Electron. Agric. 2017, 142, 326–337. [Google Scholar] [CrossRef]
- Mahajan, S.; Raina, A.; Gao, X.Z.; Kant Pandit, A.; Yin, P.Y. Plant Recognition Using Morphological Feature Extraction and Transfer Learning over SVM and AdaBoost. Symmetry 2021, 13, 356. [Google Scholar] [CrossRef]
- Wu, H.; Fang, L.; Yu, Q.; Yang, C. Composite descriptor based on contour and appearance for plant species identification. Eng. Appl. Artif. Intell. 2024, 133, 108291. [Google Scholar] [CrossRef]
- Wang, B.; Gao, Y. Hierarchical String Cuts: A Translation, Rotation, Scale, and Mirror Invariant Descriptor for Fast Shape Retrieval. IEEE Trans. Image Process. 2014, 23, 4101–4111. [Google Scholar] [CrossRef]
- Yang, C.; Lyu, W.; Yu, Q.; Jiang, Y.; Zheng, Z. Learning a discriminative region descriptor for fine-grained cultivar identification. Comput. Electron. Agric. 2025, 229, 109700. [Google Scholar] [CrossRef]
- Chen, X.; Wang, B.; Gao, Y. Symmetric binary tree based co-occurrence texture pattern mining for fine-grained plant leaf image retrieval. Pattern Recognit. 2022, 129, 108769. [Google Scholar] [CrossRef]
- Lv, Z.; Zhang, Z. Research on plant leaf recognition method based on multi-feature fusion in different partition blocks. Digit. Signal Process. 2023, 134, 103907. [Google Scholar] [CrossRef]
- Wang, B.; Gao, Y.; Yuan, X.; Xiong, S. Local R-symmetry co-occurrence: Characterising leaf image patterns for identifying cultivars. IEEE/ACM Trans. Comput. Biol. Bioinform. 2022, 19, 1018–1031. [Google Scholar] [CrossRef]
- Hamrouni, L.; Kherfi, M.L.; Aiadi, O.; Benbelghit, A. Plant Leaves Recognition Based on a Hierarchical One-Class Learning Scheme with Convolutional Auto-Encoder and Siamese Neural Network. Symmetry 2021, 13, 1705. [Google Scholar] [CrossRef]
- Beikmohammadi, A.; Faez, K.; Motallebi, A. SWP-LeafNET: A novel multistage approach for plant leaf identification based on deep CNN. Expert Syst. Appl. 2022, 202, 117470. [Google Scholar] [CrossRef]
- Cui, J.; Zhang, X.; Zhang, J.; Han, Y.; Ai, H.; Dong, C.; Liu, H. Weed identification in soybean seedling stage based on UAV images and Faster R-CNN. Comput. Electron. Agric. 2024, 227, 109533. [Google Scholar] [CrossRef]
- Sharma, S.; Vardhan, M. AELGNet: Attention-based Enhanced Local and Global Features Network for medicinal leaf and plant classification. Comput. Biol. Med. 2025, 184, 109447. [Google Scholar] [CrossRef]
- Abbasi, S.; Mokhtarian, F.; Kittler, J. Curvature scale space image in shape similarity retrieval. Multimed. Syst. 1999, 7, 467–476. [Google Scholar] [CrossRef]
- Cao, J.; Wang, B.; Brown, D.B. Similarity based leaf image retrieval using multiscale R-angle description. Inf. Sci. 2016, 374, 51–64. [Google Scholar] [CrossRef]
- Ni, F.; Wang, B. Integral contour angle: An invariant shape descriptor for classification and retrieval of leaf images. In Proceedings of the IEEE International Conference on Image Processing, Athens, Greece, 7–10 October 2018; pp. 1223–1227. [Google Scholar]
- Barré, P.; Stöver, B.C.; Müller, K.F.; Steinhage, V. Leafsanp: A computer vision system for automatic plant species identification. Ecol. Inform. 2017, 40, 50–56. [Google Scholar] [CrossRef]
- Yang, C.; Yu, Q. Multiscale Fourier descriptor based on triangular features for shape retrieval. Signal Process. Image Commun. 2019, 71, 110–119. [Google Scholar] [CrossRef]
- Bay, H.; Ess, A.; Tuytelaars, T.; Gool, L.V. Speeded-Up robust features (SURF). Comput. Vis. Image Underst. 2008, 110, 346–359. [Google Scholar] [CrossRef]
- Qi, X.; Xiao, R.; Li, C.G.; Qiao, Y.; Guo, J.; Tang, X. Pairwise rotation invariant co-occurrence local binary pattern. IEEE Trans. Pattern Anal. Mach. Intell. 2014, 36, 2199–2213. [Google Scholar] [CrossRef]
- Tang, Z.; Su, Y.; Er, M.J.; Qi, F.; Zhang, L.; Zhou, J. A local binary pattern based texture descriptors for classification of tea leaves. Neurocomputing 2015, 168, 1011–1023. [Google Scholar] [CrossRef]
- Dyrmann, M.; Karstoft, H.; Midtiby, H.S. Plant species classification using deep convolutional neural network. Biosyst. Eng. 2016, 151, 72–80. [Google Scholar] [CrossRef]
- Liu, G.H.; Li, Z.Y.; Yang, J.Y.; Zhang, D. Exploiting sublimated deep features for image retrieval. Pattern Recognit. 2024, 147, 110076. [Google Scholar] [CrossRef]
- Liu, G.H.; Yang, J.Y. Deep-seated features histogram: A novel image retrieval method. Pattern Recognit. 2021, 116, 107926. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015; pp. 1097–1105. [Google Scholar]
- Wu, H.; Fang, L.; Yu, Q.; Yang, C. Deep convolutional feature aggregation for fine-grained cultivar recognition. Knowl.-Based Syst. 2023, 275, 110688. [Google Scholar] [CrossRef]
- Chen, X.; Wang, B. Symmetry-constrained linear sliding co-occurrence LBP for fine-grained leaf image retrieval. Comput. Electron. Agric. 2024, 218, 108741. [Google Scholar] [CrossRef]
- Belongie, S.; Malik, J.; Puzicha, J. Shape matching and object recognition using shape contexts. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 509–522. [Google Scholar] [CrossRef]
- Ling, H.; Jacobs, D. Shape classification using the inner-distance. IEEE Trans. Pattern Anal. Mach. Intell. 2007, 29, 286–299. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Novotný, P.; Suk, T. Leaf recognition of woody species in Central Europe. Biosyst. Eng. 2013, 115, 444–452. [Google Scholar] [CrossRef]
- Wang, B.; Brown, D.; Gao, Y.; Salle, J.L. MARCH: Multiscale-arch-height description for mobile retrieval of leaf images. Inf. Sci. 2014, 302, 132–148. [Google Scholar] [CrossRef]
Dataset | #Classes 1 | Resolution | Intra-Class Variation | Inter-Class Similarity | Notes |
---|---|---|---|---|---|
ICL220 | 220 | 250 × 500 2 | Low | Medium | Species-level |
MEW2012 | 153 | 1000 × 2000 2 | Medium | Medium | Species-level |
SoyCultivar200 | 200 | 2000 × 3000 2 | Low | High | Cultivar-level |
Method | Bulls-Eye | Feature Dimension | Average Retrieval Time |
---|---|---|---|
MDM [8] | 50.51 | 16384 | 132.06 |
MFD [31] | 64.06 | 360 | 135.04 |
HoGCV [10] | 63.09 | 3600 | 724.01 |
IMTD [12] | 67.16 | 300 | 347.84 |
LPCFR | 70.31 | 1540 | 286.49 |
Method | RsCoM (60.63) | SBT (64.38) | Resnet50 (58.86) | DSFH (59.59) | SDFH (78.92) |
MDM (50.51) | 68.15 | 68.45 | 74.90 | 68.45 | 73.40 |
MFD (64.06) | 73.56 | 72.67 | 82.46 | 78.66 | 82.99 |
HoGCV (63.09) | 71.85 | 72.03 | 81.43 | 77.20 | 81.33 |
IMTD (67.16) | 73.47 | 73.21 | 82.75 | 79.88 | 84.36 |
LPCFR (70.31) | 73.23 | 73.97 | 84.01 | 81.64 | 86.16 |
Method | RsCoM (60.63) | SBT (64.38) | Resnet50 (58.86) | DSFH (59.59) | SDFH (78.92) |
MDM (50.51) | 67.17/68.15 | 67.39/68.45 | 74.17/74.90 | 66.42/68.45 | 69.37/73.40 |
MFD (64.06) | 72.61/73.56 | 71.90/72.67 | 81.94/82.46 | 77.51/78.66 | 80.15/82.99 |
HoGCV (63.09) | 70.91/71.85 | 71.07/72.03 | 80.94/81.43 | 76.13/77.20 | 78.62/81.33 |
IMTD (67.16) | 72.31/73.47 | 72.35/73.21 | 82.38/82.75 | 79.22/79.88 | 82.04/84.36 |
LPCFR (70.31) | 71.89/73.23 | 72.94/73.97 | 83.61/84.01 | 81.53/81.64 | 84.63/86.16 |
Method | Bulls-Eye |
---|---|
MDM [8] | 50.63 |
MFD [31] | 65.23 |
HoGCV [10] | 68.02 |
IMTD [12] | 74.35 |
LPCFR | 78.91 |
Method | RsCoM (76.77) | SBT (81.07) | Resnet50 (61.48) | DSFH (65.15) | SDFH (75.79) |
MDM (50.63) | 77.29 | 79.33 | 74.18 | 69.60 | 72.17 |
MFD (65.23) | 83.54 | 84.95 | 82.86 | 81.19 | 83.57 |
HoGCV (68.02) | 81.07 | 83.34 | 82.18 | 80.38 | 82.19 |
IMTD (74.35) | 84.69 | 86.61 | 85.83 | 86.16 | 88.20 |
LPCFR (78.91) | 84.35 | 87.28 | 86.10 | 87.58 | 89.59 |
Method | RsCoM (76.77) | SBT (81.07) | Resnet50 (61.48) | DSFH (65.15) | SDFH (75.79) |
MDM (50.63) | 77.24/77.29 | 79.20/79.33 | 73.72/74.18 | 68.04/69.60 | 70.07/72.17 |
MFD (65.23) | 83.53/83.54 | 84.89/84.95 | 82.60/82.86 | 80.17/81.19 | 82.08/83.57 |
HoGCV (68.02) | 81.05/81.07 | 83.25/83.34 | 81.86/82.18 | 79.59/80.38 | 81.00/82.19 |
IMTD (74.35) | 84.65/84.69 | 86.60/86.61 | 85.64/85.83 | 85.70/86.16 | 87.47/88.20 |
LPCFR (78.91) | 84.28/84.35 | 87.25/87.28 | 85.82/86.10 | 87.37/87.58 | 89.16/89.59 |
Method | Upper | Middle | Lower | Joint |
---|---|---|---|---|
MDM [8] | 23.02 | 24.52 | 26.00 | 43.16 |
MFD [31] | 26.65 | 28.28 | 28.89 | 49.58 |
HoGCV [10] | 29.04 * | 31.48 * | 32.48 * | 59.81 * |
IMTD [12] | 28.57 | 30.73 | 33.11 | 56.79 |
LPCFR | 35.62 | 37.56 | 40.10 | 70.99 |
Method | RsCoM (42.80) | SBT (47.57 *) | Resnet50 (32.65) | DSFH (54.30) | SDFH (46.70) |
MDM (23.02) | 42.94 | 41.28 | 37.07 | 48.83 | 39.22 |
MFD (26.65) | 45.32 | 44.05 | 39.56 | 51.70 | 42.86 |
HoGCV (29.04 *) | 45.86 | 45.76 | 42.66 | 55.90 | 46.38 |
IMTD (28.57) | 45.93 | 45.67 | 41.05 | 54.31 | 45.84 |
LPCFR (35.62) | 48.80 | 51.12 | 45.73 | 62.15 | 53.92 |
Method | RsCoM (43.09) | SBT (48.40 *) | Resnet50 (33.63) | DSFH (46.26) | SDFH (50.20) |
MDM (24.52) | 44.69 | 44.13 | 39.83 | 46.01 | 42.77 |
MFD (28.28) | 47.13 | 46.12 | 41.56 | 49.46 | 46.11 |
HoGCV (31.48 *) | 48.03 | 48.27 | 44.52 | 53.91 | 50.71 |
IMTD (30.73) | 47.72 | 47.77 | 42.77 | 51.75 | 48.75 |
LPCFR (37.56) | 50.47 | 53.21 | 47.24 | 59.35 | 57.34 |
Method | RsCoM (41.62) | SBT (47.92 *) | Resnet50 (33.01) | DSFH (36.12) | SDFH (49.20) |
MDM (26.00) | 44.34 | 44.52 | 39.06 | 40.70 | 42.89 |
MFD (28.89) | 46.34 | 46.39 | 40.47 | 43.38 | 45.59 |
HoGCV (32.48 *) | 47.07 | 48.62 | 43.45 | 47.30 | 50.31 |
IMTD (33.11) | 47.57 | 48.95 | 42.76 | 47.09 | 49.71 |
LPCFR (40.10) | 49.31 | 53.26 | 46.13 | 52.35 | 56.64 |
Method | RsCoM (70.46) | SBT (81.69 *) | Resnet50 (61.56) | DSFH (74.61) | SDFH (78.25) |
MDM (43.16) | 76.72 | 78.55 | 71.58 | 77.79 | 75.09 |
MFD (49.58) | 79.88 | 80.99 | 74.83 | 81.23 | 78.91 |
HoGCV (59.81 *) | 80.53 | 82.72 | 78.19 | 84.98 | 83.35 |
IMTD (56.79) | 80.80 | 83.29 | 77.30 | 84.91 | 83.33 |
LPCFR (70.99) | 83.73 | 86.84 | 82.11 | 90.16 | 88.72 |
Method | RsCoM (41.62) | SBT (47.92 *) | Resnet50 (33.01) | DSFH (36.12) | SDFH (49.20) |
MDM (26.00) | 43.91/44.34 | 43.30/44.52 | 38.76/39.06 | 40.30/40.70 | 40.78/42.89 |
MFD (28.89) | 45.87/46.34 | 45.40/46.39 | 40.18/40.47 | 42.71/43.38 | 42.88/45.59 |
HoGCV (32.48 *) | 46.98/47.07 | 48.25/48.62 | 43.35/43.45 | 47.09/47.30 | 48.67/50.31 |
IMTD (33.11) | 47.29/47.57 | 48.00/48.95 | 42.65/42.76 | 46.56/47.09 | 47.68/49.71 |
LPCFR (40.10) | 48.65/49.31 | 53.27/53.26 | 45.89/46.13 | 52.79/52.35 | 56.57/56.64 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ye, M.; Cheng, Y.; Yuan, Y.; Yu, D.; Jin, G. Local Polar Coordinate Feature Representation and Heterogeneous Fusion Framework for Accurate Leaf Image Retrieval. Symmetry 2025, 17, 1049. https://doi.org/10.3390/sym17071049
Ye M, Cheng Y, Yuan Y, Yu D, Jin G. Local Polar Coordinate Feature Representation and Heterogeneous Fusion Framework for Accurate Leaf Image Retrieval. Symmetry. 2025; 17(7):1049. https://doi.org/10.3390/sym17071049
Chicago/Turabian StyleYe, Mengjie, Yong Cheng, Yongqi Yuan, De Yu, and Ge Jin. 2025. "Local Polar Coordinate Feature Representation and Heterogeneous Fusion Framework for Accurate Leaf Image Retrieval" Symmetry 17, no. 7: 1049. https://doi.org/10.3390/sym17071049
APA StyleYe, M., Cheng, Y., Yuan, Y., Yu, D., & Jin, G. (2025). Local Polar Coordinate Feature Representation and Heterogeneous Fusion Framework for Accurate Leaf Image Retrieval. Symmetry, 17(7), 1049. https://doi.org/10.3390/sym17071049