Meta-Learning in Land Use and Land Cover Classification: Review and Perspective
Highlights
- Optimization-based and metric-based meta-learning dominate LULC classification research, with MAML and its variants being the most widely adopted, while memory-augmented methods remain underexplored due to computational overhead on high-dimensional remote sensing data.
- Meta-learning consistently outperforms conventional pre-training followed by fine-tuning under significant domain shifts across multiple data modalities, by acquiring cross-task structural knowledge rather than reusing instance-level features.
- Temporal dynamics modeling and multimodal data integration remain in early stages, calling for unified meta-learning frameworks that jointly address cross-regional, cross-temporal, and cross-modal generalization challenges arising from spatial heterogeneity.
- The integration of meta-learning with remote sensing foundation models represents a promising pathway toward operationally deployable LULC systems, combining large-scale representation learning with rapid few-shot adaptation mechanisms.
Abstract
1. Introduction
2. Review Methodology
2.1. Literature Search Strategy
2.2. Screening and Selection Criteria
2.3. Review Framework
3. Meta-Learning Paradigms
3.1. Meta-Learning Fundamentals
3.2. Memory-Augmented Meta-Learning
3.3. Optimization-Based Meta-Learning
3.4. Metric-Based Meta-Learning
3.5. Section Summary
- Memory-augmented meta-learning can dynamically adjust its internal memory state for task-specific adaptation, making it conceptually suitable for scenarios requiring incremental knowledge accumulation. However, it often exhibits limited transferability across substantially different domains and demands considerable computational resources and memory capacity, which constrains its practical applicability to multimodal remote sensing data.
- Optimization-based methods possess the widest applicability across diverse tasks due to their model-agnostic nature, allowing flexible integration with diverse backbone architectures commonly used in remote sensing (e.g., CNNs and Transformers). However, it remains computationally intensive owing to bilevel optimization, particularly the second-order gradient computation required by MAML.
- Metric-based methods are relatively straightforward and computationally efficient at inference time, yet their performance is sensitive to the quality of the learned embedding space and is largely confined to supervised classification settings.
4. LULC Application in Remote Sensing
4.1. Overview of Meta-Learning Applications in LULC
4.2. Label Scarcity in LULC
4.3. Cross-Region and Cross-Domain Generalization in LULC
4.4. Temporal Dynamics Modeling in LULC
4.5. Multimodal Data Integration in LULC
5. Discussion
5.1. Key Findings
5.2. Comparison with Related Paradigms
5.3. Current Limitations
5.4. Future Directions
6. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Alem, A.; Kumar, S. Deep Learning Methods for Land Cover and Land Use Classification in Remote Sensing: A Review. In Proceedings of the 2020 8th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), Noida, India, 4–5 June 2020; pp. 903–908. [Google Scholar]
- Ma, Y.; Chen, S.; Ermon, S.; Lobell, D.B. Transfer Learning in Environmental Remote Sensing. Remote Sens. Environ. 2024, 301, 113924. [Google Scholar] [CrossRef]
- Vivone, G.; Deng, L.-J.; Deng, S.; Hong, D.; Jiang, M.; Li, C.; Li, W.; Shen, H.; Wu, X.; Xiao, J.-L.; et al. Deep Learning in Remote Sensing Image Fusion: Methods, Protocols, Data, and Future Perspectives. IEEE Geosci. Remote Sens. Mag. 2024, 13, 269–310. [Google Scholar] [CrossRef]
- Lou, C.; Al-Qaness, M.A.A.; AL-Alimi, D.; Dahou, A.; Abd Elaziz, M.; Abualigah, L.; Ewees, A.A. Land Use/Land Cover (LULC) Classification Using Hyperspectral Images: A Review. Geo-Spat. Inf. Sci. 2025, 28, 345–386. [Google Scholar] [CrossRef]
- Wang, J.; Haining, R.; Zhang, T.; Xu, C.; Hu, M.; Yin, Q.; Li, L.; Zhou, C.; Li, G.; Chen, H. Statistical Modeling of Spatially Stratified Heterogeneous Data. Ann. Am. Assoc. Geogr. 2024, 114, 499–519. [Google Scholar] [CrossRef]
- Lu, S.; Guo, J.; Zimmer-Dauphinee, J.R.; Nieusma, J.M.; Wang, X.; VanValkenburgh, P.; Wernke, S.A.; Huo, Y. Vision Foundation Models in Remote Sensing: A Survey. IEEE Geosci. Remote Sens. Mag. 2025, 13, 190–215. [Google Scholar] [CrossRef]
- Hospedales, T.M.; Antoniou, A.; Micaelli, P.; Storkey, A.J. Meta-Learning in Neural Networks: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 5149–5169. [Google Scholar] [CrossRef]
- Tong, X.-Y.; Xia, G.-S.; Lu, Q.; Shen, H.; Li, S.; You, S.; Zhang, L. Land-Cover Classification with High-Resolution Remote Sensing Images Using Transferable Deep Models. Remote Sens. Environ. 2020, 237, 111322. [Google Scholar] [CrossRef]
- Alzubaidi, L.; Bai, J.; Al-Sabaawi, A.; Santamaría, J.; Albahri, A.S.; Al-dabbagh, B.S.N.; Fadhel, M.A.; Manoufali, M.; Zhang, J.; Al-Timemy, A.H.; et al. A Survey on Deep Learning Tools Dealing with Data Scarcity: Definitions, Challenges, Solutions, Tips, and Applications. J. Big Data 2023, 10, 46. [Google Scholar] [CrossRef]
- Sun, X.; Wang, B.; Wang, Z.; Li, H.; Li, H.; Fu, K. Research Progress on Few-Shot Learning for Remote Sensing Image Interpretation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 2387–2402. [Google Scholar] [CrossRef]
- Mohammadi, S.; Belgiu, M.; Stein, A. Few-Shot Learning for Crop Mapping from Satellite Image Time Series. Remote Sens. 2024, 16, 1026. [Google Scholar] [CrossRef]
- Ma, Y.; Zhang, Z.; Yang, H.L.; Yang, Z. An Adaptive Adversarial Domain Adaptation Approach for Corn Yield Prediction. Comput. Electron. Agric. 2021, 187, 106314. [Google Scholar] [CrossRef]
- Rußwurm, M.; Wang, S.; Korner, M.; Lobell, D. Meta-Learning for Few-Shot Land Cover Classification. In Proceedings of the Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, Seattle, WA, USA, 14–19 June 2020; pp. 200–201. [Google Scholar]
- Finn, C.; Abbeel, P.; Levine, S. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks. In Proceedings of the 34th International Conference on Machine Learning (ICML), Sydney, Australia, 6–11 August 2017. [Google Scholar]
- Rußwurm, M.; Wang, S.; Kellenberger, B.; Roscher, R.; Tuia, D. Meta-Learning to Address Diverse Earth Observation Problems across Resolutions. Commun. Earth Environ. 2024, 5, 37. [Google Scholar] [CrossRef]
- Park, S.-H.; Syazwany, N.S.; Lee, S.-C. Meta-Feature Fusion for Few-Shot Time Series Classification. IEEE Access 2023, 11, 41400–41414. [Google Scholar] [CrossRef]
- Gama, P.H.T.; Oliveira, H.; dos Santos, J.A.; Cesar, R.M. An Overview on Meta-Learning Approaches for Few-Shot Weakly-Supervised Segmentation. Comput. Graph. 2023, 113, 77–88. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
- Thrun, S.; Pratt, L. Learning to Learn: Introduction and Overview. In Learning to Learn; Thrun, S., Pratt, L., Eds.; Springer: Boston, MA, USA, 1998; pp. 3–17. [Google Scholar]
- Vinyals, O.; Blundell, C.; Lillicrap, T.; Kavukcuoglu, K.; Wierstra, D. Matching Networks for One Shot Learning. In Proceedings of the 30th International Conference on Neural Information Processing Systems; Curran Associates Inc.: Red Hook, NY, USA, 2016; pp. 3637–3645. [Google Scholar]
- Franceschi, L.; Frasconi, P.; Salzo, S.; Grazzi, R.; Pontil, M. Bilevel Programming for Hyperparameter Optimization and Meta-Learning. In Proceedings of the International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018. [Google Scholar]
- Vettoruzzo, A.; Bouguelia, M.-R.; Vanschoren, J.; Rögnvaldsson, T.; Santosh, K. Advances and Challenges in Meta-Learning: A Technical Review. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 4763–4779. [Google Scholar] [CrossRef]
- Santoro, A.; Bartunov, S.; Botvinick, M.; Wierstra, D.; Lillicrap, T. Meta-Learning with Memory-Augmented Neural Networks. In Proceedings of the 33rd International Conference on Machine Learning (ICML 2016), New York, NY, USA, 20–22 June 2016. [Google Scholar]
- Graves, A.; Wayne, G.; Danihelka, I. Neural Turing Machines. arXiv 2014, arXiv:1410.5401. [Google Scholar] [CrossRef]
- Mishra, N.; Rohaninejad, M.; Chen, X.; Abbeel, P. A Simple Neural Attentive Meta-Learner. In Proceedings of the 6th International Conference on Learning Representations (ICLR 2018), Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]
- Garnelo, M.; Rosenbaum, D.; Maddison, C.J.; Ramalho, T.; Saxton, D.; Shanahan, M.; Teh, Y.W.; Rezende, D.J.; Eslami, S.M.A. Conditional Neural Processes. arXiv 2018, arXiv:1807.01613. [Google Scholar]
- Munkhdalai, T.; Yu, H. Meta Networks. Proc. Mach. Learn. Res. 2017, 70, 2554–2563. [Google Scholar]
- Ramalho, T.; Garnelo, M. Adaptive Posterior Learning: Few-Shot Learning with a Surprise-Based Memory Module. arXiv 2019, arXiv:1902.02527. [Google Scholar]
- Chen, X.; Wang, Z.; Tang, S.; Muandet, K. MATE: Plugging in Model Awareness to Task Embedding for Meta Learning. In Proceedings of the Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2020. [Google Scholar]
- Liu, X.; Tian, X.; Lin, S.; Qu, Y.; Ma, L.; Yuan, W.; Zhang, Z.; Xie, Y. Learn from Concepts: Towards the Purified Memory for Few-Shot Learning. In Proceedings of the International Joint Conference on Artificial Intelligence, Montréal, QC, Canada, 19–26 August 2021. [Google Scholar]
- Du, Y.; Shen, J.; Zhen, X.; Snoek, C.G.M. EMO: Episodic Memory Optimization for Few-Shot Meta-Learning. In Proceedings of the CoLLAs, Montréal, QC, Canada, 22–25 August 2023. [Google Scholar]
- Chen, X.; Shi, M. Memory-Guided Network with Uncertainty-Based Feature Augmentation for Few-Shot Semantic Segmentation. In Proceedings of the 2024 IEEE International Conference on Multimedia and Expo (ICME), Niagara Falls, ON, Canada, 15–19 July 2024; pp. 1–6. [Google Scholar]
- Ravi, S.; Larochelle, H. Optimization as a Model for Few-Shot Learning. arXiv 2017, arXiv:1707.05041. [Google Scholar]
- Nichol, A.; Achiam, J.; Schulman, J. On First-Order Meta-Learning Algorithms. arXiv 2018, arXiv:1803.02999. [Google Scholar]
- Raghu, A.; Raghu, M.; Bengio, S.; Vinyals, O. Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML. arXiv 2019, arXiv:1909.09157. [Google Scholar]
- Oh, J.; Yoo, H.; Kim, C.; Yun, S.-Y. BOIL: Towards Representation Change for Few-Shot Learning. arXiv 2021, arXiv:2008.08882. [Google Scholar] [CrossRef]
- Grant, E.; Finn, C.; Levine, S.; Darrell, T.; Griffiths, T. Recasting Gradient-Based Meta-Learning as Hierarchical Bayes. arXiv 2018, arXiv:1801.08930. [Google Scholar]
- Finn, C.; Xu, K.; Levine, S. Probabilistic Model-Agnostic Meta-Learning. arXiv 2019, arXiv:1806.02817. [Google Scholar] [CrossRef]
- Kim, T.; Yoon, J.; Dia, O.; Kim, S.; Bengio, Y.; Ahn, S. Bayesian Model-Agnostic Meta-Learning. arXiv 2018, arXiv:1806.03836. [Google Scholar] [CrossRef]
- Jamal, M.A.; Qi, G.-J.; Shah, M. Task-Agnostic Meta-Learning for Few-Shot Learning. arXiv 2018, arXiv:1805.07722. [Google Scholar]
- Rusu, A.A.; Rao, D.; Sygnowski, J.; Vinyals, O.; Pascanu, R.; Osindero, S.; Hadsell, R. Meta-Learning with Latent Embedding Optimization. arXiv 2019, arXiv:1807.05960. [Google Scholar] [CrossRef]
- Jiang, X.; Havaei, M.; Varno, F.; Chartrand, G.; Chapados, N.; Matwin, S. Learning to Learn with Conditional Class Dependencies. In Proceedings of the 7th International Conference on Learning Representations (ICLR 2019), New Orleans, LA, USA, 6–9 May 2019. [Google Scholar]
- Li, Z.; Zhou, F.; Chen, F.; Li, H. Meta-SGD: Learning to Learn Quickly for Few-Shot Learning. arXiv 2017, arXiv:1707.09835. [Google Scholar]
- Sun, S.; Gao, H. Meta-AdaM: A Meta-Learned Adaptive Optimizer with Momentum for Few-Shot Learning. In Proceedings of the Proceedings of the 37th International Conference on Neural Information Processing Systems; Curran Associates Inc.: Red Hook, NY, USA, 2023. [Google Scholar]
- Lee, Y.; Choi, S. Gradient-Based Meta-Learning with Learned Layerwise Metric and Subspace. arXiv 2018, arXiv:1801.05558. [Google Scholar]
- Zhou, F.; Wu, B.; Li, Z. Deep Meta-Learning: Learning to Learn in the Concept Space. arXiv 2018, arXiv:1802.03596. [Google Scholar] [CrossRef]
- Liu, G.; Wang, T.; Zhang, S.; He, K. Generating Pseudo-Labels Adaptively for Few-Shot Model-Agnostic Meta-Learning. In Proceedings of the British Machine Vision Conference, London, UK, 21–24 November 2022. [Google Scholar]
- Lee, J.-J.; Yoon, S.W. XB-MAML: Learning Expandable Basis Parameters for Effective Meta-Learning with Wide Task Coverage. arXiv 2024, arXiv:2403.06768. [Google Scholar]
- Snell, J.; Swersky, K.; Zemel, R. Prototypical Networks for Few-Shot Learning. In Proceedings of the 31st International Conference on Neural Information Processing Systems; Curran Associates Inc.: Red Hook, NY, USA, 2017; pp. 4080–4090. [Google Scholar]
- Sung, F.; Yang, Y.; Zhang, L.; Xiang, T.; Torr, P.H.S.; Hospedales, T.M. Learning to Compare: Relation Network for Few-Shot Learning. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 1199–1208. [Google Scholar]
- Zheng, Y.; Wang, R.; Yang, J.; Xue, L.; Hu, M. Principal Characteristic Networks for Few-Shot Learning. J. Vis. Commun. Image Represent. 2019, 59, 563–573. [Google Scholar] [CrossRef]
- Chowdhury, R.R.; Bathula, D.R. IPNET: Influential Prototypical Networks for Few Shot Learning. arXiv 2022, arXiv:2208.09345. [Google Scholar] [CrossRef]
- Li, X.; Yu, L.; Fu, C.-W.; Fang, M.; Heng, P.-A. Revisiting Metric Learning for Few-Shot Image Classification. Neurocomputing 2020, 406, 49–58. [Google Scholar] [CrossRef]
- He, X.; Li, F.; Liu, L. Task-Adaptive Relation Dependent Network for Few-Shot Learning. In Proceedings of the 2021 International Joint Conference on Neural Networks (IJCNN); IEEE: New York, NY, USA, 2021; pp. 1–8. [Google Scholar]
- Li, J.; Tong, J.; Gao, G.; Xu, W. Attention-Enhanced Relation Network for Few-Shot Image Classification. In Proceedings of the 2023 6th International Conference on Image and Graphics Processing, Chongqing, China, 6–8 January 2023; pp. 197–203. [Google Scholar]
- Ren, L.; Duan, G.; Huang, T.; Kang, Z. Multi-Local Feature Relation Network for Few-Shot Learning. Neural Comput. Appl. 2022, 34, 7393–7403. [Google Scholar] [CrossRef]
- Zhou, F.; Wang, P.; Zhang, L.; Wei, W.; Zhang, Y. Revisiting Prototypical Network for Cross Domain Few-Shot Learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–23 June 2023; pp. 20061–20070. [Google Scholar]
- Zhang, L.; Liu, J.; Luo, M.; Chang, X.; Zheng, Q.; Hauptmann, A.G. Scheduled Sampling for One-Shot Learning via Matching Network. Pattern Recognit. 2019, 96, 106962. [Google Scholar] [CrossRef]
- Wang, J.; Zhai, Y. Prototypical Siamese Networks for Few-Shot Learning. In Proceedings of the 2020 IEEE 10th International Conference on Electronics Information and Emergency Communication (ICEIEC); IEEE: New York, NY, USA, 2020; pp. 178–181. [Google Scholar]
- Zhang, Y.; Kang, Z. TPN: Transferable Proto-Learning Network towards Few-Shot Document-Level Relation Extraction. In Proceedings of the 2024 International Joint Conference on Neural Networks (IJCNN); IEEE: New York, NY, USA, 2024; pp. 1–9. [Google Scholar]
- Jiang, Z.; Feng, Z.; Niu, B. Prototype-Neighbor Networks with Task-Specific Enhanced Meta-Learning for Few-Shot Classification. Neural Netw. 2025, 190, 107761. [Google Scholar] [CrossRef]
- Gao, K.; Liu, B.; Yu, X.; Zhang, P.; Tan, X.; Sun, Y. Small Sample Classification of Hyperspectral Image Using Model-Agnostic Meta-Learning Algorithm and Convolutional Neural Network. Int. J. Remote Sens. 2021, 42, 3090–3122. [Google Scholar] [CrossRef]
- Amoako, P.Y.O.; Cao, G.; Shi, H.; Arthur, J.K.; Agyenim, Y.O.B. Meta-Learning with Orthogonal Softmax Layer (MLOSL) for Small Sample Hyperspectral Image Classification. Multimed. Tools Appl. 2025, 84, 1–30. [Google Scholar] [CrossRef]
- Li, Z.; Lu, F.; Zou, J.; Hu, L.; Zhang, H. Generalized Few-Shot Meets Remote Sensing: Discovering Novel Classes in Land Cover Mapping via Hybrid Semantic Segmentation Framework. In Proceedings of the 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); IEEE: New York, NY, USA, 2024; pp. 2744–2754. [Google Scholar]
- Jia, Y.; Sun, C.; Gao, J.; Wang, Q. Few-Shot Remote Sensing Scene Classification via Parameter-Free Attention and Region Matching. ISPRS J. Photogramm. Remote Sens. 2025, 227, 265–275. [Google Scholar] [CrossRef]
- Swaminathan, B.; Palani, S.; Vairavasundaram, S. Meta Learning-Based Dynamic Ensemble Model for Crop Selection. Appl. Artif. Intell. 2022, 36, 2145646. [Google Scholar] [CrossRef]
- Rußwurm, M.; Wang, S.; Tuia, D. Humans Are Poor Few-Shot Classifiers for Sentinel-2 Land Cover. In Proceedings of the IGARSS 2022—2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; pp. 4859–4862. [Google Scholar]
- Wang, S.; Rußwurm, M.; Körner, M.; Lobell, D.B. Meta-Learning for Few-Shot Time Series Classification. In Proceedings of the IGARSS 2020—2020 IEEE International Geoscience and Remote Sensing Symposium; IEEE: New York, NY, USA, 2020; pp. 7041–7044. [Google Scholar]
- Deng, B.; Jia, S.; Shi, D. Deep Metric Learning-Based Feature Embedding for Hyperspectral Image Classification. IEEE Trans. Geosci. Remote Sens. 2019, 58, 1422–1435. [Google Scholar] [CrossRef]
- Xi, B.; Li, J.; Li, Y.; Song, R.; Hong, D.; Chanussot, J. Few-Shot Learning with Class-Covariance Metric for Hyperspectral Image Classification. IEEE Trans. Image Process. 2022, 31, 5079–5092. [Google Scholar] [CrossRef] [PubMed]
- Wang, C.; Zheng, W.; Sun, X.; Zhou, J.; Lu, J. Probabilistic Deep Metric Learning for Hyperspectral Image Classification. Pattern Recognit. 2025, 157, 110878. [Google Scholar] [CrossRef]
- Wu, G.; Cong, L.; Huang, C.; Ju, Y.; Jiang, J.; Chen, C. Meta-Learning Framework for Effective Few Shot Time Series Prediction. In Proceedings of the 2025 IEEE 5th International Conference on Power, Electronics and Computer Applications (ICPECA); IEEE: New York, NY, USA, 2025; pp. 18–22. [Google Scholar]
- Dai, M.; Xing, S.; Xu, Q.; Wang, H.; Li, P.; Sun, Y.; Pan, J.; Li, Y. Learning Transferable Cross-Modality Representations for Few-Shot Hyperspectral and LiDAR Collaborative Classification. Int. J. Appl. Earth Obs. Geoinf. 2024, 126, 103640. [Google Scholar] [CrossRef]
- Zhang, K.; Han, Y.; Chen, J.; Zhang, Z.; Wang, S. Semantic Segmentation for Remote Sensing Based on RGB Images and Lidar Data Using Model-Agnostic Meta-Learning and Partical Swarm Optimization. IFAC-PapersOnLine 2020, 53, 397–402. [Google Scholar] [CrossRef]
- Qiao, X.; Xing, L.; Han, A.; Liu, W.; Liu, B. Multi-Scale Fusion for Few-Shot Remote Sensing Image Classification. Int. J. Remote Sens. 2023, 44, 6012–6032. [Google Scholar] [CrossRef]
- Wang, Y.; Yao, Q.; Kwok, J.T.; Ni, L.M. Generalizing from a Few Examples: A Survey on Few-Shot Learning. ACM Comput. Surv. 2020, 53, 1–34. [Google Scholar] [CrossRef] [PubMed]
- Lake, B.M.; Salakhutdinov, R.; Tenenbaum, J.B. Human-Level Concept Learning through Probabilistic Program Induction. Science 2015, 350, 1332–1338. [Google Scholar] [CrossRef] [PubMed]
- Liu, B.; Yu, X.; Yu, A.; Zhang, P.; Wan, G.; Wang, R. Deep Few-Shot Learning for Hyperspectral Image Classification. IEEE Trans. Geosci. Remote Sens. 2018, 57, 2290–2304. [Google Scholar] [CrossRef]
- Jiang, X.; Zhou, N.; Li, X. Few-Shot Segmentation of Remote Sensing Images Using Deep Metric Learning. IEEE Geosci. Remote Sens. Lett. 2022, 19, 6507405. [Google Scholar] [CrossRef]
- Li, X.; Deng, J.; Fang, Y. Few-Shot Object Detection on Remote Sensing Images. IEEE Trans. Geosci. Remote Sens. 2021, 60, 5601614. [Google Scholar] [CrossRef]
- Alajaji, D.; Alhichri, H.S.; Ammour, N.; Alajlan, N. Few-Shot Learning for Remote Sensing Scene Classification. In Proceedings of the 2020 Mediterranean and Middle-East Geoscience and Remote Sensing Symposium (M2GARSS); IEEE: New York, NY, USA, 2020; pp. 81–84. [Google Scholar]
- Li, Y.; Yang, J. Few-Shot Cotton Pest Recognition and Terminal Realization. Comput. Electron. Agric. 2020, 169, 105240. [Google Scholar] [CrossRef]
- Demir, I.; Koperski, K.; Lindenbaum, D.; Pang, G.; Huang, J.; Basu, S.; Hughes, F.; Tuia, D.; Raskar, R. DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW); IEEE: New York, NY, USA, 2018; pp. 172–17209. [Google Scholar]
- Yan, J.; Wang, L.; Song, W.; Chen, Y.; Chen, X.; Deng, Z. A Time-Series Classification Approach Based on Change Detection for Rapid Land Cover Mapping. ISPRS J. Photogramm. Remote Sens. 2019, 158, 249–262. [Google Scholar] [CrossRef]
- Sajid, M.; Khan, H.H.; Khan, A.; Ahmad, R.; Khan, A.; Siraj, G.; Ansari, A.A. Application of Sentinel-1 SAR Data for Flood Monitoring in the Lower Ganges Basin: A Time-Series Analysis of 2021 Flood in Bihar. Discov. Sens. 2025, 1, 12. [Google Scholar] [CrossRef]
- Peng, K.; Jiang, W.; Hou, P.; Wu, Z.; Cui, T. Detailed Wetland-Type Classification Using Landsat-8 Time-Series Images: A Pixel-and Object-Based Algorithm with Knowledge (POK). GIScience Remote Sens. 2024, 61, 2293525. [Google Scholar] [CrossRef]
- Jiang, J.; Chen, C.; Lackinger, A.; Li, H.; Li, W.; Pei, Q.; Dustdar, S. MetaTrans-FSTSF: A Transformer-Based Meta-Learning Framework for Few-Shot Time Series Forecasting in Flood Prediction. Remote Sens. 2025, 17, 77. [Google Scholar] [CrossRef]
- Li, J.; Hong, D.; Gao, L.; Yao, J.; Zheng, K.; Zhang, B.; Chanussot, J. Deep Learning in Multimodal Remote Sensing Data Fusion: A Comprehensive Review. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102926. [Google Scholar] [CrossRef]
- Gominski, D.; Gouet-Brunet, V.; Chen, L. Cross-Dataset Learning for Generalizable Land Use Scene Classification. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA, 19–20 June 2022; pp. 1381–1390. [Google Scholar]
- Lu, X.; Gong, T.; Zheng, X. Domain Mapping Network for Remote Sensing Cross-Domain Few-Shot Classification. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5606411. [Google Scholar] [CrossRef]
- Xiong, Z.; Zhang, F.; Wang, Y.; Shi, Y.; Zhu, X.X. EarthNets: Empowering Artificial Intelligence for Earth Observation. IEEE Geosci. Remote Sens. Mag. 2024, 13, 45–78. [Google Scholar] [CrossRef]
- Wang, Y.; Albrecht, C.M.; Braham, N.A.A.; Mou, L.; Zhu, X.X. Self-Supervised Learning in Remote Sensing: A Review. IEEE Geosci. Remote Sens. Mag. 2022, 10, 213–247. [Google Scholar] [CrossRef]
- Li, L.; Zhang, W.; Zhang, X.; Emam, M.; Jing, W. Semi-Supervised Remote Sensing Image Semantic Segmentation Method Based on Deep Learning. Electronics 2023, 12, 348. [Google Scholar] [CrossRef]
- Wang, S.; Chen, W.; Xie, S.M.; Azzari, G.; Lobell, D.B. Weakly Supervised Deep Learning for Segmentation of Remote Sensing Imagery. Remote Sens. 2020, 12, 207. [Google Scholar] [CrossRef]
- Pan, S.J.; Yang, Q. A Survey on Transfer Learning. IEEE Trans. Knowl. Data Eng. 2010, 22, 1345–1359. [Google Scholar] [CrossRef]
- Bommasani, R.; Hudson, D.A.; Adeli, E.; Altman, R.; Arora, S.; von Arx, S.; Bernstein, M.S.; Bohg, J.; Bosselut, A.; Brunskill, E.; et al. On the Opportunities and Risks of Foundation Models. arXiv 2021, arXiv:2108.07258. [Google Scholar] [CrossRef]
- Mai, G.; Huang, W.; Sun, J.; Song, S.; Mishra, D.; Liu, N.; Gao, S.; Liu, T.; Cong, G.; Hu, Y.; et al. On the Opportunities and Challenges of Foundation Models for Geoai (Vision Paper). ACM Trans. Spat. Algorithms Syst. 2024, 10, 1–46. [Google Scholar] [CrossRef]
- Bi, H.; Feng, Y.; Tong, B.; Wang, M.; Yu, H.; Mao, Y.; Chang, H.; Diao, W.; Wang, P.; Yu, Y.; et al. RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation. IEEE Trans. Pattern Anal. Mach. Intell. 2025, 48, 4388–4405. [Google Scholar] [CrossRef]
- Liu, F.; Chen, D.; Guan, Z.; Zhou, X.; Zhu, J.; Ye, Q.; Fu, L.; Zhou, J. RemoteCLIP: A Vision Language Foundation Model for Remote Sensing. IEEE Trans. Geosci. Remote. Sens. 2024, 62, 1–16. [Google Scholar] [CrossRef]
- Zhang, W.; Cai, M.; Zhang, T.; Zhuang, Y.; Mao, X. EarthGPT: A Universal Multi-Modal Large Language Model for Multi-Sensor Image Comprehension in Remote Sensing Domain. IEEE Trans. Geosci. Remote. Sens. 2024, 62, 1–20. [Google Scholar]
- Yu, Z.; Liu, C.; Liu, L.; Shi, Z.; Zou, Z. MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 47, 1764–1781. [Google Scholar] [CrossRef]
- Cepeda, V.V.; Nayak, G.K.; Shah, M. GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-Localization. arXiv 2023, arXiv:2309.16020. [Google Scholar]
- Zhu, J.; Yao, L.; Liu, F.; Zhang, C.; Shen, C.; Zhou, J. A Encoder-Decoder Framework for Foundation Model-Based Remote Sensing Semantic Segmentation. In Proceedings of the 6th ACM International Conference on Multimedia in Asia Workshops; Association for Computing Machinery: New York, NY, USA, 2024. [Google Scholar]
- Li, X.; Wen, C.; Hu, Y.; Zhou, N. RS-CLIP: Zero Shot Remote Sensing Scene Classification via Contrastive Vision-Language Supervision. Int. J. Appl. Earth Obs. Geoinf. 2023, 124, 103497. [Google Scholar] [CrossRef]
- Huang, Z.; Yan, H.; Zhan, Q.; Yang, S.; Zhang, M.; Zhang, C.; Lei, Y.; Liu, Z.; Liu, Q.; Wang, Y. A Survey on Remote Sensing Foundation Models: From Vision to Multimodality. arXiv 2025, arXiv:2503.22081. [Google Scholar] [CrossRef]
- Hu, E.J.; Shen, Y.; Wallis, P.; Allen-Zhu, Z.; Li, Y.; Wang, S.; Wang, L.; Chen, W. LoRA: Low-Rank Adaptation of Large Language Models. arXiv 2021, arXiv:2106.09685. [Google Scholar]
- Hu, Z.; Wei, Y.; Shen, L.; Yuan, C.; Tao, D. LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs. In Proceedings of the 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 11–15 June 2025. [Google Scholar]
- Choe, S.K.; Mehta, S.V.; Ahn, H.; Neiswanger, W.; Xie, P.; Strubell, E.; Xing, E. Making Scalable Meta Learning Practical. arXiv 2023, arXiv:2310.05674. [Google Scholar] [CrossRef]
- Li, J.; Huang, X.; Tu, L. WHU-OHS: A Benchmark Dataset for Large-Scale Hersepctral Image Classification. Int. J. Appl. Earth Obs. Geoinf. 2022, 113, 103022. [Google Scholar] [CrossRef]
- Mani, A.; Gorbachev, S.; Yan, J.; Dixit, A.; Shi, X.; Li, L.; Sun, Y.; Chen, X.; Wu, J.; Deng, J.; et al. OHID-1: A New Large Hyperspectral Image Dataset for Multi-Classification. Sci. Data 2025, 12, 251. [Google Scholar] [CrossRef] [PubMed]
- Nguyen, T.T.; Huynh, T.T.; Ren, Z.; Nguyen, P.L.; Liew, A.W.-C.; Yin, H.; Nguyen, Q.V.H. A Survey of Machine Unlearning. ACM Trans. Intell. Syst. Technol. 2024, 16, 1–46. [Google Scholar] [CrossRef]
- Gu, A.; Dao, T. Mamba: Linear-Time Sequence Modeling with Selective State Spaces. arXiv 2024, arXiv:2312.00752. [Google Scholar] [CrossRef]
- Aleissaee, A.A.; Kumar, A.; Anwer, R.M.; Khan, S.; Cholakkal, H.; Xia, G.-S.; Khan, F.S. Transformers in Remote Sensing: A Survey. arXiv 2022, arXiv:2209.01206. [Google Scholar] [CrossRef]
- Bao, M.; Lyu, S.; Xu, Z.; Zhou, H.; Ren, J.; Xiang, S.; Li, X.; Cheng, G. Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook. Remote. Sens. 2025, 18, 594. [Google Scholar] [CrossRef]
- Wang, F.; Wang, H.; Wang, Y.; Wang, D.; Chen, M.; Zhao, H.; Sun, Y.; Wang, S.; Lan, L.; Yang, W.; et al. RoMA: Scaling up Mamba-Based Foundation Models for Remote Sensing. arXiv 2025, arXiv:2503.10392. [Google Scholar]










| Method | Year | Memory Mechanism | Core Innovation |
|---|---|---|---|
| MANN [23] | 2016 | External content-addressable matrix (NTM-based) | Pioneered memory-augmented meta-learning with LRUA-based writing for persistent representation-label storage |
| SNAIL [25] | 2018 | Implicit attention-based memory | Replaces explicit memory with interleaved temporal convolution and soft attention for high-bandwidth retrieval |
| CNPs [26] | 2018 | Compressed task embedding | Aggregates support set into a single task representation; enables calibrated uncertainty estimation |
| Meta Networks [27] | 2017 | Dual-stream fast-weight generation | Meta-learner generates fast weights for cross-task adaptation; base learner handles task-specific objectives |
| APL [28] | 2019 | Surprise-based incremental buffer | Selectively retains high-information samples to mitigate memory growth |
| MATE [29] | 2020 | Kernel mean embedding with adaptive attention | Models inter-task distributional differences and task complexity jointly |
| GAM [30] | 2021 | Graph-based memory aggregation | Aggregates knowledge from past tasks via GNNs in a model-agnostic manner |
| EMO [31] | 2023 | Gradient-history memory | Stores prior-task gradient histories to assist parameter updates when current-task signals are weak |
| CSM [32] | 2024 | Learnable memory vectors | Extracts base-class object patterns and encodes query features to align base-novel distributions |
| Method | Year | Direction | Core Innovation |
|---|---|---|---|
| MAML [14] | 2017 | Foundational | Bilevel optimization with task-specific inner-loop adaptation and shared outer-loop initialization |
| Meta-LSTM [33] | 2017 | Foundational | Uses LSTM to explicitly model the optimization trajectory and learn task-specific update rules |
| Reptile [34] | 2018 | Computational simplification | First-order approximation of MAML through iterative task sampling; avoids second-order gradients |
| ANIL [35] | 2020 | Computational simplification | Restricts inner-loop updates to the classifier head, leveraging feature reuse |
| BOIL [36] | 2021 | Computational simplification | Updates only the feature extractor; emphasizes representation change for cross-domain scenarios |
| LLAMA [37] | 2018 | Probabilistic extension | Recasts MAML as hierarchical Bayesian inference with approximate curvature estimation |
| PLATIPUS [38] | 2018 | Probabilistic extension | Samples diverse task-specific models from a learned distribution via a probabilistic graphical model |
| BMAML [39] | 2018 | Probabilistic extension | Integrates MAML with Stein variational gradient descent for non-parametric Bayesian meta-learning |
| TAML [40] | 2018 | Robustness and generalization | Promotes unbiased initialization by maximizing label entropy to reduce meta-overfitting |
| LEO [41] | 2019 | Robustness and generalization | Performs adaptation in a learned low-dimensional latent embedding for efficient optimization |
| CAML [42] | 2019 | Robustness and generalization | Conditionally transforms feature representations based on class-level dependencies |
| Meta-SGD [43] | 2017 | Meta-learned optimizer | Meta-learns initialization, per-parameter learning rates, and update directions jointly |
| Meta-Adam [44] | 2023 | Meta-learned optimizer | Incorporates weight-update history and momentum to predict adaptive learning rates |
| MT-net [45] | 2018 | Recent advance | Introduces layer-wise subspace learning with meta-learned distance metrics |
| DEML [46] | 2018 | Recent advance | Enables meta-learning in concept space rather than instance space |
| GP-MAML/ANIL/BOIL [47] | 2022 | Recent advance | Leverages pseudo-labeled query samples to enrich the support set during training |
| XB-MAML [48] | 2024 | Recent advance | Dynamically expandable basis parameters linearly combined to form task-specific initializations |
| Method | Year | Direction | Core Innovation |
|---|---|---|---|
| Matching Network [20] | 2016 | Foundational | Attention-based mapping from support set to label predictions via differentiable nearest-neighbor mechanism |
| Prototypical Network [49] | 2017 | Foundational | Represents each class by the mean embedding of its support examples; classifies query samples by nearest-prototype distance |
| Relation Network [50] | 2018 | Foundational | Learned deep relation module replaces fixed distance functions for non-linear similarity scoring |
| Principal Characteristics Net [51] | 2019 | Prototype refinement | Contribution-based weighting of embedded vectors for more expressive class prototypes |
| IPNET [52] | 2022 | Prototype refinement | Weights support samples by Maximum Mean Discrepancy to reduce outlier influence |
| K-Tuple Network [53] | 2020 | Relational modeling | Captures multi-sample relational structures during episodic training |
| Task-Adaptive Relation-Dependent Network [54] | 2021 | Relational modeling | Addresses train–test distribution bias via distribution-shifting and fine-grained feature comparison |
| Attention-Enhanced Relation Network [55] | 2023 | Relational modeling | Integrates adaptive-kernel and cross-channel attention to encode multi-scale features |
| MLFRNet [56] | 2022 | Relational modeling | Models local feature relationships using cosine-distance-based attention |
| LDP-Net [57] | 2023 | Cross-domain generalization | Dual-branch global–local knowledge distillation with EMA updates |
| SS-Matching Networks [58] | 2019 | Cross-domain generalization | Difficulty-aware metric with scheduled sampling for progressive training |
| Prototypical Siamese Networks [59] | 2020 | Cross-domain generalization | Siamese architecture with dedicated module for refined prototypical representation |
| TPN [60] | 2024 | Hybrid and neighbor-based | Transferable proto-learner with NOTA calibration and virtual adversarial training |
| PNN [61] | 2025 | Hybrid and neighbor-based | Combines Prototypical Networks with KNN-inspired Neighbor Network and hybrid data augmentation |
| Paradigm | Method | Complexity | Dominant Cost |
|---|---|---|---|
| Memory-augmented | MANN [23] | ) | Content-based read/write to external memory at every time step over sequence length L |
| Memory-augmented | CNPs [26] | ) | Single deterministic aggregation of support set into a task representation; no per-step memory access |
| Optimization-based | MAML [14] | ) | Bilevel optimization requires backpropagation through the inner-loop computation graph |
| Optimization-based | Reptile [34] | ) | Avoids second-order gradients via iterative task sampling |
| Optimization-based | ANIL [35] | ) | Inner-loop updates restricted to the classifier head |
| Metric-based | Matching Network [20] | ) | Attention over the entire support set for each query |
| Metric-based | Prototypical Network [49] | ) | Prototype computation followed by distance comparison; no inner-loop adaptation |
| Metric-based | Relation Network [50] | ) | Learned relation module replaces fixed distance, applied to each query–class pair |
| Paradigm | Core Mechanism | Strengths | Limitations | Suitability for LULC |
|---|---|---|---|---|
| Memory-augmented meta-learning | Encode support-set information into external or internal memory modules; retrieve relevant content for query prediction via attention or content-addressable access | Supports incremental knowledge retention; suitable for cross-temporal pattern retrieval | High computational and memory overhead; limited transferability across substantially different domains | Underexplored in LULC; potential for dynamic monitoring and incremental class discovery |
| Optimization-based meta-learning | Learn a shared parameter initialization that can be rapidly adapted to new tasks through a few gradient steps (bilevel optimization) | Model-agnostic; flexible integration with diverse backbones (CNN, Transformer); broad task applicability | Computationally intensive due to second-order gradients; sensitive to task distribution | Most widely adopted in LULC; applied across optical, hyperspectral, time series, and multimodal data |
| Metric-based meta-learning | Learn an embedding space in which query-to-support distance determines classification; non-parametric inference at test time | Computationally efficient at inference; no task-specific fine-tuning required | Performance sensitive to embedding quality; largely confined to supervised classification | Well-suited for hyperspectral image classification and operational large-scale mapping |
| LULC Challenge | Meta-Learning Paradigms | Data Modalities | Representative References |
|---|---|---|---|
| Label scarcity (Section 4.2) | Optimization-based; Metric-based | Optical image; Hyperspectral image | Gao et al. (2021) [62]; Amoako et al. (2025) [63]; Li et al. (2024) [64]; Jia et al. (2025) [65]; Swaminathan et al. (2022) [66] |
| Cross-region and cross-domain shifts (Section 4.3) | Optimization-based; Metric-based | Multispectral (Sentinel-2); Hyperspectral image | Rußwurm et al. (2020) [13]; Rußwurm et al. (2022) [67]; Wang et al. (2020) [68]; Deng et al. (2019) [69]; Xi et al. (2022) [70]; Wang et al. (2025) [71] |
| Temporal dynamics modeling (Section 4.4) | Optimization-based | Satellite time series image | Park et al. (2023) [16]; Mohammadi et al. (2024) [11]; Wu et al. (2025) [72]; Jiang et al. (2025) [61] |
| Multimodal data integration (Section 4.5) | Optimization-based; Metric-based | LiDAR and Hyperspectral image; Multi-resolution image; multispectral image | Dai et al. (2024) [73]; Rußwurm et al. (2024) [15]; Zhang et al. (2020) [74]; Qiao et al. (2023) [75] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
He, W.; Li, L.; Wu, H.; Gao, X.; Yang, Y.; Zhang, Z.; Yang, X.; Ge, Y. Meta-Learning in Land Use and Land Cover Classification: Review and Perspective. Remote Sens. 2026, 18, 1879. https://doi.org/10.3390/rs18121879
He W, Li L, Wu H, Gao X, Yang Y, Zhang Z, Yang X, Ge Y. Meta-Learning in Land Use and Land Cover Classification: Review and Perspective. Remote Sensing. 2026; 18(12):1879. https://doi.org/10.3390/rs18121879
Chicago/Turabian StyleHe, Wei, Lianfa Li, Haoxiong Wu, Xilin Gao, Yichen Yang, Zixuan Zhang, Xiaomei Yang, and Yong Ge. 2026. "Meta-Learning in Land Use and Land Cover Classification: Review and Perspective" Remote Sensing 18, no. 12: 1879. https://doi.org/10.3390/rs18121879
APA StyleHe, W., Li, L., Wu, H., Gao, X., Yang, Y., Zhang, Z., Yang, X., & Ge, Y. (2026). Meta-Learning in Land Use and Land Cover Classification: Review and Perspective. Remote Sensing, 18(12), 1879. https://doi.org/10.3390/rs18121879

