Remote Sensing Agent: Reshaping the Paradigm of Remote Sensing Information Processing
Highlights
- We define a customized “4+1” core characteristic framework for Remote Sensing Agent tailored to geospatial Earth observation.
- Three paradigm shifts of remote sensing processing are systematically summarized from initiation, execution and evaluation dimensions.
- RS Agent will grow into a groundbreaking driving force in the era of geospatial intelligence, contributing to Earth observation and sustainable development.
- Promising technical routes toward dynamic geoscience knowledge evolution and multi-agent coordination are outlined for subsequent RS Agent development.
Abstract
1. Introduction
2. Core Characteristics of Remote Sensing Agent
- Multimodal Spatial PerceptionThe agent perceives multi-source remote sensing data (optical, SAR, hyperspectral, LiDAR, etc.), spatial structures, geographic location, terrain conditions, and dynamic environmental changes, establishing a spatially aware representation of the observed scene [22]. Unlike general AI agents, Remote Sensing Agent integrates multimodal remote sensing perception channels to achieve comprehensive scene cognition. For example, it combines optical data’s high spectral resolution with SAR data’s all-weather observation capability, and supplements terrain details with LiDAR data, effectively overcoming the limitations of single-source data. In complex mountainous areas, it can perceive terrain slopes and aspect through LiDAR, and combine hyperspectral data to identify subtle changes in vegetation coverage, laying a foundation for subsequent task execution.
- Goal-Driven Spatial Mission PlanningFor typical remote sensing Earth observation scenarios, the agent no longer merely performs post hoc data processing, but smartly carries out full-process observation resource deployment and spatial task arrangement. Oriented to practical geospatial demands such as emergency disaster response and regional environmental monitoring, the agent independently selects suitable satellite platforms, matches spaceborne optical and SAR sensors with ground monitoring equipment, formulates reasonable joint observation time windows, and optimizes actual observation routes and scanning trajectories. Combined with real-time meteorological constraints, terrain occlusion conditions, and professional geoscientific knowledge, the agent continuously devises the overall observation scheme. For example, in flood emergency monitoring, the agent can automatically switch and schedule all-weather satellite resources to avoid cloud interference, coordinate multi-source space–ground integrated sensors, and adjust on-site observation sequences, rather than following fixed, offline processing schedules. This spatial mission planning capability effectively links high-level user observation intentions with actual on-demand Earth observation deployment [6].
- Geoscientific Knowledge ReferenceThe agent constructs and maintains a systematic geoscientific knowledge base, which includes spectral characteristics of ground objects, spatial topology relationships, surface environmental laws, and professional remote sensing processing specifications. This knowledge base serves as a core reference for the agent, and provides comprehensive or real-time prior information during perception, planning, and execution, replacing the simple information storage of general agents. For instance, in crop growth monitoring, it can call the spectral characteristic knowledge of different crop growth stages to accurately invert vegetation chlorophyll content; in urban land cover classification, it refers to spatial topology relationships to distinguish between residential areas and industrial zones, improving classification accuracy. Moreover, this knowledge base can be continuously updated through the feedback loop (which is the fourth characteristic), integrating new research results and practical experience to enhance the agent’s adaptability [15].
- Geospatial Workflow ExecutionAs the one of the core parts of Remote Sensing Agent, this component is responsible for executing a sequence of interrelated remote sensing processing tasks. Through professional tool scheduling and workflow orchestration, it converts multi-source raw observation data into decision-supporting analysis results and updates geospatial states in real time. The agent implements a full-stack remote sensing processing chain including data preprocessing, radiometric and atmospheric correction, feature extraction, spatial modeling, and intelligent interpretation, while supporting tool invocation, script generation, and observation device interaction. It can automatically invoke professional tools such as SNAP for atmospheric correction, GDAL for raster reprojection and QGIS for spatial modeling, and generate Python scripts to automate the processing workflow. For example, in coastal zone environmental monitoring, it completes the entire process from satellite data acquisition, radiometric correction, to coastal erosion extraction, and utilizes geoscientific knowledge to select the optimal feature extraction algorithm, ensuring the reliability of monitoring results. Such tool-augmented execution capability has been formally evaluated in recent benchmarks [5]. The core controller of this module is a large language model (LLM) oriented to remote sensing domain, which understands geospatial tasks, schedules professional tools, and coordinates the entire execution process. The LLM is fine-tuned with remote sensing text data and geospatial instruction data, and integrated with visual-language models to adapt to multimodal remote sensing data.
- Feedback LoopServing as the “+1” unifying mechanism, the feedback loop connects multimodal spatial perception, goal-driven spatial mission planning, geoscientific knowledge reference, and geospatial workflow execution. It evaluates task performances, transmits environmental feedback to upstream modules, updates geoscientific knowledge, helps to optimize the workflow, and forms a complete, autonomous, and evolvable closed-loop intelligence system. For example, if the extraction accuracy of a target detection task is lower than the expectation, the feedback loop will trace the error source—whether it is insufficient perception data or inappropriate algorithm parameters—and adjust the data collection scheme or optimize the algorithm with reference to geoscientific knowledge, realizing iterative improvement of task performances [17]. Therefore, the “+1” feedback loop is a cross-cutting mechanism that connects all four core components. When the workflow or task planning requires access to geoscientific knowledge, or when the perception module needs to provide references for other decision-making links, all such interactions are mainly realized through this “+1” feedback loop of the remote sensing agent.
3. Threefold Reshaping the Paradigm of Remote Sensing Information Processing
4. Discussion and Outlook
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| GeoAI | Geospatial Artificial Intelligence |
| LVLM | Large Vision-Language Model |
| UAVs | Unmanned Aerial Vehicles |
| GDAL | Geospatial Data Abstraction Library |
| SNAP | Sentinel Application Platform |
References
- Shang, Y.; Cheng, B.; Zhang, Z.; Wang, Q.; Jin, P.; Huang, L.; Liu, C.; Huang, L.; Ding, X.; Shen, T.; et al. Artificial Intelligence for Remote Sensing: Progress, Challenges, and Perspectives. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2026, 19, 6840–6874. [Google Scholar] [CrossRef]
- Feng, P.; Lv, Z.; Ye, J.; Wang, X.; Huo, X.; Yu, J.; Xu, W.; Zhang, W.; Bai, L.; He, C.; et al. Earth-agent: Unlocking the full landscape of earth observation with agents. arXiv 2025, arXiv:2509.23141. [Google Scholar]
- Awesome-Remote-Sensing-Agents. 2026. Available online: https://github.com/PolyX-Research/Awesome-Remote-Sensing-Agents (accessed on 9 June 2026).
- Xu, W.; Yu, Z.; Mu, B.; Wei, Z.; Zhang, Y.; Li, G.; Wang, J.; Peng, M. RS-Agent: Automating remote sensing tasks through intelligent agent. arXiv 2024, arXiv:2406.07089. [Google Scholar]
- Shabbir, A.; Munir, M.A.; Dudhane, A.; Sheikh, M.U.; Khan, M.H.; Fraccaro, P.; Moreno, J.B.; Khan, F.S.; Khan, S. Thinkgeo: Evaluating tool-augmented agents for remote sensing tasks. arXiv 2025, arXiv:2505.23752. [Google Scholar]
- Yao, L.; Xu, S.; Liu, F.; Zhang, C.; Yao, B.; Min, R.; Li, Y. RemoteAgent: Bridging Vague Human Intents and Earth Observation with RL-based Agentic MLLMs. arXiv 2026, arXiv:2604.07765. [Google Scholar]
- Li, W. Artificial Intelligence in Earth Science: A GeoAI Perspective. J. Geophys. Res. Mach. Learn. Comput. 2025, 2, e2025JH000691. [Google Scholar] [CrossRef]
- Li, W. GeoAI: Where machine learning and big data converge in GIScience. J. Spat. Inf. Sci. 2020, 20, 71–77. [Google Scholar] [CrossRef]
- Esri. GeoAI: What Is GeoAI? Official Documentation; Esri: Redlands, CA, USA, 2026. [Google Scholar]
- Tang, J.; Yan, Y.; Wang, Q.; Xia, Y.; Geng, B.; Chen, J.; Ma, K.; Zhai, Y.; He, Q.; Shao, W.; et al. Intelligent Remote Sensing Agents: A Survey. Available online: https://www.researchgate.net/publication/403817822_Intelligent_Remote_Sensing_Agents_A_Survey (accessed on 9 June 2026).
- Talemi, N.A.; Boone, J.; Afghah, F. Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems. arXiv 2026, arXiv:2601.01891. [Google Scholar] [CrossRef]
- Guo, H.; Su, X.; Wu, C.; Du, B.; Zhang, L.; Li, D. Remote Sensing ChatGPT: Solving Remote Sensing Tasks with ChatGPT and Visual Models. In Proceedings of the IGARSS 2024—2024 IEEE International Geoscience and Remote Sensing Symposium; IEEE: Piscataway, NJ, USA, 2024; pp. 11474–11478. [Google Scholar] [CrossRef]
- Zhu, L.; Wu, J.; Wang, B.; Zhang, G.; Wang, J.; Chen, S.; Tan, M. RS-AGENT: Large Language Models Guided Agent System for Remote Sensing Image Generation. In Proceedings of the IGARSS 2024—2024 IEEE International Geoscience and Remote Sensing Symposium; IEEE: Piscataway, NJ, USA, 2024; pp. 7020–7024. [Google Scholar] [CrossRef]
- Zhao, S.; Liu, F.; Zhang, X.; Chen, H.; Gu, X.; Jiang, Z.; Ling, F.; Fei, B.; Zhang, W.; Wang, J.; et al. OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation. Version Number: 1. arXiv 2026, arXiv:2603.22148. [Google Scholar] [CrossRef]
- Xiao, A.; Cheng, S.; Xu, Y.; Ren, Y.; Chen, H.; Yokoya, N. GeoMMBench and GeoMMAgent: Toward Expert-Level Multimodal Intelligence in Geoscience and Remote Sensing. arXiv 2026, arXiv:2604.08896. [Google Scholar]
- Liu, Z.; Zhao, D.; Yuan, B.; Jiang, Z. RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents. IEEE Trans. Geosci. Remote Sens. 2025, 63, 5611814. [Google Scholar] [CrossRef]
- Liu, C.; Chen, K.; Zhang, H.; Qi, Z.; Zou, Z.; Shi, Z. Change-Agent: Toward Interactive Comprehensive Remote Sensing Change Interpretation and Analysis. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5635616. [Google Scholar] [CrossRef]
- Lee, C.; Paramanayakam, V.; Karatzas, A.; Jian, Y.; Fore, M.; Liao, H.; Yu, F.; Li, R.; Anagnostopoulos, I.; Stamoulis, D. Multi-Agent Geospatial Copilots for Remote Sensing Workflows. In Proceedings of the IGARSS 2025—2025 IEEE International Geoscience and Remote Sensing Symposium; IEEE: Piscataway, NJ, USA, 2025; pp. 1084–1089. [Google Scholar] [CrossRef]
- Yao, L.; Liu, F.; Xu, S.; Zhang, C.; Min, R.; Di, S.; Zheng, Y. RemoteZero: Geospatial Reasoning with Zero Human Annotations. arXiv 2026, arXiv:2605.04451. [Google Scholar] [CrossRef]
- Yao, L.; Liu, F.; Lu, H.; Zhang, C.; Min, R.; Xu, S.; Di, S.; Peng, P. Remotereasoner: Towards unifying geospatial reasoning workflow. Proc. AAAI Conf. Artif. Intell. 2026, 40, 11883–11891. [Google Scholar] [CrossRef]
- Munir, M.A.; Sheikh, M.U.; Shabbir, A.; Khan, M.H.; Khan, F.; Zhu, X.X.; Demir, B.; Khan, S. Agentic AI for Remote Sensing: Technical Challenges and Research Directions. arXiv 2026, arXiv:2604.24919. [Google Scholar] [CrossRef]
- Hu, H.; Wang, P.; Feng, Y.; Wei, K.; Yin, W.; Diao, W.; Wang, M.; Bi, H.; Kang, K.; Ling, T.; et al. RingMo-Agent: A unified remote sensing foundation model for multi-platform and multi-modal reasoning. arXiv 2025, arXiv:2507.20776. [Google Scholar]
- Hashemi, M.; Züfle, A. A Comprehensive Survey of Agentic AI for Spatio-Temporal Data. Preprints 2026, 2026012236. [Google Scholar] [CrossRef]
- Fang, K.; Feng, H.; Cai, J.; Xia, Y.; Zhao, Y.; Wen, Z.; Wang, W.; Khowaja, S.A.; Dev, K.; Gadekallu, T.R. Large Language Models for Interpreting the Earth: A New Paradigm from Perception to Decision Intelligence. 2026. Available online: https://www.techrxiv.org/doi/pdf/10.36227/techrxiv.176857876.68610969/v1?onload=true (accessed on 9 June 2026).
- Sun, Z.; Zhou, Y.; Yang, J. An LLM-based Multi-Agent System for Remote Sensing Analysis. Big Earth Data 2026, 10, 1–25. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Liu, P.; Zhuang, R. Remote Sensing Agent: Reshaping the Paradigm of Remote Sensing Information Processing. Remote Sens. 2026, 18, 1980. https://doi.org/10.3390/rs18121980
Liu P, Zhuang R. Remote Sensing Agent: Reshaping the Paradigm of Remote Sensing Information Processing. Remote Sensing. 2026; 18(12):1980. https://doi.org/10.3390/rs18121980
Chicago/Turabian StyleLiu, Peng, and Rongkai Zhuang. 2026. "Remote Sensing Agent: Reshaping the Paradigm of Remote Sensing Information Processing" Remote Sensing 18, no. 12: 1980. https://doi.org/10.3390/rs18121980
APA StyleLiu, P., & Zhuang, R. (2026). Remote Sensing Agent: Reshaping the Paradigm of Remote Sensing Information Processing. Remote Sensing, 18(12), 1980. https://doi.org/10.3390/rs18121980

