- Article
Robust and Fast Sensing of Urban Flood Depth with Social Media Images Using Pre-Trained Large Models and Simple Edge Training
- Lin Lin,
- Zhenli Zeng and
- Chaoqing Tang
- + 2 authors
Accurately estimating urban floodwater depth is a critical step in enhancing urban resilience and strengthening disaster prevention and mitigation capabilities. Traditional methods relying on hydrological monitoring stations and numerical simulations suffer from limitations such as sparse spatial coverage, insufficient validation data, limited accuracy, and delayed fast performance. In contrast, social media data—characterized by its vast volume and fast availability, can effectively compensate for these shortcomings. When processed using artificial intelligence (AI) algorithms, such data can significantly improve credibility, disaster perception speed, and water depth estimation accuracy. To address these challenges, this paper proposes a robust and widely applicable method for rapid urban flood depth perception. The approach integrates AI technology and social media data to construct an AI framework capable of perceiving urban physical parameters through multimodal big data fusion without costly model training. By leveraging the near real-time and widespread nature of social media, an automated web crawler collects flood images and their textual descriptions (including reference objects), eliminating the need for additional hardware investments. The framework uses predefined prompts and pre-trained models to automatically perform relevance verification, duplicate filtering, object detection, and feature extraction, requiring no manual data annotation or model training. With only a minimal amount of water depth annotated data and compressed cross-modal feature vectors as training input, a lightweight Multilayer Perceptron (MLP) achieves high-precision depth estimation based on reference objects. This method avoids the need for large-scale model fine-tuning, allowing rapid training even on devices without GPUs. Experiments demonstrate that the proposed method reduces the Mean Square Error (MSE) by over 80%, processes each image in less than 0.5 s (more than 20 times faster than existing large-model approaches), and exhibits strong robustness to changes in perspective and image quality. The solution is fully compatible with existing infrastructure such as surveillance cameras, offering an efficient and reliable approach for fast flood monitoring in urban hydrology and water engineering applications.
17 November 2025






