Open Access This article is
- freely available
ISPRS Int. J. Geo-Inf. 2018, 7(4), 147; https://doi.org/10.3390/ijgi7040147
Foreword to the Special Issue on Machine Learning for Geospatial Data Analysis
Department of Civil, Environmental and Geomatic Engineering, Institute of Geodesy and Photogrammetry, ETH Zürich, Stefano-Franscini-Platz 5, 8093 Zürich, Switzerland
Remote Sensing Group, IGG, University of Bonn, Nussallee 15, 53115 Bonn, Germany
Swiss Data Science Center, ETH Zürich, Universitätstrasse 25, 8006 Zürich, Switzerland
Crop and Environmental Sciences Department, Harper Adams University, Edgmond, Newport TF10 8NB, UK
Author to whom correspondence should be addressed.
Received: 9 April 2018 / Accepted: 10 April 2018 / Published: 13 April 2018
Advances in machine learning research are pushing the limits of geographical information sciences (GIScience) by offering accurate procedures to analyze small-to-big GeoData. This Special Issue groups together six original contributions in the field of GeoData-driven GIScience that focus mainly on three different areas: extraction of semantic information from satellite imagery, image recommendation, and map generalization. Different technical approaches are chosen for each sub-topic, from deep learning to latent topic models.
Keywords:geospatial machine learning; big data; classification; remote sensing; GIS; GIScience
Machine learning is revolutionizing digital and data-intensive disciplines by offering tools to analyze and extract valuable information from very large quantities of unstructured data. Geographical Information Sciences (GIScience) in general are no less affected by such change, and are among the disciplines that could benefit the most from tailored machine learning solutions. Modern GIScience research is characterized by very large and unstructured sources of geolocated data, from which it is often required to extract high level information in the form of spatial semantics, spatial object relationships, trajectories, or more generally, numeric tags associated to objects embedded in geographical coordinates.
Modern machine and deep learning together with the rapid development of hardware and open source software libraries allows us to approach geospatial applications that were beyond reach a few years ago. Methods must scale to massive amounts of data, for both training and prediction steps. In parallel, the increasing availability of cloud computing services and affordable graphics processing units (GPUs) eases accessibility to huge computing power. In light of these trends, the geospatial community is moving quickly towards data-driven (deep) machine and learning tools to solve challenging open research questions.
This Special Issue assembles six novel contributions in different areas of GeoData-driven machine learning. Topics span different disciplines of GIScience: generation of street address from satellite imagery , land-cover classification of polarimetric Synthetic Aperture Radar (PolSAR) images , extraction of buildings from maps to perform generalization , land-cover classification from satellite image time series , automatic selection of buildings based on cartographic constraints  and satellite image retrieval and recommendation .
In addition to tackling different applications, the papers employ a variety of machine learning tools to accomplish their goals. Deep learning is used in two works [1,4]: in the former, a convolutional neural network extracts roads from satellite images as an initial step to generate street addresses, while in the latter, a sequential convolutional recurrent neural network provides robust end-to-end land-cover and land-use mapping from satellite image time series. Decision trees in the form of single decision trees  or random forests  are used for building detection in the first and land-cover classification in the second contribution. Lee, et al.  also test other standard machine learning approaches such as support vector machines, k-nearest-neighbor and naïve Bayes classification to explore which methodologies are most appropriate for map generalization purposes. Map generalization is also at the core of , who approach building selection with genetic algorithms that are constrained by cartographic and contextual knowledge. Finally, Zhang, et al.  use a latent topic model including space and time to retrieve and recommend remote sensing imagery.
From these contributions it is apparent that machine learning offers solutions that are highly competitive when compared to humans. Although not yet rivalling the human performance in terms of accuracy on several applications, such methods allow us to process and interpret very large quantities of data, which would be impossible to treat manually. For instance, the address generation methods in Demir, et al.  could be scaled at a global level or the method presented in Russwurm and Körner  could be extended to very long time series and large geographical areas with minimal user intervention. Modern applications of machine learning should always allow for an open door for flexible inclusion of more data or deployment to large scale datasets.
We currently observe that machine learning tools are quickly becoming standard for analyzing geospatial data. Widespread use of machine learning across a large variety of disciplines fosters collaborative research efforts. Even colleagues new to the field can quickly learn and apply machine learning due to many well-designed free tutorials and access to open source software libraries on the web. In order to further boost research in machine learning applied to GIScience, there is a need to share data and source code publicly, to design and maintain convincing benchmarking activities, and to validate methods quantitatively on open datasets of realistic (large) size.
The guest editors would like to thank all the authors for their excellent contributions, the reviewers for the timely and constructive feedbacks and the assistant editors of the ISPRS International Journal of Geo-Information for the kind and prompt help.
All authors contributed equally to the writing of the editorial.
Conflicts of Interest
The authors declare no conflict of interest.
- Demir, I.; Hughes, F.; Raj, A.; Dhruv, K.; Muddala, S.M.; Garg, S.; Doo, B.; Raskar, R. Generative street addresses from satellite imagery. ISPRS Int. J. Geo-Inf. 2018, 7, 84. [Google Scholar] [CrossRef]
- Hänsch, R.; Hellwich, O. Classification of PolSAR images by stacked random forests. ISPRS Int. J. Geo-Inf. 2018, 7, 74. [Google Scholar] [CrossRef]
- Lee, J.; Jang, H.; Yang, J.; Yu, K. Machine learning classification of buildings for map generalization. ISPRS Int. J. Geo-Inf. 2017, 6, 309. [Google Scholar] [CrossRef]
- Russwurm, M.; Körner, M. Multi-temporal land cover classification with sequential recurrent encoders. ISPRS Int. J. Geo-Inf. 2018, 7, 129. [Google Scholar] [CrossRef]
- Wang, L.; Guo, Q.; Lui, Y.; Sun, Y.; Wei, Z. Contextual building selection based on a genetic algorithm in map generalization. ISPRS Int. J. Geo-Inf. 2017, 6, 271. [Google Scholar] [CrossRef]
- Zhang, X.; Chen, D.; Liu, J. A space-time periodic task model for recommendation of remote sensing images. ISPRS Int. J. Geo-Inf. 2018, 7, 40. [Google Scholar] [CrossRef]
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).