remotIO: A Sentinel-1 Multi-Temporal InSAR Infrastructure Monitoring Service with Automatic Updates and Data Mining Capabilities

: Multi-temporal synthetic aperture radar interferometry (MT-InSAR) is nowadays a well-developed remote sensing technique for monitoring of Earth’s surface deformation. The availability of regular and open Copernicus Sentinel-1 satellite data with enhanced spatiotemporal coverage has recently stimulated several initiatives for development of new monitoring services which can help to respond to emergencies faster and apply resilience measures more accurately as compared to conventional ground-based techniques. In this paper, the alpha version of the remotIO (Retrieval of Motions and Potential Deformation Threats) system is presented. It is currently able to provide continuous and autonomous updates of MT-InSAR results and post-processing methodology over sites with active deformation hazards to ease the interpretation and facilitate decision-supporting tools for on-time situational awareness. Our post-processing approach implemented in remotIO’s web application has proven to be useful in ﬁltering the resultant deformation maps and in pinpointing problematic zones with potential ground deformation threats also over low-coherent areas.


Introduction
In the last decades, synthetic aperture radar interferometry (InSAR) has proven to be a cost-effective technique for measuring subtle deformation with precision at the order of a centimeter to as much as a few millimeters. InSAR employs satellite radar images acquired by sensors with all-weather, day/night, high spatial resolution and wide area coverage imaging capabilities. Furthermore, satellite radar images have been acquired for almost 28 years already, and new images are continuously collected all over the world.
Conventional in-situ monitoring techniques provide point-wise information of the displacement activity; however, they often lack efficient and timely updates. InSAR as a remote sensing technique could complement or sometimes outperform ground-based methods by its ability to monitor tiny surface deformations over wide-areas without the need for in-situ observations or special equipment installation in the areas of interest.
The spatial distribution of velocities as visible from a standard point cloud map might be affected by noises, which also depends on the quality of data processing and on the ability of InSAR interpreter to analyze the data. While producing final deformation maps, usually thresholds are applied in order to sample only highly coherent radar targets. Thresholding on coherence might, however, cause misinterpretation of the results over areas undergoing more complex deformation scenarios. If the discrepancies in the areas of moderate coherence share similar behavior in terms of estimated parameters, it is important to take into account their spatial correlation for correct inference.
Linear trends (i.e., displacement velocities) might be used to analyze the movements; however, alone, they are not providing information on the probability of movement for intervening areas. Velocities alone are sometimes not sufficient for interpretation purposes, especially for those areas that are affected by other than (usually assumed) linear movements.
This research aims to further develop a proposed approach [36] for the detection of different deformation patterns caused by different triggering factors, which are not recognizable by conventional analyses of the average line-of-sight (LOS) velocities alone.
For a better disentangling between technological constraints of InSAR and actual deformation processes, it is necessary to develop an effective identification methods and event recognition analysis to adopt appropriate risk mitigation strategies. Although various methodologies were implemented to map potentially hazardous areas from MT-InSAR displacement maps [37][38][39][40][41][42][43][44], most of them are adversely affected by deficiencies: • They are based on the assessment of a single variable, e.g., velocity. • They rely only on displacement time series analysis, producing less reliable outputs for smaller or non-equally sampled datasets, while ignoring underlying spatial relations. • They lack robustness for performing analysis solely on a set of highly coherent points. • They require manual handling of the data in different software or Geographic Information Systems (GIS).

•
They incorporate distortion to the spatial composition of point networks by involving interpolation or gridding. • Their application is limited to a one-off activity and utilization of a single technique (e.g., clustering only).

•
They do not support the analysis of more complex deformation scenarios over low-coherent areas.
Research and development of this topic aims to propose an effective way of extracting useful information from Multi-temporal InSAR deformation maps while providing an innovative approach for detection of outliers and potential deformation threats and for correct handling of the information over low-coherent areas, as also recommended in [45].
Our proposed data mining approach [36] allows for minimization of outliers in final results while preserving spatial dependency among low-coherent observations. Such localities might represent areas with fast deformation processes, seasonal changes and other effects or represent noise and inconsistencies added into the final results. These would remain undetected, especially over wide areas monitored by Sentinel-1, as their coherence is often weakened and by the rules of standard thresholding procedure they would be discarded. The information and full range of coherence over low-coherent areas might, however, be exploited in a similar way as the coherence is used in thematic mapping applications such as change detection.
While the operational nature of Sentinel-1 opens for the exploitation of InSAR data on an unprecedented level, the InSAR community faces new challenges [46], among them large data volumes and inadequate tools. The research and development of both the methodological approach and remotIO's web-based platform are sought to enhance good practice and user experience and promote the diffusion of data derived from Sentinel-1 interferometry.

Study Area and Data
Within the scope of remotIO project, continuous MT-InSAR monitoring and all displacement products are provided for following pilot scenarios in Slovakia:

Methods
The remotIO (Retrieval of Motions and Potential Deformation threats) system has been developed under European Space Agency PECS (Plan for European Cooperating States) project in Slovakia. The remotIO platform is currently capable of providing autonomous Multi-temporal InSAR (MT-InSAR) processing and generation of regularly updated displacement maps.

General Platform Design
The service components are integrated into one system which is composed of: (a) Data Collection Server and Database Server; (b) Data Processing Server; and (c) Web server ( Figure 1). The system provides computational resources, data storage and tools for pre-processing and automated MT-InSAR processing and generation of added value products which are published on the Web server.
The data analytics services of the remotIO system represent a set of tools operating at Data Processing Server layer of the system (Figure 1, pink layer). These tools are utilized for performing MT-InSAR analysis and retrieval of displacement maps and displacement time series over monitored sites. Analytics services are also responsible for regular and automatic updates of the displacement products after initial processing is carried out. Finally, analytics services allow for post-processing of deformation products and generation of added value products, as described in Section 4.

Pre-Processing
The initial MT-InSAR-processing for each site is performed as the first step [1,2]. The processing chain for initial MT-InSAR processing is summarized in Figure 2. All available Sentinel-1A/B SLC images (with processing unit corresponding to approximately 20 km × 20 km crop) acquired over the areas of interest (Table 1) are co-registered on the sampling grid of a Master acquisition [47]. Single Master image is selected for each track based on optimal distribution of normal and temporal baselines. Precise Orbit Ephemeris (POE) are preferably used for the processing if available. Since precise orbits are only published after 20 days from the actual acquisition, for images acquired in a span of last 20 days, the only applicable are Restituted Orbits which are available few hours after satellite image acquisition. The digital elevation model SRTM-3 (Shuttle Radar Topography Mission) is used for removal of the topographic phase component. To get precise geolocation of PS (Persistent Scatterer) points, single Ground Control Point (GCP) is selected [48][49][50]. Full-resolution data are analyzed on a pixel-by-pixel basis to identify a sparse grid of points, usually corresponding to man-made structures (e.g., buildings, bridges, monuments, antennas, poles, conducts, etc.). The amplitude dispersion is used for pre-selection of the persistent scattering points for further processing [1,2]. A threshold on amplitude dispersion (0.42) [51] is used to create a network of pre-selected points for estimation of preliminary parameters and Atmospheric Phase Screen (APS). Considering the time periods, a large amount of acquisitions and the applied post-processing filtering, a slightly relaxed threshold of 0.42 is used to increase the densities of analyzed points (including scatterers that might be incoherent for short periods of time, relative to dataset timespan). After APS removal, phase time series are analyzed. Two key parameters are estimated here: residual height and annual displacement velocity. As with classical methodology of MT-InSAR [1,2] analysis, a linear model assumption for the deformation estimates is used. Finally, for each PS, a displacement time series related to the reference point identified in the stable area are estimated. Thanks to availability of both sensing geometries (ascending/descending), Line-of-Sight (LOS) velocities are decomposed into vertical and horizontal component in descending azimuth look direction [52] where a three-step approach is applied: • LOS velocities are re-sampled into regular grid of 50 m × 50 m.

•
The grid cells are assigned with the average of all LOS velocities allocated within individual cells.

Automated Processing
After initial MT-InSAR processing is performed for each site (Section 3.2), the set of software triggers controls each step of the system and handles processing of updated data stacks automatically ( Figure 2). Data Collection Server is updated regularly every 4 h. After new image is allocated and downloaded in the main data repository, the Database Server updates the list of stored images. Processing Server regularly checks the database of available imagery every 1 h, and, if a new image is available from respective orbit track and monitored site, the automatic processing is performed. Software triggers are operating in periods of 4 h for downloading of the data, 1 h for checking the availability of new imagery and 3 h on average to carry out new processing. For the areas not exceeding approximately 80 km × 80 km, new results are produced every six days based on revisit periods of Sentinel-1A/B and within an 8-h window after every new acquisition is made available on Sentinels Scientific Data Hub (SciHub).
Currently, SARproZ software [53] is used to perform automatic processing, however any InSAR software can be used in processing chain (see MT-InSAR processing chain in Figure 2). Processing is controlled by programmatic scripts which are run from the Matlab command window. In such a way, it is possible to control each step of MT-InSAR processing, from mean reflectivity map generation, through ground control points selection, atmospheric phase screen estimation and removal, to generation of standardized Comma-Separated Value (CSV) files with all estimated parameters and displacement time series. CSV files are then linked to Web Server, where results are visualized in form of interactive displacement maps. The communication between individual components of the system (Figure 2) is maintained by the set of algorithms which are operating continuously as background processes. The solution for continuous monitoring is fixed to the time of initial pre-processing of available data stack (see Section 3.2) in order to provide computationally and timely efficient updates. Newly acquired images are thus added to an existing solution and existing data stack. Pre-selected points for processing are not identical, although they are pre-selected with the same value of amplitude dispersion (0.42) [1,2,20,51]. New persistent scatterers will thus appear in an updated product, while identical points from new and previous datasets are tracked, and, if their coherence is significantly lost (or they will not appear in an updated dataset), the points are assigned with "Out of Service" label (see Analytical Tools Toolbar in Figure 3).
Standard remotIO's outputs are geolocated point clouds of PS points (Levels 1-3), or regularly sampled grid of points (Level 0) with a corresponding unique identification, estimated displacement parameters and their precision, quality indicators, ETRS-89 coordinates, local datum coordinates as well as physical heights and decision-support parameters. Metadata (acquisition geometry-related, e.g. incidence and heading, master date, next acquisition dates and reference point coordinates) are included in header. Comma-separated value (CSV) files for all product tiers (Levels 0-3) are downloadable directly from the platform ( Figure 3). All full-resolution products ( Table 2) are updated continuously.
Displacement maps with Line-of-sight (LOS) velocities from ascending and descending orbit passes constitute a Level 1 product tier of the remotIO platform (Table 2), while Level 2 is extended for full displacement time series which are interactively visualized in GUI of the platform. LOS velocities decomposed into horizontal and vertical displacement velocity components [52,55] represent Level 0 tier (Table 2). Level 3 product tier (see Section 4) contains all Level 2 components, with addition of decision-support parameters (Section 4.2), which are represented by analytical tools, namely data mining filtering mode, low-coherent alerts, displacement alarms, risk ratings and current condition parameters, as shown in Figure 3.

Post-Processing
The standard quality indicator of displacement velocity and residual height estimates is the ensemble temporal coherence. The widely used approach in reporting the final MT-InSAR results is to impose a simple threshold (e.g., >0.7) on its value. However, some points could contain important information even when their temporal coherence is lower.
For points with low temporal coherence, the results are not reliable as the functional model employed within processing of InSAR time series is not matching the actual observations. Low values of temporal coherence are usually associated with limitations and constraints of InSAR technology [1,2,20,21]. Points with lower temporal coherence might also represent areas where deformation processes progress rapidly, in a way that was not assumed in applied mathematical models. In some cases, and specifically over small areas, reliable estimates in low-coherent zones could be obtained by extending the standard linear model for deformation estimates [1,2], by re-assessing solution space boundaries or by constructing complex mathematical models which better describe real deformation scenario. Wrong model drops the estimate of ensemble coherence, biases the estimate of parameters and increases chances of PS candidates rejection, even though it is in fact a PS (e.g., undergoing strong non-linear deformation), which needs to be considered during re-assessment of the processing strategy and applied models.
Our goal is to perform autonomous post-processing analysis of Multi-temporal InSAR results via multivariate analysis of all major variables (such as velocity, residual height, temporal coherence, etc.) and: (i) search for outliers; (ii) decide about removing/leaving them inside the dataset; and (iii) reduce noise incorporated in the final results. The primary aim is to look for the characteristics that could lead to the optimal results in terms of quality and interpretation purposes. Some outlying measurements could point the attention of an expert user towards the locations affected by motions which are un-modeled (e.g., non-linear displacements, seasonal fluctuations, sudden changes such as in the case of landslide activation, etc.). After detecting anomalous behavior, the processing strategy of expert users can be refined and real deformation profiles could be extracted for these areas. If the potential deformation threat has been identified, the non-expert user should be aware of such a situation in order to adopt appropriate prevention activities. Another advantage is the possibility for such post-processing procedure to operate autonomously and generate timely information for monitored sites. The research and development step was associated with the existing prototype first formulated in [36]. The techniques implemented in [36] allowed for extraction of location-, data-and application-driven outliers and has proven to be useful for extracting PS points with strong ground deformation processes, although their coherence is weakened and by the rules of standard thresholding procedure they would be discarded. However, these techniques were not tailored for automatic updates of MT-InSAR analysis over pilot test sites and do not account for easy-access to data, intuitiveness in interpreting the results and effectiveness in terms of providing timely information for better situational awareness. The research and development process was focused on implementing different data-mining strategies (see Section 4.1) to analyze Multi-temporal InSAR point clouds in various applications over pilot test-sites to possibly identify areas with significant movement. The modules implemented in [36] were rectified and a systematic methodology is developed and summarized in following sections.
The development process allowed for designing a set of decision-supporting tools for detection and easy interpretation of problematic areas (Section 4.2).

Data Mining Methodology
Given a set of persistent scatterers (PS) that have undergone standard MT-InSAR processing (Figure 2), the resultant spatial data are represented by multiple variables estimated in the PS position. These variables are: the annual displacement velocity, height or residual height with respect to reference DEM and corresponding standard deviations and temporal coherence, all with respect to an arbitrary reference point. The proposed data mining methodology consists of five consecutive steps, entailing outlier candidate detection, all nearest neighbors search, multivariate analysis in local neighborhood, decision making process and time series warning (see workflow diagram in Figure 3). The goal of outlier detection is to separate a core of reliable observations from outlying ones. Assume a sufficiently large sample size from a population with a finite level of variance. In the initial step, all MT-InSAR estimates within population are tested for normal distribution (population size and processing time requirements are summarized in Section 4.1.6, Table 5). We aim to analyze observations which are far from the fitted distribution. As the first step, a robust covariance estimate to data is applied, and a confidence region (ellipse in 2D, Figure 4) to the central data points is fitted. For each dataset, we consider that 40% of samples might correspond to noise and points outside the confidence region are considered outlier candidates (OCs). The whole process estimates the inliers in a robust way, while the robust form of Mahalanobis distance (or robust distance) is used to derive a measure of outlyingness.
If we assume that the data are d-dimensional and are stored in an n × d data matrix X = (x 1 , . . . , x n ) T with x i = (x i1 , . . . , x id ) T we can compute robust estimates of covariance by the Minimum Covariance Determinant (MCD) method [56][57][58]. For explanatory purposes, we stick to two-dimensional data and variables of velocity and residual height (Figure 4). The MCD estimator, firstly introduced by the authors of [56,57], is defined as the mean and the covariance matrix of the half set H containing h observations whose covariance matrix has the smallest determinant. The usual value of h, i.e. the one which achieves the highest breakdown point, is h = n samples +n variables +1 2 , where n samples is the sample size and n variables is the number of variables. In general, all h-subsets cannot be considered, and one must rely on approximate algorithms, such as those introduced by computationally efficient FAST-MCD algorithm [58]. A FastMCD algorithm [58,59] was implemented for the computation of MCD estimator. outliers. As indicated in above paragraph, the goal is to find n samples +n variables +1 2 observations whose empirical covariance has the smallest determinant. Such "pure" subset of observations is then used to compute standard estimates of location and covariance. The locations or the robust distances accurately reflect the true distribution of the observations [56]. Minimum Covariance Determinant (MCD) is a robust estimator and guarantees that the estimation is resistant to erroneous observations. A high level of statistical significance α= 0.4, or a so-called statistical tolerance of 60%, represents the assumption that 40% of measurements might correspond to erroneous observations. This is given by MCD(x) ≤ χ 2 d,0.6 where χ 2 d,0.6 is the 0.6 quantile of the chi-squared distribution with d degrees of freedom. The argument of χ 2 d,0.6 represents a threshold for which data points becomes inliers or outliers ( Figure 5). This proportion is the value given to the contamination parameter (i.e., outliers fraction) of the algorithm. Inliers are labeled 1, while outlier candidates are labeled -1 and this information is stored in 'Evaluation' column of the rejection table (Table 3, Section 4.1.4).
Such separation of data into inliers and outliers does not imply that we are straight discarding the outliers. The outliers are further analyzed, so we instead prefer to use the term outlier candidates (OCs). OCs are not necessarily errors. The unusually high level of statistical tolerance in place (Figure 4) is justified by the fact that outlier candidates are further analyzed in accordance with spatial properties of the points in their neighborhood. This has an inherent benefit for the initial selection of the points for MT-InSAR processing, as parameter settings using, for example, amplitude dispersion [1,2] might be loosened to increase the densities of analyzed points.

Step 2: All Nearest Neighbors Search
Performing outlier detection methodology as the first step of the proposed procedure is efficient since it is not necessary to perform clustering and/or nearest neighbor queries for every point inside the dataset, as proposed in [36]. Finding all nearest neighbors is performed in 50-m radius of every OC point. The radius should be chosen large enough with respect to data resolution (50-100 m for Sentinel-1), such that it still yields the same segmentation but maximizes the number of visited neighbors per query. All nearest neighbor search inside a given radius is an integral part of our approach since we aim to evaluate the underlying spatial relations of OCs among observations in the neighborhood. The fundamental output of this step is a list of neighboring points for each OC inside a dataset.
As a spatial-partitioning data structure, a kD-tree method [60,61] was selected as the most efficient data structure for arbitrary dimensions with quick lookup times for radially bounded queries. kD-trees are a widely used approach for performing nearest neighbors search in spatial data analysis, and research is currently concentrated on memory-efficient implementations for large-scale datasets. The expected complexity of the nearest neighbor lookup is O (log (n)) and the construction time of a kD-tree is O (n log (n)) [60].

Step 3: Multivariate Analysis in Local Neighborhood
The variables defined in location of each OC are compared towards variables of the neighboring points. Clusters of outliers ( Figure 5) which are identified by looking for connected components of undirected graph, employing graph theory [62], are the main output of this step (Step 3).
The OCs are tested for the presence or non-presence of outliers in the neighborhood based on the evaluation labels (Table 3) provided from MCD method (Step 1) and list of neighboring points from kD-tree method (Step 2). In Step 3, the OCs with the presence of other outliers in the neighborhood are segmented into wider groups and are being evaluated separately. The segmentation of dataset into outliers clusters is performed to analyze statistical properties of OCs against representative proportion of PS points in spatial domain, since systematic errors might cause clusters of PSs separated by a large distance to behave differently from what is being anticipated in mathematical model applied amongst the majority of points [63,64].
All OCs which were not classified as grouped are considered isolated. Each isolated outlier with temporal coherence greater than selected minimum is kept inside the dataset. For OCs in groups and isolated outliers, the comparison is performed only with neighbors labeled as inliers (Evaluation = 1), in order to match the behavior of outliers towards proportion of the "normal" data. If the value for particular variable deviates more than the rejection criterion [65] of absolute deviation from the median of all inliers in the neigborhood [36,66], it is labeled 1. This way, OCs' variables are labeled inside the rejection table (Table 3) with 0 (non-deviating) or 1 (deviating). Such labels are used to identify spatially correlated clusters and separate them from noise (Step 4). Outliers clusters with common statistical properties (Step 4) are sought to be preserved in final results as potential indicators of anomalous behavior. The re-assessment of the processing methodology might be required for low-coherent inliers (as discussed in Section 6), which is also the reason they are designated as potential 'problems' inside the 'Decision Label' of rejection table (Table 3) and in the following Step 4.

Step 4: Decision Making Process
The process of extracting useful information from OCs is represented by decision making-process ( Figure 5) performed on the basis of rejection table (Table 3). The variable columns (Velocity, Height, etc.) are pre-assigned with 0 or 1 based on median absolute deviation [36,66] from prevailing values of inliers (Evaluation = 1) in the neighborhood (radius) or inside the cluster of outliers. If there are no inliers, OCs are considered isolated ('Isolated problem'). If the comparison with inliers in neighborhood does not exceed the rejection criterion [65,66], a point is kept inside a dataset as 'Unknown problem'. For outlier candidates (OCs) where individual variables exceed the rejection criterion, the problematic variables are flagged 1 ('Height problem' or 'Velocity problem'). 'Low-coherent Alerts' (Table 3) are indicated for groups of OCs sharing most common behavior, indicating that the respective part of the monitored area exhibits similar statistical properties and might represent a potential deformation threat.
Pre-assignment of labels for problematic variables in each outlier candidate (OC) and groups of outlier candidates are used to form classes 'Decision' and 'Decision Labels' ( Table 3). The example in Table 3 is limited to variables of velocity and residual height as those constitute the primary estimates of MT-InSAR methodology. Decision classes are further utilized for designing set of decision-supporting tools which are accessible from the Graphical User Interface (GUI) of the remotIO web-app via the 'Analytical Tools' toolbar ( Figure 9).   Along with the multiple estimated parameters (displacement velocity and residual height for the scope of this paper), each PS point disposes of displacement time series, updated regularly inside remotIO platform. Since the interest of end-users is mainly focused on obtaining information on the current state of the monitored area, the time series warning step is dedicated to evaluating the behavior of the most recent observation. The time series analysis is based on the approach proposed by Raspini et al. [28]. Our approach is designed to evaluate the amount of deviation of the most recent observation with respect to the linear trend of observations in the last 30 days. Likewise, the total amount of deviation from the linear trend of historical data over arbitrary time-span can be used to set early warning mechanism. The procedure holds for ( Figure 6):  For each PS point, the 'Delta' parameter value is estimated and assigned with a sign (+/−) and is then used to assign parameters for visualizing different characteristics of current warning state (Table 4)  For the displacement alarms exceeding the phase change detectable in a single pixel (half the wavelength, i.e. 28 mm), the possibility of unwrapping error cannot be rejected. Moreover, considering the worst case scenario of scatterer with Signal-to-Clutter ratio as low as 4 dB, the three-sigma confidence bounds yield approximate threshold of 15 mm for false alarm rejection. The intervals of 'Risk ratings' have been set in accordance with end-user requirements and observed deformation scenarios over pilot monitoring sites (see Section 5). Although we use the term 'risk', we are actually referring to the 'vulnerability' (Minor, Moderate, Significant, and Major) of detected zones where displacement alarms are triggered.

Comparison with Previous Version
Since the foundations of post-processing methodology were firstly introduced by Bakon et al. [36], changes introduced in current implementation are summarized in Table 5. The execution times of Bakon et al. [36] and newly developed version were tested on datasets with different total amount of points. The computations were performed on a server setup with the following characteristics: 24 CPUs (Intel R Xeon R CPU E5-2680 v4 @ 2.40GHz), 256GB RAM (16 × 16 GB DIMM DRAM EDO), 1 TB system SSD for processing and OS GNU/Linux Ubuntu 18.04 LTS. The results are shown in Table 5.

Added-Value Products
'Low-coherent Alerts', 'Displacement Alarms', 'Risk Ratings' and 'Current Condition' parameters (Section 4.1) are utilized for designing a set of decision-supporting tools which are accessible from the Graphical User Interface (GUI) of the remotIO web-app via 'Analytical Tools' toolbar under Level 3 product tier of the remotIO's framework. A set of GUI operators (Figure 9) allows end-users to further explore information stored over filtered deformation maps.
After activating the 'Data mining mode' under Level 3 product tier, final results are filtered and could be superimposed with added-value information ('Low-coherent Alerts', 'Displacement Alarms', 'Risk Ratings' and 'Current Condition'). When this information is matched together (see Section 5), the end-user should be able to easily interpret results which are continuously and regularly updated.
Such methodology is beneficial for improving point densities, for tackling outliers with extremes in any variable and focusing attention towards evaluation or re-assessment of low-coherent areas.
With our approach, the final set of information provided by MT-InSAR analysis is enhanced for statistical layers of: (i) low-coherent inliers and 'Low-coherent Alerts' which directs the attention of end-users towards problematic areas or zones with potential deformation threats; (ii) 'Displacement Alarms' to evaluate the progress of displacement time series in recent periods; (iii) 'Risk Ratings' to assess the amount of displacement in the most recent satellite acquisition; and (iv) 'Current Condition' for overview of the actual movement tendency (up/down) of the observed point. As a result, the final set of standardized products provided under the remotIO's decision-support framework is composed of three layers in comparison to traditional procedures, as schematically illustrated in Figure 7.

Results
Several case studies from monitoring urban environments are presented in this work in order to demonstrate the ability of our data mining approach to filter final deformation maps. The use of a full range of a coherence value enabled the detection of areas affected by significant movements or added inconsistencies. The preliminary results over three pilot monitoring scenarios ((1) Landslides; (2) Mining subsidence; and (3) Dams) are presented in this section.

Landslides (Upper Nitra; Hradec, Velka and Mala Lehotka)
Since conventional terrestrial techniques provide only point-wise information on the landslide's activity without timely and efficient updates, the use of MT-InSAR technology as implemented within remotIO framework is foreseen to improve situational awareness in the area. For this particular case, our product can help demarcate distinct areas of motion; better understand the surface deformation pattern of landslides and its spatial and temporal evolution; and assess the success of remediation works. Monitored movements in the interval of several millimeters up to two centimeters per year are ideally traceable with mid-resolution C-band data of Sentinel-1.
remotIO's pilot scenario for the continual monitoring case are the areas of Prievidza city districts Hradec, Vel'ká and Malá Lehôtka affected by recent activation of these geohazards. Ongoing landslides activity and its consequence (damaged buildings and infrastructure) emphasize the requirement for operational capabilities of monitoring schemes. Moreover, areas of lower coherence due to accelerating or decelerating movements are to be expected; hence, our data-mining approach could help attain sufficient attention towards problematic sectors for deeper investigation.
The PS deformation maps of remotIO Level 1 products ( Figure S1a,b) are visualized in the area of 2.5 km × 2.5 km (6.25 km 2 ). The reference point is set in the stable part of Prievidza city, 1 km from the area of interest. The areas affected by strong deformation process (up to 2 cm/year) are evident over the vast majority of Hradec and Velka Lehotka suburbs. Several PSs in central part of Hradec village (northeastern one) even exceed the velocity of 2 cm/year.
Thanks to the availability of both sensing geometries (ascending and descending), a decomposition to vertical and horizontal component in descending azimuth look direction is possible [52].
The deformation pattern detected in ascending track (uplift) progresses in the opposite direction to descending track (subsidence). This is a consequence of prevailing horizontal (westward) direction of actual displacement vector, as illustrated by remotIO Level 0 product in Figure S2.
Since observed landslide is substantially active and several buildings are subject to continual damage, assessment of remediation works and operational routine monitoring is of utmost importance. The capabilities of decision-support analytics under the Level 3 product tier of the remotIO platform are presented here for the area of Hradec village ( Figure S1). The areas affected by active deformation processes (up to 2 cm/year, show in red) are visible over the vast majority of Hradec suburb from Sentinel-1's descending track No. 51 ( Figure S1). Filtering capabilities of 'Data mining mode' and proposed methodology are presented in Figure 8. Higher-order products for final MT-InSAR analysis provided by remotIO's decision-support framework.
As observed in Figure 8, many of the points are discarded with thresholding on temporal coherence (>0.7). More than 300 points and additional alerts and alarms (as summarized in Table 6) from both orbit tracks are preserved in final results when data mining is used. Besides increasing the amount of preserved points in final results, there are several 'Low-coherent Alerts' in the area and some 'Displacement Alarms' with 'Major' risk ratings that are concentrated in the southern part of the village, as showcased in remotIO's web-app environment in Figure 9. Several buildings would have been left without any information if a simple thresholding procedure were applied. Additionally, newly preserved 'Low-coherent Alerts' and 'Displacement Alarms' might be crucial for the identification of buildings affected by motions in very recent periods.
Detailed and frequent remotIO's Multi-temporal InSAR displacement time series in Level 2 and 3 products helped the monitoring authority (SGUDS) to reveal the slow deceleration of movement activity and overall stabilization of landslide body over past years for both Hradec and Velka Lehotka (example shown in Figure 10). SGUDS authority considers information provided by remotIO as good assessment indicator of remediation works already performed after critical situation in 2014 as well as for planning and good positioning of further such remediation works.
The example of displacement time-series that were recorded for series of alerts and alarms is shown in Figure 11. 'Displacement Alarm' triggered for the point inside Hradec village reveals movement of −11.0 mm with change detected in April 2019. Such points could indicate a need for field inspection, prevention activities or re-assessment of processing methodology for mitigation of systematic errors (such as local inclusion of non-linear model or adaptation of solution space).

Mining Subsidence (Upper Nitra, Kos Village)
For the area of active mining in Kos municipality, the information about current conditions of ground stability plays an important role in safety surveillance of the residential area. The monitoring authority (Hornonitrianske bane (HB) Prievidza Mining Company) actively surveys impacted part of the Kos village via monthly leveling measurements (Figure 15), where subsidence of up to -3 cm per year has already been confirmed by remotIO's products ( Figure 13). The inhabited zones close to rapidly subsiding belt are of highest importance for operational monitoring. Since those are commonly less coherent areas, our data-mining approach could retrieve valuable information for some of the critical zones ( Figure 12).
Lack of coherent points in the central part of the former Kos village is caused by significant geomorphological changes at the scale of several meters over past years which led to formation of sinkholes and wetlands and consequent demolition of buildings. Since results from the ascending sensing geometry (Figure 12a) exhibit total loss of a coherence over the affected area, retrieval of information from ascending track over critical zone is not possible. However, southeastern and northwestern parts of the village are still inhabited and are in focus of monitoring authority. 'Low-coherent Alerts' and 'Displacement Alarms' detected in the monitored area from descending track ( Figure 13) increased the overall densities of observed points and pin-pointed areas with potentially dangerous progression of displacement time series in current monitoring periods.
For example, low-coherent inlier (0.44) was retrieved inside area above mining corridor cut (Figure 14), exhibiting accelerating displacement behavior due to ongoing mining operations.
Monthly leveling measurements are realized under HBP authority over dedicated monitoring network in the precinct of Koš village since April 2016, which was determined by the new mining corridors in a close vicinity of inhabited parts of the village.
Ground-truth validation of remotIO Level 2 products is performed automatically via time series comparison at double-differenced level. For each leveling benchmark, corresponding nearest coherent PS point is identified and its LOS displacement time series are approximately projected into vertical using acquisition's local incidence angle for particular PS. Both leveling and projected PS displacement time series are transformed with respect to a common reference epoch (zero-epoch of leveling campaigns) and common reference point (reference leveling benchmark) to obtain displacement double-differences. An example of displacement time series comparison ( Figure 15) with simple linear regression shows significant correlation at three-sigma level. In addition, two-sample Kolmogorovov-Smirnovov test was used to confirm whether unequally sampled time-series follow same probability distribution.
The high temporal redundancy and dense spatial coverage of PS points is superior to leveling and hence valuable for monitoring authority.     Figure 15. Example of remotIO InSAR vs. leveling double-differenced displacement time series projected to vertical at selected leveling benchmarks.

Dams (Gabcikovo Waterworks)
The monitoring tasks over the first level of Gabcikovo waterworks-the Cunovo dam-focus on the main parts of the waterworks: power plant, chambers, levees and weir on both stages. These are monitored by semi-annual precise leveling measurements. Another task focuses on the problematic part of the dam, which corresponds to the original riverbed of the Danube River. It is under investigation due to persistent seasonal movements (e.g., thermal expansion due to temperature changes and dam oscillations related to the water level change) that are difficult to observe using standard geodetic methods or employing standard assumptions on linear movement trend and correspondent coherence loss. Those seasonal signals are examined by remotIO products. From both ascending and descending sensing geometries of Sentinel-1 over Cunovo dam ( Figure S3), good spatial coverage is attained for all structural parts of the waterworks body.
Such coverage is especially valuable in the areas lacking leveling ground control points, providing added-value for monitoring authority in critical parts of waterworks structure where realization of terrestrial measurements would not be possible. This is the case of retrieved low-coherent points ( Figure 16) with series of alerts and alarms (Table 6) triggered for the part of the dam over-passing former river bed with strong seasonal effects recorded in displacement time series, as shown in the example (Figure 17). Although those movements are expected for such type of construction and the monitoring authority is well aware of them, even slight changes in its amplitude might be an indicator of unexpected activity.
The temporal sampling of six days is far superior to semi-annual leveling field campaigns. The higher temporal sampling is mostly appreciated by monitoring authority representatives. It is clear from the results in Figure 18, showing leveling and InSAR equivalent double-differences (projected to vertical) comparison, that even Sentinel-1 C-band data can map fine seasonal displacement signals over dams body.
The major parts of second waterworks level-Gabcikovo dam-which are under vital monitoring efforts correspond to the main body, hydropower plant and two lock chambers for ships passage. The MT-InSAR results as provided by remotIO have spotted displacement over specific parts of the Gabcikovo dam, which is visualized for both Sentinel-1's sensing geometries (ascending/descending) in Figure S4. Beside the subsidence tendencies over the central part of the Gabcikovo dam corresponding to the area between locks and hydro power plant ( Figure S4), several more points over the dam locks have been preserved in final results utilizing data mining procedure. For the 'Displacement alarms' triggered over this part of the dam, the abnormal progression of movement with a total displacement of −7.3 mm (significantly smaller than 28 mm phase ambiguity) in January 2019 has been recorded in time series, as shown in Figure 19. Such points of interest identified by data mining analytics are further evaluated under cooperation with monitoring authority. This is also an example how the use of remotIO's added value information might help to focus the attention of experts for upcoming acquisitions and set the priority areas for deeper investigation.    The total number of points and additional issued warnings (alerts and alarms) preserved by data mining procedure in comparison to simple thresholding procedure (temporal coherence >0.7) is summarized in Table 6.

Discussion
remotIO concept represents a first stage of a complete MT-InSAR processing framework intended for monitoring of man-made objects and infrastructure mostly over urban areas. remotIO components have been developed to target zones with the highest displacement rates or displacement rate changes (e.g., landslides or undermining subsidence as presented in this manuscript) and to support structural health monitoring tasks (e.g., monitoring of buildings, bridges, dams or cities' districts). The system was scaled to provide analysis over small-to-medium sized regions (5 km × 5 km up to 80 km × 80 km) and support the full life cycle of providing MT-InSAR monitoring products: from initial processing of full available archives of Sentinel-1 over monitoring sites, through continuous automatic updates, post-processing methodology for filtering resultant deformation maps and generation of added value products, until the delivery of results via web-based platform where all outputs are accessible to end-users.
The core focus of the manuscript is in post-processing methodology for designing a set of higher-order products that can be utilized by both expert and non-expert users. The information stored over problematic points should be perceived as indicative, since retrieval of reliable deformation estimates is not possible without appropriate re-assessment of the processing methodology and internal cross-validation. To partially address this issue, a new parallel processing is started immediately for problematic zones with images acquired in a span of more than 20 days (to acquire precise orbits) and with selected monthly, quarterly or semi-annual updates to avoid possible false alarms. Since the alpha version of the remotIO system (TRL 4) has been introduced to users over pilot monitoring sites (Section 5) only recently (end of April 2020), the false alarm rate is subject to further research and validation with ground truth observations.
The proposed filtering approach was tested on standard MT-InSAR processing employing linear displacement model. As the current application use cases are focused on monitoring man-made structures for the continuous periods, distributed scatterers are not incorporated into remotIO framework, although they can be included if proven necessary by insufficient spatial coverage.
It is important to note that, without proper refinement of MT-InSAR processing strategy and mitigation of systematic errors, inconsistencies to final results might be added. Pin-pointed problematic zones, especially over low-coherent areas, shall be further analyzed in detail, for example by applying different mathematical models for deformation estimates [25,68,69] or individual scatterer models for extracted low-coherent points [18,70,71], by exploiting partially coherent targets [72][73][74] or utilization of high-resolution X-band data (TerraSAR-X, Cosmo-SkyMed) might be required for more detailed structural analysis and precise engineering tasks. The system is scalable and fully open for integration of other satellite SAR data and new study areas. Additional work shall be focused on effective dissemination of actual alerts and false positives caused by processing model inconsistencies (i.e., unwrapping errors). Based on triggered alerts and alarms, expert InSAR users can advise or alone modify processing methodology before providing final statements and conclusions. This is also an example of how both expert and non-expert conceptualizations of operational InSAR service could be met in a single monitoring system. Similar studies addressing operational exploitation of Sentinel-1 measurements can be found in [27][28][29][30][31][32][33][34]. These studies are additionally oriented on nationwide monitoring schemes, where we foresee the possibility of joint exploitation of targeted analysis over small-to-medium sized regions (remotIO) with state-wide monitoring initiatives in Slovakia [75] and Europe [76].

Conclusions
In this work, an alpha version of an infrastructure monitoring and decision-support framework remotIO is presented. It provides regularly and automatically updated MT-InSAR analysis in three product tiers ( Table 2). The third-level added-value product comprises post-processing procedure based on MCD outlier detection, space-partitioning and multivariate analysis, which allows the use of a full range of temporal coherence to filter the resultant datasets of outliers hence avoiding simple thresholding.
The use of data mining analytics on the basis of standard MT-InSAR products helps non-experts users: (a) to ease the interpretation of results; (b) to set the priority areas for deeper investigation in terms of systematic errors mitigation and correction of the processing procedures; and (c) to activate prevention measures or suitable risk mitigation activities when real motions are confirmed.
remotIO's platform and analytical tools are integrated into the web-app graphical user interface, made accessible from [77].
The preliminary results over three pilot monitoring scenarios ((1) Landslides; (2) Mining subsidence; and (3) Dams) demonstrate the ability of a prototype to filter final deformation maps and improve point densities over zones with the highest displacement rates and low-coherent areas. By triggering a series of alerts and alarms over monitored sites, the system provides continuous information over areas where deformation processes are progressing also in recent monitoring periods. These areas are further investigated under cooperation with end-users that are involved in remotIO's pilot monitoring scenarios. The initial end-users feedback on platform's practical exploitation is positive.
Finally, existing ground truth data from conventional terrestrial geodetic (leveling and GNSS) and geotechnical (inclinometry) techniques carried out under authority of end-users of particular pilot monitoring scenarios shall be further utilized for validation of data-mining procedure outputs, especially over low-coherent zones.  The view expressed herein can in no way be taken to reflect the official opinion of the European Space Agency.
Acknowledgments: Sentinel-1 data were provided by ESA under free, full, and open data policy adopted for the Copernicus program. Data were processed by SARPROZ using Matlab and Google Maps. The authors would like to thank State Geological Institute of Dionyz Stur and Hornonitrianske bane Prievidza, a.s. and Vodohodpodarska vystavba, s.p. (section TBD) for providing ground truth data, discussion and collaboration. This work was supported by the VEGA grant agency under project 2/0100/20.

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The following abbreviations are used in this manuscript: