1. Introduction
Content-Based Image Retrieval (CBIR) is an important research area in computer vision and information retrieval. It enables users to search for images based on their visual content rather than relying on text-based descriptions or metadata. CBIR systems use features extracted from images to compute the similarity between a reference image and other samples in a database. In the case of SAR images, CBIR can be challenging due to the unique properties of the data, such as the speckle noise, number of looks and spatial resolution to name a few. In the literature, there are different techniques to perform CBIR using SAR images [
1,
2,
3,
4,
5,
6,
7].
SAR systems offer advantages as they are not dependent on solar light and atmospheric conditions [
8,
9]. However, analyzing SAR images is difficult due to the multiplicative granulation caused by speckle, which results from the interference of electromagnetic waves. Although filters can be applied to reduce this effect, it may come at the cost of reduced image quality. Therefore, statistical modeling techniques are used to process and interpret these images. Here, the
distribution was adopted to model intensity SAR data [
10,
11,
12] and to develop the CBIR-SAR.
In general, the CBIR systems available in the literature use similarity measures that do not consider that image regions can be discriminated using the estimated parameters of the image model. In fact, it is possible to understand an image as a set of regions that can be described by different probability laws [
13]. Thus, it is possible to use stochastic distances to statistically discriminate distributions for SAR image retrieval purposes.
Thus, the aim of this paper is to advance the state-of-the-art on the use of a stochastic distance to overcome the limitation of the classic CBIR to deal with SAR images due to the speckle noise. To achieve this goal, the triangular stochastic distance measures the similarity between regions with different roughness levels or textures based on the parameters of the distribution to develop the CBIR-SAR.
To estimate the parameters of the
model, the Maximum Likelihood Estimation (
) [
14] and the Fast Log-Cumulants Method (
) [
12] are applied to the intensity images. Additionally, a stochastic similarity measure uses the estimated parameters [
13,
15], which are the SAR image features, to distinguish image regions. After obtaining the distances, they are arranged in the ascending order and generate a ranking list of image regions. Then, the performance of the information retrieval system is evaluated by using the Mean Average Precision (MAP) [
16].
The main contributions of this work are as follows: (a) development of a reproducible CBIR system based on the intensity SAR data model that evaluates two estimation parameter methods as image feature extractors; and (b) establishment of a knowledge-based information retrieval inspired by stochastic distances and data modeling extensible to other statistical models and stochastic distances.
This paper unfolds as follows:
Section 2 describes the fundamental background required to develop the proposed approach.
Section 3 introduces a system for content-based SAR image retrieval using a stochastic distance.
Section 4 shows the results and discussions, and
Section 5 summarizes our findings and conclusions.
2. Background on SAR Data Modeling
This section presents the statistical model for intensity SAR images, i.e., the distribution, the parameter estimation methods, and the stochastic distance.
2.1. Statistical Model for Intensity SAR Data
Here, the
distribution was adopted to model SAR data in intensity. The probability density function of the
is defined as [
10]:
with
. The parameters
and
correspond to the roughness and scale, respectively,
is the gamma function defined as [
17]:
with
and the
L parameter is known as the number of looks, which represents the speckle control parameter. The influence of the parameters
in the shape of the
distribution can be seen in
Figure 1.
Figure 1a shows the effect of varying the parameter
for the same
and
. Notice that when the value of
decreases, the probability density function tends to be more homogeneous, representing image regions that are more homogeneous.
Figure 1b shows the effect of varying the parameter
for the same
and
. In [
18], the
parameter is related to the brightness of the scene.
Finally,
Figure 1c shows the effect of the number of looks, i.e., varying the parameter
L for the same
and
. It also exhibits that the number of looks has an effect mostly on the distribution of very small values. Notice that when
L decreases, the
distribution assigns more probability to small values of the return
z, yielding less contrasted images [
19] and consequently more challenging images.
The great advantage of using the
distribution is that it can accurately model homogeneous, heterogeneous and extremely heterogeneous regions of an intensity SAR image [
10]. This flexibility has motivated some important applications, such as SAR image segmentation driven by the
Shannon entropy [
20] and autoregressive moving average process for speckled data [
21], among others.
2.2. Parameter Estimation for the Distribution
To estimate the parameters of the
distribution [
10], two methods were selected: the Maximum Likelihood Estimation [
13] and the Fast Log-Cumulants Method [
12]. Here, the number of looks,
L, is considered known. The
L parameter is generally an integer provided by the SAR sensor, and in this paper, it is a priori information. However, in the absence of it, the number of looks can be estimated from real SAR data, and it is therefore interesting to call the equivalent number of looks [
22].
2.2.1. Maximum Likelihood Estimation ()
Consider a random variable
Z with probability density function
, where
is the vector of parameters to be estimated and (
) is a random sample of size
n from the variable
Z. The likelihood function
is defined by [
14]:
which is interpreted as a function of
. The maximum likelihood estimator of
will be the one that maximizes
.
For the
distribution, the likelihood function is given by [
13]:
The values of
and
that maximize Equation (
4) can be obtained by finding the solution of the following system of nonlinear equations.
where
is the digamma function [
17,
23] and
and
are the maximum likelihood estimators for
and
, respectively. Given that there are no feasible inverse functions for
, the nonlinear system in Equation (
5) cannot be solved explicitly, and therefore, it requires iterative procedures. In fact, it is a difficult task, in particular in textureless areas [
19].
In order to solve this problem here, we applied a numerical routine based on the Broyden–Fletcher–Goldfarb–Shanno algorithm (
), which was implemented in R software through the
maxLik function. A good solution consists of using estimates obtained with the Method of Moments [
10] as the initial guess for the
algorithm. The algorithm
starts at initial estimates for the optimal values of
and
and proceeds iteratively to search for better estimates at each stage.
2.2.2. Fast Log-Cumulants Method ()
This method presents analytical expressions for estimating the roughness
and scale
parameters of the
distribution. Using
, the roughness parameter can be estimated by the following expression [
12]:
where
stands for the modulus operator,
is the trigamma function [
17] and
is the log-cumulant of order 2. Usually,
must be replaced by the corresponding sample log-moment of order
and
2, respectively, which can be calculated as [
24]:
with
,
being a sample of a random variable
Z. After obtaining
, the
parameter can be estimated by:
where
and
is the digamma function [
17].
2.3. Stochastic Distance
Contrast analysis often addresses the problem of quantifying how distinguishable two image regions are from each other [
13], and a distance metric plays a crucial role to perform it. Similarly, CBIR systems rely on image content similarity and require distance metrics. Due to the statistical properties of SAR data, CBIR systems require suitable distances. In fact, stochastic distances are relevant tools for SAR image analysis and understanding, since they are capable of assessing differences between regions in a scene.
In this paper, the triangular distance was selected due to its favorable properties, such as not requiring an extreme computational effort and being effective for both extremely heterogeneous and homogeneous regions in SAR images [
25]. Similarly to [
13], we deal with distances between the same distributions, and thus, only their parameters are relevant. The triangular distance is described as:
where
and
are random variables defined in the same probability space, with probability density functions
and
, respectively, where
and
are arrays of parameters.
3. Proposed Methodology
The proposed CBIR method is based on the knowledge that comes from the statistical model of real SAR images. It uses intensity images acquired from synthetic aperture radar. The database consists of 25 clipping of images of each region (e.g., water, oil spill, forest) from UAVSAR and 25 images of each region (e.g., water, urban area, and forest) from OrbiSAR-2 and ALOS PALSAR. The Maximum Likelihood Estimation and the fast approach of the Log-Cumulative Method are the methods that estimate the roughness and scale parameters for each image region. The number of looks provided by the SAR sensors is equal to 1.
Table 1 presents this radar information. The reason for dealing with single-look images is that they are markedly affected by speckle and pose potential challenges for the algorithm. Thus, all tests are performed with single-look images.
Figure 2 depicts the overall pipeline of the proposed CBIR. It shows that for the SAR image database, the parameters of the
distribution for each image region are estimated using
and
. The main reason for using these parameter estimation methods is to assess the performance of the proposed algorithm on each one.
The estimates and the similarity measure are inputs to CBIR-SAR to compute the triangular stochastic distance between each query image and the other samples. Finally, the system outputs distances which provide a ranking list of the retrieved samples of water, oil spill, forest and urban area. Then, the MAP measure assesses the results. The higher the MAP value is, the better the performance of CBIR-SAR.
3.1. SAR Database
The real SAR images were acquired from three different radars: single-look images from the airborne Uninhabited Aerial Vehicle Synthetic Aperture Radar (UAVSAR) from the National Aeronautics and Space Administration (NASA); OrbiSAR-2 from BRADAR; and ALOS Phased Array type L-band Synthetic Aperture Radar (ALOS PALSAR) from the National Aeronautics and Space Administration (NASA). To create the database, each image was manually cut and categorized into three regions according to the coverage area. The set of samples comprises 25 patches of 100 × 100 pixels for each region of each radar polarization channel, i.e., HH, HV, VH and VV. The VH channel of ALOS PALSAR was unavailable.
Figure 3 exhibits samples of patches, where
correspond to water,
are forest regions and
are urban areas.
Table 1 presents important physical information about the three radars that acquired the real SAR images.
3.2. Parameter Estimation
In this paper, the and methods estimate the roughness and scale parameters of the model for each image sample. In terms of CBIR-SAR, the parameter estimation extracts statistical information of images, and it corresponds to the feature extraction step of a CBIR system. Finally, the parameter estimates are inputs to the triangular stochastic distance. Given an image as an input query, CBIR-SAR calculates the distance between it and all images from the database.
3.3. CBIR-SAR
CBIR systems search for similar images in a database by assessing the similarity between the reference image and the other images in the database [
26]. It uses visual image features to compute distances and arrange them in ascending order and generate a ranking list of images. Here, the visual features are the roughness (
) and scale (
) parameters of the
model.
The proposed CBIR based on a stochastic approach recognizes and retrieves SAR image regions such as water, forest, urban areas, and oil spills, taking into account the model and the stochastic distance between same distributions, which is possibly indexed by different parameters. Then, CBIR-SAR performs image retrieval matching samples by statistical similarity.
3.4. Mean Average Precision (MAP)
For performance evaluation, the Mean Average Precision (MAP) [
16] is used, as:
where
is the precision until the position
i of the ranking, and
is equal to 1 if the image
i of the ranking
i belongs to the same class as the query image
Q and 0 otherwise.
is the number of images in the ranking, and
S is the number of images of the same class obtained by the query image. The MAP is obtained by the average AP for all images of the ranking within the range
and higher values indicate better performance.
4. Results and Discussion
This section presents the results and discussions based on the experiments carried out with CBIR-SAR applied to synthetic and real SAR images acquired from three different SAR sensors. The statistical computational environment used to carry out the experiments was the R language [
27], version 4.1.2, on a machine with a CPU of up to 4.20 GHz and 16 GB of RAM.
4.1. Experiments with Synthetic SAR Images
Inspired by [
22], the intensity SAR images were simulated using:
where
S represents the simulated image,
is the inverse function of the
F-Snedecor distribution, with
and
being the degrees of freedom and
U is a random variable with uniform distribution over the interval
. To generate SAR images from Equation (
11), the following steps are required:
- (i)
Define the parameters .
- (ii)
Generate U, with desirable size, from the uniform distribution over the interval .
- (iii)
Put
U in Equation (
11). The result will be a synthetic intensity
image, with parameters defined in step (i).
Three different sets of intensity SAR images
were simulated using Equation (
11), where each set comprises 25 image samples of 100 × 100 pixels. Our simulations considered the
parameter values referring to extremely heterogeneous regions (urban areas) in the interval
for
as well as to heterogeneous regions (forest areas). The values are in the interval
for
and homogeneous regions (water, oil) are in the interval
for
. With regard to the other parameters, the
values were adopted in the interval
∈ (1, 7) and
L = 1. After generating the synthetic SAR image database, it we applied the CBIR methodology described in
Section 3.
Table 2 presents the results of the experiments. It shows the MAP values using both estimators and three regions with different roughness levels.
The results show the relevance of the texture degree and the parameter estimation to obtain MAP values close to 1.0. The higher the degree of texture, the better the performance of CBIR-SAR. obtained better results than because the latter encountered a numerical convergence problem in the process. However, MAP reached values above 0.81 for all scenarios.
4.2. Experiments with Real SAR Images
In this study, the proposed approach was evaluated using three real images from the SAR database, and 25 clippings of the respective regions of size 100 × 100 were created from each original image, resulting in a total of 75 images for each database. The roughness and scale parameters were estimated using and . Then, we calculatedthe ranking list based on the obtained stochastic distances and the MAP measure value for each image region.
Table 3 presents the results for the different sensors, polarization channels, and bands. The L-band UAVSAR radar database showed promising results using both estimators and all polarization channels. The method efficiently identified homogeneous regions with consistent results, especially for the VV polarization. This polarization provides better contrast between the oil slick and the ocean, which indicates that it is a suitable channel to detect roughness and brightness variations from the sea surface, such as those caused by oil spills.
The experiments with OrbiSAR-2 achieved the highest MAP values for urban areas for all channels. On the other hand, MAP values around 0.5 indicate that CBIR-SAR incorrectly considered water and forest samples with similar texture. Both and provided good estimates and hence MAP values above 0.88 for water samples in VV. The HH channel achieved the best result for forest.
The results with ALOS PALSAR for HH and HV were superior to VV, mostly in extremely heterogeneous regions. On the other hand, VV presented the best results for water. Overall, CBIR-SAR performed better in homogeneous regions for both L-band sensors. The worst results for VV refer to heterogeneous regions, i.e., forest.
5. Conclusions
The proposed method is based on the statistical data modeling and stochastic distance to perform CBIR on single-look SAR images. Single-look SAR images are more challenging due to the marked presence of the speckle noise. The distribution models the SAR intensity data, and its parameters are estimated by two estimation methods, i.e., the Maximum Likelihood Estimation and the Fast Log-Cumulant Method. The input features of the CBIR system comprise the estimated parameters of the image region, and the triangular distance compares the similarity between image regions. These estimated parameters represent the roughness and mean brightness features which discriminate image regions. In fact, the stochastic distance is highly dependent on the estimation quality of the parameters. Both estimation methods were tested: due to its good properties and due to its estimation speed.
Our experiments demonstrated the effectiveness of the proposed method with synthetic and real SAR images. For both estimators, CBIR-SAR achieved the highest MAP values for extremely heterogeneous regions. These results confirmed that the proposed method is able to retrieve regions of similar texture based on the roughness parameter and hence, it is highly dependent on the region texture. However, it accomplished satisfactory results for homogeneous regions, and it was capable of revealing subtle texture differences between water and oil spills. Both synthetic and real SAR data results support that the proposed CBIR-SAR provides an effective solution to overcome the challenges posed by speckle for SAR image retrieval. We also demonstrated the role of the statistical data model, estimation methods and stochastic distance in the development of an expert system that can organize and query SAR image regions more efficiently. Tests with other stochastic distances and data may extend the method for other scenarios and applications, such as change detection and classification.
Among the advantages of the proposed methodology, we can list the following. CBIR-SAR is able to retrieve different image regions based on the parameters of the model, which are image features embodied in the stochastic triangular distance. In addition, our approach deals with raw data and hence it does not require a preprocessing step due to the speckle noise. Lastly, the methodology presented promising results on both parameter estimation methods. Moreover, CBIR-SAR results may also drive other image processing techniques, such as SAR image segmentation and classification. A disadvantage of this methodology is that the estimation method requires the use of numerical techniques.
Author Contributions
Conceptualization: A.D.S. and F.N.S.M.; Methodology: A.D.S., P.H.d.S.S., R.R.V.S., F.A.À.R. and F.N.S.M.; Formal analysis and investigation: A.D.S. and F.N.S.M.; Writing: A.D.S., P.H.d.S.S., R.R.V.S., F.A.À.R. and F.N.S.M.; Supervision: F.N.S.M. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by Fundação de Amparo à Pesquisa do Estado do Piauí (FAPEPI) grant number Edital 001/2022.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
No new data were created or analyzed in this study. Data sharing is not applicable to this article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Krishna, G.S.; Prakash, N. Deep learning for efficient and multi-labelled classification of synthetic aperture radar images. Evol. Syst. 2021, 12, 741–754. [Google Scholar] [CrossRef]
- Alshehri, M. A content-based image retrieval method using neural network-based prediction technique. Arab. J. Sci. Eng. 2020, 45, 2957–2973. [Google Scholar] [CrossRef]
- Zhang, K.; Li, B.; Tao, R. SAR image retrieval based-on fly algorithm. In Proceedings of the Tenth International Conference on Advanced Computational Intelligence (ICACI), Xiamen, China, 29–31 March 2018; pp. 502–507. [Google Scholar] [CrossRef]
- Tang, X.; Jiao, L.; Emery, W.J. SAR Image Content Retrieval Based on Fuzzy Similarity and Relevance Feedback. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 1824–1842. [Google Scholar] [CrossRef]
- Jiao, L.; Tang, X.; Hou, B.; Wang, S. SAR Images Retrieval Based on Semantic Classification and Region-Based Similarity Measure for Earth Observation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 3876–3891. [Google Scholar] [CrossRef]
- Tang, X.; Jiao, L. Fusion Similarity-Based Reranking for SAR Image Retrieval. IEEE Geosci. Remote Sens. Lett. 2017, 14, 242–246. [Google Scholar] [CrossRef]
- Schroder, M.; Rehrauer, H.; Seidel, K.; Datcu, M. Interactive learning and probabilistic retrieval in remote sensing image archives. IEEE Trans. Geosci. Remote Sens. 2000, 38, 2288–2298. [Google Scholar] [CrossRef]
- Argenti, F.; Lapini, A.; Bianchi, T.; Alparone, L. A tutorial on speckle reduction in synthetic aperture radar images. IEEE Geosci. Remote Sens. Mag. 2013, 1, 6–35. [Google Scholar] [CrossRef] [Green Version]
- Nobre, R.H.; Rodrigues, F.A.A.; Marques, R.C.P.; Nobre, J.S.; Neto, J.F.S.R.; Medeiros, F.N.S. SAR Image Segmentation with Renyi’s Entropy. IEEE Signal Process. Lett. 2016, 23, 1551–1555. [Google Scholar] [CrossRef]
- Frery, A.C.; Muller, H.J.; Yanasse, C.C.F.; Sant’Anna, S.J.S. A model for extremely heterogeneous clutter. IEEE Trans. Geosci. Remote Sens. 1997, 35, 648–659. [Google Scholar] [CrossRef]
- Gambini, J.; Cassetti, J.; Lucini, M.M.; Frery, A.C. Parameter Estimation in SAR Imagery Using Stochastic Distances and Asymmetric Kernels. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 365–375. [Google Scholar] [CrossRef] [Green Version]
- Rodrigues, F.A.A.; Nobre, J.S.; Vigélis, R.; Liesenberg, V.; Marques, R.C.P.; Medeiros, F.N.S. A Fast Approach for the Log-Cumulants Method Applied to Intensity SAR Image Processing. In Proceedings of the 2020 IEEE Latin American GRSS and ISPRS Remote Sensing Conference (LAGIRS), Santiago, Chile, 22–26 March 2020; pp. 494–498. [Google Scholar] [CrossRef]
- Nascimento, A.D.C.; Cintra, R.J.; Frery, A.C. Hypothesis testing in speckled data with stochastic distances. IEEE Trans. Geosci. Remote Sens. 2010, 48, 373–385. [Google Scholar] [CrossRef]
- Casella, G.; Berger, R.L. Statistical Inference; Cengage Learning: Boston, MA, USA, 2021. [Google Scholar]
- Braga, I.H.T.; Sacramento, V.P.; Oliveira, L.C.C.; Medeiros, F.N.S.; Rodrigues, F.A.Á. Ocean surface change detection from remote sensing image based on stochastic similarity measure. Braz. J. Water Resour. 2022, 27. [Google Scholar] [CrossRef]
- Wang, B.; Brown, D.; Gao, Y.; La Salle, J. MARCH: Multiscale-arch-height description for mobile retrieval of leaf images. Inf. Sci. 2015, 302, 132–148. [Google Scholar] [CrossRef]
- Arfken, G.B.; Weber, H.J. Mathematical Methods for Physicists; AAPT: College Park, MD, USA, 1999. [Google Scholar]
- Bustos, O.H.; Lucini, M.M.; Frery, A.C. M-Estimators of Roughness and Scale for -Modelled SAR Imagery. EURASIP J. Appl. Signal Process. 2002, 1, 105–114. [Google Scholar]
- Frery, A.C.; Wu, J.; Gomez, L. Sar Image Analysis—A Computational Statistics Approach; Wiley: Hoboken, NJ, USA, 2022. [Google Scholar]
- Ferreira, J.A.; Nascimento, A.D.C. Shannon Entropy for the Model: A New Segmentation Approach. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 2547–2553. [Google Scholar] [CrossRef]
- Junior, P.M.A.; Nascimento, A.D.C. ARMA process for speckled data. J. Stat. Comput. Simul. 2021, 91, 3125–3153. [Google Scholar] [CrossRef]
- Marques, R.C.P.; Medeiros, F.N.; Nobre, J.S. SAR image segmentation based on level set approach and model. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 2046–2057. [Google Scholar] [CrossRef]
- Abramowitz, M.; Stegun, I.A. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables, 9th ed.; Dover: New York, NY, USA, 1964. [Google Scholar]
- Nicolas, J.M. Introduction aux statistiques de deuxième espèce: Applications des logs-moments et des logs-cumulants à l’analyse des lois d’images radar. TS. Trait. Signal 2002, 19, 139–167. [Google Scholar]
- Rey, A.; Revollo Sarmiento, N.; Frery, A.C.; Delrieux, C. Automatic Delineation of Water Bodies in SAR Images with a Novel Stochastic Distance Approach. Remote Sens. 2022, 14, 5716. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, D.; Lu, G.; Ma, W.Y. A survey of content-based image retrieval with high-level semantics. Pattern Recognit. 2007, 40, 262–282. [Google Scholar] [CrossRef]
- R Core Team. R: A language and environment for statistical computing. MSOR Connect. 2014, 1. [Google Scholar]
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).