A Parallel Unmixing-Based Content Retrieval System for Distributed Hyperspectral Imagery Repository on Cloud Computing Platforms
Abstract
:1. Introduction
2. Cloud-Computing-Based CBIR Framework in SaaS Mode for Hyperspectral Remote Imagery Repository
2.1. Cloud-Computing-Based CBIR Architecture in SaaS Mode
2.2. Storage Method for Hyperspectral Remote Images
2.3. Method of the CBIR System
- Image input. In this step, an image to be retrieve is uploaded to HDFS.
- Endmember extraction. The system extracts the image’s endmember information automatically.
- Signature comparison. For all spectral signatures in the spectral library, the system calculates the SAD with all the endmembers extracted in step 2 and identities the extracted endmembers according to SAD scores.
- Abundance filtration. One or many endmembers interested in are chosen and minimum abundance filters for each endmember are defined. In this case, the image is retrieved only if the matched endmember contains a higher total abundance in the scene than the predefined minimum abundance threshold.
- Results display and manual selection. The retrieved hyperspectral remote images are showed in table form in the web for users to select.
3. Hyperspectral Imagery Distributed Retrieval
3.1. Parallel PPI
- Generate a set of Y random skewers where denotes a random vector.
- Iterate Y times, for each , we project all the pixel vectors onto this to record those sample vectors that are at its extreme positions denoted as and . Let .
- Remove those pixels with low frequency, and spectral angle distance (SAD) is used to calculate the similarity between any two pixels in to remove those similar pixels. Finally, the all remaining pixels in are endmembers M.
- We divide the original hyperspectral remote sensing image into m partitions and store them on HDFS in a distributed way automatically.
- We read the hyperspectral dataset from HDFS to ByteRdd as a stream of bytes using built-in newAPIHadoopFile() method. After that, ByteRdd is converted into DataRDD according to the format of remote sensing image. It is noteworthy that the spectral data in DataRDD is complete spectral.
- We generate Y random skewers in the driver and broadcast them to all executors through resource manager. Thus, the data in each partition shares the same skewers.
- The following steps are similar to the steps of the traditional PPI algorithm. Perform the map operation on DataRDD and, in each executor, project the pixel onto each skewer to find pixels that are at its extreme positions to form a set denoted by , where and denote the maximal and minimal projections on every skewer, and and denote their corresponding positions in the entire HSI, respectively.
- The reduce operation is conducted to submit all in each partition to driver and compute the maximal and minimal projects on every skewer over again. The result will be cached in driver, denoted by .
- Count pixel purity index from denoted by , including and filter a part of endmembers in , which is similar through SAD algorithm. Final, find the sample vectors M which are repeated most often.
3.2. Parallel SAD Implemented
- First, the spectral library is read from the MySQL server to a LibRDD instance which is divided into n partitions. Every partition contains a list of elements which are tuples, including name, wavelength, and reflectance. It is obvious that R is always much smaller than N, so we broadcast to computing nodes through Spark broadcast mechanism.
- Perform the map operation on . The map operation is in charge of calculating the SAD value between in each partition and in broadcast. In order to solve the problem that the wavelengths of the results are different from the wavelengths of the spectral library, we take wavelengths in the same range and align the spectrum by linear interpolation.
- Conduct the reduce operation on to filter out the minimum SAD value for each spectral vector and identify the spectrum type.
3.3. Parallel Sum-to-One Constrained Least Squares (PSCLS) Abundance Inversion
4. Experiments
4.1. Experiment Setup
4.2. Evaluation of Parallel Unmixing Accuracy
4.3. Computational Performance Evaluation
5. Discussion
5.1. Method Advantage Analysis
5.2. Limitation Analysis
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Bioucas-Dias, J.M.; Plaza, A.; Camps-Valls, G.; Scheunders, P.; Nasrabadi, N.; Chanussot, J. Hyperspectral remote sensing data analysis and future challenges. IEEE Geosci. Remote Sens. Mag. 2013, 1, 6–36. [Google Scholar] [CrossRef] [Green Version]
- Kruse, F.A.; Boardman, J.W.; Huntington, J.F. Comparison of airborne hyperspectral data and EO-1 Hyperion for mineral mapping. IEEE Trans. Geosci. Remote Sens. 2003, 41, 1388–1400. [Google Scholar] [CrossRef] [Green Version]
- Liu, X.; Gao, Z.; Ning, J.; Yu, X.; Zhang, Y. An improved method for mapping tidal flats based on remote sensing waterlines: A case study in the Bohai Rim, China. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2016, 9, 5123–5129. [Google Scholar] [CrossRef]
- Zhao, J.; Xu, T.; Xiao, J.; Liu, S.; Mao, K.; Song, L.; Yao, Y.; He, X.; Feng, H. Responses of water use efficiency to drought in southwest China. Remote Sens. 2020, 12, 199. [Google Scholar] [CrossRef] [Green Version]
- Cao, L.; Chen, X.; Zhang, C.; Kurban, A.; Qian, J.; Pan, T.; Yin, Z.; Qin, X.; Ochege, F.U.; Maeyer, P.D. The global spatiotemporal distribution of the mid-tropospheric CO2 concentration and analysis of the controlling factors. Remote Sens. 2019, 11, 94. [Google Scholar] [CrossRef] [Green Version]
- Kupilik, M.; Witmer, F.D.; MacLeod, E.; Wang, C.; Ravens, T. Gaussian process regression for arctic coastal erosion forecasting. IEEE Trans. Geosci. Remote Sens. 2018, 57, 1256–1264. [Google Scholar] [CrossRef]
- Cao, R.; Feng, Y.; Liu, X.; Shen, M.; Zhou, J. Uncertainty of vegetation green-up date estimated from vegetation indices due to snowmelt at northern middle and high latitudes. Remote Sens. 2020, 12, 190. [Google Scholar] [CrossRef] [Green Version]
- Zhang, Z.; Shao, Y.; Tian, W.; Wei, Q.; Zhang, Y.; Zhang, Q. Application potential of GF-4 images for dynamic ship monitoring. IEEE Geosci. Remote Sens. Lett. 2017, 14, 911–915. [Google Scholar] [CrossRef]
- Arias, L.; Cifuentes, J.; Marín, M.; Castillo, F.; Garcés, H. Hyperspectral imaging retrieval using MODIS satellite sensors applied to volcanic ash clouds monitoring. Remote Sens. 2019, 11, 1393. [Google Scholar] [CrossRef] [Green Version]
- Yu, W.; Ma, M. Scale mismatch between in situ and remote sensing observations of land surface temperature: Implications for the validation of remote sensing LST products. IEEE Geosci. Remote Sens. Lett. 2014, 12, 497–501. [Google Scholar]
- Ke, C. Military object detection using multiple information extracted from hyperspectral imagery. In Proceedings of the IEEE International Conference on Progress in Informatics and Computing, Nanjing, China, 27–29 October 2017; pp. 124–128. [Google Scholar]
- Rathore, M.M.U.; Paul, A.; Ahmad, A.; Chen, B.; Huang, B.; Ji, W. Real-time big data analytical architecture for remote sensing application. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 4610–4621. [Google Scholar] [CrossRef]
- Arrow, K.; Solow, R.; Portney, P.R.; Leamer, E.E.; Radner, R.; Schuman, H. Report of the NOAA panel on contingent valuation. Fed. Regist. 1993, 58, 4601–4614. [Google Scholar]
- Justice, C.; Giglio, L.; Korontzi, S.; Owens, J.; Morisette, J.T.; Roy, D.; Descloitres, J.; Alleaume, S.; Petitcolin, F.; Kaufman, Y. The MODIS fire products. Remote Sens. Environ. 2002, 83, 244–262. [Google Scholar] [CrossRef]
- Sevilla, J.; Plaza, A. A new digital repository for hyperspectral imagery with unmixing-based retrieval functionality implemented on GPUs. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 2267–2280. [Google Scholar] [CrossRef]
- Yu, H.; Gao, L.; Liao, W.; Zhang, B.; Aleksandra, P.; Wilfried, P. Multiscale superpixel-level subspace-based support vector machines for hyperspectral image classification. IEEE Geosci. Remote Sens. Lett. 2017, 14, 2142–2146. [Google Scholar] [CrossRef]
- Luo, W.; Gao, L.; Zhang, R.; Andrea, M.; Zhang, B. Bilinear normal mixing model for spectral unmixing. IET Image Processing 2019, 13, 344–354. [Google Scholar] [CrossRef]
- Wu, Z.; Shi, L.; Li, J.; Wang, Q.; Sun, L.; Wei, Z.; Plaza, J.; Plaza, A. GPU parallel implementation of spatially adaptive hyperspectral image classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 4, 1–13. [Google Scholar] [CrossRef]
- Zheng, X.; Xue, Y.; Guang, J.; Liu, J. Remote sensing data processing acceleration based on multi-core processors. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, Beijing, China, 10–15 July 2017; pp. 641–644. [Google Scholar]
- Rabe, B.R.; Clifford, M.; Miles, N. Storage Area Network (SAN) Management System for Discovering SAN Components Using a SAN Management Server. U.S. Patent 7194538B1, 20 March 2007. [Google Scholar]
- Chen, P.M.; Lee, E.K.; Gibson, G.A.; Katz, R.H.; Patterson, D.A. RAID: High-performance, reliable secondary storage. ACM Comput. Surv. 1994, 26, 145–185. [Google Scholar] [CrossRef] [Green Version]
- Gibson, G.A.; van Meter, R. Network attached storage architecture. Commun. ACM 2000, 43, 37–45. [Google Scholar] [CrossRef]
- Zheng, P.; Wu, Z.; Zhang, W.; Li, M.; Yang, J.; Zhang, Y.; Wei, Z. An unmixing-based content retrieval method for hyperspectral imagery repository on cloud computing platform. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 23–27 July 2018; pp. 3583–3586. [Google Scholar]
- Zinno, I.; Elefante, S.; Mossucca, L.; De Luca, C.; Manunta, M.; Terzo, O.; Lanari, R.; Casu, F. A first assessment of the P-SBAS DInSAR algorithm performances within a cloud computing environment. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 4675–4686. [Google Scholar] [CrossRef]
- Wu, Z.; Li, Y.; Plaza, A.; Li, J.; Xiao, F.; Wei, Z. Parallel and distributed dimensionality reduction of hyperspectral data on cloud computing architectures. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2016, 9, 2270–2278. [Google Scholar] [CrossRef]
- Cafaro, M.; Giovanni, A. Grids, clouds, and virtualization. In Grids, Clouds and Virtualization; Springer: London, UK, 2011; pp. 1–21. [Google Scholar]
- Zaharia, M.; Xin, R.S.; Wendell, P.; Das, T.; Armbrust, M.; Dave, A.; Meng, X.; Rosen, J.; Venkataraman, S.; Franklin, M.J.; et al. Apache spark: A unified engine for big data processing. Commun. ACM 2016, 59, 56–65. [Google Scholar] [CrossRef]
- Shvachko, K.; Kuang, H.; Radia, S.; Chansler, R. The hadoop distributed file system. In Proceedings of the IEEE Symposium on Mass Storage Systems and Technologies, Lake Tahoe, NV, USA, 3–7 May 2010; pp. 1–10. [Google Scholar]
- Kruse, F.A.; Lefkoff, A.B.; Boardman, J.W.; Heidebrecht, K.B.; Shapiro, A.T.; Barloon, P.J.; Goetz, A. The Spectral Image Processing System (SIPS): Software for integrated analysis of AVIRIS data. In Proceedings of the Third Annual JPL Airborne Geoscience Workshop, Pasadena, CA, USA, 1–5 June 1992. [Google Scholar]
- Molinier, M.; Laaksonen, J.; Hame, T. Detecting man-made structures and changes in satellite imagery with a content-based information retrieval system built on self-organizing maps. IEEE Trans. Geosci. Remote Sens. 2007, 45, 861–874. [Google Scholar] [CrossRef] [Green Version]
- Veganzones, M.A.; Grana, M. A spectral/spatial CBIR system for hyperspectral images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2012, 5, 488–500. [Google Scholar] [CrossRef]
- Chang, C.I.; Plaza, A. A fast iterative algorithm for implementation of pixel purity index. IEEE Geosci. Remote Sens. Lett. 2006, 3, 63–67. [Google Scholar] [CrossRef]
- Chaudhry, F.; Wu, C.; Liu, W.; Chang, C.; Plaza, A. Pixel purity index-based algorithms for endmember extraction from hyperspectral imagery. Recent Adv. Hyperspectral Signal Image Process. 2006, 37, 29–62. [Google Scholar]
- Wild, E.C. USGS library training and outerach: Finding and using scientific literature and data. In Geoscience Information Services: “Peak” Performances; Geoscience Information Society: Alexandria, VA, USA, 2010; Volume 51. [Google Scholar]
- Bertschinger, E. Multiscale Gaussian random fields and their application to cosmological simulations. Astrophys. J. Suppl. Ser. 2001, 137, 1. [Google Scholar] [CrossRef]
- Vane, G.; Green, R.O.; Chrien, T.G.; Enmark, H.T.; Hansen, E.G.; Porter, W.M. The airborne visible/infrared imaging spectrometer (AVIRIS). Remote Sens. Environ. 1993, 44, 127–143. [Google Scholar] [CrossRef]
- Sun, J.; Zhang, Y.; Wu, Z.; Zhu, Y.; Yin, X.; Ding, Z.; Wei, Z.; Plaza, J.; Plaza, A. An efficient and scalable framework for processing remotely sensed big data in cloud computing environments. IEEE Trans. Geosci. Remote Sens. 2019, 57, 4294–4308. [Google Scholar] [CrossRef]
- Wu, Z.; Sun, J.; Zhang, Y.; Zhu, Y.; Li, J.; Plaza, A.; Benediktsson, J.A.; Wei, Z. Scheduling-guided automatic processing of massive hyperspectral image classification on cloud computing. IEEE Trans. Cybern. 2020. [Google Scholar] [CrossRef]
Software | Release Version |
---|---|
Spark | 2.3.3 |
Hadoop | 2.7.3 |
OpenStack | Queens |
MySQL | 5.7.1 |
Java | 1.8.0_201 |
Scala | 2.11.8 |
Tomcat | 9.0 |
Spring | 4.3.1.RELEASE |
MyBatis | 1.3.1 |
Version | Cores | Dataset_C 2 (1.40 GB) | Dataset_C 3 (2.80 GB) | Dataset_C 4 (5.60 GB) | Dataset_C 5 (11.20 GB) | Dataset_C 4 (22.40 GB) | |||||
---|---|---|---|---|---|---|---|---|---|---|---|
Time (min) | Speedup (x) | Time (min) | Speedup (x) | Time (min) | Speedup (x) | Time (min) | Speedup (x) | Time (min) | Speedup (x) | ||
Serial version | 1 | 144 | - | 288 | - | 582 | - | 1146 | - | 2268 | - |
Distributed version | 16 | 11 | 13.1 | 20 | 14.4 | 39 | 14.9 | 78 | 14.7 | 156 | 14.5 |
32 | 6.1 | 23.6 | 12 | 24.0 | 23 | 25.3 | 45 | 25.5 | 90 | 25.2 | |
64 | 5.2 | 27.7 | 9.8 | 29.4 | 20 | 29.1 | 38 | 30.2 | 72 | 31.5 | |
120 | 5.4 | 26.7 | 9.9 | 29.1 | 20 | 29.1 | 39 | 29.4 | 78 | 29.1 |
Version | Cores | Dataset_C 2 (1.40 GB) | Dataset_C 3 (2.80 GB) | Dataset_C 4 (5.60 GB) | Dataset_C 5 (11.20 GB) | Dataset_C 4 (22.40 GB) | |||||
---|---|---|---|---|---|---|---|---|---|---|---|
Time (s) | Speedup (x) | Time (s) | Speedup (x) | Time (s) | Speedup (x) | Time (s) | Speedup (x) | Time (s) | Speedup (x) | ||
Serial version | 1 | 144 | - | 288 | - | 582 | - | 1146 | - | 2268 | - |
Distributed version | 16 | 11 | 13.1 | 20 | 14.4 | 39 | 14.9 | 78 | 14.7 | 156 | 14.5 |
32 | 8 | 8.25 | 13 | 9.92 | 19 | 13.58 | 34 | 15.18 | 60 | 17.00 | |
64 | 10 | 6.60 | 14 | 9.21 | 20 | 12.9 | 33 | 15.64 | 66 | 15.45 | |
120 | 16 | 4.13 | 18 | 7.17 | 24 | 10.75 | 35 | 14.74 | 59 | 17.29 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zheng, P.; Wu, Z.; Sun, J.; Zhang, Y.; Zhu, Y.; Shen, Y.; Yang, J.; Wei, Z.; Plaza, A. A Parallel Unmixing-Based Content Retrieval System for Distributed Hyperspectral Imagery Repository on Cloud Computing Platforms. Remote Sens. 2021, 13, 176. https://doi.org/10.3390/rs13020176
Zheng P, Wu Z, Sun J, Zhang Y, Zhu Y, Shen Y, Yang J, Wei Z, Plaza A. A Parallel Unmixing-Based Content Retrieval System for Distributed Hyperspectral Imagery Repository on Cloud Computing Platforms. Remote Sensing. 2021; 13(2):176. https://doi.org/10.3390/rs13020176
Chicago/Turabian StyleZheng, Peng, Zebin Wu, Jin Sun, Yi Zhang, Yaoqin Zhu, Yuan Shen, Jiandong Yang, Zhihui Wei, and Antonio Plaza. 2021. "A Parallel Unmixing-Based Content Retrieval System for Distributed Hyperspectral Imagery Repository on Cloud Computing Platforms" Remote Sensing 13, no. 2: 176. https://doi.org/10.3390/rs13020176
APA StyleZheng, P., Wu, Z., Sun, J., Zhang, Y., Zhu, Y., Shen, Y., Yang, J., Wei, Z., & Plaza, A. (2021). A Parallel Unmixing-Based Content Retrieval System for Distributed Hyperspectral Imagery Repository on Cloud Computing Platforms. Remote Sensing, 13(2), 176. https://doi.org/10.3390/rs13020176