3D Reconstruction of a Complex Grid Structure Combining UAS Images and Deep Learning
Abstract
:1. Introduction
2. Related Works
2.1. UAS Based Photogrammetric Imaging
2.2. Grid Structures 3D Reconstruction
2.3. Deep Convolutional Neural Networks
- CNNs for single-photo 3D reconstruction: multiple neural network models were proposed for reconstruction of objects and buildings from a single image using conditional generative adversarial networks (GAN) [41,42,43,44,45,46,47]. While deep models such as Pix2Vox [44] and Z-GAN [47] proved to reconstruct complex structures from a single photo, but a large training dataset is required to achieve the desired quality. However, no public datasets of wire structures are available to date to train such models.
- CNNs for feature matching: the presented approaches [48,49,50,51,52] seem to outperform handcrafted feature detectors/descriptor methods. Still, their performance is closely related to the similarity of local image patches in the training dataset with respect to the images used during inference. However, repeating metal beams of wire structures are not present in modern datasets.
- CNNs for semantic image segmentation and boosting of SfM/MVS procedures: CNN methods [53,54,55,56,57,58,59] have also demonstrated their potential for detecting a numerous number of elements in the images and then boost the processing pipeline in terms of constrained tie point extraction or semantic multi-view stereo [60,61,62]. The advantages of image masking for dense point cloud generation are well known in the literature [62,63,64]. While there are multiple readily available segmenation models for oblique aerial photos [63] or buildings [64,65], the generation of pixel-level semantic segmentation for sparse wire objects is challenging. The analysis of repetitive patterns [66,67] allows to partly solve this problem for opaque objects (e.g., skyscrapers). Still, for objects with holes, such methods do not provide robust results. Generative Adversarial Networks (GANs) [68,69] have demonstrated a significant improvement for models that generate high fidelity output such as color images and semantic segmentation. Luc et al. [70] has proposed an adversarial framework for learning a robust semantic segmentation models capable of reconstructing fine details in the input imagery. Luc proposed to use masked images as an input for the discriminator. The discriminator observes color images masked with real masks and masks predicted by the framework. It learns to distinguish ‘real’ images and ‘fake’ images. This allows to provide a meaningful adversarial loss that improves the quality of segmentation in terms of small objects and object boundaries. So, considering image segmentation and masking as a key point for repetitive and self-occlusive structures 3D reconstruction from images, some deep network models were presented: MobileNetV2 [54], a fast network leveraging inverted residuals and linear bottlenecks; UPerNet [71] model, a multi-task network that uses internal feature map fusing to increase the labelling accuracy; HRNetV2 [72] which utliizes high-resolution representation and multiple streams of different spatial sizes to perform high-fidelity image segmentation. These CNN models serve as baselines and a starting point for developing our deep learning technique for accurate and robust image segmentation for further multi-view stereo 3D reconstruction of the Shukhov Radio tower.
3. Shukhov Tower and UAS-Based Surveying
3.1. Shukhov Radio Tower as a Photogrammetric Challenge
- The tower’s size (137 m height) and shape require some specific means for acquiring the necessary images keeping appropriate scale and ray intersection angles. UASs give a solution for this challenge allowing to acquire images of such huge-sized and hardly-get object according the specific requirements.
- The 3D surveying’s design and preparation must consider that the historical monument is now an operating radio translation tower: radio transmitters located on the tower disturb UAS control and operations.
- An effective image processing should minimize manual operation and also be able to handle holes, wire structures, repeated elements and shiny surfaces. This challenge can be answer with deep learning technique for detecting tower elements in images (Section 4).
3.2. UAS-Based Survey
3.3. Standard Imagery Processing
4. Deep-Learning Aided Iimage-Based 3D Reconstruction
4.1. Deep Learning Approach
4.2. WireNetV2 Model Architecture
- (i)
- It had low generalization ability and failed to label wire parts with a similar texture but different structure, such as the upper levels of the tower, if the training dataset included only ground-truth masks of the bottom and middle levels;
- (ii)
- The segmentation had soft edges at sharp corners that were caused by the negative log likelihood loss. Such soft edges reduced the matching accuracy during the sparse key-point matching stage.
4.3. WireNetV2 Loss Function
4.4. WireNet Training Dataset
- pixel-wise segmentation into two classes “tower” and “background”,
- generation of training labels.
5. Results
5.1. Training Process and Performance of WireNetV2 Model
5.2. Quantitative and Qualitative Evaluation of the WireNetV2 Model
5.3. Image-Based 3D Reconstruction
5.4. Textured 3D Model
6. Discussion
7. Conclusions
Author Contributions
Funding
Conflicts of Interest
Abbreviations
CNN | Convolution Neural Network |
GAN | Generative Adversarial Network |
GCP | Ground Control Point |
GSD | Ground Sample Distance |
IoU | Intersection-over-Union |
MVS | Multi-view Stereo |
RMSE | Root Mean Square Error |
SfM | Structure from Motion |
SGD | Stochastic Gradient Descend |
UAS | Unmanned Aerial System |
UAV | Unmanned Aerial Vehicle |
References
- Colomina, I.; Molina, P. Unmanned aerial systems for photogrammetry and remote sensing: A review. ISPRS J. Photogramm. Remote Sens. 2014, 92, 79–97. [Google Scholar] [CrossRef] [Green Version]
- Nex, F.; Remondino, F. UAV for 3D mapping applications: A review. Appl. Geomat. 2014, 6, 1–15. [Google Scholar] [CrossRef]
- Granshaw, S.I. RPV, UAV, UAS, RPAS … or just drone? Photogramm. Rec. 2018, 33, 160–170. [Google Scholar] [CrossRef]
- Hassanalian, M.; Abdelkefi, A. Classifications, applications, and design challenges of drones: A review. Prog. Aerosp. Sci. 2017, 91, 99–131. [Google Scholar] [CrossRef]
- Giordan, D.; Hayakawa, Y.; Nex, F.; Remondino, F.; Tarolli, P. Review article: The use of remotely piloted aircraft systems (RPASs) for natural hazards monitoring and management. Nat. Hazards Earth Syst. Sci. 2018, 18, 1079–1096. [Google Scholar] [CrossRef] [Green Version]
- Leonov, A.V.; Anikushkin, M.N.; Ivanov, A.V.; Ovcharov, S.V.; Bobkov, A.E.; Baturin, Y.M. Laser scanning and 3D modeling of the Shukhov hyperboloid tower in Moscow. J. Cult. Herit. 2015, 16, 551–559. [Google Scholar] [CrossRef]
- Stathopoulou, E.K.; Welponer, M.; Remondino, F. Open-source image-based 3d reconstruction pipelines: Review, comparison and evaluation. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, XLII-2/W17, 331–338. [Google Scholar] [CrossRef] [Green Version]
- Mandlburger, G.; Pfennigbauer, M.; Schwarz, R.; Flory, S.; Nussbaumer, L. Concept and Performance Evaluation of a Novel UAV-Borne Topo-Bathymetric LiDAR Sensor. Remote Sens. 2020, 12, 986. [Google Scholar] [CrossRef] [Green Version]
- Anthony, D.; Elbaum, S.; Lorenz, A.; Detweiler, C. On crop height estimation with UAVs. In Proceedings of the 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems, Chicago, IL, USA, 14–18 September 2014; pp. 4805–4812. [Google Scholar] [CrossRef] [Green Version]
- Candiago, S.; Remondino, F.; De Giglio, M.; Dubbini, M.; Gattelli, M. Evaluating Multispectral Images and Vegetation Indices for Precision Farming Applications from UAV Images. Remote Sens. 2015, 7, 4026–4047. [Google Scholar] [CrossRef] [Green Version]
- Kameyama, S.; Sugiura, K. Estimating Tree Height and Volume Using Unmanned Aerial Vehicle Photography and SfM Technology, with Verification of Result Accuracy. Drones 2019, 3, 26. [Google Scholar] [CrossRef]
- Nex, F.; Remondino, F. Preface: Latest Developments, Methodologies, and Applications Based on UAV Platforms. Drones 2019, 3, 26. [Google Scholar] [CrossRef] [Green Version]
- Rinaudo, F.; Chiabrando, F.; Lingua, A.; Spanò, A. Archaeological site monitoring: UAV photogrammetry can be an answer. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2012, 39, 583–588. [Google Scholar] [CrossRef] [Green Version]
- Knyaz, V.; Chibunichev, A.; Zhuravlev, D. Multisource data fusion for documenting archaeological sites. In Image and Signal Processing for Remote Sensing XXIII; Bruzzone, L., Ed.; International Society for Optics and Photonics (SPIE): Bellingham, WA, USA, 2017; Volume 10427, pp. 508–516. [Google Scholar] [CrossRef]
- Sauerbier, M.; Eisenbeiss, H. UAVs For The Documentation Of Archaeological Excavations. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2010, 38, 526–531. [Google Scholar]
- Radoglou-Grammatikis, P.; Sarigiannidis, P.; Lagkas, T.; Moscholios, I. A compilation of UAV applications for precision agriculture. Comput. Netw. 2020, 172, 107148. [Google Scholar] [CrossRef]
- Grenzdorffer, G.J.; Engel, A.; Teichert, B. The Photogrammetric Potential of Low-Cost UAVS in Forestry and Agriculture. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2008, 31, 1207–1214. [Google Scholar]
- Hildmann, H.; Kovacs, E. Review: Using Unmanned Aerial Vehicles (UAVs) as Mobile Sensing Platforms (MSPs) for Disaster Response, Civil Security and Public Safety. Drones 2019, 3, 59. [Google Scholar] [CrossRef] [Green Version]
- Gonzalez, L.; Montes, G.; Puig, E.; Johnson, S.; Mengersen, K.; Gaston, K. Unmanned Aerial Vehicles (UAVs) and Artificial Intelligence Revolutionizing Wildlife Monitoring and Conservation. Sensors 2016, 16, 97. [Google Scholar] [CrossRef] [Green Version]
- Jakovljevic, G.; Govedarica, M.; Alvarez-Taboada, F. A Deep Learning Model for Automatic Plastic Mapping Using Unmanned Aerial Vehicle (UAV) Data. Remote Sens. 2020, 12, 1515. [Google Scholar] [CrossRef]
- Ya’acob, N.; Zolkapli, M.; Johari, J.; Yusof, A.L.; Sarnin, S.S.; Asmadinar, A.Z. UAV environment monitoring system. In Proceedings of the 2017 International Conference on Electrical, Electronics and System Engineering (ICEESE), Kanazawa, Japan, 9–10 November 2017; pp. 105–109. [Google Scholar] [CrossRef]
- Iglesias, L.; De Santos-Berbel, C.; Pascual, V.; Castro, M. Using Small Unmanned Aerial Vehicle in 3D Modeling of Highways with Tree-Covered Roadsides to Estimate Sight Distance. Remote Sens. 2019, 11, 2625. [Google Scholar] [CrossRef] [Green Version]
- Knyaz, V.A.; Chibunichev, A.G. Photogrammetric techniques for road surface analysis. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2016, 41, 515–520. [Google Scholar] [CrossRef]
- Romero-Chambi, E.; Villarroel-Quezada, S.; Atencio, E.; Munoz-La Rivera, F. Analysis of Optimal Flight Parameters of Unmanned Aerial Vehicles (UAVs) for Detecting Potholes in Pavements. Appl. Sci. 2020, 10, 4157. [Google Scholar] [CrossRef]
- Wefelscheid, C.; Hänsch, R.; Hellwich, O. Three-dimensional building reconstruction using images obtained by unmanned aerial vehicles. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2011, 38, 183–188. [Google Scholar] [CrossRef] [Green Version]
- Qin, R.; Grün, A.; Huang, X. UAV project—Building a reality-based 3D model of the NUS (National University of Singapore) campus. In Proceedings of the 33rd Asian Conference on Remote Sensing, Pattaya, Thailand, 26–30 November 2012. [Google Scholar]
- Calì, M.; Ambu, R. Advanced 3D Photogrammetric Surface Reconstruction of Extensive Objects by UAV Camera Image Acquisition. Sensors 2018, 18, 2815. [Google Scholar] [CrossRef] [Green Version]
- Hein, D.; Kraft, T.; Brauchle, J.; Berger, R. Integrated UAV-Based Real-Time Mapping for Security Applications. ISPRS Int. J. Geo Inf. 2019, 8, 219. [Google Scholar] [CrossRef] [Green Version]
- Liu, L.; Ceylan, D.; Lin, C.; Wang, W.; Mitra, N.J. Image-Based Reconstruction of Wire Art. ACM Trans. Graph. 2017, 36, 1–11. [Google Scholar] [CrossRef] [Green Version]
- Hofer, M.; Wendel, A.; Bischof, H. Line-based 3D Reconstruction of Wiry Objects. In Proceedings of the 18th Computer Vision Winter Workshop, Petersburg, Russia, 16–18 July 2013; pp. 78–85. [Google Scholar]
- Bacharidis, K.; Sarri, F.; Ragia, L. 3D Building Façade Reconstruction Using Deep Learning. ISPRS Int. J. Geo Inf. 2020, 9, 322. [Google Scholar] [CrossRef]
- Martin, T.; Montes, J.; Bazin, J.C.; Popa, T. Topology-Aware Reconstruction of Thin Tubular Structures. In SIGGRAPH Asia 2014 Technical Briefs; Association for Computing Machinery: New York, NY, USA, 2014. [Google Scholar] [CrossRef]
- Huang, H.; Wu, S.; Cohen-Or, D.; Gong, M.; Zhang, H.; Li, G.; Chen, B. L1-Medial Skeleton of Point Cloud. ACM Trans. Graph. 2013, 32, 65-1–65-8. [Google Scholar] [CrossRef] [Green Version]
- Morioka, K.; Ohtake, Y.; Suzuki, H. Reconstruction of Wire Structures from Scanned Point Clouds. In Advances in Visual Computing; Bebis, G., Boyle, R., Parvin, B., Koracin, D., Li, B., Porikli, F., Zordan, V., Klosowski, J., Coquillart, S., Luo, X., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; pp. 427–436. [Google Scholar]
- Su, I.; Qin, Z.; Saraceno, T.; Krell, A.; Mühlethaler, R.; Bisshop, A.; Buehler, M.J. Imaging and analysis of a three-dimensional spider web architecture. J. R. Soc. Interface 2018, 15, 20180193. [Google Scholar] [CrossRef] [Green Version]
- Nooruddin, M.; Rahman, M. Improved 3D Reconstruction for Images having Moving Object using Semantic Image Segmentation and Binary Masking. In Proceedings of the 2018 4th International Conference on Electrical Engineering and Information Communication Technology (iCEEiCT), Dhaka, Bangladesh, 13–15 September 2018; pp. 32–37. [Google Scholar] [CrossRef]
- Mohammed, H.M.; El-Sheimy, N. Segmentation of image pairs for 3d reconstruction. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, XLII-2/W16, 175–180. [Google Scholar] [CrossRef] [Green Version]
- Ketcha, M.D.; Silva, T.D.; Uneri, A.; Kleinszig, G.; Vogt, S.; Wolinsky, J.P.; Siewerdsen, J.H. Automatic masking for robust 3D-2D image registration in image-guided spine surgery. In Medical Imaging 2016: Image-Guided Procedures, Robotic Interventions, and Modeling; Webster, R.J., III, Yaniv, Z.R., Eds.; International Society for Optics and Photonics (SPIE): Bellingham, WA, USA, 2016; Volume 9786, pp. 98–104. [Google Scholar] [CrossRef] [Green Version]
- Kaneko, M.; Iwami, K.; Ogawa, T.; Yamasaki, T.; Aizawa, K. Mask-SLAM: Robust Feature-Based Monocular SLAM by Masking Using Semantic Segmentation. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Salt Lake City, UT, USA, 18–22 June 2018; pp. 371–3718. [Google Scholar] [CrossRef]
- Wan, Q.; Li, Y.; Cui, H.; Feng, Z. 3D-Mask-GAN:Unsupervised Single-View 3D Object Reconstruction. In Proceedings of the 2019 6th International Conference on Behavioral, Economic and Socio-Cultural Computing (BESC), Beijing, China, 28–30 October 2019; pp. 1–6. [Google Scholar] [CrossRef]
- Girdhar, R.; Fouhey, D.F.; Rodriguez, M.; Gupta, A. Learning a Predictable and Generative Vector Representation for Objects. In Computer Vision—ECCV 2016; Leibe, B., Matas, J., Sebe, N., Welling, M., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 484–499. [Google Scholar]
- Shin, D.; Fowlkes, C.C.; Hoiem, D. Pixels, Voxels, and Views: A Study of Shape Representations for Single View 3D Object Shape Prediction. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 3061–3069. [Google Scholar] [CrossRef] [Green Version]
- Choy, C.B.; Xu, D.; Gwak, J.; Chen, K.; Savarese, S. 3D-R2N2: A Unified Approach for Single and Multi-view 3D Object Reconstruction. In Computer Vision—ECCV 2016; Leibe, B., Matas, J., Sebe, N., Welling, M., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 628–644. [Google Scholar] [CrossRef] [Green Version]
- Xie, H.; Yao, H.; Sun, X.; Zhou, S.; Zhang, S. Pix2Vox: Context-Aware 3D Reconstruction From Single and Multi-View Images. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea, 27 October–2 November 2019; pp. 2690–2698. [Google Scholar] [CrossRef] [Green Version]
- Shin, D.; Ren, Z.; Sudderth, E.B.; Fowlkes, C.C. 3D Scene Reconstruction With Multi-Layer Depth and Epipolar Transformers. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Seoul, Korea, 27 October–2 November 2019; pp. 2172–2182. [Google Scholar] [CrossRef] [Green Version]
- Knyaz, V.A.; Kniaz, V.V.; Remondino, F. Image-to-Voxel Model Translation with Conditional Adversarial Networks. In Computer Vision—ECCV 2018; Springer: Cham, Switzerland, 2018; pp. 601–618. [Google Scholar] [CrossRef]
- Kniaz, V.V.; Remondino, F.; Knyaz, V.A. Generative adversarial networks for single photo 3D reconstruction. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, XLII-2/W9, 403–408. [Google Scholar] [CrossRef] [Green Version]
- Yi, K.M.; Trulls, E.; Lepetit, V.; Fua, P. LIFT: Learned Invariant Feature Transform. In Computer Vision—ECCV 2018; Springer: Cham, Switzerland, 2018; pp. 467–483. [Google Scholar] [CrossRef] [Green Version]
- Ono, Y.; Trulls, E.; Fua, P.; Yi, K.M. LF-Net: Learning Local Features from Images. In Proceedings of the Annual Conference on Neural Information Processing Systems 2018, NeurIPS 2018, Montréal, QC, Canada, 3–8 December 2018; pp. 6237–6247. [Google Scholar]
- Christiansen, P.H.; Kragh, M.F.; Brodskiy, Y.; Karstoft, H. UnsuperPoint: End-to-end Unsupervised Interest Point Detector and Descriptor. arXiv 2019, arXiv:1907.04011. [Google Scholar]
- Shen, X.; Wang, C.; Li, X.; Yu, Z.; Li, J.; Wen, C.; Cheng, M.; He, Z. RF-Net: An End-to-End Image Matching Network based on Receptive Field. arXiv 2019, arXiv:1906.00604. [Google Scholar]
- Kniaz, V.V.; Mizginov, V.; Grodzitsky, L.; Bordodymov, A. GANcoder: Robust feature point matching using conditional adversarial auto-encoder. In Optics, Photonics and Digital Technologies for Imaging Applications VI; Schelkens, P., Kozacki, T., Eds.; International Society for Optics and Photonics (SPIE): Bellingham, WA, USA, 2020; Volume 11353, pp. 59–68. [Google Scholar] [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the International Conference on Medical image computing and Computer-Assisted Intervention, Shenzhen, China, 13–17 October 2019; Springer: Berlin/Heidelberg, Germany, 2015; pp. 234–241. [Google Scholar]
- Sandler, M.; Howard, A.G.; Zhu, M.; Zhmoginov, A.; Chen, L. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2018, Salt Lake City, UT, USA, 18–22 June 2018; pp. 4510–4520. [Google Scholar] [CrossRef] [Green Version]
- Minaee, S.; Boykov, Y.; Porikli, F.M.; Plaza, A.J.; Kehtarnavaz, N.; Terzopoulos, D. Image Segmentation Using Deep Learning: A Survey. arXiv 2020, arXiv:2001.05566. [Google Scholar]
- Kniaz, V.V. Conditional GANs for semantic segmentation of multispectral satellite images. In Image and Signal Processing for Remote Sensing XXIV; Bruzzone, L., Bovolo, F., Eds.; International Society for Optics and Photonics (SPIE): Bellingham, WA, USA, 2018; Volume 10789, pp. 259–267. [Google Scholar] [CrossRef]
- Kniaz, V.V. Deep learning for dense labeling of hydrographic regions in very high resolution imagery. In Image and Signal Processing for Remote Sensing XXV; Bruzzone, L., Bovolo, F., Eds.; International Society for Optics and Photonics (SPIE): Bellingham, WA, USA, 2019; Volume 11155, pp. 283–292. [Google Scholar] [CrossRef]
- Shelhamer, E.; Long, J.; Darrell, T. Fully Convolutional Networks for Semantic Segmentation. arXiv 2016, arXiv:1605.06211. [Google Scholar] [CrossRef]
- Christiansen, P.; Nielsen, L.N.; Steen, K.A.; Jørgensen, R.N.; Karstoft, H. DeepAnomaly: Combining Background Subtraction and Deep Learning for Detecting Obstacles and Anomalies in an Agricultural Field. Sensors 2016, 16, 1904. [Google Scholar] [CrossRef] [Green Version]
- Huang, P.; Matzen, K.; Kopf, J.; Ahuja, N.; Huang, J. DeepMVS: Learning Multi-view Stereopsis. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; pp. 2821–2830. [Google Scholar]
- Kuhn, A.; Sormann, C.; Rossi, M.; Erdler, O.; Fraundorfer, F. DeepC-MVS: Deep Confidence Prediction for Multi-View Stereo Reconstruction. arXiv 2019, arXiv:1912.00439. [Google Scholar]
- Stathopoulou, E.K.; Remondino, F. Multi-view stereo with semantic priors. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, XLII-2/W15, 1135–1140. [Google Scholar] [CrossRef] [Green Version]
- Wei, Z.; Wang, Y.; Yi, H.; Chen, Y.; Wang, G. Semantic 3D Reconstruction with Learning MVS and 2D Segmentation of Aerial Images. Appl. Sci. 2020, 10, 1275. [Google Scholar] [CrossRef] [Green Version]
- De Nunzio, G. A software tool for the semi-automatic segmentation of architectural 3D models with semantic annotation and Web fruition. ACTA IMEKO 2018, 7, 64–72. [Google Scholar] [CrossRef]
- Stathopoulou, E.K.; Remondino, F. Multi view stereo with semantic priors. arXiv 2020, arXiv:2007.02295. [Google Scholar] [CrossRef] [Green Version]
- Roberts, R.; Sinha, S.N.; Szeliski, R.; Steedly, D. Structure from motion for scenes with large duplicate structures. In CVPR 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 3137–3144. [Google Scholar]
- Jiang, N.; Tan, P.; Cheong, L.F. Seeing double without confusion: Structure-from-motion in highly ambiguous scenes. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 6–21 June 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 1458–1465. [Google Scholar]
- Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.C.; Bengio, Y. Generative Adversarial Nets. In Proceedings of the Annual Conference on Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; pp. 2672–2680. [Google Scholar]
- Isola, P.; Zhu, J.; Zhou, T.; Efros, A.A. Image-to-Image Translation with Conditional Adversarial Networks. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, 21–26 July 2017; pp. 5967–5976. [Google Scholar] [CrossRef] [Green Version]
- Luc, P.; Couprie, C.; Chintala, S.; Verbeek, J. Semantic Segmentation using Adversarial Networks. arXiv 2016, arXiv:1611.08408. [Google Scholar]
- Xiao, T.; Liu, Y.; Zhou, B.; Jiang, Y.; Sun, J. Unified Perceptual Parsing for Scene Understanding. In Computer Vision—ECCV 2018; Springer: Cham, Switzerland, 2018; pp. 432–448. [Google Scholar] [CrossRef] [Green Version]
- Sun, K.; Zhao, Y.; Jiang, B.; Cheng, T.; Xiao, B.; Liu, D.; Mu, Y.; Wang, X.; Liu, W.; Wang, J. High-Resolution Representations for Labeling Pixels and Regions. arXiv 2019, arXiv:1904.04514. [Google Scholar]
- Kniaz, V.V.; Zheltov, S.Y.; Remondino, F.; Knyaz, V.A.; Bordodymov, A.; Gruen, A. Wire structure image-based 3D reconstruction aided by deep learning. ISPRS Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2020, 43, 435–441. [Google Scholar] [CrossRef]
- Zhang, H.; Xu, T.; Li, H.; Zhang, S.; Wang, X.; Huang, X.; Metaxas, D.N. StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 41, 1947–1962. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Paszke, A.; Gross, S.; Massa, F.; Lerer, A.; Bradbury, J.; Chanan, G.; Killeen, T.; Lin, Z.; Gimelshein, N.; Antiga, L.; et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library. In Proceedings of the Annual Conference on Neural Information Processing Systems 2019, NeurIPS 2019, Montréal, QC, Canada, 3 December 2019. [Google Scholar]
- Schönberger, J.L.; Zheng, E.; Frahm, J.M.; Pollefeys, M. Pixelwise View Selection for Unstructured Multi-View Stereo. In Computer Vision—ECCV 2016; Leibe, B., Matas, J., Sebe, N., Welling, M., Eds.; Springer International Publishing: Cham, Switzerland, 2016; pp. 501–518. [Google Scholar] [CrossRef]
- Schönberger, J.L.; Frahm, J. Structure-from-Motion Revisited. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 4104–4113. [Google Scholar] [CrossRef]
- Wu, B.; Zhou, Y.; Qian, Y.; Gong, M.; Huang, H. Full 3D reconstruction of transparent objects. ACM Trans. Graph. 2018, 37, 103:1–103:11. [Google Scholar] [CrossRef] [Green Version]
- Bianco, S.; Ciocca, G.; Marelli, D. Evaluating the Performance of Structure from Motion Pipelines. J. Imaging 2018, 4, 98. [Google Scholar] [CrossRef] [Green Version]
- Atcheson, B.; Ihrke, I.; Heidrich, W.; Tevs, A.; Bradley, D.; Magnor, M.A.; Seidel, H. Time-resolved 3d capture of non-stationary gas flows. ACM Trans. Graph. 2008, 27, 132. [Google Scholar] [CrossRef]
- Ji, Y.; Ye, J.; Yu, J. Reconstructing Gas Flows Using Light-Path Approximation. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, 23–28 June 2013; pp. 2507–2514. [Google Scholar] [CrossRef] [Green Version]
- Ihrke, I.; Magnor, M.A. Image-based tomographic reconstruction of flames. In ACM SIGGRAPH 2004 Sketches (SIGGRAPH’04); Association for Computing Machinery: New York, NY, USA, 2004. [Google Scholar] [CrossRef] [Green Version]
- Wu, Z.; Zhou, Z.; Tian, D.; Wu, W. Reconstruction of three-dimensional flame with color temperature. Vis. Comput. 2015, 31, 613–625. [Google Scholar] [CrossRef] [Green Version]
- Hofer, M.; Maurer, M.; Bischof, H. Efficient 3D scene abstraction using line segments. Comput. Vis. Image Underst. 2017, 157, 167–178. [Google Scholar] [CrossRef]
Parameter | Value |
---|---|
Camera type | Mirrorless interchangeable |
lens digital camera | |
Lens: | E-mount lens |
Focal length | 16 mm |
Image sensor | Exmor APS-C HD CMOS |
23.4 × 15.6 mm | |
Total pixel number | Appr. 14,600,000 pix |
ISO sensitivity: | Auto, ISO 200 to 12,800 |
Exposure compensation: | EV (1/3 EV step) |
Shutter | Electronically-controlled, |
vertical-traverse, focal-plane | |
Speed range: | 1/4000 s to 30 s |
Parameter | Value |
---|---|
Brand | Ascending Techn |
Max. payload [kg] | 0.75 |
Max. stay in the air [min] | 22 |
Max. speed [km/h] | 60 |
Max. height above sea [m] | 1000 |
Propulsion | Electric |
ϕ/wingspan [cm] | 82 |
Height [cm] | 12.5 |
Weight [kg] | 0.98 |
Weight of battery [kg] | 0.45 |
Number of rotors | 8 |
Transport on human back | Y |
Name | Kernel | Str. | Ch. I/O | In Res. | Out Res. | Recep. Field | Input |
---|---|---|---|---|---|---|---|
conv0 | 2 | 9/64 | 4 | 3 RGB images multiplied by masks | |||
conv1 | 2 | 64/128 | 7 | conv0 | |||
conv2 | 2 | 128/256 | 16 | conv1 | |||
conv3 | 2 | 256/256 | 32 | conv2 | |||
conv4 | 2 | 256/256 | 34 | conv3 | |||
conv5 | 2 | 256/256 | 70 | conv4 | |||
conv6 | 1 | 256/1 | 70 | conv5 |
HRNetV2 [72] | MobileNetV2 [54] | UPerNet [71] | WireNetV2 | |
---|---|---|---|---|
I | 0.771 | 0.585 | 0.704 | 0.762 |
II | 0.799 | 0.554 | 0.770 | 0.826 |
III | 0.769 | 0.597 | 0.730 | 0.803 |
average | 0.780 | 0.579 | 0.735 | 0.797 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Knyaz, V.A.; Kniaz, V.V.; Remondino, F.; Zheltov, S.Y.; Gruen, A. 3D Reconstruction of a Complex Grid Structure Combining UAS Images and Deep Learning. Remote Sens. 2020, 12, 3128. https://doi.org/10.3390/rs12193128
Knyaz VA, Kniaz VV, Remondino F, Zheltov SY, Gruen A. 3D Reconstruction of a Complex Grid Structure Combining UAS Images and Deep Learning. Remote Sensing. 2020; 12(19):3128. https://doi.org/10.3390/rs12193128
Chicago/Turabian StyleKnyaz, Vladimir A., Vladimir V. Kniaz, Fabio Remondino, Sergey Y. Zheltov, and Armin Gruen. 2020. "3D Reconstruction of a Complex Grid Structure Combining UAS Images and Deep Learning" Remote Sensing 12, no. 19: 3128. https://doi.org/10.3390/rs12193128
APA StyleKnyaz, V. A., Kniaz, V. V., Remondino, F., Zheltov, S. Y., & Gruen, A. (2020). 3D Reconstruction of a Complex Grid Structure Combining UAS Images and Deep Learning. Remote Sensing, 12(19), 3128. https://doi.org/10.3390/rs12193128