3D Computer Vision and Smart Building and City

Journal Name	Impact Factor	CiteScore	Launched Year	First Decision (median)	APC
Buildings buildings	3.1	4.4	2011	15.1 Days	CHF 2600
Drones drones	4.8	7.4	2017	20.8 Days	CHF 2600
Energies energies	3.2	7.3	2008	16.8 Days	CHF 2600
Sensors sensors	3.5	8.2	2001	17.8 Days	CHF 2600
Sustainability sustainability	3.3	7.7	2009	17.9 Days	CHF 2400

14 pages, 17481 KB

Open AccessArticle

Integration with Visual Perception—Research on the Usability of a Data Visualization Interface Layout in Zero-Carbon Parks Based on Eye-Tracking Technology

by Guangxu Li, Lingyu Wang and Jie Hu

Sustainability 2023, 15(14), 11102; https://doi.org/10.3390/su151411102 - 17 Jul 2023

Cited by 6 | Viewed by 3629

Abstract

With the continued application of data visualization technology in sustainable development, the construction of carbon emission monitoring platforms is becoming increasingly popular in industrial parks. However, there are many kinds of such interfaces, the usability of which remains unclear. Therefore, in order to [...] Read more.

With the continued application of data visualization technology in sustainable development, the construction of carbon emission monitoring platforms is becoming increasingly popular in industrial parks. However, there are many kinds of such interfaces, the usability of which remains unclear. Therefore, in order to explore the usability of current carbon emission visualization interfaces in parks and put forward humanized optimization strategies for their subsequent design, this study used eye-tracking technology to analyze the data readability of six types of layouts from three aspects of visual perception features: integrity, understandability, and selectivity. Quantitative data from eye movement experiments and visual perception characteristics were evaluated using a Likert scale in an analysis of different layouts, and the correlation data between three visual perception characteristics and the readability of different layout data were obtained using an SPSS tool. The results show that, compared with a layout containing 3D graphics, the pure data type of interface has a shorter task completion time and higher readability; however, it provides fewer choices for users and is less interesting. In addition, there is a significant negative correlation between integrity and task completion time; the more complete the interface layout, the shorter the task completion time. In summary, a certain correlation was found between visual perception characteristics and the readability of interface layout using this method. At the same time, the advantages and disadvantages of different interface layouts were also analyzed, and more humanized optimization directions and strategies were devised. This is vital for aiding subsequent research on the influence of specific layout elements to optimize visualization interfaces that display carbon emission data. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

27 pages, 15219 KB

Open AccessArticle

SGGTSO: A Spherical Vector-Based Optimization Algorithm for 3D UAV Path Planning

by Wentao Wang, Chen Ye and Jun Tian

Drones 2023, 7(7), 452; https://doi.org/10.3390/drones7070452 - 7 Jul 2023

Cited by 21 | Viewed by 3898

Abstract

The application of 3D UAV path planning algorithms in smart cities and smart buildings can improve logistics efficiency, enhance emergency response capabilities as well as provide services such as indoor navigation, thus bringing more convenience and safety to people’s lives and work. The [...] Read more.

The application of 3D UAV path planning algorithms in smart cities and smart buildings can improve logistics efficiency, enhance emergency response capabilities as well as provide services such as indoor navigation, thus bringing more convenience and safety to people’s lives and work. The main idea of the 3D UAV path planning problem is how to plan to get an optimal flight path while ensuring that the UAV does not collide with obstacles during flight. This paper transforms the 3D UAV path planning problem into a multi-constrained optimization problem by formulating the path length cost function, the safety cost function, the flight altitude cost function and the smoothness cost function. This paper encodes each feasible flight path as a set of vectors consisting of magnitude, elevation and azimuth angles and searches for the optimal flight path in the configuration space by means of a metaheuristic algorithm. Subsequently, this paper proposes an improved tuna swarm optimization algorithm based on a sigmoid nonlinear weighting strategy, multi-subgroup Gaussian mutation operator and elite individual genetic strategy, called SGGTSO. Finally, the SGGTSO algorithm is compared with some other classical and novel metaheuristics in a 3D UAV path planning problem with nine different terrain scenarios and in the CEC2017 test function set. The comparison results show that the flight path planned by the SGGTSO algorithm significantly outperforms other comparison algorithms in nine different terrain scenarios, and the optimization performance of SGGTSO outperforms other comparison algorithms in 24 CEC2017 test functions. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

28 pages, 12417 KB

Open AccessArticle

Analysis of Polarization Detector Performance Parameters on Polarization 3D Imaging Accuracy

by Pengzhang Dai, Dong Yao, Tianxiang Ma, Honghai Shen, Weiguo Wang and Qingyu Wang

Sensors 2023, 23(11), 5129; https://doi.org/10.3390/s23115129 - 27 May 2023

Cited by 5 | Viewed by 3240

Abstract

Three-dimensional (3D) reconstruction of objects using the polarization properties of diffuse light on the object surface has become a crucial technique. Due to the unique mapping relation between the degree of polarization of diffuse light and the zenith angle of the surface normal [...] Read more.

Three-dimensional (3D) reconstruction of objects using the polarization properties of diffuse light on the object surface has become a crucial technique. Due to the unique mapping relation between the degree of polarization of diffuse light and the zenith angle of the surface normal vector, polarization 3D reconstruction based on diffuse reflection theoretically has high accuracy. However, in practice, the accuracy of polarization 3D reconstruction is limited by the performance parameters of the polarization detector. Improper selection of performance parameters can result in large errors in the normal vector. In this paper, the mathematical models that relate the polarization 3D reconstruction errors to the detector performance parameters including polarizer extinction ratio, polarizer installation error, full well capacity and analog-to-digital (A2D) bit depth are established. At the same time, polarization detector parameters suitable for polarization 3D reconstruction are provided by the simulation. The performance parameters we recommend include an extinction ratio ≥ 200, an installation error

\in

[−1°, 1°], a full-well capacity ≥ 100

K e^{-}

, and an A2D bit depth ≥ 12 bits. The models provided in this paper are of great significance for improving the accuracy of polarization 3D reconstruction. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

16 pages, 3853 KB

Open AccessArticle

A Lightweight UAV System: Utilizing IMU Data for Coarse Judgment of Loop Closure

by Hongwei Zhu, Guobao Zhang, Zhiqi Ye and Hongyi Zhou

Drones 2023, 7(6), 338; https://doi.org/10.3390/drones7060338 - 23 May 2023

Viewed by 3299

Abstract

Unmanned aerial vehicles (UAVs) can experience significant performance issues during flight due to heavy CPU load, affecting their flight capabilities, communication, and endurance. To address this issue, this paper presents a lightweight stereo-inertial state estimator for addressing the heavy CPU load issue of [...] Read more.

Unmanned aerial vehicles (UAVs) can experience significant performance issues during flight due to heavy CPU load, affecting their flight capabilities, communication, and endurance. To address this issue, this paper presents a lightweight stereo-inertial state estimator for addressing the heavy CPU load issue of ORB-SLAM. It utilizes nonlinear optimization and features to incorporate inertial information throughout the Simultaneous Localization and Mapping (SLAM) pipeline. The first key innovation is a coarse-to-fine optimization method that targets the enhancement of tracking speed by efficiently addressing bias and noise in the IMU parameters. A novel visual–inertial pose graph is proposed as an observer to assess error thresholds and guide the system towards visual-only or visual–inertial maximum a posteriori (MAP) estimation accordingly. Furthermore, this paper introduces the incorporation of inertial data in the loop closure thread. The IMU data provide displacement direction relative to world coordinates, which is serving as a necessary condition for loop detection. The experimental results demonstrate that our method maintains excellent localization accuracy compared to other state-of-the-art approaches on benchmark datasets, while also significantly reducing CPU load. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

25 pages, 7806 KB

Open AccessArticle

A Cross-Source Point Cloud Registration Algorithm Based on Trigonometric Mutation Chaotic Harris Hawk Optimisation for Rockfill Dam Construction

by Bingyu Ren, Hao Zhao and Shuyang Han

Sensors 2023, 23(10), 4942; https://doi.org/10.3390/s23104942 - 21 May 2023

Cited by 6 | Viewed by 3189

Abstract

A high-precision three-dimensional (3D) model is the premise and vehicle of digitalising hydraulic engineering. Unmanned aerial vehicle (UAV) tilt photography and 3D laser scanning are widely used for 3D model reconstruction. Affected by the complex production environment, in a traditional 3D reconstruction based [...] Read more.

A high-precision three-dimensional (3D) model is the premise and vehicle of digitalising hydraulic engineering. Unmanned aerial vehicle (UAV) tilt photography and 3D laser scanning are widely used for 3D model reconstruction. Affected by the complex production environment, in a traditional 3D reconstruction based on a single surveying and mapping technology, it is difficult to simultaneously balance the rapid acquisition of high-precision 3D information and the accurate acquisition of multi-angle feature texture characteristics. To ensure the comprehensive utilisation of multi-source data, a cross-source point cloud registration method integrating the trigonometric mutation chaotic Harris hawk optimisation (TMCHHO) coarse registration algorithm and the iterative closest point (ICP) fine registration algorithm is proposed. The TMCHHO algorithm generates a piecewise linear chaotic map sequence in the population initialisation stage to improve population diversity. Furthermore, it employs trigonometric mutation to perturb the population in the development stage and thus avoid the problem of falling into local optima. Finally, the proposed method was applied to the Lianghekou project. The accuracy and integrity of the fusion model compared with those of the realistic modelling solutions of a single mapping system improved. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

27 pages, 7713 KB

Open AccessArticle

Thermal Characterization of Buildings with as-is Thermal-Building Information Modelling

by Víctor Pérez-Andreu, Antonio Adán Oliver, Carolina Aparicio-Fernández and José-Luis Vivancos Bono

Buildings 2023, 13(4), 972; https://doi.org/10.3390/buildings13040972 - 6 Apr 2023

Cited by 4 | Viewed by 3099

Abstract

Developing methodologies to accurately characterise the energy conditions of existing building stock is a fundamental aspect of energy consumption reduction strategies. To that end, a case study using a thermal information modelling method for existing buildings (as-is T-BIM) is reported. This proposed new [...] Read more.

Developing methodologies to accurately characterise the energy conditions of existing building stock is a fundamental aspect of energy consumption reduction strategies. To that end, a case study using a thermal information modelling method for existing buildings (as-is T-BIM) is reported. This proposed new method is based on the automatic processing of 3D thermal clouds of interior zones of a building that generates a semantic proprietary model that contains time series of surface temperatures assigned to its surface elements. The proprietary as-is T-BIM automatically generates an as-is BEM model with gbXML standards for energy simulation. This is a multi-zone energy model of the building. In addition, the surface temperature data series of the as-is T-BIM model elements permit the calculation of their thermal transmittances, increasing the calibration options of the obtained as-is BEM model. To test the as-is TBIM method, a case study compares the as-is BEM model obtained by as-is T-BIM methods with the one obtained by standard methods for the same building. The results demonstrate differences in geometry, transmittance, and infiltration values, as well as insignificant differences in annual air conditioning energy consumption or the comfort parameters tested. This seems to indicate shorter modelling times and greater accuracy of the as-is T-BIM model. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

27 pages, 11254 KB

Open AccessArticle

A Robust Real-Time Ellipse Detection Method for Robot Applications

by Wenshan He, Gongping Wu, Fei Fan, Zhongyun Liu and Shujie Zhou

Drones 2023, 7(3), 209; https://doi.org/10.3390/drones7030209 - 17 Mar 2023

Cited by 11 | Viewed by 6509

Abstract

Over the years, many ellipse detection algorithms have been studied broadly, while the critical problem of accurately and effectively detecting ellipses in the real-world using robots remains a challenge. In this paper, we proposed a valuable real-time robot-oriented detector and simple tracking algorithm [...] Read more.

Over the years, many ellipse detection algorithms have been studied broadly, while the critical problem of accurately and effectively detecting ellipses in the real-world using robots remains a challenge. In this paper, we proposed a valuable real-time robot-oriented detector and simple tracking algorithm for ellipses. This method uses low-cost RGB cameras for conversion into HSV space to obtain reddish regions of interest (RROIs) contours, effective arc selection and grouping strategies, and the candidate ellipses selection procedures that eliminate invalid edges and clustering functions. Extensive experiments are conducted to adjust and verify the method’s parameters for achieving the best performance. The method combined with a simple tracking algorithm executes only approximately 30 ms on a video frame in most cases. The results show that the proposed method had high-quality performance (precision, recall, F-Measure scores) and the least execution time compared with the existing nine most advanced methods on three public actual application datasets. Our method could detect elliptical markers in real-time in practical applications, detect ellipses adaptively under natural light, well detect severely blocked and specular reflection ellipses when the elliptical object was far from or close to the robot. The average detection frequency can meet the real-time requirements (>10 Hz). Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

14 pages, 5319 KB

Open AccessArticle

Physical Structure Expression for Dense Point Clouds of Magnetic Levitation Image Data

by Yuxin Zhang, Lei Zhang, Guochen Shen and Qian Xu

Sensors 2023, 23(5), 2535; https://doi.org/10.3390/s23052535 - 24 Feb 2023

Viewed by 2295

Abstract

The research and development of an intelligent magnetic levitation transportation system has become an important research branch of the current intelligent transportation system (ITS), which can provide technical support for state-of-the-art fields such as intelligent magnetic levitation digital twin. First, we applied unmanned [...] Read more.

The research and development of an intelligent magnetic levitation transportation system has become an important research branch of the current intelligent transportation system (ITS), which can provide technical support for state-of-the-art fields such as intelligent magnetic levitation digital twin. First, we applied unmanned aerial vehicle oblique photography technology to acquire the magnetic levitation track image data and preprocessed them. Then, we extracted the image features and matched them based on the incremental structure from motion (SFM) algorithm, recovered the camera pose parameters of the image data and the 3D scene structure information of key points, and optimized the bundle adjustment to output 3D magnetic levitation sparse point clouds. Then, we applied multiview stereo (MVS) vision technology to estimate the depth map and normal map information. Finally, we extracted the output of the dense point clouds that can precisely express the physical structure of the magnetic levitation track, such as turnout, turning, linear structures, etc. By comparing the dense point clouds model with the traditional building information model, experiments verified that the magnetic levitation image 3D reconstruction system based on the incremental SFM and MVS algorithm has strong robustness and accuracy and can express a variety of physical structures of magnetic levitation track with high accuracy. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

16 pages, 6307 KB

Open AccessArticle

Event-Guided Image Super-Resolution Reconstruction

by Guangsha Guo, Yang Feng, Hengyi Lv, Yuchen Zhao, Hailong Liu and Guoling Bi

Sensors 2023, 23(4), 2155; https://doi.org/10.3390/s23042155 - 14 Feb 2023

Cited by 9 | Viewed by 5333

Abstract

The event camera efficiently detects scene radiance changes and produces an asynchronous event stream with low latency, high dynamic range (HDR), high temporal resolution, and low power consumption. However, the large output data caused by the asynchronous imaging mechanism makes the increase in [...] Read more.

The event camera efficiently detects scene radiance changes and produces an asynchronous event stream with low latency, high dynamic range (HDR), high temporal resolution, and low power consumption. However, the large output data caused by the asynchronous imaging mechanism makes the increase in spatial resolution of the event camera limited. In this paper, we propose a novel event camera super-resolution (SR) network (EFSR-Net) based on a deep learning approach to address the problems of low spatial resolution and poor visualization of event cameras. The network model is capable of reconstructing high-resolution (HR) intensity images using event streams and active sensor pixel (APS) frame information. We design the coupled response blocks (CRB) in the network that are able of fusing the feature information of both data to achieve the recovery of detailed textures in the shadows of real images. We demonstrate that our method is able to reconstruct high-resolution intensity images with more details and less blurring in synthetic and real datasets, respectively. The proposed EFSR-Net can improve the peak signal-to-noise ratio (PSNR) metric by 1–2 dB compared with state-of-the-art methods. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

18 pages, 14274 KB

Open AccessArticle

Development of a Construction-Site Work Support System Using BIM-Marker-Based Augmented Reality

by Jae-Wook Yoon and Seung-Hyun Lee

Sustainability 2023, 15(4), 3222; https://doi.org/10.3390/su15043222 - 10 Feb 2023

Cited by 15 | Viewed by 5006

Abstract

Augmented reality (AR) in 3D has been proposed as a way to overcome the shortcomings of 2D drawings. In particular, marker-based AR is known to be more accurate in implementation, but it is not easy to use on construction sites because it requires [...] Read more.

Augmented reality (AR) in 3D has been proposed as a way to overcome the shortcomings of 2D drawings. In particular, marker-based AR is known to be more accurate in implementation, but it is not easy to use on construction sites because it requires more time and effort to create corresponding markers for information. Therefore, the purpose of this study was to develop a building information modeling (BIM)-based AR construction work support system that can be applied to construction sites by automatically generating markers. The system algorithm consists of three modules. The first module classifies and groups the objects of the BIM-based 3D model by work order. The second is used to reconstruct the 3D model by groups and automatically generate the corresponding individual markers for each object. The third specifies the marker position and implements AR by automatically matching of 3D model objects to the corresponding markers. To verify this system, a case study was implemented by projecting the BIM-marker-based AR of a 3D model on an existing building. The results show that the developed system provides 3D models and work-related information in AR at the correct scale, size, and location. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

17 pages, 4799 KB

Open AccessArticle

Dynamic Occlusion Modeling and Clearance Control of the Visual Field of Curved Highway Roadside Landscape

by Jian Xiao, Xudong Zha, Liulin Yang and Jie Wei

Sustainability 2023, 15(4), 3200; https://doi.org/10.3390/su15043200 - 9 Feb 2023

Viewed by 2283

Abstract

In order to solve the control of the degree of anti-occlusion of the roadside landscape of the expressway curve according to the drivers’ visual characteristics during high-speed driving, a dynamic space model of the visual process of curved highway roadside landscape was established, [...] Read more.

In order to solve the control of the degree of anti-occlusion of the roadside landscape of the expressway curve according to the drivers’ visual characteristics during high-speed driving, a dynamic space model of the visual process of curved highway roadside landscape was established, and the calculation equation of roadside landscape visual field was derived. The dynamic occlusion ratio was defined by space coordinates, and the judgment model was proposed for the de-occlusion of the roadside landscape visual field. According to the standard design parameters of the G4 Highway Hunan section, the occlusion laws were analyzed by MATLAB for different widths and different heights of obstructions as well as the same widths and heights of obstructions at different positions in the roadside landscape visual field, thus the control value and control content of anti-occlusion clearance for the roadside landscapes were proposed. The results show that the anti-occlusion clearance control range of the roadside landscape is 270 m at the design speed of 120 km/h, 220 m at 100 km/h, and 170 m at 80 km/h. The control value of clearance width is 25 m, and the control value of clearance height is recommended to be 20 m. Within the scope of highway land expropriation, it is recommended to land expropriation 25 m wide from the road boundary. The research provides model support for building the closeness and openness of highway roadside landscape. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

17 pages, 3290 KB

Open AccessArticle

Research on Public Space Micro-Renewal Strategy of Historical and Cultural Blocks in Sanhe Ancient Town under Perception Quantification

by Wenqing Ding, Qinqin Wei, Jing Jin, Juanjuan Nie, Fanfan Zhang, Xiaotian Zhou and Youhua Ma

Sustainability 2023, 15(3), 2790; https://doi.org/10.3390/su15032790 - 3 Feb 2023

Cited by 24 | Viewed by 5021

Abstract

The public space environment of historical and cultural blocks is inseparable from human activities, which affects tourists’ behavior and perception activities. Through the evaluation of tourists’ environmental behavior perception, the relationship between spatial characteristics and tourists’ perception is fully considered, which plays an [...] Read more.

The public space environment of historical and cultural blocks is inseparable from human activities, which affects tourists’ behavior and perception activities. Through the evaluation of tourists’ environmental behavior perception, the relationship between spatial characteristics and tourists’ perception is fully considered, which plays an important role in the protection and development of public space in historical and cultural blocks. This paper takes the historical and cultural block of Sanhe Ancient Town in Hefei as the research area, focusing on the public space of the block. Through the analysis of the semantic differential method and eye movement legal quantitative analysis, from the angle of psychological perception and visual perception, we carried out an analysis of the historical and cultural block’s public space, and built the double sense of a comprehensive evaluation system of parsing the historical and cultural blocks with the whole situation of public space and the rule of the performance. The results show that: (1) the visual perception preference for spatial elements is in the order of architectural structure > green landscape > architectural decoration > commercial activities > participants > pavement > street furniture > others. (2) There is a significant correlation but not a complete convergence of psychological perception and visual perception. (3) Buildings, structures, and space formats play a key role in creating a sense of space scale, with the former playing a positive role and the latter a negative role. (4) The visual attraction of a green landscape is strong and can improve the visual quality of space. The research found that there are evaluation differences between the visual perception and psychological perception of spatial elements, which are significantly correlated but not fully convergent. Through quantitative analysis and the interpretation of tourists’ perception from different perspectives, we can take relevant improvement and optimization measures for many deficiencies of public space in historical and cultural districts. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

19 pages, 10037 KB

Open AccessArticle

Affordable Robotic Mobile Mapping System Based on Lidar with Additional Rotating Planar Reflector

by Janusz Będkowski and Michał Pełka

Sensors 2023, 23(3), 1551; https://doi.org/10.3390/s23031551 - 31 Jan 2023

Cited by 5 | Viewed by 4633

Abstract

This paper describes an affordable robotic mobile 3D mapping system. It is built with Livox Mid–40 lidar with a conic field of view extended by a custom rotating planar reflector. This 3D sensor is compared with the more expensive Velodyne VLP 16 lidar. [...] Read more.

This paper describes an affordable robotic mobile 3D mapping system. It is built with Livox Mid–40 lidar with a conic field of view extended by a custom rotating planar reflector. This 3D sensor is compared with the more expensive Velodyne VLP 16 lidar. It is shown that the proposed sensor reaches satisfactory accuracy and range. Furthermore, it is able to preserve the metric accuracy and non–repetitive scanning pattern of the unmodified sensor. Due to preserving the non–repetitive scan pattern, our system is capable of covering the entire field of view of 38.4 × 360 degrees, which is an added value of conducted research. We show the calibration method, mechanical design, and synchronization details that are necessary to replicate our system. This work extends the applicability of solid–state lidars since the field of view can be reshaped with minimal loss of measurement properties. The solution was part of a system that was evaluated during the 3rd European Robotics Hackathon in the Zwentendorf Nuclear Power Plant. The experimental part of the paper demonstrates that our affordable robotic mobile 3D mapping system is capable of providing 3D maps of a nuclear facility that are comparable to the more expensive solution. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

12 pages, 3518 KB

Open AccessArticle

BIM Style Restoration Based on Image Retrieval and Object Location Using Convolutional Neural Network

by Yalong Yang, Yuanhang Wang, Xiaoping Zhou, Liangliang Su and Qizhi Hu

Buildings 2022, 12(12), 2047; https://doi.org/10.3390/buildings12122047 - 22 Nov 2022

Cited by 4 | Viewed by 2611

Abstract

BIM is one of the main technical ways to realize building informatization, and the model’s texture is essential to its style design during BIM construction. However, the texture maps provided by mainstream BIM software are not realistic enough and monotonous to meet the [...] Read more.

BIM is one of the main technical ways to realize building informatization, and the model’s texture is essential to its style design during BIM construction. However, the texture maps provided by mainstream BIM software are not realistic enough and monotonous to meet the actual needs of users for the model style. Therefore, an interior furniture BIM style restoration method was proposed based on image retrieval and object location using convolutional neural network. First, two types of furniture images, namely grayscale contour images from BIM software and real images from the Internet, were collected to train the following network model. Second, a multi-feature weighted fusion neural network model based on an attention mechanism (AM-rVGG) was proposed, which focused on the structural information of furniture images to retrieve the most similar real image, and then some furniture image patches from the retrieved one were generated with object location and random cropping techniques as the candidate texture maps of the furniture BIM. Finally, the candidate ones were fed back into the BIM software to realize the restoration of the furniture BIM style. The experimental results showed that the average retrieval accuracy of the proposed network model was 83.1%, and the obtained texture maps could effectively restore the real style of the furniture BIM. This work provides a new idea for restoring the realism in other BIM. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

26 pages, 10353 KB

Open AccessArticle

INV-Flow2PoseNet: Light-Resistant Rigid Object Pose from Optical Flow of RGB-D Images Using Images, Normals and Vertices

by Torben Fetzer, Gerd Reis and Didier Stricker

Sensors 2022, 22(22), 8798; https://doi.org/10.3390/s22228798 - 14 Nov 2022

Viewed by 2710

Abstract

This paper presents a novel architecture for simultaneous estimation of highly accurate optical flows and rigid scene transformations for difficult scenarios where the brightness assumption is violated by strong shading changes. In the case of rotating objects or moving light sources, such as [...] Read more.

This paper presents a novel architecture for simultaneous estimation of highly accurate optical flows and rigid scene transformations for difficult scenarios where the brightness assumption is violated by strong shading changes. In the case of rotating objects or moving light sources, such as those encountered for driving cars in the dark, the scene appearance often changes significantly from one view to the next. Unfortunately, standard methods for calculating optical flows or poses are based on the expectation that the appearance of features in the scene remains constant between views. These methods may fail frequently in the investigated cases. The presented method fuses texture and geometry information by combining image, vertex and normal data to compute an illumination-invariant optical flow. By using a coarse-to-fine strategy, globally anchored optical flows are learned, reducing the impact of erroneous shading-based pseudo-correspondences. Based on the learned optical flows, a second architecture is proposed that predicts robust rigid transformations from the warped vertex and normal maps. Particular attention is paid to situations with strong rotations, which often cause such shading changes. Therefore, a 3-step procedure is proposed that profitably exploits correlations between the normals and vertices. The method has been evaluated on a newly created dataset containing both synthetic and real data with strong rotations and shading effects. These data represent the typical use case in 3D reconstruction, where the object often rotates in large steps between the partial reconstructions. Additionally, we apply the method to the well-known Kitti Odometry dataset. Even if, due to fulfillment of the brightness assumption, this is not the typical use case of the method, the applicability to standard situations and the relation to other methods is therefore established. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

21 pages, 4502 KB

Open AccessArticle

A Laboratory and Field Universal Estimation Method for Tire–Pavement Interaction Noise (TPIN) Based on 3D Image Technology

by Hui Wang, Xun Zhang and Shengchuan Jiang

Sustainability 2022, 14(19), 12066; https://doi.org/10.3390/su141912066 - 23 Sep 2022

Cited by 81 | Viewed by 3812

Abstract

Tire–pavement interaction noise (TPIN) accounts mainly for traffic noise, a sensitive parameter affecting the eco-based maintenance decision outcome. Consistent methods or metrics for lab and field pavement texture evaluation are lacking. TPIN prediction based on pavement structural and material characteristics is not yet [...] Read more.

Tire–pavement interaction noise (TPIN) accounts mainly for traffic noise, a sensitive parameter affecting the eco-based maintenance decision outcome. Consistent methods or metrics for lab and field pavement texture evaluation are lacking. TPIN prediction based on pavement structural and material characteristics is not yet available. This paper used 3D point cloud data scanned from specimens and road pavement to conduct correlation and clustering analysis based on representative 3D texture metrics. We conducted an influence analysis to exclude macroscope pavement detection metrics and macro deformation metrics’ effects (international roughness index, IRI, and mean profile depth, MPD). The cluster analysis results verified the feasibility of texture metrics for evaluating lab and field pavement wear, differentiating the wear states. TPIN prediction accuracy based on texture indicators was high (R² = 0.9958), implying that it is feasible to predict the TPIN level using 3D texture metrics. The effects of pavement texture changes on TPIN can be simulated by laboratory wear. Full article

(This article belongs to the Topic 3D Computer Vision and Smart Building and City)

► Show Figures

Figure 1

Topic Menu

Topic Editors

3D Computer Vision and Smart Building and City

Topic Information

Keywords

Participating Journals

Published Papers (16 papers)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Topic Menu

Topic Editors

3D Computer Vision and Smart Building and City

Topic Information

Keywords

Participating Journals

Related Topics

Published Papers (16 papers)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI