Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (21)

Search Parameters:
Keywords = signed distance function (SDF)

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
20 pages, 11095 KB  
Article
SRNN: Surface Reconstruction from Sparse Point Clouds with Nearest Neighbor Prior
by Haodong Li, Ying Wang and Xi Zhao
Appl. Sci. 2026, 16(3), 1210; https://doi.org/10.3390/app16031210 - 24 Jan 2026
Viewed by 581
Abstract
Surface reconstruction from 3D point clouds has a wide range of applications. In this paper, we focus on the reconstruction from raw, sparse point clouds. Although some existing methods work on this topic, the results often suffer from geometric defects. To solve this [...] Read more.
Surface reconstruction from 3D point clouds has a wide range of applications. In this paper, we focus on the reconstruction from raw, sparse point clouds. Although some existing methods work on this topic, the results often suffer from geometric defects. To solve this problem, we propose a novel method that optimizes a neural network (referred to as signed distance function) to fit the Signed Distance Field (SDF) from sparse point clouds. The signed distance function is optimized by projecting query points to its iso-surface accordingly. Our key idea is to encourage both the direction and distance of projection to be correct through the supervision provided by a nearest neighbor prior. In addition, we mitigate the error propagated from the prior function by augmenting the low-frequency components in the input. In our implementation, the nearest neighbor prior is trained with a large-scale local geometry dataset, and the positional encoding with a specified spectrum is used as a regularization for the optimization process. Experiments on the ShapeNetCore dataset demonstrate that our method achieves better accuracy than SDF-based methods while preserving smoothness. Full article
(This article belongs to the Special Issue Technical Advances in 3D Reconstruction—2nd Edition)
Show Figures

Figure 1

24 pages, 11080 KB  
Article
Graph-Based and Multi-Stage Constraints for Hand–Object Reconstruction
by Wenrun Wang, Jianwu Dang, Yangping Wang and Hui Yu
Sensors 2026, 26(2), 535; https://doi.org/10.3390/s26020535 - 13 Jan 2026
Viewed by 401
Abstract
Reconstructing hand and object shapes from a single view during interaction remains challenging due to severe mutual occlusion and the need for high physical plausibility. To address this, we propose a novel framework for hand–object interaction reconstruction based on holistic, multi-stage collaborative optimization. [...] Read more.
Reconstructing hand and object shapes from a single view during interaction remains challenging due to severe mutual occlusion and the need for high physical plausibility. To address this, we propose a novel framework for hand–object interaction reconstruction based on holistic, multi-stage collaborative optimization. Unlike methods that process hands and objects independently or apply constraints as late-stage post-processing, our model progressively enforces physical consistency and geometric accuracy throughout the entire reconstruction pipeline. Our network takes an RGB-D image as input. An adaptive feature fusion module first combines color and depth information to improve robustness against sensing uncertainties. We then introduce structural priors for 2D pose estimation and leverage texture cues to refine depth-based 3D pose initialization. Central to our approach is the iterative application of a dense mutual attention mechanism during sparse-to-dense mesh recovery, which dynamically captures interaction dependencies while refining geometry. Finally, we use a Signed Distance Function (SDF) representation explicitly designed for contact surfaces to prevent interpenetration and ensure physically plausible results. Through comprehensive experiments, our method demonstrates significant improvements on the challenging ObMan and DexYCB benchmarks, outperforming state-of-the-art techniques. Specifically, on the ObMan dataset, our approach achieves hand CDh and object CDo metrics of 0.077 cm2 and 0.483 cm2, respectively. Similarly, on the DexYCB dataset, it attains hand CDh and object CDo values of 0.251 cm2 and 1.127 cm2, respectively. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

21 pages, 4137 KB  
Article
Physics-Informed Neural Networks Simulation and Validation of Airflows in Three-Dimensional Upper Respiratory Tracts
by Mohamed Talaat, Xiuhua Si, Haibo Dong and Jinxiang Xi
Fluids 2025, 10(12), 306; https://doi.org/10.3390/fluids10120306 - 25 Nov 2025
Cited by 3 | Viewed by 3045
Abstract
Accurate and efficient simulation of airflows in human airways is critical for advancing the understanding of respiratory physiology, disease diagnostics, and inhalation drug delivery. Traditional computational fluid dynamics (CFD) provides detailed predictions but is often mesh-sensitive and computationally expensive for complex geometries. In [...] Read more.
Accurate and efficient simulation of airflows in human airways is critical for advancing the understanding of respiratory physiology, disease diagnostics, and inhalation drug delivery. Traditional computational fluid dynamics (CFD) provides detailed predictions but is often mesh-sensitive and computationally expensive for complex geometries. In this study, we explored the usage of physics-informed neural networks (PINNs) to simulate airflows in three geometries with increasing complexity: a duct, a simplified mouth–lung model, and a patient-specific upper airway. Key procedures to implement PINN training and testing were presented, including geometry preparation/scaling, boundary/constraint specification, training diagnostics, nondimensionalization, and inference mapping. Both the laminar PINN and SDF–mixing-length PINN were tested. PINN predictions were validated against high-fidelity CFD simulations to assess accuracy, efficiency, and generalization. The results demonstrated that nondimensionalization of the governing equations was essential to ensure training accuracy for respiratory flows at 1 m/s and above. Hessian-matrix-based diagnosis revealed a quick increase in training challenges with flow speed and geometrical complexity. Both the laminar and SDF–mixing-length PINNs achieved comparable accuracy to corresponding CFD predictions in the duct and simplified mouth–lung geometry. However, only the SDF–mixing-length PINN adequately captured flow details unique to respiratory morphology, such as obstruction-induced flow diversion, recirculating flows, and laryngeal jet decay. The results of this study highlight the potential of PINNs as a flexible alternative to conventional CFD for modeling respiratory airflows, with adaptability to patient-specific geometries and promising integration with static or real-time imaging (e.g., 4D CT/MRI). Full article
(This article belongs to the Special Issue Respiratory Flows)
Show Figures

Figure 1

51 pages, 9631 KB  
Review
Review of Physics-Informed Neural Networks: Challenges in Loss Function Design and Geometric Integration
by Sergiy Plankovskyy, Yevgen Tsegelnyk, Nataliia Shyshko, Igor Litvinchev, Tetyana Romanova and José Manuel Velarde Cantú
Mathematics 2025, 13(20), 3289; https://doi.org/10.3390/math13203289 - 15 Oct 2025
Cited by 13 | Viewed by 13296
Abstract
Physics-Informed Neural Networks (PINNs) represent a transformative approach to solving partial differential equation (PDE)-based boundary value problems by embedding physical laws into the learning process, addressing challenges such as non-physical solutions and data scarcity, which are inherent in traditional neural networks. This review [...] Read more.
Physics-Informed Neural Networks (PINNs) represent a transformative approach to solving partial differential equation (PDE)-based boundary value problems by embedding physical laws into the learning process, addressing challenges such as non-physical solutions and data scarcity, which are inherent in traditional neural networks. This review analyzes critical challenges in PINN development, focusing on loss function design, geometric information integration, and their application in engineering modeling. We explore advanced strategies for constructing loss functions—including adaptive weighting, energy-based, and variational formulations—that enhance optimization stability and ensure physical consistency across multiscale and multiphysics problems. We emphasize geometry-aware learning through analytical representations—signed distance functions (SDFs), phi-functions, and R-functions—with complementary strengths: SDFs enable precise local boundary enforcement, whereas phi/R capture global multi-body constraints in irregular domains; in practice, hybrid use is effective for engineering problems. We also examine adaptive collocation sampling, domain decomposition, and hard-constraint mechanisms for boundary conditions to improve convergence and accuracy and discuss integration with commercial CAE via hybrid schemes that couple PINNs with classical solvers (e.g., FEM) to boost efficiency and reliability. Finally, we consider emerging paradigms—Physics-Informed Kolmogorov–Arnold Networks (PIKANs) and operator-learning frameworks (DeepONet, Fourier Neural Operator)—and outline open directions in standardized benchmarks, computational scalability, and multiphysics/multi-fidelity modeling for digital twins and design optimization. Full article
Show Figures

Figure 1

25 pages, 2435 KB  
Article
FAIS: Fully Automatic Indoor Surveying Framework of Terrestrial Laser Scanning Point Clouds in Large-Scale Indoor Environments
by Wenhao Li, Tong Jia, Shiyi Guo, Yunchun Zhou, Yizhe Liu and Hao Wang
Remote Sens. 2025, 17(16), 2863; https://doi.org/10.3390/rs17162863 - 17 Aug 2025
Viewed by 1503
Abstract
This article presents a novel fully automatic indoor surveying (FAIS) framework for large-scale indoor environments using a Terrestrial Laser Scanning (TLS) hardware system. Traditional methods for indoor surveying are labor-intensive and time-consuming, as they rely on manually positioning scanners for data capture and [...] Read more.
This article presents a novel fully automatic indoor surveying (FAIS) framework for large-scale indoor environments using a Terrestrial Laser Scanning (TLS) hardware system. Traditional methods for indoor surveying are labor-intensive and time-consuming, as they rely on manually positioning scanners for data capture and placing markers for registration. What is more, positioning scanners manually may cause uneven scanning or rescanning, including unstructured areas specifically. To ensure full coverage of the scene, we precisely obtain the number and location of scan stations through the Signed Distance Function (SDF) based method. Meanwhile, we propose an efficient large-scale dense point cloud registration method without markers. The proposed framework is adapted to environments where the scanner operates on a flat surface, such as office spaces, theater stage spaces, urban areas, and some cultural heritage scenic areas. Experiments demonstrate that the proposed method decreases computation time and obtains a more complete point cloud. Full article
Show Figures

Figure 1

21 pages, 3828 KB  
Article
High-Precision 3D Reconstruction in Complex Scenes via Implicit Surface Reconstruction Enhanced by Multi-Sensor Data Fusion
by Quanchen Zhou, Jiabao Zuo, Wenhao Kang and Mingwu Ren
Sensors 2025, 25(9), 2820; https://doi.org/10.3390/s25092820 - 30 Apr 2025
Cited by 9 | Viewed by 3882
Abstract
In this paper, we investigate implicit surface reconstruction methods based on deep learning, enhanced by multi-sensor data fusion, to improve the accuracy of 3D reconstruction in complex scenes. Existing single-sensor approaches often struggle with occlusions and incomplete observations. By fusing complementary information from [...] Read more.
In this paper, we investigate implicit surface reconstruction methods based on deep learning, enhanced by multi-sensor data fusion, to improve the accuracy of 3D reconstruction in complex scenes. Existing single-sensor approaches often struggle with occlusions and incomplete observations. By fusing complementary information from multiple sensors (e.g., multiple cameras or a combination of cameras and depth sensors), our proposed framework alleviates the issue of missing or partial data and further increases reconstruction fidelity. We introduce a novel deep neural network that learns a continuous signed distance function (SDF) for scene geometry, conditioned on fused multi-sensor feature representations. The network seamlessly merges multi-modal data into a unified implicit representation, enabling precise and watertight surface reconstruction. We conduct extensive experiments on 3D datasets, demonstrating superior accuracy compared to single-sensor baselines and classical fusion methods. Quantitative and qualitative results reveal that multi-sensor fusion significantly improves reconstruction completeness and geometric detail, while our implicit approach provides smooth, high-resolution surfaces. Additionally, we analyze the influence of the number and diversity of sensors on reconstruction quality, the model’s ability to generalize to unseen data, and computational considerations. Our work highlights the potential of coupling deep implicit representations with multi-sensor fusion to achieve robust 3D reconstruction in challenging real-world conditions. Full article
(This article belongs to the Section Sensing and Imaging)
Show Figures

Figure 1

19 pages, 2793 KB  
Article
Neuronal Mesh Reconstruction from Image Stacks Using Implicit Neural Representations
by Xiaoqiang Zhu, Yanhua Zhao and Lihua You
Mathematics 2025, 13(8), 1276; https://doi.org/10.3390/math13081276 - 12 Apr 2025
Cited by 1 | Viewed by 3255
Abstract
Reconstructing neuronal morphology from microscopy image stacks is essential for understanding brain function and behavior. While existing methods are capable of tracking neuronal tree structures and creating membrane surface meshes, they often lack seamless processing pipelines and suffer from stitching artifacts and reconstruction [...] Read more.
Reconstructing neuronal morphology from microscopy image stacks is essential for understanding brain function and behavior. While existing methods are capable of tracking neuronal tree structures and creating membrane surface meshes, they often lack seamless processing pipelines and suffer from stitching artifacts and reconstruction inconsistencies. In this study, we propose a new approach utilizing implicit neural representation to directly extract neuronal isosurfaces from raw image stacks by modeling signed distance functions (SDFs) with multi-layer perceptrons (MLPs). Our method accurately reconstructs the tubular, tree-like topology of neurons in complex spatial configurations, yielding highly precise neuronal membrane surface meshes. Extensive quantitative and qualitative evaluations across multiple datasets demonstrate the superior reliability of our approach compared to existing methods. The proposed method achieves a volumetric reconstruction accuracy of up to 98.2% and a volumetric IoU of 0.90. Full article
(This article belongs to the Special Issue Mathematical Applications in Computer Graphics)
Show Figures

Figure 1

17 pages, 14630 KB  
Article
Three-Dimensional Shape Reconstruction from Digital Freehand Design Sketching Based on Deep Learning Techniques
by Ding Zhou, Guohua Wei and Xiaojun Yuan
Appl. Sci. 2024, 14(24), 11717; https://doi.org/10.3390/app142411717 - 16 Dec 2024
Cited by 2 | Viewed by 3243
Abstract
This paper proposes a method for 3D reconstruction from Freehand Design Sketching (FDS) in architecture and industrial design. The implementation begins by extracting features from the FDS using the self-supervised learning model DINO, followed by the continuous Signed Distance Function (SDF) regression as [...] Read more.
This paper proposes a method for 3D reconstruction from Freehand Design Sketching (FDS) in architecture and industrial design. The implementation begins by extracting features from the FDS using the self-supervised learning model DINO, followed by the continuous Signed Distance Function (SDF) regression as an implicit representation through a Multi-Layer Perceptron network. Taking eyeglass frames as an example, the 2D contour and freehand sketch optimize the alignment by their geometrical similarity while exploiting symmetry to improve reconstruction accuracy. Experiments demonstrate that this method can effectively reconstruct high-quality 3D models of eyeglass frames from 2D freehand sketches, outperforming existing deep learning-based 3D reconstruction methods. This research offers practical information for understanding 3D modeling methodology for FDS, triggering multiple modes of design creativity and efficient scheme adjustments in industrial or architectural conceptual design. In conclusion, this novel approach integrates self-supervised learning and geometric optimization to achieve unprecedented fidelity in 3D reconstruction from FDS, setting a new benchmark for AI-driven design processes in industrial and architectural applications. Full article
Show Figures

Figure 1

19 pages, 50560 KB  
Article
Garment Recognition and Reconstruction Using Object Simultaneous Localization and Mapping
by Yilin Zhang and Koichi Hashimoto
Sensors 2024, 24(23), 7622; https://doi.org/10.3390/s24237622 - 28 Nov 2024
Cited by 1 | Viewed by 1839
Abstract
The integration of robotics in the garment industry remains relatively limited, primarily due to the challenges in the highly deformable nature of garments. The objective of this study is thus to explore a vision-based garment recognition and environment reconstruction model to facilitate the [...] Read more.
The integration of robotics in the garment industry remains relatively limited, primarily due to the challenges in the highly deformable nature of garments. The objective of this study is thus to explore a vision-based garment recognition and environment reconstruction model to facilitate the application of robots in garment processing. Object SLAM (Simultaneous Localization and Mapping) was employed as the core methodology for real-time mapping and tracking. To enable garment detection and reconstruction, two datasets were created: a 2D garment image dataset for instance segmentation model training and a synthetic 3D mesh garment dataset to enhance the DeepSDF (Signed Distance Function) model for generative garment reconstruction. In addition to garment detection, the SLAM system was extended to identify and reconstruct environmental planes, using the CAPE (Cylinder and Plane Extraction) model. The implementation was tested using an Intel Realsense® camera, demonstrating the feasibility of simultaneous garment and plane detection and reconstruction. This study shows improved performance in garment recognition with the 2D instance segmentation models and an enhanced understanding of garment shapes and structures with the DeepSDF model. The integration of CAPE plane detection with SLAM allows for more robust environment reconstruction that is capable of handling multiple objects. The implementation and evaluation of the system highlight its potential for enhancing automation and efficiency in the garment processing industry. Full article
(This article belongs to the Special Issue Advances in Sensing, Control and Path Planning for Robotic Systems)
Show Figures

Figure 1

17 pages, 12577 KB  
Article
Neural Surfel Reconstruction: Addressing Loop Closure Challenges in Large-Scale 3D Neural Scene Mapping
by Jiadi Cui, Jiajie Zhang, Laurent Kneip and Sören Schwertfeger
Sensors 2024, 24(21), 6919; https://doi.org/10.3390/s24216919 - 28 Oct 2024
Cited by 2 | Viewed by 2639
Abstract
Efficiently reconstructing complex and intricate surfaces at scale remains a significant challenge in 3D surface reconstruction. Recently, implicit neural representations have become a popular topic in 3D surface reconstruction. However, how to handle loop closure and bundle adjustment is a tricky problem for [...] Read more.
Efficiently reconstructing complex and intricate surfaces at scale remains a significant challenge in 3D surface reconstruction. Recently, implicit neural representations have become a popular topic in 3D surface reconstruction. However, how to handle loop closure and bundle adjustment is a tricky problem for neural methods, because they learn the neural parameters globally. We present an algorithm that leverages the concept of surfels and expands relevant definitions to address such challenges. By integrating neural descriptors with surfels and framing surfel association as a deformation graph optimization problem, our method is able to effectively perform loop closure detection and loop correction in challenging scenarios. Furthermore, the surfel-level representation simplifies the complexity of 3D neural reconstruction. Meanwhile, the binding of neural descriptors to corresponding surfels produces a dense volumetric signed distance function (SDF), enabling the mesh reconstruction. Our approach demonstrates a significant improvement in reconstruction accuracy, reducing the average error by 16.9% compared to previous methods, while also generating modeling files that are up to 90% smaller than those produced by traditional methods. Full article
(This article belongs to the Section Remote Sensors)
Show Figures

Figure 1

20 pages, 4626 KB  
Article
Three-Dimensional Reconstruction of Indoor Scenes Based on Implicit Neural Representation
by Zhaoji Lin, Yutao Huang and Li Yao
J. Imaging 2024, 10(9), 231; https://doi.org/10.3390/jimaging10090231 - 16 Sep 2024
Viewed by 3546
Abstract
Reconstructing 3D indoor scenes from 2D images has always been an important task in computer vision and graphics applications. For indoor scenes, traditional 3D reconstruction methods have problems such as missing surface details, poor reconstruction of large plane textures and uneven illumination areas, [...] Read more.
Reconstructing 3D indoor scenes from 2D images has always been an important task in computer vision and graphics applications. For indoor scenes, traditional 3D reconstruction methods have problems such as missing surface details, poor reconstruction of large plane textures and uneven illumination areas, and many wrongly reconstructed floating debris noises in the reconstructed models. This paper proposes a 3D reconstruction method for indoor scenes that combines neural radiation field (NeRFs) and signed distance function (SDF) implicit expressions. The volume density of the NeRF is used to provide geometric information for the SDF field, and the learning of geometric shapes and surfaces is strengthened by adding an adaptive normal prior optimization learning process. It not only preserves the high-quality geometric information of the NeRF, but also uses the SDF to generate an explicit mesh with a smooth surface, significantly improving the reconstruction quality of large plane textures and uneven illumination areas in indoor scenes. At the same time, a new regularization term is designed to constrain the weight distribution, making it an ideal unimodal compact distribution, thereby alleviating the problem of uneven density distribution and achieving the effect of floating debris removal in the final model. Experiments show that the 3D reconstruction effect of this paper on ScanNet, Hypersim, and Replica datasets outperforms the state-of-the-art methods. Full article
(This article belongs to the Special Issue Geometry Reconstruction from Images (2nd Edition))
Show Figures

Figure 1

18 pages, 8632 KB  
Article
RobotSDF: Implicit Morphology Modeling for the Robotic Arm
by Yusheng Yang, Jiajia Liu, Hongpeng Zhou, Afimbo Reuben Kwabena, Yuqiao Zhong and Yangmin Xie
Sensors 2024, 24(16), 5248; https://doi.org/10.3390/s24165248 - 14 Aug 2024
Cited by 4 | Viewed by 3985
Abstract
The expression of robot arm morphology is a critical foundation for achieving effective motion planning and collision avoidance in robotic systems. Traditional geometry-based approaches usually suffer from the contradiction between the high demand for computing resources for fine expression and the insufficient detail [...] Read more.
The expression of robot arm morphology is a critical foundation for achieving effective motion planning and collision avoidance in robotic systems. Traditional geometry-based approaches usually suffer from the contradiction between the high demand for computing resources for fine expression and the insufficient detail expression caused by the pursuit of efficiency. The signed distance function addresses these drawbacks due to its ability to handle complex and arbitrary shapes and lower computational requirements. However, conventional robotic morphology methods based on the signed distance function often face challenges when the robot moves dynamically, since robots with different postures are modeled as independent individuals but the postures of robots are infinite. In this paper, we introduce RobotSDF, an implicit morphology modeling approach that can express the robot shape of arbitrary posture precisely. Instead of depicting a whole model of the robot arm, RobotSDF models the robot morphology as integrated implicit joint models driven by joint configurations. In this approach, the dynamic shape change process of the robot is converted into the coordinate transformations of query points within each joint’s coordinate system. Experimental results with the Elfin robot demonstrate that RobotSDF can accurately depict robot shapes across different postures up to the millimeter level, which exhibits 38.65% and 66.24% improvement over the Neural-JSDF and configuration space distance field algorithms, respectively, in representing robot morphology. We further verified the efficiency of RobotSDF through collision avoidance in both simulation and actual human–robot collaboration experiments. Full article
(This article belongs to the Section Sensors and Robotics)
Show Figures

Figure 1

23 pages, 12227 KB  
Article
3D Reconstruction of Ancient Buildings Using UAV Images and Neural Radiation Field with Depth Supervision
by Yingwei Ge, Bingxuan Guo, Peishuai Zha, San Jiang, Ziyu Jiang and Demin Li
Remote Sens. 2024, 16(3), 473; https://doi.org/10.3390/rs16030473 - 25 Jan 2024
Cited by 26 | Viewed by 8961
Abstract
The 3D reconstruction of ancient buildings through inclined photogrammetry finds a wide range of applications in surveying, visualization and heritage conservation. Unlike indoor objects, reconstructing ancient buildings presents unique challenges, including the slow speed of 3D reconstruction using traditional methods, the complex textures [...] Read more.
The 3D reconstruction of ancient buildings through inclined photogrammetry finds a wide range of applications in surveying, visualization and heritage conservation. Unlike indoor objects, reconstructing ancient buildings presents unique challenges, including the slow speed of 3D reconstruction using traditional methods, the complex textures of ancient structures and geometric issues caused by repeated textures. Additionally, there is a hash conflict problem when rendering outdoor scenes using neural radiation fields. To address these challenges, this paper proposes a 3D reconstruction method based on depth-supervised neural radiation fields. To enhance the representation of the geometric neural network, the addition of a truncated signed distance function (TSDF) supplements the existing signed distance function (SDF). Furthermore, the neural network’s training is supervised using depth information, leading to improved geometric accuracy in the reconstruction model through depth data obtained from sparse point clouds. This study also introduces a progressive training strategy to mitigate hash conflicts, allowing the hash table to express important details more effectively while reducing feature overlap. The experimental results demonstrate that our method, under the same number of iterations, produces images with clearer structural details, resulting in an average 15% increase in the Peak Signal-to-Noise Ratio (PSNR) value and a 10% increase in the Structural Similarity Index Measure (SSIM) value. Moreover, our reconstruction model produces higher-quality surface models, enabling the fast and highly geometrically accurate 3D reconstruction of ancient buildings. Full article
Show Figures

Graphical abstract

20 pages, 3353 KB  
Article
ISHS-Net: Single-View 3D Reconstruction by Fusing Features of Image and Shape Hierarchical Structures
by Guoqing Gao, Liang Yang, Quan Zhang, Chongmin Wang, Hua Bao and Changhui Rao
Remote Sens. 2023, 15(23), 5449; https://doi.org/10.3390/rs15235449 - 22 Nov 2023
Cited by 4 | Viewed by 3062
Abstract
The reconstruction of 3D shapes from a single view has been a longstanding challenge. Previous methods have primarily focused on learning either geometric features that depict overall shape contours but are insufficient for occluded regions, local features that capture details but cannot represent [...] Read more.
The reconstruction of 3D shapes from a single view has been a longstanding challenge. Previous methods have primarily focused on learning either geometric features that depict overall shape contours but are insufficient for occluded regions, local features that capture details but cannot represent the complete structure, or structural features that encode part relationships but require predefined semantics. However, the fusion of geometric, local, and structural features has been lacking, leading to inaccurate reconstruction of shapes with occlusions or novel compositions. To address this issue, we propose a two-stage approach for achieving 3D shape reconstruction. In the first stage, we encode the hierarchical structure features of the 3D shape using an encoder-decoder network. In the second stage, we enhance the hierarchical structure features by fusing them with global and point features and feed the enhanced features into a signed distance function (SDF) prediction network to obtain rough SDF values. Using the camera pose, we project arbitrary 3D points in space onto different depth feature maps of the CNN and obtain their corresponding positions. Then, we concatenate the features of these corresponding positions together to form local features. These local features are also fed into the SDF prediction network to obtain fine-grained SDF values. By fusing the two sets of SDF values, we improve the accuracy of the model and enable it to reconstruct other object types with higher quality. Comparative experiments demonstrate that the proposed method outperforms state-of-the-art approaches in terms of accuracy. Full article
Show Figures

Figure 1

20 pages, 11594 KB  
Article
Sat-Mesh: Learning Neural Implicit Surfaces for Multi-View Satellite Reconstruction
by Yingjie Qu and Fei Deng
Remote Sens. 2023, 15(17), 4297; https://doi.org/10.3390/rs15174297 - 31 Aug 2023
Cited by 16 | Viewed by 5104
Abstract
Automatic reconstruction of surfaces from satellite imagery is a hot topic in computer vision and photogrammetry. State-of-the-art reconstruction methods typically produce 2.5D elevation data. In contrast, we propose a one-stage method directly generating a 3D mesh model from multi-view satellite imagery. We introduce [...] Read more.
Automatic reconstruction of surfaces from satellite imagery is a hot topic in computer vision and photogrammetry. State-of-the-art reconstruction methods typically produce 2.5D elevation data. In contrast, we propose a one-stage method directly generating a 3D mesh model from multi-view satellite imagery. We introduce a novel Sat-Mesh approach for satellite implicit surface reconstruction: We represent the scene as a continuous signed distance function (SDF) and leverage a volume rendering framework to learn the SDF values. To address the challenges posed by lighting variations and inconsistent appearances in satellite imagery, we incorporate a latent vector in the network architecture to encode image appearances. Furthermore, we introduce a multi-view stereo constraint to enhance surface quality. This constraint minimizes the similarity between image patches to optimize the position and orientation of the SDF surface. Experimental results demonstrate that our method achieves superior visual quality and quantitative accuracy in generating mesh models. Moreover, our approach can learn seasonal variations in satellite imagery, resulting in texture mesh models with different and consistent seasonal appearances. Full article
Show Figures

Graphical abstract

Back to TopTop