Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (3,309)

Search Parameters:
Keywords = 3d point clouds

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
25 pages, 11545 KiB  
Article
Workpiece Coordinate System Measurement for a Robotic Timber Joinery Workflow
by Francisco Quitral-Zapata, Rodrigo García-Alvarado, Alejandro Martínez-Rocamora and Luis Felipe González-Böhme
Buildings 2025, 15(15), 2712; https://doi.org/10.3390/buildings15152712 (registering DOI) - 31 Jul 2025
Abstract
Robotic timber joinery demands integrated, adaptive methods to compensate for the inherent dimensional variability of wood. We introduce a seamless robotic workflow to enhance the measurement accuracy of the Workpiece Coordinate System (WCS). The approach leverages a Zivid 3D camera mounted in an [...] Read more.
Robotic timber joinery demands integrated, adaptive methods to compensate for the inherent dimensional variability of wood. We introduce a seamless robotic workflow to enhance the measurement accuracy of the Workpiece Coordinate System (WCS). The approach leverages a Zivid 3D camera mounted in an eye-in-hand configuration on a KUKA industrial robot. The proposed algorithm applies a geometric method that strategically crops the point cloud and fits planes to the workpiece surfaces to define a reference frame, calculate the corresponding transformation between coordinate systems, and measure the cross-section of the workpiece. This enables reliable toolpath generation by dynamically updating WCS and effectively accommodating real-world geometric deviations in timber components. The workflow includes camera-to-robot calibration, point cloud acquisition, robust detection of workpiece features, and precise alignment of the WCS. Experimental validation confirms that the proposed method is efficient and improves milling accuracy. By dynamically identifying the workpiece geometry, the system successfully addresses challenges posed by irregular timber shapes, resulting in higher accuracy for timber joints. This method contributes to advanced manufacturing strategies in robotic timber construction and supports the processing of diverse workpiece geometries, with potential applications in civil engineering for building construction through the precise fabrication of structural timber components. Full article
(This article belongs to the Special Issue Architectural Design Supported by Information Technology: 2nd Edition)
Show Figures

Figure 1

15 pages, 2290 KiB  
Article
Research on Automatic Detection Method of Coil in Unmanned Reservoir Area Based on LiDAR
by Yang Liu, Meiqin Liang, Xiaozhan Li, Xuejun Zhang, Junqi Yuan and Dong Xu
Processes 2025, 13(8), 2432; https://doi.org/10.3390/pr13082432 (registering DOI) - 31 Jul 2025
Abstract
The detection of coils in reservoir areas is part of the environmental perception technology of unmanned cranes. In order to improve the perception ability of unmanned cranes to include environmental information in reservoir areas, a method of automatic detection of coils based on [...] Read more.
The detection of coils in reservoir areas is part of the environmental perception technology of unmanned cranes. In order to improve the perception ability of unmanned cranes to include environmental information in reservoir areas, a method of automatic detection of coils based on two-dimensional LiDAR dynamic scanning is proposed, which realizes the detection of the position and attitude of coils in reservoir areas. This algorithm realizes map reconstruction of 3D point cloud by fusing LiDAR point cloud data and the motion position information of intelligent cranes. Additionally, a processing method based on histogram statistical analysis and 3D normal curvature estimation is proposed to solve the problem of over-segmentation and under-segmentation in 3D point cloud segmentation. Finally, for segmented point cloud clusters, coil models are fitted by the RANSAC method to identify their position and attitude. The accuracy, recall, and F1 score of the detection model are all higher than 0.91, indicating that the model has a good recognition effect. Full article
Show Figures

Figure 1

21 pages, 2245 KiB  
Article
Extraction of Corrosion Damage Features of Serviced Cable Based on Three-Dimensional Point Cloud Technology
by Tong Zhu, Shoushan Cheng, Haifang He, Kun Feng and Jinran Zhu
Materials 2025, 18(15), 3611; https://doi.org/10.3390/ma18153611 (registering DOI) - 31 Jul 2025
Abstract
The corrosion of high-strength steel wires is a key factor impacting the durability and reliability of cable-stayed bridges. In this study, the corrosion pit features on a high-strength steel wire, which had been in service for 27 years, were extracted and modeled using [...] Read more.
The corrosion of high-strength steel wires is a key factor impacting the durability and reliability of cable-stayed bridges. In this study, the corrosion pit features on a high-strength steel wire, which had been in service for 27 years, were extracted and modeled using three-dimensional point cloud data obtained through 3D surface scanning. The Otsu method was applied for image binarization, and each corrosion pit was geometrically represented as an ellipse. Key pit parameters—including length, width, depth, aspect ratio, and a defect parameter—were statistically analyzed. Results of the Kolmogorov–Smirnov (K–S) test at a 95% confidence level indicated that the directional angle component (θ) did not conform to any known probability distribution. In contrast, the pit width (b) and defect parameter (Φ) followed a generalized extreme value distribution, the aspect ratio (b/a) matched a Beta distribution, and both the pit length (a) and depth (d) were best described by a Gaussian mixture model. The obtained results provide valuable reference for assessing the stress state, in-service performance, and predicted remaining service life of operational stay cables. Full article
(This article belongs to the Section Construction and Building Materials)
28 pages, 2174 KiB  
Article
Validating Lava Tube Stability Through Finite Element Analysis of Real-Scene 3D Models
by Jiawang Wang, Zhizhong Kang, Chenming Ye, Haiting Yang and Xiaoman Qi
Electronics 2025, 14(15), 3062; https://doi.org/10.3390/electronics14153062 (registering DOI) - 31 Jul 2025
Abstract
The structural stability of lava tubes is a critical factor for their potential use in lunar base construction. Previous studies could not reflect the details of lava tube boundaries and perform accurate mechanical analysis. To this end, this study proposes a robust method [...] Read more.
The structural stability of lava tubes is a critical factor for their potential use in lunar base construction. Previous studies could not reflect the details of lava tube boundaries and perform accurate mechanical analysis. To this end, this study proposes a robust method to construct a high-precision, real-scene 3D model based on ground lava tube point cloud data. By employing finite element analysis, this study investigated the impact of real-world cross-sectional geometry, particularly the aspect ratio, on structural stability under surface pressure simulating meteorite impacts. A high-precision 3D reconstruction was achieved using UAV-mounted LiDAR and SLAM-based positioning systems, enabling accurate geometric capture of lava tube profiles. The original point cloud data were processed to extract cross-sections, which were then classified by their aspect ratios for analysis. Experimental results confirmed that the aspect ratio is a significant factor in determining stability. Crucially, unlike the monotonic trends often suggested by idealized models, analysis of real-world geometries revealed that the greatest deformation and structural vulnerability occur in sections with an aspect ratio between 0.5 and 0.6. For small lava tubes buried 3 m deep, the ground pressure they can withstand does not exceed 6 GPa. This process helps identify areas with weaker load-bearing capacity. The analysis demonstrated that a realistic 3D modeling approach provides a more accurate and reliable assessment of lava tube stability. This framework is vital for future evaluations of lunar lava tubes as safe habitats and highlights that complex, real-world geometry can lead to non-intuitive structural weaknesses not predicted by simplified models. Full article
Show Figures

Figure 1

31 pages, 11269 KiB  
Review
Advancements in Semantic Segmentation of 3D Point Clouds for Scene Understanding Using Deep Learning
by Hafsa Benallal, Nadine Abdallah Saab, Hamid Tairi, Ayman Alfalou and Jamal Riffi
Technologies 2025, 13(8), 322; https://doi.org/10.3390/technologies13080322 - 30 Jul 2025
Abstract
Three-dimensional semantic segmentation is a fundamental problem in computer vision with a wide range of applications in autonomous driving, robotics, and urban scene understanding. The task involves assigning semantic labels to each point in a 3D point cloud, a data representation that is [...] Read more.
Three-dimensional semantic segmentation is a fundamental problem in computer vision with a wide range of applications in autonomous driving, robotics, and urban scene understanding. The task involves assigning semantic labels to each point in a 3D point cloud, a data representation that is inherently unstructured, irregular, and spatially sparse. In recent years, deep learning has become the dominant framework for addressing this task, leading to a broad variety of models and techniques designed to tackle the unique challenges posed by 3D data. This survey presents a comprehensive overview of deep learning methods for 3D semantic segmentation. We organize the literature into a taxonomy that distinguishes between supervised and unsupervised approaches. Supervised methods are further classified into point-based, projection-based, voxel-based, and hybrid architectures, while unsupervised methods include self-supervised learning strategies, generative models, and implicit representation techniques. In addition to presenting and categorizing these approaches, we provide a comparative analysis of their performance on widely used benchmark datasets, discuss key challenges such as generalization, model transferability, and computational efficiency, and examine the limitations of current datasets. The survey concludes by identifying potential directions for future research in this rapidly evolving field. Full article
(This article belongs to the Section Information and Communication Technologies)
Show Figures

Figure 1

16 pages, 5301 KiB  
Article
TSINet: A Semantic and Instance Segmentation Network for 3D Tomato Plant Point Clouds
by Shanshan Ma, Xu Lu and Liang Zhang
Appl. Sci. 2025, 15(15), 8406; https://doi.org/10.3390/app15158406 - 29 Jul 2025
Viewed by 100
Abstract
Accurate organ-level segmentation is essential for achieving high-throughput, non-destructive, and automated plant phenotyping. To address the challenge of intelligent acquisition of phenotypic parameters in tomato plants, we propose TSINet, an end-to-end dual-task segmentation network designed for effective and precise semantic labeling and instance [...] Read more.
Accurate organ-level segmentation is essential for achieving high-throughput, non-destructive, and automated plant phenotyping. To address the challenge of intelligent acquisition of phenotypic parameters in tomato plants, we propose TSINet, an end-to-end dual-task segmentation network designed for effective and precise semantic labeling and instance recognition of tomato point clouds, based on the Pheno4D dataset. TSINet adopts an encoder–decoder architecture, where a shared encoder incorporates four Geometry-Aware Adaptive Feature Extraction Blocks (GAFEBs) to effectively capture local structures and geometric relationships in raw point clouds. Two parallel decoder branches are employed to independently decode shared high-level features for the respective segmentation tasks. Additionally, a Dual Attention-Based Feature Enhancement Module (DAFEM) is introduced to further enrich feature representations. The experimental results demonstrate that TSINet achieves superior performance in both semantic and instance segmentation, particularly excelling in challenging categories such as stems and large-scale instances. Specifically, TSINet achieves 97.00% mean precision, 96.17% recall, 96.57% F1-score, and 93.43% IoU in semantic segmentation and 81.54% mPrec, 81.69% mRec, 81.60% mCov, and 86.40% mWCov in instance segmentation. Compared with state-of-the-art methods, TSINet achieves balanced improvements across all metrics, significantly reducing false positives and false negatives while enhancing spatial completeness and segmentation accuracy. Furthermore, we conducted ablation studies and generalization tests to systematically validate the effectiveness of each TSINet component and the overall robustness of the model. This study provides an effective technological approach for high-throughput automated phenotyping of tomato plants, contributing to the advancement of intelligent agricultural management. Full article
Show Figures

Figure 1

22 pages, 5896 KiB  
Article
Point Cloud Generation Method Based on Dual-Prism Scanning with Multi-Parameter Optimization
by Yuanfeng Zhao, Zhen Zheng and Hong Chen
Photonics 2025, 12(8), 764; https://doi.org/10.3390/photonics12080764 - 29 Jul 2025
Viewed by 136
Abstract
This study addresses two critical challenges in biprism-based laser scanning systems: the lack of a comprehensive mathematical framework linking prism parameters to scanning performance, and unresolved theoretical gaps regarding parameter effects on point cloud quality. We propose a multi-parameter optimization method for point [...] Read more.
This study addresses two critical challenges in biprism-based laser scanning systems: the lack of a comprehensive mathematical framework linking prism parameters to scanning performance, and unresolved theoretical gaps regarding parameter effects on point cloud quality. We propose a multi-parameter optimization method for point cloud generation using dual-prism scanning. By establishing a beam pointing mathematical model, we systematically analyze how prism wedge angles, refractive indices, rotation speed ratios, and placement configurations influence scanning performance, revealing their coupled effects on deflection angles, azimuth control, and coverage. The non-paraxial ray tracing method combined with the Möller–Trumbore algorithm enables efficient point cloud simulation. Experimental results demonstrate that our optimized parameters significantly enhance point cloud density, uniformity, and target feature integrity while overcoming limitations of traditional database construction methods. This work provides both theoretical foundations and practical solutions for high-precision 3D reconstruction in high-speed rendezvous scenarios such as missile-borne laser fuzes, offering advantages in cost-effectiveness and operational reliability. Full article
Show Figures

Figure 1

21 pages, 3699 KiB  
Article
Three-Dimensional Extended Target Tracking and Shape Learning Based on Double Fourier Series and Expectation Maximization
by Hongge Mao and Xiaojun Yang
Sensors 2025, 25(15), 4671; https://doi.org/10.3390/s25154671 - 28 Jul 2025
Viewed by 211
Abstract
This paper investigates the problem of tracking targets with unknown but fixed 3D star-convex shapes using point cloud measurements. While existing methods typically model shape parameters as random variables evolving according to predefined prior models, this evolution process is often unknown in practice. [...] Read more.
This paper investigates the problem of tracking targets with unknown but fixed 3D star-convex shapes using point cloud measurements. While existing methods typically model shape parameters as random variables evolving according to predefined prior models, this evolution process is often unknown in practice. We propose a particular approach within the Expectation Conditional Maximization (ECM) framework that circumvents this limitation by treating shape-defining quantities as parameters estimated directly via optimization. The objective is the joint estimation of target kinematics, extent, and orientation in 3D space. Specifically, the 3D shape is modeled using a radial function estimated via double Fourier series (DFS) expansion, and orientation is represented using the compact, singularity-free axis-angle method. The ECM algorithm facilitates this joint estimation: an Unscented Kalman Smoother infers kinematics in the E-step, while the M-step estimates DFS shape parameters and rotation angles by minimizing regularized cost functions, promoting robustness and smoothness. The effectiveness of the proposed algorithm is substantiated through two experimental evaluations. Full article
Show Figures

Figure 1

20 pages, 2776 KiB  
Article
Automatic 3D Reconstruction: Mesh Extraction Based on Gaussian Splatting from Romanesque–Mudéjar Churches
by Nelson Montas-Laracuente, Emilio Delgado Martos, Carlos Pesqueira-Calvo, Giovanni Intra Sidola, Ana Maitín, Alberto Nogales and Álvaro José García-Tejedor
Appl. Sci. 2025, 15(15), 8379; https://doi.org/10.3390/app15158379 - 28 Jul 2025
Viewed by 148
Abstract
This research introduces an automated 3D virtual reconstruction system tailored for architectural heritage (AH) applications, contributing to the ongoing paradigm shift from traditional CAD-based workflows to artificial intelligence-driven methodologies. It reviews recent advancements in machine learning and deep learning—particularly neural radiance fields (NeRFs) [...] Read more.
This research introduces an automated 3D virtual reconstruction system tailored for architectural heritage (AH) applications, contributing to the ongoing paradigm shift from traditional CAD-based workflows to artificial intelligence-driven methodologies. It reviews recent advancements in machine learning and deep learning—particularly neural radiance fields (NeRFs) and its successor, Gaussian splatting (GS)—as state-of-the-art techniques in the domain. The study advocates for replacing point cloud data in heritage building information modeling workflows with image-based inputs, proposing a novel “photo-to-BIM” pipeline. A proof-of-concept system is presented, capable of processing photographs or video footage of ancient ruins—specifically, Romanesque–Mudéjar churches—to automatically generate 3D mesh reconstructions. The system’s performance is assessed using both objective metrics and subjective evaluations of mesh quality. The results confirm the feasibility and promise of image-based reconstruction as a viable alternative to conventional methods. The study successfully developed a system for automated 3D mesh reconstruction of AH from images. It applied GS and Mip-splatting for NeRFs, proving superior in noise reduction for subsequent mesh extraction via surface-aligned Gaussian splatting for efficient 3D mesh reconstruction. This photo-to-mesh pipeline signifies a viable step towards HBIM. Full article
Show Figures

Figure 1

28 pages, 10524 KiB  
Article
Automating Three-Dimensional Cadastral Models of 3D Rights and Buildings Based on the LADM Framework
by Ratri Widyastuti, Deni Suwardhi, Irwan Meilano, Andri Hernandi and Juan Firdaus
ISPRS Int. J. Geo-Inf. 2025, 14(8), 293; https://doi.org/10.3390/ijgi14080293 - 28 Jul 2025
Viewed by 302
Abstract
Before the development of 3D cadastre, cadastral systems were based on 2D representations, which now require transformation or updating. In this context, the first issue is that existing 2D rights are not aligned with recent 3D data acquired using advanced technologies such as [...] Read more.
Before the development of 3D cadastre, cadastral systems were based on 2D representations, which now require transformation or updating. In this context, the first issue is that existing 2D rights are not aligned with recent 3D data acquired using advanced technologies such as Unmanned Aerial Vehicle–Light Detection and Ranging (UAV-LiDAR). The second issue is that point clouds of objects captured by UAV-LiDAR, such as fences and exterior building walls—are often neglected. However, these point cloud objects can be utilized to adjust 2D rights to correspond with recent 3D data and to update 3D building models with a higher level of detail. This research leverages such point cloud objects to automatically generate 3D rights and building models. By combining several algorithms, such as Iterative Closest Point (ICP), Random Forest (RF), Gaussian Mixture Model (GMM), Region Growing, the Polyfit method, and the orthogonality concept—an automatic workflow for generating 3D cadastral models is developed. The proposed workflow improves the horizontal accuracy of the updated 2D parcels from 1.19 m to 0.612 m. The floor area of the 3D models improves by approximately ±3 m2. Furthermore, the resulting 3D building models provide approximately 43% to 57% of the elements required for 3D property valuation. The case study of this research is in Indonesia. Full article
Show Figures

Figure 1

17 pages, 8512 KiB  
Article
Interactive Holographic Display System Based on Emotional Adaptability and CCNN-PCG
by Yu Zhao, Zhong Xu, Ting-Yu Zhang, Meng Xie, Bing Han and Ye Liu
Electronics 2025, 14(15), 2981; https://doi.org/10.3390/electronics14152981 - 26 Jul 2025
Viewed by 252
Abstract
Against the backdrop of the rapid advancement of intelligent speech interaction and holographic display technologies, this paper introduces an interactive holographic display system. This paper applies 2D-to-3D technology to acquisition work and uses a Complex-valued Convolutional Neural Network Point Cloud Gridding (CCNN-PCG) algorithm [...] Read more.
Against the backdrop of the rapid advancement of intelligent speech interaction and holographic display technologies, this paper introduces an interactive holographic display system. This paper applies 2D-to-3D technology to acquisition work and uses a Complex-valued Convolutional Neural Network Point Cloud Gridding (CCNN-PCG) algorithm to generate a computer-generated hologram (CGH) with depth information for application in point cloud data. During digital human hologram building, 2D-to-3D conversion yields high-precision point cloud data. The system uses ChatGLM for natural language processing and emotion-adaptive responses, enabling multi-turn voice dialogs and text-driven model generation. The CCNN-PCG algorithm reduces computational complexity and improves display quality. Simulations and experiments show that CCNN-PCG enhances reconstruction quality and speeds up computation by over 2.2 times. This research provides a theoretical framework and practical technology for holographic interactive systems, applicable in virtual assistants, educational displays, and other fields. Full article
(This article belongs to the Special Issue Artificial Intelligence, Computer Vision and 3D Display)
Show Figures

Figure 1

34 pages, 12831 KiB  
Article
Behavior of Large-Diameter Circular Deep Excavation Under Asymmetric Surface Surcharge
by Ping Zhao, Youqiang Qiu, Feng Liu, Zhanqi Wang and Panpan Guo
Symmetry 2025, 17(8), 1194; https://doi.org/10.3390/sym17081194 - 25 Jul 2025
Viewed by 215
Abstract
Circular deep excavations, characterized by their symmetrical geometry, are commonly employed in constructing foundations for large-span suspension bridges and as launching shafts for shield tunneling. However, the mechanical behavior of such excavations under asymmetric surface surcharge remains inadequately understood due to a paucity [...] Read more.
Circular deep excavations, characterized by their symmetrical geometry, are commonly employed in constructing foundations for large-span suspension bridges and as launching shafts for shield tunneling. However, the mechanical behavior of such excavations under asymmetric surface surcharge remains inadequately understood due to a paucity of relevant investigations. This study addresses this knowledge gap by establishing a three-dimensional finite element model (3D-FEA) based on the anchor deep excavation project of a specific bridge. The model is utilized to investigate the influence of asymmetric surcharge on the forces and deformations within the supporting structure. The results show that both the internal force and displacement cloud diagrams of the support structure exhibit asymmetric characteristics. The distribution of displacement and internal forces has spatial effects, and the maximum values all occur in the areas where asymmetric loads are applied. The maximum values of the displacement, axial force, and shear force of underground continuous walls increase with the increase in the excavation depth. The total displacement curves all show the feature of a “bulging belly”. The maximum displacement is 13.3 mm. The axial force is mainly compression, with a maximum value of −9514 kN/m. The maximum positive and negative values of the shear force are 333 kN/m and −705 kN/m, respectively. The bending moment diagram of different monitoring points shows the characteristics of “bow knot”. The maximum values of the positive bending moment and negative bending moment are 1509.4 kN·m/m and −2394.3 kN·m/m, respectively. The axial force of the ring beam is mainly compression, with a maximum value of −5360 kN, which occurs in ring beams 3, 4, and 5. The displacement cloud diagram of the support structure under symmetrical loads shows symmetrical characteristics. Under different load conditions, the displacement curve of the diaphragm wall shows the characteristics of “bulge belly”. The forms of loads with displacements from largest to smallest at the same position are as follows: asymmetric loads, symmetrical loads, and no loads. These findings provide valuable insights for optimizing the structural design of similar deep excavation projects and contribute to promoting sustainable urban underground development. Full article
(This article belongs to the Special Issue Symmetry, Asymmetry and Nonlinearity in Geomechanics)
Show Figures

Figure 1

21 pages, 3816 KiB  
Article
A K-Means Clustering Algorithm with Total Bregman Divergence for Point Cloud Denoising
by Xiaomin Duan, Anqi Mu, Xinyu Zhao and Yuqi Wu
Symmetry 2025, 17(8), 1186; https://doi.org/10.3390/sym17081186 - 24 Jul 2025
Viewed by 247
Abstract
Point cloud denoising is essential for improving 3D data quality, yet traditional K-means methods relying on Euclidean distance struggle with non-uniform noise. This paper proposes a K-means algorithm leveraging Total Bregman Divergence (TBD) to better model geometric structures on manifolds, enhancing robustness against [...] Read more.
Point cloud denoising is essential for improving 3D data quality, yet traditional K-means methods relying on Euclidean distance struggle with non-uniform noise. This paper proposes a K-means algorithm leveraging Total Bregman Divergence (TBD) to better model geometric structures on manifolds, enhancing robustness against noise. Specifically, TBDs—Total Logarithm, Exponential, and Inverse Divergences—are defined on symmetric positive-definite matrices, each tailored to capture distinct local geometries. Theoretical analysis demonstrates the bounded sensitivity of TBD-induced means to outliers via influence functions, while anisotropy indices quantify structural variations. Numerical experiments validate the method’s superiority over Euclidean-based approaches, showing effective noise separation and improved stability. This work bridges geometric insights with practical clustering, offering a robust framework for point cloud preprocessing in vision and robotics applications. Full article
(This article belongs to the Section Mathematics)
Show Figures

Figure 1

26 pages, 15535 KiB  
Article
BCA-MVSNet: Integrating BIFPN and CA for Enhanced Detail Texture in Multi-View Stereo Reconstruction
by Ning Long, Zhengxu Duan, Xiao Hu and Mingju Chen
Electronics 2025, 14(15), 2958; https://doi.org/10.3390/electronics14152958 - 24 Jul 2025
Viewed by 129
Abstract
The 3D point cloud generated by MVSNet has good scene integrity but lacks sensitivity to details, causing holes and non-dense areas in flat and weak-texture regions. To address this problem and enhance the point cloud information of weak-texture areas, the BCA-MVSNet network is [...] Read more.
The 3D point cloud generated by MVSNet has good scene integrity but lacks sensitivity to details, causing holes and non-dense areas in flat and weak-texture regions. To address this problem and enhance the point cloud information of weak-texture areas, the BCA-MVSNet network is proposed in this paper. The network integrates the Bidirectional Feature Pyramid Network (BIFPN) into the feature processing of the MVSNet backbone network to accurately extract the features of weak-texture regions. In the feature map fusion stage, the Coordinate Attention (CA) mechanism is introduced into 3DU-Net to obtain the position information on the channel dimension related to the direction, improve the detail feature extraction, optimize the depth map and improve the depth accuracy. The experimental results show that BCA-MVSNet not only improves the accuracy of detail texture reconstruction, but also effectively controls the computational overhead. In the DTU dataset, the Overall and Comp metrics of BCA-MVSNet are reduced by 10.2% and 2.6%, respectively; in the Tanksand Temples dataset, the Mean metrics of the eight scenarios are improved by 6.51%. Three scenes are shot by binocular camera, and the reconstruction quality is excellent in the weak-texture area by combining the camera parameters and the BCA-MVSNet model. Full article
Show Figures

Figure 1

25 pages, 2129 KiB  
Article
Zero-Shot 3D Reconstruction of Industrial Assets: A Completion-to-Reconstruction Framework Trained on Synthetic Data
by Yongjie Xu, Haihua Zhu and Barmak Honarvar Shakibaei Asli
Electronics 2025, 14(15), 2949; https://doi.org/10.3390/electronics14152949 - 24 Jul 2025
Viewed by 198
Abstract
Creating high-fidelity digital twins (DTs) for Industry 4.0 applications, it is fundamentally reliant on the accurate 3D modeling of physical assets, a task complicated by the inherent imperfections of real-world point cloud data. This paper addresses the challenge of reconstructing accurate, watertight, and [...] Read more.
Creating high-fidelity digital twins (DTs) for Industry 4.0 applications, it is fundamentally reliant on the accurate 3D modeling of physical assets, a task complicated by the inherent imperfections of real-world point cloud data. This paper addresses the challenge of reconstructing accurate, watertight, and topologically sound 3D meshes from sparse, noisy, and incomplete point clouds acquired in complex industrial environments. We introduce a robust two-stage completion-to-reconstruction framework, C2R3D-Net, that systematically tackles this problem. The methodology first employs a pretrained, self-supervised point cloud completion network to infer a dense and structurally coherent geometric representation from degraded inputs. Subsequently, a novel adaptive surface reconstruction network generates the final high-fidelity mesh. This network features a hybrid encoder (FKAConv-LSA-DC), which integrates fixed-kernel and deformable convolutions with local self-attention to robustly capture both coarse geometry and fine details, and a boundary-aware multi-head interpolation decoder, which explicitly models sharp edges and thin structures to preserve geometric fidelity. Comprehensive experiments on the large-scale synthetic ShapeNet benchmark demonstrate state-of-the-art performance across all standard metrics. Crucially, we validate the framework’s strong zero-shot generalization capability by deploying the model—trained exclusively on synthetic data—to reconstruct complex assets from a custom-collected industrial dataset without any additional fine-tuning. The results confirm the method’s suitability as a robust and scalable approach for 3D asset modeling, a critical enabling step for creating high-fidelity DTs in demanding, unseen industrial settings. Full article
Show Figures

Figure 1

Back to TopTop