This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications
by
Daniel Thomanek
Daniel Thomanek *
and
Clemens Gühmann
Clemens Gühmann *
Chair of Electronic Measurement and Diagnostic Technology, Institute of Energy and Automation Technology, Faculty IV-Electrical Engineering and Computer Science, Technische Universität Berlin, Einsteinufer 17, 10587 Berlin, Germany
*
Authors to whom correspondence should be addressed.
Submission received: 2 April 2025
/
Revised: 1 May 2025
/
Accepted: 8 May 2025
/
Published: 11 May 2025
Featured Application
Monocular depth estimation models often produce non-metric (relative) depth, and even when they aim to predict metric depth, they tend to perform poorly on unseen data, as demonstrated in our study. However, many real-world applications of monocular depth estimation require accurate metric measurements, for example, to verify compliance with legal regulations. As part of the EU-funded research project BerDiBa, we investigated the calibration of depth estimation models for metric accuracy in the railway domain. Our goal was to establish a foundation for a specific use case: measuring the encroachment of vegetation into the structural gauge. This is a critical challenge, as such vegetation can obstruct and distort sensor data or, in the case of hard wooden plants, even cause physical damage to trains. We aimed to develop a method that enables reliable metric scaling of any recent monocular depth estimation model, regardless of its training data or camera parameters, in order to meet legal requirements.
Abstract
Three-dimensional reconstruction using monocular camera images is a well-established research topic. While multi-image approaches like Structure from Motion produce sparse point clouds, single-image depth estimation via machine learning promises denser results. However, many models estimate relative depth, and even those providing metric depth often struggle with unseen data due to unfamiliar camera parameters or domain-specific challenges. Accurate metric 3D reconstruction is critical for railway applications, such as ensuring structural gauge clearance from vegetation to meet legal requirements. We propose a novel method to scale 3D point clouds using the track gauge, which typically only varies in very limited values between large areas or countries worldwide (e.g., m in Europe). Our approach leverages state-of-the-art image segmentation to detect rails and measure the track gauge from a train driver’s perspective. Additionally, we extend our method to estimate a reasonable railway-specific extrinsic camera calibration. Evaluations show that our method reduces the average Chamfer distance to LiDAR point clouds from m (benchmark UniDepth) to m for image-wise calibration and m for average calibration.
Share and Cite
MDPI and ACS Style
Thomanek, D.; Gühmann, C.
Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications. Appl. Sci. 2025, 15, 5361.
https://doi.org/10.3390/app15105361
AMA Style
Thomanek D, Gühmann C.
Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications. Applied Sciences. 2025; 15(10):5361.
https://doi.org/10.3390/app15105361
Chicago/Turabian Style
Thomanek, Daniel, and Clemens Gühmann.
2025. "Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications" Applied Sciences 15, no. 10: 5361.
https://doi.org/10.3390/app15105361
APA Style
Thomanek, D., & Gühmann, C.
(2025). Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications. Applied Sciences, 15(10), 5361.
https://doi.org/10.3390/app15105361
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.