Next Article in Journal
FLE-YOLO: A Faster, Lighter, and More Efficient Strategy for Autonomous Tower Crane Hook Detection
Previous Article in Journal
UMFNet: Frequency-Guided Multi-Scale Fusion with Dynamic Noise Suppression for Robust Low-Light Object Detection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications

Chair of Electronic Measurement and Diagnostic Technology, Institute of Energy and Automation Technology, Faculty IV-Electrical Engineering and Computer Science, Technische Universität Berlin, Einsteinufer 17, 10587 Berlin, Germany
*
Authors to whom correspondence should be addressed.
Appl. Sci. 2025, 15(10), 5361; https://doi.org/10.3390/app15105361
Submission received: 2 April 2025 / Revised: 1 May 2025 / Accepted: 8 May 2025 / Published: 11 May 2025

Featured Application

Monocular depth estimation models often produce non-metric (relative) depth, and even when they aim to predict metric depth, they tend to perform poorly on unseen data, as demonstrated in our study. However, many real-world applications of monocular depth estimation require accurate metric measurements, for example, to verify compliance with legal regulations. As part of the EU-funded research project BerDiBa, we investigated the calibration of depth estimation models for metric accuracy in the railway domain. Our goal was to establish a foundation for a specific use case: measuring the encroachment of vegetation into the structural gauge. This is a critical challenge, as such vegetation can obstruct and distort sensor data or, in the case of hard wooden plants, even cause physical damage to trains. We aimed to develop a method that enables reliable metric scaling of any recent monocular depth estimation model, regardless of its training data or camera parameters, in order to meet legal requirements.

Abstract

Three-dimensional reconstruction using monocular camera images is a well-established research topic. While multi-image approaches like Structure from Motion produce sparse point clouds, single-image depth estimation via machine learning promises denser results. However, many models estimate relative depth, and even those providing metric depth often struggle with unseen data due to unfamiliar camera parameters or domain-specific challenges. Accurate metric 3D reconstruction is critical for railway applications, such as ensuring structural gauge clearance from vegetation to meet legal requirements. We propose a novel method to scale 3D point clouds using the track gauge, which typically only varies in very limited values between large areas or countries worldwide (e.g., 1.435 m in Europe). Our approach leverages state-of-the-art image segmentation to detect rails and measure the track gauge from a train driver’s perspective. Additionally, we extend our method to estimate a reasonable railway-specific extrinsic camera calibration. Evaluations show that our method reduces the average Chamfer distance to LiDAR point clouds from 1.94 m (benchmark UniDepth) to 0.41 m for image-wise calibration and 0.71 m for average calibration.
Keywords: railway; predictive maintenance; driver assistance; monocular metric 3D reconstruction; metric depth estimation; scaling; extrinsic calibration; depth; disparity; image segmentation; LiDAR railway; predictive maintenance; driver assistance; monocular metric 3D reconstruction; metric depth estimation; scaling; extrinsic calibration; depth; disparity; image segmentation; LiDAR

Share and Cite

MDPI and ACS Style

Thomanek, D.; Gühmann, C. Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications. Appl. Sci. 2025, 15, 5361. https://doi.org/10.3390/app15105361

AMA Style

Thomanek D, Gühmann C. Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications. Applied Sciences. 2025; 15(10):5361. https://doi.org/10.3390/app15105361

Chicago/Turabian Style

Thomanek, Daniel, and Clemens Gühmann. 2025. "Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications" Applied Sciences 15, no. 10: 5361. https://doi.org/10.3390/app15105361

APA Style

Thomanek, D., & Gühmann, C. (2025). Metric Scaling and Extrinsic Calibration of Monocular Neural Network-Derived 3D Point Clouds in Railway Applications. Applied Sciences, 15(10), 5361. https://doi.org/10.3390/app15105361

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop