A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard

Vidal-Balea, Aida; Valladares-Poncela, Antón; Vilar-Martínez, Javier; Fernández-Caramés, Tiago M.; Fraga-Lamas, Paula

doi:10.3390/mti10020013

This is an early access version, the complete PDF, HTML, and XML versions will be available soon.

Open AccessArticle

A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard

by

Aida Vidal-Balea

^1,2,3

,

Antón Valladares-Poncela

^1,2,3

,

Javier Vilar-Martínez

⁴

,

Tiago M. Fernández-Caramés

^1,2,3

and

Paula Fraga-Lamas

^1,2,3,*

¹

Department of Computer Engineering, Faculty of Computer Science, Universidade da Coruña, 15071 A Coruña, Spain

²

Centro de Investigación CITIC, Universidade da Coruña, 15071 A Coruña, Spain

³

Centro Mixto de Investigación UDC-Navantia, Universidade da Coruña, Edificio de Batallones, s/n, 15403 Ferrol, Spain

⁴

Navantia S. A., Astillero de Ferrol, 15403 Ferrol, Spain

^*

Author to whom correspondence should be addressed.

Multimodal Technol. Interact. 2026, 10(2), 13; https://doi.org/10.3390/mti10020013 (registering DOI)

Submission received: 25 November 2025 / Revised: 26 January 2026 / Accepted: 28 January 2026 / Published: 1 February 2026

(This article belongs to the Special Issue Multimodal Interaction Design in Immersive Learning and Training Environments)

Download Versions Notes

Abstract

Industry 5.0 is composed of a variety of complex tasks and challenging processes requiring specialized labor and multidisciplinary coordination. Specifically, when it comes to shipbuilding, shipyards leverage advanced technologies, seeking to replace operations that continue to rely on traditional methods, such as 2D blueprints and paper-based documentation, which can lead to inefficiencies and alignment errors in precision-dependent tasks. For this reason, this article focuses on embracing Mixed Reality (MR) technologies to address these challenges in the context of electrical outfitting tasks. The design, development and evaluation of a MR application tailored for HoloLens 2 smart glasses aims to streamline the workflow for operators, reducing reliance on paper-based documentation and enhancing the precision of assembly processes. The proposed system allows for the precise positioning of 3D models in the real environment, ensuring accurate alignment during assembly. Additionally, it incorporates automatic dimension generation between objects in the scene. To further enhance usability, the application integrates a Galician on-device Automatic Speech Recognition (ASR) system, allowing operators to interact seamlessly with the MR interface using voice commands. The whole system has been exhaustively tested, both through usability and functionality evaluations, which validate MR as a viable tool for shipyard assembly and inspection tasks.

Keywords: Extended Reality; Augmented Reality; Mixed Reality; Microsoft HoloLens; MRTK; ASR; automatic dimension generation

Share and Cite

MDPI and ACS Style

Vidal-Balea, A.; Valladares-Poncela, A.; Vilar-Martínez, J.; Fernández-Caramés, T.M.; Fraga-Lamas, P. A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard. Multimodal Technol. Interact. 2026, 10, 13. https://doi.org/10.3390/mti10020013

AMA Style

Vidal-Balea A, Valladares-Poncela A, Vilar-Martínez J, Fernández-Caramés TM, Fraga-Lamas P. A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard. Multimodal Technologies and Interaction. 2026; 10(2):13. https://doi.org/10.3390/mti10020013

Chicago/Turabian Style

Vidal-Balea, Aida, Antón Valladares-Poncela, Javier Vilar-Martínez, Tiago M. Fernández-Caramés, and Paula Fraga-Lamas. 2026. "A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard" Multimodal Technologies and Interaction 10, no. 2: 13. https://doi.org/10.3390/mti10020013

APA Style

Vidal-Balea, A., Valladares-Poncela, A., Vilar-Martínez, J., Fernández-Caramés, T. M., & Fraga-Lamas, P. (2026). A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard. Multimodal Technologies and Interaction, 10(2), 13. https://doi.org/10.3390/mti10020013

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

A Mixed Reality Tool with Automatic Speech Recognition for 3D CAD Based Visualization and Automatic Dimension Generation in the Industry 5.0 Shipyard

Abstract

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI