Reprint

Automatic Speech Recognition and Understanding in Air Traffic Management

Edited by
March 2024
318 pages
  • ISBN978-3-7258-0316-3 (Hardback)
  • ISBN978-3-7258-0315-6 (PDF)

This book is a reprint of the Special Issue Automatic Speech Recognition and Understanding in Air Traffic Management that was published in

Engineering
Summary

Automatic Speech Recognition and Understanding (ASRU) has the potential to reduce air traffic controllers’ (ATCos) workload and to enhance air traffic management (ATM) safety. Automatic Speech Recognition (ASR) transforms voice signals into a sequence of words, e.g., “speed bird four one six descend flight level one two zero”. Automatic Speech Understanding extracts the meaning from this word sequence, e.g., that the aircraft with the callsign “BAW416” should “DESCEND” to roughly “twelve thousand feet”. The Special Issue contains 12 articles authored by 54 different authors, working for 23 institutions that are located in 13 countries on four continents. These articles discuss (a) ontologies for modeling words and semantics, (b) the extraction of aircraft callsigns and complex commands, (c) ASRU support for ATCos through callsign highlighting, the filling of aircraft radar labels and flight strips for approach, tower, and apron environments, (d) supporting simulation pilots, (e) speech activity detection, speaker role classification, natural language processing, English language identification, (f) combining ASRU with surveillance data, (g) joining speech and gaze data, and (h) a safety assessment.

Format
  • Hardback
License
© 2022 by the authors; CC BY-NC-ND license
Keywords
speech recognition; human–computer interaction; situational awareness; air traffic management; air traffic controller; flight callsign; ASR; VRS; air traffic controller training; simulation-pilot agent; BERT; automatic speech recognition and understanding; speech synthesis; automatic speech recognition; natural language understanding; semantic interpretation; air traffic control; radio communications; intent representation; semantic ontology; performance metrics; automatic speech recognition; automatic speech understanding; air traffic management; air traffic controller; radar label; human factors; assistant system; human-in-the-loop simulation; air traffic controller; multiple remote tower; assistant-based speech recognition; automatic speech recognition and understanding; electronic flight strips; air traffic controller; simulation pilot; workload; assistant-based speech recognition; automatic speech recognition and understanding; apron control; STARFiSH; speech recognition; voice-driven control; acoustic model; grammar network; syntax analysis; semantic analysis; unmanned aerial vehicle (UAV); UAV control; speaker clustering; speaker role detection; air traffic control communications; automatic speech recognition and understanding; OpenSky Network; callsign recognition; ADS-B data; safety assessment; air traffic control; automatic speech recognition; workload; situational awareness; en-route sector; approach sector; fatigue recognition; air traffic controller; feature fusion; multi-mode; ATC; ASR; HMM; DNN; RNN; WER; VHF; ADS-B; METAR; GMTT; speech corpus; deep speech; call sign detection; levenshtein distance; fuzzy string matching