Modeling Visual Fatigue in Remote Tower Air Traffic Controllers: A Multimodal Physiological Data-Based Approach

Liang, Ruihan; Pan, Weijun; Zuo, Qinghai; Zhang, Chen; Chen, Shenhao; Chen, Sheng; Deng, Leilei

doi:10.3390/aerospace12060474

Open AccessArticle

Modeling Visual Fatigue in Remote Tower Air Traffic Controllers: A Multimodal Physiological Data-Based Approach

by

Ruihan Liang

^1,*,

Weijun Pan

^2,*,

Qinghai Zuo

¹,

Chen Zhang

¹,

Shenhao Chen

¹,

Sheng Chen

¹ and

Leilei Deng

¹

College of Air Traffic Management, Civil Aviation Flight University of China, Guanghan 618307, China

²

Flight Technology and Flight Safety Research Base of the Civil Aviation Administration of China, Civil Aviation Flight University of China, Guanghan 618307, China

^*

Authors to whom correspondence should be addressed.

Aerospace 2025, 12(6), 474; https://doi.org/10.3390/aerospace12060474

Submission received: 25 April 2025 / Revised: 26 May 2025 / Accepted: 26 May 2025 / Published: 27 May 2025

(This article belongs to the Section Air Traffic and Transportation)

Download

Browse Figures

Versions Notes

Abstract

As a forward-looking development in air traffic control (ATC), remote towers rely on virtualized information presentation, which may exacerbate visual fatigue among controllers and compromise operational safety. This study proposes a visual fatigue recognition model based on multimodal physiological signals. A 60-min simulated remote tower task was conducted with 36 participants, during which eye-tracking (ET), electroencephalography (EEG), electrocardiography (ECG), and electrodermal activity (EDA) signals were collected. Subjective fatigue questionnaires and objective ophthalmic measurements were also recorded before and after the task. Statistically significant features were identified through paired t-tests, and fatigue labels were constructed by combining subjective and objective indicators. LightGBM was then employed to rank feature importance by integrating split frequency and information gain into a composite score. The top 12 features were selected and used to train a multilayer perceptron (MLP) for classification. The model achieved an average balanced accuracy of 0.92 and an F1 score of 0.90 under 12-fold cross-validation, demonstrating excellent predictive performance. The high-ranking features spanned four modalities, revealing typical physiological patterns of visual fatigue across ocular behavior, cortical activity, autonomic regulation, and arousal level. These findings validate the effectiveness of multimodal fusion in modeling visual fatigue and provide theoretical and technical support for human factor monitoring and risk mitigation in remote tower environments.

Keywords:

visual fatigue; remote tower; air traffic controller; LightGBM; MLP; eye-tracking; EEG; ECG; EDA

1. Introduction

With the progressive implementation of remote tower technology in civil aviation operations, the working paradigm of air traffic controllers (ATCOs) is shifting from traditional physical towers to highly virtualized, information-intensive remote environments. This transformation not only redefines the controller’s operational workspace, but also alters how traffic information is perceived and processed through human–machine interfaces. Within a remote tower setting, controllers are required to monitor virtual tower views composed of high-resolution multi-channel surveillance displays for extended periods, relying on electronic flight strips, radar systems, and voice communications to manage arriving and departing aircraft with precision [1,2]. These workspaces are typically enclosed and lit primarily by artificial lighting, lacking the natural light variation that aids visual comfort. The combination of prolonged screen exposure and high cognitive demands makes controllers particularly vulnerable to visual fatigue, manifesting as reduced accommodative function, diminished attention, and slowed responses—factors that could compromise operational stability and aviation safety [3].

Visual fatigue refers to a functional decline in the visual system resulting from sustained and intense visual tasks, often caused by prolonged strain on the eye-brain system and associated physiological or psychological stressors [4,5]. Existing assessment methods are generally categorized into subjective evaluations and objective measurements, targeting individual perception and physiological responses, respectively. Subjective methods rely on standardized questionnaires to quantify perceived fatigue following specific tasks. Commonly used instruments include the Visual Fatigue Questionnaire (VFQ), Subjective Symptoms Questionnaire (SSQ), and scenario-specific visual discomfort scales [6,7,8,9,10]. For example, Auffret et al. found that both short- and long-term screen exposure significantly exacerbates subjective visual fatigue symptoms through self-report measures [11]. Although subjective questionnaires are easy to administer and effectively capture the participants’ personal experiences, they are inherently prone to cognitive biases and self-perception errors, which limit their objectivity.

Objective assessments quantify visual fatigue through a variety of physiological markers including visual function tests, biosignal monitoring, and neural activity analysis. Ophthalmologic parameters such as contrast sensitivity, near point of accommodation (NPA), near point of convergence (NPC), and tear film breakup time (TBUT) are commonly used to evaluate structural and functional changes in the visual system. These features reflect a person’s visual discrimination ability, ocular flexibility, binocular coordination, and ocular surface health, offering robust objective evidence of visual fatigue [10,12,13]. For example, Rossi et al. systematically evaluated visual fatigue in video terminal operators using contrast sensitivity and related measures [13]. In eye-tracking (ET) metrics, blink frequency, pupil diameter, saccade velocity, and fixation duration are widely adopted to assess the task intensity and fatigue levels [14]. For instance, Wang et al. developed a real-time visual fatigue assessment model based on unobtrusive eye-tracking and blink features [15]. Electroencephalography (EEG) also provides critical neurophysiological insights into visual fatigue. Indicators such as the theta-to-beta ratio (θ/β), power spectral entropy, and regional power dynamics have been shown to correlate with declining alertness and increasing cognitive load [16,17,18]. For example, Lee et al. found that visual stimulation under 2D/3D, AR, and VR conditions elicited distinct patterns of delta, theta, and alpha wave activity in the frontal, occipital, and parietal cortices [19]. Additionally, electrocardiogram (ECG) and electrodermal activity (EDA) are frequently used to evaluate autonomic nervous system responses and arousal states under fatigue. ECG-derived heart rate variability (HRV) indicators such as LF, HF, and RMSSD, along with skin conductance level (SCL) and skin conductance responses (SCRs), have been proven to be effective in identifying fatigue-related changes [4,20,21]. For example, Wang et al. employed EEG-ECG multimodal fusion to improve real-time fatigue detection in driving scenarios, while Sameri et al. validated the utility of EDA in monitoring cybersickness and visual fatigue in VR environments [22,23].

Several modeling approaches have been proposed to enhance fatigue state recognition accuracy and generalizability. Yuan et al. developed a dynamic Bayesian network (DBN)-based visual fatigue model, which improved the inference performance but showed limitations in handling static or segmented task scenarios [24]. Tian et al. introduced a fusion model (DSF) based on entropy-CRITIC weighting to evaluate visual fatigue in SSVEP-based BCI systems [25]. Despite its interpretability, the system’s application was confined to BCI environments. Shi et al. built a deep learning model (AtLSMMs) incorporating display spectral characteristics and EEG time-series data, achieving three-class fatigue classification with promising accuracy, though generalization across participants remained limited [26]. Lu et al. proposed a weakly supervised graph convolutional network (WSGCN-VD) to handle noisy labels in EEG-based visual discomfort detection, but the model’s complexity and lack of interpretability hinder practical deployment [27].

To date, research on visual fatigue has made meaningful progress in multimodal modeling and feature integration. However, key challenges remain in model interpretability, individual adaptability, and context-specific validation. Meanwhile, existing frameworks, such as ICAO’s Fatigue Risk Management System (FRMS) and Eurocontrol’s Human Performance guidelines, primarily emphasize organizational and scheduling controls—such as duty time limitations and rest periods—rather than real-time physiological monitoring and individualized fatigue recognition [28,29,30,31]. While these frameworks provide essential safety foundations, they are not tailored to emerging digital ATC environments, where visual-cognitive demands evolve rapidly and are highly context-sensitive.

Particularly, the unique characteristics of remote tower environments—virtualized visual displays, artificial lighting, and sustained demand for situational awareness—may lead to fatigue mechanisms that differ significantly from those in conventional towers or other fatigue-prone settings [3]. As remote towers are designed to maintain or exceed the safety levels of traditional towers, it is imperative to investigate the physiological manifestations and recognition mechanisms of controller visual fatigue in this setting.

The main contributions of this paper are as follows:

A comprehensive multimodal data acquisition framework was established, incorporating subjective questionnaires, ophthalmologic parameters, and physiological recordings (ET, EEG, ECG, EDA), to systematically evaluate visual fatigue during apron control tasks in the remote tower environment;
A two-stage recognition model based on LightGBM and multilayer perceptron (MLP) was proposed, balancing feature interpretability with nonlinear modeling capacity to improve the prediction accuracy;
Based on feature importance rankings and model outcomes, the physiological patterns of remote tower controllers under visual fatigue were analyzed, providing a scientific basis for human factor monitoring and intervention in remote tower environments.

2. Experiment

2.1. Participants

A total of 36 qualified participants were recruited for this study. All participants were undergraduate students majoring in air traffic control at the College of Air Traffic Management, Civil Aviation Flight University of China who had completed training for ATC positions and passed the relevant competency assessments. All participants met the Class I medical certification requirements issued by the Civil Aviation Administration of China. The mean age was 22.2 years (SD = 0.94), with binocular uncorrected visual acuity of LogMAR ≤ 0.0 and no history of ophthalmologic disorders.

The experimental protocol strictly adhered to the ethical principles of the Declaration of Helsinki. Participation was entirely voluntary, and the participants were informed of their right to withdraw at any stage. All data were anonymized to ensure confidentiality and participant safety.

2.2. Experimental Scenario

The experiment was conducted using the Tower Client simulator, a high-fidelity remote tower simulation platform capable of rendering high-resolution visual environments, processing live traffic data streams, and supporting interactive scheduling tasks. The simulator allowed the experimenters to configure detailed traffic scenarios via an instructor interface. Controllers operated the system using remote tower displays, executing voice instructions, electronic flight strip management, and taxi route planning tasks.

To enhance ecological validity, the simulation was conducted in a fully enclosed control room with constant artificial lighting to replicate the visual load typical of real remote tower environments. To simulate realistic air-ground communication, the experimenters acted as pilots from a physically isolated room using a simulated radio system for real-time verbal interaction with the participants.

The virtual airport environment, referred to as “Hansha Airport”, was modeled after Wuhan Tianhe International Airport (ZHHH) and Changsha Huanghua International Airport (ZGHA), capturing representative features of apron control scenarios. The experiment simulated real apron operations including aircraft arrival, taxiing, stand allocation, and departure scheduling. A fixed flight schedule was adopted, with traffic volume maintained within a relatively stable range. Multiple trial runs were conducted prior to formal data collection to validate the scenario’s stability and timing. The traffic configuration is shown in Table 1.

2.3. Procedure

Participants were instructed to avoid consuming stimulants (e.g., coffee, strong tea) 24 h prior to the experiment and to maintain a minimum of 7 h of sleep the night before to ensure that they entered the experiment free from fatigue.

Before the formal task, each participant received a detailed briefing from an experimenter outlining the procedure, safety considerations, and possible risks. Participants were required to provide informed consent after fully understanding the purpose and protocol of the study. A 5-min pre-task visual relaxation period was conducted to eliminate the baseline visual strain. Subsequently, participants completed a standardized visual fatigue questionnaire to assess the baseline subjective fatigue, followed by a set of ophthalmologic assessments including visual acuity, accommodation response, NPA, NPC, contrast sensitivity, and TBUT.

Following the preparation phase, participants entered a 60-min apron control simulation. During this task, ET, EEG, EEG and EDA data were continuously recorded in real-time. After completing the task, participants immediately underwent a second ophthalmologic assessment and filled out the visual fatigue questionnaire again to capture the post-task changes in both the subjective and objective fatigue indicators. Each participant completed two sessions scheduled at different times to allow for sufficient visual recovery and ensure data independence. The full experimental flow is illustrated in Figure 1A.

2.4. Data Collection

Eye-tracking data were recorded at 100 Hz using Tobii Pro Glasses 3 (Tobii AB, Stockholm, Sweden). ECG signals were acquired with the ErgoLAB biosensing wearable device (Kingfa, Guangzhou, China) at a sampling rate of 512 Hz. EEG data were collected using the ErgoLAB Portable EEG system (32 channels, 512 Hz, Kingfa, China), and EDA signals were obtained using an ErgoLAB wireless EDA sensor (bilateral hand electrodes, 512 Hz, Kingfa, China). The data acquisition devices and experimental setup are shown in Figure 1B.

In addition to physiological signals, participants completed a subjective fatigue questionnaire at the start and end of each session. The questionnaire was developed based on the established visual fatigue literature and clinical ophthalmologic standards, targeting symptoms such as eye discomfort, blurred vision, accommodation difficulties, attention issues, and emotional responses [20]. The Remote Tower Visual Fatigue Questionnaire (RTVF-Q) includes 12 items across six domains. Each item is rated for frequency and severity (0–5 scale), with total scores ranging from 0 to 120, where higher scores indicate greater fatigue. A sample of the RTVF-Q is presented in Table 2.

Objective visual function assessments were also conducted before and after each task. Six key ophthalmologic parameters were measured: best-corrected distance visual acuity (Snellen E-chart), NPC (via RAF rule), NPA (via accommodative rule), accommodation response time (via CV-7800 autorefractor, Ming Sing Optical R&D, Ningbo, China), contrast sensitivity (Pelli–Robson chart under standard lighting), and TBUT (via fluorescein strips, Jingming, Shenzhen, China, and slit-lamp microscope OVS-2, WBO, Guangzhou, China). These indicators, combined with the questionnaire scores, formed the basis for labeling the fatigue states during model training.

During each 60-min session, multimodal data were segmented from two periods: minutes 2–7 after task onset and the final 5 min of the task. The first two minutes were excluded to allow the participants to acclimate and to avoid data distortion caused by the initial cognitive and physiological transitions. Each data segment was paired with corresponding subjective questionnaire scores and objective visual function indicators to comprehensively characterize the fatigue states.

3. Visual Fatigue Modeling

To explore the physiological mechanisms and recognition strategies of visual fatigue among remote tower air traffic controllers, this study developed a multimodal modeling framework based on physiological signals. The overall process consists of dataset construction with fatigue labels and a two-stage recognition model using multimodal inputs.

In the initial stage, eye-tracking, EEG, ECG, and EDA signals were preprocessed and statistically analyzed to select features exhibiting significant differences before and after the task. Fatigue labels were generated by integrating subjective questionnaire scores and objective ophthalmologic parameters. A LightGBM model was then used to assess the feature importance and perform further selection. The most relevant features were subsequently fed into a multilayer perceptron (MLP) for classification. The complete modeling framework is illustrated in Figure 2.

3.1. Dataset Construction

3.1.1. Data Preprocessing and Feature Selection

All physiological signals underwent preprocessing based on established protocols in the literature, with adjustments made according to observations during the current experimental procedures to ensure data quality and stability [32,33,34,35,36].

Eye-tracking signals were processed using the ErgoLAB platform V1.0.7. A moving median filter was applied, and gaps less than 75 ms were linearly interpolated. Fixations were identified with a minimum duration of 60 ms, and adjacent fixations shorter than 75 ms or with angular distances less than 0.5° were merged. Saccades were detected using a velocity threshold of 2 pixels/ms with a duration range of 10–200 ms. Pupil diameter data were rescaled and interpolated. Blinks were identified using a duration threshold of 70–350 ms.

EEG, ECG, and EDA signals were processed using MATLAB R2024a and relevant toolbox extensions. EEG signals were processed using EEGLAB (v2024.0.0) [37]. Data were re-referenced to the average of M1 and M2 electrodes and filtered with a 0.1–30 Hz bandpass filter to remove drift and high-frequency noise. Independent component analysis (ICA) was performed to remove ocular and muscle artifacts. EEG was recorded with a 32-channel system based on the international 10–20 system to achieve high spatial resolution. Electrodes were grouped into four functional brain regions: frontal, parietal, occipital, and temporal, enabling region-specific feature extraction [38]. Fast Fourier transform (FFT) was used to compute the relative power in the delta (δ), theta (θ), alpha (α), and beta (β) bands. The average of each region’s electrodes was used to represent that region’s neural activity. Additional features included spectral entropy and power band ratios such as θ/β, (α + θ)/β, α/β, and (α + θ)/(α + β). The electrode layout and regional mapping are shown in Figure 3.

Raw ECG and EDA signals were filtered using adaptive bandpass filtering and Bior4.4 wavelet denoising. ECG R-peaks were detected using HEPLAB (v1.0.0), with a heart rate limit of 120 bpm and detection threshold at 70%. From the RR intervals, time-domain HRV features were extracted, and FFT was applied to obtain the frequency-domain indices [39]. EDA signals were processed using LEDALAB (v3.2.5) to separate the SCL and SCR components [40].

After preprocessing, paired sample t-tests were conducted to evaluate whether each feature differed significantly before and after the task. The significance level was set at α = 0.05. Features showing statistically significant differences (p < 0.05) were retained as candidate variables for subsequent modeling. Table 3 summarizes the significant features identified through statistical testing, listing the direction of change for each feature before and after the task as well as the corresponding t-values and p-values. These results serve to verify the discriminative power of each feature as an input variable for modeling.

A total of 26 features across multiple physiological domains exhibited significant changes before and after the task. These were selected as the initial input variables for the subsequent machine learning models.

3.1.2. Fatigue Label Construction

To construct reliable fatigue labels, visual fatigue was defined as a multidimensional state involving both physiological changes and subjective perception. A combined labeling strategy was used.

For objective indicators, six ophthalmologic parameters were selected (visual acuity, NPA, NPC, accommodation response time, contrast sensitivity, TBUT). Each was normalized using a linear mapping, where 0 indicates no fatigue and 1 indicates severe fatigue. Thresholds for normalization were based on the clinical literature and experimental observations [12,41,42]. To avoid scaling and redundancy issues, a principal component analysis (PCA) was applied to assign weights to these features. The final weights are shown in Table 4.

For the subjective dimension, the RTVF-Q questionnaire was used to obtain the controllers’ perceived level of visual fatigue. The total score of the scale was normalized and denoted as

Y \in [0, 1]

, representing the intensity of each individual’s self-reported fatigue experience during the task. To reflect the important role of subjective perception in fatigue determination, and to account for both the sensitivity of the questionnaire and the inherent variability of self-assessment, the present study assigned a weight of

w_{Y} = 0.4

to the subjective dimension.

To justify the inclusion and weighting of subjective data, we conducted a Pearson correlation analysis between the total RTVF-Q scores and each of the six objective ophthalmologic indicators. The results revealed moderate to strong positive correlations in most cases (r values ranging from 0.41 to 0.67, p < 0.05), suggesting that the subjective assessments were meaningfully aligned with the physiological changes associated with visual fatigue. These findings support the integration of both subjective and objective components in fatigue labeling. The adopted weighting scheme thus reflects the dual nature of visual fatigue as a psychophysiological state and enhances the robustness of fatigue annotation.

The final visual fatigue composite score function, denoted as

S_{t}

, was constructed to characterize the overall visual fatigue level associated with each segment of data and is defined as follows:

S_{t} = \sum_{n = 1}^{6} w_{n} \cdot X_{n} + 0.4 \cdot Y,

(1)

The overall rating

S_{t} \in [0, 1]

, with larger values indicating a more severe visual fatigue state.

Furthermore, to classify the visual fatigue state of remote tower controllers in a scientifically grounded manner, the present study utilized the composite score

S_{t}

as the basis for classification. A discrimination threshold

T

was defined to determine the fatigue status: if

S_{t} \geq T

, the controller is considered to be in a state of visual fatigue.

Label = \{\begin{matrix} 1 & i f S_{t} \geq T \\ 0 & i f S_{t} < T \end{matrix},

(2)

Combining the characteristics of remote tower tasks—which are prone to inducing early-onset fatigue—with the established definitions of subjective and objective fatigue thresholds, the final threshold was determined by referencing the distribution of subjective and objective scores along with the median of

S_{t}

. The threshold was set at

T = 0.48

and used to classify fatigue and non-fatigue states. This binary visual fatigue label was then added to the standardized multimodal feature dataset, resulting in a final high-dimensional feature sample set with fatigue state annotations. Among the samples, fatigued and non-fatigued instances accounted for 47.9% and 52.1%, respectively, maintaining an overall balance between positive and negative classes.

3.2. LightGBM-MLP Recognition Model

This study proposed a two-stage recognition model integrating LightGBM and MLP to predict visual fatigue states in a remote tower context. The model aims to combine interpretability in feature selection with the nonlinear modeling power of neural networks. Both stages were implemented in Python 3.6.0 and executed on a Windows 11 workstation with an i9-12900K CPU and 64 GB RAM.

In the first stage, LightGBM was used to rank the importance of 26 features. LightGBM is a GBDT-based ensemble method that constructs trees sequentially to minimize residuals and global loss. The objective function is as follows:

L = \sum_{i = 1}^{n} l (y_{i}, {\hat{y}}_{i}) + \sum_{k = 1}^{K} Ω (f_{k}),

(3)

where

l (y_{i}, {\hat{y}}_{i})

is the loss function,

y_{i}

is the binary visual fatigue status label derived from the composite score, and

{\hat{y}}_{i}

is the predicted value.

Ω (f_{k})

is a regularization term used to constrain model complexity, and

f_{k}

denotes the first

k

decision trees.

To comprehensively evaluate feature importance, this study considered two metrics provided by LightGBM: the “Split” index based on split frequency, and the “Gain” index based on information gain. These two indices respectively reflect the frequency with which a feature is used and its contribution to model performance. Specifically, Split represents the total number of times a feature is used for node splitting across all trees; a higher Split value indicates that the feature is frequently utilized in the modeling process. Gain, on the other hand, represents the total information gain obtained when a feature is used to split nodes; a higher Gain value implies that the feature has a more direct and critical impact on the model’s predictive performance. Split and Gain were ranked and normalized independently. Features appearing in the intersection set of the two rankings were assigned an overall importance score (OIS) to facilitate quantitative selection. The calculation of OIS is defined as follows:

OIS = α \cdot \frac{{Split}_{i}}{\sum_{j} {Split}_{j}} + (1 - α) \cdot \frac{{Gain}_{i}}{\sum_{j} {Gain}_{j}},

(4)

Among these, the

OIS

denotes the composite importance score of feature

i

, where

{Split}_{i}

and

{Gain}_{i}

represent the number of splits and the total information gain associated with feature

i

, respectively. The denominator corresponds to the sum of each metric across all features, used for normalization. The parameter

α \in [0, 1]

is introduced to adjust the relative weights of Split and Gain. In this study,

α

was set to 0.5 to balance the general usage frequency and the information contribution of features during the selection process. This design aimed to avoid the bias introduced by relying on a single metric and to enhance both the robustness of feature selection and the interpretability of the model.

Finally, the top 12 features, which together accounted for 84.80% of the total OIS, were selected based on the comprehensive importance ranking and used as the input feature set for subsequent model training, as illustrated in Figure 4:

In the second stage, the key features selected in the first stage were input into an MLP model. Through a fully connected multi-layer architecture, the MLP was used to capture high-level interactions among features from different modalities, ultimately yielding the classification output for the visual fatigue states. As a typical feedforward neural network, the MLP is well-suited for modeling complex nonlinear mappings from structured input data. It consists of an input layer, multiple hidden layers, and an output layer, with full connectivity between adjacent layers.

The forward propagation process of the MLP can be expressed as follows:

\begin{matrix} h^{(l)} = σ (W^{(l)} h^{(l - 1)} + b^{(l)}) \end{matrix},

(5)

\hat{y} = σ (W^{(L)} h^{(L - 1)} + b^{(L)}),

(6)

Among them,

h^{(l)}

denotes the activation output of the l-th layer, while

W^{(l)}

and

b^{(l)}

represent the weight matrix and bias term, respectively.

σ

is the activation function, and

\hat{y}

is the final predicted output. Through iterative training, the MLP is capable of automatically learning higher-order nonlinear interactions from the structured input features, effectively capturing synergistic patterns and potential correlations across different modalities.

Furthermore, to achieve optimal model performance and generalization capability, a systematic hyperparameter tuning process was conducted. Based on the structural characteristics of the model and theoretical considerations, the initial value ranges of key hyperparameters were first determined. Subsequently, a tree-structured Parzen estimator (TPE)-based Bayesian optimization algorithm, combined with fivefold cross-validation, was used to efficiently search the hyperparameter space. The final network architecture comprised three hidden layers with 24, 48, and 24 neurons, respectively. The ReLU function was selected as the activation function, and the dropout rate was set to 0.22. The initial learning rate was fixed at 0.01, and the maximum number of training epochs was set to 500. To further mitigate overfitting and enhance training efficiency, an early stopping mechanism was introduced: training was terminated if no significant improvement was observed on the validation set for 30 consecutive epochs, with the best-performing model parameters retained.

3.3. Model Performance Evaluation

To comprehensively evaluate the performance of the proposed multimodal visual fatigue recognition model, a 12-fold cross-validation strategy was adopted. Specifically, the dataset was divided into 12 groups, each consisting of data from 3 participants. Each group was used once as the validation set, while the remaining groups served as the training set.

Balanced accuracy (BA) and F1 score were selected as the primary performance metrics. Balanced accuracy accounts for class imbalance and is calculated as:

Balanced Accuracy = \frac{1}{2} (\frac{TP}{TP + FN} + \frac{TN}{TN + FP}),

(7)

In this context,

TP

and

TN

represent the number of correctly predicted fatigue and non-fatigue samples, respectively.

FP

refers to the number of fatigue samples incorrectly classified as non-fatigue, while

FN

refers to the number of non-fatigue samples misclassified as fatigue.

The F1 score is used as a comprehensive measure that balances the model’s Precision and Recall, and is calculated as:

F 1 = \frac{2 \cdot Precision \cdot Recall}{Precision + Recall},

(8)

with:

Precision = \frac{TP}{TP + FP},

(9)

Recall = \frac{TP}{TP + FN},

(10)

Precision indicates the proportion of samples predicted by the model as “fatigued” that are actually fatigued, while Recall denotes the proportion of all true fatigue samples that were correctly identified. The F1 score represents the harmonic mean of Precision and Recall, and is particularly suitable for evaluating the model’s ability to recognize critical classes under imbalanced sample conditions.

Table 5 presents the classification performance of the proposed model under different combinations of physiological modalities. A clear upward trend was observed as more modalities were integrated. When using eye-tracking (ET) features alone, the model achieved a moderate performance (mean BA = 0.67, F1 = 0.66), indicating limited discriminative capacity with oculomotor data alone. The addition of EEG features led to a substantial performance boost (mean BA = 0.78, F1 = 0.77), underscoring the contribution of neurophysiological indicators in capturing cognitive fatigue states. Incorporating ECG data further enhanced the classification accuracy (mean BA = 0.86, F1 = 0.84), likely due to its reflection of autonomic regulatory changes under fatigue. The inclusion of EDA features yielded the best results (mean BA = 0.92, F1 = 0.90), highlighting the added value of arousal-level signals. In several folds, the model even achieved perfect scores (BA = 1, F1 = 1), demonstrating the strong synergistic effect of multimodal fusion. These findings reinforce the effectiveness of the proposed approach and support the use of integrated physiological data in visual fatigue modeling.

The results demonstrate that the LightGBM-MLP multimodal model developed in this study achieved a relatively high accuracy and stable performance in the context of remote tower operations. The findings also confirm the complementary and integrative advantages of multi-source physiological features in visual fatigue recognition. It is worth noting that although numerous studies have focused on visual fatigue recognition in other high-risk scenarios, systematic modeling in the emerging but critical field of remote tower air traffic management remains in its early stages. This study achieved relatively accurate fatigue recognition in this specific setting, providing both a methodological foundation and a performance reference for future research in the field.

4. Discussion

In this study, twelve high-importance features were identified based on LightGBM feature importance analysis, spanning four modalities: ET, EEG, ECG, and EDA. These features were then used to accurately classify the visual fatigue states. Collectively, they reflect multidimensional physiological changes that occur in remote tower controllers during apron control tasks, offering quantitative evidence for understanding the underlying mechanisms of visual fatigue.

In the eye-tracking modality, pupil diameter emerged as the most important feature and was significantly reduced under fatigue, suggesting impaired pupillary adjustment and decreased visual alertness. The increase in incomplete blink rate and blink frequency with fatigue indicates weakened eyelid muscle control and ocular dryness, manifesting as overt signs of discomfort. Additionally, prolonged blink duration reflects delayed eye closure movements and reduced motor responsiveness. Together, these changes outline the evolution of visual system fatigue under prolonged visual load and are consistent with prior research findings [14].

In the EEG modality, increases in the θ/β ratio at Fz, α/β ratio at Pz, and overall (θ + α)/(α + β) ratio under fatigue reflect a dominance of low-frequency activity, alongside reduced frontal executive function and posterior spatial awareness. These patterns align with the theoretical expectation of diminished cognitive resource allocation during fatigue [17,43]. Furthermore, the decline in power spectral entropy at Cz suggests reduced spectral complexity and monotonous brain activity, indicating impaired adaptability of the neural system to continuous task demands [44].

ECG and EDA serve as indicators of autonomic nervous system activity and arousal levels. In the ECG modality, an increase in the LF/HF ratio and a decrease in RMSSD reflect heightened sympathetic dominance and weakened parasympathetic regulation [20]. In the EDA modality, reduced SCL and fewer SCRs point to lowered physiological arousal under fatigue [45]. These observations suggest that the high cognitive and perceptual demands of remote tower operations may contribute to persistent autonomic tension and reduced heart rate variability during the progression of visual fatigue.

Overall, these features span four physiological systems—ocular activity, cortical electrical activity, autonomic regulation, and physiological arousal—forming a multimodal profile of visual fatigue in remote tower controllers. The findings confirm that visual fatigue is not solely a function of ocular strain, but is accompanied by broader physiological changes. This underscores the necessity and effectiveness of employing a multimodal fusion approach for fatigue recognition in this context.

It is also important to recognize that remote tower environments differ from conventional tower settings in several ways that may influence the fatigue dynamics. While traditional towers provide controllers with direct line-of-sight to the airfield, natural lighting, and real-world depth cues, remote towers depend on mediated displays and artificial illumination. These conditions may lead to a greater reliance on screen-based scanning, limited peripheral awareness, and prolonged accommodation demands. As such, visual fatigue in remote towers may occur more rapidly or manifest differently than in traditional towers, highlighting the importance of specialized models like the one proposed in this study.

It should also be noted that all participants in this study were professionally trained ATC students. Although they possessed adequate task capabilities, differences may exist between them and experienced controllers in terms of operational strategies, stress tolerance, and cognitive resilience. Future studies should therefore expand the sample to include in-service controllers to enhance the applicability and generalizability of the findings. Moreover, individual differences across age groups in visual sensitivity and neural regulation should be further considered to improve model adaptability and robustness.

5. Conclusions

This study focused on air traffic control tasks in remote tower environments and developed a visual fatigue recognition method based on multimodal physiological signals. The effectiveness and feasibility of using multimodal data for visual fatigue modeling were systematically validated. A high-fidelity remote tower simulation environment was constructed to collect multi-channel physiological signals including ET, EEG, ECG, and EDA. Fatigue state labels were generated by combining subjective visual fatigue questionnaires with objective ophthalmologic parameters, resulting in a high-confidence dataset.

Based on this, a two-stage recognition model integrating LightGBM and MLP was proposed and implemented. Under 12-fold cross-validation, the model achieved strong performance (Balanced Accuracy = 0.92, F1 = 0.90). The results further indicate that visual fatigue in remote tower environments involves coordinated changes across multiple systems including ocular activity, cortical neural activity, autonomic regulation, and physiological arousal.

In summary, this study provides both a theoretical foundation and a technical approach for visual fatigue recognition in remote tower settings, and establishes a practical basis for future developments in human factor evaluation, fatigue prediction, and dynamic intervention mechanisms. In future applications, the proposed model could be integrated into remote tower systems to support real-time fatigue mitigation—for example, through adaptive interface adjustments, feedback prompts to controllers when fatigue thresholds are detected, or seamless integration into existing tower management and safety assurance platforms. These capabilities would enable proactive fatigue management and promote sustainable operator performance. Future work may incorporate dynamic task load and individual variability to expand the model’s applicability across diverse tasks and populations and to advance the engineering deployment and intelligent development of human factor monitoring in remote tower systems.

Author Contributions

Conceptualization, R.L. and W.P.; Methodology, R.L.; Validation, Q.Z., C.Z., S.C. (Shenhao Chen), S.C. (Sheng Chen) and L.D.; Formal analysis, W.P.; investigation, Q.Z. and C.Z.; Resources, S.C. (Shenhao Chen) and S.C. (Sheng Chen); Data curation, L.D.; Writing—original draft preparation, R.L.; Writing—review and editing, R.L.; Visualization, R.L.; Supervision, W.P.; Funding acquisition, W.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (U2333209), the National Key R&D Program of China (No. 2021YFF0603904), the National Natural Science Foundation of China (U2333207), the Independent Project of Sichuan Provincial Key Laboratory of Civil Aircraft Fire Science and Safety Engineering (MZ2024JB01), and the Sichuan Provincial Civil Aviation Flight Technology and Flight Safety Engineering Technology Research Center (GY2024-33D).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of the Flight Technology and Flight Safety Research Base, Civil Aviation Flight University of China (NO.24/0028, 7 October 2024).

Informed Consent Statement

Informed consent was obtained from all participants involved in the study.

Data Availability Statement

The data generated and analyzed in this study are not publicly available due to confidentiality agreements and privacy protection concerns for the participants. For requests to access the dataset, please contact the corresponding author at ruihanliang2750@163.com.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ATC	Air traffic control
ET	Eye-tracking
EEG	Electroencephalography
ECG	Electrocardiography
EDA	Electrodermal activity
MLP	Multilayer perceptron
ATCOs	Air traffic controllers
VFQ	Visual Fatigue Questionnaire
SSQ	Symptoms Questionnaire
NPA	Near point of accommodation
NPC	Near point of convergence
TBUT	Tear film breakup time
HRV	Heart rate variability
SCL	Skin conductance level
SCRs	Skin conductance responses
RTVF-Q	Remote Tower Visual Fatigue Questionnaire
PCA	Principal component analysis

References

Slomski, P.; Cooper, J.M.; Mousinho, I.A.; Puchalski, O.; Stewart, M. Aerodromes and Aircraft Safety. In Drone Law and Policy; Routledge: Oxfordshire, UK, 2021; pp. 140–166. [Google Scholar]
Kearney, P.; Li, W.-C. Multiple Remote Tower for Single European Sky: The Evolution from Initial Operational Concept to Regulatory Approved Implementation. Transp. Res. Part Policy Pract. 2018, 116, 15–30. [Google Scholar] [CrossRef]
International Federation of Air Traffic Controllers’ Associations Remote Towers—Guidance Material 2019; International Federation of Air Traffic Controllers’ Associations Remote Towers: Montreal, QC, Canada, 2019.
Wang, Y.; Zhong, X.; Zhang, Y.; Tu, Y.; Wang, L.; Chen, Y.; Zhang, C.; Zhou, W. Visual Fatigue Following Long-Term Visual Display Terminal Work under Different Light Sources. Light. Res. Technol. 2017, 49, 1034–1051. [Google Scholar] [CrossRef]
Wilkins, A.; Huang, J.; Cao, Y. Visual Stress Theory and Its Application to Reading and Reading Tests. J. Res. Read. 2004, 27, 152–162. [Google Scholar] [CrossRef]
Kennedy, R.S.; Lane, N.E.; Berbaum, K.S.; Lilienthal, M.G. Simulator Sickness Questionnaire: An Enhanced Method for Quantifying Simulator Sickness. Int. J. Aviat. Psychol. 1993, 3, 203–220. [Google Scholar] [CrossRef]
Mangione, C.M.; Lee, P.P.; Pitts, J.; Gutierrez, P.; Berry, S.; Hays, R.D.; NEI-VFQ Field Test Investigators. Psychometric Properties of the National Eye Institute Visual Function Questionnaire (NEI-VFQ). Arch. Ophthalmol. 1998, 116, 1496–1504. [Google Scholar] [CrossRef]
Liu, Y.; Guo, X.; Fan, Y.; Meng, X.; Wang, J. Subjective Assessment on Visual Fatigue versus Stereoscopic Disparities. J. Soc. Inf. Disp. 2021, 29, 497–504. [Google Scholar] [CrossRef]
Coleman, A.L. Development of the 25-Item National Eye Institute Visual Function Questionnaire. Evid.-Based Ophthalmol. 2002, 3, 58–59. [Google Scholar] [CrossRef]
Jia, L.; Jia, L.; Zhao, J.; Feng, L.; Huang, X. A Multimodal Visual Fatigue Assessment Model Based on Back Propagation Neural Network and XGBoost. Displays 2024, 83, 102702. [Google Scholar] [CrossRef]
Auffret, E.; Mielcarek, M.; Bourcier, T.; Delhommais, A.; Speeg-Schatz, C.; Sauer, A. Digital Eye Strain. Functional Symptoms and Binocular Balance Analysis in Intensive Digital Users. J. Français d’Ophtalmol. 2022, 45, 438–445. [Google Scholar] [CrossRef]
Shi, Y.; Tu, Y.; Wang, L.; Zhang, Y.; Zhang, Y.; Wang, B. Spectral Influence of the Normal LCD, Blue-Shifted LCD, and OLED Smartphone Displays on Visual Fatigue: A Comparative Study. Displays 2021, 69, 102066. [Google Scholar] [CrossRef]
Rossi, G.C.M.; Scudeller, L.; Bettio, F.; Milano, G. A Pilot, Phase II, Observational, Case-Control, 1-Month Study on Asthenopia in Video Terminal Operators without Dry Eye: Contrast Sensitivity and Quality of Life before and after the Oral Consumption of a Fixed Combination of Zinc, L-Carnitine, Extract of Elderberry, Currant and Extract of Eleutherococcus. Nutrients 2021, 13, 4449. [Google Scholar] [CrossRef]
Souchet, A.D.; Philippe, S.; Lourdeaux, D.; Leroy, L. Measuring Visual Fatigue and Cognitive Load via Eye Tracking While Learning with Virtual Reality Head-Mounted Displays: A Review. Int. J. Hum.-Comput. Interact. 2022, 38, 801–824. [Google Scholar] [CrossRef]
Wang, Y.; Zhai, G.; Zhou, S.; Chen, S.; Min, X.; Gao, Z.; Hu, M. Eye Fatigue Assessment Using Unobtrusive Eye Tracker. IEEE Access 2018, 6, 55948–55962. [Google Scholar] [CrossRef]
Diez, P.; Orosco, L.; Garcés Correa, A.; Carmona, L. Assessment of Visual Fatigue in SSVEP-Based Brain-Computer Interface: A Comprehensive Study. Med. Biol. Eng. Comput. 2024, 62, 1475–1490. [Google Scholar] [CrossRef]
Chen, C.; Li, K.; Wu, Q.; Wang, H.; Qian, Z.; Sudlow, G. EEG-Based Detection and Evaluation of Fatigue Caused by Watching 3DTV. Displays 2013, 34, 81–88. [Google Scholar] [CrossRef]
Kim, Y.-J.; Lee, E.C. EEG Based Comparative Measurement of Visual Fatigue Caused by 2D and 3D Displays. In Proceedings of the HCI International 2011–Posters’ Extended Abstracts: International Conference, HCI International 2011, Orlando, FL, USA, 9–14 July 2011; Proceedings, Part II 14; Springer: Berlin/Heidelberg, Germany, 2011; pp. 289–292. [Google Scholar]
Lee, C.-C.; Chiang, H.-S.; Hsiao, M.-H. Effects of Screen Size and Visual Presentation on Visual Fatigue Based on Regional Brain Wave Activity. J. Supercomput. 2021, 77, 4831–4851. [Google Scholar] [CrossRef]
Jia, L.; Jia, L.; Lin, Z.; Feng, L.; Huang, X. Assessment of Visual Fatigue Caused by Stereoscopic Disparity Based on Multimodal Measurement. Displays 2023, 79, 102466. [Google Scholar] [CrossRef]
Yang, X.; Wang, D.; Hu, H.; Yue, K. P-31: Visual Fatigue Assessment and Modeling Based on ECG and EOG Caused by 2D and 3D Displays. In Proceedings of the SID Symposium Digest of Technical Papers; Wiley Online Library: Hoboken, NJ, USA, 2016; Volume 47, pp. 1237–1240. [Google Scholar]
Wang, L.; Song, F.; Zhou, T.H.; Hao, J.; Ryu, K.H. EEG and ECG-Based Multi-Sensor Fusion Computing for Real-Time Fatigue Driving Recognition Based on Feedback Mechanism. Sensors 2023, 23, 8386. [Google Scholar] [CrossRef] [PubMed]
Sameri, J.; Coenegracht, H.; Van Damme, S.; De Turck, F.; Torres Vega, M. Physiology-Driven Cybersickness Detection in Virtual Reality: A Machine Learning and Explainable AI Approach. Virtual Real. 2024, 28, 174. [Google Scholar] [CrossRef]
Yuan, Z.; Zhuo, K.; Zhang, Q.; Zhao, C.; Sang, S. Probabilistic Assessment of Visual Fatigue Caused by Stereoscopy Using Dynamic Bayesian Networks. Acta Ophthalmol. 2019, 97, e435–e441. [Google Scholar] [CrossRef]
Tian, P.; Xu, G.; Han, C.; Du, C.; Li, H.; Chen, R.; Xie, J.; Wang, J.; Jiang, H.; Guo, X.; et al. A Subjective and Objective Fusion Visual Fatigue Assessment System for Different Hardware and Software Parameters in SSVEP-Based BCI Applications. Sci. Rep. 2024, 14, 27872. [Google Scholar] [CrossRef] [PubMed]
Shi, Y.; Tu, Y.; Wang, L.; Zhu, N. AtLSMMs Network: An Attentional-biLSTM Based Multi-Model Prediction for Smartphone Visual Fatigue. Displays 2024, 84, 102754. [Google Scholar] [CrossRef]
Lu, N.; Zhao, X.; Yao, L. 3D Visual Discomfort Assessment with a Weakly Supervised Graph Convolution Neural Network Based on Inaccurately Labeled EEG. IEEE Trans. Neural Syst. Rehabil. Eng. 2024, 32, 1164–1176. [Google Scholar] [CrossRef] [PubMed]
Matschnigg, G.; Graham, N.; Wykoff, D. Fatigue Risk Management Systems: Implementation Guide for Operators 2011; International Civil Aviation Organization: Montreal, QC, Canada, 2011. [Google Scholar]
International Civil Aviation Organization. Manual for the Oversight of Fatigue Management Approaches 2016; International Civil Aviation Organization: Montreal, QC, Canada, 2016. [Google Scholar]
EUROCONTROL. Guidelines on Fatigue Management in ATC Rostering Systems 2016; EUROCONTROL: Brussels, Belgium, 2016. [Google Scholar]
EUROCONTROL. Human Performance—Enhancing Safety and Efficiency 2024; EUROCONTROL: Brussels, Belgium, 2024. [Google Scholar]
Holmqvist, K.; Nyström, M.; Andersson, R.; Dewhurst, R.; Jarodzka, H.; Van de Weijer, J. Eye Tracking: A Comprehensive Guide to Methods and Measures; OUP: Oxford, UK, 2011. [Google Scholar]
Niedermeyer, E.; da Silva, F.L. Electroencephalography: Basic Principles, Clinical Applications, and Related Fields; Lippincott Williams & Wilkins: Philadelphia, PA, USA, 2005. [Google Scholar]
Ouerhani, N.; Von Wartburg, R.; Hugli, H.; Muri, R. Empirical Validation of the Saliency-Based Model of Visual Attention. ELCVIA Electron. Lett. Comput. Vis. Image Anal. 2004, 3, 13–24. [Google Scholar] [CrossRef]
Fowles, D.C.; Christie, M.J.; Edelberg, R.; Grings, W.W.; Lykken, D.T.; Venables, P.H. Publication Recommendations for Electrodermal Measurements. Psychophysiology 1981, 18, 232–239. [Google Scholar] [CrossRef]
Standard, B. Medical Electrical Equipment. Int. Stand. IEC 2011, 60601, 2–33. [Google Scholar]
Delorme, A.; Makeig, S. EEGLAB: An Open Source Toolbox for Analysis of Single-Trial EEG Dynamics Including Independent Component Analysis. J. Neurosci. Methods 2004, 134, 9–21. [Google Scholar] [CrossRef]
Henry, J.C. Electroencephalography: Basic Principles, Clinical Applications, and Related Fields. Neurology 2006, 67, 2092. [Google Scholar] [CrossRef]
Perakakis, P. HEPLAB: A Matlab Graphical Interface for the Preprocessing of the Heartbeat-Evoked Potential. Zenodo 2019. [Google Scholar] [CrossRef]
Benedek, M.; Kaernbach, C. A Continuous Measure of Phasic Electrodermal Activity. J. Neurosci. Methods 2010, 190, 80–91. [Google Scholar] [CrossRef]
Zhang, Y.; Tu, Y.; Wang, L.; Zhang, W. Assessment of Visual Fatigue under LED Tunable White Light with Different Blue Components. J. Soc. Inf. Disp. 2020, 28, 24–35. [Google Scholar] [CrossRef]
Zheng, F.; Hou, F.; Chen, R.; Mei, J.; Huang, P.; Chen, B.; Wang, Y. Investigation of the Relationship between Subjective Symptoms of Visual Fatigue and Visual Functions. Front. Neurosci. 2021, 15, 686740. [Google Scholar] [CrossRef] [PubMed]
Trejo, L.J.; Kubitz, K.; Rosipal, R.; Kochavi, R.L.; Montgomery, L.D. EEG-Based Estimation and Classification of Mental Fatigue. Psychology 2015, 6, 572–589. [Google Scholar] [CrossRef]
Chen, C.; Wang, J.; Li, K.; Wu, Q.; Wang, H.; Qian, Z.; Gu, N. Assessment Visual Fatigue of Watching 3DTV Using EEG Power Spectral Parameters. Displays 2014, 35, 266–272. [Google Scholar] [CrossRef]
Boucsein, W. Electrodermal Activity; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012. [Google Scholar]

Figure 1. (A) Overall experimental procedure. (B) Task scenario and multimodal data acquisition setup.

Figure 2. Construction process and structural framework diagram of the multimodal visual fatigue recognition model.

Figure 3. Schematic diagram of the electrode layout and functional brain area division of the 32-lead EEG system. Blue circles represent recording electrodes, while gray circles indicate reference electrodes.

Figure 4. Feature importance ranking derived from LightGBM analysis.

Table 1. Experimental scenario configuration.

Metric	Configuration
Runways	RWY 16L for arrivals, RWY 16R for departures
Taxiways	2 main taxiways (TWY B1, TT); 8 connector taxiways (TWY A1–A4, C1–C4)
Stands	30 stands in total: 18 contact stands (G01–G18), 12 remote stands (S01–S12)
Arrival Flights	12 arrivals via RWY 16L, taxiing to designated stands
Departure Flights	18 departures via RWY 16R, including sequencing and holding operations
Traffic Frequency	Average of 2–3 aircraft per 5 min

Table 2. Remote Tower Visual Fatigue Questionnaire (RTVF-Q).

Item	Symptom Description	Frequency Score (0–5)	Severity Score (0–5)
1	Dryness or burning sensation in the eyes	None-Frequent	None-Severe
2	Eye soreness or pressure	None-Frequent	None-Severe
3	Blurred screen text	None-Frequent	None-Severe
4	Visual instability or ghosting	None-Frequent	None-Severe
5	Difficulty maintaining visual fixation	None-Frequent	None-Severe
6	Difficulty adjusting focus for near/far distances	None-Frequent	None-Severe
7	Discomfort due to screen brightness or glare	None-Frequent	None-Severe
8	Eye redness or foreign body sensation	None-Frequent	None-Severe
9	Headache or forehead tension	None-Frequent	None-Severe
10	Frequent need to blink	None-Frequent	None-Severe
11	Distracted attention or increased task interruptions	None-Frequent	None-Severe
12	Irritability or restlessness due to visual fatigue	None-Frequent	None-Severe

Table 3. Paired t-test results for feature differences before and after the task.

Modality	Feature	Trend	t	p-Value
ET	Blink Frequency	↑	3.93	<0.001
	Incomplete Blink Rate	↑	2.82	0.011
	Blink Duration	↑	2.74	0.013
	Pupil Diameter	↓	−2.40	0.027
	Saccade Velocity	↓	−3.89	<0.001
	Saccade Count	↓	−2.91	0.009
	Fixation Deviation	↑	2.12	0.047
	Microsaccade Count	↑	2.23	0.038
EEG	Frontal θ Band Power	↑	3.4	0.003
	Frontal δ Band Power	↑	2.82	0.011
	Occipital α Band Power	↑	2.45	0.024
	Parietal β Band Power	↓	−2.20	0.040
	θ/β Ratio at Fz	↑	2.71	0.014
	α/β Ratio at Pz	↑	2.23	0.038
	Spectral Entropy at Cz	↓	−2.91	0.009
	(θ + α)/(α + β) Ratio	↑	4.07	<0.001
ECG	Mean RR	↑	2.45	0.024
	RMSSD	↓	−2.22	0.039
	HF	↓	−2.82	0.011
	LF/HF	↑	3.40	0.003
	pNN50	↓	−2.62	0.017
EDA	Mean SCL	↓	−2.40	0.027
	SCL Variation Rate	↓	−2.16	0.044
	SCR Count	↓	−2.64	0.016
	SCR Amplitude	↓	−2.27	0.035
	SCR Recovery Time	↑	2.52	0.021

Table 4. PCA-based weights for objective indicators.

Indicator	Weight
Visual Acuity	0.125
Accommodation Response	0.114
NPA	0.110
NPC	0.091
Contrast Sensitivity	0.088
TBUT	0.072

Table 5. Performance metrics under 12-fold cross-validation.

Group	ET		ET+ EEG		ET + EEG + ECG		ET + EEG + ECG + EDA
Group	BA	F1	BA	F1	BA	F1	BA	F1
1	0.68	0.65	0.79	0.74	0.89	0.86	1	1
2	0.80	0.78	0.87	0.89	1	1	1	1
3	0.61	0.60	0.78	0.73	0.90	0.87	0.92	0.90
4	0.58	0.58	0.73	0.71	0.87	0.85	0.89	0.88
5	0.59	0.57	0.65	0.66	0.73	0.75	0.84	0.84
6	0.84	0.82	1	1	1	1	1	1
7	0.55	0.52	0.68	0.66	0.74	0.72	0.85	0.80
8	0.68	0.68	0.77	0.80	0.80	0.77	1	1
9	0.67	0.66	0.79	0.77	0.85	0.85	0.94	0.91
10	0.61	0.61	0.67	0.75	0.69	0.66	0.76	0.72
11	0.80	0.79	0.89	0.88	1	1	1	1
12	0.67	0.65	0.72	0.69	0.79	0.78	0.81	0.80
MEAN	0.67	0.66	0.78	0.77	0.86	0.84	0.92	0.90

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liang, R.; Pan, W.; Zuo, Q.; Zhang, C.; Chen, S.; Chen, S.; Deng, L. Modeling Visual Fatigue in Remote Tower Air Traffic Controllers: A Multimodal Physiological Data-Based Approach. Aerospace 2025, 12, 474. https://doi.org/10.3390/aerospace12060474

AMA Style

Liang R, Pan W, Zuo Q, Zhang C, Chen S, Chen S, Deng L. Modeling Visual Fatigue in Remote Tower Air Traffic Controllers: A Multimodal Physiological Data-Based Approach. Aerospace. 2025; 12(6):474. https://doi.org/10.3390/aerospace12060474

Chicago/Turabian Style

Liang, Ruihan, Weijun Pan, Qinghai Zuo, Chen Zhang, Shenhao Chen, Sheng Chen, and Leilei Deng. 2025. "Modeling Visual Fatigue in Remote Tower Air Traffic Controllers: A Multimodal Physiological Data-Based Approach" Aerospace 12, no. 6: 474. https://doi.org/10.3390/aerospace12060474

APA Style

Liang, R., Pan, W., Zuo, Q., Zhang, C., Chen, S., Chen, S., & Deng, L. (2025). Modeling Visual Fatigue in Remote Tower Air Traffic Controllers: A Multimodal Physiological Data-Based Approach. Aerospace, 12(6), 474. https://doi.org/10.3390/aerospace12060474

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Modeling Visual Fatigue in Remote Tower Air Traffic Controllers: A Multimodal Physiological Data-Based Approach

Abstract

1. Introduction

2. Experiment

2.1. Participants

2.2. Experimental Scenario

2.3. Procedure

2.4. Data Collection

3. Visual Fatigue Modeling

3.1. Dataset Construction

3.1.1. Data Preprocessing and Feature Selection

3.1.2. Fatigue Label Construction

3.2. LightGBM-MLP Recognition Model

3.3. Model Performance Evaluation

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI