1. Introduction
Accurately measuring the workload (WL) of operators in human–computer interaction systems is crucial for ensuring the safety of the system. Although the advancement of automation technology has reduced the physical burden on operators, at the same time, the amount of information output by the system is also rapidly increasing. This change is particularly prominent when performing complex multi-window tasks, such as operating manned vehicles in emergency situations, where operators need to monitor the status of multiple subsystems in real time and effectively perform multiple tasks under tight time pressure. This working environment can easily lead to operators bearing excessive WL, thereby increasing the operational risks of human–computer systems [
1,
2]. Therefore, this study aims to propose a new WL measurement method to better understand the WL of operators in complex environments, and based on this, optimize task allocation [
3].
WL is a multidimensional concept, influenced by factors such as workload requirement, time constraint, operator ability, and behavioral performance [
4]. Boff et al. [
5] defined WL as the proportion of psychological resources required by the operator in the task. Brookhuis et al. [
6] argued that WL is influenced by task characteristics and personal attributes, including age, experience, motivation, task execution strategies, physical states, and emotional states [
7]. These multiple influencing factors make measuring WL a difficult challenge. Now, the common methods for measuring WL can be divided into three categories: subjective measurements, performance measurements, and physiological measurements [
8,
9,
10,
11].
The subjective measurement methods require operators to self-assess the WL level when experienced in the current or recently completed tasks, including the NASA-TLX scale, Bedford scale, and the revised Cooper Harper scale [
12,
13,
14]. The NASA-TLX scale developed by Hart et al. can comprehensively assess WL [
12]. This scale mainly covers six dimensions: mental demand, physical demand, temporal demand, effort, own performance, and frustration level [
15,
16]. The scale was originally designed to meet the needs of the aerospace industry, and over time, its application scope has significantly expanded. Nowadays, the NASA-TLX scale is not limited to the aviation industry but is widely used in various fields such as transportation, nuclear power plants, healthcare, and human–computer interaction, becoming an important tool for assessing WL in these fields [
17,
18,
19]. The subjective measurement methods are widely used due to their ease of management and efficiency [
20,
21]. However, the method requires the operator to temporarily interrupt the ongoing task during use. Therefore, it is difficult to assess the WL of an operator in real time using subjective measurement methods. In addition, the WL measurements obtained by the operator through the subjective measurement methods may differ from the actual WL experienced [
22].
The physiological measurement methods provide an objective and real-time approach for assessing WL by analyzing physiological data from subjects during task execution. In previous studies, the correlations have been extensively examined between the WL level of subjects and their physiological indicators including electrooculography (EOG), electrocardiography (ECG), electroencephalography (EEG), respiration, skin conductance, blood pressure, and eye-tracking data [
23,
24]. Due to its sensitivity to transient fluctuations in WL, the physiological measurement methods have been adopted by many scholars to evaluate the WL level of operators in the work environment [
25,
26,
27]. However, these methods require operators to wear physiological monitoring equipment, which not only increases the overall evaluation cost but may also cause interference with the performance of operators during task execution.
The performance is one of the main indicators for measuring WL, which can directly reflect the operational effectiveness of human–machine interaction systems [
28,
29]. Performance measurement methods are divided into primary task method and secondary task method. As the WL level increases, operators will consume more cognitive resources, leading to a decrease in performance [
30].
Siegel and Wolf [
31] proposed the Time Line Analysis and Prediction (TLAP) method to quantify time load by the ratio of required time to available time. Although the impact of task type on WL is not considered, the TLAP method provides valuable insights for subsequent research on WL measurement. For example, Liu et al. [
31] proposed a pilot’s WL prediction method based on timeline analysis. Park et al. [
32] proposed the Adaptive Control of Thought-Rationale (ACT-R) model with cognitive architecture, which was experimentally validated through flight missions. However, both methods require the decomposition of specific human–machine systems and tasks into multiple modules. The standards for task decomposition have not yet been unified, which limits the application of these methods in other fields.
In sum, the current methods for measuring WL have their own application scopes and limitations. Firstly, the subjective measurement method is a post-event measurement method that cannot measure WL in real time, which restricts our timely intervention in task allocation of human–machine interaction systems. Secondly, although physiological measurement methods can measure WL in real time, these methods often cause unnecessary interference for operators and affect the effectiveness of task execution. Finally, the performance-based real-time WL measurement methods have received widespread attention but are mainly suitable for specific fields such as flight missions, and their universality in other fields such as manned vehicles is limited. In addition, although there are some performance-based measurement methods for WL that could be used to monitor changes in operator’s WL in real time, the effective rules for setting WL classification thresholds are still lacking. Wickens pointed out that the WL threshold could be objectively defined by an observed slowing in the rate of performance associated with increases in overall task demands [
33]. But the current setting of WL threshold is mainly determined based on the subjective experiences of designers, which lead to the lack of commonality [
34,
35,
36].
To solve the issued mentioned above, this study aims to establish a measurement method for WL to improve the problem of High Workload (HWL) among operators in multitasking environments of manned vehicles. And the WL assessment thresholds were further defined based on the decreasing tendency of performance accuracy associated with increased task demand. This study provides a theoretical and practical basis for the optimal design and adaptive adjustment of human–machine dynamic function allocation, which supports the improvement of operator efficiency and system safety.
  2. WL Measurement Method
The operators in the manned vehicle are responsible for monitoring and operating computers and instruments. The differences between this multitasking system and others are that operators must access the human–machine interfaces of each subsystem through separate windows, and each subtask system demands the operator’s complete attention. Multitasking systems with a high frequency of human–machine interaction are prone to put operators at HWL and even overload, thus creating safety risks. Therefore, the WL measurement method for operators in the manned vehicle was proposed. This method can help provide a basis for task allocation between humans and machines, and thus reduce the occurrence and duration of HWL states in operators.
  2.1. Method Design
Given the limitations of existing methods [
37], a new measurement method of WL was proposed based on the concepts of Task Demand Load (TDL) and Subject Load Capacity (SLC). The TDL refers to the workload imposed on operators by monitoring or operating systems [
38,
39], which is associated with the task attributes (such as task type and task difficulty). Besides the TDL, the workload capacity provided by an operator also varies with the task attributes, especially for multitasking human–machine interaction systems. Therefore, a new concept of SLC was defined in this study that refers to the ability of operators to complete tasks within limited available resources.
On the basis of existing methods for quantifying task complexity, this study proposes the quantization formulas for TDL, SLC, and Average Workload (AWL). The TDL and SLC were calculated considering two dimensions: the task complexity based on entropy theory, and the time factors including the planned completion time and the realized completion time. Based on the above definitions, the WL level of operator during tasks could be measured and classified in terms of thresholds related to performance accuracy.
Figure 1 illustrates the specific implementation process of the WL measurement method, as described below.
 Step 1: Multitasking system analysis and task complexity estimation. The interface elements and operational flow were summarized via the system tasks analysis. On this basis, the interface–information interaction, the action control, and the interface control behavior were graphed. The types and numbers of nodes in the above three behavioral graphs were used as inputs of Equation (1), and the four sub-complexities were obtained including actions logic complexity, action size complexity, interface–information interaction complexity, and interface control behavior complexity. The specific methodology is described in 
Section 2.2.
Step 2: Calculation of AWL based on TDL and SLC. The TDL and SLC were calculated based on task complexity and time factors, and then were used to measure AWL combined with other relevant factors including task frequency, task period, and actual service time of subjects. The specific methodology is described in 
Section 2.3.
Step 3: Classification of WL based on defined thresholds. The elevated WL can easily lead to operational errors, which may pose potential safety risks in human–machine systems. Therefore, the WL thresholds could be defined based on the task accuracy. Based on the established thresholds, the AWL measurement results could be classified into different levels. The specific methodology is described in 
Section 2.4.
  2.2. Task Complexity
The information theory has been proved feasible and effective in quantitative assessment of task complexity. Shannon introduced entropy as the most important measure into information theory. Entropy is a fundamental measure that includes information, decision-making, and uncertainty, and plays a core role in the field of information theory. To evaluate task complexity, Park et al. [
40] and Davis et al. [
41] adopted the entropy measurement method within the context of program control graphs. Mowshwitz [
42] introduced two different entropy measures for graphical representations: first-order entropy for evaluating the logic of program control diagrams and second-order entropy for evaluating the complexity of program control diagrams. It is important to note that a higher entropy value signifies increased graph complexity. Equation (1) is the formal representation of the entropy measurement method [
42]:
        where 
H is the entropy value; 
N is the number of node types; 
p(
Ai) is the probability of the node belonging to class 
Ai.
Based on the entropy theory, the task complexity for multitasking systems was estimated from four different dimensions: interface–information interaction complexity, actions logic complexity, action size complexity, and interface control behavior complexity [
41]. The interface–information interaction complexity refers to the intricacies of information interaction between operational behaviors and HMI, which could be quantified based on the interface–information interaction graph. The actions logic complexity and action size complexity, derived from the actions control graph, elucidate the logical sequence and the volume of requisite actions during task execution [
43]. The interface control behavior complexity could be calculated based on the interface control behavior graph.
The operational requirements of the multitasking system can be divided into task monitoring and task processing. Accordingly, the task complexity in this study is divided into the monitoring complexity and the processing complexity. The monitoring complexity is determined using Equation (2) [
43]. The processing complexity is determined by Equation (3) [
43].
        
        where 
HM is the monitoring complexity; 
HP is the processing complexity; 
HIIIC is the interface–information interaction complexity; 
HALC is the actions logic complexity; 
HASC is the action size complexity; 
HICBC is the interface control behavior complexity.
  2.3. AWL Measurement
The TDL describes the inherent attribute of the task, independent of the operator’s ability. It is related to factors such as task complexity, task uncertainty, task frequency, and task performance requirements [
44].
The existing scholarly literature has substantiated that the TDL can be quantified by assessing various task system parameters, with findings consistently highlighting the salient role of task complexity in influencing TDL relative to other determinants [
40]. Furthermore, the TDL is closely intertwined with temporal constraints, commonly referred to as time pressure [
45]. Consequently, the TDL in this study is quantified by considering the task complexity and the temporal requirements of the task, as follows:
        where 
TDLi is the task demand load of task 
i; 
 is the monitoring complexity of task 
i; 
 is the processing complexity of task 
i; 
 is the maximum monitoring time of task 
i;  is maximum processing time of task 
i.Besides the task-related factors, the WL is also influenced by the individual factors [
6]. For example, the operators exhibit varied responses when participating in different task processes. Therefore, the SLC, defined in this study, is associated with the task parameters (such as the task complexity) and the operation proficiency (such as the actual operating time in Equation (5)).
        
        where 
SLCi is the subject load capacity; 
 is the monitoring complexity of task 
i; 
 is the processing complexity of task 
i; 
 is the actual monitoring time of subject for task 
i; 
 is the actual processing time of subject for task 
i.
The WL can be viewed as a resource pool. The available resources that operators can activate vary with the task complexity; meanwhile, the requirements for resources are different when performing various tasks. Therefore, this study compares SLC to activated available resources, while TDL is likened to occupied resources. And the WL is defined as the ratio of occupied resources to activated available resources, with the value range of 0 to 1. Three cases of the instantaneous WL are considered in this study as follows:
- When the subject is not actively engaged in task processing (SLCi = 0), the instantaneous WL is defined as 0. 
- When SLC matches or exceeds TDL (TDLi ≤ SLCi), the subject’s instantaneous WL is calculated as the ratio between the two. 
- When SLC falls short of meeting TDL (TDLi > SLCi), the upper limit of WL is reached, designated as 1 in this study. 
The mathematical expression for instantaneous WL is as follows:
        where 
WLi is the instantaneous 
WL at task 
i; 
SLCi is the subject workload capacity for task 
i; 
TDLi is the task demand load for task 
i.
The assessment of instantaneous WL only captures its fluctuations throughout the duration of task. To measure the WL of operator at a specific moment, the AWL is introduced as the accumulated WL per unit time, as described in Equation (7) [
45]. The lower AWL in subjects implies a greater store of cognitive resources, thereby enhancing their capacity to effectively manage new tasks.
        
        where 
ti is the service time for task 
i; 
N represents the total number of tasks; 
T is the duration of task time, 
T = 600 s.
  2.4. WL Assessment Threshold
The multitasking systems in the manned vehicle mainly lead to HWL conditions and rarely cause operator underload, which is the main reason for many manned vehicle accidents. The human–machine dynamic functional allocation system closely links intelligent auxiliary support to the operator’s WL level, providing automated support when the WL is too high [
46]. The AWL measurements are able to assess changes in the real-time WL of the operator. However, in order to further clarify the intervention time for automation in human–machine dynamic function allocation systems, it is important to clarify the category in which the WL is located.
When task demand exceeds resource supply, operators would be in the state of overload, and their performance will sharply decline. The red line in 
Figure 2 shows the performance breakpoint. This red line divides the supply and demand space into two regions. The left region is the “remaining resources region”, and the right region is the “overload region” [
47,
48]. Therefore, the setting of the red line is crucial for the evaluation of WL.
In order to ensure that operators have sufficient residual capacity to maintain a certain level of work performance in emergency situations, the WL assessment thresholds were defined in this study by determining the maximum allowable level of WL based on performance transition points. Based on these thresholds, the WL state was identified for better guidance of the high-performance allocation strategies of human–machine collaboration.
  3. Experimental Methods
  3.1. Subjects
Forty-five subjects between the ages of 20 and 27 years (23.03 ± 1.50 years) were recruited to participate in the ECAR system experiment. All subjects were physically healthy, right-handed, with normal or corrected vision, and with normal electrocardiograms. Prior to the formal experiment, an 8-h task training was conducted for each subject to ensure that they are fully familiar with the task operation of ECAR system under all conditions. All subjects had a good mental state before or during the experiment. All subjects completed the test without dropping out and received financial compensation after the experiment ended.
  3.2. ECAR System Tasks
In the manned vehicle, the operator carries out multi-window tasks through the Environmental Control and Atmospheric Regeneration (ECAR) system. The ECAR system is a task platform developed in PyCharm Community Edition software (version 2020.2.4). It integrates the Environmental Control System (ECS) and the Atmospheric Regeneration System (ARS). The Human Machine Interfaces (HMIs) of the ECAR system are shown in 
Figure 3.
Figure 3a shows the HMI of the ACS. The specific task operation process of the ECS is as follows: firstly, the operator observes the abnormal temperature and humidity parameters (ECS1), and clicks the “TEMP” or “RH” button (ECS2); secondly, the operator observes the abnormal equipment parameters (ECS3) and analyzes the cause of the failure (ECS4); finally, the operator selects the “Air conditioning fault level” (ECS5) and clicks the appropriate “Air conditioning fault reporting” button (ECS6).
 Figure 3b shows the HMI of the ARS. The specific task operation process of the ARS is as follows: firstly, the operator observes the abnormal atmospheric parameters (ARS1), and clicks the “O2”, “CO2” or “TC” button (ARS2); secondly, the operator observes the abnormal equipment parameters (ARS3) and analyzes the cause of the failure (ARS4); finally, the operator selects the “Fault level” (ARS5) and clicks the appropriate “Fault reporting” button (ARS6).
   3.3. Task Complexity of ECAR System
Taking the ECS as an example, the two types of complexity (monitoring complexity and processing complexity) and the sub complexity (interface–information interaction complexity, actions logic complexity, action size complexity, and interface control behavior complexity) were calculated.
The interface–information interaction complexity is equal to the second-order entropy of the interface–information interaction graph, as shown in 
Figure 4. In second-order entropy, if the amount of information flowing in and out is consistent, it is a type of node. The number of second-order entropy nodes in 
Figure 4 is 20. The calculated result is as follows:
Figure 5 shows the actions control graph. The actions logic complexity is equal to the first-order entropy of the actions control graph. In first-order entropy, when the elements and quantities of information flowing in and out are consistent, it is a type of node. The number of first-order entropy nodes in 
Figure 5 is 5. The specific formula is as follows:
 The action size complexity is equal to the second-order entropy of the actions control graph. The number of second-order entropy nodes is 16. The action size complexity value is as follows:
Similarly, the interface control behavior complexity is equal to the second-order entropy of the information control behavior graph. As shown in 
Figure 6, the number of second-order entropy nodes is 7. The interface control behavior complexity value is as follows:
According to Equations (2) and (3), the monitoring complexity and processing complexity are as follows:
        where 
HM is the monitoring complexity; 
HP is the processing complexity; 
HIIIC is the interface–information interaction complexity; 
HALC is the actions logic complexity; 
HASC is the action size complexity; 
HICBC is the interface control behavior complexity.
The sub-complexity values (
HIIIC, 
HALC, 
HASC, and 
HICBC) and the complexity type (
HM and 
HP) of subtasks in ECAR system were summarized in 
Table 1.
  3.4. Experimental Design
In order to verify the effectiveness of the proposed WL measurement method, the experiments, adopting the within-subject design, were conducted with the three WL levels for the ECAR system tasks. The duration of each experimental test was set to 10 min in this study. The three WL levels were designed by setting different trigger numbers of tasks in fixed task duration of 10 min: LWL including eight tasks, MWL including 20 tasks, and HWL including 36 tasks. The dependent variables included the subjective NASA-TLX scale, work performance, and eye movement indicators.
  3.5. Experimental Procedure
The experimental procedure is shown in 
Figure 7. Before the formal experiment began, the researchers carefully explained the experimental process to the subjects. Considering individual differences among the subjects, the researchers made appropriate adjustments to the seat height and distance from the operating computer based on their height. The computer screen used in the experiment was set to 14 inches in size and the screen brightness was controlled at 80%. Throughout the testing phase, the temperature in the laboratory was maintained between 21 °C and 25 °C, the relative humidity was maintained between 20% and 30%, and the working surface illumination was 500 lx.
During the experiments, the subjects were required to perform three groups of ECAR system tasks with different WL levels in the order designed by the Latin square method. The NASA-TLX scale was required to be filled out after completing the tasks at each WL level, and a 10-min break was reserved between adjacent WL levels. This study was approved by the Institutional Review Board (IRB) of Beihang University.
  3.6. Experimental Data
The experimental data consists of four types: subjective scale, work performance, eye movement, and AWL, as detailed in 
Table 2.
The NASA-TLX scale was adopted as the subjective tool for evaluating WL from six factors, namely mental demand, physical demand, temporal demand, effort, own performance, and frustration level. The use of the NASA-TLX scale is mainly divided into two parts. Firstly, the subjects rate each dimension based on their feelings and experiences during task execution; secondly, the subjects obtained the weight of each dimension by comparing the importance of the six dimensions [
10]. Finally, the overall WL score is calculated by multiplying the weighted scores of each dimension by their scale scores, and then adding these products together. The higher the score is, the higher the WL is.
The actual operational data of ECAR tasks, including task execution accuracy, monitoring time, processing time, task accuracy, and task execution frequency, were automatically recorded by the computer system. The operator’s work time was calculated as the sum of the monitoring time and the processing time. The work time percentage is the ratio of the operator’s work time to the overall task duration (the task duration in this study was 10 min).
The visual attentional behavior of subjects was recorded using the aSee Pro desktop eye tracking device and aSee Studio software (version 0.2.29.3) in order to analyze its relationship with WL. ASee Pro utilizes telemetry-based gaze tracking technology based on human eye 3D models, without the need for fixed heads, and can accurately calculate gaze coordinates within a distance range of 55 cm to 80 cm. The pupil diameter, fixation duration, saccade range, and saccade velocity were used as indicators of eye movement. Among them, the pupil diameter is closely related to the changes of subject’s state and the way of information processing and is a sensitive indicator for measuring WL [
49]. The fixation duration refers to the duration length of the subject’s fixation at each fixation point. The longer the fixation duration, the more difficult it is for the subject to process the information [
50]. The saccade amplitude refers to the length of the subject’s eye span from the end of the previous fixation behavior to the beginning of the next fixation behavior [
51]. The more difficult the task, the smaller the saccade amplitude will be. The saccade velocity refers to the distance between the fixation points of the subjects per second [
52]. It can indicate the efficiency of the subject’s search for information in the current visual area, and the faster saccade velocity represents the more efficient the subject’s search for information [
52].
The AWL is quantitatively obtained based on the task complexity, maximum allowable time, actual work time, and task frequency. The detailed calculation process of task complexity is described in 
Section 3.2. The maximum allowable monitoring time for each subtask in the ECAR system is 5 s, and the maximum allowable processing time is 20 s. The actual monitoring time, the actual processing time, and the task frequency of each subtask in the ECAR system are recorded by the computer.
The repeated measures analysis of variance (ANOVA) was performed using the SPSS 25.0 to test the significance between all measurements. Given that the ANOVA necessitates adherence to normality and homogeneity of variance assumptions for all measures, the Shapiro-Wilk test was initially employed in this study to assess normality. Unfortunately, the examination revealed that none of the dependent variables met the criteria for normality. Subsequently, the one-way ANOVA was applied to assess homogeneity of variance, revealing that only NASA-TLX results demonstrated homogeneity. Considering the non-compliance with normality and homogeneity of variance, the Kruskal–Wallis test was adopted to examine the impact of WL levels on all measures. Additionally, the Spearman correlation analysis was performed using the open-source statistical software package R 3.6.1 to investigate the correlation between AWL measurement results and subjective NASA-TLX scales, work performance, and eye movement indicators. Statistical analysis was performed using α = 0.05.
  4. Results
  4.1. WL Measurement Results
Table 3 presents the measurement results under different WL conditions. 
Figure 8 shows the measurements under three different WL conditions including the calculated AWL, NASA-TLX score, task accuracy, work time percentage, pupil diameter, fixation duration, saccade amplitude, and saccade velocity.
 As shown in 
Figure 8a,b, the AWL measurements and NASA-TLX scores increased significantly with the increase of designed WL levels, indicating the sensitivity of the three designed WL levels and the effectiveness of the AWL measurement method.
As shown in 
Figure 8c,d, as the designed WL levels increased, the task accuracy decreases significantly, and meanwhile the work time percentage increased significantly. These results illustrated that an increase in the WL level caused subjects to spend more time processing tasks, while further contributing to subject errors.
Figure 8e–h depict the changes in eye movement of subjects when performing the tasks with different WL levels. The statistical results indicated that there was no significant effect of WL level on pupil diameter and saccade velocity. Moreover, the fixation duration was significantly higher at LWL than that at HWL, and the saccade amplitude under LWL was significantly lower than that under MWL and HWL. These results indicated that tasks with HWL level can lead to shorter fixation duration for subjects and force them to process more information in one fixation.
 The above measurement results not only validated the effectiveness of the AWL measurement method proposed in this study, but also revealed the effects of changes in the WL level on the work performance and eye movement indexes.
  4.2. WL Correlation Analysis
Figure 9 shows the correlation analysis results of all measurements during the processing of the ECAR system task. The absolute value of the correlation coefficient is regarded as extremely weak correlation between 0 and 0.1, weak correlation between 0.1 and 0.3, moderate correlation between 0.3 and 0.5, strong correlation between 0.5 and 0.7, and extremely strong correlation between 0.7 and 1.
 The correlation analysis results showed that there were significant and very strong positive correlations between the AWL measurements and the NASA-TLX scores (r = 0.75, p < 0.01) and work time percentage (r = 0.97, p < 0.01). A significant and very strong negative correlation was found between the AWL measurements and the task accuracy (r = −0.73, p < 0.01). And the significant and weak correlations were found between the AWL measurements and the fixation duration (r = −0.28, p < 0.05).
Based on the correlation between the AWL measurement results and the NASA-TLX scale, task accuracy, and working time percentage, it is proven that the AWL measurement results are effective in evaluating the operator’s WL level. The larger the AWL measurement results, the higher the operator’s WL level.
  4.3. WL Assessment Threshold
The WL assessment thresholds were defined by integrating subjects’ task accuracy with the calculated AWL measurements. In 
Figure 10, the solid black line represented the task accuracy, and the black dashed line denoted the fitted trend line for task accuracy. Similarly, the solid red line represented the AWL measurements calculated by the method proposed in this study, and the red dashed line indicated the fitted trend line for AWL measurements. Notably, as the designed WL level increased, the AWL measurements simultaneously increased, and the task execution accuracy progressively declined.
When the trend line of task accuracy reached or exceeded 0.95, most subjects attained 100% task accuracy, signifying optimal performance of the human–machine system. At this point, the corresponding AWL measurement of 0.25 was designated as the LWL threshold. The performance of the human–machine system was deemed satisfactory when the trend line of task accuracy ranged between 0.90 and 1. However, the security of the human–machine system was threatened when the fitted trend line of task execution accuracy fell below 0.90. Consequently, the AWL value of 0.4, corresponding to the task execution accuracy of 0.9, was designated as the HWL threshold.
To summarize, based on the AWL measurement calculated using the method proposed in this study, the subject’s WL state can be classified into different level. Specifically, subjects were regarded as being in the LWL state when the AWL measurement was below 0.25, in the MWL state when the AWL measurement was between 0.25 and 0.40, and in the HWL state when the AWL measurement surpassed 0.40.
  6. Conclusions
In the manned vehicle, the multitasking human–machine interaction system often leads to HWL for operators. The WL measurement method was proposed for multitasking processing based on TDL and SLC, in response to the limitations of traditional WL measurement methods in human–machine collaboration systems. To evaluate the effectiveness of this method, the ECAR system tasks with varying WL levels were designed, and 45 subjects were engaged in the experimental study. The main conclusions are summarized as follows:
- In the measurement results of the AWL and subjective NASA-TLX scale, work time percentage increased significantly with the increase of WL level. The measurement results of accuracy and fixation duration decreased significantly with the increase of WL level. 
- There were very strong correlations existed between AWL measurements and NASA-TLX results (r = 0.75, p < 0.01), task execution accuracy (r = −0.73, p < 0.01), and work time percentage (r = 0.97, p < 0.01), respectively. 
- Thresholds for WL levels were established based on task execution accuracy. The thresholds for LWL and HWL were 0.25 and 0.40, respectively. Based on the established WL thresholds, this study evaluated the AWL measurements for three categories. 
In conclusion, the results indicate that the AWL measurement method proposed in this study can effectively assess the WL of operators in real time, and can also automatically trigger the optimal task allocation strategy in human–machine dynamic function allocation system to keep operators in the appropriate WL state, thus optimizing the performance of the human–machine system.