Emotional and Psychophysiological Reactions While Performing a Collaborative Task with an Industrial Robot in Real and Virtual Working Settings

Schöner, Dennis; Birkle, Jonas; Wagner-Hartl, Verena

doi:10.3390/tae1010004

Open AccessArticle

Emotional and Psychophysiological Reactions While Performing a Collaborative Task with an Industrial Robot in Real and Virtual Working Settings

by

Dennis Schöner

,

Jonas Birkle

and

Verena Wagner-Hartl

^*

Department of Engineering & Technology, Campus Tuttlingen, Furtwangen University, 78532 Tuttlingen, Germany

^*

Author to whom correspondence should be addressed.

Theor. Appl. Ergon. 2025, 1(1), 4; https://doi.org/10.3390/tae1010004

Submission received: 18 June 2025 / Revised: 23 July 2025 / Accepted: 25 July 2025 / Published: 30 July 2025

Download

Browse Figures

Versions Notes

Abstract

Increasing automation and the rapidly growing use of robots in industrial as well as social areas result in a greater need for research regarding collaboration between humans and robots. Key factors for a safe and successful combination of human and robot abilities include acceptance and trust in the robot. In order to prevent physical and psychological harm to humans, reducing these negative emotions and increasing trust and acceptance are essential. One way to achieve this is through the use of virtual training methods and environments. However, current research scarcely covers this approach. Therefore, this research focusses on an experimental approach to investigate emotional and psychophysiological (ECG, EDA) reactions while performing a collaborative assembly task (screwing) with an industrial robot in a real and a virtual setting, respectively. The study sample consisted of 46 participants (23 female) with an age range from 20 to 58 years. The results of the analyzed subjective and objective psychophysiological (cardiovascular and electrodermal responses) measures provide more information regarding the suitability of virtual trainings for human–robot collaboration. Differences in task complexity were measurable in both virtual and real environments. Furthermore, gender differences were also shown.

Keywords:

human–robot collaboration; working environment; employee training

1. Introduction

Increasing automation and the rapidly growing use of robots in industrial as well as social areas result in a stronger need for research regarding collaboration between humans and robots. Key factors for a safe and successful combination of human and robot abilities include acceptance and trust in the robot, as well as negative emotions towards the robot [1]. In order to prevent physical and psychological harm to humans, reducing these negative emotions and increasing trust and acceptance is essential [2,3].

Research has already shown that the introduction of collaborative robots into the workplace can lead to multiple negative emotions among workers, ranging from fear of losing their jobs [2] to increased stress levels while working with the robot, resulting in a reduction in workers’ mental health [4,5]. Moreover, the incorrect behavior of industrial robots can lead to increasing mental stress in the test subjects, who subsequently changed their behavior when working with the robot, increasing the risk for errors [6].

In addition to the methods for subjectively measuring trust, acceptance, fear, and emotions, there are also objective measures that can supplement the subjective results. The psychological processes that arise when experiencing a situation are often accompanied by physical reactions [7]. This fact can be used to yield emotional, cognitive, and physical strain measurable and thus, more objective. This also applies to the strain experienced during human–robot interaction and therefore, makes it an interesting topic for this field.

These psychophysiological measures are characterized by several parameters that support their use in the analysis of situation-related stress. On the one hand, psychophysiological measures are non-invasive, and on the other hand, they cannot be intentionally influenced by humans, in most cases [8]. Due to the high complexity of psychological processes, a single psychophysiological measure is usually not sufficient to provide an in-depth picture [7]. Therefore, several of these measures should be combined and supplemented with subjective and technical data whenever possible. A multidimensional approach is therefore advisable. The collection of psychophysiological data, such as electrodermal activity and electrocardiography, is well suited for the recording the beforementioned stress levels and potentially negative emotional states of humans during interactions with an industrial robot [9,10].

Virtual and augmented reality simulations have emerged as promising tools for training operators in collaborative tasks, potentially reducing anxiety before working with real robots [11,12]. Nikolaidis et al. [13] showed that so-called cross-training, in which humans and robots swap roles, had the tendency to increase trust in the robot. Furthermore, Palmarini et al. [12] used augmented reality (AR) technology for developing an interface that could successfully contribute to increasing trust in human–robot collaboration (HRC). Additionally, immersive virtual training is suggested as a way of safely and harmlessly investigating human behavior when working with robots in high-risk situations [14].

However, it remains partially unclear whether subjective feelings and psychophysiological reactions observed during virtual training accurately reflect those experienced in real-world human–robot interactions. Addressing this knowledge gap is crucial for designing effective training protocols that build trust and ensure safety. Therefore, the aim of the exploratory study was to examine the emotional and psychophysiological reactions of individuals while performing collaborative tasks with an industrial robot in both real and virtual working environments. The following research questions should be answered:

Are there differences regarding subjective measures [(I) perceived valence, arousal, and dominance; (II) usefulness and satisfaction; (III) trust; (IV) mental demand, performance, and effort] during the human–robot interaction of men and women at three levels of complexity (low, medium, and high) in real and virtual working environments?
Are there differences regarding objective measures [(V) cardiovascular and (VI) electrodermal activity] during the human–robot interaction of men and women at three levels of complexity (low, medium, and high) in real and virtual working environments?

2. Materials and Methods

2.1. Study Participants

A total of 46 participants (23 female), aged between 20 and 58 years (M = 26.63, SD = 7.68), participated in the study; 16 participants (35%) had already worked with a robot. Furthermore, they reported that they had less to very much VR-experience (M = 3.07, SD = 0.95; 5-point scale ranging from 1 = no VR-experience to 5 = very much VR-experience). Additionally, all participants were asked to assess their general attitude towards working with industrial robots. Overall, they assessed it with partly to very good (M = 4.35, SD = 0.53; 5-point scale ranging from 1 very poor to 5 = very good). All participants provided their informed consent at the beginning of the study and participated in the study voluntarily. Ethical approval for this study was obtained from the ethics committee of Furtwangen University (approval Number: 22-030).

2.2. Study Design

A mixed design was used for the exploratory study. The independent variables included gender (IV1; men, women) and training method (IV2; real environment, virtual environment). Furthermore, the participants were required to perform three different interaction tasks (IV3; measurement repetition factor) within each training condition with three different levels of complexity (low, medium, high). The tasks were characterized by a sequential progression in terms of complexity, and the participants therefore completed the tasks in an ascending order. This procedure was intended to simulate real training conditions and thus facilitate a familiarization with the task and simulate an employee training for human–robot collaboration in the industrial context. Regarding the study design and the measured dependent variables, the presented study was based on previous experiments of the research group (e.g., Refs. [15,16,17]).

Furthermore, it is important to note that the distribution of gender and training method was nearly balanced, with 12 woman and 11 men in the real training condition and 12 men and 11 women in the VR training condition, χ²(1) = <0.01, p = 1.000. Also, the previous working experience with robots was statistically equally distributed between the two different training groups, χ²(1) = 0.10, p = 0.757. The same can be reported for the previous VR experience, t(44) = −0.15, p = 0.879, as well as for the general attitude towards working with industrial robots, t(44) = 1.13, p = 0.267.

The following dependent variables (DVs) were used: subjective measures included valence, arousal and dominance measured using the self-assessment manikin [18]; usefulness and satisfaction assessed using the acceptance scale of van der Laan et al. [19] (both ranging from −2 to +2). Furthermore, trust, using the subjective trust score for the scenarios of human–robot interaction, as presented by Wagner-Hartl et al. [20] (5-point scale ranging from 1 = strongly disagree to 5 = strongly agree), and the three scales of mental demand, performance, and effort from the NASA Task Load Index (NASA-TLX); [21,22], were also used. Objective measures included cardiovascular activity using heart rate (HR) and heart rate variability (HRV RMSSD; root mean square of successive differences) and electrodermal activity, analyzing skin conductance level (SCL), sum amplitude, non-specific skin conductance responses (NS.SCR), and mean sum amplitude (sum amplitude/NS.SCR). All used psychophysiological parameters were baseline corrected. The psychophysiological measures were recorded using movisens EcgMove4 [23] and EdaMove4 [24] (thenar/hypothenar, non-dominant hand) devices.

2.3. Materials and Procedure

Physical materials included two 3D-printed workpieces (100 mm × 50 mm × 10 mm), shown in Figure 1. The first workpiece (left side) featured embedded M6 threaded inserts on both sides, while the second workpiece (right side) comprised two M5 threaded inserts on the left, as visible in Figure 1. Custom 3D-printed grippers with interchangeable 11 mm and 13 mm socket attachments were designed for handling the used screws. Workpieces were secured to the HORST600 collaborative robot [25] using a 3D-printed fixture.

Figure 2 shows the setup of the real working environment. The HORST600 robot [25] used for the completion of the interactive tasks is visible on the right, with the workpieces on the table on the left side. A virtual replica of the workspace was used as the virtual environment, with real-time robot data streamed via a Modbus TCP interface and replayed to ensure the exact same motion of the virtual and physical robot. Special care was taken to create even conditions regarding the design of the virtual and the real environment. The workpiece and robot models were created in Blender, and the HTC Vive Pro HMD [26] provided immersive VR interaction. The VR hand controllers of the HTC Vive Pro were used for handling objects in the virtual environment. All robot interactions were recorded via a camera and logged in Excel.

Upon arrival, participants were welcomed and introduced to the study (see Figure 3). After providing their informed consent, participants were asked to fill out a sociodemographic questionnaire. After mounting the physiological sensors, a baseline of the psychophysiological measures was recorded, while participants sat quietly facing a fixing cross on the wall in front of them. The baseline measurement had a duration of three minutes. After finishing the baseline measurement, the participants were instructed to complete the interaction task together with the robot. Therefore, they were randomly assigned to either the VR or the real environment group.

Independent from the group (virtual or real environment), the interaction tasks consisted of three experimental subtasks, with increasing complexity (low, medium, high). The subtasks and their execution were identical under both the real and virtual environment conditions (see also Section 2.2) to ensure comparability between settings. The subtasks are shown in Table 1. Each subtask paired a human action with an automated robotic action. After each subtask, a subjective assessment with the questionnaires described in Section 2.2 was conducted. Additionally, prior to the second and third task, a short resting measurement of 90 s was conducted to assess recovery and transitional physiological states. The execution time of each task took about 60 s (no difference between real and virtual environment), and the average time needed to answer the questionnaires after the tasks was 3 min. The total duration of the study took around 50 min.

Subjective data was collected via Unipark surveys, capturing sociodemographic data and participant perceptions as stated. As mentioned previously (see Section 2.2), the psychophysiological measures were recorded via ECG and EDA using movisens EcgMove4 [23] and EdaMove4 [24] devices. Initial quality checks of the recordings were performed immediately after each session, with any artefacts noted in the Excel log. Raw ECG and EDA data were processed using movisens’s DataAnalyzer and custom software. Time windows were defined by the manually logged start and end times of each task. The baseline correction used the middle 90 s of a three-minute rest period. This time period was used to avoid possible onset and offset effects within the baseline measurement. Subjective questionnaires were scored according to their respective published procedures. Any artefacts within the data were identified during processing and excluded before analysis.

2.4. Statistical Analyses

IBM SPSS (Version 30) statistics and JASP were used to calculate the results. The statistical analyses were based on a significance level of 5%. Due to the exploratory approach of the study [27], tendencies towards significance were also analyzed and based on a significance level of 10%.

3. Results

3.1. Subjective Measures

3.1.1. Valence, Arousal, and Dominance

To answer the first research question concerning whether differences regarding the subjective measures of perceived valence, arousal, and dominance [18] can be shown, the results of mixed ANOVAs show a significant effect of the working environment on the perceived valence during human–robot interaction, F(1, 35) = 5.69, p = 0.023, η²_part. = 0.140. Overall, the interaction with the robot was perceived as significantly more pleasant within the virtual (VR) environment than within the real environment (see Figure 4). All other effects did not reach the level of significance (see Table 2).

3.1.2. Usefulness and Satisfaction

Regarding the second research question concerning whether differences regarding the perceived usefulness and satisfaction [19] during human–robot interaction at three levels of complexity (low, medium, and high) in real and virtual working environments do exist, mixed ANOVAs show a significant effect of the working environment on the perceived usefulness during human–robot interaction, F(1, 35) = 6.10, p = 0.019, η²_part. = 0.149. Following the results, the VR training environment is overall perceived as significantly more useful than the real training environment in which the human–robot interaction was performed (see Figure 5). No other effects reached the level of significance (see Table 3).

3.1.3. Trust

To answer research question three concerning the perceived trust [20] during human–robot interaction, a mixed ANOVA showed a tendency towards significance regarding the interaction complexity*gender, F_GG(1.49, 49.11) = 2.87, p = 0.081, η²_part. = 0.080, and the repeated measurement effect complexity, F_GG(1.49, 49.11) = 2.72, p = 0.090, η²_part. = 0.076. Post hoc analyses (Tukey) revealed that women assessed tasks with the highest level of complexity significantly higher regarding the perceived trust than tasks with the lowest level of complexity (p = 0.013; see also Figure 6). Furthermore, all other effects did not reach the level of significance (see Table 4).

3.1.4. Mental Demand, Performance, and Effort

To answer the fourth research question, the three scales of mental demand, performance, and effort of the NASA-TLX [21,22] were analyzed using mixed ANOVAs. Following the results, a significant interaction complexity*working environment (see Figure 7), F(2, 70) = 3.91, p = 0.025, η²_part. = 0.100, was shown for mental demand, and a significant effect of the working environment was shown for effort (see Figure 8), F(1, 35) = 4.56, p = 0.040, η²_part. = 0.115, during human–robot interaction within the trainings. In addition, significant repeated measurement effects of complexity were shown regarding mental demand, F(2, 70) = 5.99, p = 0.004, η²_part. = 0.146, and effort, F_GG (1.47, 51.53) = 21.82, p < 0.001, η²_part. = 0.384. All other effects did not reach the level of significance (see Table 5). Following post hoc analyses (Tukey), it was shown that within the real working environment, the highest task complexity was assessed as significantly more mentally demanding than the lowest (p = 0.008) as well as the medium (p = 0.005) task complexity. Additionally, the effort to accomplish the level of performance was perceived as significantly higher by the group which performed the training in the virtual working environment than by the group that performed the tasks in the real working environment.

3.2. Objective Measures

3.2.1. Cardiovascular Activity

Regarding the objective measures, cardiovascular reactions of the participants were used to answer the fifth research question. The results of mixed ANOVAs reveal a significant effect of task complexity regarding the heart rate of the participants, F(2, 72) = 4.35, p = 0.017, η²_part. = 0.108. Following post hoc analyses (Holm), the baseline-corrected HR was significantly lower within the task with the lowest complexity than within both other tasks (medium and high complexity; p = 0.038; see also Figure 9). All other effects did not reach the level of significance (see Table 6).

3.2.2. Electrodermal Activity

To answer the sixth research question concerning differences with regard to electrodermal reactions of the participants during the different tasks they performed while interacting with the robot, ANOVAs with repeated measures show a significant interaction task complexity*working environment*gender (Figure 10), F_GG(1.58, 48.95) = 3.85, p = 0.037, η²_part. = 0.111, as well as a significant effect of the measurement repetition factor of task complexity, F_GG(1.58, 48.95) = 38.00, p < 0.001, η²_part. = 0.551 and a between subjects effect for gender, F(1, 31) = 4.74, p = 0.037, η²_part. = 0.132, for the skin conductance level (SCL) of the participants. Furthermore, a tendency towards significance was shown for the interaction task complexity*working environment, F_GG(1.58, 48.95) = 2.70, p = 0.089, η²_part. = 0.080. The results of the post hoc analyses (Tukey) for the interaction task complexity*working environment*gender of the baseline-corrected SCL are shown in Table 7. Overall, skin conductance level was able to reveal significant and tendentially significant differences between different levels of complexity, especially regarding high and low complexity levels.

For NS.SCR a significant interaction working condition*gender can be shown, F(1, 31) = 4.31, p = 0.046, η²_part. = 0.122, but post hoc analyses show no significant differences between the four groups.

Furthermore, regarding sum amplitude, a significant interaction complexity*gender, F(2, 62) = 3.24, p = 0.046, η²_part. = 0.095, as well as tendencies towards interaction for complexity, F(2, 62) = 3.12, p = 0.051, η²_part. = 0.092, working environment, F(1, 31) = 3.86, p = 0.059, η²_part. = 0.111; and the interaction working environment*gender, F(1, 31) = 3.29, p = 0.080, η²_part. = 0.096, can be shown. Post hoc analyses show a significantly higher sum amplitude level for women while completing the highest task complexity than while performing a task of the low complexity level (p = 0.010; Tukey). Additionally, the sum amplitude was overall tendentially higher in the real environment than in the VR environment group, and here also tendentially higher for women within these conditions (p = 0.051). All other effects did not reach the level of significance (see Table 8).

4. Discussion

The aim of the study was to examine the emotional and psychophysiological reactions of individuals performing collaborative tasks with an industrial robot in both real and virtual working environments. The underlying idea was to base the tasks on potential employees’ trainings for human–robot interaction within an industrial setting. Therefore, three different levels of task complexity (low, medium, high) were performed within a real or a virtual (VR) working environment that duplicates the real working environment in a virtual scenario. The analyzed levels of task complexity (training levels) built on each other and were always completed by the participants in ascending order. Their implementation follows frequently mentioned participant suggestions from previous research of the research group regarding requirements for VR trainings of human–robot interaction [17] and the former findings regarding task complexity and acceptance of human–robot interaction [20].

To answer the research questions concerning differences with regard to the subjective measures, the results reveal differences in various analyzed variables. It can be shown that the working environment seems to have a significant effect on the perceived valence during human–robot interaction. Following the results, the interaction with the robot was perceived as significantly more pleasant within the VR environment than within the real environment. Additionally, the VR training environment was perceived as significantly more useful than the real training environment. The results are in line with those of previous research [14,17]. Furthermore, they underline the importance of using XR as promising approach for the development of future trainings e.g., [11,12,17].

On the other hand, the participants of the VR group reported a significantly higher effort required to accomplish the level of performance during the training interaction with the robot than the group that performed the tasks in the real working environment, respectively. One explanation for this could be that virtual reality is not yet strongly integrated into everyday life, and therefore, the use of the controllers may still be unfamiliar. This may have had an influence on the observed results. Interestingly, the electrodermal responses of the participants pointed in the other direction, whereas the sum amplitude was tendentially significantly higher during human–robot interaction in the real than in the VR environment, and also tendentially higher for women within these conditions. Following Boucsein and Backs [8], this can be interpreted such that a moderate increase in the amplitude of non-specific electrodermal responses is associated with increased cognitive activity (preparation activation) and a marked increase with negatively toned emotions and emotional strain (affect arousal). One explanation could be that for the real working environment, it was additionally shown that the highest task complexity was assessed as significantly more mentally demanding than the lowest and medium level of complexity. This result is in line with the results for the objective measure heart rate, which was significantly higher during the performance within tasks with the medium and highest complexity than during the performance of a task with the lowest complexity. Following Boucsein and Backs [8], an (moderate) increase in the heart rate can be interpreted as mental strain (preparatory activation). Furthermore, differences in task complexity were also visible in the electrodermal responses of the participants and especially between levels of low and high complexity. Again, following Boucsein and Backs [8], an increase in the skin conductance level (SCL) can be interpreted as strain and high general arousal. Furthermore, women show a significantly higher sum amplitude level during the highest task complexity than during the lowest task complexity. In summary, the comparison of the subjective data and the objective psychophysiological measures indicate that the implementation of different levels of task complexity training was successful within the presented study.

Moreover, regarding the subjective measures, only one gender difference was shown. For perceived trust, the results suggested that women assessed tasks with the highest level of complexity significantly higher than tasks with the lowest level of complexity. Future research should take this into account. For example by evaluating whether this could be an effect of habituation, or whether it is a perceived feeling, i.e., “if the robot can do this difficult task, then I can trust it”.

Like every study, this study exhibits some limitations. Elevated room temperatures caused ECG and EDA sensors to fail or lose adhesion, compromising some psychophysiological recordings. Inadequate ventilation during hot summer conditions introduced artifacts in the EDA data, suggesting that future studies should ensure better climate control and more robust equipment. In the virtual scenario, HMD connection problems and alignment failures with the workpieces disrupted some trials. Future research should solve the problems that occurred in the VR environment and also include a larger sample of participants. The intended complexity gradient in the interaction tasks were only partially manifested in subjective and physiological measures, leaving potential order and learning effects unexamined. Future studies should take this into account and expand this research.

Furthermore, future research should also investigate how effectively the training phase reduces participants’ anxiety and builds trust in the robot, with particular focus on the higher acceptance observed after virtual training. A larger, systematic, and adaptive training could be developed to specifically target negative emotions and strengthen operator confidence, leveraging psychophysiological findings that virtual reality elicits responses similar to those for real-world interactions. Furthermore, subsequent studies should consider ergonomic factors, incorporating elements such as motion sensors and ergonomically-oriented analysis. This would facilitate the investigation of potential differences in these parameters within virtual reality and real-world environments.

5. Conclusions

In summary, this research focusses on an experimental approach to investigate emotional and psychophysiological (cardiovascular and electrodermal) responses while performing a collaborative assembly task (screwing) with an industrial robot in a real and a virtual setting, respectively. The results of the analyzed subjective and objective psychophysiological measures provide more information regarding the suitability of virtual trainings for human–robot collaboration. The results can be seen as an additional step towards closing the existing knowledge gap. Future research can benefit from the results and help pave the way for future development in this promising field.

Author Contributions

Conceptualization, D.S., J.B. and V.W.-H.; methodology, D.S., J.B. and V.W.-H.; software, D.S. and J.B.; validation, D.S., J.B. and V.W.-H.; formal analysis, D.S. and V.W.-H.; investigation, D.S. and J.B.; resources, D.S., J.B. and V.W.-H.; data curation, D.S., J.B. and V.W.-H.; writing—original draft preparation, D.S. and V.W.-H.; writing—review and editing, D.S., J.B. and V.W.-H.; visualization, D.S. and V.W.-H.; supervision, J.B. and V.W.-H.; project administration, D.S., J.B. and V.W.-H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee of Furtwangen Univerity (approval number: 22-030; approval date: 31.07.2022, enlargement 01.07.2024).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Restrictions apply to the availability of these data. The data used in this article are unavailable due to privacy and ethical restrictions. Data sharing is not applicable to this article. Requests regarding the data set should be directed to the corresponding author.

Acknowledgments

We would like to thank Katharina Gleichauf for her support and all participants for their time and willingness to contribute to our study.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Oubari, A.; Pischke, D.; Jenny, M.; Meißner, A.; Trübswetter, A. Mensch-Roboter-Kollaboration in der Produktion [Human-robot collaboration in production]. Z. Für Wirtsch. Fabr. J. Econ. Fact. Oper. 2018, 113, 560–564. [Google Scholar]
Lotz, V.; Himmel, S.; Ziefle, M. You’re my mate–acceptance factors for human-robot collaboration in industry. In Proceedings of the International Conference on Competitive Manufacturing, Stellenbosch, South Africa, 30 January–1 February 2019; Volume 30, pp. 405–411. [Google Scholar]
Kildal, J.; Tellaeche, A.; Fernández, I.; Maurtua, I. Potential users’ key concerns and expectations for the adoption of cobots. Procedia CIRP 2018, 72, 21–26. [Google Scholar] [CrossRef]
Körner, U.; Müller-Thur, K.; Lunau, T.; Dragano, N.; Angerer, P.; Buchner, A. Perceived stress in human–machine interaction in modern manufacturing environments-results of a qualitative interview study. Stress Health 2019, 35, 187–199. [Google Scholar] [CrossRef] [PubMed]
Szalma, J.L.; Taylor, G.S. Individual differences in response to automation: The five factor model of personality. J. Exp. Psychol. 2011, 17, 71–96. [Google Scholar] [CrossRef] [PubMed]
Fratczak, P. Trust and Fluency in Industrial Human-robot Interaction: Virtual Reality Study of Human Behaviour. Ph.D. Dissertation, Loughborough University, Loughborough, UK, 2020. [Google Scholar]
Boucsein, W. Psychophysiologische Methoden in der Ingenieurspsychologie [Psychophysiological methods in engineering psychology]. In Sonderdruck aus Enzyklopädie der Psychologie: Themenbereich D Praxisgebiete: Serie III Wirtschafts-,Organisations- und Arbeitspsychologie [Offprint from Encyclopedia of Psychology: Subject Area D Areas of Practice: Series III Business, Organizational and Occupational Psychology]; Zimelong, B., Konradt, U., Eds.; Hogrefe: Göttingen, Germany, 2006; pp. 317–358. [Google Scholar]
Boucsein, W.; Backs, R.W. The Psychophysiology of Emotion, Arousal and Personality: Methods and Models. In Handbook of Digital Human Modeling: Research for Applied Ergonomics and Human Factors Engineering; Duffy, V.G., Ed.; Human Factors and Ergonomics; CRC Press: Boca Raton, FL, USA, 2009; pp. 35-1–35-18. [Google Scholar]
Shao, S.; Wang, T.; Wang, Y.; Su, Y.; Song, C.; Yao, C. Research of HRV as a measure of mental workload in human and dual-arm robot interaction. Electronics 2020, 9, 2174. [Google Scholar] [CrossRef]
Arai, T.; Kato, R.; Fujita, M. Assessment of operator stress induced by robot collaboration in assembly. CIRP Ann. 2010, 59, 5–8. [Google Scholar] [CrossRef]
Su, S.; Wang, R.; Zhou, R.; Chen, Z.; Zhou, F. The effectiveness of virtual reality, augmented reality, and mixed reality training in total hip arthroplasty: A systematic review and meta-analysis. J. Orthop. Surg. Res. 2023, 18, 121. [Google Scholar] [CrossRef] [PubMed]
Palmarini, R.; del Amo, I.F.; Bertolino, G.; Dini, G.; Erkoyuncu, J.A.; Roy, R.; Farnsworth, M. Designing an AR interface to improve trust in Human-Robots collaboration. Procedia CIRP 2018, 70, 350–355. [Google Scholar] [CrossRef]
Nikolaidis, S.; Lasota, P.; Ramakrishnan, R.; Shah, J. Improved human–robot team performance through cross-training, an approach inspired by human team training practices. Int. J. Robot. Res. 2015, 34, 1711–1730. [Google Scholar] [CrossRef]
Atkinson, D.J.; Clark, M.H. Methodology for study of human-robot social interaction in dangerous situations. In Proceedings of the Second International Conference on Human-Agent Interaction, Tsukuba, Japan, 29–31 October 2014; Association for Computing Machinery: New York, NY, USA, 2014; pp. 371–376. [Google Scholar]
Birkle, J.; Vogel, A.; Wagner-Hartl, V. Impact of Distance and Movement Speed on the Acceptance of Human-Robot Interaction—Method and First Evaluation. In HCI International 2022—Late Breaking Posters. HCII 2022. Communications in Computer and Information Science; Stephanidis, C., Antona, M., Ntoa, S., Salvendy, G., Eds.; Springer: Cham, Switzerland, 2022; Volume 1655, pp. 483–490. [Google Scholar] [CrossRef]
Wagner-Hartl, V.; Nakladal, S.; Koch, T.; Babajic, D.; Mazur, S.; Birkle, J. Influence of Movement Speed and Interaction Instructions on Subjective Assessments, Performance and Psychophysiological Reactions During Human-Robot Interaction. In HCI International 2023—Late Breaking Papers. HCII 2023. Lecture Notes in Computer Science; Kurosu, M., Hashizume, A., Marcus, A., Rosenzweig, E., Soares, M.M., Harris, D., Li, W.-C., Schmorrow, D.D., Fidopiastis, C.M., Rau, P.-L.P., Eds.; Springer: Cham, Switzerland, 2023; Volume 14054, pp. 461–475. [Google Scholar] [CrossRef]
Birkle, J.; Wagner-Hartl, V. Requirements for Virtual Reality-Based Trainings of Human-Robot Interaction. Cogn. Comput. Internet Things 2024, 124, 64–74. [Google Scholar] [CrossRef]
Bradley, M.M.; Lang, P.J. Measuring emotion: The self-assessment manikin and the semantic differential. J. Behav. Ther. Exp. Psychiatry 1994, 25, 49–59. [Google Scholar] [CrossRef] [PubMed]
Van der Laan, J.D.; Heino, A.; De Waard, D. A simple procedure for the assessment of acceptance of advanced transport telematics. Transp. Res. Part C Emerg. Technol. 1997, 5, 1–10. [Google Scholar]
Wagner-Hartl, V.; Schmid, R.; Gleichauf, K. The Influence of Task Complexity on Acceptance and Trust in Human-Robot Interaction– Gender and Age Differences. Cogn. Comput. Internet Things 2022, 43, 118–126. [Google Scholar] [CrossRef]
Hart, S.G.; Staveland, L.E. Development of NASA-TLX (Task Load Index): Results of empirical and theoretical research. In Human Mental Workload; Hancock, P.A., Meshkati, N., Eds.; North Holland Press: Amsterdam, The Netherlands; London, UK, 1988; Volume 198, pp. 139–183. [Google Scholar]
Hart, S.G. NASA-TASK LOAD INDEX (NASA-TLX); 20 years later. In Proceedings of the Human Factors and Ergonomics Society; Foster-Miller, Inc.: Waltham, MA, USA, 2006; Volume 50, pp. 904–908. [Google Scholar]
Movisens. EcgMove 4. Available online: https://www.movisens.com/en/products/ecg-sensor/ (accessed on 17 June 2025).
Movisens. EdaMove 4. Available online: https://www.movisens.com/en/products/eda-and-activity-sensor/ (accessed on 17 June 2025).
Fruitcore-Robotics. HORST600. The Industrial Robot. Available online: https://www.fruitcore-robotics.com/en/horst600 (accessed on 17 June 2025).
HTC Corporation. VIVE Pro 2. Available online: https://www.vive.com/de/product/vive-pro2-full-kit/overview/ (accessed on 17 June 2025).
Abt, K. Descriptive data analysis: A concept between confirmatory and exploratory data analysis. Methods Inf. Med. 1987, 26, 77–88. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Workpieces and tool sockets used in the experimental tasks.

Figure 2. Experimental setup of the real working environment (left side) and the VR environment (right side).

Figure 3. Study procedure.

Figure 4. Subjectively perceived valence of the three different levels of complexity (measurement times) of the real and the VR working environment.

Figure 5. Subjectively perceived usefulness of the three different levels of complexity (measurement times) of the real and the VR working environment.

Figure 6. Gender differences in perceived trust of the three different levels of complexity (measurement times) of the real and the VR working environment.

Figure 7. Interaction complexity*working environment of subjectively perceived mental demand during task performance in the real and the VR working environment.

Figure 8. Subjectively perceived effort of the three different levels of complexity (measurement times) of the real and the VR working environment.

Figure 9. Baseline-corrected heart rate for the three different levels of complexity (measurement times) of the real and the VR working environment.

Figure 10. Baseline-corrected SCL of men and women for the three different levels of complexity (measurement times) of the real and the VR working environment.

Table 1. The three human–robot interaction subtasks with different complexities.

Nr.	Complexity	Human Tasks	Robot Task
A1	low	(1) Place workpiece “1” in the red fixture.	(2) Screw the two left screws.
A2	medium	(1) Rotate workpiece “1” by 180°. (2) Return workpiece “1” to the table.	(3) Screw the two left screws.
A3	high	(1) Swap the socket tool from a larger to a smaller size. (2) Place workpiece “2” in the red fixture. (4) Return workpiece “2” to the table.	(3) Screw the two left screws.

Note. Numbers in parentheses show the successive sequence of the human and robot per each subtask.

Table 2. Subjective measures—perceived valence, arousal, and dominance—results of the mixed ANOVAs.

DV	F	df	df_error	p	η²_part.
Valence
complexity	1.08	2	70	0.344	0.030
complexity*gender	0.39	2	70	0.681	0.011
complexity*working environment	1.49	2	70	0.234	0.041
complexitygenderworking environment	0.01	2	70	0.986	<0.001
gender	0.16	1	35	0.690	0.005
working environment	5.69	1	35	0.023	0.140
gender*working environment	2.68	1	35	0.110	0.071
Arousal
complexity	0.30 ¹	1.28 ¹	44.73 ¹	0.642 ¹	0.009 ¹
complexity*gender	0.01 ¹	1.28 ¹	44.73 ¹	0.966 ¹	<0.001 ¹
complexity*working environment	0.20 ¹	1.28 ¹	44.73 ¹	0.720 ¹	0.006 ¹
complexitygenderworking environment	0.20 ¹	1.28 ¹	44.73 ¹	0.717 ¹	0.006 ¹
gender	2.32	1	35	0.137	0.062
working environment	0.01	1	35	0.923	<0.001 ¹
gender*working environment	1.98	1	35	0.168	0.054
Dominance
complexity	0.54	2	70	0.583	0.015
complexity*gender	0.16	2	70	0.851	0.005
complexity*working environment	1.24	2	70	0.296	0.034
complexitygenderworking environment	0.67	2	70	0.516	0.019
gender	1.91	1	35	0.176	0.052
working environment	1.95	1	35	0.172	0.053
gender*working environment	0.01	1	35	0.940	<0.001

¹ Mauchly’s test of sphericity indicates that the assumption of sphericity is violated—sphericity correction: Greenhouse–Geisser.

Table 3. Subjective measures—perceived usefulness and satisfaction—results of the mixed ANOVAs.

DV	F	df	df_error	p	η²_part.
Usefulness
complexity	0.84 ¹	1.52 ¹	53.01 ¹	0.409 ¹	0.023 ¹
complexity*gender	0.46 ¹	1.52 ¹	53.01 ¹	0.581 ¹	0.013 ¹
complexity*working environment	0.08 ¹	1.52 ¹	53.01 ¹	0.870 ¹	0.002 ¹
complexitygenderworking environment	0.16 ¹	1.52 ¹	53.01 ¹	0.794 ¹	0.004 ¹
gender	0.19	1	35	0.665	0.005
working environment	6.10	1	35	0.019	0.149
gender*working environment	0.82	1	35	0.371	0.023
Satisfaction
complexity	1.36 ¹	1.58 ¹	55.38 ¹	0.263 ¹	0.037 ¹
complexity*gender	0.32 ¹	1.58 ¹	55.38 ¹	0.679 ¹	0.009 ¹
complexity*working environment	2.10 ¹	1.58 ¹	55.38 ¹	0.141 ¹	0.057 ¹
complexitygenderworking environment	1.72 ¹	1.58 ¹	55.38 ¹	0.193 ¹	0.047 ¹
gender	0.28	1	35	0.599	0.008
working environment	1.16	1	35	0.289	0.032
gender*working environment	0.54	1	35	0.465	0.015

¹ Mauchly’s test of sphericity indicates that the assumption of sphericity is violated—sphericity correction: Greenhouse–Geisser.

Table 4. Subjective measures—perceived trust—results of a mixed ANOVA.

DV	F	df	df_error	p	η²_part.
Trust
complexity	2.72 ¹	1.49 ¹	49.11 ¹	0.090 ¹	0.076 ¹
complexity*gender	2.87 ¹	1.49 ¹	49.11 ¹	0.081 ¹	0.080 ¹
complexity*working environment	0.57 ¹	1.49 ¹	49.11 ¹	0.518 ¹	0.017 ¹
complexitygenderworking environment	0.85 ¹	1.49 ¹	49.11 ¹	0.402 ¹	0.025 ¹
gender	0.07	1	33	0.792	0.002
working environment	0.11	1	33	0.739	0.003
gender*working environment	0.27	1	33	0.610	0.008

¹ Mauchly’s test of sphericity indicates that the assumption of sphericity is violated—sphericity correction: Greenhouse–Geisser.

Table 5. Subjective measures—mental demand, performance, and effort—results of the mixed ANOVAs.

DV	F	df	df_error	p	η²_part.
Mental demand
complexity	5.99	2	70	0.004	0.146
complexity*gender	0.75	2	70	0.476	0.021
complexity*working environment	3.91	2	70	0.025	0.100
complexitygenderworking environment	0.16	2	70	0.849	0.005
gender	1.42	1	35	0.241	0.039
working environment	0.07	1	35	0.790	0.002
gender*working environment	1.01	1	35	0.322	0.028
Performance
complexity	0.41	2	70	0.663	0.012
complexity*gender	0.15	2	70	0.860	0.004
complexity*working environment	0.94	2	70	0.395	0.026
complexitygenderworking environment	0.34	2	70	0.711	0.010
gender	1.39	1	35	0.247	0.038
working environment	1.05	1	35	0.312	0.029
gender*working environment	2.12	1	35	0.154	0.057
Effort
complexity	21.82 ¹	1.47 ¹	51.53 ¹	<0.001 ¹	0.384 ¹
complexity*gender	0.12 ¹	1.47 ¹	51.53 ¹	0.829 ¹	0.003 ¹
complexity*working environment	0.37 ¹	1.47 ¹	51.53 ¹	0.630 ¹	0.010 ¹
complexitygenderworking environment	0.85 ¹	1.47 ¹	51.53 ¹	0.401 ¹	0.024 ¹
gender	0.03	1	35	0.856	<0.001
working environment	4.56	1	35	0.040	0.115
gender*working environment	1.14	1	35	0.294	0.031

¹ Mauchly’s test of sphericity indicates that the assumption of sphericity is violated—sphericity correction: Greenhouse–Geisser.

Table 6. Objective measures—cardiovascular activity (baseline-corrected)—results of the mixed ANOVAs.

DV	F	df	df_error	p	η²_part.
Heart rate (HR)
complexity	4.35	2	72	0.017	0.108
complexity*gender	1.10	2	72	0.338	0.030
complexity*working environment	1.17	2	72	0.318	0.031
complexitygenderworking environment	0.12	2	72	0.885	0.003
gender	0.07	1	36	0.792	0.002
working environment	0.49	1	36	0.488	0.013
gender*working environment	2.33	1	36	0.136	0.061
Heart rate variability (HRV)
complexity	0.62	2	72	0.542	0.017
complexity*gender	0.44	2	72	0.647	0.012
complexity*working environment	0.16	2	72	0.853	0.004
complexitygenderworking environment	1.28	2	72	0.283	0.034
gender	0.14	1	36	0.714	0.004
working environment	0.61	1	36	0.441	0.017
gender*working environment	0.09	1	36	0.761	0.003

Table 7. Baseline-corrected SCL, Post hoc comparison—interaction working environment*gender*task complexity.

		Mean Difference	p
real, female, low	real, female, high	−3.473	0.005
	real, male, high	−8.247	<0.001
VR, female, low	real, male, medium	−5.948	0.075
	VR, female, high	−4.069	0.002
	real, male, high	−8.882	<0.001
real, male, low	real, male, medium	−2.844	0.071
	real, male, high	−5.778	<0.001
VR, male, low	real, male, high	−6.585	0.031
real, female, medium	real, male, high	−6.086	0.038
VR, female, medium	real, male, high	−6.206	0.053
real, male, medium	real, male, high	−2.934	0.055

Table 8. Objective measures—electrodermal activity (baseline-corrected)—results of the mixed ANOVAs.

DV	F	df	df_error	p	η²_part.
Skin conductance level (SCL)
complexity	38.00 ¹	1.58 ¹	48.95 ¹	<0.001 ¹	0.551 ¹
complexity*gender	0.11 ¹	1.58 ¹	48.95 ¹	0.852 ¹	0.003 ¹
complexity*working environment	2.70 ¹	1.58 ¹	48.95 ¹	0.089 ¹	0.080 ¹
complexitygenderworking environment	3.85 ¹	1.58 ¹	48.95 ¹	0.037 ¹	0.111 ¹
gender	4.74	1	31	0.037	0.132
working environment	1.10	1	31	0.302	0.034
gender*working environment	0.67	1	31	0.419	0.021
NS.SCR
complexity	2.29	2	62	0.110	0.069
complexity*gender	2.03	2	62	0.140	0.061
complexity*working environment	0.83	2	62	0.440	0.026
complexitygenderworking environment	0.01	2	62	0.988	<0.001
gender	0.66	1	31	0.424	0.021
working environment	0.02	1	31	0.889	<0.001
gender*working environment	4.31	1	31	0.046	0.122
Sum amplitude
complexity	3.12	2	62	0.051	0.092
complexity*gender	3.24	2	62	0.046	0.095
complexity*working environment	1.71	2	62	0.190	0.052
complexitygenderworking environment	0.20	2	62	0.822	0.006
gender	0.98	1	31	0.331	0.031
working environment	3.86	1	31	0.059	0.111
gender*working environment	3.29	1	31	0.080	0.096
Mean sum amplitude
complexity	1.90 ¹	1.65 ¹	51.02 ¹	0.167 ¹	0.058 ¹
complexity*gender	1.36 ¹	1.65 ¹	51.02 ¹	0.264 ¹	0.042 ¹
complexity*working environment	1.39 ¹	1.65 ¹	51.02 ¹	0.257 ¹	0.043 ¹
complexitygenderworking environment	0.15 ¹	1.65 ¹	51.02 ¹	0.819 ¹	0.005 ¹
gender	0.19	1	31	0.664	0.006
working environment	2.58	1	31	0.118	0.077
gender*working environment	1.71	1	31	0.201	0.052

¹ Mauchly’s test of sphericity indicates that the assumption of sphericity is violated—sphericity correction: Greenhouse–Geisser.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Schöner, D.; Birkle, J.; Wagner-Hartl, V. Emotional and Psychophysiological Reactions While Performing a Collaborative Task with an Industrial Robot in Real and Virtual Working Settings. Theor. Appl. Ergon. 2025, 1, 4. https://doi.org/10.3390/tae1010004

AMA Style

Schöner D, Birkle J, Wagner-Hartl V. Emotional and Psychophysiological Reactions While Performing a Collaborative Task with an Industrial Robot in Real and Virtual Working Settings. Theoretical and Applied Ergonomics. 2025; 1(1):4. https://doi.org/10.3390/tae1010004

Chicago/Turabian Style

Schöner, Dennis, Jonas Birkle, and Verena Wagner-Hartl. 2025. "Emotional and Psychophysiological Reactions While Performing a Collaborative Task with an Industrial Robot in Real and Virtual Working Settings" Theoretical and Applied Ergonomics 1, no. 1: 4. https://doi.org/10.3390/tae1010004

APA Style

Schöner, D., Birkle, J., & Wagner-Hartl, V. (2025). Emotional and Psychophysiological Reactions While Performing a Collaborative Task with an Industrial Robot in Real and Virtual Working Settings. Theoretical and Applied Ergonomics, 1(1), 4. https://doi.org/10.3390/tae1010004

Article Menu

Emotional and Psychophysiological Reactions While Performing a Collaborative Task with an Industrial Robot in Real and Virtual Working Settings

Abstract

1. Introduction

2. Materials and Methods

2.1. Study Participants

2.2. Study Design

2.3. Materials and Procedure

2.4. Statistical Analyses

3. Results

3.1. Subjective Measures

3.1.1. Valence, Arousal, and Dominance

3.1.2. Usefulness and Satisfaction

3.1.3. Trust

3.1.4. Mental Demand, Performance, and Effort

3.2. Objective Measures

3.2.1. Cardiovascular Activity

3.2.2. Electrodermal Activity

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI