May I Assist You?—Exploring the Impact of Telepresence System Design on the Social Perception of Remote Assistants in Collaborative Assembly Tasks

Brade, Jennifer; Mandl, Sarah; Klimant, Franziska; Strobel, Anja; Klimant, Philipp; Dix, Martin

doi:10.3390/robotics14060073

Open AccessArticle

May I Assist You?—Exploring the Impact of Telepresence System Design on the Social Perception of Remote Assistants in Collaborative Assembly Tasks

by

Jennifer Brade

^1,*

,

Sarah Mandl

²

,

Franziska Klimant

¹

,

Anja Strobel

²

,

Philipp Klimant

^3,4 and

Martin Dix

^1,4

¹

Professorship Production Systems and Processes, Chemnitz University of Technology, Reichenhainer Str. 70, 09126 Chemnitz, Germany

²

Professorship Personality Psychology and Assessment, Chemnitz University of Technology, Wilhelm-Raabe-Straße 43, 09120 Chemnitz, Germany

³

Professorship Virtual Technologies, Hochschule Mittweida—University of Applied Sciences, Technikumplatz 17, 09648 Mittweida, Germany

⁴

Fraunhofer Institute for Machine Tools and Forming Technology IWU, Reichenhainer Straße 88, 09126 Chemnitz, Germany

^*

Author to whom correspondence should be addressed.

Robotics 2025, 14(6), 73; https://doi.org/10.3390/robotics14060073

Submission received: 26 March 2025 / Revised: 19 May 2025 / Accepted: 27 May 2025 / Published: 28 May 2025

(This article belongs to the Special Issue Extended Reality and AI Empowered Robots)

Download

Browse Figures

Versions Notes

Abstract

Remote support in general is a method that saves time and resources. A relatively new and promising technology for remote support that combines video conferencing and physical mobility is that of telepresence systems. The remote assistant, that is, the user of said technology, gains both presence and maneuverability in the distant location. As telepresence systems vary greatly in their design, the question arises as to whether the design influences the perception of the remote assistant. Unlike pure design studies, the present work focuses not only on the design and evaluation of the telepresence system itself, but especially on its perception during a collaborative task involving a human partner visible through the telepresence system. This paper presents two studies in which participants performed an assembly task under the guidance of a remote assistant. The remote assistant was visible through differently designed telepresence systems that were evaluated in terms of social perception and trustworthiness. Four telepresence systems were evaluated in study 1 (N = 32) and five different systems in study 2 (N = 34). The results indicated that similarly designed systems showed only marginal differences, but a system that was designed to transport additional loads and was therefore less agile and rather bulky was rated significantly less positively regarding competence than the other systems. It is particularly noteworthy that it was not the height of the communication medium that was decisive for the rating, but above all, the agility and mobility of the system. These results provide evidence that the design of a telepresence system can influence the social perception of the remote assistant and therefore has implications for the acceptance and use of telepresence systems.

Keywords:

telepresence system; assembly task; social perception; collaboration; human–robot interaction

1. Introduction

The increasing digitalization and automation of production systems has led to far-reaching changes in industrial processes in recent years. The use of telepresence systems (TPSs), which enable specialist personnel to intervene in work processes from remote locations, is gaining increasing attention. Telepresence can be facilitated through a diverse array of technological devices—encompassing basic video telecommunication systems to advanced augmented reality (AR) glasses and sophisticated telepresence robots. Telepresence robots are remote-controlled devices equipped with videoconferencing technology, allowing users to maintain a virtual presence in a distant location [1]. Telepresence robots offer several advantages over traditional methods like video calls or AR glasses: telepresence robots provide a more immersive, lifelike experience, allowing the remote participant to navigate in the distant location and engage naturally with the environment. They enable real-time presence, offering better spatial awareness and, depending on the model, the ability to physically move and interact with objects, which is particularly beneficial for tasks requiring hands-on assistance. Additionally, telepresence robots offer a greater sense of embodiment and social presence, making remote collaboration more intuitive and effective [2]. Although this technology is already being used in areas such as medicine [3] and education [4], its use in industrial settings is still limited. The reasons for this are complex and range from technical hurdles and a lack of standards to economic and organizational challenges. In addition, the question of possible deployment areas and tasks for the TPS prevails. In the following text, the term TPS is used in general and thus includes telepresence robots.

For telepresence systems to be used effectively in production systems, it is crucial to clearly define the areas of application. Not all processes are equally suitable for the use of telepresence. The fields of application with a high degree of flexibility, fast decision-making, and a high degree of human interaction appear to be particularly useful. In maintenance and assembly, where complex assemblies with different parts and tools are often processed, telepresence offers the opportunity to make specialist knowledge available regardless of the experts’ current location. At the same time, it can help shorten the learning curve for less experienced employees by providing instructions and feedback in real time. To date, there is no known formal research for the application of TPSs in the field of industrial settings.

One key aspect that underlines the relevance of telepresence in production systems is the increasing shortage of skilled workers. In many industries, complex processes are carried out by workers who do not always have the necessary expertise. This can lead to errors, inefficient processes, and production downtime. Telepresence systems could play a crucial role here by enabling skilled personnel to monitor assembly processes remotely and instruct less qualified employees. This would not only improve the quality of processes but also ensure the availability of experts across regions.

An often underestimated but decisive factor for the success of telepresence systems is the design of the robots used. This might not only be associated with functionality and efficiency, but also with the perception, acceptance, and subsequent use by employees [5,6]. In production systems, where robots [7] and telepresence robots are often integrated into complex, collaborative processes, the design needs to meet the specific requirements of the tasks. In functional terms, a telepresence robot should be able to move precisely and flexibly in a production environment. This includes not only navigating through dynamic work areas, but also interacting with tools, machines, and people. Ergonomic aspects play just as an important role as technological features, such as high-resolution cameras, intuitive controls, and real-time communication.

In addition to technical performance, the effect of the design on human interaction partners is of central importance. Research shows that the appearance and behavior of a robot can significantly influence cooperation and acceptance by humans [8]. Robots in general can be ascribed both positive and negative human attributes [9], which are associated with both the behavior of the robots and the design aspects [5,6]. For example, more human-like robots are perceived as more sociable, but less competent than less human-like robots [10]. Schouten et al. [2] showed that the use of a TPS led to more robot-like characteristics being attributed to the remote interaction partner. Therefore, it can be assumed that the design of the robot affects the evaluation of the remote assistant’s characteristics. A telepresence robot that is perceived as too mechanical or aloof could cause mistrust or apprehension. A human-like, empathetic design could help break down barriers and enable more effective collaboration. In addition to the social perception of the TPS, trust is also a key factor in human–robot collaboration. Various studies have been conducted to identify the factors that determine trust in robots, including studies by Bainbridge, et al. [11] and Tsui, et al. [12] who examined the influences of type, size, proximity, and behavior of the robot. In assembly, where clear communication and trust between those involved is essential, a telepresence robot that is designed accordingly could significantly improve collaboration. This is especially true in scenarios where skilled personnel give instructions remotely because the collaboration is relying on the interaction between the person on site and the remote personnel, with no in-person interaction taking place. Previous studies have thus shown that the design of a TPS has an effect on social perception. In previous studies, e.g., with videos [13], it became clear that the context of the interaction with the TPS played a major role and that the parameters sociability/morality, activity/cooperation, competence, and anthropomorphism (included in the Social Perception of Robots Scale (SPRS) by Mandl, Bretschneider, Asbrock, Meyer, and Strobel [10]) should be examined. Although anthropomorphism plays a more significant role in social contexts, it should also be investigated in industrial contexts to ensure a holistic examination of social factors and to allow for comparison with previous studies.

To ensure the effective use of telepresence systems in an industrial context, a negative impact of the design on communication and the attitude of the users should be avoided. It is therefore necessary to investigate how differently designed telepresence systems are perceived in different contexts and for different tasks. To approach this topic, two studies were carried out in which an assembly task under the guidance of a remote assistant was performed. A comparative analysis was conducted involving four (study 1) or five (study 2) distinct telepresence robots, respectively, alongside a tablet as a control condition, evaluated on social parameters such as sociability/morality, activity/cooperation, competence, and anthropomorphism. The primary objective of the current research was to explore the influence of the design variations of different TPSs on the social assessment of the remote assistant. The present study builds upon a previous investigation that had assessed analogous TPSs through video content [13], as it exclusively concentrates on the authentic interaction and collaborative fulfillment of a task. It should be emphasized that this study, in contrast to previous studies, focuses not only on the social perception of the various TPSs, but on potential associations between social perception and collaborative activities. The present study thus provides additional knowledge on how design aspects could strengthen or hinder interactions between the users of TPSs and others who are physically present. Nevertheless, it is noteworthy that there was an absence of direct engagement between the telepresence systems and the participants, given that all systems functioned solely as conduits for the remote assistant’s display. As a control variable, the participants’ knowledge regarding telepresence systems and robots, as well as their prior experiences with these technologies, were also examined and incorporated into the analyses. The ensuing research inquiries were articulated:

Research question 1: Is there a difference in the perception of the TPSs with respect to (a) their human resemblance (anthropomorphism), and their perceived (b) sociability/morality, (c) activity/cooperation, and (d) competence?

Research question 2: Is there a difference between the TPSs regarding the trustworthiness placed in them?

2. Materials and Methods

2.1. Materials

To answer these research questions, two studies were designed to investigate the social perception of different TPSs. These two studies diverge in that in study 2, an additional TPS was accessible, which was evaluated in conjunction with the systems used in study 1. In both studies, an identical scenario was employed: participants were required to complete an easy assembly task under the guidance of a remote assistant. Two adjacent laboratories (situated at Chemnitz University of Technology) were utilized, one designated to the participant and the first experimenter (hereafter referred to as the experimenter), who oversaw and guided the participant while supplying all necessary materials, and the other intended for the second experimenter (hereafter designated as the remote assistant). The remote assistant managed all TPSs and was perceptible to the participants solely through these systems. In all experimental conditions, the remote assistant was female and of a comparable age to mitigate potential interference effects.

A wooden construction kit (“Maker M108”) produced by MATADOR “www.matador.at (accessed on 26 May 2025) was used for the assembly task to guarantee that all participants would be capable of completing it, notwithstanding their varying levels of pre-existing knowledge and craftsmanship skills. This kit was designed for children aged three and above and facilitates the assembly of uncomplicated objects using diverse components, including plates and cuboids of various dimensions, cubes, wheels, and pre-stretchers, as well as connecting pins (rods) of differing lengths and colors. The assembly of these components was executed without the need for supplementary tools. The pliers and wedges included in the kit were exclusively utilized for disassembly, which was performed solely by the experimenter.

For the purpose of the studies, objects were meticulously selected, aiming for maximal similarity concerning their assembly procedures, assembly difficulty, the number of components, and recognizability of the resulting object. The participants were unaware of the specific object they were constructing; no visual representations of the final objects were provided. Consequently, the assembled objects were required to be easily recognizable. The assembly process was conducted under the supervision of a remote assistant, who articulated the assembly instructions to the participant. The instructions underwent testing, organization, standardization, and prior documentation. For each assembly task, the remote assistant was visible through one of the TPSs or, in the control condition, via the tablet. Each participant was tasked with assembling a stork as a preliminary tutorial object, guided by the experimenter. Study 1 encompassed the following objects, which were presented in a randomized sequence: penguin, ostrich, duck, deer, and robot, and in study 2, a cow was assembled as the 6th object in addition to these objects (see Figure 1 for all assembly objects).

To examine the impact of the system utilized for the presentation of the remote assistant, one control condition (tablet) alongside four (study 1) or five (study 2) different TPSs were used (Figure 2): Pepper, developed by Softbank Robotics (Figure 2a), temi, produced by temi (Figure 2b), Double 3, manufactured by Double Robotics (Figure 2c), the automated guided vehicle (AGV) “Hubert”, created by the Professorship Production Systems and Processes at Chemnitz University of Technology (Figure 2d), as well as the robotic platform “Hubertus”, which was also developed by the Professorship Production Systems and Processes (Figure 2e).

Pepper, Double 3, and temi exhibit a comparable design that emulates the human form; specifically, the “body” of these robots is elongated in a vertical orientation, with the communication medium—a tablet—positioned atop, thereby simulating a head. In stark contrast to these three robotic entities, the AGVs “Hubert” and “Hubertus” showcase a functional design: “Hubert” was engineered for the domain of intralogistics, facilitating the transportation of selected goods to assist pickers (for an elaborate exposition, refer to [14]). For the purposes of the current investigation, a tablet was affixed to the vehicle, enabling its function as a remotely operated TPS. The design of the AGV “Hubertus” is, like “Huberts” design, rather functional. It consists of a small mobile robot platform on which, similar to Hubert, a tablet was mounted. Unlike temi, Pepper, and Double 3, for both AGVs, the tablet was strategically positioned at approximately calf height relative to the participants. In all instances of the TPSs, meticulous attention was devoted to ensuring that the remote assistant was adequately visible and perceived in a consistent manner across the displays.

2.2. Procedure

Prior to data collection, the present study was preregistered on OSF “https://osf.io/2ad9y/?view_only=aa6bf5a9cfc241f4a0fde5dd4fba6999” (accessed on 26 May 2025). The procedure was evaluated and approved by the Ethics Committee at Chemnitz University of Technology (#101608307). It was not considered to require further ethical approvals and hence, to be appropriate concerning the ethical criteria used by the Ethics committee, which includes requirements about the sampling of healthy adults, voluntary attendance, noninvasive measures, no deception, and appropriate physical and mental demands on the subject. For each participant, the following study procedure (see Figure 3) was employed. After the participants were welcomed by the experimenter in the laboratory, informed about the experiment, and had signed informed consent, the pre-assessment was started: The participants responded to a brief demographic survey, evaluated their prior expertise regarding TPSs, and completed four personality inventories (These data were not analyzed in this paper but will be used in other studies and publications) to assess the need for cognition [15], affinity for technology interaction [16], an anthropomorphic tendency [17], and negative attitudes toward robots [18]. These questionnaires were presented in a randomized order. After responding to the questionnaires, the study material was presented to the participants, encompassing the corresponding designations of the components, which also appeared in the assembly directives. Subsequently, the tutorial object was constructed under the supervision of the experimenter. Thereafter, the assembly of the test objects was started. The participants were directed by the remote assistant, who articulated the assembly instructions. The remote assistant was observable via a distinct TPS for each of the assembly objects. In the control condition, the remote assistant was visible through a tablet that was standing on the table. Both the sequence of the TPSs and the sequence of the assembly objects were randomized. Following each assembly, the TPS was extricated from the laboratory by the experimenter, and the participant evaluated the TPSs on the 18 adjectives from the SPRS [10], with one item on competence, one item on trustworthiness, and the subjective difficulty of the assembly task. Subsequently, the next TPS was brought into the room, and the subsequent assembly task was executed. To mitigate unsystematic variance in the execution of the study, we employed a study protocol that delineated the proceedings. In this protocol, all deviations by the participants and observations of the experimenter while completing the tasks were documented. After the main experiment, each participant was interviewed and we specifically asked how they perceived the different assembly tasks, the type of different assistant systems, whether they tended to pay attention to the TPSs during assembly, and whether they would subjectively say that the representation of the remote assistant had an impact on their ratings.

2.3. Items

Social Perception. The SPRS [10] was used to assess three factors of social perception: anthropomorphism, sociability/morality, and activity/cooperation. In total, 18 items were presented on a semantic differential to be rated on a five-point Likert scale and averaged into a scale. Anthropomorphism consisted of eight items (e.g., natural—artificial), morality/sociability of six items (e.g., sympathetic—unsympathetic), and activity/cooperation of four items (e.g., active—inert).

Competence. To assess the perceived competence of the TPSs, one item, rated as a semantic differential on a five-point Likert scale (competent—incompetent), analogous to previous studies, was used [13].

Trustworthiness. The short version of the Multi-Dimensional Measure of Trust Scale, Version 2 (MDMTv2; [19]), comprising of one item which was translated into German in consultation with one author of the scale, B. Malle (personal communication, 20 July 2021), was used to assess trustworthiness. Participants were asked to rate how trustworthy they perceived the TPS in question on an eight-point scale, anchored at 0 = not at all and 7 = very much.

Difficulty. The subjective difficulty was assessed with the question: “How would you rate the difficulty of the task you have just completed?” It was rated on a five-point Likert scale, anchored at 1 = easy and 5 = difficult.

2.4. Participants

We conducted an a priori power analysis with G*Power (version 3.1.9.7) with the following parameters for ANOVA (repeated measures, within factors): effect size f = 0.25; alpha-error probability: 0.05; power: 0.95; number of groups = 1; number of measurements: 5; correlation among repeated measures = 0.5; non-sphericity correction = 1, resulting in a necessary total sample size of N = 31. Participants were acquired via social networks as well as university circulars and had to fulfil the following inclusion criteria: a fluent command of German and no physical limitation that would impede a two-handed assembly task.

In study 1, the final sample consisted of 32 participants (26 female, 6 male) and in study 2, 34 participants took part (24 female, 8 male, and 2 non-binary). To account for outliers in the data, box plots to identify and subsequently remove multivariate outliers on the dependent variables (sociability/morality, activity/cooperation, competence, and anthropomorphism) were used. In both studies, no multivariate outliers were identified. There was a malfunction of the AGV “Hubert” that could not be rectified during study 2, so six people completed study 2 without the AGV “Hubert” condition. All participants took part in the study voluntarily and were informed that they were free to abort the experiment at any time.

Study 1: The sample was highly educated, with 75% having obtained a high-school diploma and 25% having obtained a university degree. On average, the participants were 23.06 years old (SD = 3.17, range = 18–29). A total of 25% of the participants reported they had experience with TPSs and 21.88% reported they had prior knowledge about TPSs.

Study 2: The sample was equally highly educated, with 88% having acquired a high-school diploma and 12% having obtained a university degree. On average, the participants were 23.18 years of age (SD = 7.14, range = 18–60). A total of 26.47% of the participants declared they had prior experience with TPSs and 23.53% indicated they had prior knowledge regarding TPSs.

On average, the complete experimental session lasted, in both studies, around 60 min, and all participants received monetary compensation or course credit points.

3. Results

Since participants had to assemble different models, as a first step, it was established whether the conditions were perceived as equally difficult (see Figure 4). Due to the non-normality of the data, a Kruskal–Wallis Test was employed. In both studies the level of difficulty was evaluated differently (study 1: χ² (4) = 17.63, p = 0.001; study 2: χ² (5) = 28.76, p < 0.001), even though the mean difficulty of the tasks was low overall. The task condition “duck” was perceived as the most difficult in both studies. In study 1, the condition “duck” was perceived more difficult than the conditions “penguin” (p = 0.001) and “deer” (p = 0.024). In study 2, the condition “penguin” was perceived as less difficult than the conditions “ostrich” (p = 0.004), “duck” (p < 0.001), and “robot” (p = 0.017). Additionally, the “duck” condition was perceived as more difficult than the “cow” (p = 0.023) condition.

Overall, both studies showed a similar level of perceived difficulty for the objects to be assembled. When queried explicitly in the ensuing brief interview, however, in study 1, twenty-five of the participants denoted that they discerned the assignments to be “simple”, and merely one individual signified to regard the assignment as “challenging”. In study 2, 20 participants stated that they found the task “easy”, but 10 people also stated that the wording was not clear.

3.1. Quantitative Results

In the following, the results for the dimensions competence, anthropomorphism, sociability/morality, and activity/cooperation are presented separately for both studies:

In study 1, the dependent variables competence, anthropomorphism, sociability/morality, and activity/cooperation were not normally distributed. In study 2, competence, anthropomorphism, and sociability/morality were not normally distributed, but activity/cooperation was. For easier comparability, we employed a non-parametric alternative to the analysis of variance, therefore, the Kruskal–Wallis test, was computed. When applicable, post hoc Dunn’s tests [20], corrected with Bonferroni, were computed.

Figure 5 and Figure 6 depict the means and standard deviations for the dimensions of social perception for study 1. In research question 1, we asked whether the TPSs were rated differently with respect to (a) anthropomorphism, (b) sociability/morality, (c) activity/cooperation, and (d) competence.

For competence, we found that the TPSs were perceived differently (χ² (4) = 13.15, p = 0.011) with a small effect size (η² = 0.059). A post hoc Dunn’s test showed that the AGV “Hubert” was perceived as less competent than Double 3 (p = 0.048) and the tablet control condition (p = 0.010). None of the other comparisons were significant (Figure 5).

For anthropomorphism, we found that the TPSs were perceived differently ((χ² (4) = 14.93, p = 0.005) with a moderate effect size (η² = 0.071). A post hoc Dunn’s test showed that the AGV “Hubert” was perceived as less anthropomorphic than Double 3 (p = 0.014), the tablet control condition (p = 0.021), and Pepper (p = 0.017). None of the other comparisons were significant (Figure 5).

In terms of perceived morality/sociability, we did not find any significant differences between the TPSs (χ² (4) = 5.68, p = 0.225) (Figure 6).

For activity/cooperation, the test showed that the TPSs were perceived differently (χ² (4) = 15.11, p = 0.005), with a moderate effect size (η² = 0.072). A post hoc Dunn’s test revealed that the AGV “Hubert” was perceived as less active/cooperative than Double 3 (p = 0.034) and Pepper (p = 0.005). None of the other comparisons were significant (Figure 6).

For study 2, the means and standard deviations for the dimensions of social perception are shown in Figure 7 and Figure 8. As in study 1, we found significant differences for all dimensions except sociability/morality (research question 1). In terms of competence, the findings indicated that the TPSs were perceived differently with statistical significance (χ² (5) = 19.66, p = 0.001), with a moderate effect size (η² = 0.074). Post hoc Dunn’s test identified that the AGV “Hubert” was perceived as less competent than temi (p = 0.024), Double 3 (p = 0.001), and Pepper (p = 0.004). Other comparisons did not show significant results (Figure 7).

Regarding anthropomorphism, the analysis revealed that the TPSs were perceived differently (χ² (5) = 35.33, p < 0.001), exhibiting a large effect size (η² = 0.153). Subsequent analysis using a post hoc Dunn’s test revealed that the AGV “Hubert” was regarded as less anthropomorphic compared to temi (p = 0.003), Double 3 (p = 0.001), and Pepper (p < 0.001). There was also a significant difference between the AGV “Hubertus” and Pepper (p = 0.002). No other comparisons reached statistical significance (Figure 7).

As in study 1, no significant differences regarding the perceived morality/sociability among the TPSs (χ² (5) = 7.76, p = 0.170) were identified (Figure 8).

In the domain of activity/cooperation, the analysis indicated that the TPSs were perceived in a disparate manner (χ² (5) = 22.93, p < 0.001), exhibiting a moderate effect size (η² = 0.091). A subsequent post hoc Dunn’s test demonstrated that the AGV “Hubert” was regarded as less active/cooperative in comparison to temi (p = 0.001), Double 3 (p = 0.019), and Pepper (p = 0.001). In addition, temi was rated as more active/cooperative than the tablet condition (p = 0.046). All the other comparisons did not reach statistical significance (Figure 8).

To answer research question 2 (see Figure 9), we investigated whether the TPSs were rated differently with respect to the trustworthiness placed in them. In study 1, we found significant differences in the attribution of trust (χ² (4) = 13.51, p = 0.009) with a moderate effect size (η² = 0.061). A post hoc Dunn’s test revealed that the AGV “Hubert” was perceived as less trustworthy than the tablet control condition (p = 0.011). None of the other comparisons in study 1 were significant. In study 2, the Kruskal–Wallis test turned out significant (χ² (5) = 12.68, p = 0.027). A post hoc Dunn’s test, however, did not indicate any significant differences between the TPSs.

3.2. Qualitative Results

A short interview was conducted after finishing the assembly tasks. In the following, the descriptive results on (1) which system the participants favored, (2) how participants felt about the different TPSs, (3) whether the participants were focused on the TPSs, the remote assistant, or the task at hand, and (4) whether they felt that the TPS had an effect on how they felt during the task, are reported. Figure 10 shows which of the TPSs were preferred by what percentage of the participants (e.g., 15.5% (study 1) and 14.7% (study 2) preferred the tablet over the TPS) and how the participants perceived and described the TPSs.

In both studies none of the participants indicated they focused solely on the TPSs but rather that the focus was on the assembly task or on the remote assistant. The only exception was Pepper, which eight of the thirty-two participants in study 1 and fourteen of the thirty-four participants in study 2 explicitly mentioned as being in the focus and posing a distraction to them. Some participants also commented that they found Pepper interesting and scary at the same time.

In terms of the participants’ feelings toward the TPSs, in study 1, eleven participants answered that they did not feel the TPS had any effect on their emotions, albeit one elaborated that Pepper was distracting. Furthermore, two participants elaborated that they interacted “with the same person” or “with a tablet” and thus, the TPSs had no effect. Some participants stated that they felt “uneasy” or “distracted” by the TPSs. In study 2, twenty-six participants stated that the TPSs had an influence on their ratings and of these, seven participants directly stated that the level of the communication medium had an influence.

4. Discussion

Telepresence systems could be a good alternative to cost- and resource-intensive assembly and maintenance work in deployment areas such as production systems, since they provide the users with the ability to move in distant locations while providing video conferencing systems. Thus, they enable the user to not only interact socially with other people in a distant location, but also make it possible to move around, for instance to aid the workers with their expertise without the need to travel. However, whether the use of TPSs could really make a change will be dependent on factors founded in the actual technical abilities required and, importantly, the users on both sides of the technology. Social perception was found to be associated with the acceptance and subsequent actual use of different robots [21,22].

To this end, in the present work, we investigated the social perception of different telepresence systems, controlled by a remote assistant, in a collaborative task. The collaborative task consisted of the assembly of wooden models under the instruction of the remote assistant, who operated via different TPSs and a tablet as a control condition. Participants rated each TPS after the collaborative interaction in terms of the relevant dimensions of social perception: anthropomorphism, sociability/morality, activity/cooperation, competence, and trustworthiness.

In terms of competence, the AGV “Hubert” received the lowest descriptive values in both studies. In study 1, the AGV “Hubert” was perceived as significantly less competent than Double 3 and the tablet control condition. In study 2, however the AGV “Hubert” was not only perceived as less competent than Double 3, but also than temi and Pepper. However, the tablet condition scored slightly lower in study 2 than in study 1. Three implications can be derived from these results: First, the differences among the conditions were minimal and exhibited only a small effect size in both studies. This suggests that the evaluative responses were more influenced by the standardized instructions provided by the remote assistant, or the perception of the remote assistant, across all conditions rather than the TPSs. Second, in both studies, the AGV “Hubert” was the only TPS to be rated significantly less competent than other TPSs (i.e., than Double3 and the tablet condition in study 1; than Double3, temi, and Pepper in study 2). Since this effect was evident in both studies, it can be assumed that the AGV “Hubert’s” additional assets (such as transporting items), which were not used and thus, not salient for the participants, were not perceived as useful; thus, it may not have been the best option for instructional purposes. As seen in Figure 10, “Hubert” is a guided vehicle designed to move items in an industrial scenario. In the present study, it was equipped with a tablet to be able to communicate—for example, in tasks where it should move items and a remote assistant needed to be able to communicate with another person. Based on the results of study 1, it could be assumed that the height of the communication medium could have influenced the assessment. In contrast to the other TPSs, the tablet for the AGV “Hubert” was at a level of approximately 30 cm above the floor, whereas the tablets of the other TPSs were approximately at eye-level for the seated participant. However, the results of study 2 contradicted this, as the AGV “Hubertus” exhibited a similar height of the communication medium as the AGV “Hubert” and yet its competence ratings did not deviate significantly from those of the other TPSs. It can be concluded from these results that although the height of the communication medium was associated with the competence rating, as both AGVs scored descriptively lower than the other devices, it is not solely responsible for the lower rating. A third possible reason for the lower evaluation in terms of competence for the AGV “Hubert” could be the bulkiness of the TPS, which was intended for the transportation of loads. In contrast to the AGV “Hubert”, the AGV “Hubertus” is a rather agile, more maneuverable system and is similar to the other TPSs in its dimensions (depth x width, except in height).

The results of the two studies regarding anthropomorphism showed that in both studies, the AGV “Hubert” was perceived as less anthropomorphic, with significantly lower values than the values of Double 3 and Pepper in both studies. However, although all devices except the AGV “Hubert” showed similar anthropomorphism values in study 1, a more differentiated picture emerged in study 2: although temi, Double 3, and the tablet condition achieved similar values, this time, Pepper achieved the highest anthropomorphism values, which even differed significantly from the AGV “Hubert” and the AGV “Hubertus”. Comparing the results of both studies, it could be concluded that although the same survey instruments were used, the participants in study 2 made more differentiated assessments. Due to Pepper’s humanoid design, the results of study 2 are not surprising.

In both studies, our investigation did not reveal any significant differences in perceived sociability/morality, which suggests that this dimension is, in this scenario, only marginally connected to other social constructs and anthropomorphism. Given that morality and sociability are fundamentally human traits, it is plausible that the consistent visibility of the remote assistant providing instructions across all assembly tasks on the tablets led to an evaluation of the person rather than the technology itself. This implies that in this context, the presence of the remote assistant may overlay any attributes assigned to the technology, as the human qualities (and consequent perceptions, such as friendliness and morality) of the remote assistant were more pronounced than those of the technology. Moreover, the average ratings for this dimension were relatively high, indicating that the characteristics of the remote assistant likely exerted a favorable influence on the assessments.

The results for activity/cooperation in both studies showed significantly lower values for the AGV “Hubert” compared to Double 3 and Pepper. The AGV “Hubertus”, on the other hand, was perceived similarly to Double 3 and Pepper, which suggested that once again the massive, bulky design of the AGV “Hubert”, and not the height of the communication medium, had an influence on the rating. This assumption was also supported by the qualitative data from the interview, as 26% of the participants stated that the AGV “Hubertus” was “better” than the AGV “Hubert” because its design was smaller and more agile.

Regarding trust, only in study 1, the AGV “Hubert” and the tablet control differed significantly. No significant differences in trust were observed in any other comparisons. In study 2, no differences in trust were found between the different TPS designs. These results suggested that trust may not be as strongly influenced by the design of the TPSs as expected. However, it should be noted that a single-item scale was used in both studies to keep the experiment economically reasonable. While this approach provided useful initial insights, further studies focusing on trust should employ multi-item scales to gain more nuanced and comprehensive insights into how different TPS designs influence trust formation.

Although a significant number of participants expressed a preference for Pepper, this particular TPS was also identified as a source of distraction during the task. This phenomenon may be attributed to Pepper’s erratic movements (e.g., gesturing with its arms and fingers, tilting its head), which participants articulated as “feeling distracted due to Pepper’s gaze and hand gestures.” Nonetheless, some individuals reported experiencing discomfort while engaging with Pepper, specifically noting that it “felt as though a third party was present” and that the humanoid design, combined with the awareness of its mechanical nature, induced unease. Furthermore, several participants remarked that they directed more attention towards Pepper than the task at hand. It can be inferred that while Pepper was preferred, its application should be limited to contexts where such “distraction” is tolerable or even advantageous, yet it may prove to be less appropriate in environments such as in educational telepresence systems or for activities necessitating concentrated focus.

The verbal description regarding the TPSs provided by the participants indicates that Pepper was regarded as the most engaging and “adorable” system, whereas temi and Double 3 were perceived in a more neutral light, characterized metaphorically as “a tablet on wheels,” facilitating amicable interactions. The AGV “Hubert” received a more unfavorable assessment. This can be attributed, in part, to the fact that its functional utility (load transportation) was neither acknowledged nor deemed necessary within the context of the scenario. Based on the information provided by the participants in study 2, it was also deduced that the height of the communication medium was perceived as unfavorable and obstructive in both AGVs, but that the agile and reduced design of the AGV “Hubertus” led to better social perception. Here, the TPSs’ design had a negative effect on the overall evaluation, thus highlighting the importance of appropriate design choices for TPSs in different scenarios. Subsequent studies should further investigate the influence of the level of communication medium on the ratings, as well as evaluate the different design variants of AGVs in different contexts and tasks.

The assembly tasks were classified as ranging from easy to moderately difficult, which may have constrained the variability in the dataset. Due to the considerable disparity in difficulty posed by one specific task compared to the others, direct comparisons between this task and the various TPSs were unfeasible, given the randomized nature of the study. Consequently, for the purposes of replication, it is advisable to gather larger sample sizes to facilitate these comparisons. In future investigations, it would be beneficial to implement more challenging tasks to ascertain whether task difficulty influences the assessment of TPSs. Moreover, the fixed seating arrangement during the tasks inhibited the different TPSs from effectively showcasing their unique mobility capabilities and, particularly for the AGV “Hubert,” its ability to transport small items. Therefore, employing a variety of tasks could yield more insights into the social perception of diverse TPSs.

The findings indicate that participants exhibited a preference for TPSs that showed human-like characteristics in certain respects. At a descriptive level, the AGV “Hubert” received the lowest ratings across all evaluated metrics. The distinctions observed among the tested systems were negligible, with the exception of the AGV “Hubert.” While it could be assumed from the results of study 1 that both the design of the robots and the height of the communication medium significantly influenced the results, at least the latter could be mitigated based on the results of study 2, as the communication medium for the AGV “Hubertus” was at the same height as the AGV “Hubert”, and the AGV “Hubertus” achieved better values. However, it can be assumed that the design of the “body” has a strong influence on the evaluation, as the AGV “Hubertus” achieved better values at a descriptive level, which can be attributed to its more maneuverable and agile design. Future research should explore whether these findings can be replicated in alternate environments (e.g., industrial settings) or if the functionality of the AGV “Hubert” positively impacts its evaluation when its additional features confer a practical advantage.

Nevertheless, the assembly task was successfully completed in every condition, that is, with every TPS, and none of the participants stated that there were differences in the difficulty of the tasks due to the different designs of the TPSs. A limiting factor, however, was that the task was relatively simple; future studies should examine whether differences in performance (errors and time) result from the use of different TPSs and, if such differences are found, how these are reflected in the social ratings.

The present studies are not without limitations: The remote assistant, who delivered the instructions for the assembly task, was female and of a comparable age across all experimental conditions to facilitate comparability. Nonetheless, subsequent research should further explore the hypothesized influence of the remote assistant on the assessment of the TPSs. An additional constraint arose from technical difficulties: in both experiments, the audio feature of Pepper’s tablet was inoperative; consequently, it was determined that a separate tablet should be utilized exclusively for audio playback. This alternative tablet was positioned in close proximity to both Pepper and the participants.

It should also be noted that, while the selected assembly task ensured standardization and accessibility, its simplicity did not reflect the procedural and cognitive demands of real-world industrial assembly tasks. The focus here was on the feasibility of assembly without special prior knowledge and without endangering the participants. Detailed investigations in such a general framework should be followed by studies dealing with specific scenarios where a TPS is needed in order to draw conclusions and guidelines for the type of TPSs to be used later. In future studies, performance should also be measured—such as task completion time and error rates—as these objective metrics can complement subjective evaluations. In the present study, such measures were deliberately omitted to avoid placing unnecessary pressure on participants. Furthermore, the cooperation in both studies took place between the participant (who carried out the assembly) and the remote assistant (who read out the instructions). So, there was only limited cooperation, which could have influenced the evaluation of the TPSs. Therefore, in follow-up studies, the remote assistant’s part could be embodied by another participant who would not read out prefabricated instructions but explains the instructions using visual material to simulate a real interaction.

Another limitation is that this study was conducted at the university, so the sample population characteristics were young, highly educated, and predominantly female. It is therefore not possible to generalize the results, and it was not possible to look for gender-based differences in TPS scores. The sample also raised the question of the extent to which the results were transferable to an industrial context, as most employees in the production system sector are male and do not have a university degree. To overcome these problems, replication studies are currently being conducted with an older and more gender-balanced participant sample. Additionally, studies involving industrial user groups have been planned to better assess the relevance and applicability of the findings to real-world industrial environments.

In order to classify which types of tasks are suitable for working with TPSs in an industrial context, firstly, it is necessary to evaluate which potential industrial collaborative use cases of TPSs are conceivable. Secondly, these use cases should be classified so that the tasks can be clustered in terms of task complexity and the degree of interaction with regard to mobility and collaboration. In a third step, this clustering should then be incorporated into studies with different collaborative tasks so that it can be checked which tasks are suitable for which TPS and how the social perception of these systems turns out depending on the task type. This approach can ensure the effective use of TPSs in production engineering in the future.

5. Conclusions

The study presented here found evidence that for an assembly task with wooden building blocks, participants preferred a more human-like TPS, particularly in terms of competence and activity/cooperation. However, the results also showed that this preference was not only due to the physical level of the communication medium, whereby communication at eye level was clearly preferred, but above all, to the fact that the system was agile and movable. Even if the task used here was solvable with all TPSs, these differences were reflected in the TPS ratings. It is therefore necessary to consider the context of use in order to reduce rejections and optimize the efficiency of task completion. Additionally, the findings indicate that increased mobility alone does not necessarily enhance the social perception of telepresence systems, as the tablet-based control condition was rated comparably or even more positively than some TPSs. This suggests that the use of TPSs must be carefully aligned with user expectations and task requirements. Further research is needed to systematically investigate which specific design features contribute to improved perception and collaboration quality in different application contexts. Based on the findings from the present study, further research is planned to deepen the understanding and practical applicability of TPSs in industrial contexts. The current study was designed as a static experimental scenario in which the TPSs remained stationary throughout the task. This setup did not fully capture the potential of mobile TPSs, particularly in terms of dynamic interaction and physical engagement within collaborative activities. To address this, future studies will incorporate mobility and interactivity more explicitly by implementing a scenario-based task format. These future experiments will incorporate assembly routines that require tool use, part differentiation, or timed actions, further reflecting actual workplace complexity. Through this approach, we aim to simulate the realistic industrial applications of TPSs within a controlled experimental framework. In this study, we have demonstrated that the choice of telepresence system is not trivial; rather, its deployment must be thoughtfully designed to align with the specific needs of the task and the user. Furthermore, we focused on an initial evaluation of TPSs, emphasizing simpler tasks to minimize distractions and provide a baseline for future studies. This has laid the groundwork for more in-depth investigations, where the influence of task complexity, user roles, and the specific design of TPSs will be explored to better understand how these systems can be optimized for real-world applications.

Author Contributions

Conceptualization, J.B. and S.M.; methodology, J.B. and S.M.; software, J.B. and S.M.; validation, J.B. and S.M.; formal analysis, S.M.; investigation, J.B. and S.M.; resources, A.S., F.K., P.K. and M.D.; data curation, J.B. and S.M.; writing—original draft preparation, J.B.; writing—review and editing, J.B., S.M., F.K., A.S., P.K. and M.D.; visualization, J.B. and S.M.; supervision, A.S. and P.K.; project administration, A.S. and P.K.; funding acquisition, A.S. and P.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—Project-ID 416228727–SFB 1410.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

AGV	Automated Guided Vehicle
AR	Augmented Reality
ATI	Affinity for Technology Interaction
SPRS	Social Perception of Robots Scale
TPS	Telepresence System

References

Björnfot, P.; Bergqvist, J.; Kaptelinin, V. Non-technical users’ first encounters with a robotic telepresence technology: An empirical study of office workers. Paladyn J. Behav. Robot. 2018, 9, 307–322. [Google Scholar] [CrossRef]
Schouten, A.P.; Portegies, T.C.; Withuis, I.; Willemsen, L.M.; Mazerant-Dubois, K. Robomorphism: Examining the effects of telepresence robots on between-student cooperation. Comput. Hum. Behav. 2022, 126, 106980. [Google Scholar] [CrossRef]
Zillner, C.; Turner, A.; Rockenbauer, G.; Röhsner, M.; Pletschko, T. Use of Telepresence Systems to enhance School Participation in Pediatric patients with chronic illnesses involving the CNS. Z. Neuropsychol. 2022, 33, 227–234. [Google Scholar] [CrossRef]
Kasuk, T.; Virkus, S. Exploring the power of telepresence: Enhancing education through telepresence robots. Inf. Learn. Sci. 2024, 125, 109–137. [Google Scholar] [CrossRef]
von der Pütten, A.M.; Krämer, N.C. A survey on robot appearances. In Proceedings of the Seventh Annual ACM/IEEE International Conference on Human-Robot Interaction, Boston, MA, USA, 5–8 March 2012; pp. 267–268. [Google Scholar]
Schaefer, K.E.; Sanders, T.L.; Yordon, R.E.; Billings, D.R.; Hancock, P.A. Classification of robot form: Factors predicting perceived trustworthiness. In Proceedings of the Human Factors and Ergonomics Society Annual Meeting, Boston, MA, USA, 22–26 October 2012; pp. 1548–1552. [Google Scholar]
Faccio, M.; Granata, I.; Menini, A.; Milanese, M.; Rossato, C.; Bottin, M.; Minto, R.; Pluchino, P.; Gamberini, L.; Boschetti, G. Human factors in cobot era: A review of modern production systems features. J. Intell. Manuf. 2023, 34, 85–106. [Google Scholar] [CrossRef]
Goetz, J.; Kiesler, S.; Powers, A. Matching robot appearance and behavior to tasks to improve human-robot cooperation. In Proceedings of the 12th IEEE International Workshop on Robot and Human Interactive Communication, 2003. Proceedings. ROMAN 2003, Millbrae, CA, USA, 2 November 2003; pp. 55–60. [Google Scholar]
Sauppé, A.; Mutlu, B. The social impact of a robot co-worker in industrial settings. In Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems, Seoul, Republic of Korea, 18–23 April 2015; pp. 3613–3622. [Google Scholar]
Mandl, S.; Bretschneider, M.; Asbrock, F.; Meyer, B.; Strobel, A. The Social Perception of Robots Scale (SPRS): Developing and Testing a Scale for Successful Interaction Between Humans and Robots. In Proceedings of the Working Conference on Virtual Enterprises, Lisbon, Portugal, 19–21 September 2022; pp. 321–334. [Google Scholar]
Bainbridge, W.A.; Hart, J.; Kim, E.S.; Scassellati, B. The effect of presence on human-robot interaction. In Proceedings of the RO-MAN 2008—The 17th IEEE International Symposium on Robot and Human Interactive Communication, Munich, Germany, 1–3 August 2008; pp. 701–706. [Google Scholar]
Tsui, K.M.; Desai, M.; Yanco, H.A. Considering the bystander’s perspective for indirect human-robot interaction. In Proceedings of the 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Osaka, Japan, 2–5 March 2010; pp. 129–130. [Google Scholar]
Mandl, S.; Brade, J.; Bretschneider, M.; Asbrock, F.; Meyer, B.; Jahn, G.; Klimant, P.; Strobel, A. Perception of embodied digital technologies: Robots and telepresence systems. Hum.-Intell. Syst. Integr. 2023, 5, 43–62. [Google Scholar] [CrossRef]
Winkler, S.; Weidensager, N.; Brade, J.; Knopp, S.; Jahn, G.; Klimant, P. Use of an automated guided vehicle as a telepresence system with measurement support. In Proceedings of the 2022 IEEE 9th International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications (CIVEMSA), Chemnitz, Germany, 15–17 June 2022; pp. 1–6. [Google Scholar]
Bless, H.; Wänke, M.; Bohner, G.; Fellhauer, R.F.; Schwarz, N. Need for cognition: Eine Skala zur Erfassung von Engagement und Freude bei Denkaufgaben. Z. Sozialpsychol. 1994, 25, 147–154. [Google Scholar]
Franke, T.; Attig, C.; Wessel, D. A personal resource for technology interaction: Development and validation of the affinity for technology interaction (ATI) scale. Int. J. Hum.–Comput. Interact. 2019, 35, 456–467. [Google Scholar] [CrossRef]
Waytz, A.; Cacioppo, J.; Epley, N. Who sees human? The stability and importance of individual differences in anthropomorphism. Perspect. Psychol. Sci. 2010, 5, 219–232. [Google Scholar] [CrossRef] [PubMed]
Bartneck, C.; Nomura, T.; Kanda, T.; Suzuki, T.; Kennsuke, K. A cross-cultural study on attitudes towards robots. In Proceedings of the 11th International Conference on Human-Computer Interaction, Las Vegas, NV, USA, 22–27 July 2005. [Google Scholar]
Ullman, D.; Malle, B.F. Measuring gains and losses in human-robot trust: Evidence for differentiable components of trust. In Proceedings of the 2019 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI), Daegu, Republic of Korea, 11–14 March 2019; pp. 618–619. [Google Scholar]
Dunn, O.J. Multiple comparisons using rank sums. Technometrics 1964, 6, 241–252. [Google Scholar] [CrossRef]
Carpinella, C.M.; Wyman, A.B.; Perez, M.A.; Stroessner, S.J. The robotic social attributes scale (RoSAS) development and validation. In Proceedings of the 2017 ACM/IEEE International Conference on Human-Robot Interaction, Vienna, Austria, 6–9 March 2017; pp. 254–262. [Google Scholar]
McKee, K.R.; Bai, X.; Fiske, S.T. Warmth and competence in human-agent cooperation. Auton. Agents Multi-Agent Syst. 2024, 38, 23. [Google Scholar] [CrossRef]

Figure 1. Assembly objects used in the studies.

Figure 2. Telepresence systems used in the studies. (a) Pepper; (b) temi; (c) Double 3; (d) AGV “Hubert”; (e) AGV “Hubertus”.

Figure 3. Procedure of the studies (* only in study 2).

Figure 4. Difficulty of the different assembly objects (left, study 1; right, study 2).

Figure 5. Results for the dimensions competence and anthropomorphism (study 1).

Figure 6. Results for the dimensions sociability/morality and activity/cooperation (study 1).

Figure 7. Results for the dimensions competence and anthropomorphism (study 2).

Figure 8. Results for the dimensions sociability/morality and activity/cooperation (study 2).

Figure 9. Results for the conditions for trustworthiness for both studies (left study 1, right study 2).

Figure 10. Description of the telepresence systems by the participants (in brackets, the frequency of mention) and the proportion of favored systems. In addition to these answers, the tablet was also named as a favorite (study 1: 15.5%; study 2: 5.9%).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Brade, J.; Mandl, S.; Klimant, F.; Strobel, A.; Klimant, P.; Dix, M. May I Assist You?—Exploring the Impact of Telepresence System Design on the Social Perception of Remote Assistants in Collaborative Assembly Tasks. Robotics 2025, 14, 73. https://doi.org/10.3390/robotics14060073

AMA Style

Brade J, Mandl S, Klimant F, Strobel A, Klimant P, Dix M. May I Assist You?—Exploring the Impact of Telepresence System Design on the Social Perception of Remote Assistants in Collaborative Assembly Tasks. Robotics. 2025; 14(6):73. https://doi.org/10.3390/robotics14060073

Chicago/Turabian Style

Brade, Jennifer, Sarah Mandl, Franziska Klimant, Anja Strobel, Philipp Klimant, and Martin Dix. 2025. "May I Assist You?—Exploring the Impact of Telepresence System Design on the Social Perception of Remote Assistants in Collaborative Assembly Tasks" Robotics 14, no. 6: 73. https://doi.org/10.3390/robotics14060073

APA Style

Brade, J., Mandl, S., Klimant, F., Strobel, A., Klimant, P., & Dix, M. (2025). May I Assist You?—Exploring the Impact of Telepresence System Design on the Social Perception of Remote Assistants in Collaborative Assembly Tasks. Robotics, 14(6), 73. https://doi.org/10.3390/robotics14060073

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

May I Assist You?—Exploring the Impact of Telepresence System Design on the Social Perception of Remote Assistants in Collaborative Assembly Tasks

Abstract

1. Introduction

2. Materials and Methods

2.1. Materials

2.2. Procedure

2.3. Items

2.4. Participants

3. Results

3.1. Quantitative Results

3.2. Qualitative Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI