The UP150: A Multifactorial Environmental Intervention to Promote Employee Physical and Mental Well-Being

Physical activity (PA) is a major health factor and studies suggest workplaces could promote PA by modifying office design, motivational strategies and technology. The present study aims to evaluate the efficiency of UP150, a multifactorial workplace intervention for the improvement and maintenance of the level of physical fitness (PF) and wellbeing. Forty-five employees were randomly divided into the experimental (EG) and control (CG) groups. The PF was assessed pre-post intervention using the cubo fitness test (CFT), the amount of PA was evaluated using the IPAQ questionnaire and accelerometers while the workload was assessed using the NASA-TLX questionnaire and psycho-physical health by using the SF-12 questionnaire. The EG worked in UP150 offices while the CG worked in their usual offices for 8 weeks. The EG and CG came back 4 weeks after the intervention for CFT retention. The EG improved CFT motor efficiency and the amount of moderate PA, while it reduced mental load. The EG retained reached motor efficiency levels 4 weeks after the intervention. No differences were found in IPAQ. The UP150 demonstrated to be a proactive environment and to be efficient in the promotion of PA, improving PF and mental health while decreasing mental load.


Introduction
The workplace represents one of the main causes of sedentarism and stress, which negatively affect the quality of life [1]. The incoming of the SARS-CoV-2 pandemic has changed usual working habits and has introduced the concept of smart working, which has caused a reduction in the costs for the companies accompanied by a reduction in employees' working involvement and a decrease in working performances [2,3]. To act on the movement's education and on employees' motivation, a methodology is proposed that can determine an easy and non-traumatic transition from the classic workplace concept (based on constriction, stress and health risks due to a sedentary lifestyle) to a new workplace environment and office's design concept, which consider the well-being and the caring of employees as central elements of companies' welfare strategies [4].
The innovation of this methodology consists of the insertion of systems that can increase the motivation to perform physical activity, through the increase in autonomy, relatedness, and the positive perception of self-motor competence [5]. More specifically, a training system is proposed based on effort perception, which aims to integrate physical activity to professional working rhythms, promoting a conscious physical practice, adequate to the individual's psycho-physical condition and to the working context.

•
Individual level: based on individual consultancies, a dynamic work station (standing and sitting desks) and on the use of treadmill or stairs in active pauses or obliged routes in the office. • Group level: introducing, in the working daily routine, group active pauses of 10 min, promoting online physical activities, walking groups and social support network. • Environmental communication level: using printed billboards or personalized online messages (in order to promote healthy behaviors and physical activity), pedometers and informative campaigns. • Policy and physical environment levels: promoting the creation of safe bicycle parking, the insert of changing rooms in the workplace and health programs.
In addition, the literature underlines that another important aspect to consider in the promotion of physical activity and creation of an active environment is motivation [37]. The lack of adequate motivation in conducting an active lifestyle could cause the early abandonment of the structured promotion program [38]. In order to prevent this issue, it is important to refer to a behavioral and motivational theoretical framework. In recent years, the self-determination theory (SDT) has been considered by many authors. This theory affirms that people are moved by three inner psychological needs: autonomy, competence and relatedness [39]. SDT has been widely suggested in the working context to develop the autonomy and proactivity of the employees in reaching working goals [40]. In addition, as evidenced by Deci and colleagues [41], autonomy, competence and relatedness can be considered as a mediator of health and wellness in the workplace and of physical activity [42]. The autonomy is the need to self-organize one's own experience and one's own behavior. Being autonomous means deciding and self-approving all actions in order to reach a specific goal [39]. In this psychological need, the goal setting theory [43] seems to be effective to promote motivation and autonomy.
In this theory, it is asserted that, when a task is perceived as too easy or too difficult, motivation decreases, while if the task is perceived as moderately difficult, motivation raises. This is supported by Brehm and colleagues who, moreover, assert that an increasing level of potential motivation due to a positive perception of the goal can increase the amount of effort dedicated in the achievement of the goal [44,45]. From this point of view, programs that aim to include physical activity promotion need to include adequate exercise in order to stimulate interest during practice. The competence is represented by the need of feeling efficient in one's own social and physical world, the task required has to be adequate to the subject's characteristics and the goals have to be clear. Relatedness is the need to feel connected with others, creating bonds and experimenting with belonging and intimacy; the individuals need to feel understood and positively evaluated.
Another important factor that may influence the employees' motivation can be perceived in the offices' design. It has been demonstrated that this latter factor has an important role in motivation enhancement [46]. In particular, the office's design can reduce sitting time in employees acting on the environmental/spatial factor as workstations, furniture, office size, office density, shared spaces, corridors and stairs [47]. As previously explained, the increase in motivation is helped by the development of competence, which is composed by the acquisition of abilities and knowledge [48]. In order to sensibilize, educate and increase the knowledge of the employees about healthy lifestyles, recent researches have considered the inclusion of a new professional figure in the workplace environment, the wellness coach [49,50].
Wellness coaches have been shown to have great efficacy health risk prevention and in the promotion of active lifestyles even in workplace [51]. Moreover, the education to move and be active, given by the wellness coaches, is essential to create new abilities due to new routines and automatism, which could facilitate the approach to physical activity during working hours [52]. The inclusion of this new type of professional inside the working environment could contribute in enhancing the motivation to pursue physical activity and active lifestyles [51].
According to Commissaris and colleagues, a systematic review underlines that multicomponent approaches (individual, organizational and environmental changes) can have more efficacy than single component approaches [53].
A study conducted by Maylor and colleagues [54] comprised a multi-component intervention named "Beat the Seat", which aimed to reduce the amount of time spent in the sitting position during the workday, intervening on organizational, individual and environmental elements. The results showed that the intervention contributed to reduce the time spent in the sitting position and increased the amounts of transitions from the sitting to standing position and the number of steps performed during the workday.
Another multi-component intervention, proposed by Nooijen and colleagues [55], aimed to promote physical activity through the use of the company gym, walks during the lunch breaks and standing or walking meetings. The study underlines that, after 6 months from the start of the intervention, employees' self-efficacy and motivation to approach physical activity increased significantly.
It is supposed that participants involved in the new proposed working environment could present higher levels of physical fitness and psychological well-being than a similar group in a standard working environment, considering both in-presence and smart working. Moreover, it is supposed that participants inserted in the new offices could increase the time spent in physical activities and maintain the improved physical fitness.

Sample
Forty-five employees volunteered to participate and were randomly divided into an experimental group (EG = 23) and control group (CG = 22). Only employees of the company that hosted the trial were included. Conversely, all the employees that presented psychological or physical disabilities were excluded from the experimental procedure. Moreover, the participants were excluded from the analysis if they did not complete all the expected evaluations or if they reached a percentage of work absenteeism greater than 10% during the experimental procedure.
During the experimental period, 5 participants (2 males and 1 female in the EG and 2 males in the CG) were no longer able to follow the procedures and were excluded from the analysis. At the end of the experimental procedure, both of groups were composed of 20 participants each (12 females and 8 males for each group). The participants of the EG were 31.7 ± 8.2 years old, with an average weight of 67.6 ± 17.0 kg and an average height of 1.7 ± 0.1 m (BMI= 22.6 ± 2.7 kg·m −2 ). Similarly, the participants of the CG were 32.0 ± 4.4 years old, with an average weight of 64.8 ± 9.9 kg and an average height of 1.7 ± 0.1 m (BMI= 22.9 ± 3.9 kg·m −2 ). The study was conducted in accordance with the declaration of Helsinki and was approved by the ethics committee of the University of Milan (14 September 2020, number 84/20).

Protocol
The study was conducted following the structure of the randomized controlled trial. The first phase (pre-test) aimed to investigate the characteristics of the sample, proposing the cubo fitness test (CFT), and a set of questionnaires. The CFT was administered in 3 different sessions every 5 days to assess reliability. Moreover, in this phase, all participants wore the accelerometers for a week (detection 1). In the second phase (training), the participants were equally divided into two groups (EG and CG) with an equal amount of weekly physical activity measured in the first phase with the accelerometers and the international physical activity questionnaire. The EG worked three times a week in the new concept office and two times a week in smart working for eight weeks. During this period, the EG was subjected to the experimental procedures. The CG worked alternatively in their normal offices separately from EG (three times a week) and in smart working (two times a week) as usual. The CG was not allowed to interact with the procedures determined for the EG. In this phase, the physical activity of both groups was measured with the accelerometers, alternatively one week each, assessing three weeks for each group during the entire experimental period (detection 2, detection 3 and detection 4). In addition, all the participants had to report daily the total quality recovery value (at work entrance), the adapted Borg scale's value (when leaving work) and the training load (calculated by multiplying the adapted Borg's value by the working minutes of the day). After the expected eight weeks, in the third phase (post-test), the participants repeated the CFT and the set of questionnaires. In the fourth phase, all participants interrupted the experimental procedures for four weeks, and subsequently they repeated the CFT and the international physical activity questionnaire (retention test). The entire protocol is shown in Figure 1.

Assessment
All tests were administered in a dedicated test room inside the workplace office with a fixed temperature of 22 • C and with a standard humidity percentage of 40%. During the entire experimental period, participants followed the national indications for SARS-CoV-2 prevention.

Assessment
All tests were administered in a dedicated test room inside the workplace office with a fixed temperature of 22 °C and with a standard humidity percentage of 40%. During the entire experimental period, participants followed the national indications for SARS-CoV-2 prevention.

Architectural Changes
The employees of the EG worked in new offices projected and developed by the specialized society "Progetto Design and Build S.r.l.". The new concept of the office, named "Ufficio Proattivo 150" (UP150), was projected not only in order to include a set of physical activities stations that aimed to integrate movement during active pauses or during workflow, but even in order to create an environment that can increase the motivation of employees and social relationships. The following section explains the details of the physical stations.
Check in wall station: Wall fixed disposal, divided into three backlit segments (low, medium and high), which had to be touched while illuminated. The low segment had to be touched with both hands performing squats; the medium segment had to be touched while performing wall push-ups; and the high segment had to be touched rising on toes and reaching as high as possible. Each segment lit up for 10 s during which the employee had to perform the respective exercise. Each exercise was repeated two times, with a total duration of 60 s.
Steps station: The step station was structured as an informal meeting area, composed of steps, where employees could sit on. The structure presented two steps (of 50 cm and 90 cm) useful for performing different type of exercises (step-ups, push-ups at different heights; sit-squats and stretching for lower limbs).
Meeting rooms' bike: The meeting rooms and the phone booth were equipped with a set of cycle ergometers that allowed to perform physical activity during meetings and work calls.

Architectural Changes
The employees of the EG worked in new offices projected and developed by the specialized society "Progetto Design and Build S.r.l.". The new concept of the office, named "Ufficio Proattivo 150" (UP150), was projected not only in order to include a set of physical activities stations that aimed to integrate movement during active pauses or during workflow, but even in order to create an environment that can increase the motivation of employees and social relationships. The following section explains the details of the physical stations.
Check in wall station: Wall fixed disposal, divided into three backlit segments (low, medium and high), which had to be touched while illuminated. The low segment had to be touched with both hands performing squats; the medium segment had to be touched while performing wall push-ups; and the high segment had to be touched rising on toes and reaching as high as possible. Each segment lit up for 10 s during which the employee had to perform the respective exercise. Each exercise was repeated two times, with a total duration of 60 s.
Steps station: The step station was structured as an informal meeting area, composed of steps, where employees could sit on. The structure presented two steps (of 50 cm and 90 cm) useful for performing different type of exercises (step-ups, push-ups at different heights; sit-squats and stretching for lower limbs).
Meeting rooms' bike: The meeting rooms and the phone booth were equipped with a set of cycle ergometers that allowed to perform physical activity during meetings and work calls.
Break room's treadmill: The break rooms were equipped with treadmills that allowed to walk and talk with colleagues during pauses.
Steppers (toilet, vending machines, standing desks): A set of steppers were placed in different office areas. The steppers allowed to log into many office workflow activities, such as washing and drying hands, buying coffee or snacks, or just stepping while working. To log into the activity, the employee had to perform at least 30 s of steps. The employee could also choose to bypass the procedure and to freely log into the activities without performing the exercise.
Rubber bands (break room and standing desks): A set of rubber bands with three different difficulties were placed in the break room and at the base of standing desks. These bands were useful to perform some simple exercises bound to muscular fitness for upper and lower limbs (at least 30 s for each exercise).
Reclining fitness bench: A dedicated area was equipped with a reclining bench that allowed the execution of stretching for lower limbs, sitting sit-ups with different inclinations of the bench and push-ups.
Wooden stick: The office's common areas were equipped with wooden sticks. These sticks allowed to perform exercises for shoulder and upper limb mobility.

App UP150
It is an application for mobility, developed by the society Business Integration Partners S.p.A. (Milan, Italy). The application includes the pocket trainer (PT), the training diary (TD), and physical activity score tools (PAS), which are explained in detail in the following sections. The App UP150 was developed to simplify the interaction process between the employee and the physical activity inside and outside the workplace. The application had to be activated at the entrance using the QR code linked to the "check in wall" or at home selecting a general check-in exercise. Before the "check in wall station" or any check in exercise, the application required the insertion of a value referred to the perception of total self-recovery using the TQR only once per day. Moreover, the App UP150 was able to connect the mobile phone to all the training stations using QR codes that permitted access to the description of the exercise, the suggested time and subsequently to the timer attached to the selected activity. The timer stopped when the employees decided to push the stop button on the mobile phone and the score reached in physical activity performed (see section on Physical Activity Score) was shown automatically. The same sequence could be used even outside the office, in this case the employee had to select a category of exercise (cardiorespiratory fitness, muscular fitness, articular fitness, sports activity and combined fitness) and to decide when to activate and to stop the timer. The TD system inserted in the application permitted to record all training activity, to compare the reached score with the weekly goal score assigned by the PT, and to give information about the duration and the perceived intensity of the exercise performed.

Pocket Trainer/Training Diary
The pocket trainer (PT) and the training diary (TD) are two components of a unique tool that aim to control and improve the index of motor efficiency (IME, see Section 2.5.1) and the participants' lifestyle by increasing physical activity, consciousness, and perception of their own psycho-physical condition. The PT assigns a goal score, dependent to the IME that has to be reached (or overpassed) during the week, performing different types of physical activity. The goal score is normalized based on the level of the participants. The minimum target score is 150 points (level C), the intermediate target score is 225 points (level B), while the maximum goal score is 300 points (level A) according to WHO recommendations (from 150 to 300 min of moderate physical activity [33]).

Physical Activity Score
The physical activity score (PAS) is an instrument that allows codifying and scoring the physical activity. It was necessary to allow the participants to reach the weekly goal score and record the physical activity performed inside and outside the office. The score is determined by the duration of exercise (in minutes) multiplied by an effort perception's coefficient. The coefficient is assigned as follows: the activities perceived from 0 to 3 (Light) on Borg's scale adapted from NSCA (2012) [56] are considered as coefficient 1; the activities perceived 4 or 5 (moderate) are considered as coefficient 1.5; and all the activities perceived 6 or more (vigorous) are considered as coefficient 2.

Wellness Coaches
The wellness coaches [49] were figures represented in this study by sport science graduate students. These specialists supported the employees during physical practice inside and outside the office. Furthermore, they personalized the exercises and the daily physical routine based on the employee's necessities (previous disease, injury or specific goal or necessity, based on CFT results). Moreover, their role consisted of making the employees aware of the benefits of good practices and healthy lifestyles. The coaches were available online 7 days per week and in office 2 days per week to demonstrate and to explain the correct execution of the exercises.

Self-Determination Methodology
The method used in this research followed the self-determination theory key points identified as the promotion of autonomy, competence, and relatedness [39]. Autonomy is promoted thanks to the possibility of choosing the type (cardiorespiratory, muscular, flexibility or combined physical fitness), the place (inside or outside the office) and the duration and the intensity of the physical activity to reach an assigned target score using the PT and the TD instruments.
Moreover, according to the considered literature, the architectural changes and the support of the application (App UP150) are thought to play an important role in autonomy promotion permitting to choose between many physical stations. Competence is guaranteed by the use of effort perception during physical activity. This method aimed to teach the employees how to practice physical fitness responsibly, respecting the participant's internal load and avoiding the risk of an inadequate intensity effort. Relatedness is promoted by encouraging the interaction with the wellness coaches and with the other employees during the active pauses or during breaks. Moreover, relatedness is encouraged by the specifically designed new architectural environment elements as meeting room's bike or break's room treadmill. To prevent the possible interference of the social context on the physical activity motivation and to enhance the perception of being socially connected [57], wellness coaches were fundamental to help the transition from the classic office concept to the present approach. The wellness coaches were asked to create an adequate working climate where physical activity during working time is not considered an embarrassing moment, but, conversely, a well-accepted opportunity by all colleagues.

Cubo Fitness Test
This test was administered to assess the physical levels of the employees in the office environment. The Cubo Fitness Test (CFT) [58] is composed by 5 submaximal tests based on effort and pain perception executed on a cube-shaped multifunctional instrument. These tests propose to evaluate cardiorespiratory fitness, muscular fitness and flexibility fitness, related to physical wellness and maintaining good health [59][60][61]. Each test gives a defined number of points depending on the test result. The maximal reachable points for each test was chosen taking into account the importance of each considered fitness category in preventing health and mortality risks. Cardiorespiratory fitness has been demonstrated to be fundamental in preventing cardiovascular diseases and other comorbidities, including hypertension, diabetes, heart failure and atrial fibrillation [62], representing the major cause of death worldwide [63,64]. A lower yet important contribution to health risk prevention is muscular fitness, which has been demonstrated to be efficient in reducing mortality risk [59,65]. Finally, flexibility fitness has been demonstrated to be effective in improving life quality [66], but no researches have demonstrated its efficacy in mortality risk prevention.
The index of motor efficiency (IME), resulting at the end of the five submaximal tests, ranges from 10 to 100 points. It summarizes the points reached in the 5 mentioned submaximal tests and was normalized based on age and sex. A score range lower than 33 points is considered a low level (level C), the score range included between 33 and 66 points is considered a medium level (level B) and the score range higher than 66 points is considered a high level (level A). The validity and the reliability of CFT were assessed in a previous research [58]. In the following section, all 5 submaximal tests are explained.
Ruffier test (RT) [67]: The participants had to sit and stand up from the cube with a frequency of 40 bpm for 30 times (or 45 s). During the test, we collected the resting hearth rate (HR0), the hearth rate (HR) immediately at the end of test (HR1) and the HR one minute after the end (HR2). The Ruffier index (RI) was calculated using the following formula: RI = (HR0 + HR1 + HR2 − 200)/100. Lower values of RI indicate a better performance. The height of the sitting position was modified using supports appropriately designed to maintain a 90 • knee angle for each participant during the sitting activity. The effort perception was requested at the end of the test using the adapted Borg's CR-10 scale [56]. The maximal acquirable score was 40 points.
Thirty second push-up test (PUT) [68,69]: The participants had to choose one of three difficulty levels in order to obtain an effort perception of "moderate" on the adapted Borg's CR-10 scale. The levels were defined by the different distances from the ground where the hands' support. Level 3 was the easiest (120 cm from the ground), followed by level 2 (60 cm from the ground) and level 1 (40 cm from the ground). After choosing, the test required the performance of the maximum possible number of push-ups in 30 s, with the subsequent assessment of target effort perception at the end of the test. The maximal acquirable score was 20 points.
Thirty second seated sit-up test (SUT) [68,69]: The participant had to choose one of three difficulty levels in order to obtain an effort perception of "moderate" on the adapted Borg's CR-10 scale. The levels were defined by a different inclination of the seat's back support. Similar to the PUT, Level 3 was the easiest (90 • of inclination), followed by level 2 (45 • of inclination) and level 1 (15 • of inclination). The participant was requested to sit at the edge of the sitting area and to perform the maximum possible number of seated sit-ups in 30 s, with the subsequent assessment of target effort perception at the end of the test. The maximal acquirable score was 20 points.
Shoulder mobility test (SMT) [70]: The main tool of this test is a graduated stick. This instrument is marked with a precise measurement in centimeters starting from 0 in the middle point and increasing the measurement in equal increments in both directions. The participant had to hold the stick with both hands at the same distance from point 0. Starting with a large distance, participants had to perform a backward and subsequently a forward circle with upper limbs extended without losing grip. The participants were asked to repeat the test gradually reducing the hands distance, until they reached their limit without pain perception (value 100 of SIS scale [71]). The maximal acquirable score was 10 points.
Chair sit and reach test (SRT) [72]: The participant was asked to sit on the cube at the edge of the sitting area similarly to the SUT and to lay one leg on the graduated board while the other one was bent with an angle of 90 • and with the foot on the ground. The centimeter placed on the graduated board was calibrated, making the point 0 to start next to the participant's heel. Starting from this point, the centimeter presents two ranges of values: a positive one from heel to the ground and a negative one from heel to the participant's hip. The participant was asked to slowly bend over (5 s) trying to reach or to overpass the heel with both hands performing the maximum stretching without pain (value 100 of SIS scale [71]), holding the position for 2 s while the evaluator measured the distance in centimeters from the hands' middle fingers to the heel and subsequently to return in 5 s. The same measurement was performed for both of legs and the points were assigned using the mean value. The maximal acquirable score was 10 points.

Questionnaires International Physical Activity Questionnaire (IPAQ)
The IPAQ is a validated questionnaire that estimates the amount and the intensity of weekly physical activity performed by adults between 18 and 65 years old [73]. The questionnaire consists of 9 questions from which it was possible to obtain a score in Met referred to total activities. Moreover, thanks to the questionnaire, it was possible to estimate the weekly minutes elapsed in sedentary behavior (during working week and during weekend) or in light, moderate and vigorous physical activity. According to the literature, a total score lower than 700 Met was considered as inactive, a total score from 700 to 2519 Met was considered as active and a score up to 2519 Met was considered very active.

NASA Task Load Index (NASA-TLX)
The NASA-TLX is an evaluation instrument useful to investigate a perceived workload referred to a specific activity [74,75]. In the present research, it was requested to evaluate the workload referred to the previous working week [76]. The questionnaire permitted to evaluate 6 perceived loads: the mental demand (MD), the physical demand (PD), the temporal demand (TD), the effort (EF), the performance (PE) and the frustration (FR). Moreover, a total score (TS) summarized the overall workload.

Short Form Healthy Survey (SF-12)
The SF-12 is a validated questionnaire that aims to evaluate the psycho-physical health status of the participants [77]. It is a short version of SF-36 and consist of 12 questions. The SF-12 permits to estimate the self-reported health status by evaluating two indexes, identified as the physical component summary (PCS-12) and the mental component summary (MCS-12). The survey presents six questions that investigate the PCS-12 through physical activity, the limitations due to physical health, physical pain and general health, while six questions investigate the MCS-12 through social activities, vitality, emotional status and mental health.

Accelerometers
In the present research, the triaxial accelerometers Axivity AX3 (Axivity Ltd., Newcastle upon Tyne, UK, 2013) were used in order to assess the amount of sedentary (lower than 1.5 Met), light (from 1.5 to 3 Met), moderate (from 3 to 6 Met) and vigorous (up to 6 Met) physical activities of the employees according to the literature [78]. The accelerometers' measurements had a range of recorded acceleration of ±16 g and data were collected with a frequency of 100 Hz [79]. The accelerometers were worn on the wrist of the non-dominant hand [80] from Monday to Friday; the raw triaxial data were downloaded from the devices and exported using OmGUI software version 1.24 (Axivity Ltd., Newcastle upon Tyne, UK, 2013).

Total Quality Recovery Scale (TQR)
In the present research, the TQR was proposed to evaluate the perceived recovery status of the employee before performing the CFT and during the entire working week. It is a validated scale that permits to evaluate the psycho-physical recovery referred to the last 24 h [81]. The TQR is composed by a range of value from 6 (very, very poor recovery) to 20 (very, very good recovery). The participant had to focus on his own recovery perception and, based on the scale's verbal anchor, to give the most adequate value. The range of values from 12 to 14 (reasonable recovery) was considered adequate to perform the CFT.

Training Load
Both the EG and CG were asked daily to record their effort perception, based on the adapted Borg's scale, referring to the performed working hours. The training load was calculated by multiplying the effort perception reported by the working minutes performed [82].

Statistical Analysis
The normal distribution of data was conducted using the Kolmogorov-Smirnov test. The reliability of the CFT was assessed through the interclass correlation coefficient (ICC). The homogeneity of the analyzed groups was assessed by performing the unpaired t-test or the respective non-parametric Mann-Whitney U test. To verify the efficacy of the intervention, an ANOVA 3 × 2 (pre/post/retention × EG/CG) was performed for the CFT and IPAQ data, while an ANOVA 2 × 2 (pre/post × EG/CG) was performed for the NASA-TLX and SF-12 data. When a non-parametric analysis occurred, a Friedman test (to assess the intra-group differences) with a Mann-Whitney U test (to assess the inter-group differences) was chosen to replace the ANOVA 3 × 2, while the Wilcoxon test (to assess the intra-group differences) with the Mann-Whitney U test (to assess the intergroup differences) was chosen to replace the ANOVA 2 × 2. The delta values (pre-post) differences for each variable between the EG and CG groups were analyzed and compared using the unpaired t-test or the respective Mann-Whitney U test.
To analyze the accelerometer data, an ANOVA 4 × 2 (detection × EG/CG) was performed, while a Friedman test (to assess the intra-group differences) with a Mann-Whitney U test (to assess the inter-group differences) was chosen for the non-normally distributed data. During the three weeks of training of the experimental group, an ANOVA 3 × 2 (detection × condition) was used to assess differences in amounts of minutes of exercise during each of the three detections of training for the accelerometer and pocket trainer measurements of the intensity of the exercise. The independent sample t-test or respective Mann-Whitney U test were used to detect differences between the minutes and scores of physical activity performed inside and outside the office measured with the training diary. Significance was set at 0.05 (2-tailed) for all analyses.
To analyze the TQR and the training load data, an ANOVA 8 × 2 (evaluated weeks × EG/CG) was performed, while a Friedman test (to assess the intra-group differences) with a Mann-Whitney U test (to assess the inter-group differences) was chosen for the nonnormally distributed data. The effect sizes for the repeated measurements using ANOVA were calculated as partial eta squared (η 2 p), using the small = 0.02, medium = 0.13 and large = 0.26 interpretation for effect size [83]. The effect size for the Mann-Whitney U test and Wilcoxon test was calculated as Pearson's r, using the small = 0.1, medium = 0.3 and large = 0.5 interpretation of the effect size [84]. Moreover, the effect size for the Friedman analysis was calculated using Kendall's W (W), using the small = 0.1, medium = 0.3 and large = 0.5 interpretation of the effect size [84]. All data analysis was conducted using the statistical packages for social sciences (SPSS version 21).

CFT
All CFT tests results are shown in Table 1, while the CFT's reliability data and the CFT's RPE data are shown in Appendix A Tables A1 and A2. CFT was found reliable for all its variables, and RPE measured in each test of CFT did not show any significative difference in between and within group analysis (p > 0.05).

IME
The index of motor efficiency test showed a significant interaction group by time (p = 0.008, η 2 = 0.141), a significant main effect of time (p < 0.001, η 2 = 0.351) and no significant main effect of group. Follow up tests revealed that, while the control group did not change over time from pre to post and from post to retention, the experimental group showed a significant increase in the test from pre to post (p < 0.001) and from pre to retention (p < 0.001), while no significant differences were found from post to retention, although there was a trend toward significance (p = 0.074). The analysis of the deltas showed a significant difference between groups regarding delta of the pre-post test (p < 0.001) and a trend to significance in the post-retention test (p = 0.052). Figure 2 shows the main results of IME.

IME
The index of motor efficiency test showed a significant interaction group by time (p = 0.008, η 2 = 0.141), a significant main effect of time (p < 0.001, η 2 = 0.351) and no significant main effect of group. Follow up tests revealed that, while the control group did not change over time from pre to post and from post to retention, the experimental group showed a significant increase in the test from pre to post (p < 0.001) and from pre to retention (p < 0.001), while no significant differences were found from post to retention, although there was a trend toward significance (p = 0.074). The analysis of the deltas showed a significant difference between groups regarding delta of the pre-post test (p < 0.001) and a trend to significance in the post-retention test (p = 0.052). Figure 2 shows the main results of IME.

SMT
The shoulder mobility test showed a significant interaction group by time (p < 0.001, η 2 = 0.243), a significant main effect of time (p < 0.001, η 2 = 0.299) and no significant main effect of group. Follow up tests revealed that, while in the control group the shoulder mobility did not change over time from pre to post to retention, the experimental group showed a significant difference in the test from pre to post (p < 0.001) and from post to retention (p = 0.017). The analysis of the deltas showed a significant difference between groups regarding delta of the pre-post test (p < 0.001) and post-retention test (p = 0.034). All participants reported a value of SIS scale of 100, which corresponds to the maximum stretching without pain as requested.

SRT
In the chair sit and reach test, a significant difference with the Friedman test (χ 2 (2) = 13.164, p < 0.001, W = 0.32) was found in the experimental group among the three different conditions (pre, post and retention). Follow up tests with Wilcoxon signed rank tests showed that there was a significant difference between pre and post conditions (Z = −3.024, p = 0.002, r = 0.67) and between post and retention condition (Z = −3.182, p = 0.001, r = 0.71). In the control group, no significant difference was found with the Friedman test. Mann-Whitney U test at post intervention showed no significant difference between groups.
The analysis of the deltas showed a significant difference between groups regarding the delta of the pre-post test (Z = −3.981, p < 0.00,1, r = 0.62) and a significant difference in the post-retention test (Z = −2.217, p < 0.027, r = 0.62). All participants reported a value of SIS scale of 100, which corresponds to the maximum stretching without pain as requested.

SUT
In the sit-up test, a significant difference with the Friedman test (χ 2 (2) = 10.107, p = 0.006, W = 0.25) was found in the experimental group among the three different conditions (pre, post and retention). Follow up tests with Wilcoxon signed rank tests showed that there was a significant difference between pre and post conditions (Z = −3.268, p = 0.001, r = 0,75), while no significant difference was observed between post and retention conditions. In the control group, no significant difference was found with the Friedman test. Mann-Whitney U test at post intervention showed no significant difference between groups.
The analysis of the deltas showed a significant difference (Z = −2.427, p = 0.015, r = 0.38) between groups regarding the delta of the pre-post test. No significant difference was found for the post-retention test.
The RPE measured for the sit-up test at pre, post and retention did not show any significant differences over time and between groups.

PUT
In the push up test, a significant difference with the Friedman test (χ 2 (2) = 11.414, p = 0.003, W = 0.33) was found in the experimental group among the three different conditions (pre, post and retention). Follow up tests with Wilcoxon signed rank tests showed a significant difference between pre and post conditions (Z = −3.361, p = 0.001, r = 0.75), while no significant difference was observed between post and retention conditions. In the control group, a significant difference was found with the Friedman test (χ 2 (2) = 17.492, p < 0.001, W = 0.51). Follow up tests with Wilcoxon signed rank tests showed a significant difference between pre and post conditions (Z = −3.420, p < 0.001, r = 0.76), while no significant difference was observed between post and retention conditions. Mann-Whitney U test at post intervention showed no significant difference between groups.
The analysis of the deltas showed no significant differences between groups regarding the delta of the pre-post test and post-retention test.
The RPE measured for the push up test at pre, post and retention did not show any significant interaction or main effects.

RT
The Ruffier test did not show any significant interaction group by time or main effect of group. However, a main effect of time was detected (p < 0.001, η 2 = 0.404). The Ruffier index decreased from pre to post (p < 0.001) with no differences between groups; however, it did not change between post and retention.
The analysis of the deltas showed no significant differences between groups regarding the delta of the pre-post test and post-retention test.
The RPE measured for the Ruffier test at pre, post and retention did not show any significant interaction or main effects.

IPAQ
No differences were found with the Friedman test in both groups for the physical activity questionnaire in any of the analyzed variables: total met, sedentary behavior during the weekend, sedentary behavior during the week, and light, moderate and vigorous physical activity. However, the Mann-Whitney U test revealed a significance for retention's total Met values (Z = 1.986, p = 0.049, r = 0.31) and for post-retention delta's light activities (Z = 2.222, p = 0.026, r = 0.35). All IPAQ data are shown in Table 2.

NASA-TLX
No significant interactions or main effects were found for temporal demand, frustration and the total score that summarized the total workload.
The physical demand increased significantly (experimental: Z = −2.894, p = 0.004, r = 0.65; control: Z = −2.897, p = 0.004, r = 0.65) from pre to post uniformly with no differences in the groups. The Mann-Whitney U test revealed that there is no significant difference between groups at pre and at post. The delta analysis did not show any significant differences between the groups.
For the performance scale, Wilcoxon signed rank tests showed a significance difference between pre and post for the control group (Z = −3.508, p < 0.001, r = 0.78), while no differences were reported for the experimental group. The Mann-Whitney U test revealed a significant difference (Z = −2.192, p = 0.028, r = 0.34) between the groups at pre-test, while no significant difference between groups was detected at post-test. The delta analysis showed a significant difference in performance (Z = −2.882, p = 0.003, r = 0.46) between the two groups.
For the mental demand scale, Wilcoxon signed rank tests showed a significance difference between pre and post for the experimental group (Z = −2.913, p = 0.004, r = 0.65), while no differences were reported for the control group. The Mann-Whitney U test revealed a significant difference (Z = −2.041, p = 0.047, r = 0.32) between the groups at post-test, while no significant difference between groups was detected at pre-test. Delta analysis showed a trend toward significant difference (Z = −1.726, p = 0.087, r = 0.27) between the two groups.
For the effort scale, Wilcoxon signed rank tests showed a significance difference between pre and post for the control group (Z = −3.463, p < 0.001, r = 0.77), while no differences were reported for the experimental group. The Mann-Whitney U test revealed a significant difference (Z = −2.663, p = 0.007, r = 0.42) between the groups at post-test, while no significant difference between groups was detected at pre-test. The delta analysis showed a significant difference in effort (Z = −3.246, p = 0.001, r = 0.51) between the two groups. Figure 3 shows the effort and mental demand results. All NASA-TLX data are shown in Table 3.

SF-12
The health questionnaire did not report any significant interaction (group by time) or main effects for the physical component (PCS). Regarding the mental component, a trend to interaction was found (p = 0.087, η 2 = 0.070) and a significant main effect of time (p = 0.03, η 2 = 0.154) was reported. The delta analysis showed a significant difference in the mental component (MCS) (Z = −2.102, p = 0.036, r = 0.33) between the two groups. SF-12 data are shown in Table 3. ences were reported for the experimental group. The Mann-Whitney U test revealed a significant difference (Z = −2.663, p = 0.007, r = 0.42) between the groups at post-test, while no significant difference between groups was detected at pre-test. The delta analysis showed a significant difference in effort (Z = −3.246, p = 0.001, r = 0.51) between the two groups. Figure 3 shows the effort and mental demand results. All NASA-TLX data are shown in Table 3.

Accelerometers
The accelerometers data are shown in Table 4. No significant differences among weeks and for both groups were found for sedentary, light and vigorous minutes of exercises. No significant difference was found for the control group for moderate exercise among the four detections. A significant difference was found for moderate minutes of exercise in the experimental group among the four detections with the Friedman test (χ 2 (3) = 18.200, p < 0.001, W = 0.30). Follow up tests with Wilcoxon signed rank tests showed that the first detection was significantly different from detection 2 (Z = −2.215, p = 0.027, r = 0.50), detection 3 (Z = −2.605, p = 0.009, r = 0.58) and detection 4 (Z = −2.668, p = 0.008, r = 0.60), while detection 2 was significantly different from detection 3 (Z = −2.012, p = 0.044, r = 0.31) and detection 4 (Z = −1.961, p = 0.050, r = 0.31).
No significant difference was found between detections 3 and 4. The Mann-Whitney U test did not show any significant inter group difference between the experimental and control groups for sedentary, light and vigorous minutes of exercise. However, in the moderate minutes category, a significant difference between the groups was found at detection 2 (Z = −2.336, p = 0.018). No other group differences were found in any other detection. 7.3 ± 8.8 4.2 ± 5.5 EG = experimental groups; CG = control group. Detection 1 represents the pre-assessment; detection 2 represents the first measurement during the intervention; detection 3 represents the second measurement; detection 4 represents the third measurement during the intervention. * = significant p-value (<0.05) in between groups analysis (EG vs. CG); # = significant p-value (<0.05) in within group analysis (difference with detection 1); § = significant p-value (<0.05) in withing group analysis (difference with detection 2).  Table 5.

Training Diary
From Table 6, it is possible to extrapolate that most of the exercises performed belonged to cardiorespiratory fitness (80% of the weekly activity), followed by muscular (9% of the weekly activity) and combined (9% of the weekly activity) fitness, while the less approached was the flexibility fitness (2% of the weekly activity). Nevertheless, flexibility fitness was practiced more inside the office than outside (59% vs. 41%), even without significant differences, while all other typologies of fitness were practiced more outside than inside the office (cardiorespiratory: 32% inside office, 68% outside; muscular: 29% inside office, 71% outside; combined: 25% inside office, 75% outside). Data are shown as mean ± standard deviation. PT's Detection 1 corresponds to the first measurement performed with the accelerometer during the second phase (detection 2); PT's detection 2 represents the second measurement performed with the accelerometer (detection 3); PT's detection 3 represents the third measurement (detection 4). The PT measurements shown represent the minutes of activity perceived as light, moderate and vigorous and reported in the App UP150 by the EG. * = significant p-value (<0.05) in between groups analysis (PT vs. accelerometer). Data refer to the mean weekly number of minutes and points accumulated. The table shown data refer to physical activity performed inside the office (inside office), outside the office (outside office) and the total physical activity obtained summed the inside and the outside office activities. * = significant p-value (<0.05) in between groups analysis (inside office vs. outside office).

TQR
No significant interaction or main effects were reported for the total quality recovery scale. All TQR data are reported in Appendix A Table A3.

Training Load
No significant interaction or main effects were reported for the training load. All training load data are reported in Appendix A Table A3.

Cubo Fitness Test
Based on the present data and comparing with the previous literature, CFT data were found reliable according to a previous research [58]. No differences were found in RPE values measured after each test of CFT, confirming the accuracy of the CFT instrument in performing measurements.
Analyzing the test results, CFT shows the efficacy of the intervention in the improvement of the level of IME. The EG passed from a low level (29.4 ± 13.7 a.u.) to a medium level (43.0 ± 15.6 a.u.). Consequently, it is reasonable to assert that the intervention has demonstrated efficacy in improving the EG's general physical condition according to the multi-factorial intervention proposed by Maylor and colleagues [54].
As explained previously, the IME is the final CFT score, composed by the sum of the individual submaximal tests' scores, and it is necessary to analyze the components that contributed to the overall improvement. The intervention significantly increased the levels of flexibility fitness (SMT and SRT) and muscular fitness (SUT). Considering flexibility fitness, the intervention improved both SMT and SRT in the EG, while the CG maintained the initial levels of flexibility. This effect could be due not only by the time spent performing flexibility exercises, but even by the concomitance of more factors, such as combined fitness (where a combination of muscular and flexibility fitness occurred) and muscular fitness [85]. In particular, muscular fitness has been demonstrated to have efficacy in improving flexibility when different muscle groups are involved alternatively [85]. Indeed, during the intervention, employees could freely approach different muscular exercises in the office and combined fitness permitted to increase the total amount of time spent performing activities related to flexibility fitness.
Analyzing the results of muscular fitness, the SUT showed an improvement in abdominal strength only for the EG. In the literature, it is well explained how the specific exercise of sit-up offers better results in the sit-up test than other exercises (curl-up or core stability exercises) [86,87]. From this point of view, the improvement in the SUT could be explained by the specificity of some exercises performed by the EG; indeed, sit-ups were one of the exercises proposed to the employees of the EG during the office intervention.
Differently, the results of PUT reported an increase in upper-limb muscular fitness for both groups. It should be noted that the intervention started soon before the end of the Italian lockdown restrictions for the SARS-CoV-2 pandemic (the intervention started 7 May 2021, while the main restrictions finished 24 May 2021). The reopening of gyms and sport centers could have permitted the improvement in some kind of fitness levels even in the control group. In particular, the push-ups could be defined as a commonly prescribed exercise that were recommended even to the computer worker population during the COVID-19 quarantine [88]. For this reason, PUT could have been influenced by a "background noise" that could have hidden the intervention efficacy.
Even if both the EG and CG improved their cardiovascular fitness results, the intervention was not able to create differences between the two groups. The lack of differences could be explained by the duration of the intervention and the intensity of the performed exercises. A meta-analysis of Boulè and colleagues [89] considered eight weeks of cardiorespiratory training intervention as the minimum inclusion criteria. Nevertheless, the duration of the intervention program of the present research seems not enough to see more significant differences in the cardiorespiratory fitness of the EG employees. Even in this case, the lack of a significant difference between the two groups could be due, as described for PUT, to the end of the lockdown, which could have motivated even the CG to engage in more physical activity after a prolonged period of restrictions.
Nevertheless, data presented in the researches of Branch and colleagues, and Dunn and colleagues [90,91] showed that, at moderate intensity, marked results in cardiorespiratory fitness could be obtained after a wider period of training (12-24 weeks); considering the current pandemic situation, extending the period of intervention of the present study could bring more marked results.
Analyzing the retention effects, the employees of EG maintained the new acquired level of IME. Analyzing the results of the five submaximal fitness test, the EG retained the values of RT, PUT and SUT, while the values of SMT and SRT decreased. The results seem to agree with the research conducted by Chen and colleagues [92] that showed how different types of exercises could provide different effects after an intervention of 8 weeks and a successive rest of 4 weeks. In particular, the resistance and aerobic exercises seem to be efficient in maintaining positive effects after a period of interruption of the activities (the considered intervention included 2 days of training per week). The loss of flexibility could find explanation in the interruption of the stress-reduction effect promoted by the intervention [93].
Indeed, the results of our study demonstrated that the intervention reduced the perception of mental demand and improved the perceived mental health; all these factors could be considered as clues of the anti-stress effect of the intervention [93,94]. The absence of the intervention during the 4 weeks that followed the study could have contributed to increase the related muscular tone stress, as shown in a research based on visual display workers [95] and, consequently, could have reduced flexibility fitness.

Questionnaires
The proposed questionnaires highlight a positive effect of the UP150 on the mental psycho-physical components. Based on the significances found in the effort variable of the NASA-TLX, it can be concluded that the amount of practiced physical activities and, more in general, the intervention had a contribution to maintain the initial level of working effort perception conversely to CG, which increased the weekly working effort perception in the last period of training. This could be caused by the coping effect of the physical exercise during working hours [96,97]. This is supported by the research conducted by Wandel and Roos [96] who showed that physical activity could represent an effective coping strategy, especially for high position workers. This is confirmed by the lower level of working mental effort found in the EG in the last days of the experimental period. Moreover, the trend evidenced by the SF-12 seems to follow the line drawn by the NASA-TLX's results; indeed, the MCS-12 showed a better improvement in the EG's mental condition, confirming the positive effect of the experimental procedure.
The IPAQ did not show significant differences between the two groups in pre-and post-training conditions. The result seems to be contradictory with the higher number of minutes spent performing moderate physical activity recorded by the accelerometers in the EG. This lack of consistency between the accelerometers data and IPAQ datahas been previously seen by Dyrstad et al. [98]. More specifically, Dyrstad highlighted that, in the IPAQ questionnaire, the participants reported less sedentary time, less moderate intensity and a higher level of vigorous intensity physical activity than what was measured by the accelerometers.
Concerning the retention effect, despite an absence of significant intra-group differences in IPAQ, the results showed a difference in total Met values in favor of the control group, and the delta analysis seems to show that this difference could be due to a difference in walking activities [92]. During the experimental interruption, the participants did not receive any indication about the behavior to maintain concerning physical fitness and this might have caused the difference in outdoor activities [92]. It is necessary to mention that, in this period, the experimental group was not allowed to interact with any experimental procedure. Nevertheless, the employees of the EG were able to maintain the newly acquired levels of physical fitness during the retention test as evidenced in the previous paragraph.

Accelerometer
The analysis performed with the accelerometers underlines in EG a significant increase in minutes spent performing moderate physical activity and a decreasing trend of minutes spent in sedentary behavior. Even without significance, the EG evidenced a decrease of 5% of sedentary activities (from 84% to 79% of the total monitored time), while the CG did not show a similar trend (from 82% to 81%). Comparing the obtained data with the literature, it is possible to assess that the EG started with a mean time spent in sedentary behavior of 8.14 ± 0.8 h/day, considered as a range of increase in mortality risk (from 7.5 to 9.0 h/day) by WHO guidelines [33], and finished the experimental procedure with 7.8 ± 0.9 h/day, moving gradually away from the risk range.

TQR and Training Load
The lack of significances in TQR analysis seems to indicate that both of groups experienced similar recovery conditions, which corresponded to a "reasonable recovery". The same can be asserted for the training load, which presented the same trend for both groups for the entire experimental period.
This last dataset appears to be at odds with the difference found in the effort values of the NASA-TLX, but it is necessary to specify that the NASA-TLX, as expressed by its protocol, is a self-reported questionnaire specifically that refers to the working task of the employee [74], and does not consider the physical fitness activity performed during working hours. Conversely, the training load was an evaluation of the general effort experienced during the working day which includes physical activity.

App UP150
The App UP150 permitted to collect data about the physical activity performed by the EG during the entire experimental period. In this way, it was possible to analyze and discuss the typology of fitness practiced, the reaching of the target score and the time spent conducting physical activity.
The app showed that all participants were able to reach the target score set by the CFT. Moreover, it is important to underline that about 75% of the target score (144.2 ± 74.4 a.u. of 191.9 ± 29.7 a.u.) was reached in the workplace during working hours. This could represent an important goal in this new workplace concept; physical activity was able to efficiently fit in the workflow, reaching the important percentage of recommended weekly physical activity [33]. The mean of the participants of the EG performed more than 350 min of physical activity during the week (summing the office physical activity and the outside office physical activity reported in the TD). Indeed, considering the accelerometer outcomes, it is possible to notice that the experimental group not only had an increased trend of moderate physical activity during the 8 weeks of intervention from 307.8 to 425.4 min (an increase of 3%), but even overpassed the minimum amounts of moderate minutes recommended of moderate physical activity. Nevertheless, it is possible to notice that the intensity of the activities perceived using the PT differs considerately from the intensity recorded by the accelerometers. The employees tended to underestimate the moderate physical activity, while tending to overestimate the light and vigorous physical activity.
Moreover, the results show that the participants of the EG interpretated most of the moderate intensity as light intensity. This phenomenon could be due to an easier perception of personal lower and upper bounds of effort, caused by the more shared meaning of 'no effort' and 'maximal effort' [99] during physical activity, while the different variations in effort could be more difficult to perceive. Another factor that could have contributed to increasing the difference between perceived and measured intensity of physical exercise is motivation. Based on Brehm's motivational theory, if the task is perceived as adequate, the potential motivation that concerns the activity increases [44,45]. The increase in the employees' motivation can influence the perception of practiced physical activity [44,45] and, as shown in the previous presented data, it is possible to notice that the minutes measured as moderate are more often perceived as light than vigorous.

Potential Limiations
The quarantine imposed to control the outbreak of SARS-CoV-2 could have influenced the participants' lifestyle and consequently some results. Therefore, it is necessary to investigate the efficacy of the UP150 during a normal period, outside the restrictions imposed by the pandemic. Moreover, the small sample size did not permit to compare the effect of the intervention on females and males separately.

Practical Applications and Future Perspective
Due to the modification of the classic workplace concept, even caused by the current pandemic crisis, the office needs to change old schemes, adapting to new working modali-ties that include both in-presence and smart working [4]. The office intended as a simple physical space of work must change to follow the employees' needs of organizational elasticity (both in-presence and smart working) and psycho-physical wellbeing [4]. From this new perspective, architectural changes could help the employees follow good practices for healthy behavior, facilitating the engagement with physical activity moments [100].
Moreover, the inclusion of a new professional figure in the form of the wellness coach could represent not only a facilitator of physical activity, but even a favoring element of social relationships that could have a positive impact on the working environment [101]. The technology must link the office (intended as physical space) with individual motivation (psychological level) and with opportunities of physical activities (fitness level) based on self-perception and regulation of effort. These three elements must be adaptable to the new working situation, switching from the in-presence to the smart-working modality and supporting the employees to develop healthy behaviors, even outside the working context.
In future investigations, a larger sample size could help to strengthen the effects of the experimental protocol.
Future research could focus the investigation on the longitudinal effects of the UP150 on the same proposed variables and could also prove the effectiveness of cardiorespiratory fitness and give better results in all other considered physical fitness categories [91]. Moreover, it could be useful to implement and verify the efficiency of new physical fitness stations linked to workflow moments or to active breaks, and the impact of the entire protocol on employees' illness and stress related absenteeism [35].

Conclusions
The UP150 workplace intervention, based on architectural, technological, physical, and methodological components, seems to be efficient in the promotion of physical activity and an active lifestyle. In particular, the UP150 improved the employees' index of motor efficiency, increasing flexibility fitness and part of muscular fitness. Considering the proposed questionnaires, the intervention decreased the work-based mental demand, maintained a fair level of working stress-related effort, evaluated with the NASA-TLX, and improved mental health, evaluated with the SF-12. Furthermore, the experimental procedure increased the number of moderate minutes of physical activity practiced during the working week, reaching and overpassing the minutes recommended by the literature.