1. Introduction
The COVID-19 pandemic impacted the lives and well-being of people worldwide. Early in 2020, the COVID-19 pandemic disrupted the normal lifestyle of people around the world, due to government-issued lockdowns which required people to spend significant time sheltering inside their homes. Countries instituted stay-at-home orders that resulted in transitioning from traditional in-person schooling and work to remote modalities, including travel bans, non-essential workplace closures, and restricting public gatherings. Restrictions caused social isolation, affecting mental and physical health [
1]. There is evidence that the psychological impact of the pandemic was strongly negative and lasted throughout the entirety of the 2020 pandemic year and continued to decline even after the pandemic was brought under better control [
2]. Additionally, there is concern that the sustained loss of access to parks and fitness centers might have resulted in an increase in sedentary behavior [
3]. After the pandemic was brought under better control, towards the end of 2020, the lifting of restrictions resulted in a return to normality in many aspects. Reopening public parks and fitness centers has allowed for a return to prior physical activity (PA) behavior, however, many factors such as masking requirements, fear of contracting the virus, and capacity limitations likely discouraged some people from returning to normal behavior. Furthermore, there are open questions regarding how long-term human behavior was affected from the pandemic.
Physical activity provides a range of benefits for overall well-being including alleviating stress and depression, maintaining a healthy body composition, improving sleep quality, and mitigating certain types of disease. Investigating the impact of the pandemic on physical activity behavior could provide valuable information for researchers, public health officials, policymakers, and employers to better understand and promote healthy PA during the remainder of this pandemic, future pandemics, and during normal, pandemic-free times. Furthermore, this pandemic provided an unprecedented opportunity to use sensing technology not available in previous pandemics to collect data on how people responded to conditions caused by the pandemic. This large-scale digital data on exercise habits holds potential to elucidate the policy, environmental, and personal influences that drive healthy lifestyle choices [
3,
4].
A majority of previous research on PA during the COVID-19 pandemic found that PA generally decreased [
5,
6]. These studies used a variety of data sources including survey questionnaires, digital step-counts, and internet search trends, and have yielded inconclusive insights into PA levels during the pandemic due to the heterogeneity of data sources, types of PA tracked (i.e., step-counts, active minutes, intensity of exercise), and subject population [
5,
6]. Stockwell et al. published a systematic review in early 2021 that included 45 papers reporting the changes in PA from before to during the COVID-19 pandemic; only four of these papers reported device-based measures and the rest based on questionnaires [
5]. Most research on PA uses self-reported survey data, which is subject to recall bias compared with objectively-measured activity data [
5,
7,
8]; furthermore, subjects are more likely to over or underestimate their PA when self-reporting [
9,
10]. Detailed interpretation of PA from questionnaires are discouraged [
9]. Objectively-tracked PA data is more sensitive than surveying [
9] and thus the literature highlights the need for more behavioral research based on objective measures in order to provide direct accurate comparisons [
5,
6,
9,
10]. There remains a large gap between questionnaire-based studies and objective studies using activity tracking devices.
Objective data can also be misleading because it can fail to capture true activity. Studies using smartphone activity-tracking data found that both step-counts and active time decreased during the 2–3 months following initial lockdowns [
11,
12]. While this research indicates that step-counts may have decreased, big data analyses suggest an increase in other measures of PA [
13,
14,
15,
16]. Fitbit, a large wearable device company, reported that overall steps were down from 2019 to 2020, however, active minutes increased for 42% of Fitbit users from 2019 to 2020 [
15]. Strava, a leading exercise-recording platform that reached 73 million users in 2020, reported that from 2019 to 2020 global activity counts increased 33% and global outdoor running activities increased 90% [
16]. In summary, while multiple sources of data suggest that step-counts declined, big data analysis from wearable device platforms suggests that people were logging dedicated exercise efforts more frequently than prior to the pandemic, perhaps in an effort to compensate for reduced daily-steps.
There are several studies examining a particular country’s exercise habits during initial lockdown periods [
11,
17,
18]; or evaluating exercise habits of particular groups like disabled adults [
19,
20], older adults [
21], working parents [
22]; or using survey data to compare between active and infrequent exercisers, [
23,
24]. Ding et al. reports survey results spanning 11 countries and 11,775 subjects, showing that residents of countries with more stringent COVID-19 response policies were more likely to be insufficiently active according to public health recommendations [
18]. While these studies are useful for understanding PA in particular groups, there is limited research directed at investigating governmental pandemic policies in coordination with physical activity in multiple countries. Research leveraging large objective datasets will improve understanding of PA during the COVID-19 pandemic.
In this work, we conduct a exploratory data analysis of wearable device data recorded internationally over a two-year period including a pre-pandemic year (2019) and a pandemic year (2020), using data obtained from the running wearable device company Stryd (
www.stryd.com; accessed on 6 October 2022). Our work includes descriptive and statistical analyses and elucidates the PA behaviors of dedicated runners, including time of exercise and activity levels, and compares them between years, countries, and severity of government COVID-19 containment policies. Specific contributions of this work include the following:
This work employs objective activity-tracking data from a wearable device to provide a direct comparison of running exercise behavior of dedicated runners between a pre-pandemic year and a COVID-19 pandemic year.
We elucidate how PA changes in dedicated runners at the country level and provide descriptive comparison between countries and pre-pandemic to pandemic years. We discuss specific pandemic policies enacted by countries and the observed PA of runners.
We show that from the pre-pandemic to the pandemic year there were several large and significant shifts in the time of day that runners log activity, and we break these patterns down by country.
2. Materials and Methods
2.1. Description of Wearable Device
Running activity data was provided by the company Stryd, which markets a footpod for runners and maintains an online analytics dashboard and smartphone application for users to track their training. The Stryd footpod attaches to shoelaces and can be connected via Bluetooth to a smartphone or fitness watch.
Stryd’s footpod is not used for continuous daily tracking; it is used only for in-run tracking. Stryd’s footpod costs $219 in early 2020 and is marketed towards dedicated runners with an interest in technically-detailed fitness data. Due to its price-point and marketing, Stryd users tend to be dedicated amateur runners, competitive runners, college, semi-professional, and professional runners, or triathletes. As such running is likely to be among the highest priority types of exercise for these users.
2.2. Data Cleaning
We obtained anonymized data collected from 1 November 2018 through 1 January 2021 according to Coordinated Universal Time (UTC). All activities represent runs and include fields such as unique user IDs, timestamps, distance traveled, elapsed time during the activity, average speed, local temperature, and starting Global Positioning System (GPS) coordinates. Some activities lack GPS coordinates due to setting configurations or if performed on a treadmill. We clean the data as shown in
Figure 1. Initial cleaning was performed, including dropping rows with blank usernames or without valid identifiers. A sample of the first author’s personal data is displayed in
Table S1 in Supplementary Materials.
For our analysis, only users who had logged a run on or prior to 3 January 2019 (UTC) were kept, all others were dropped. Near world-record caliber activities were dropped (0.78% of activities), defined as having average speeds exceeding 11 m/s, max speed exceeding 12 m/s, or having a combination of individual activity distance greater than 5000 m with an average speed greater than 6.4 m/s.
GPS coordinates were used to map activities to country and time zone with the Python packages timezonefinder and reversegeocode. Timestamps were converted from UTC time zone to the local time zone. 18.3% of 2019 activities and 0.7% of 2020 activities were without GPS data, due to advances in the Stryd platform’s compatibility and also perhaps due to decreased indoor exercise during COVID-19. To handle this difference in the number of activities lacking GPS data between 2019 and 2020, non-GPS runs were assigned to the country and time-zone where the majority of the individual’s runs with GPS data took place. Users without a single majority country over the two years and users with different majority countries between 2019 and 2020 were dropped from the analysis.
Users who had logged at least one activity before 3 January 2019 and at least 20 activities in each of 2019 and 2020 were included. After the cleaning steps, over 7000 unique users remained in our dataset. For the country specific analysis we select the 14 countries with the most Stryd users. The number of users within the 14 countries that we analyzed ranges from over 90 users to over 2000 users.
2.3. Analysis of Exercise Behaviors
2.3.1. Activity Counts
In this paper, activity counts are defined as the number of activities recorded in Stryd’s database. We examine activity counts for different time periods and countries.
2.3.2. Government Policy Strictness
Government policy strictness was quantified using Oxford’s Government Response Stringency Index (GRSI), a time-series government policy strictness index for individual countries capturing the harshness of several virus containment strategies including school, workplace, and public transport closures, cancellations of public events and gathering restrictions, stay-at-home requirements, restrictions on internal and international travel, and public information campaigns [
25]. The GRSI is on a scale of 0 to 100, with 0 indicating a lack of pandemic policy, and can be directly compared across countries. Our analysis employs the GRSI in order to explore physical activity changes in the pandemic year that correspond with major policy changes.
2.3.3. Activity Counts by Country
We selected 14 countries that are highly represented in our dataset. First, we examine total activity counts by year and country. The World Health Organization (WHO) declared COVID-19 as a global pandemic on 11 March 2020. To control for seasonality, we compare and report the change in activity counts from 2019 to 2020 using the 67-day periods before and after 11 March. Since 2020 is a leap year, we remove 29 February data from the analysis to keep the length of the time periods equal. We also plot the distribution of the number of activities logged in a year by each user, broken down by country and pre-pandemic and pandemic year.
We also conduct a time-series analysis of activity counts. We reported normalized activity counts by day and country. Normalized activity counts were calculated by dividing the number of activities that day in the given country by the number of users in the country. This normalization allows activity counts to be compared between countries, since the number of users differed in each country. Finally, a 14-day rolling average was applied to the activity counts to smooth out variations due to weather and weekday.
2.3.4. Time of Day That Athletes Run
Time of day of activities was calculated using GPS coordinates to identify the respective time zones and extract local time. To examine pandemic-related behavior, we restricted this analysis to data recorded after March 11th in both years, since 11 March 2020 was the WHO pandemic declaration date.
Activities were partitioned into one-hour bins for each hour of a 7-day week. Activity counts were normalized by dividing each bin activity count by the largest bin activity count. Then, the difference in activity counts in 2019 and 2020 were plotted for each bin.
Next, we defined three time periods: before work (12 AM to 8 AM), during standard working hours (8 AM to 5 PM), and after work (5 PM to 12 AM). These time periods are limited in that they do not represent all job types. Activity counts for each time period on each day were calculated then normalized equally by dividing by the largest bin size.
We hypothesized that the mass shift to working- and schooling-from-home provided more schedule flexibility. We conducted 2-tailed student’s t-tests paired by user to test for the difference between mean activity counts per time bin per day between 2019 and 2020. A test is conducted individually for each of the time bins on each day, comparing the binned activity counts of each user between years. This test is also conducted for each of the 14 selected countries, individually. We use Python’s SciPy package for this statistical analysis.
4. Discussion
We used objective wearable device data for a retrospective analysis into physical activity behaviors of Stryd users (dedicated runners) prior to and during the COVID-19 pandemic. Currently, the body of research on PA during COVID-19 relies heavily on data collected via surveying people about their past PA habits and via objective device-tracking of step counts. Both can be very misleading, with survey data subject to recall bias and exaggeration, and step counts subject to user nonwear for dedicated athletic activities, such as playing soccer, climbing, or running where a user does not carry their mobile device [
5,
6,
9,
10]. Our analysis provides insight into a unique group of active individuals who purchased a footpod for the purpose of tracking more detailed information on their runs and overall running fitness. As such, we expect that running is a high priority activity for users, and they are also likely to be diligent about wearing the footpod for runs. Furthermore, as a retrospective analysis, we eliminate sources of behavior bias that can result when subjects are aware of their behavior being studied.
Overall, our analysis revealed variations across countries in PA behaviors of more competitive runners. It is possible that the pandemic influenced this group in a different way than others and we cannot extrapolate these behaviors to other groups. Countries enacted different pandemic policies, with Italy, France, and Spain respectively enacting the strictest lockdowns out of the 14 countries examined. Interestingly, Italy and Spain had the largest decreases in activity during lockdown periods while France had a relatively minimal decrease in activity counts. Previous work found that countries with higher government pandemic stringency were more likely to be insufficiently active [
18]. France’s PA is notable and should be investigated further because France reached the second highest stringency index during their strict strict lockdown out of the countries we analyzed, but PA resembled the PA level of the pre-pandemic year. As discussed previously, France allowed outdoor exercise for residents close to their homes, while Spain and Italy did not allow public outdoor exercise.
Many countries experienced decreased PA during lockdown periods indicated by the spike in GRSI. Tison et al. examined smartphone step-counts during initial pandemic lockdowns and showed that Brazil, France, Italy, Japan, United Kingdom, and United States all experienced a decrease in steps comparing a pre-pandemic baseline on 11 February 2020 to every day in the period between 1 April and 1 June 2020 ranging from a 10% decrease to about a 47% decrease [
12]. While step-counts were down, likely due to adherence to lockdown measures, shifting to remote work, and public closures, our work suggests that step-counts my not accurately reflective overall PA. Our work suggests that dedicated runners in Japan, United Kingdom, and United States increased logged PA from before to during the lockdown. These three countries had region-dependent restrictions, but generally gatherings were restricted and Japan and United Kingdom both nationally only allowed solo outdoor exercise (
Table S3). Similarly, Venter et al. found an increase in recreational PA counts during initial lockdown using Strava data from 270,000 users logging runs, walks, hikes, and bike rides in Oslo, Norway [
28]. More studies need to utilize logged activity data to assess its use and investigate the differences from step-counts and questionnaire data for measuring PA.
Previous work suggests that there was an increase in people working from home during the pandemic and increased sedentary behavior [
5,
29]. In our analysis of exercise timing throughout the week, we found that during 2020, users logged significantly more activities within standard working hours on Monday through Friday weekdays compared with 2019, and significantly fewer activities on weekends during the working hour period. Venter et al. similarly found that more running, hiking, and walking activities were logged during daylight hours, and fewer were logged in the mornings and evenings corresponding to this definitions of before work, working hours, and after work [
28]. These effects may be due to the increased flexibility that came from pursuing work or education from home and the elimination of commuting. This may indicate that these times are preferred over times outside of working hours such as early morning or late evening, and it could also indicate that the reduced structure and increased flexibility of working at home may have contributed to decreased activity, or that people attempted to spread their exercise out over the day to avoid people and risk contracting the virus. Due to the large variation in pandemic workplace, school, public gathering, and other pandemic policy, COVID-19 case rates, health systems, and cultural and socioeconomic factors, it is not within the scope of this work to relate specific policies and cultural factors to observed PA behaviors within countries.
It is of widespread interest to determine whether working-from-home is a workplace policy that would promote healthy PA. Future work could investigate those who exercised outside of working hours before the pandemic and shifted to exercising during working hours during the pandemic and whether they experienced increased or decreased PA. Furthermore, future analysis could investigate whether activity increased from the pre-pandemic year of 2019 to later years of the pandemic, to reduce the affect of the initial shock and panic of the pandemic. Previous questionnaire-based research indicated that habitually active individuals experienced a decrease in PA, while less active individuals were more likely to have increased PA [
5,
17,
23,
24]. We are interested in whether this finding is supported by device-measured data, and if there is variation between countries.
This present study’s strengths include that it examines global PA and also the individual PA of 14 countries using objective data for direct comparison across countries and before to during the pandemic. Also, the study uses a large sample size of over 7000 subjects, reports analysis of logged physical activities instead of solely step-counts which most previous objective-tracking PA studies are limited by, and we cross-examine government pandemic response with PA levels. Limitations of this analysis are that it only explores the PA of dedicated runners and it cannot be extrapolated to other groups. Also, it cannot explain the changes in PA. Possible factors for decreased activity counts in 2020 are that the pandemic was an extremely stressful time due to economic instability, isolation, disruption of regular life, fear of contracting the virus, and government pandemic policy. Another explanation for lower activity logging is that there were widespread in-person race cancellations beginning in February, so users may have reduced their training due to race cancellations. The device strictly captures running activity of dedicated runners and misses other daily activity. Finally, our dataset may not have an accurate representation of socioeconomic groups who are less likely to own wearable devices.