Next Article in Journal
Communicating Sustainable Responsible Investments as Financial Advisors: Engaging Private Investors with Strategic Communication
Next Article in Special Issue
Immersive Robotic Telepresence for Remote Educational Scenarios
Previous Article in Journal
Forest Park Visitors Opinions and Willingness to Pay for Sustainable Development of the Germia Forest and Recreational Park
Previous Article in Special Issue
Evidence-Based Assessment of Student Performance in Virtual Worlds
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Visualizing Digital Traces for Sustainable Urban Management: Mapping Tourism Activity on the Virtual Public Space

1
Department of Architectural Representation, Barcelona School of Architecture, Universitat Politècnica de Catalunya, 08028 Barcelona, Spain
2
Department of Architectural Technology, Barcelona School of Architecture, Universitat Politècnica de Catalunya, 08028 Barcelona, Spain
*
Author to whom correspondence should be addressed.
Sustainability 2021, 13(6), 3159; https://doi.org/10.3390/su13063159
Submission received: 18 February 2021 / Revised: 9 March 2021 / Accepted: 10 March 2021 / Published: 13 March 2021
(This article belongs to the Special Issue Visual Technologies for Sustainable Digital Environments)

Abstract

:
One of the challenges of heritage cities is sustainably balancing mass tourism and the daily life of its residents. Urban policies can modulate the impact of tourism through regulations focusing on areas with outstanding visitor pressure, which must consequently be delimited accurately and objectively. Within a traditionally data-scarce discipline, urban practitioners can currently employ a wide range of tracking technologies, but because of their limitations can also greatly benefit from new sources of data from social media. Using Barcelona as a testbed, a methodology is presented to identify and visualize hot spots of visitor activity using more than a million public geotagged images collected from the Flickr photo-sharing community. Multiple complementary visualization approaches are discussed that are suitable for different scales of analysis, from global to sub-block resolution. The presented methodology is firmly grounded in a well-established spatial statistics framework, adapted to a “big data” environment, to extract knowledge from social media. It is designed to generalize to other urban settings, providing substantial advantages over other surveying methods in terms of cost-efficiency, scalability, and accuracy, while capturing the behavior of a larger number of participants and covering more extensive areas or temporal spans.

1. Introduction

1.1. Urban Planning and Sustainable Tourism Management

Since the mid-20th century, the number of people living in urban areas has steadily increased in absolute and relative terms [1]. With the majority of the world’s population living in urban areas, and projections indicating that over the next 30 years, most of the population growth will occur in cities, urban practitioners (planners and designers) require a deeper understanding of urban phenomena [2] and the mechanisms that drive their processes [3].
One of the challenges modern cities face is the management of mass tourism [4], which despite being capable of driving substantial economic growth [5], can hamper the daily life of their inhabitants [6] and negatively influence the experience of visitors themselves [7], as well as the environment [8].
However, the effects of these impacts are not distributed uniformly across the city [9], since the physical presence of visitors tends to concentrate on cultural heritage sites in city centers [10] while the effects on the residential market [11] or the environmental impact of the floating population [12] can affect other neighborhoods across the city.
Therefore, the accurate delimitation of outstanding visitor pressure areas is crucial for implementing policies that modulate these activities and keep them within their carrying capacity [13]. The taxation of specific uses, or subjecting tourism-related businesses to the procurement of a mandatory license within these zones, can sustainably balance local and visitor presence in the public space.

1.2. New Challenges in Urban Management

Urban planning and urban design shape the complexity of cities through building codes, urban regulations, or housing policies. While these disciplines have traditionally been data-scarce [14], the growing complexity of urban phenomena requires complementing traditional approaches [15,16] with data-driven methodologies [17], providing public authorities with actionable information [18] to respond to new challenges and guide urban policies [19].
In this context, urban planners and designers can significantly benefit from new sources of data from social media to complement long-established methodologies based on cadastral data [20]. However, this approach requires addressing several issues in the methodological framework [21], derived not only from the increased amount of data but also from the quality assurance necessary [22] because of the informal nature of these sources [23].
This paper analyzes the tourist activity from an urban management perspective using more than a million picture locations of Barcelona collected from the Flickr service [24]. The methodology focuses on the visualization aspects to extract knowledge from social media data effectively [25] as a communication tool to make its complexity interpretable intuitively for stakeholders, using a principled approach to data representation [26] and reproducibility [27].

1.3. Research Questions and Hypotheses

The main aim of our research is to investigate whether visualizing the digital traces of tourist activity can provide an advantage over established methods of tracking visitor behavior. The paper focuses on two main issues from a sustainable urban management perspective to understand the tourism activity using social media data: the origins of the visitors at a global scale and the location of the hot spots of visitor activity in the public space. The main hypothesis that supports research question is that the location of digital traces in the virtual space is a proxy that reflects the actual behavior of users in the physical public space, and in particular that picture-taking activity is closely aligned with visitor behavior.
A secondary research goal focuses on identifying the methodological aspects required to extract actionable knowledge from raw unstructured data from social media. The paper investigates the complete workflow developed in a case study, from the initial data collection, followed by the data cleaning and transformation, and finally the most effective visualization strategies to communicate the results to the intended audience. The main hypothesis of the secondary research goal is that a graphical summary can provide understanding to a non-expert audience, despite the size and complexity of the data sources.

2. Theoretical Framework

2.1. Capturing Human Spatial Behavior

The strategy of recording human activity in public spaces to support planning decisions—pioneered by Pushkarev [28] and Whyte [29]—has progressively percolated in the practice of urban planning and design [30], adopting a multifaceted approach [31]. This approach is critical to understand spatial behavior at the three main scales of analysis of the built environment: regional planning [32], urban planning and design [33], and architectural spaces in building interiors [34].
Recent technological advances in optical tracking [35] allow tracking behavior using cameras but raise privacy concerns, which can be mitigated using thermal imaging technologies [36] or motion sensing devices [37]. Other approaches use GPS technology [38] using handheld units [39], but require a sufficient number of volunteers that agree to participate in experiments [40]. The recruitment of participants for extended periods is also an issue in the emerging technologies of eye-tracking [41] and virtual reality simulations [42].
In contrast, the indirect tracking of spatial behavior can cover a broader temporal and spatial scope at the expense of reduced accuracy. The classic approach has relied on demographic [43] or transportation [44] statistics, but despite current advances in open data initiatives they are not universally applicable. Recent approaches aggregate data from cell towers [45], requiring ad hoc partnerships with telecommunications companies [46] at urban [47,48] or regional scales [49,50].
These technologies have been applied to tourism research [51], but can be expensive and time-consuming [52]. Moreover, they cannot be used in some research areas as they require designing and conducting an experiment, which is not always feasible, especially for retrospective or spatially extensive studies [53], in contrast to social media data [54].

2.2. Digital Traces, Picture Sharing and Tourism

Photography is inextricably associated with tourist activity, from the initial choice of destination browsing images on the Internet [55], through the motivations of the decision of taking a picture of oneself [56] or at a particular location [57], to the final selection of pictures posted online [58].
Current mobile devices are equipped with cameras, accurate GPS, and pervasive Internet connections, with user-friendly applications that allow users to post content on social networks. Since the introduction of the iPhone in 2007 and the generalization of smartphone use [59], the production and distribution of user-generated content of Web 2.0 services provide unique opportunities to gather data on user behavior at urban [60], regional [61], and global [62] scales.
Before geotagged media became pervasive, social media research in the tourism and hospitality field focused on travel planning from the consumer perspective, or the promotion of related services from the supplier perspective [63]. At the time of writing, location data from social media [64] can be mined from multiple platforms [65], and these emerging sources of data [66] can be used as a substitute for travel diaries [67] or replace resource-intensive field surveys [68].
The developed methodology focused on geotagged pictures collected from a photo-sharing community, as picture-sharing services typically provide a better location accuracy [69] than messaging platforms [70], which was required for the intended resolution of the analysis.

2.3. The Case of Barcelona

Promoted as a tourist destination since the beginning of the 20th century [71], and following several decades of international tourism expansion in Spain [72], Barcelona experimented an exceptional growth in the number of visitors [73] after the celebration of the Summer Olympic Games of 1992 [74] and is currently the 12th most visited urban destination in the world [75], holding the 6th position in the Airbnb top destination cities [76].
This sustained popularity as a tourist destination has situated Barcelona as an excellent candidate to test the proposed methodology, providing a significant volume of data for analysis as a thoroughly visited and photographed destination; according to Sightsmap and based on data from the discontinued Panoramio service, Barcelona was ranked as the third most photographed city in the world, while according to Flickr usage it was the 6th city in terms of the ratio of pictures of visitors versus its official population [77].
Moreover, from the perspective of its physical structure, the diversity of the urban fabric of Barcelona [78], where the historical evolution of the city has shaped a complex built environment [79] with a collage of different neighborhoods with their own distinct character and functional mix [80], has ensured that the developed methodology was suitable in a wide variety of urban morphologies and was capable of capturing the spatial dynamics of a rapidly changing city [81].

3. Data Collection

3.1. Retrieval of Picture Locations

Flickr is a photo-sharing community founded by the Canadian company Ludicorp in 2004, subsequently acquired by Yahoo in 2005, and owned by SmugMug since April 2018. The service allows users to share image content and receive feedback from its online community. Although it is possible to browse its catalog of public images without registration, the creation of an account is mandatory to upload content and use its social networking features.
The service supports storing spatial and temporal metadata if available, which are included by most location-enabled devices and stored in the Exchangeable Image File Format (EXIF) header of geotagged image files by default. Data can be accessed using a well-documented Application Programming Interface (API), which replicates most of the functionality available on its website through its entry point, which offers 222 methods at the time of writing.
Data collection was performed in March 2017 through the Flickr API using a custom script developed in the R programming language [82]. Using the flickr.photos.search method of the Flickr API, images were requested within a bounding rectangle encompassing the city limits of Barcelona in the WGS global reference system for geospatial information (EPSG:4326). The collected data consisted of 1,166,704 unique geotagged pictures and covered a period of 12 full years (from the beginning of 2005 to the end of 2016).
After parsing the 69 fields of the collected records in the JavaScript Object Notation (JSON) format, the coordinate pair of each geotagged picture was projected into the spatial reference used by the cartographic services of the city of Barcelona (EPSG:25831) with the maximum accuracy possible using the PROJ.4 library [83].
The locations outside the city limits, which included the points in the harbor within the Mediterranean Sea, were discarded with an overlay operation. Outliers that had the capacity to bias the results were excluded, in particular users with a disproportionate number of pictures (super-users) and coordinate pairs with an abnormal quantity of exactly coincident coordinate pairs, which were indicative of unreliable location precision. The resulting dataset contained 834,328 locations, a 28% reduction from the total points collected.
These raw point locations provided a detailed overview of the spatial distribution of the pictures publicly shared by Flickr users (Figure 1). However, the picture locations in this map did not provide information on whether the user who uploaded the picture to the Flickr service was a local or a visitor, making it unsuitable to identify the specific behavior of either group. In addition, while the map was capable of identifying the areas with a greater concentration of pictures, the large number of overlapping points could not provide an accurate representation of the true density in the most popular areas.

3.2. User Location Data

While it was possible to classify users into locals and visitors by analyzing their temporal behavior [84], this approach could incorrectly label a significant number of commuters from the metropolitan area as visitors, or conversely label recurring visitors taking advantage of short-haul low-cost flights as locals. To increase the accuracy of the classification, the information of the public profiles of the 34,283 users who posted the collected pictures was queried using the Flickr API, obtaining the time zone [85] for 72% of them using the flickr.people.getInfo method and, more importantly, the place of residence [86] of roughly 50% of the profiles, combining the information from the flickr.profile.getProfile and flickr.people.getInfo methods (Table 1).
The anonymity of these users was preserved at all times during research using only their Flickr-assigned unique identifier (NSID). While most users shared the same time zone as Barcelona (Figure 2a), the proportion of users with location data appeared constant across all time zones (Figure 2b). The maps indicated that the time zone corresponding to Barcelona had the largest number of users, which was consistent with an overrepresentation of locals or visitors from locations close to the destination. In contrast, the maps suggested that the proportion of users that provided location data in their profiles was consistent across all time zones.
This visual hypothesis was confirmed using the Cohen’s D statistic [87], confirming that there was not a statistically significant difference in user engagement (measured as the number of geotagged pictures uploaded) when the user’s profile included time zone information (p < 0.01) or when they included their location (p < 0.001), compared to users that provided no public data. It was therefore considered that there was not a consistent bias in the data.

4. Visualizing Global-Scale Data

4.1. Mapping Cities of Origin

The publicly shared location information collected from Flickr through its API consisted of unstructured text strings, with the name of the locality and/or the name of the country of residence of each user. This location information was not usable in the provided raw format, and it was necessary to extract the geographic coordinates corresponding to the named locations. This process required processing 10,908 unique user location strings with a custom script, obtaining 5693 unique geographic locations from which country and city of residence could be extracted and assigned to the corresponding users and—by extension—their pictures.
The coordinates of the user’s locations showed the spatial distribution of the visitors who posted a geotagged picture of Barcelona on Flickr, which were concentrated in the European Union (Figure 3), and in particular on the ‘blue banana’ European Megalopolis of the Manchester–Milan axis (which is well connected by land and air to Barcelona), but also on the East and West coasts of the United States of America.
While this visualization approach provided informative maps of the visitors’ locations, its interpretation can be challenging for stakeholders, because the spatial distribution of settlements is not uniform across all areas of the world and can suggest a higher concentration of visitors from areas where the population resides in small towns, and conversely underestimate areas with fewer but more populated cities. It also requires significant geographic knowledge to interpret the maps and extract meaningful conclusions.
Furthermore, the representation of locations as dots in a map makes overlapping locations in densely populated areas challenging to identify, although this can be partially addressed by overlaying geometries using transparency, or using the screen blending mode in image processing. Furthermore, encoding the number of users as the size of the dots increases the overplotting of locations, and introduces additional interpretation challenges because of the wide range of the values to represent (more than three orders of magnitude), as well as perceptual biases in symbol size judgment [26].

4.2. Treemap Representation

A treemap is a space-filling visualization [88] of hierarchical structures using a series of nested rectangles. The areas of those rectangles are proportional to an associated numerical value, and each rectangle can be recursively tiled into smaller rectangles corresponding to a sub-level. This representation summarizes the information hierarchically and with an efficient use of space, allowing the detection of patterns present in the data that could otherwise be impossible to identify during exploratory analysis. In contrast, presenting the information as a bubble map can hide patterns present in world-scale data, as discussed above.
The treemapify 2.5.5 R package [89] was used to summarize a global perspective of the visitors’ origins in a compact and intuitive visualization. The representation was split according the spatial granularity of the smaller aggregation unit, corresponding either to the city or the country of origin. To avoid clutter, only the units above a minimum user threshold were represented (1/1000 of its corresponding group), while the rest were consolidated into an overflow category.
The treemap corresponding to the smaller aggregation unit (city) was grouped according to seven geographic scopes around the city (Figure 4): municipal (corresponding to residents), metropolitan (well-connected cities in the functional area of influence), regional (level-2 administrative level), autonomic (level-1 administrative level), national (level-0 administrative level), and the countries that belonged to the Schengen area at the time of data collection. These scopes were chosen to reflect the different costs in the mobility patterns and other limitations in the travel (language, nationality, and border checks).
The results showed that Barcelona residents were the largest group, but their share was relatively small (13% of users). The most populous European and North American cities were well represented, as well as some smaller Catalan localities near Barcelona (Badalona, Sabadell, Terrassa), which were however surpassed in number by travelers from Madrid, suggesting a strong connection between the two largest metropolises in Spain, which are well-connected by air through one of the world’s busiest air shuttle routes and, more recently, through high-speed rail, in operation since 2008.
The treemap corresponding to the larger aggregation unit (country) was grouped according to the seven main regions [90] defined by the World Bank (Figure 5). This classification was designed to help the stakeholders focus their promotion efforts on specific countries, train the staff to handle specific languages, and tailor their offerings to cater to different cultural backgrounds.
This representation summarizes visually that the majority of users resided in European and Central Asia macro-region, which is home to more than 3/4 of the users. Spaniards were expectedly the most represented group, and beyond domestic visitors most users in this region originated from the United Kingdom, followed by Italy, France, Germany, and the Netherlands. In North America, the number of visitors from the United States was comparable in size to the United Kingdom, and Canadian visitors were similar in number to the ones coming from the Netherlands. In contrast, visitor counts from the whole of Latin America were about the same size as visitors from France, despite including Brazil, which accounted for almost the same number of users as Canada or Russia. Asian visitors were comparable in number to Germans. In contrast, visitors from the Middle East and North Africa, South Asia, or Sub-Saharan Africa were almost absent.

5. Spatial Analysis and Visualization

5.1. Picture Locations and User Origins

With these data, users were classified as locals when they belonged to one of the 164 municipalities of the Barcelona Metropolitan Region defined by the Regional Plan of Catalonia [91] and visitors otherwise. The origin of each user was assigned to all her pictures, resulting in 140,566 pictures from locals (17%), 348,197 from visitors (42%), and 345,565 for which the origin was not determined (41%). According to this classification, the number of pictures uploaded by locals and visitors did not show a statistically significant difference between both groups (p < 0.001), using Cohen’s D statistic.
The point locations were classified according to the origin of the corresponding user who shared the picture. The results were visualized in a grid of maps focusing on 12 landmarks of the city at the same scale (Figure 6), overlaid on a grayscale aerial photo. To avoid occlusions, the points were overlaid with the locals drawn on top of the visitors (because the smallest group was less likely to obscure the larger), with the pictures from users of unknown origin behind all of them.
The grid of maps revealed that the spatial distribution of the different user profiles was different across the city, exhibiting a higher intensity of visitors concentrated in the most recognizable landmarks, but also a higher spread around the areas of interest, while the local population tended to disperse more evenly across the city. However, beyond this visual assessment it was necessary to quantify the degree of clustering and dispersion to accurately measure the distinct spatial patterns of the behavior of locals and visitors.

5.2. Point Patterns

The visualization approach of the mosaic of maps of landmarks was very effective in showing the distribution of user origins in detailed scales, in particular when sorting the points in separate layers. However, it could not be applied to the whole area of study because of the differences in point density across the city.
The optimal visualization settings were dependent on both density and scale, because the point size and transparency designed for sparser areas were not suitable for the higher-density clusters, and vice versa. Likewise, increasing the magnification was not adequate for lower-density areas, and while it was adequate for interactive visualizations where users could focus on an area of interest it was not an option for printed output, where the scale and map extent are fixed.
One of the approaches to visualize the full extent of the city was using small multiples [25], separating the map representation into a lattice of equally sized panels with the same resolution and extent, each containing the point pattern for each of the user profiles. In this case, the representation was sensitive to the size of the point symbols, because larger symbols tended to overlap in higher-density areas. This issue was alleviated using small-sized points for the maps and using transparency to identify areas with different degrees of overlap.
The resulting maps (Figure 7) showed that the point pattern corresponding to the local population exhibits a more dispersed pattern, and while it concentrates in the central areas, it is spread more evenly throughout the city. In contrast, the visitors’ behavior is much more spatially clustered around touristic landmarks. The pictures taken by users whose origin could not be determined seem to exhibit a pattern between both extremes, indicating that they might correspond to a mixture of both profiles.
However, these qualitative judgments based on the visual analysis of the maps were adequate for exploratory analysis, but required a more accurate characterization based on the point pattern analysis framework.

5.3. Clustering

Although the spatial distribution of the points can be summarized into a single intensity value, the complexity of planar point patterns requires a different approach. However, purely graphical methods such as Fry plots [92] are not practical to compute for large sample sizes and are challenging to interpret visually. Therefore the majority of these summaries take the form of a function of some metric of the point pattern over a radius [93], generally plotted against the expectation of a Poisson process of the same intensity under the null hypothesis of Complete Spatial Randomness (CSR).
The K(r) function is the cumulative average number of points within a distance r of a typical point. In order to make the comparison of point patterns possible, the estimation is corrected for edge effects and divided by the intensity. The L(r) function is a common transformation of the K(r) function, which transforms the theoretical reference Poisson K-function into a straight line (Figure 8). The estimation was computed using the spatstat 1.64-1 [94] R package for the point patterns corresponding to the pictures taken by locals and visitors, and compared to the pictures without origin data. The corresponding function for the combined point patterns was also computed as a reference.
The results confirmed that the local pictures were more clustered than the rest of the patterns, as they were closer to the expected value under CSR. In contrast, the visitors exhibited a significantly higher clustering at all distances, but mostly higher at smaller ranges (up to 100 m), while the locals were more regular across all distance bands. The pictures from users whose origin could not be determined matched the global clustering pattern almost exactly, confirming the hypothesis that they consisted of a mixture of both profiles.

5.4. Rasterization

Compared to the maps, the graphical summary provided by the L(r) function described the clustering behavior of the different user profiles visually, and was supported by a robust spatial statistics framework. However, it could not provide an accurate representation of the specific areas of the city with a higher intensity of picture-taking activity.
Therefore, it was necessary to develop a map representation to overcome the issues identified in the point maps while keeping the advantages of small multiples. The approach used rasterization, counting the number of pictures in a grid of pixels (quadrat counts), and addressed the main issues in the discretization process [95]:
  • The size of the aggregation unit;
  • The transformation of the aggregated values;
  • The representation of the values using a color scale.
The size of the aggregation unit (pixel) was determined from the spatial distribution of the points themselves, with the objective of optimizing pixel size: grids of very small pixels tend to contain mostly empty cells, while grids of very large pixels do not provide enough spatial resolution at the intended scale of analysis. Therefore, the discretization of points into an infinitely fine grid of pixels would consist of only zeroes and ones, and conversely, a single pixel that covered the whole area of study would consist of a summary of the density without any variance.
The chosen criterion was based on the “empty space” F(r) function, sometimes called “spherical first contact distribution” or the “point-to-nearest-event distribution”. This function computes the probabilities (in the 0 to 1 range) of finding a point within a radius of any fixed reference location (Figure 9).
Because F(r) is defined as a cumulative distribution function, it was possible to define the pixel sizes where a certain fraction of the cells would have a point count different than zero, for each profile. These thresholds are shown as points on their respective curves, and their cutoffs as well as their corresponding pixel sizes (Table 2) were used to define a pixel size common for all maps. A pixel size of 25 × 25 m for the maps was chosen as a compromise value for all point patterns (shown as a vertical line in the figure).
The pixel values were transformed to convert the counts into an attractiveness metric [96], which consisted of a multiplier in relation to a baseline, defined as a hypothetical uniform distribution of pictures. This metric was invariant by construction to both the pixel size and the number of pixels in the observation window. The resulting values were log-transformed to fit the computed magnitudes into the dynamic range of the color scale of the map, where some locations have a picture interest more than a thousand times larger than the reference baseline.
The numeric values of the cells were visualized in a perceptual uniform color scale (Parula). The results showed a very detailed spatial distribution of events (Figure 10), but without the issues discussed in the plot of individual points [97]. The maps visually confirm the clustering of visitors in the major landmarks corresponding to the most famous buildings of the architect Antoni Gaudí, as well as their concentration around a few streets with a marked touristic character. In contrast, the panel corresponding to the locals confirms a more dispersed pattern around the central areas of the city but also reveals a high number of pictures taken by residents on the main landmarks of the city, indicative that the photographic attractiveness of these hotspots is aligned with the interests of both locals and the tourists, and that locals behave like tourists in these locations.

5.5. Kernel Density Estimation

The rasterized maps discussed above are useful to identify the landmarks in the city with the highest picture interest, but cannot produce the required delimitation of the areas of outstanding intensity. This delimitation is crucial to designate the areas where it is necessary to prioritize the implementation of policies that modulate tourist pressure, and requires a methodology that produces regions that can be delimited accurately, as a complement for maps that highlight specific landmarks.
The kernel density estimation (KDE) computes non-parametrically the intensity function of a point process [98] as a density map, and is suitable if this process is suspected to be inhomogeneous (Figure 8 and Figure 9). Although the result is a raster composed of small-sized pixels, the underlying estimated intensity function can theoretically be discretized into an arbitrarily fine grid of pixels.
The point patterns were smoothed with a gaussian kernel, with a standard deviation (sigma) of 250 m, and therefore there was 62.8% of the mass around each point in a 250 m radius, and 95.4% in a 512 m radius. The values were chosen following the L(r) function (Figure 8), considering the radius where the curves approximately began to increase monotonically. This method was also effective in smoothing small inaccuracies in the coordinates of the points.
The KDE results were computed in a grid with the same 25 × 25 m resolution as the rasterized maps and classified in quantiles for each of the resulting pixel values. The classification was diverging and symmetric around the median, and defined a neutral (non-significant) area corresponding to the interquartile range, as well as two extremes to identify cold and hot spots (lower and upper quartiles, respectively). These cold and hot areas were classified into three intensities, with the extreme values corresponding to the lower and upper 1% (Figure 11).
The results clearly showed the hot spots of the visitors’ activity, with had much more pronounced areas of interest around the landmarks in the city, compared to the local population. In particular, it was possible to identify the areas around two Gaudí landmarks (Sagrada Família and Park Güell), which in contrast did not appear as points of interest for the local population. The old quarter (Ciutat Vella) was identified as very popular area for both tourists and visitors, but in the case of tourists the area of interest was more compact, but extended through the axis of Passeig de Gràcia through another Gaudí landmark (La Pedrera). In addition, this approach was also capable of identifying and outline the “deserts”, or areas devoid of tourist activity.

6. Discussion

Although digital traces provide an unprecedented opportunity for data-driven research on tourist activity in urban settings, it faces significant challenges regarding collection, processing, modeling, and visualization. This paper describes all the steps in this process in a coherent methodology, but focuses on the visual aspects which are oftentimes neglected but are the critical last step of the data analysis process [99] to communicate the results to the stakeholders effectively.
Multiple complementary visualization approaches were discussed, which are suitable for different stages of the analysis, from the exploratory phase where hypotheses are formulated, to the delimitation of the areas of outstanding visitor pressure. It also approaches the analysis using a multi-scale approach, from the global to the detail scales, aligned with the workflow requirements of the urban planning practice.
The presented methodology is firmly grounded in a well-established spatial statistics framework, adapted to a “big data” environment, to extract knowledge from social media, condensing loosely structured data into effective and interpretable visualizations, following the principles of the grammar of graphics [100] in a reproducible computational environment.
The results reveal the distinct digital traces of locals and tourists, adapting the visualization strategy to the specific requirements of multiple scales and resolutions of analysis. Despite the complexity of the raw data, the resulting maps condense social media activity into informative representations that reduce the cognitive load of the intended audience to interpret the results.
The discussed methodology addresses the main research question regarding the suitability of social media data as an emerging source to investigate tourism behavior from a sustainable urban management perspective. The methodology was designed to be generalized to other areas of study and provides a substantial advantage over other surveying methods in terms of economy, feasibility, and accuracy. In particular, identifying the geographic origin of individuals is not possible with camera-based tracking, and obtaining data on the behavior of a large number of participants is not practical at the urban scale.
Despite these advantages, the collected data can be considered a convenience sample, and WEIRD (western, educated, industrialized, rich, and democratic) subjects can be overrepresented and therefore introduce biases in the results [101] because of differences in Flickr usage across regions. Despite this limitation, this paper assumes that visitor behavior can be considered independent of cultural background, but corrective measures such as stratified sampling should be implemented otherwise in future research.
Another limitation are the privacy concerns; while the research preserved the privacy of all users at all times, it was obvious that the possibility of tracking the behavior of a single individual was trivial with the publicly available data collected. The increasing sensitivity of users on this topic can prevent bona fide further research if these data becomes unavailable.
The secondary research question focused on the data processing required to graphically summarize the unstructured data collected from social media into informative representations adapted to the requirements of the practice of urban planners and designers. The research indicates that there is not a single strategy that can be applied in all situations, but on the contrary the scale of analysis and intended application dictate the most effective visualization approach. For this reason, the paper presents different methods to address the issues identified in this case study.
Despite the contributions presented to the sustainable tourism management field, there are many open avenues of investigation that can be pursued in further research, such as the application to other cases of study or the inclusion of the temporal variable in addition to the spatial location, which would result in significant contributions to the field and provide a more comprehensive understanding of the visitor behavior in urban settings.

Author Contributions

Conceptualization, F.V. and J.R.; methodology, F.V.; software, F.V.; validation, F.V. and J.R.; formal analysis, F.V.; investigation, F.V.; resources, F.V.; data curation, F.V.; writing—original draft preparation, F.V.; writing—review and editing, F.V. and J.R.; visualization, F.V.; supervision, F.V. and J.R.; project administration, F.V. and J.R.; funding acquisition, F.V. and J.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Informed Consent Statement

Informed consent was waived because data was publicly available, and the anonymity of subjects was preserved at all times.

Data Availability Statement

Data not available due to privacy restrictions.

Acknowledgments

The authors thank the three anonymous reviewers for their time to critically read the manuscript and for their valuable comments that helped us improve and clarify the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. UN-Habitat. State of the World’s Cities 2010/2011—Cities for All: Bridging the Urban Divide; State of the World’s Cities; Earthscan: London, UK, 2010; ISBN 978-1-84971-175-3. [Google Scholar]
  2. Dovey, K.; Pafka, E. The Science of Urban Design? Urban Des. Int. 2015, 21, 1–10. [Google Scholar] [CrossRef]
  3. Bettencourt, L.M.A.; Lobo, J.; Helbing, D.; Kuhnert, C.; West, G.B. Growth, Innovation, Scaling, and the Pace of Life in Cities. Proc. Natl. Acad. Sci. USA 2007, 104, 7301–7306. [Google Scholar] [CrossRef] [Green Version]
  4. World Tourism Organization. “Overtourism”? Understanding and Managing Urban Tourism Growth beyond Perceptions; Centre of Expertise Leisure, Tourism & Hospitality, NHTV Breda University of Applied Science, NHL Stenden University of Applied Sciences, Eds.; World Tourism Organization (UNWTO): Madrid, Spain, 2018; ISBN 978-92-844-2007-0. [Google Scholar]
  5. Cárdenas-García, P.J.; Sánchez-Rivero, M.; Pulido-Fernández, J.I. Does Tourism Growth Influence Economic Development? J. Travel Res. 2015, 54, 206–221. [Google Scholar] [CrossRef]
  6. González-Pérez, J.M. The Dispute over Tourist Cities. Tourism Gentrification in the Historic Centre of Palma (Majorca, Spain). Tour. Geogr. 2019, 1–21. [Google Scholar] [CrossRef]
  7. Popp, M. Positive and Negative Urban Tourist Crowding: Florence, Italy. Tour. Geogr. 2012, 14, 50–72. [Google Scholar] [CrossRef]
  8. Jia, H.; Appolloni, A.; Wang, Y. Green Travel: Exploring the Characteristics and Behavior Transformation of Urban Residents in China. Sustainability 2017, 9, 1043. [Google Scholar] [CrossRef] [Green Version]
  9. Koens, K.; Postma, A.; Papp, B. Is Overtourism Overused? Understanding the Impact of Tourism in a City Context. Sustainability 2018, 10, 4384. [Google Scholar] [CrossRef] [Green Version]
  10. García-Hernández, M.; De la Calle-Vaquero, M.; Yubero, C. Cultural Heritage and Urban Tourism: Historic City Centres under Pressure. Sustainability 2017, 9, 1346. [Google Scholar] [CrossRef] [Green Version]
  11. Xu, F.; Hu, M.; La, L.; Wang, J.; Huang, C. The Influence of Neighbourhood Environment on Airbnb: A Geographically Weighed Regression Analysis. Tour. Geogr. 2019, 22, 1–18. [Google Scholar] [CrossRef]
  12. Sánchez-Galiano, J.-C.; Martí-Ciriquián, P.; Fernández-Aracil, P. Temporary Population Estimates of Mass Tourism Destinations: The Case of Benidorm. Tour. Manag. 2017, 62, 234–240. [Google Scholar] [CrossRef]
  13. O’Reilly, A.M. Tourism Carrying Capacity: Concept and Issues. Tour. Manag. 1986, 7, 254–258. [Google Scholar] [CrossRef]
  14. Miller, H.J. The Data Avalanche Is Here. Shouldn’t We Be Digging? J. Reg. Sci. 2010, 50, 181–201. [Google Scholar] [CrossRef]
  15. Lynch, K. The Image of the City; The MIT Press: Cambridge, MA, USA, 1960; ISBN 0-262-62001-4. [Google Scholar]
  16. Cullen, G. The Concise Townscape; Architectural Press: London, UK, 1961; ISBN 978-0-7506-2018-5. [Google Scholar]
  17. Decoding the City: Urbanism in the Age of Big Data, 1st ed; Offenhuber, D.; Ratti, C. (Eds.) Birkhauser Verlag AG: Basel, Switzerland, 2014; ISBN 978-3-03821-597-4. [Google Scholar]
  18. Batty, M. Does Big Data Lead to Smarter Cities? Problems, Pitfalls and Opportunities. ISJLP 2015, 11, 127–525. [Google Scholar]
  19. Higham, D.J.; Batty, M.; Bettencourt, L.M.A.; Greetham, D.V.; Grindrod, P. An Overview of City Analytics. R. Soc. Open Sci. 2017, 4, 161063. [Google Scholar] [CrossRef] [PubMed]
  20. Valls, F.; Garcia-Almirall, P.; Redondo, E.; Fonseca, D. From Raw Data to Meaningful Information: A Representational Approach to Cadastral Databases in Relation to Urban Planning. Future Internet 2014, 6, 612–639. [Google Scholar] [CrossRef] [Green Version]
  21. Serra-Coch, G.; Chastel, C.; Campos, S.; Coch, H. Graphical Approach to Assess Urban Quality: Mapping Walkability Based on the TOD-Standard. Cities 2018, 76, 58–71. [Google Scholar] [CrossRef] [Green Version]
  22. Miller, H.J.; Goodchild, M.F. Data-Driven Geography. GeoJournal 2015, 80, 449–461. [Google Scholar] [CrossRef]
  23. Mcardle, G.; Kitchin, R. Improving the Veracity of Open and Real-Time Urban Data. Built Environ. 2016, 42, 457–473. [Google Scholar] [CrossRef]
  24. Donaire, J.A.; Galí, N. La imagen turística de Barcelona en la comunidad de flickr. Cuad Tur. 2011, 291–303. [Google Scholar]
  25. Tufte, E.R. Envisioning Information; Graphics Press: Cheshire, CT, USA, 1990; ISBN 0-9613921-1-8. [Google Scholar]
  26. Wilkinson, L.; Wills, G. The Grammar of Graphics; Springer: New York, NY, USA, 2005; ISBN 0-387-24544-8. [Google Scholar]
  27. Peng, R.D. Reproducible Research in Computational Science. Science 2011, 334, 1226. [Google Scholar] [CrossRef] [Green Version]
  28. Pushkarev, B.S. Urban Space for Pedestrians: A Quantitative Approach, 1st ed.; The MIT Press: Cambridge, MA, USA, 1976; ISBN 0-262-16063-3. [Google Scholar]
  29. Whyte, W.H. The Social Life of Small Urban Spaces; Project for Public Spaces: New York, NY, USA, 1980; ISBN 978-0-9706324-1-8. [Google Scholar]
  30. Urbanism on Track: Application of Tracking Technologies in Urbanism; Schaick, J.; van Spek, S.C. (Eds.) Research in Urbanism Series; IOS Press: Amsterdam, The Netherlands, 2008; ISBN 978-1-58603-817-5. [Google Scholar]
  31. Gehl, J.; Svarre, B. How to Study Public Life, 2nd ed.; Island Press: Washington, DC, USA, 2013; ISBN 978-1-61091-423-9. [Google Scholar]
  32. Zhong, C.; Schläpfer, M.; Arisona, S.M.; Batty, M.; Ratti, C.; Schmitt, G. Revealing Centrality in the Spatial Structure of Cities from Human Activity Patterns. Urban Stud. 2015, 0042098015601599. [Google Scholar] [CrossRef] [Green Version]
  33. Willis, A.; Gjersoe, N.; Havard, C.; Kerridge, J.; Kukla, R. Human Movement Behaviour in Urban Spaces: Implications for the Design and Modelling of Effective Pedestrian Environments. Environ. Plan. B Plan. Des. 2004, 31, 805–828. [Google Scholar] [CrossRef] [Green Version]
  34. Penn, A. The Complexity of the Elementary Interface: Shopping Space. In Proceedings of the 5th International Space Syntax Symposium, Delft, The Netherlands, 13–17 June 2005; van Nes, A., Ed.; Volume 1, pp. 25–42. [Google Scholar]
  35. Aggarwal, J.K.; Ryoo, M.S. Human Activity Analysis: A Review. ACM Comput. Surv. 2011, 43, 16:1–16:43. [Google Scholar] [CrossRef]
  36. Nielsen, S.Z.; Gade, R.; Moeslund, T.B.; Skov-Petersen, H. Taking the Temperature of Pedestrian Movement in Public Spaces. Transp. Res. Procedia 2014, 2, 660–668. [Google Scholar] [CrossRef] [Green Version]
  37. Seer, S.; Brändle, N.; Ratti, C. Kinects and Human Kinetics: A New Approach for Studying Pedestrian Behavior. Transp. Res. Part C Emerg. Technol. 2014, 48, 212–228. [Google Scholar] [CrossRef]
  38. Shoval, N.; Kwan, M.-P.; Reinau, K.H.; Harder, H. The Shoemaker’s Son Always Goes Barefoot: Implementations of GPS and Other Tracking Technologies for Geographic Research. Geoforum 2014, 51, 1–5. [Google Scholar] [CrossRef]
  39. Van Der Spek, S. Spatial Metro—Tracking Pedestrians in Historic City Centres. Res. Urban. Ser. 2008, 1, 77–97. [Google Scholar] [CrossRef]
  40. Bauder, M.; Freytag, T. Visitor Mobility in the City and the Effects of Travel Preparation. Tour. Geogr. 2015, 17, 682–700. [Google Scholar] [CrossRef]
  41. Scott, N.; Zhang, R.; Le, D.; Moyle, B. A Review of Eye-Tracking Research in Tourism. Curr. Issues Tour. 2019, 22, 1244–1261. [Google Scholar] [CrossRef]
  42. Kuliga, S.F.; Thrash, T.; Dalton, R.C.; Hölscher, C. Virtual Reality as an Empirical Research Tool—Exploring User Experience in a Real Building and a Corresponding Virtual Model. Comput. Environ. Urban Syst. 2015, 54, 363–375. [Google Scholar] [CrossRef]
  43. Back, A.; Marjavaara, R. Mapping an Invisible Population: The Uneven Geography of Second-Home Tourism. Tour. Geogr. 2017, 19, 595–611. [Google Scholar] [CrossRef]
  44. Reades, J.; Zhong, C.; Manley, E.; Milton, R.; Batty, M. Finding Pearls in London’s Oysters. Built Environ. 2016, 42, 365–381. [Google Scholar] [CrossRef] [Green Version]
  45. González, M.C.; Hidalgo, C.A.; Barabási, A.-L. Understanding Individual Human Mobility Patterns. Nature 2008, 453, 779–782. [Google Scholar] [CrossRef]
  46. Reades, J.; Calabrese, F.; Sevtsuk, A.; Ratti, C. Cellular Census: Explorations in Urban Data Collection. IEEE Pervasive Comput. 2007, 6, 30–38. [Google Scholar] [CrossRef] [Green Version]
  47. Ratti, C.; Frenchman, D.; Pulselli, R.M.; Williams, S. Mobile Landscapes: Using Location Data from Cell Phones for Urban Analysis. Environ. Plann B Plann Des. 2006, 33, 727–748. [Google Scholar] [CrossRef]
  48. Kloeckl, K.; Senn, O.; Ratti, C. Enabling the Real-Time City: LIVE Singapore! J. Urban Technol. 2012, 19, 89–112. [Google Scholar] [CrossRef]
  49. Ahas, R.; Mark, Ü. Location Based Services—New Challenges for Planning and Public Administration? Futures 2005, 37, 547–561. [Google Scholar] [CrossRef]
  50. Ratti, C.; Sobolevsky, S.; Calabrese, F.; Andris, C.; Reades, J.; Martino, M.; Claxton, R.; Strogatz, S.H. Redrawing the Map of Great Britain from a Network of Human Interactions. PLoS ONE 2010, 5, e14248. [Google Scholar] [CrossRef] [Green Version]
  51. Shoval, N.; Ahas, R. The Use of Tracking Technologies in Tourism Research: The First Decade. Tour. Geogr. 2016, 18, 587–606. [Google Scholar] [CrossRef]
  52. Caldeira, A.M.; Kastenholz, E. Spatiotemporal Tourist Behaviour in Urban Destinations: A Framework of Analysis. Tour. Geogr. 2019. [Google Scholar] [CrossRef]
  53. Wood, S.A.; Guerry, A.D.; Silver, J.M.; Lacayo, M. Using Social Media to Quantify Nature-Based Tourism and Recreation. Sci. Rep. 2013, 3. [Google Scholar] [CrossRef] [PubMed]
  54. Jin, C.; Xu, J. Using User-Generated Content Data to Analyze Tourist Mobility between Hotels and Attractions in Cities. Environ. Plan. B Urban Anal. City Sci. 2020, 47, 826–840. [Google Scholar] [CrossRef]
  55. Lian, T.; Yu, C. Impacts of Online Images of a Tourist Destination on Tourist Travel Decision. Tour. Geogr. 2019, 1–30. [Google Scholar] [CrossRef]
  56. Carbon, C.-C. Universal Principles of Depicting Oneself across the Centuries: From Renaissance Self-Portraits to Selfie-Photographs. Front. Psychol. 2017, 8. [Google Scholar] [CrossRef] [Green Version]
  57. Patil, S.; Norcie, G.; Kapadia, A.; Lee, A. “Check out Where I Am!”: Location-Sharing Motivations, Preferences, and Practices. In Proceedings of the CHI ’12 Extended Abstracts on Human Factors in Computing Systems; ACM: New York, NY, USA, 2012; pp. 1997–2002. [Google Scholar]
  58. Donaire, J.A.; Camprubí, R.; Galí, N. Tourist Clusters from Flickr Travel Photography. Tour. Manag. Perspect. 2014, 11, 26–33. [Google Scholar] [CrossRef] [Green Version]
  59. Shoval, N. Sensing Tourists: Geoinformatics and the Future of Tourism Geography Research. Tour. Geogr. 2018, 20, 910–912. [Google Scholar] [CrossRef]
  60. Shao, H.; Zhang, Y.; Li, W. Extraction and Analysis of City’s Tourism Districts Based on Social Media Data. Comput. Environ. Urban Syst. 2017, 65, 66–78. [Google Scholar] [CrossRef]
  61. Bae, S.H.; Yun, H.J. Spatiotemporal Distribution of Visitors’ Geotagged Landscape Photos in Rural Areas. Tour. Plan. Dev. 2017, 14, 167–180. [Google Scholar] [CrossRef]
  62. Hawelka, B.; Sitko, I.; Beinat, E.; Sobolevsky, S.; Kazakopoulos, P.; Ratti, C. Geo-Located Twitter as Proxy for Global Mobility Patterns. Cartogr. Geogr. Inf. Sci. 2014, 41, 260–271. [Google Scholar] [CrossRef] [Green Version]
  63. Leung, D.; Law, R.; van Hoof, H.; Buhalis, D. Social Media in Tourism and Hospitality: A Literature Review. J. Travel Tour. Mark. 2013, 30, 3–22. [Google Scholar] [CrossRef]
  64. Andris, C. Integrating Social Network Data into GISystems. Int. J. Geogr. Inf. Sci. 2016, 30, 2009–2031. [Google Scholar] [CrossRef] [Green Version]
  65. Stock, K. Mining Location from Social Media: A Systematic Review. Comput. Environ. Urban Syst. 2018, 71, 209–240. [Google Scholar] [CrossRef]
  66. Zhang, J. Big Data and Tourism Geographies—An Emerging Paradigm for Future Study? Tour. Geogr. 2018, 20, 899–904. [Google Scholar] [CrossRef] [Green Version]
  67. Vu, H.Q.; Li, G.; Law, R.; Zhang, Y. Tourist Activity Analysis by Leveraging Mobile Social Media Data. J. Travel Res. 2018, 57, 883–898. [Google Scholar] [CrossRef] [Green Version]
  68. Hamstead, Z.A.; Fisher, D.; Ilieva, R.T.; Wood, S.A.; McPhearson, T.; Kremer, P. Geolocated Social Media as a Rapid Indicator of Park Visitation and Equitable Park Access. Comput. Environ. Urban Syst. 2018, 72, 38–50. [Google Scholar] [CrossRef]
  69. Hauff, C. A Study on the Accuracy of Flickr’s Geotag Data. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval; ACM: New York, NY, USA, 2013; pp. 1037–1040. [Google Scholar]
  70. Salas-Olmedo, M.H.; Moya-Gómez, B.; García-Palomares, J.C.; Gutiérrez, J. Tourists’ Digital Footprint in Cities: Comparing Big Data Sources. Tour. Manag. 2018, 66, 13–25. [Google Scholar] [CrossRef] [Green Version]
  71. Cócola Gant, A.; Palou Rubio, S. Tourism Promotion and Urban Space in Barcelona. Historic Perspective and Critical Review, 1900–1936. Doc. D’anàlisi Geogràfica 2015, 61, 461–482. [Google Scholar] [CrossRef] [Green Version]
  72. Balaguer, J.; Cantavella-Jordá, M. Tourism as a Long-Run Economic Growth Factor: The Spanish Case. Appl. Econ. 2002, 34, 877–884. [Google Scholar] [CrossRef] [Green Version]
  73. Tamajón, L.A.G.; Valiente, G.C. Cruise Tourism in Barcelona: From Marginality to International Leadership. Bol. Asoc. Geogr. Esp. 2012, 253–271+483–488. [Google Scholar]
  74. Herstein, R.; Berger, R. Hosting the Olympics: A City’s Make-or-Break Impression. J. Bus. Strategy 2013, 34, 54–59. [Google Scholar] [CrossRef]
  75. Hedrick-Wong, Y.; Choong, D. Global Destination Cities Index 2016; Mastercard: New York, NY, USA, 2016. [Google Scholar]
  76. Geerts, W. Top 100 City Destinations Ranking; Euromonitor International: London, UK, 2017. [Google Scholar]
  77. Paldino, S.; Bojic, I.; Sobolevsky, S.; Ratti, C.; González, M.C. Urban Magnetism through the Lens of Geo-Tagged Photography. Epj Data Sci. 2015, 4, 5. [Google Scholar] [CrossRef] [Green Version]
  78. Dovey, K.; Pafka, E. What Is Functional Mix? An Assemblage Approach. Plan. Theory Pract. 2017, 1–19. [Google Scholar] [CrossRef] [Green Version]
  79. Louf, R.; Barthelemy, M. A Typology of Street Patterns. J. R. Soc. Interface 2014, 11, 20140924. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  80. Porta, S.; Latora, V.; Wang, F.; Rueda, S.; Strano, E.; Scellato, S.; Cardillo, A.; Belli, E.; Càrdenas, F.; Cormenzana, B.; et al. Street Centrality and the Location of Economic Activities in Barcelona. Urban Stud 2012, 49, 1471–1488. [Google Scholar] [CrossRef] [Green Version]
  81. Mansilla, J.A.; Milano, C. Becoming Centre: Tourism Placemaking and Space Production in Two Neighborhoods in Barcelona. Tour. Geogr. 2019. [Google Scholar] [CrossRef]
  82. R Core Team. R. A Language and Environment for Statistical Computing; R Core Team R: Vienna, Austria, 1993. [Google Scholar]
  83. Knudsen, T.; Evers, K. Transformation Pipelines for PROJ.4. In Proceedings of the FIG Working Week 2017 Proceedings, Helsinki, Finland, 29 May–2 June 2017; Volume 19, p. 8050. [Google Scholar]
  84. Huang, Q.; Wong, D.W.S. Activity Patterns, Socioeconomic Status and Urban Spatial Structure: What Can Social Media Data Tell Us? Int. J. Geogr. Inf. Sci. 2016, 30, 1873–1898. [Google Scholar] [CrossRef]
  85. Chua, A.; Servillo, L.; Marcheggiani, E.; Moere, A.V. Mapping Cilento: Using Geotagged Social Media Data to Characterize Tourist Flows in Southern Italy. Tour. Manag. 2016, 57, 295–310. [Google Scholar] [CrossRef]
  86. Straumann, R.K.; Çöltekin, A.; Andrienko, G. Towards (Re)Constructing Narratives from Georeferenced Photographs through Visual Analytics. Cartogr. J. 2014, 51, 152–165. [Google Scholar] [CrossRef] [Green Version]
  87. Cohen, J. Statistical Power Analysis for the Behavioral Sciences, 2nd ed.; L. Erlbaum Associates: Hillsdale, NJ, USA, 1988; ISBN 978-0-8058-0283-2. [Google Scholar]
  88. Wickham, H.; Hofmann, H. Product Plots. IEEE Trans. Vis. Comput. Graph. 2011, 17, 2223–2230. [Google Scholar] [CrossRef] [PubMed]
  89. Wilkins, D. Treemapify: Draw Treemaps in “Ggplot2”. Adelaide, Australia. 2017. Available online: https://github.com/wilkox/treemapify/ (accessed on 8 January 2021).
  90. Kelso, N.V.; Patterson, T. Natural Earth Vector. Cartogr. Perspect. 2009, 45–50. [Google Scholar] [CrossRef]
  91. Departament de la Presidència. Pla Territorial General de Catalunya; Departament de la Presidència: Barcelona, Spain, 1995; Volume 2032, pp. 2621–2623.
  92. Fry, N. Random Point Distributions and Strain Measurement in Rocks. Tectonophysics 1979, 60, 89–105. [Google Scholar] [CrossRef]
  93. Baddeley, A.; Rubak, E.; Turner, R. Spatial Point Patterns: Methodology and Applications with R; CRC Press: Boca Raton, FL, USA, 2016; ISBN 978-1-4822-1020-0. [Google Scholar]
  94. Baddeley, A.; Turner, R. Spatstat: An R Package for Analyzing Spatial Point Patterns. J. Stat. Softw. 2005, 12. [Google Scholar] [CrossRef] [Green Version]
  95. Valls, F. Digital Traces and Urban Research: Barcelona Through Social Media Data. Ph.D. Thesis, Universitat Politècnica de Catalunya, Barcelona, Spain, 2019. [Google Scholar]
  96. Valls, F.; Roca, J. Identificación de Puntos de Interés Turístico a Través de Datos de Redes Sociales. In Proceedings of the XIII CTV 2019 Proceedings: XIII International Conference on Virtual City and Territory: “Challenges and Paradigms of the Contemporary City”, Barcelona, Spain, 2 October 2019; pp. 1–17. [Google Scholar]
  97. Bednar, J. Big Data Visualization with Datashader; Anaconda, Inc.: Austin, TX, USA, 2017; p. 12. [Google Scholar]
  98. Silverman, B.W. Density Estimation for Statistics and Data Analysis; Monographs on Statistics and Applied Probability; Chapman & Hall: London, UK; CRC: New York, NY, USA, 1986; ISBN 978-0-412-24620-3. [Google Scholar]
  99. Wickham, H.; Grolemund, G. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data, 1st ed.; O’Reilly Media: Sebastopol, CA, USA, 2016; ISBN 978-1-4919-1039-9. [Google Scholar]
  100. Wickham, H. A Layered Grammar of Graphics. J. Comput. Graph. Stat. 2010, 19, 3–28. [Google Scholar] [CrossRef]
  101. Henrich, J.; Heine, S.J.; Norenzayan, A. The Weirdest People in the World? Behav. Brain Sci. 2010, 33, 61–83. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Point cloud corresponding to the 1,166,704 unique geotagged pictures (in white) collected using the Flickr API in the WGS 84/Pseudo-Mercator—Spherical Mercator (EPSG:3857) projected coordinate system. Base map tiles by Carto, under CC BY 3.0. Data by OpenStreetMap, under Open Database License (ODbL).
Figure 1. Point cloud corresponding to the 1,166,704 unique geotagged pictures (in white) collected using the Flickr API in the WGS 84/Pseudo-Mercator—Spherical Mercator (EPSG:3857) projected coordinate system. Base map tiles by Carto, under CC BY 3.0. Data by OpenStreetMap, under Open Database License (ODbL).
Sustainability 13 03159 g001
Figure 2. Number of Flickr users who uploaded a geotagged picture of Barcelona according to the time zone they reside in (a) and the proportion of users that provide location data in their profile (b). Map projection uses the North Pole Lambert Azimuthal Equal Area (ESRI:102017). Time zone cartography from Natural Earth, in the public domain.
Figure 2. Number of Flickr users who uploaded a geotagged picture of Barcelona according to the time zone they reside in (a) and the proportion of users that provide location data in their profile (b). Map projection uses the North Pole Lambert Azimuthal Equal Area (ESRI:102017). Time zone cartography from Natural Earth, in the public domain.
Sustainability 13 03159 g002
Figure 3. Origins of the Flickr users with geotagged pictures of Barcelona. Dot size is proportional to the number of users, and symbols are overlaid using the QGIS software screen blend mode. Maps use the Plate Carrée Equirectangular projection (EPSG:4326/EPSG:32663) and the ETRS Lambert Azimuthal Equal-Area projection (EPSG:3035), respectively. Base cartography from Natural Earth, in the public domain.
Figure 3. Origins of the Flickr users with geotagged pictures of Barcelona. Dot size is proportional to the number of users, and symbols are overlaid using the QGIS software screen blend mode. Maps use the Plate Carrée Equirectangular projection (EPSG:4326/EPSG:32663) and the ETRS Lambert Azimuthal Equal-Area projection (EPSG:3035), respectively. Base cartography from Natural Earth, in the public domain.
Sustainability 13 03159 g003
Figure 4. Treemap of the user’s cities of residence, grouped and colored according to their geographic scope around the city of Barcelona, reflecting mobility patterns and travel limitations.
Figure 4. Treemap of the user’s cities of residence, grouped and colored according to their geographic scope around the city of Barcelona, reflecting mobility patterns and travel limitations.
Sustainability 13 03159 g004
Figure 5. Treemap of the user’s country of residence, grouped and colored according to their main region according to the World Bank classification.
Figure 5. Treemap of the user’s country of residence, grouped and colored according to their main region according to the World Bank classification.
Sustainability 13 03159 g005
Figure 6. Picture locations according to the origin of the user who uploaded it to the Flickr service in 12 landmarks in Barcelona. All grids are at the same scale and measure 500 m across. Locations are rendered in the following order from top to bottom: locals (blue), visitors (red), and undetermined (green). Base cartography by Institut Cartogràfic i Geològic de Catalunya (ICGC), under CC BY 4.0.
Figure 6. Picture locations according to the origin of the user who uploaded it to the Flickr service in 12 landmarks in Barcelona. All grids are at the same scale and measure 500 m across. Locations are rendered in the following order from top to bottom: locals (blue), visitors (red), and undetermined (green). Base cartography by Institut Cartogràfic i Geològic de Catalunya (ICGC), under CC BY 4.0.
Sustainability 13 03159 g006
Figure 7. Small multiple maps of the picture locations taken by locals, visitors, and users with unknown origin. Points are rendered with 10% opacity. Maps use the WGS 84/Pseudo-Mercator—Scheme 3857. projected coordinate system. Base map tiles by Stamen Design, under CC BY 3.0, using data from OpenStreetMap under ODbL. City limits cartography from CartoBCN, under CC BY 3.0.
Figure 7. Small multiple maps of the picture locations taken by locals, visitors, and users with unknown origin. Points are rendered with 10% opacity. Maps use the WGS 84/Pseudo-Mercator—Scheme 3857. projected coordinate system. Base map tiles by Stamen Design, under CC BY 3.0, using data from OpenStreetMap under ODbL. City limits cartography from CartoBCN, under CC BY 3.0.
Sustainability 13 03159 g007
Figure 8. Comparison of the summary L(r) functions of the point patterns of pictures from locals, visitors, and users with undetermined origin. The function for the combined point pattern is shown as a reference with a black dotted line. The expected value under CSR is shown as a solid gray line, and the approximate radius where the curves begin to increase monotonically is shown as a faint vertical dashed line.
Figure 8. Comparison of the summary L(r) functions of the point patterns of pictures from locals, visitors, and users with undetermined origin. The function for the combined point pattern is shown as a reference with a black dotted line. The expected value under CSR is shown as a solid gray line, and the approximate radius where the curves begin to increase monotonically is shown as a faint vertical dashed line.
Sustainability 13 03159 g008
Figure 9. Comparison of the summary F(r) functions of the point patterns of pictures from locals, visitors, and users with undetermined origin. The function for the combined point pattern is shown as a reference with a black dotted line. The evaluated thresholds are shown as points on their curve. The consensus 25 × 25 m pixel size is shown as a dashed vertical gray line.
Figure 9. Comparison of the summary F(r) functions of the point patterns of pictures from locals, visitors, and users with undetermined origin. The function for the combined point pattern is shown as a reference with a black dotted line. The evaluated thresholds are shown as points on their curve. The consensus 25 × 25 m pixel size is shown as a dashed vertical gray line.
Sustainability 13 03159 g009
Figure 10. Spatial distribution of the relative attractiveness of the urban spaces in Barcelona, according to the different user profiles. Map uses the ETRS89/UTM zone 31N (EPSG:25831) projection. City limits cartography from CartoBCN, under CC BY 3.0.
Figure 10. Spatial distribution of the relative attractiveness of the urban spaces in Barcelona, according to the different user profiles. Map uses the ETRS89/UTM zone 31N (EPSG:25831) projection. City limits cartography from CartoBCN, under CC BY 3.0.
Sustainability 13 03159 g010
Figure 11. Classified results of the Kernel Density Estimation, according to the different user proFigure 89./UTM zone 31N (EPSG:25831) projection. City limits cartography from CartoBCN, under CC BY 3.0.
Figure 11. Classified results of the Kernel Density Estimation, according to the different user proFigure 89./UTM zone 31N (EPSG:25831) projection. City limits cartography from CartoBCN, under CC BY 3.0.
Sustainability 13 03159 g011
Table 1. Flickr user’s location-related data available using the discussed methods. Sets overlap partially and therefore combined data collected from the flickr.profile.getProfile and flickr.people.getInfo methods yield location data for 50% of the users.
Table 1. Flickr user’s location-related data available using the discussed methods. Sets overlap partially and therefore combined data collected from the flickr.profile.getProfile and flickr.people.getInfo methods yield location data for 50% of the users.
Field NameTypeMethodValid% Vaild
locationplaceflickr.people.getInfo14,72043%
timezonemetadataflickr.people.getInfo24,68772%
profileurllinkflickr.people.getInfo34,279100%
hometownplaceflickr.profile.getProfile10,45630%
cityplaceflickr.profile.getProfile12,96938%
countryplaceflickr.profile.getProfile13,30239%
Table 2. Fraction of pixels with at least one location and three evaluated cutoffs for the threshold for each point pattern.
Table 2. Fraction of pixels with at least one location and three evaluated cutoffs for the threshold for each point pattern.
Pixel Size (per Fraction of Cells with Data)
Point PatternPixels with Data1/31/22/3
Locals25%33 m55 m91 m
Visitors25%35 m63 m108 m
Unknown33%26 m43 m71 m
Global44%18 m30 m51 m
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Valls, F.; Roca, J. Visualizing Digital Traces for Sustainable Urban Management: Mapping Tourism Activity on the Virtual Public Space. Sustainability 2021, 13, 3159. https://doi.org/10.3390/su13063159

AMA Style

Valls F, Roca J. Visualizing Digital Traces for Sustainable Urban Management: Mapping Tourism Activity on the Virtual Public Space. Sustainability. 2021; 13(6):3159. https://doi.org/10.3390/su13063159

Chicago/Turabian Style

Valls, Francesc, and Josep Roca. 2021. "Visualizing Digital Traces for Sustainable Urban Management: Mapping Tourism Activity on the Virtual Public Space" Sustainability 13, no. 6: 3159. https://doi.org/10.3390/su13063159

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop