Understanding Human Activities in Response to Typhoon Hato from Multi-Source Geospatial Big Data: A Case Study in Guangdong, China

: Every year typhoons severely disrupt the normal rhythms of human activities and pose serious threats to China’s coast. Previous studies have shown that the impact extent and degree of a typhoon can be inferred from various geolocation datasets. However, it remains a challenge to unravel how dwellers respond to a typhoon disaster and what they concern most in the places with signiﬁcant human activity changes. In this study, we integrated the geotagged microblogs with the Tencent’s location request data to advance our understanding of dweller’s collective response to typhoon Hato and the changes in their concerns over the typhoon process. Our results show that Hato induces both negative and positive anomalies in humans’ location request activities and such anomalies could be utilized to characterize the impacts of wind and rainfall brought by Hato to our study area, respectively. Topic analysis of Hato-related geotagged microblogs reveals that the negative location request anomalies are closely related to damage-related topics, whereas the positive anomalies to trafﬁc-related topics. The negative anomalies are signiﬁcantly correlated with economic loss and population affected at city level as suggested by an over 0.7 adjusted R 2 . The changes in the anomalies can be used to portray the response and recovery processes of the cities impacted.


Introduction
Tropical storms are one of the most destructive natural disasters. Statistics show that during the period from 1998 to 2017, tropical cyclones and typhoons affected around 726 million people and caused 233,000 deaths and USD 1.33 trillion of economic loss worldwide [1]. Furthermore, growing evidence has shown that typhoons occur less frequently but tend to be more intense in response to the global climate change [2][3][4][5]. Over the past 37 years, typhoons that hit East and Southeast Asia have intensified by 12-15% and the proportion of category 4 and 5 storms has doubled or even tripled [6].
China is one of the countries frequently affected by typhoons and has been impacted by eight storms per year from 2010 to 2018. Typhoons have caused over an average of 60 billion RMB Yuan direct economic loss, mainly in the coastal provinces [7]. Typhoon Hato is the most devastating typhoon that hit China in recent years, costing 32 deaths and a 29 billion RMB Yuan economic loss [7]. More studies are needed to help us better understand how dwellers respond to a natural hazard such as a typhoon and how the areas impacted recover after that.
Many data sources have been used to study how human activities change in response to a typhoon disaster. Among them, satellite images have been widely used to monitor and assess typhoon disasters [8], though it is not possible to monitor real-time human activities from remotely sensed data. Over recent years, the use of mobile phones and social media platforms has provided new insights on disaster management. The social sensing data [9], such as mobile phone records and social media posts, can record public behaviors in the physical and cyber space at high spatial and temporal resolutions and have been widely used in typhoon-related studies, in particularly in disaster-related information extraction [10][11][12][13][14][15], situation awareness [16][17][18][19][20], resilience assessment [21][22][23][24][25][26], and rapid damage assessment [27][28][29][30][31][32].
Previous typhoon studies on the use of social sensing data can be roughly grouped into two categories regarding the data utilized in previous studies. One is crowd positioning data represented by mobile phone signaling records. For example, Hong et al. [33] used twomonth positioning data from more than 800,000 anonymized mobile phones in Houston, Texas. They analyzed the mobility patterns of a crowd before, during, and after Hurricane Harvey and found clear socioeconomic and racial disparities in resilience capacity and evacuation patterns of the local communities. The phone records document the users' spatiotemporal attributes, from which the individual or collective human activities across a large geographic scale could be inferred. However, it is not possible to identify the specific types of human activities from phone records. For example, the phone record could not tell what a user is doing when she or he is making a phone call.
Social media data is another type of social sensing data that are widely used to study how dwellers respond to a natural hazard. For example, Huang et al. [17] first split the Twitter updates posted at different stages of a typhoon based on the topics. They then investigated the spatiotemporal patterns of human behaviors and reactions to the typhoon. Their research results provide detailed information about the disaster evolution and are helpful to the decision makers to track the ongoing development of a disaster. From the social media contents, human reactions to an event and spatial cognition of individuals affected could be inferred [9]. The social media data could also be used to track and monitor disaster situations, improve emergency response promptness, and act as a gauge of public interest or concerns [34,35]. Many studies have integrated social media data and remote sensing imagery to monitor disasters and assess disaster-induced damages [29,31,34]. These studies showed that social media data are valuable in providing disaster-related information that cannot be acquired by other sensors [36,37].
More recently, location request data have been used to estimate the collective human activities [38][39][40]. Dwellers' daily activities usually change dramatically during a natural hazard and such changes are recorded in the location request data in a near real-time manner across a large geographic scale. Previous studies have shown that a natural hazard such as a typhoon could significantly impact human activities, and such impacts could be inferred from the changes in the number of location requests (NLR) [41,42]. However, it remains unclear how dwellers respond to typhoon disasters and what the public concern most in the places with significant human activity changes. More studies are needed to integrate multi-source data to help us better understand the changes in human activities in response to typhoon disasters.
In this study, we utilized location request data and geotagged microblogs, to examine how dwellers respond to typhoon Hato. We first introduced our study area, the typhoon, and datasets in Sections 1 and 2. We then displayed our methods and our findings in Sections 3 and 4. Finally, the discussions and conclusions are presented in Sections 5 and 6.

Study Area and Typhoon Hato
Typhoon Hato is a strong tropical cyclone that struck Guangdong in August 2017. It caused 32 deaths and USD 4.34 billion in economic loss. Hato is the worst storm that has hit China in recent years. It is also one of the strongest typhoons that hit Macao over the Remote Sens. 2022, 14, 1269 3 of 17 past 50 years [43]. Hato started as a tropical depression over the northwest Pacific Ocean at 14:00 on 20 August and grew as a tropical storm the next day. On 23 August, it intensified as a strong typhoon and made landfall near Jinwan, Zhuhai at 12:50 p.m. It then quickly weakened to a tropical storm and swept through Guangdong ( Figure 1).
Guangdong is located in the southeast coast of China. It has a total population of 126.01 million and is the most populous province in China. As of 2020, over 74% of Guangdong's population live in urban areas. Over the years, Guangdong was also the province in China that was affected by typhoons most frequently [44]. hit China in recent years. It is also one of the strongest typhoons that hit Macao over the past 50 years [43]. Hato started as a tropical depression over the northwest Pacific Ocean at 14:00 on August 20 and grew as a tropical storm the next day. On August 23, it intensified as a strong typhoon and made landfall near Jinwan, Zhuhai at 12:50 p.m. It then quickly weakened to a tropical storm and swept through Guangdong ( Figure 1). Guangdong is located in the southeast coast of China. It has a total population of 126.01 million and is the most populous province in China. As of 2020, over 74% of Guangdong's population live in urban areas. Over the years, Guangdong was also the province in China that was affected by typhoons most frequently [44].

Data
Tencent's location big data portal [46] provides gridded numbers of location requests with a spatial resolution of 0.01 × 0.01 decimal degree. A user may generate a location request when she/he uses various mobile applications, including the most popular social media platform WeChat, and the world's largest travel service platform Didi, and one of China's most popular e-commerce platforms Jindong, etc. We collected the hourly NLR dataset from August 1 to August 31, 2017 from Tencent's location big data portal for this study.
The NLR dataset has been widely applied to estimate population dynamics [47][48][49] and previous studies have found that over 60% of the changes in the daily NLR could be explained by the variations in the daily number of tourists in tourist attractions [42], suggesting that the NLR dataset could be used as a proxy measure of the short-term collective human activities. It is worth noting that the non-geotagged human activities are not collected and thus are not included in the NLR dataset.
Geotagged microblogs were obtained from the Sina Weibo platform [50], which is one of the China's most popular social media sites and has 165 million daily active users. We collected over 1.5 million geotagged microblogs posted in Guangdong in August 2017. The microblog dataset includes the message texts and a range of additional information including message identifiers, user identifiers, follower counts, retweet statuses, self-reported location, and time stamps.

Data
Tencent's location big data portal [46] provides gridded numbers of location requests with a spatial resolution of 0.01 × 0.01 decimal degree. A user may generate a location request when she/he uses various mobile applications, including the most popular social media platform WeChat, and the world's largest travel service platform Didi, and one of China's most popular e-commerce platforms Jindong, etc. We collected the hourly NLR dataset from 1 August to 31 August 2017 from Tencent's location big data portal for this study.
The NLR dataset has been widely applied to estimate population dynamics [47][48][49] and previous studies have found that over 60% of the changes in the daily NLR could be explained by the variations in the daily number of tourists in tourist attractions [42], suggesting that the NLR dataset could be used as a proxy measure of the short-term collective human activities. It is worth noting that the non-geotagged human activities are not collected and thus are not included in the NLR dataset.
Geotagged microblogs were obtained from the Sina Weibo platform [50], which is one of the China's most popular social media sites and has 165 million daily active users. We collected over 1.5 million geotagged microblogs posted in Guangdong in August 2017. The microblog dataset includes the message texts and a range of additional information including message identifiers, user identifiers, follower counts, retweet statuses, selfreported location, and time stamps.
We also obtained the statistics of Hato-induced damage and loss by city from local governments. The statistics dataset includes the number of people who were impacted and direct economic loss. We also obtained hourly wind speed and rainfall data from the 21 meteorological stations in Guangdong in August 2017 from China Meteorological Data Service Centre [51]. Figure 2 shows our data processing and analysis processes. We first used the Seasonal Hybrid Extreme Studentized Deviate (S-H-ESD) method [52] to detect the anomalies in wind, rainfall time series, and NLR time series. Based on the typhoon process, we then defined the anomaly indicators to evaluate typhoon-triggered human activity changes at city scale. The Biterm topic model (BTM) [53] was utilized to categorize and analyze the topics in typhoon-related microblogs. We then analyzed how city dwellers respond to the Hato and assess cities' responses and recovery process. Finally, we analyzed the spatiotemporal variations in microblogs and investigated the microblog topics within areas with NLR anomalies by spatial overlaying of the abnormal area with microblogs.

Methods
In this study, we used the NLR as a proxy of the collective human activities. Accordingly, significantly increased or decreased NLR would reflect human activity anomalies across space and over time. We extracted public opinions from typhoon-related microblogs as microblogs can reflect users' perceptions of an event. We also obtained the statistics of Hato-induced damage and loss by city from local governments. The statistics dataset includes the number of people who were impacted and direct economic loss. We also obtained hourly wind speed and rainfall data from the 21 meteorological stations in Guangdong in August 2017 from China Meteorological Data Service Centre [51]. Figure 2 shows our data processing and analysis processes. We first used the Seasonal Hybrid Extreme Studentized Deviate (S-H-ESD) method [52] to detect the anomalies in wind, rainfall time series, and NLR time series. Based on the typhoon process, we then defined the anomaly indicators to evaluate typhoon-triggered human activity changes at city scale. The Biterm topic model (BTM) [53] was utilized to categorize and analyze the topics in typhoon-related microblogs. We then analyzed how city dwellers respond to the Hato and assess cities' responses and recovery process. Finally, we analyzed the spatiotemporal variations in microblogs and investigated the microblog topics within areas with NLR anomalies by spatial overlaying of the abnormal area with microblogs.

Methods
In this study, we used the NLR as a proxy of the collective human activities. Accordingly, significantly increased or decreased NLR would reflect human activity anomalies across space and over time. We extracted public opinions from typhoon-related microblogs as microblogs can reflect users' perceptions of an event.

Time Series Anomaly Detection
We used the S-H-ESD method to detect anomalies in the NLR time series. The S-H-ESD has six major steps (Equations (1)-(5)). We first decomposed a time series ( ) into seasonal ( ), trend ( ), and residual (R) components. We then found the most extreme value in the series R and calculated the corresponding statistic (Equations (2) and (3)). The was then compared with a critical value using Equation (4) to determine whether the extreme value was an outlier. If it was, we used Equation (4) to calculate its intensity and then removed the extreme value from the time series. If not, its was set to 0. The process was repeated for times and an anomaly time series corresponding to was constructed.

Time Series Anomaly Detection
We used the S-H-ESD method to detect anomalies in the NLR time series. The S-H-ESD has six major steps (Equations (1)-(5)). We first decomposed a time series (X) into seasonal (S X ), trend (T X ), and residual (R) components. We then found the most extreme value in the series R and calculated the corresponding statistic R j (Equations (2) and (3)). The R j was then compared with a critical value using Equation (4) to determine whether the extreme value was an outlier. If it was, we used Equation (4) to calculate its intensity I j and then removed the extreme value from the time series. If not, its I j was set to 0. The process was repeated for k times and an anomaly time series I corresponding to X was constructed.
5 of 17 where, k is the maximum number of assumed outliers, n is the sample number of the current time series, t α/2N, N−2 is the critical value of the t distribution at the significance level α/(2N) and with a degree of freedom N − 2. Table 1 lists the indicators that we used to describe typhoon and human activity at multiple scales. In this study, we described a typhoon process as the time series of its rainfall and maximum wind speed anomaly (I p ) and (I w ), which are the two major characteristics of a typhoon. In this study, the NLR anomalies were used to characterize abnormal human activities. A positive and a negative NLR anomaly implies significant increased and decreased location requests, respectively. A user may generate a location request when she/he is using the applications for navigation, car hailing, food and merchandise delivery, and social media check-ins, etc. As a result, the NLR anomalies may reflect multifaced human activity changes. We first calculated the positive and negative anomaly time series of each grid, respectively (denoted by I nlr (grid, pos) and I nlr (grid, neg)), based on the method described in Section 3.1. We then obtained the time series of the number of positive and negative grids by city through summarizing the number of abnormal grids within a city's administration boundary. In the end, we calculated the time series of NLR positive and negative anomalies at city scale (I nlr (pos) and I nlr ( neg)) using the same method in Section 3.1. In our study, the NLR anomaly by grid shows the influence of the typhoon on that grid whereas the NLR anomaly by city illustrates the spatial extent impacted by the typhoon within the city. In fact, the NLR positive and negative anomalies can be observed during both the periods with and without a typhoon. The city-level anomalies as calculated using the above approaches can eliminate the anomalies during the non-typhoon periods and can better reveal how a typhoon could impact human activities as we will discuss in Section 4.1. Table 1. Indicators used to describe Hato's impacts and human activity anomalies.

Objects Indicators Scale
Typhoon I p : The intensity of rainfall anomalies City; hourly I w : The intensity of wind speed anomalies City; hourly

Response to Typhoon and Recovery
We defined seven indicators to evaluate human activities under typhoon ( Figure 3 and Table 2). Empirically, a typhoon process normally consists of pre-disaster, ongoing response, recovery, and post-disaster recovery stages [54,55]. Variations in the NLR can be used to describe the human activities at the response and recovery stages. As illustrated in Figure 3, human activity anomalies increased significantly when an area was impacted by a typhoon, then gradually returned to normal after the typhoon weakened.
The recovery stage refers to the process over which human activities gradually bounce back normal and it is widely used to study a city's resilience [26,33,54]. In this study, we calculated the recovery time to evaluate how fast a city returns to normal after the typhoon. We then used the maximum and cumulative NLR anomalies to evaluate the impacts of a typhoon on a city. We also identified the linear relationship between the typhoon-induced damages and the maximum and cumulative NLR anomalies, respectively. by a typhoon, then gradually returned to normal after the typhoon weakened.
The recovery stage refers to the process over which human activities gradually bounce back normal and it is widely used to study a city's resilience [26,33,54]. In this study, we calculated the recovery time to evaluate how fast a city returns to normal after the typhoon. We then used the maximum and cumulative NLR anomalies to evaluate the impacts of a typhoon on a city. We also identified the linear relationship between the typhoon-induced damages and the maximum and cumulative NLR anomalies, respectively.  Cumulative NLR anomalies Duration of recovery t − t

Typhoon-Related Microblogs and Topic Analysis
We obtained 12,167 typhoon-Hato-related geotagged microblogs by examining the hashtags and extracting messages containing 'typhoon' or 'Hato'. We then cleaned the microblogs and removed all non-Chinese characters such as the punctuations, emojis, links, and other characters. Next, we manually removed the microblogs that were irrelevant to the typhoon disaster, such as those for advertising promotion. Finally, the sentences with a text length from 10 to 128 characters in the cleaned microblogs were then segmented and clustered into groups based on their topics.
We used the BTM to extract topics in microblogs. This model has been widely used in disaster-related microblog topic extraction [56][57][58][59], as it outperforms the conventional topic models such as LDA [60]. The BTM model learns the topics through modeling the word co-occurrence patterns in the whole corpus and cluster microblogs based on the frequency and distribution of the bi-terms in the sentences. However, it is an unsupervised model, and the users need to set the number of categories manually and interpret the meanings of the clustering results. In practice, the BTM first specifies the number of classification categories and then performs a series artificial selection and merging to obtain the best category number and topics. We followed the framework in [56] and fed the BTM with our cleaned microblogs. We started with 20 topics and with an increment of five

Typhoon-Related Microblogs and Topic Analysis
We obtained 12,167 typhoon-Hato-related geotagged microblogs by examining the hashtags and extracting messages containing 'typhoon' or 'Hato'. We then cleaned the microblogs and removed all non-Chinese characters such as the punctuations, emojis, links, and other characters. Next, we manually removed the microblogs that were irrelevant to the typhoon disaster, such as those for advertising promotion. Finally, the sentences with a text length from 10 to 128 characters in the cleaned microblogs were then segmented and clustered into groups based on their topics.
We used the BTM to extract topics in microblogs. This model has been widely used in disaster-related microblog topic extraction [56][57][58][59], as it outperforms the conventional topic models such as LDA [60]. The BTM model learns the topics through modeling the word co-occurrence patterns in the whole corpus and cluster microblogs based on the frequency and distribution of the bi-terms in the sentences. However, it is an unsupervised model, and the users need to set the number of categories manually and interpret the meanings of the clustering results. In practice, the BTM first specifies the number of classification categories and then performs a series artificial selection and merging to obtain the best category number and topics. We followed the framework in [56] and fed the BTM with our cleaned microblogs. We started with 20 topics and with an increment of five topics in each experiment until we obtained 40 topics. In the end, we selected the experiment with 30 topics as the results show the most recognizable keyword groups. We then labeled the topics based on the subcategories of disaster-related topics [61] and grouped the topics into nine major categories based on their parent categories. Finally, as shown in Table 3, we manually labeled samples that were randomly selected from the cleaned microblogs dataset and evaluated the accuracy of the results by using precision, recall, and accuracy (Equations (6)- (8)). The accuracy is 70.3%, which is similar to a previous study [59]. Overall where, k is the total number of classes. C i refers to a specific topic, TP is the number of the correctly classified microblogs, FP is the number of microblogs that are incorrectly classified in C i , FN is the number of microblogs that should be classified into topic C i but are classified into other topics. S is the total number of samples, in this case 600.

City-Level Human Activities in Response to Hato
Variations in wind speed and rainfall intensities show Hato's dynamic evolution and its physical impacts on our study area. Figure 4    The negative NLR anomalies are correlated with wind speed whereas the positive NLR anomalies are associated with typhoon-induced rainfall ( Figure 5). When impacted by Hato, human activities in 11 cities showed significantly negative NLR anomalies. Furthermore, the I w and I nlr (neg) curves of these 11 cities show a similar variation trend regarding the maximum (P), start time (t 1 ), and recovery time (T 2 ), particularly in Yunfu, Yangjiang, and Dongguan. The association between the NLR anomalies and the wind speed was further validated by statistical analyses (Figure 5). The Pearson r and Spearman rho between P(I nlr (neg)) and P(I w ) are 0.84 and 0.81, respectively, and between the t 1 (I nlr (neg)) and t 1 (I w ) are 0.87 and 0.88, respectively. The correlations are all statistically significant with p < 0.01.
However, four cities show positive NLR anomalies ( Figure 5), including Heyuan, Zhuhai, Maoming, and Zhanjiang, though Hato dumped significantly heavy rainfall to many cities (Figure 4b). We also found a positive relationship between the positive NLR anomalies and Hato-induced rainfall in these cities except Zhanjiang. Meanwhile, the durations of positive NLR anomaly processes in all these four cities were much shorter than those showing negative NLR anomalies, suggesting human activities were mainly affected by wind speed during typhoon Hato. It is not a surprise that heavy rainfall tends to change dwellers' normal activities significantly and trigger more location requests. For example, car hailing and food delivery demand, therefore location requests, may increase significantly due to the short and heavy rainfall. This finding is consistent with previous research [62].  Figure 6 shows how the 11 cities impacted respond to Hato. Generally, the cities impacted more severely by Hato have a longer response time ( ) though notable distinctions exist among cities (Figure 6a). The response time varies significantly in some cities such as Yangjiang, Maoming, and Guangzhou that are less affected by Hato. More specifically, Guangzhou shows the longest response time, which may be due to its large population. On the other hand, some cities were impacted more by Hato yet show a shorter response time. For example, Hato hit Yunfu four hours later after it landed in Zhuhai but its response duration lasted only four hours, possibly indicating that dwellers in Yunfu were The map shows the NLR anomalies by city with the insets showing the linear correlations and the statistical significance test of P(I nlr (neg)) and P(I w ), t 1 (I nlr (neg)) and t 1 (I w ), respectively. The square graphs on the left and bottom show the variation in I nlr (pos) and I p , I nlr (neg), and I w , respectively. Guangzhou shows the longest response time, which may be due to its large population. On the other hand, some cities were impacted more by Hato yet show a shorter response time. For example, Hato hit Yunfu four hours later after it landed in Zhuhai but its response duration lasted only four hours, possibly indicating that dwellers in Yunfu were well informed and responded rapidly before Hato entered the city's territory (Figure 6a).

Assessment of Typhoon Process, Cities' Responses, and Recovery
The recovery time (T 2 ) in different cities varied significantly across our study area (Figure 6b). Overall, the cities impacted more significantly showed a shorter recovery time period. More specifically, the recovery time of Zhuhai, Jiangmen, and Yunfu, which were on Hato's path, was between four and six hours. By contrast, the recovery time in the cities that were less impacted ranged from 3 to 9 h. Yangjiang had the longest recovery time of 9 h, highlighting the weaker disaster resilience in this city. The negative NLR anomalies are associated with Hato-induced damage at the city level. As shown in Table 4, the strongest correlation was found between the maximal NLR anomalies and the direct economic loss as suggested by an adjusted of 0.80; and between the cumulative NLR anomalies and the affected population (adjust = 0.84).
Such associations show that human activity anomalies could be used to support the rapid hazard-induced damage assessment.

Topics of Typhoon-Related Microblogs
The Hato-related microblogs are grouped into nine distinct and diverse categories (Table 5), which characterize the influences of the typhoon on dwellers in multiple aspects. For example, during the typhoon process, the warnings and alerts topic group generally covers the information about typhoon's landing location and early warning, transportation hub shutdown, and school closure announcements that were mainly issued by governments. The damage topic group includes the information about water outages, power outages, and fallen trees or branches. The topics related to traffic include information The negative NLR anomalies are associated with Hato-induced damage at the city level. As shown in Table 4, the strongest correlation was found between the maximal NLR anomalies and the direct economic loss as suggested by an adjusted R 2 of 0.80; and between the cumulative NLR anomalies and the affected population (adjust R 2 = 0.84). Such associations show that human activity anomalies could be used to support the rapid hazard-induced damage assessment.

Topics of Typhoon-Related Microblogs
The Hato-related microblogs are grouped into nine distinct and diverse categories (Table 5), which characterize the influences of the typhoon on dwellers in multiple aspects.
For example, during the typhoon process, the warnings and alerts topic group generally covers the information about typhoon's landing location and early warning, transportation hub shutdown, and school closure announcements that were mainly issued by governments. The damage topic group includes the information about water outages, power outages, and fallen trees or branches. The topics related to traffic include information about flight delays and subway suspensions. The microblogs also show a cooling topic as Hato brought cool air to Guangdong in a hot summer and dwellers were excited about that. The temporal variations in the number of typhoon-related microblogs (NTM) in different topic categories reveal the different concerns of the dwellers in response to Hato from 22 to 24 August (Figure 7). About 70% of the Hato-related microblogs were posted on 23 August, when the typhoon hit our study area most significantly. The largest number of Hato-related microblogs are found in the warning and damage categories, accounting for over half of all Hato-related microblogs. The topics in the warning and alerts, damage, temperature, traffic, and gratitude and praying categories show significant changes over time. As the typhoon approached and departed Guangdong, the number of the microblogs in the warning and alerts topic category gradually dropped, whereas those in the damage and gratitude and praying topic categories gradually increased over time. Furthermore, we also found more traffic-related microblogs were posted before the typhoon, more coolingrelated microblogs were posted before and after the typhoon. By contrast, microblogs in the caution and advice topic category were more popular during the typhoon process.

Topic Analysis within Areas with NLR Anomalies
The microblogs in the areas with positive and negative anomalies tend to center around a specific topic. We first identified whether a grid shows no, positive, or negative anomalies. A grid is identified as with a positive anomaly if it shows positive anomalies over a longer period than when it shows negative anomalies. We then used the Getis-Ord Gi* [63] method to examine whether there are significant hotspots of positive and negative anomalies (p < 0.05). Finally, we conducted the Two-Proportions test to examine if the proportion of the microblogs regarding a specific topic in an area with a positive NLR anomaly is significantly higher than that of the microblogs regarding the same topic in another area (Table 6). Table 6 shows the associations between the topics in the microblogs and the positive and negative NLR anomalies. In the areas with a positive NLR anomaly, the proportion of traffic-topic-related microblogs (the F column in Table 6) is significantly higher than in other areas though the proportion is only 8%. The numbers of damage-and concern and fear-related microblogs (B and E) in negative anomaly areas are both significantly higher than those in the areas with normal NLR, particularly in the hotspots with negative anomalies (28%). As a result, the negative NLR anomalies are well associated with disasterinduced damage and the positive anomalies associated with traffic-related topics, suggesting that negative anomalies could be used for rapid assessment of disaster loss. By contrast, positive NLR anomalies are likely to show the negative impact of Hato on traffic. Table 6. Results of the proportional test and their corresponding significance test results for various topic categories versus the areas with no, positive, negative anomaly areas (negative hotspots indicates the significant hotspots with negative anomalies). The values in the table represent the proportion of a specific topic-related microblog in relation to the total number of all microblogs in the area with a specific anomaly. Statistically significant values (p < 0.001) are presented in bold. Letters A-E in the first row correspond to the topic categories in Table 5.

Topic Analysis within Areas with NLR Anomalies
The microblogs in the areas with positive and negative anomalies tend to center around a specific topic. We first identified whether a grid shows no, positive, or negative anomalies. A grid is identified as with a positive anomaly if it shows positive anomalies over a longer period than when it shows negative anomalies. We then used the Getis-Ord Gi* [63] method to examine whether there are significant hotspots of positive and negative anomalies (p < 0.05). Finally, we conducted the Two-Proportions test to examine if the proportion of the microblogs regarding a specific topic in an area with a positive NLR anomaly is significantly higher than that of the microblogs regarding the same topic in another area (Table 6). Table 6. Results of the proportional test and their corresponding significance test results for various topic categories versus the areas with no, positive, negative anomaly areas (negative hotspots indicates the significant hotspots with negative anomalies). The values in the table represent the proportion of a specific topic-related microblog in relation to the total number of all microblogs in the area with a specific anomaly. Statistically significant values (p < 0.001) are presented in bold. Letters A-E in the first row correspond to the topic categories in Table 5.

Area_1
Area_2  Table 6 shows the associations between the topics in the microblogs and the positive and negative NLR anomalies. In the areas with a positive NLR anomaly, the proportion of traffic-topic-related microblogs (the F column in Table 6) is significantly higher than in other areas though the proportion is only 8%. The numbers of damage-and concern and fearrelated microblogs (B and E) in negative anomaly areas are both significantly higher than those in the areas with normal NLR, particularly in the hotspots with negative anomalies (28%). As a result, the negative NLR anomalies are well associated with disaster-induced damage and the positive anomalies associated with traffic-related topics, suggesting that negative anomalies could be used for rapid assessment of disaster loss. By contrast, positive NLR anomalies are likely to show the negative impact of Hato on traffic.
The associations between the areas with anomalies and the proportions of specifictopic microblogs pre-and post-Hato are consistent with the aforementioned proportion test results (Figure 8). Before the typhoon, negative anomalies are mainly found in Guangzhou and Shenzhen. During and after the typhoon, we found noticeable negative anomalies in Zhuhai and Jiangmen, where Hato landed, and most damage-related microblogs were posted. The associations between the areas with anomalies and the proportions of specifictopic microblogs pre-and post-Hato are consistent with the aforementioned proportion test results (Figure 8). Before the typhoon, negative anomalies are mainly found in Guangzhou and Shenzhen. During and after the typhoon, we found noticeable negative anomalies in Zhuhai and Jiangmen, where Hato landed, and most damage-related microblogs were posted. Although numerous positive anomalies are found in the areas when Hato is passing through, no significant changes in terms of the distribution of abnormal grids were found before and after the typhoon. This may also be the reason why only four cities show significant positive anomalies (Section 4.1). Moreover, traffic-related microblogs were posted during/after Hato and were mainly from Dongguan, Shenzhen, and Guangzhou. However, we cannot find significant a change of grids with positive anomalies in these three cities. More similar studies are needed to validate the findings regarding the positive anomaly during typhoons.
Our results cross-verified the associations between the proportion of specific topicrelated microblogs and the areas with NLR anomalies. The semantic information from microblogs provides further knowledge about how dwellers responded to Hato and what the public were concerned with most in the places with significant changes in human activities. Although numerous positive anomalies are found in the areas when Hato is passing through, no significant changes in terms of the distribution of abnormal grids were found before and after the typhoon. This may also be the reason why only four cities show significant positive anomalies (Section 4.1). Moreover, traffic-related microblogs were posted during/after Hato and were mainly from Dongguan, Shenzhen, and Guangzhou. However, we cannot find significant a change of grids with positive anomalies in these three cities. More similar studies are needed to validate the findings regarding the positive anomaly during typhoons.
Our results cross-verified the associations between the proportion of specific topicrelated microblogs and the areas with NLR anomalies. The semantic information from microblogs provides further knowledge about how dwellers responded to Hato and what the public were concerned with most in the places with significant changes in human activities.

Discussion
Results from this study reveal that typhoon Hato triggered significant positive and negative location request anomalies, and a negative NLR anomaly is tightly related to Hato's wind speed, whereas an NLR positive anomaly to the rainfall. Such findings are consistent with previous studies [54,55], showing that the typhoon had a significant impact on human activities, which could be inferred from the changes in the number of location requests. This study further examined the social media posts, which usually reflect the concerns of the people impacted and how the concerns change in response to the evolution of a natural hazard. In this study, we integrated the semantic information in the microblogs with Tencent's NLR data to investigate the collective human activities and public concerns in response to Hato as well as the possible reasons that trigger the NLR anomalies. We found that negative NLR anomalies are closely related to damage-related topics and negative emotion, whereas the positive anomalies to traffic-related topics. Our findings suggest that integration of multi-source data can further boost our understating of how people respond to a typhoon.
Many theoretical studies on urban resilience to disasters have been completed [41,54,55] and existing studies show that the urban recovery time after a natural hazard is associated with socioeconomic disparities [26,33,56]. In this study, we further explored the urban resilience from the NLR anomalies by examining the response and recovery time during Hato. Our results indicate that the response and recovery time of the cities that were affected by Hato shows significant disparities across our study area. Some cities impacted less significantly by Hato, such as Yangjiang and Dongguan, show a longer recovery time and a slower recovery pace than those that were impacted more significantly, highlighting these cities' weaker disaster resilience. Findings from this study show the potential of using Tencent's location request data to monitor the impacts of a typhoon. The framework we proposed in this study could be used to examine other typhoon processes and their impacts on cities with different geographic natural and social-economical settings. In fact, with slight modifications, our framework could also be used to study the urban resilience to other hazards across a large geographic scale. When more similar case studies become available, it is very possible to establish a quantitative system to evaluate urban resilience in response to a variety of hazards.
In this study, we found a moderate positive correlation between the maximal negative NLR anomalies and the direct economic loss; and between the cumulative NLR anomalies and the evacuated population. All measures are statistically significant with p < 0.05. Liu et al. [42] also analyzed the relationship between NLR activity and hazard-induced damage at city level. Many other studies have shown that online social media activity can be used for rapid assessment of damage caused by a large-scale disaster [22,27,[64][65][66]. For example, Kryvasheyeu, Y. et al. [27] found that per-capita Twitter activity in the USA strongly correlates with the per-capita economic damage inflicted by Hurricane Sandy. However, the results might be undermined as geotagged tweets only account for 1∼2% of all tweets [67], and a similar dataset shows its advantages in real-time disaster assessment.
In this study, we combined the NLR and microblog data to analyze the collective human activities and public concerns in response to Hato. Such integration can overcome the limitations of using a single data source to improve our understanding the impacts of a typhoon on human activities. The NLR data only capture collective and geotagged human activities with a fine temporal and spatial resolution yet without any specific information about the typhoon. The microblogs data can provide the public perception of the event but are not available in many cities, especially in the underdeveloped areas. A combination of the two datasets can significantly improve our understanding of the impacts of Hato on human activities. However, this study could be further improved. Firstly, this study only examined the human activities that could be inferred from TLR and geotagged microblogs. More geolocation datasets such as cell records and taxi trajectories data need to be examined. Secondly, the Tencent's location data and geotagged microblogs data can only be used to infer representation of certain groups of the users, and vulnerable groups in disasters such as children and elderly people may be underrepresented [36]. Third, more studies are needed as future work needs to be done to understand the abnormal collective human activities in response to hazards such as typhoons. There is a significant spatiotemporal gap between the microblogs and the areas with NLR anomalies. In other words, grids with NLR anomalies are numerous but without enough microblogs, mainly located in the well-developed cities such as Zhuhai, Shenzhen, and Guangzhou (Figure 8). Lacking a sufficient number of microblogs may undermine the study of the association between NLR anomalies and microblog topics. More similar studies are needed to evaluate the cities' resilience in the areas with different natural settings and socioeconomic development status. It is possible to build a more robust model to use TLR anomalies to predict disaster-induced loss when more case studies become available.

Conclusions
Previous studies have shown that typhoons could significantly impact human activities and the impact extent and degree can be inferred from the anomalies of location requests. However, it is impossible to reveal dwellers' concerns and how the concerns change from the spatial anomalies of location request data alone. This study advances our understanding of the collective human perceptions of a typhoon, the changes in the dwellers' concerns during a typhoon disaster, and the typhoon-induced spatial anomalies in human activities.
The main findings of this study include the following: (1) We found significant negative and positive humans' location request anomalies when Hato hit our study area and such anomalies could be utilized to characterize the impacts of wind and rainfall caused by Hato, respectively. Topic analysis based on geotagged microblogs reveals that the negative location request anomalies are closely related to the damage-related microblog topics, whereas the positive anomalies to traffic-related topics. (2) The Tencent's location request data can be used as a low-cost tool for decision-makers to monitor disaster situations. The response and recovery of the cities affected by Hato show significant disparities across our study area. Cities impacted less significantly by Hato show a longer recovery time and a slower recovery speed than those impacted more significantly. (3) The Tencent's location big data could become a potential tool for real-time and low-cost assessment of the typhoon-induced damage. The negative anomalies are significantly correlated with economic loss and population affected at city level, with an adjusted R 2 greater than 0.7. The framework in this study could be applied to examine other typhoon processes and other natural hazards to help us better understand how cities respond and recover from a hazard across a large geographic scale.