Next Article in Journal
Evaluation of an Oral Health Management Project in Connection to a Non-Communicable Disease Prevention and Management Project: A Case Study in South Korea
Previous Article in Journal
Whole Body Vibration Exposure Transmitted to Drivers of Heavy Equipment Vehicles: A Comparative Case According to the Short- and Long-Term Exposure Assessment Methodologies Defined in ISO 2631-1 and ISO 2631-5
Previous Article in Special Issue
Machine Learning, Deep Learning, and Mathematical Models to Analyze Forecasting and Epidemiology of COVID-19: A Systematic Literature Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Correlation Analysis between Urban Elements and COVID-19 Transmission Using Social Media Data

1
Department of Urban Planning, School of Urban Design, Wuhan University, Wuhan 430072, China
2
Center for Geographic Analysis, Harvard University, Cambridge, MA 02138, USA
3
Department of Graphics and Digital Technology, School of Urban Design, Wuhan University, Wuhan 430072, China
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(9), 5208; https://doi.org/10.3390/ijerph19095208
Submission received: 16 March 2022 / Revised: 19 April 2022 / Accepted: 22 April 2022 / Published: 25 April 2022
(This article belongs to the Special Issue Advances in Spatial Epidemiology of COVID-19)

Abstract

:
The outbreak of the COVID-19 has become a worldwide public health challenge for contemporary cities during the background of globalization and planetary urbanization. However, spatial factors affecting the transmission of the disease in urban spaces remain unclear. Based on geotagged COVID-19 cases from social media data in the early stage of the pandemic, this study explored the correlation between different infectious outcomes of COVID-19 transmission and various factors of the urban environment in the main urban area of Wuhan, utilizing the multiple regression model. The result shows that most spatial factors were strongly correlated to case aggregation areas of COVID-19 in terms of population density, human mobility and environmental quality, which provides urban planners and administrators valuable insights for building healthy and safe cities in an uncertain future.

1. Introduction

The 2019 novel coronavirus [1,2], which was first reported in Wuhan, the capital of Hubei province, China, evolved into a worldwide public health event. As the main daily living space for residents, the city had a close internal relationship with people’s health [3,4,5] and there was a close correlation between personal activity related to the city’s attributes and the risk of infection [6]. Although pathogens were transmitted through mobile carriers, the urban environment shaped their flow [3]. The mechanism of the urban environment affected the spatial distribution of COVID-19, and characteristics of the area where cases gathered within the city remained unclear, which required detailed intra-urban analysis to reveal the spatial disparities and determinants of the COVID-19 pandemic.
Personal space and activities related to urban attributes were closely related to infection risk [6] at the city level, and existing research on COVID-19 and its impact factors in urban areas can be classified into three major themes, namely, population density, human mobility and environmental quality. First, studies have demonstrated the effect of spatial aggregation of the population on the susceptibility of people to COVID-19 [7,8]. Second, from the perspective of human mobility, since the distribution of public service facilities affected the urban residents’ mobility, the association of public facilities with the spatial variation of COVID-19 distribution should be further observed [9]. Lai et al. mentioned the impact of human mobility caused by urban traffic [7]. You et al. found that the distribution of commercial facilities and medical service facilities was related to the incidence of COVID-19 [6]. ImKampe et al. proved that COVID-19 outbreaks in schools did occur [10]. Patel et al. referred to the specific performance of food supply facilities during the pandemic [11]. Besides, administrative agencies, as a public service facility for people’s daily affairs, were usually ignored in the existing research [12]. Third, despite the different social, cultural, economic, political and environmental backgrounds around the world, residential environmental quality played an important role in curbing the pandemic and fighting diseases [13]. The relevance of poor communities in COVID-19 infection was also emphasized in existing research [14,15]. You et al. suggested that older communities in Wuhan should be renovated and the corridors of public space should be expanded to alleviate overcrowding [6]. In addition, air quality had also been proved to have a significant correlation with the COVID-19 pandemic [16,17,18] and open spaces such as green spaces and waters played a significant role in people maintaining distance and good health [19,20]. Furthermore, in addition to the elements mentioned in the above study, with the comprehensive consideration of the unique attributes of Wuhan (such as having a population with more than a million college students, etc.), seventeen explanatory variables possibly related to the distribution of the COVID-19 in the main urban area were selected.
Our intellectual contributions were threefold. Firstly, most existing studies had explored the impact of risk factors of COVID-19 at the country level [21], but lacked investigations of the spatial variations of the COVID-19 pandemic at the intra-urban scale, especially for Wuhan, the city where the first outbreak of the pandemic was reported. Our empirical explorations of Wuhan’s experiences of COVID-19 sought to inform detailed urban policy and governance for better preparedness and responses to public health crises.
Secondly, the main challenge hampering the research into spatial correlation between pandemics and urban spatial factors was the lack of the pandemic information on a fine spatial scale, such as on the community or kilometer grid scale. Since the pandemic information released by the Wuhan government was only at the administrative region level [6,22,23], there was no access to more detailed geospatial case data. Social media data were widely used in a myriad of studies related to COVID-19 for geotagging possible patients, analyzing the spatiotemporal dynamics of pandemics [8], studying psychological consequences of the disease [24], investigating the public’s response [25], characterizing the propagation of situational information [26], predicting re-outbreaks of the COVID-19 [27], and examining information flow during the pandemic [28]. Compared with the traditional case data, such social media data had the unique benefits of better accessibility and rich and accurate geographic information, and we innovatively used Weibo data to revisit the COVID-19 pandemic during the early stage in detail, which were found to have good sample coverage and were able to represent the spatial results of the natural transmission of COVID-19 in the main urban area of Wuhan to a certain extent [8].
Thirdly, we draw from the literature to develop a framework of urban elements for better detecting the mechanism of spatial factors affecting the spatial transmission of the disease in urban spaces. A myriad of studies focused on the influence of social and economic factors on COVID-19 [6,29], with no focus on the characteristics of the city itself. Most studies focused on the association of the built environment’s attributes with the spread of COVID-19 [30,31] while minimizing the impact of the natural environment in the urban space. Since the urban form generally encompasses a number of physical features and nonphysical characteristics [32], we hence conducted a novel framework including seventeen urban factors in terms of population density, human mobility and environmental quality to investigate the correlation between infectious outcomes and related urban factors.
Given all the above, the reason why we selected Wuhan as the study region can be attributed to the following three points. First, Wuhan was the first seriously impacted city by COVID-19 [8]. Second, the Weibo cases only occurred in Hubei province, and most of them were concentrated in Wuhan. Third, it was confirmed that the Weibo help data could well demonstrate the three phases of COVID-19’s early spread in Wuhan—scattered infection, community spread, and full-scale outbreak [8]—as well as reflecting the results of natural transmission with less intervention during the early stage of the pandemic.
In this study, we collected about 1200 Weibo messages, which included 729 valid records located in the main urban area of Wuhan, so as to represent the spatial variations of the COVID-19 pandemic in the city. Seventeen explanatory variables, from perspectives of population density, human mobility and environmental quality, namely, population, the elderly population, the third ring road, rivers (the Han river and the Yangtze river), markets, third-class hospitals, middle schools, universities, business points of interest (POIs), administration points of interest (POIs), bus stops, metro stations, house prices, age of buildings, air quality index, green spaces, and waters (rivers and lakes) were chosen to identify their influences on the spatial variation of COVID-19 in the main area of Wuhan. Considering the spatial correlation within Weibo COVID-19 cases and influencing factors, respectively, we applied the Euclidean Distance (ED), Kernel Density Estimation (KDE) and Inverse Distance Weighted (IDW) methods to process these elements, and adopted the multiple linear regression model to test the extent to which the severity of infection was impacted by urban spatial indicators. The possible impacts were further discussed so as to provide urban planners and administrators with valuable insights for building healthy and safe cities in a future of uncertainty.

2. Study Area and Data

2.1. Study Area

Wuhan is a city in Central China with a total area of 8569.15 square kilometers, a permanent resident population of 12.3265 million, and a total economic output of 1561.61 billion yuan, which was ranked among the top ten cities in China by the end of 2020. Wuhan is an important water, land and air transportation hub in China since it is located at the intersection of the golden waterway of the Yangtze River and the artery of the Jingguang railway and hence it has always been known as the “thoroughfare of nine provinces”.
Wuhan’s main urban area (MUA) is located in the center of the city, and it is the main gathering area for urban activities (Figure 1). According to the confirmed data released by the Wuhan Municipal Health Commission, as of 16 March 2022, Wuhan had reported a total of 50,537 confirmed cases, 76.034% of which were located in the main urban area of Wuhan (Table 1). Therefore, MUA was the region where cases were concentrated in Wuhan and the main administrative areas covered by MUA include Jiang’an, Jianghan, Qiaokou, Hanyang, Qingshan, Wuchang, Hongshan and Wuhan Economic Technological Development districts.
The traffic analysis zone (TAZ) based on the street network was the spatial statistical unit for this study. The average size of TAZ was 0.204 km2, with the smallest block being 0.007 km2 and the largest block being 5.851 km2. There were 1681 blocks in total.

2.2. Weibo COVID-19 Cases

In the early stage of the COVD-19 pandemic, due to its rapid outbreak, Wuhan issued a message on the closure of the city on 23 January 2020, which further aggravated the public’s panicked sentiment. A large number of people flocked to the hospital, leading to a patient overflow. Sina Weibo, one of the most influential social media platforms in China, opened a help-seeking channel for patients infected with COVID-19 suffering from pneumonia. Patients who could not receive timely treatment could be treated by publishing relevant basic information on the topic.
The Weibo help information was mainly distributed from 3 February to 12 February, and most of the information was posted from February 4th to 8th and stopped updating until the local government took a series of effective measures to supplement medical resources. The research collected about 1200 Weibo messages over this period of time, for which 729 valid records were located in the main urban area of Wuhan and basic information such as the patient’s age, gender, address, date of illness, and the number of family infections were obtained by eliminating invalid and duplicate information. The onset time of patients included in the effective records ranged from 20 December 2019 to 10 February 2020, and most of them were concentrated from 1 January 2020 to 6 February 2020 (Table 2).
The participation rates in the social media helpline among the COVID-infected at the administrative unit level (Table 3) were further obtained by calculating the ratio of infections in the Weibo information to the cumulative confirmed cases. Due to the sudden outbreak of the disease in the early stage, the officially confirmed data at the administrative region level could only be traced back to 5 March 2020, which may cause deviation to a certain extent. The result showed that the participation rates ranged from 1.46% to 2.49%, which could supplement the reliability of data to some extent.

2.3. Explanatory Variables

We selected seventeen explanatory variables from the perspectives of population density, human mobility and environmental quality (Table 4) to test the extent to which the severity of infection was influenced by urban spatial indicators.
In Wuhan, the third ring road and the rivers played a crucial role in separating the population distribution in the urban region [33], and hence the third ring road, rivers, population, and the susceptible population together were used as the measure of population density status. The information on the third ring road and rivers (the Han river and the Yangtze river) was provided by the local planning institute of Wuhan. Mobile phone data with age tags were used in this study to obtain the spatial distribution mapping of the population and the elderly population by the number of users that the base station served, which was summarized by matching base stations with the highest call frequency of users to the user ID.
Since the distribution of public service facilities affected the urban residents’ mobility [9], third-class hospitals, middle schools, universities, business facilities, administration facilities, bus stops, and metro stations were included to measure human mobility. In China, the market was the main place where people frequently entered and exited in their daily lives, and the Huanan Market especially was the first place in Wuhan where the pandemic outbreak was identified; hence, it was included as an element that affected human mobility [34,35,36]. Furthermore, the university was also included in the mobility theme since there were 84 universities located in Wuhan by the end of 2019. The POI data of facilities in 2019 were extracted from the Gaode Map API, which contained public service facilities (middle schools, universities, hospitals, administrations, markets, restaurants) and the transportation facilities (street network, bus stations and subway stations) in the main urban area of Wuhan.
In terms of the environmental quality, five variables were identified: house prices, age of buildings, air quality index, green spaces and waters (rivers and lakes). The sales prices and chronological information of residential properties were taken from the Anjuke website, which may partly reflect the quality of the residential environment. The Air Quality Index (AQI) data of 15 stations in 2019 in the main area of Wuhan were obtained from the local government and were used as the regional air quality indicator. Basic land use information such as green spaces and waters were provided by the local planning institute of Wuhan.
The average of the kernel density analysis of Weibo help cases in each TAZ unit was the dependent variable and seventeen processed variables (Table 4) in terms of the population density, human mobility and environmental quality were the explanatory variables, so as to explore the possible correlation between urban variables and COVID-19 transmission.

3. Methods

3.1. Kernel Density Analysis

The kernel density method was used to calculate the unit density of the measured values of points and line elements within a specified neighborhood, and could intuitively reflect the distribution of discrete measured values in the continuous area [37,38]. It considered the interaction between spatial elements as well as weight values for specific parameters. Kernel density analysis could be used for facility accessibility [39], the incidence of disease [40], regional analysis [41], traffic analysis [42], etc., and hence kernel density interpolation was used to characterize the distribution of COVID-19 cases from Weibo, the population and the elderly population, the accessibility of transportation facilities, educational facilities, commercial facilities and administrative facilities, and water delivery in the main area of Wuhan, since it was considered that these variables served the local urban area. The equation was as follows:
f ^ h ( x ) = 1 n h i = 1 n K ( x x i h )
where K was the kernel (a non-negative function), h > 0 was a smoothing parameter called the bandwidth, and x i was the sample point.
Due to the different nature of the facilities, the accessibility measurement should be adopted according to the daily use mode. The commonly used spatial Euclidean Distance (ED) was applied in the study to capture the closest distance from residents to the third ring road, rivers (the Han river and the Yangtze river), third-class hospitals [43], markets, and the surrounding green space [44], since it was considered that hospitals, green spaces, and markets served the entire urban area, and the distance from rivers and the third ring road was used to represent the location characteristics of residents.

3.2. Inverse Distance Weighted (IDW)

The Inverse Distance Weighted (IDW) method was used for interpolation to estimate unmeasured cell values. It assumed that things that were closer to each other were more similar than those which were farther apart. Greater weights were assigned to the elements which were closest to the measured cells; consequently, the allocated weights changed as an inverse function of the pth power of distance, where power function (p) was a positive real number [45], and the value of p was 2. The Inverse Distance Weighted method was usually used to predict air quality parameters [46,47], spatial rainfall distribution [48], surface water quality [49], etc., and hence it was applied to describe the air quality indicator (AQI) in the main urban area in Wuhan.

3.3. Multiple Linear Regression

Multiple Linear Regression was used to determine the relationship between a dependent variable (y) and more than one independent variable (x1, x2, …, xk) in the scientific research [50,51,52]. The general form of the multiple linear regression model was given by
y = β 0 + β 1 x 1   + β x 2 + + β k x k + ε ( k     17 )
where y was the density value of the Weibo population, x1, x2, …, xk were values of selected explanatory variables, β 0 was the intercept, β 1 , β 2 , …, β k were regression coefficients of factors, and ε was the random error.

4. Results

4.1. Spatial Variations of Weibo COVID-19 Cases

According to the number of household infectors reported in each record, the result of the kernel density estimation of COVID-19 cases of Weibo data is presented in Figure 2. The spatial distribution of cases showed relatively concentrated regional patterns in riverside (the Yangtze river and the Han river) areas of the Jiang’an, Jianghan, Qiaokou, Wuchang, and Hanyang districts, as well as the eastern area of East Lake High-Tech Development district.

4.2. Correlation Analysis

Weibo help data and seventeen influence factors were coded and the numerical statistical results are shown in Table 5. The scatter diagram of variables (Figure 3) showed the correlation between the density of COVID-19 cases from Weibo and the seventeen independent variables, and strong linear trends were determined between distance to the third ring road and hospitals, kernel density of the population, the elderly population, middle schools, bus stops and metro stations, and the age of buildings and the Weibo help cases with an absolute value of the Pearson’s r greater than 0.5.

4.3. Multiple Regression Model

As there may be collinearities between variables, this study conducted the calculation of linear regression (Table 6), in which the kernel density of the Weibo help cases was the dependent variable and the rest of the seventeen factors were used as independent variables, and the strength of correlation between the dependent variable and each influence elements changed. Fourteen explanatory variables, the Euclidean Distance (ED) to the third ring road, markets, hospitals, rivers, and green spaces, the Kernel Density Estimation (KDE) of the elderly, middle schools, universities, bus stops, metro stations and waters, house prices, age of buildings, and the Inverse Distance Weighted (IDW) value of air quality index (AQI) were obtained through the collinearity test and 5% significance level test, and the factors of the Kernel Density Estimation (KDE) of the population, business facilities and administrations were excluded from the model due to the existence of collinearity and failure to pass the significance test, respectively. Consequently, a linear regression model of fourteen selected factors was further constructed to observe the mechanism of the spatial distribution of COVID-19 and its determinant urban elements (Table 7). In general, the adjusted R-square of the model was 0.790, which indicated that the selected fourteen independent variables’ explanatory abilities for the dependent variable were credible.
The spatial distribution of selected fourteen explanatory variables in TAZ units were shown in Figure 4. Units with the highest elderly population density were concentrated in the core area of the Jianghan, Jiang’an, Wuchang, as well as Qingshan districts. Markets, third-class hospitals, middle schools and transportation (bus stops and metro stations) facilities presented relatively balanced distribution patterns on the whole. Universities in Wuhan were concentrated on the right bank of the Yangtze River. In terms of environmental quality elements, the distribution of green spaces and water spaces (rivers and lakes) were relatively discrete. High air quality indices were found in the Qingshan and eastern Hongshan districts, and it was shown that the air quality in the center area was the best. The core areas of Wuchang and Hongshan had relatively higher house prices. There were clusters of older buildings in the central areas of Wuchang, Hongshan, Jiang’an as well as Jianghan districts.
The visualization of the standardized coefficients of the fourteen factors regression model was shown in Figure 5 to clearly observe the influence mechanism of factors on the dependent variable. Influence factors of the Kernel Density Estimation (KDE) of the elderly population, middle schools, bus stations and metro stations, the Euclidean Distance (ED) to the third ring road and green spaces, and the Inverse Distance Weighted (IDW) value of air quality index (AQI) had positive correlations with the aggregation of COVID-19 infections. Among them, the Kernel Density Estimation (KDE) of bus stops and the elderly population, as well as the Euclidean Distance (ED) to the third ring road determined particularly significant positive effects, with coefficients of 0.321, 0.312 and 0.267, respectively. The Euclidean Distance (ED) to markets, hospitals and rivers, house prices, Kernel Density Estimation (KDE) of universities and waters, as well as the age of buildings had negative correlations with the density of cases within the city, among which, house prices and the Euclidean Distance (ED) to rivers showed stronger negative effects, with coefficients of −0.178 and −0.174, accordingly.

5. Discussion

The study aimed to explore the correlation between urban factors and the transmission of COVID-19 based on Weibo COVID-19 data and seventeen spatial indicators generated by multi-source data. Generally, the characteristics of case aggregation areas in the main urban area of Wuhan can be summed up in three features: high population density, high degree of human mobility and relatively poor environmental quality.

5.1. Impacts of Urban Factors on COVID-19 Transmission

In the early stage of the pandemic, residential areas located in the city center, which were measured by the distance to the third ring road, and developed regions along the river tended to be more densely populated, and hence the number of infected people was high, which was consistent with the findings of the literature [53] that larger metropolitan areas had a higher infection and mortality rates. The density of the elderly became the positively relevant factor for the spatial distribution of the patients, indicating that the elderly population was susceptible and was at great risk of serious disease and death in the pandemic [54,55]. In Milan, it was proved that elderly populations suffered from very poor accessibility to primary health services during the pandemic [56].
Second, the kernel density of bus stations, metro stations and middle schools became positively related factors to the spatial distribution of the patients, while the Euclidean Distance (ED) to hospitals and markets and Kernel Density Estimation (KDE) of universities were negatively related factors, which could be attributed to the layout of public facilities such as busses, metro stations, middle schools, hospitals and markets which increased human mobility, and the layout of relatively close universities decreased the degree of human mobility, and hence the probability of people-to-person contact was raised or declined [6]. Areas with better hospital access had higher risks of infection. A possible explanation for this is that due to the unbalanced spatial distribution of medical facilities, patients tended to go to large hospitals for treatment, causing a wider range of cross-infections, which made hospitals become high-risk places for the initial spread of COVID-19 [57].
Third, on the one hand, with the high-density development model of cities in recent years, high-rise residential buildings have become the normal form of residence. Consequently, public elevators and other infrastructures may be risk factors in some relatively new communities during the pandemic, while in old communities, the stairs were well ventilated, and hence led to fewer infections. Simultaneously, house prices were negatively correlated with infection density [15]. Communities with high house prices may have better management services and hence could be well protected during the pandemic, which indicated that the social polarization between the rich and the poor was one of the significant inequality factors during the pandemic. On the other hand, the air quality and the accessibility of open spaces such as green spaces and waters also had slight impacts on the variability of the spatial distribution of the disease, and the study considered that these impacts were formed through the decompositions of the virus and long-term potential effects on residents’ immunity [58]. Furthermore, poor air quality may increase the probability of residents suffering from respiratory diseases in winter, thereby raising the possibility of infection after going to hospitals.

5.2. Policy Recommendations for Urban Development

Cities are the habitats of residents. The COVID-19 pandemic showed that many cities in the world were insufficiently prepared to deal with the challenge it presented. Weak links in urban materials and infrastructure were revealed. The urban environment played an important role in suppressing the spread of the virus before the pandemic, and cities should further complete emergency planning and response measures to offset future risks after the pandemic [3], as the weaknesses should be resolved before the next pandemic [59]. The study proposed the following targeted urban planning and management recommendations based on the three characteristics of the areas where cases were highest.
First, areas with high-density populations and susceptible populations were more likely to be infected. For high-density urban areas, especially in Asia, city managers should give priority to identifying areas where the population and susceptible populations gather and implement strict controls so as to prevent the further spread of disease during a pandemic [21]. In Wuhan, the results revealed that areas far from the third ring road (with a positive coefficient of 0.267) or close to the Yangtze river (with a negative coefficient of −0.174) had a large number of people; although high-density cities had an efficient agglomeration effect, the risk of high-density population agglomeration should be reduced in future urban construction, such as multi-center construction and rational evacuation planning in urban areas [7]. In addition, the empirical results showed that the elderly have a high degree of positive relationships with the spatial variation of the disease (with a positive coefficient of 0.312) since they were more likely to develop severe illnesses due to their reduced immunity in the pandemic as the susceptible population. Policymakers could plan to provide specialized services by improving the level of services designed for the elderly, so as to promote healthy aging in urban areas [56].
Second, the high accessibility of public service facilities provided convenience for residents, but it also increased the mobility of the population, leading to a high risk of infection. The unfairness of the distribution of public service facilities and resources aggravated the excessive concentration and mobility of local nodes. The empirical results showed that infections were higher in areas close to hospitals, with a coefficient of −0.870, indicating that urban planning and management departments need to strengthen the rational layout of public service facilities and develop a fair allocation of public service resources, such as promoting the further improvement of the three-level diagnosis and treatment system and promoting remote diagnosis and treatment [60], so as to reduce excessive clustering in third-class hospitals. In addition, for high-risk buildings such as hospitals, defensive space design should be enhanced. On the other hand, the fragility of the food supply during the pandemic was also a major problem faced by people. Lai et al. proposed the idea of restoring self-sufficient urban agriculture, which was not applicable to Chinese cities [7]. Swiggy (similar to Uber Eats in the US), an Indian food delivery platform, used its advantages to ensure food supply for the vulnerable population in Bangalore [61]. Online group purchases and unified distribution via community management were two of China’s successful experiences in fighting the pandemic. In long-term planning and urban management, it is also necessary to consider the virtualized online operation of daily public service facilities such as markets, redefine the layout of the urban logistics system, and build a non-contact distribution and supply guarantee system for daily living materials. Public transportation also increased the risk of infection. In Wuhan, empirical evidence showed that places where bus and metro stations were densely distributed were accompanied by a high degree of disease transmission, among which the positive impact of bus stops was greater, with a coefficient of 0.321. Consequently, measures such as limiting the number of passengers per vehicle and increasing the number of bus or railway lines could be taken to reduce excessive crowds. At the same time, sufficient disinfection facilities should be arranged in public service facilities, as was shown in a nationwide survey in Spain, which could result in a greater willingness to use public transport in post-COVID-19 times [62]. Regarding modes of travel, residents need to be encouraged more to ride bicycles and walk. The pedestrian and bicycle lanes in Wuhan were relatively scattered and incomplete and hence a more connected and safer road network should be planned. For the university model, such as a city with nearly a hundred universities, the test in Wuhan suggested that the presence of universities was able to curb the spread of the disease to some extent [63], with a negative coefficient of −0.059, which demonstrated that it may be better to continue to maintain the closed management mode in China, since the experimental results proved that it had a positive impact on the fight against the pandemic.
Finally, high environmental quality can reduce the risk of infectious disease transmission. Studies have shown that newly built high-rise residential buildings in Wuhan had higher risks of infection than old multi-story residential buildings, with a negative coefficient of −0.100, and hence the study proposed that the proportion of future high-rise residential developments in cities should be appropriately reduced. On the other hand, the experimental results proved that better air quality, water distribution and green space accessibility had a positive impact on the fight against the pandemic in Wuhan, which shows that development managers should further focus on improving the quality of human settlements. In Australia, COVID-19 created more opportunities for people to come into contact with and learn about nature at home [13], while in China, the fact was that there were no adequate parks and open spaces where people in densely populated urban areas could walk [64,65,66], in spite of people’s growing interest in short walks [19,67]. Consequently, open public spaces, such as waters and green spaces, should be more balanced to guarantee fair accessibility for residents, since it was beneficial to the residents’ health [68,69], and was able to play a barrier function in the urban ecological space to enhance the city’s natural immunity and allow residents to live in a fair and resilient city environment.

5.3. Future Research and Limitations

This study provides a basic research paradigm for the influence mechanism of urban elements on COVID-19 at various stages of the pandemic development in urban spaces. Firstly, in general, in the early stage of the pandemic, as evidenced by this study, different cities around the world, with the same and different population volumes, could establish their own factor system that may affect disease transmission according to the common (density, public service facilities, open space, etc.) and unique attributes (mountain cities, lake cities, university cities, etc.) of the city, as well as from the three aspects of population density, human mobility and environmental quality, so as to identify related urban factors and help urban planners optimize the allocation of public service facilities, as well as rethinking the reasonable size and structure of the city. Secondly, with the mutation of the virus, the infectivity continued to increase and people entered the stage of normalization of social activities and pandemic prevention. The pandemic data became more accurate, transparent, standardized and refined, which helps researchers achieve refined exploration of the interaction between the spatial variation of disease and related factors in different urban regions; this also makes it possible to provide urban planners with more accurate and effective planning suggestions in the context of pandemic prevention.
Moreover, the research can be expanded on in the following aspects in the future: (1) Combined with more detailed flow data, a detailed agent model can be built for deep research of cities’ internal connections during the pandemic. (2) The model can be applied to the study of explorations of relationships between other chronic and acute diseases and cities.
The limitations of our data and analysis can be split into three aspects. Firstly, due to the distinct temporal character of the Weibo data, the effective sample size was only 729, and hence there was still missing disease information that involved privacy issues. Although the sample size was limited, the spatial coverage of the data was good, and the study applied Kernel Estimate Density (KDE) to generate surfaces of the pandemic in the main urban area of Wuhan, which could clearly show the difference in the level of disease prevalence between regions. However, the use of spatial estimates of Kernel Density Estimation (KDE) could cause potential bias in the regression analysis, which was non-evaluative and unavoidable in this study. Secondly, the study attempted to assess the bias of the Weibo help data by calculating the rate of participation in the helpline among those infected with COVID-19, while due to the limited availability of the public data, only the participation rate at the district level was obtained. This meant we were unable to prove that on a more granular scale, within each TAZ unit, people infected with COVID-19 had the same attitude to posts. Thirdly, the internal built-up environment of residents was not taken into account in the model. In general, this study can be regarded as proposing an idea based on limited and innovative data. If detailed data can be obtained, then more detailed correlations can be drawn in the same way.

6. Conclusions

During the outbreak of the pandemic, there was spatial variability of COVID-19 in urban spaces, and the identification of the variation, as well as their interaction mechanisms with urban elements, is of great significance in the fight against viruses and the future development of cities. The study contributes by proposing the linear regression model with innovative social media data and seventeen explanatory factors to achieve the quantitative study of the spread of COVID-19 within Wuhan city. Weibo case data provided urban scholars a unique opportunity to revisit the result of the natural development of COVID-19 in Wuhan city during the early stage of the pandemic, which was the main advantage compared to other city-level data. The data had a good sample coverage, which provided an opportunity to obtain disease information on a fine scale (such as TAZ unit level), since, even now, there is no way to obtain publicly accurate data in the early stage on a fine scale or district scale. Fourteen significant urban factors—the Euclidean Distance (ED) to the third ring road, markets, third-class hospitals, rivers and green space, the Kernel Density Estimation (KDE) values of the elderly population, middle schools, universities, bus stops, metro stations and waters, the Inverse Distance Weighted (IDW) value of air quality index (AQI), house prices, and the age of buildings—were finally selected to help us understand the distribution pattern of the COVID-19 in the urban space. Three characteristics of the areas of case aggregation were identified and the study further discussed the impacts and proposed detailed and targeted recommendations on the intensive management of high-density population and susceptible population areas, configuration and transformation of public facilities, and comprehensive improvement of urban environmental quality, to contribute to the construction of the healthy city which could well protect people from disease threats when faced with similar emergencies in the future.

Author Contributions

Conceptualization, L.L. and Z.P.; methodology, R.W.; data curation, H.W. and L.L.; writing—original draft preparation, L.L. and R.W.; writing—review and editing, Z.P. and H.W.; funding acquisition, Z.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China, grant number 52078390; National Natural Science Foundation of China, grant number 51978535.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data sharing not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sohrabi, C.; Alsafi, Z.; O’Neill, N.; Khan, M.; Kerwan, A.; Al-Jabir, A.; Iosifidis, C.; Agha, R. World Health Organization declares global emergency: A review of the 2019 novel coronavirus (COVID-19). Int. J. Surg. 2020, 76, 71–76. [Google Scholar] [CrossRef] [PubMed]
  2. Wang, L.; Wang, Y.; Ye, D.; Liu, Q. Review of the 2019 novel coronavirus (SARS-CoV-2) based on current evidence. Int. J. Antimicrob. Agent 2020, 55, 105948. [Google Scholar] [CrossRef]
  3. Lai, K.Y.; Webster, C.; Kumari, S.; Sarkar, C. The nature of cities and the COVID-19 pandemic. Curr. Opin. Environ. Sustain. 2020, 46, 27–31. [Google Scholar] [CrossRef] [PubMed]
  4. Alirol, E.; Getaz, L.; Stoll, B.; Chappuis, F.; Loutan, L. Urbanisation and infectious diseases in a globalised world. Lancet Infect. Dis. 2011, 11, 131–141. [Google Scholar] [CrossRef]
  5. Matthew, R.A.; McDonald, B. Cities under Siege: Urban Planning and the Threat of Infectious Disease. J. Am. Plan. Assoc. 2007, 72, 109–117. [Google Scholar] [CrossRef]
  6. You, H.; Wu, X.; Guo, X. Distribution of COVID-19 Morbidity Rate in Association with Social and Economic Factors in Wuhan, China: Implications for Urban Development. Int. J. Environ. Res. Public Health 2020, 17, 3417. [Google Scholar] [CrossRef]
  7. Lai, S.; Leone, F.; Zoppi, C. COVID-19 and spatial planning. TeMA J. Land Use Mob. Environ. 2020, 231–246. [Google Scholar] [CrossRef]
  8. Peng, Z.; Wang, R.; Liu, L.; Wu, H. Exploring urban spatial features of COVID-19 transmission in Wuhan based on social media data. ISPRS Int. J. Geo-Inf. 2020, 9, 402. [Google Scholar] [CrossRef]
  9. Jin, X.; Leng, Y.; Gong, E.; Xiong, S.; Yao, Y.; Vedanthan, R.; Yang, Z.; Chen, K.; Wu, C.; Yan, L. Neighborhood-Level Public Facilities and COVID-19 Transmission: A Nationwide Geospatial Study In China. Medrxiv 2020. [Google Scholar] [CrossRef]
  10. Im Kampe, E.O.; Lehfeld, A.-S.; Buda, S.; Buchholz, U.; Haas, W. Surveillance of COVID-19 school outbreaks, Germany, March to August 2020. Eurosurveillance 2020, 25, 2001645. [Google Scholar] [CrossRef]
  11. Patel, A.; Shah, P. Rethinking slums, cities, and urban planning: Lessons from the COVID-19 pandemic. Cities Health 2020, 1–3. [Google Scholar] [CrossRef]
  12. Schomaker, R.M.; Bauer, M.W. What Drives Successful Administrative Performance during Crises? Lessons from Refugee Migration and the COVID-19 Pandemic. Public Admin. Rev. 2020, 80, 845–850. [Google Scholar] [CrossRef]
  13. Hu, R. Reinventing community in COVID-19: A case in Canberra, Australia. Soc.-Ecol. Pract. Res. 2020, 2, 237–241. [Google Scholar] [CrossRef]
  14. Berkowitz, R.L.; Gao, X.; Michaels, E.K.; Mujahid, M.S. Structurally vulnerable neighbourhood environments and racial/ethnic COVID-19 inequities. Cities Health 2020, 1–4. [Google Scholar] [CrossRef]
  15. Hu, M.; Roberts, J.D.; Azevedo, G.P.; Milner, D. The role of built and social environmental factors in COVID-19 transmission: A look at America’s capital city. Sustain. Cities Soc. 2021, 65, 102580. [Google Scholar] [CrossRef]
  16. Bashir, M.F.; Ma, B.; Komal, B.; Bashir, M.A.; Tan, D.; Bashir, M. Correlation between climate indicators and COVID-19 pandemic in New York, USA. Sci. Total Environ. 2020, 728, 138835. [Google Scholar] [CrossRef]
  17. Wu, X.; Nethery, R.C.; Sabath, B.M.; Braun, D.; Dominici, F. Air pollution and COVID-19 mortality in the United States: Strengths and limitations of an ecological regression analysis. Sci. Adv. 2020, 6, eabd4049. [Google Scholar] [CrossRef]
  18. Doğan, B.; Jebli, M.B.; Shahzad, K.; Farooq, T.H.; Shahzad, U. Investigating the effects of meteorological parameters on COVID-19: Case study of New Jersey, United States. Environ. Res. 2020, 191, 110148. [Google Scholar] [CrossRef]
  19. Kleinschroth, F.; Kowarik, I. COVID-19 crisis demonstrates the urgent need for urban greenspaces. Front. Ecol. Environ. 2020, 18, 318–319. [Google Scholar] [CrossRef]
  20. Pandit, N. COVID-19: Urban Density’s Poetic Justice. Ecol. Econ. Soc. INSEE J. 2020, 3, 13–18. [Google Scholar] [CrossRef]
  21. Global Humanitarian Response Plan COVID-19. OCHA Geneva. Available online: https://reliefweb.int/sites/reliefweb.int/files/resources/Global%20Humanitarian%20Response%20Plan%20COVID-19.pdf (accessed on 1 July 2020).
  22. Ma, Y.; Zhao, Y.; Liu, J.; He, X.; Wang, B.; Fu, S.; Yan, J.; Niu, J.; Zhou, J.; Luo, B. Effects of temperature variation and humidity on the death of COVID-19 in Wuhan, China. Sci. Total Environ. 2020, 724, 138226. [Google Scholar] [CrossRef] [PubMed]
  23. Pan, A.; Liu, L.; Wang, C.; Guo, H.; Hao, X.; Wang, Q.; Huang, J.; He, N.; Yu, H.; Lin, X.; et al. Association of Public Health Interventions with the Epidemiology of the COVID-19 Outbreak in Wuhan, China. JAMA 2020, 323, 1915–1923. [Google Scholar] [CrossRef] [Green Version]
  24. Li, S.; Wang, Y.; Xue, J.; Zhao, N.; Zhu, T. The impact of COVID-19 epidemic declaration on psychological consequences: A study on active Weibo users. Int. J. Environ. Res. Public Health 2020, 17, 2032. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Zhao, Y.; Xu, H. Chinese public attention to COVID-19 epidemic: Based on social media. medRxiv 2020, 22, e18825. [Google Scholar]
  26. Li, L.; Zhang, Q.; Wang, X.; Zhang, J.; Wang, T.; Gao, T.-L.; Duan, W.; Tsoi, K.K.-F.; Wang, F.-Y. Characterizing the propagation of situational information in social media during COVID-19 epidemic: A case study on weibo. IEEE Trans. Comput. Soc. Syst. 2020, 7, 556–562. [Google Scholar] [CrossRef]
  27. Jahanbin, K.; Rahmanian, V. Using Twitter and web news mining to predict COVID-19 outbreak. Asian Pac. J. Trop. Med. 2020, 13, 378. [Google Scholar]
  28. Prabhakar Kaila, D.; Prasad, D.A. Informational flow on Twitter–Corona virus outbreak–topic modelling approach. Int. J. Adv. Res. Eng. Technol. 2020, 11, 128–134. [Google Scholar]
  29. Tisdell, C.A. Economic, social and political issues raised by the COVID-19 pandemic. Econ. Anal. Policy 2020, 68, 17–28. [Google Scholar] [CrossRef]
  30. Li, B.; Peng, Y.; He, H.; Wang, M.; Feng, T. Built environment and early infection of COVID-19 in urban districts: A case study of Huangzhou. Sustain. Cities Soc. 2021, 66, 102685. [Google Scholar] [CrossRef]
  31. Li, S.; Ma, S.; Zhang, J. Association of built environment attributes with the spread of COVID-19 at its initial stage in China. Sustain. Cities Soc. 2021, 67, 102752. [Google Scholar] [CrossRef] [PubMed]
  32. Jenks, M.; Jones, C. Dimensions of the Sustainable City; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
  33. Peng, Z.; Wang, R.; Liu, L.; Wu, H. Fine-Scale Dasymetric Population Mapping With Mobile Phone and Building Use Data Based on Grid Voronoi Method. ISPRS Int. J. Geo-Inf. 2020, 9, 344. [Google Scholar] [CrossRef]
  34. Zhang, Y.; Pan, Y.; Zhao, X.; Shi, W.; Chen, Z.; Zhang, S.; Liu, P.; Xiao, J.; Tan, W.; Wang, D. Genomic characterization of SARS-CoV-2 identified in a reemerging COVID-19 outbreak in Beijing’s Xinfadi market in 2020. Biosaf. Health 2020, 2, 202–205. [Google Scholar] [CrossRef] [PubMed]
  35. Li, X.; Wang, Q.; Ding, P.; Cha, Y.e.; Mao, Y.; Ding, C.; Gu, W.; Wang, Y.; Ying, B.; Zhao, X. Risk factors and on-site simulation of environmental transmission of SARS-CoV-2 in the largest wholesale market of Beijing, China. Sci. Total Environ. 2021, 778, 146040. [Google Scholar] [CrossRef]
  36. Zhong, N. Management and prevention of SARS in China. Philos. Trans. R. Soc. Lond. Ser. B Biol. Sci. 2004, 359, 1115–1116. [Google Scholar] [CrossRef]
  37. Bowman, A.W.; Azzalini, A. Applied Smoothing Techniques for Data Analysis: The Kernel Approach with S-Plus Illustrations; OUP Oxford: Oxford, UK, 1997; Volume 18. [Google Scholar]
  38. Silverman, B.W. Density Estimation for Statistics and Data Analysis; Routledge: London, UK, 2018. [Google Scholar]
  39. Porta, S.; Strano, E.; Iacoviello, V.; Messora, R.; Latora, V.; Cardillo, A.; Wang, F.; Scellato, S. Street centrality and densities of retail and services in Bologna, Italy. Environ. Plan. Plan. Des. 2009, 36, 450–465. [Google Scholar] [CrossRef] [Green Version]
  40. Rybnikova, N.; Stevens, R.G.; Gregorio, D.I.; Samociuk, H.; Portnov, B.A. Kernel density analysis reveals a halo pattern of breast cancer incidence in Connecticut. Spat. Spatio-Temp. Epidemiol. 2018, 26, 143–151. [Google Scholar] [CrossRef] [PubMed]
  41. Yang, J.; Zhu, J.; Sun, Y.; Zhao, J. Delimitating urban commercial central districts by combining kernel density estimation and road intersections: A case study in nanjing city, china. ISPRS Int. J. Geo-Inf. 2019, 8, 93. [Google Scholar] [CrossRef] [Green Version]
  42. Prasannakumar, V.; Vijith, H.; Charutha, R.; Geetha, N. Spatio-temporal clustering of road accidents: GIS based analysis and assessment. Proc. Soc. Behav. Sci. 2011, 21, 317–325. [Google Scholar] [CrossRef] [Green Version]
  43. Nicholl, J.; West, J.; Goodacre, S.; Turner, J. The relationship between distance to hospital and patient mortality in emergencies: An observational study. Emerg. Med. J. 2007, 24, 665–668. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Coombes, E.; Jones, A.P.; Hillsdon, M. The relationship of physical activity and overweight to objectively measured green space accessibility and use. Soc. Sci. Med. 2010, 70, 816–822. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  45. Shukla, K.; Kumar, P.; Mann, G.S.; Khare, M. Mapping spatial distribution of particulate matter using Kriging and Inverse Distance Weighting at supersites of megacity Delhi. Sustain. Cities Soc. 2020, 54, 101997. [Google Scholar] [CrossRef]
  46. Jha, D.K.; Sabesan, M.; Das, A.; Vinithkumar, N.; Kirubagaran, R. Evaluation of Interpolation Technique for Air Quality Parameters in Port Blair, India. Univ. J. Environ. Res. Technol. 2011, 1, 301–310. [Google Scholar]
  47. Ma, J.; Ding, Y.; Gan, V.J.; Lin, C.; Wan, Z. Spatiotemporal prediction of PM2.5 concentrations at different time granularities using IDW-BLSTM. IEEE Access 2019, 7, 107897–107907. [Google Scholar] [CrossRef]
  48. Chen, F.-W.; Liu, C.-W. Estimation of the spatial rainfall distribution using inverse distance weighting (IDW) in the middle of Taiwan. Pad. Water Environ. 2012, 10, 209–222. [Google Scholar] [CrossRef]
  49. Yang, W.; Zhao, Y.; Wang, D.; Wu, H.; Lin, A.; He, L. Using principal components analysis and IDW interpolation to determine spatial and temporal changes of surface water quality of Xin’anjiang river in Huangshan, China. Int. J. Environ. Res. Public Health 2020, 17, 2942. [Google Scholar] [CrossRef] [PubMed]
  50. Yan, X.; Su, X. Linear Regression Analysis: Theory and Computing; World Scientific: Singapore, 2009. [Google Scholar]
  51. Seber, G.A.; Lee, A.J. Linear Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2012; Volume 329. [Google Scholar]
  52. Montgomery, D.C.; Peck, E.A.; Vining, G.G. Introduction to Linear Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2021. [Google Scholar]
  53. Hamidi, S.; Sabouri, S.; Ewing, R. Does Density Aggravate the COVID-19 Pandemic? J. Am. Plan. Assoc. 2020, 86, 495–509. [Google Scholar] [CrossRef]
  54. Huang, C.; Wang, Y.; Li, X.; Ren, L.; Zhao, J.; Hu, Y.; Zhang, L.; Fan, G.; Xu, J.; Gu, X. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020, 395, 497–506. [Google Scholar] [CrossRef] [Green Version]
  55. Epidemiology Working Group for NCIP Epidemic Response, Chinese Center for Disease Control and Prevention. The epidemiological characteristics of an outbreak of 2019 novel coronavirus diseases (COVID-19) in China. Zhonghua Liuxingbingxue Zazhi 2020, 41, 145. [Google Scholar]
  56. Guida, C.; Carpentieri, G. Quality of life in the urban environment and primary health services for the elderly during the COVID-19 pandemic: An application to the city of Milan (Italy). Cities 2021, 110, 103038. [Google Scholar] [CrossRef]
  57. Zhou, F.; Yu, T.; Du, R.; Fan, G.; Liu, Y.; Liu, Z.; Xiang, J.; Wang, Y.; Song, B.; Gu, X. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: A retrospective cohort study. Lancet 2020, 395, 1054–1062. [Google Scholar] [CrossRef]
  58. Geary, R.S.; Wheeler, B.; Lovell, R.; Jepson, R.; Hunter, R.; Rodgers, S. A call to action: Improving urban green spaces to reduce health inequalities exacerbated by COVID-19. Prevent. Med. 2021, 145, 106425. [Google Scholar] [CrossRef] [PubMed]
  59. Bereitschaft, B.; Scheller, D. How Might the COVID-19 Pandemic Affect 21st Century Urban Design, Planning, and Development? Urban Sci. 2020, 4, 56. [Google Scholar] [CrossRef]
  60. Kapoor, A.; Guha, S.; Das, M.K.; Goswami, K.C.; Yadav, R. Digital healthcare: The only solution for better healthcare during COVID-19 pandemic? Ind. Heart J. 2020, 72, 61–64. [Google Scholar] [CrossRef] [PubMed]
  61. Swiggy Plans to Serve 5 Lakh Meals Daily to the Needy Amid Coronavirus Lockdown. Available online: https://www.deccanherald.com/business/business-news/swiggy-plans-to-serve-5-lakh-meals-daily-to-the-needy-amid-coronavirus-lockdown-820599.html (accessed on 1 July 2020).
  62. Awad-Núñez, S.; Julio, R.; Gomez, J.; Moya-Gómez, B.; González, J.S. Post-COVID-19 travel behaviour patterns: Impact on the willingness to pay of users of public transport and shared mobility services in Spain. Eur. Trans. Res. Rev. 2021, 13, 1–18. [Google Scholar] [CrossRef]
  63. Gang, F. Harmonious coexistence of university and city-on the planning and design of clustered open campus. New Arch. 2009, 5, 4–9. [Google Scholar]
  64. Haaland, C.; van Den Bosch, C.K. Challenges and strategies for urban green-space planning in cities undergoing densification: A review. Urban Forest. Urban Green. 2015, 14, 760–771. [Google Scholar] [CrossRef]
  65. Boulton, C.; Dedekorkut-Howes, A.; Byrne, J. Factors shaping urban greenspace provision: A systematic review of the literature. Landsc. Urban Plan. 2018, 178, 82–101. [Google Scholar] [CrossRef]
  66. Gray, S.; Kellas, A. COVID-19 Has Highlighted the Inadequate, and Unequal, Access to High Quality Green Spaces; BMJ Publishing Group: London, UK, 2020; p. 3. [Google Scholar]
  67. Zhu, J.; Xu, C. Sina microblog sentiment in Beijing city parks as measure of demand for urban green space during the COVID-19. Urban Forest. Urban Green. 2021, 58, 126913. [Google Scholar] [CrossRef]
  68. Kondo, M.C.; Fluehr, J.M.; McKeon, T.; Branas, C.C. Urban green space and its impact on human health. Int. J. Environ. Res. Public Health 2018, 15, 445. [Google Scholar] [CrossRef] [Green Version]
  69. Markevych, I.; Schoierer, J.; Hartig, T.; Chudnovsky, A.; Hystad, P.; Dzhambov, A.M.; De Vries, S.; Triguero-Mas, M.; Brauer, M.; Nieuwenhuijsen, M.J. Exploring pathways linking greenspace to health: Theoretical and methodological guidance. Environ. Res. 2017, 158, 301–317. [Google Scholar] [CrossRef]
Figure 1. Map of the study area in Wuhan, China: (a) the geographic location of the Wuhan, China; (b) the main urban area of Wuhan (MUA); and (c) administrative districts of Wuhan [8].
Figure 1. Map of the study area in Wuhan, China: (a) the geographic location of the Wuhan, China; (b) the main urban area of Wuhan (MUA); and (c) administrative districts of Wuhan [8].
Ijerph 19 05208 g001
Figure 2. Density map of Weibo help data.
Figure 2. Density map of Weibo help data.
Ijerph 19 05208 g002
Figure 3. Variable scatter plot. Note: ***, **, * represent significant at the level of 0.001, 0.01, 0.05, respective.
Figure 3. Variable scatter plot. Note: ***, **, * represent significant at the level of 0.001, 0.01, 0.05, respective.
Ijerph 19 05208 g003
Figure 4. Spatial distributions of explanatory variables. (a) The elderly density; (b) Distance to the third ring road; (c) Distance to rivers; (d) Middle school density; (e) Distance to markets; (f) Distance to Third-class hospitals; (g) Bus stop density; (h) Metro station density; (i)University density; (j) Distance to green space; (k) Water density; (l) Air quality index; (m) Housing price; (n) The age of buildings.
Figure 4. Spatial distributions of explanatory variables. (a) The elderly density; (b) Distance to the third ring road; (c) Distance to rivers; (d) Middle school density; (e) Distance to markets; (f) Distance to Third-class hospitals; (g) Bus stop density; (h) Metro station density; (i)University density; (j) Distance to green space; (k) Water density; (l) Air quality index; (m) Housing price; (n) The age of buildings.
Ijerph 19 05208 g004
Figure 5. Statistics chart of standardized coefficients.
Figure 5. Statistics chart of standardized coefficients.
Ijerph 19 05208 g005
Table 1. Case statistics of Wuhan.
Table 1. Case statistics of Wuhan.
DistrictsCumulative Confirmed CaseProportion
Jiang’an (MUA)656312.987%
Jianghan (MUA)524210.373%
Qiaokou (MUA)685413.562%
Hanyang (MUA)46919.282%
Wuchang (MUA)755114.942%
Qingshan (MUA)28045.548%
Hongshan (MUA)47209.340%
Dongxihu26375.218%
Caidian14242.818%
Jiangxia8751.731%
Huangpi21174.189%
Xinzhou10712.119%
East Lake Ecotourism Scenic District4830.956%
East Lake High-Tech Development District21744.302%
Wuhan Economic Technological Development District11082.192%
Other places2230.441%
Total50,537100.00%
As of 16 March 2022 source: Wuhan Municipal Commission of Health.
Table 2. Statistics of onset period.
Table 2. Statistics of onset period.
Date of IllnessNumber of Weibo Help Information
20–30 Novenmber 20193
1–31 Januaru 2020558
1–6 February 2020154
7–10 February 202014
Table 3. The participation rates in the helpline among the COVID infected.
Table 3. The participation rates in the helpline among the COVID infected.
DistrictsInfections in Weibo InformationCumulative Confirmed Casethe Participation Rate
Jianghan (MUA)7551371.46%
Qiaokou (MUA)12867891.89%
Wuchang (MUA)13574311.82%
Jiang’an (MUA)14565212.22%
Hanyang (MUA)11646612.49%
Hongshan (MUA)8746521.87%
Qingshan (MUA)6427732.31%
Table 4. Variable selection.
Table 4. Variable selection.
DimensionVariablesDefinitions
Dependent variableWeibo help case density (Weibo)The average KDE 1 value of COVID-19 infectors in each unit
Population densityPopulation density (Population)The average KDE 1 value of the population in each unit
The elderly population density (Elderly)The average KDE 1 value of the elderly population in each unit
Distance to the third ring road (Ring3)The average ED 2 value from units to the third ring road
Distance to rivers (River)The average ED 2 value from units to rivers
Human mobilityDistance to markets (Market)The average ED 2 value from units to markets
Distance to third-class hospitals (Hospital)The average ED 2 value from units to hospitals
Middle school density (M_school)The average KDE 1 value of middle schools in each unit
University density (University)The average KDE 1 value of universities in each unit
Business density (Business)The average KDE 1 value of business facilities in each unit
Administration density (Administration)The average KDE 1 value of administration facilities in each unit
Bus stop density (Bus)The average KDE 1 value of bus stop in each unit
Metro station density (Metro)The average KDE 1 value of metro station in each unit
Environmental qualityHousing price (Price)The average house price in each unit
Age of buildings (Year)The average age of buildings in each unit
Air quality index (AQI)The average IDW 3 value of air quality index in each unit
Distance to green spaces (Green)The average ED 2 value from units to green spaces
water density (water)The average KDE 1 value values of waters in each unit
1 KDE: The Kernel Density Estimation (see Methods for detailed information); 2 ED: Euclidean Distance; 3 IDW: The Inverse Distance Weighted (see Methods for detailed information).
Table 5. Variable statistics.
Table 5. Variable statistics.
Variable NameNumberMinimumMaximumMeanStd. Deviation
Weibo help density (Weibo)16810.0000 73,959.0440 28,405.5334 17,269.6339
Population density (Population)16812.5756 31,293.7673 8199.1254 5694.6717
The elderly population density (Elderly)16811.9645 2052.2024 653.7139 464.9645
Distance to the third ring road (Ring3)1681–6662.0000 10,621.0000 4499.3998 3162.0162
Distance to rivers (River)168170.0000 16,329.0000 3259.2570 3170.4919
Distance to markets (Market)168111.2599 4087.3986 410.5552 421.3228
Distance to third-class hospitals (Hospital)168123.0000 9432.0000 1913.4747 1535.0498
Middle school density (M_school)16811.0000 59.1887 15.1634 13.7686
University density (University)16811.0000 58.6333 3.6523 5.2843
Business density (Business)16811.0000 54.7805 3.2398 5.1925
Administration density (Administration)16811.0000 58.9231 4.2478 7.3873
Bus stop density (Bus)16811.4563 59.6098 27.6569 13.4843
Metro station density (Metro)16811.0000 57.6180 17.0827 14.5484
Housing price (Price)16817975.0000 40,989.0000 18,447.5544 3879.5026
Age of buildings (Year)16812.6891 28.0737 15.9859 4.5786
Air quality index (AQI)168180.3393 93.3462 86.6823 2.1607
Distance to green spaces (Green)16810.0000 6727.0000 662.9863 633.4966
water density (water)16811.0000 32.4762 9.8986 7.4350
Table 6. Model summary of seventeen urban factors.
Table 6. Model summary of seventeen urban factors.
ModelUnstandardized
Coefficients
Standardized
Coefficients
t-ValueSignificanceCollinearity
Statistics
BetaStd. ErrorBetaToleranceVIF
(Constant)−70,623.88610,753.331 −6.5680.000
Population−0.4750.083−0.157−5.7250.0000.1646.113
Elderly16.6971.280.45013.0410.0000.1039.714
Ring31.5390.1150.28213.3790.0000.2763.625
River−0.7810.088−0.143−8.9140.0000.4732.114
Market−2.1440.607−0.052−3.5310.0000.5581.794
Hospital−0.9230.186−0.082−4.9780.0000.452.223
M_school151.17421.8430.1216.9210.0000.4032.479
University−156.23944.016−0.048−3.550.0000.6741.483
Business−18.32143.511−0.006−0.4210.6740.7151.399
Administration25.64735.6310.0110.720.4720.5271.899
Bus417.36129.1460.32614.320.0000.2364.234
Metro142.75521.2090.1206.7310.0000.3832.610
Price−0.8860.079−0.199−11.2680.0000.3922.552
Year−426.75967.602−0.113−6.3130.0000.3812.626
AQI1132.903119.9150.1429.4480.0000.5431.840
Green1.0320.3690.0382.7960.0050.6671.499
Water−120.48734.619−0.052−3.480.0010.5511.816
R: 0.893R Square: 0.797Adjusted R Square: 0.794
Std. Error of the Estimate: 7831.238353
Table 7. Model summary of fourteen urban factors.
Table 7. Model summary of fourteen urban factors.
ModelUnstandardized
Coefficients
Standardized
Coefficients
t-ValueSignificanceCollinearity
Statistics
BetaStd. ErrorBetaToleranceVIF
(Constant)−80,667.56810,629.470 −7.5890.000
Elderly11.5990.8670.31213.3710.0000.2294.364
Ring31.4570.1140.26712.7500.0000.2863.501
River−0.9450.083−0.174−11.4030.0000.5401.853
Market−1.8680.608−0.046−3.0700.0020.5681.762
Hospital−0.8700.186−0.077−4.6730.0000.4562.194
M_school144.75822.0420.1156.5670.0000.4052.471
University−193.20443.907−0.059−4.4000.0000.6921.444
Bus411.54529.4330.32113.9830.0000.2374.226
Metro144.60921.1620.1226.8330.0000.3932.543
Price−0.7930.069−0.178−11.4710.0000.5191.928
Year−378.36766.530−0.100−5.6870.0000.4022.490
AQI1229.175117.0940.15410.4970.0000.5821.717
Green 0.9420.3730.0352.5270.0120.6681.496
Water−140.88434.832−0.061−4.0450.0000.5561.799
R: 0.890R Square: 0.792Adjusted R Square: 0.790
Std. Error of the Estimate: 7915.456643
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, R.; Liu, L.; Wu, H.; Peng, Z. Correlation Analysis between Urban Elements and COVID-19 Transmission Using Social Media Data. Int. J. Environ. Res. Public Health 2022, 19, 5208. https://doi.org/10.3390/ijerph19095208

AMA Style

Wang R, Liu L, Wu H, Peng Z. Correlation Analysis between Urban Elements and COVID-19 Transmission Using Social Media Data. International Journal of Environmental Research and Public Health. 2022; 19(9):5208. https://doi.org/10.3390/ijerph19095208

Chicago/Turabian Style

Wang, Ru, Lingbo Liu, Hao Wu, and Zhenghong Peng. 2022. "Correlation Analysis between Urban Elements and COVID-19 Transmission Using Social Media Data" International Journal of Environmental Research and Public Health 19, no. 9: 5208. https://doi.org/10.3390/ijerph19095208

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop