1. Introduction
Detection and classification of emotions from social streams are one of the main goals of Sentiment Analysis (SA). Generally, Deep Learning algorithms are executed to detect emotions from massive data [
1]. However, in cases where the size of the available labeled data is small, it is difficult and expensive to construct large training sets; using semi-supervised and unsupervised algorithms that also employ unlabeled data is an effective option [
2].
Semi-supervised learning algorithms [
3] use large unlabeled data in combination with the available labeled data in various learning processes. They can improve classification performances using unlabeled data to extract relevant information necessary for classifying data points [
4].
Unsupervised learning methods are applied for mining data when no annotation is available; various clustering algorithms were used to detect embedded knowledge in massive data streams [
5].
In [
6], a lightweight model based on an extension of the Fuzzy C-means algorithm (FCM) [
7,
8], called Extended Fuzzy C-Means (EFCM) [
9], is applied to social streams for emotion-based classification. Using this procedure, documents are classified into emotional categories by being assigned to the emotional category that corresponds to the fuzzy cluster to which it has the highest membership degree. The authors showed that this approach produces good classification accuracy.
This model is implemented in a Geographical Information System (GIS) in [
10] to classify urban districts based on the orientation of citizens detected from social streams.
The document classification model [
6] represents a significant trade-off between classification accuracy and computational complexity. However, it has two main critical points:
- -
The number of clusters must match the number of emotional categories, and there can be no a priori one-to-one mapping between clusters and emotional categories;
- -
The centers of the initial clusters are set randomly. This produces an average increase in execution times and allows the algorithm to converge toward a local minimum;
- -
the document is assigned to a single emotional category, not considering emotional categories corresponding to clusters to which the document belongs with a non-negligible membership degree.
A new framework for emotion-based classification from social streams, using an entropy-based weighted variant of FCM, named EwFCM, to classify the data gathered from streams, is provided to overcome the cluster center’s initialization problem in [
11]. EwFCM applies a fuzzy entropy measure to optimize the initialization of the cluster centers. Test results applied on Twitter datasets show that this emotion classification framework provides better results than the framework proposed in [
10] in terms of classification accuracy and execution time. However, this approach does not solve the problems connected to the constraint of fixing the number of clusters equal to the number of emotional categories and to the choice of neglecting the clusters to which the document belongs with a secondary but not negligible membership degree.
To overcome these drawbacks, a new fuzzy emotion-based classification method, called FREDoC (Fuzzy Relevance Emotions Document Classification), is proposed in [
12] in which, instead of executing a fuzzy clustering approach to classify a document by assigning it to the prevailing emotional category, an emotion relevance index is employed to determine if an emotional category in a text is relevant, and it is based on the well-known TF-IDF metric. Fuzzy linguistic labels are used to explain the emotional relevance, defined by constructing a specific fuzzy partition; the documents are classified considering those emotional categories whose relevance in the document is not negligible.
In [
13], the FREDoC emotion-based multiclassification method is implemented in a GIS framework to classify urban areas by the more relevant critical issues highlighted by citizens.
One of the main problems faced today in spatial analysis is the detection of urban or territorial areas where a specific phenomenon refers to hot spot analysis.
Formally, areas of the study area that have a high/low density of occurrences caused by the phenomenon under consideration are classified as hot/cold patches.
Statistical indices, such as Getis-Ord Gi* statistic [
14] and Local Index of Spatial Autocorrelation (LISA) [
15], are used to detect hot and cold spots; Gi* evaluates the spatial association of a spatial entity with surrounded ones within a specified distance of a single point. LISA statistics detect spatial autocorrelation between neighboring features, measuring the degree of similarity of data points close to a given data point within a specified threshold distance. LISA is used in [
16] for drought hot spot detection in northern Bangladesh.
Kernel Density Estimation (KDE) [
17,
18] is another well-known hot and cold spot detection technique; by identifying hot/cold spots as zones with high/low-intensity peaks relative to the surrounding areas, it provides a continuous surface representing the geographical distribution of the intensity of the phenomena.
These methods, while distinguishing hot and cold spots from areas in which the intensity of the phenomenon appears normal, are not able to accurately detect the location and extent of hot and cold spots.
To overcome this drawback, some authors propose partitive cluster algorithms to detect the extension and the shape of hot and cold spots. K-means [
19] is implemented in GIS environments to detect hot and cold spots in crime [
20,
21,
22] and fire analysis [
23,
24].
Recently, K-medoids [
25] was applied to detect hot spots in crime [
26] and disease analysis [
27], and FCM [
8,
28] was used to detect hot spots in crime analysis [
29,
30] and road traffic crashes [
31].
One of the main defects of hot spot detection methods based on partitive clustering algorithms is the impossibility of detecting the shape of hot and cold spots. In order to accurately detect the shape of the hot spots in [
32], a method of hot spot detection was adopted which makes use of the density-based Fast DBSCAN clustering algorithm [
33]; in [
34], a hybrid clustering combining a self-organizing map and a hierarchical clustering method is applied in a GIS platform to detect service levels of urban streets from GPS flow speeds data points.
Recently EFCM was implemented in a GIS framework in [
35] to detect hot spots in fire analysis and in [
36] to detect the spatial distribution and the evolution of hot spots in disease analysis. The shape of a hotspot is approximated by a circle on the map. This approach represents a trade-off between the computational speed of hot and cold spot detection methods based on partitive clustering methods and the accuracy of approaches using density-based clustering algorithms in identifying the form of hot and cold areas.
These hot and cold spot detection methods are performed on data points located on the map detecting those areas with a high and low density and intensity of data points. In this paper, we propose a different hot and cold spot detection approach in which areas detected as hot/cold spots are those in which a feeling of unease and discouragement expressed by citizens on social media prevails (it is not very present) and persists over time.
Social data can provide an extensive source of information useful for hot and cold spot analysis, both because they refer to the perceptions and moods of the citizens involved in the phenomenon analyzed and because they represent a massive data source that allows you to analyze how a phenomenon spreads and how it evolves. In addition, in many spatial analysis problems, it is useful to aggregate the information by sub-zone in which the study area is partitioned in order to classify these sub-zones on the basis of the impact of the phenomenon analyzed. The GIS-based framework in [
10] responds to this need by classifying these sub-zones on the basis of the perception and moods of citizens detected in posts inserted in social networks.
In many problems, the user is not interested in detecting the exact shape of hot/cold spots on the map but intends to evaluate if a sub-zone of the study area can be classified as a hot or cold spot, that is, if the phenomenon is particularly present or rare on the subzone for a defined period of time. This goal was pursued in [
35], which investigates whether urban agglomerations the City of London is divided into are more common and which sorts of catastrophic catastrophes endure over time. The idea on which this research is based consists of constructing a GIS-based framework for detecting hot and cold spots based on the information extracted from social streams, with the aim of determining which subzones of the study area are classified as hot/cold spots.
Unlike the method proposed in [
35], we do not use fuzzy-clustering-based hot spot detection methods, but we apply an emotion-based classification model to classify subzones of the study area based on the relevance of pleasant and unpleasant emotions of citizens determined by starting from social data. This approach allows for taking into account the hidden knowledge extracted from social streams that allow the evaluation of the persistence of the phenomenon in a sub-zone based on the moods of citizens.
Following [
6,
11,
12], we partition the emotions into pleasant and unpleasant, identifying as hot/cold spots a sub-zone classified as unpleasant/pleasant for a duration greater than a predetermined period.
We call our framework FESC (Fuzzy Emotion-based hot and cold Spots Classification).
The study area is partitioned into subzones, and the user defines the atomic time interval, called time frame, to be considered for the analysis of the evolution of the intensity of a phenomenon. A dictionary of emotion categories is created, divided into pleasant and unpleasant. The social data are extracted and grouped for subzone and time frame; after a text parting phase, they constitute the documents of a corpus. The dictionary of emotion categories is structured as in [
6], in which each emotion category is assigned a set of terms. The Term Frequency measures the relevance of a term in a document—The Term Frequency—Inverse Document Frequency index (TF-IDF) [
37], a well-known metric measuring the importance/relevance of a word in a text, is used to determine the relevance of a term in a document. Following [
6,
11,
12], the TF-IDF values measured for all terms assigned to an emotion category are summed and normalized to form the relevance of this emotion category in the document; then, the TDF-IDF values assigned to pleasant (unpleasant) categories are summed, obtaining a normalized index called pleasant (unpleasant) emotion relevance (for short Pleasant/Unpleasant ER) representing the relevance of pleasant (unpleasant) emotions in the document, where the sum of Pleasant and Unpleasant ER measures in a document is equal to 1.
To evaluate the intensity of the Pleasant/Unpleasant ER in a document, we define a fuzzy partition, built by starting from the emotional Pleasant/Unpleasant ER measure, described by fuzzy linguistic labels.
Each document is classified according to the intensity of the relevance of both pleasant and unpleasant emotions. The linguistic label value of the fuzzy set of the ER fuzzy partition, to which it has the highest membership degree, is used to classify it.
For instance, consider the Low, Medium-low, Medium, Medium-high, and High fuzzy sets as the Pleasant/Unpleasant ER fuzzy partition in
Figure 1. Let the pleasant ER measure of a document be equal to 0.59 and the unpleasant ER measure is given by 1 − 0.59 = 0.41.
Then, the document will be classified into the pleasant class Medium-high and the unpleasant class Medium.
If all the documents corresponding to a subzone’s time frames are classified with pleasant ER linguistic labels over a prefixed threshold, this subzone is labeled as a hot spot. Likewise, if those documents are classified with unpleasant ER linguistic labels over that threshold, the subzone is labeled as a cold spot.
Finally, a map of the hot and cold spots of the study area is built.
Figure 2 shows a schematized architecture of FESC.
Inputs for the framework are the study area divided into subzones, the data stream with the posts entered by citizens in the analyzed period, the Dictionary of the Emotional Word containing the dictionary of terms relating to the single emotional categories, and the ER fuzzy partition.
The output of the process is the hot and cold spots map which highlights the subzones classified as hot and cold spots, corresponding to the zones with unpleasant/pleasant emotions relevance classified above the threshold for all time frames.
A detailed description of the components of the framework is shown in
Section 3.
The main benefits of our method can be summarized as follows:
- -
Unlike other hot and cold spot detection methods, which take into consideration only localized events, FESC recognizes hot and cold spots by analyzing the moods of citizens extracted from social networks; this makes it possible to use hidden knowledge to evaluate, on the basis of citizens’ emotions, which subzones of the study area are more critical and which, on the other hand, are less affected by the phenomenon being investigated;
- -
FESC is computationally faster than cluster-based hot and cold spot detection methods; in fact, FESC does not need to detect the exact geometric shape of a spot but evaluates the prevailing emotional states of citizens to determine which subzones can be classified as hot or cold spots; this mode significantly reduces the CPU times of the algorithm;
- -
to overcome the critical issues of the cluster-based approaches adopted for the classification of documents based on the prevailing emotional category, which needs to fix the number of clusters equal to the number of emotional categories and to establish a one-to-one correspondence between clusters and emotional categories, FESC adopts an approach based on the construction of a fuzzy partition of the relevance of pleasant and unpleasant emotional categories.
The remaining portions of the essay are structured as follows: The introduction to formal concepts and ideas on hot (or cold) spots are provided in
Section 2, along with a synthetic description of the emotion-based document categorization approach [
6]. The logical overview described in
Section 3 serves as an introduction to the suggested framework. The findings of the tests and case studies that demonstrate the viability of the suggested strategy are presented in
Section 4. We discuss the paper’s closing findings in
Section 5.
3. The FESC Framework
In [
10], the social messages emotion classification method [
6] shown in
Figure 3 is implemented in a GIS-based framework in order to classify the districts of the municipality of Bologna (Italy). The study area is given by the municipality, and its six districts are the subzones in which the study area is spatially divided. Social messages are extracted in order to detect the moods of citizens connected to the livability of a district. Each document is related to Twitter social messages posted by citizens and tourists whose hashtags contain the name of a district. Each district is categorized using EFCM according to the dominant emotion category. The districts are then organized thematically depending on the dominant emotion type.
FESC extends this framework to study the temporal evolution of the intensity of a phenomenon in a subzone and detect if a subzone constitutes a hot or cold spot.
Figure 4 shows the logical view of our framework.
The social data extracted contain all the messages related to the analyzed phenomenon posted by users located in one of the subzones in which the study area is partitioned and in a specific period. This period is broken down into T time frames having equal duration.
Let the study area be partitioned into n atomic subzones s1, s2,…, sn, and the analyzed period be partitioned in T time frames. After cleaning up the text from irrelevant terms and executing the Part of Speech tagging, the Text Parsing component creates N documents aggregating messages related to the same subzone and the same time frame, where N = n∙T. The Term Filtering component annotates terms belonging to pleasant and unpleasant emotional categories in the Dictionary of EC words. The ERM Construction component computes the IF-TDF value of each category in the document and aggregates the values assigned to each pleasant/unpleasant category.
Let {} be the set of Mp pleasant emotional categories and {} be the set of Mu unpleasant emotional categories, where Mp + Mu = M.
If
j = 1,…,M i = 1,…, N is the relevance of the jth category in the ith document measured by (5), the relevance of the pleasant and unpleasant emotions in the document are given by:
where
.
The output of this process is a two-columns matrix called the Pleasant/Unpleasant ER matrix in which are assigned, for each document, the relevance of pleasant and unpleasant emotional categories given, respectively, by Equations (6) and (7).
The Document Classification component fuzzifies pleasant and unpleasant category relevancies using a user-built fuzzy pleasant/unpleasant category relevance partition called ECR fuzzy partition. The ith document is classified with pleasant/unpleasant emotion relevance equal to the label of the fuzzy set to which Rpi (Rui) belongs with the highest level of membership. The hot and cold spot thematic map creation component analyzes all the documents associated with a subzone, where each document refers to the emotional manifestations and discomforts expressed in social posts residing in the subzone in a given time frame. If all the documents are classified with the relevance of unpleasant emotions above a specified threshold, then the subzone is labeled as a hot spot; likewise, if all documents are classified with the relevance of pleasant emotions above a specified threshold Ȓ, the subzone is labeled as a cold spot.
As an example, we now consider the fuzzy partition in
Figure 1. Suppose we set Ȓ = Medium high as the threshold value for the relevance.
Let dh1, dh2,…, dhT be the T documents related to the hth subzone, where dht is the document referring to the posts inserted in the tth time frame. This subzone will be labeled as a hot spot if all its documents are classified with the relevance of unpleasant emotions greater or equal to Medium high. Likewise, it will be labeled as a cold spot if all its documents are classified with the relevance of pleasant emotions greater or equal to Medium-High.
The FESC algorithm, structured in pseudocode, is shown below.
In the preprocessing phase, the partitioning of the study area in subzones is performed; the analyzed period is partitioned in T time frames, and the pleasant and unpleasant ER fuzzy partition is created. After classifying each document, the number of related documents classified with pleasant/unpleasant emotion relevance greater than or equal to the threshold value Ȓ is calculated for each subzone. If this number is equal to the number of time frames T, then the subzone is labeled as a cold or hot spot. Finally, the hot and cold spots thematic map is created.
In Algorithm 1 the FESC method is schematized in pseudocode.
Algorithm 1: Fuzzy Emotion-based hot and cold Spots Classification (FESC). |
Partition the study area in n subzones; Partition the analyzed period in T time frames having an equal time width; Create the pleasant and unpleasant ER fuzzy partition; Create the Dictionary of EC words; Set the relevance threshold Ȓ; Extract the social messages; Clean the irrelevant texts in the posts; Create the Corpus of Documents aggregating the posts by subzone and time frames; For each document: Annotate terms belonging to pleasant and unpleasant emotional categories; For each emotional category: Compute the IF-TDF value by (4); Calculate the emotional relevance by (5); Next: Aggregate the emotional relevance of pleasant emotional categories by (6); Aggregate the emotional relevance of unpleasant emotional categories by (7); Classify the document assigning the pleasant and unpleasant emotion relevances; Next: For each subzone: ncold: = 0 // number of documents with pleasant emotion relevance ≥ Ȓ; nhot: = 0 // number of documents with unpleasant emotion relevance ≥ Ȓ; For each time frame: Rp: = pleasant emotion relevance of the corresponding document; Ru: = unpleasant emotion relevance of the corresponding document; If Rp = ≥ Ȓ Then: Ncold: = ncold + 1;
End if: If Ru = ≥ Ȓ Then: nhot: = nhot + 1;
End if:
Next: If ncold = T, Then: Label the subzone as a cold spot; Else: If ncold = T, Then: Label the subzone as a hot spot; End if: End if: Next: Create the hot and cold spots thematic map.
|
4. Test and Results
FESC was tested on a study area given by the 18 municipalities of the northeastern area of the province of Naples (Italy). The study area is a peripheral urban area characterized by the presence of a high population and building density. The phenomenon that has been analyzed concerns the discomfort of citizens caused by heat waves that occurred in the study area in the summer seasons between 2020 and 2022. The choice of this study area is dictated by the fact that it is a densely populated suburban area with recent popular building development, which can aggravate the discomfort of citizens during the presence of heat wave phenomena.
The map in
Figure 5 shows the study area.
To carry out the tests, tweets posted by users residing in the municipalities of the study area were extracted during the summer seasons of the time period analyzed. Each municipality constitutes a subzone, and the time period has been divided into three-time frames corresponding to the years 2020, 2021, and 2022.
The posts were extracted by selecting tweets with hashtags and keywords given at least one of the following words: heat, sultriness, heatwave, warm, scorching heat, and torrid heat.
The Python TWitter INTelligence library (TWINT) was used to extract the selected Twitter posts.
The FESC framework was implemented on a GIS platform built on the ESRI ArcGIS pro suite.
The number of tweets is about 622,100; on average, 2900 monthly tweets for each municipality. The Python Geocoder library was used to match the tweets to the corresponding municipality. The tests were executed using an Intel Core i7 Pentium having a 2.90 GHz clock frequency. The CPU time to execute the FESC algorithm on this dataset and to detect the final hot and cold spots was about 10.5 min.
After deleting noisy posts, posts referring to the same municipality and time frame have been merged into one document.
To construct the Dictionary of EC words was considered the set of 16 emotional categories, divided into eight pleasant and eight unpleasant, used in [
12]; these emotional categories are given by the basic and secondary emotional categories in Plutchik’s wheel of emotions [
39], listed in
Table 1.
By assigning to each emotional category, all the phrases that were semantically related to it but had been reduced to their stemmed form, the dictionary of emotional words was generated in accordance with [
12].
Following the Corpus Documents’ processing, as outlined in the Term Filtering phase, terms were associated with the dictionary category when they were discovered there.
For each document, fuzzified pleasant and unpleasant relevance indices where a document corresponds to a specific province and time frame were calculated.
Figure 6 shows the thematic maps of the pleasant (
Figure 6a) and unpleasant (
Figure 6b) emotion categories’ relevance measured in 2020.
The pleasant emotion relevance thematic map shows that in 2020 the municipalities of Brusciano, Castello di Cisterna, and Frattaminore were those in which pleasant emotional feelings prevailed, with the relevance of the pleasant emotion being Medium-high.
The unpleasant emotion relevance thematic map highlights that in 2020 the municipalities in which unpleasant emotions prevail are Caivano, Afragola, and Casoria, with High relevance, and Casandrino, Melito di Napoli and Casavatore, with Medium high relevance.
Table 2 shows in detail the membership degrees of each municipality to all pleasant emotion relevance fuzzy sets of the ECR fuzzy partition measured in 2020. The last column shows the relevance assigned to each municipality, given by the label of the fuzzy set with the greatest degree of membership.
In the municipalities of Brusciano, Castello di Cisterna, and Frattaminore, the pleasant relevance class with the highest membership degree (0.7, 0.7, and 0.5, respectively, highlighted in black) is Medium high. All other municipalities belong to the pleasant relevance classes Medium or Medium low.
The detailed results for the unpleasant emotion relevance are shown in
Table 3, where the membership degrees to each fuzzy set are shown for each municipality.
The municipalities of Afragola, Caivano, and Casoria belong to the unpleasant relevance class High, with membership degrees of 1.0, 1.0, and 0.9, respectively. The municipalities of Casandrino, Casavatore, and Melito di Napoli belong to the unpleasant relevance class Medium high with membership degrees of 0.8, 0.6, and 0.6, respectively. All other municipalities belong to the unpleasant relevance classes Medium or Medium low.
The results in
Table 2 and
Table 3, showing the pleasant and unpleasant emotion relevance assigned in 2020 to each municipality, highlight:
- -
The presence of an urban area of greater comfort felt by the citizen, which includes the neighboring municipalities of Brusciano and Castello di Cisterna:
- -
A larger urban area that includes municipalities in the central strip where a greater discomfort felt by the citizens prevails;
- -
Another urban area in the southwest of the study area covers the municipalities of Casandrino, Grumo Nevano, and Melito di Napoli.
Figure 7 shows the thematic maps of the pleasant (
Figure 7a) and unpleasant (
Figure 7b) emotion categories’ relevance measured in 2021.
The pleasant emotion relevance thematic map shows that in 2021 the municipalities of Brusciano and Castello di Cisterna are those in which pleasant emotional feelings prevailed, with the relevance of the pleasant emotion being Medium-high.
The unpleasant emotion relevance thematic map shows that in 2021 the municipalities in which unpleasant emotions prevail are Caivano, Afragola, Casoria, and Casandrino, with High relevance, and Sant’Antimo, Melito di Napoli, Cardito, Frattamaggiore, Casavatore, and Pomigliano d’Arco with Medium high relevance.
The detailed results for the pleasant and unpleasant emotion relevance are shown, respectively, in
Table 4 and
Table 5, where, for each municipality, are shown the membership degrees to each fuzzy set.
Table 4 shows that the municipalities Brusciano and Castello di Cisterna belong to the pleasant relevance class Medium high with membership degrees of 0.5 and 0.9, respectively. All other municipalities belong to the pleasant relevance classes Medium or Medium low.
Table 5 shows that the municipalities Afragola, Caivano, Casandrino, and Casoria belong to the unpleasant class High with membership degrees of 1.0, 1.0, 0.5, and 1.0, respectively. The municipalities Cardito, Casavatore, Frattamaggiore, Melito di Napoli, Pomigliano d’Arco, and Sant’Antimo, belong to the unpleasant class Medium high with membership degrees of 0.5, 0.9, 0.6, 0.8, 0.7, and 0.5, respectively. All other municipalities belong to the pleasant relevance classes Medium or Medium low.
Analyzing the results in
Table 4 and
Table 5, showing the pleasant and unpleasant emotion relevance assigned in the year 2021 to each municipality, detected the presence of the areas of comfort and discomfort felt by citizens observed in 2020.
Figure 8 shows the thematic maps of the pleasant (
Figure 8a) and unpleasant (
Figure 8b) emotion categories’ relevance measured in 2022.
The pleasant emotion relevance thematic map shows that in 2022 the municipalities of Brusciano and Castello di Cisterna are those in which pleasant emotional feelings prevailed, with the relevance of the pleasant emotion being Medium-high.
The unpleasant emotion relevance thematic map shows that in 2022 the municipalities in which unpleasant emotions prevail are Caivano, Afragola, Casoria, and Casandrino, with High relevance, and Acerra, Cardito, Casavatore, Frattamaggiore, Grumo Nevano, Melito di Napoli, and Pomigliano d’Arco with Medium high relevance.
The detailed results for the pleasant and unpleasant emotion relevance are shown, respectively, in
Table 6 and
Table 7, where, for each municipality, the membership degrees to each fuzzy set are shown.
Table 6 shows that the municipalities Brusciano and Castello di Cisterna belong to the pleasant relevance class Medium high with membership degrees of 0.6 and 1.0, respectively. All other municipalities are assigned a pleasant relevance class Low, Medium low, or Medium.
Table 7 shows that the municipalities Afragola, Caivano, Casandrino, and Casoria belong to the unpleasant relevance class High with membership degrees of 1.0, 1.0, 0.8, and 1.0, respectively. The municipalities Melito di Napoli, Grumo Nevano, Cardito, Frattamaggiore, Casavatore, Acerra, and Pomigliano d’Arco belong to the unpleasant relevance class Medium high with membership degrees of 0.5, 0.8, 0.8, 0.5, 0.6, 0.9, and 1.0, respectively. The unpleasant relevance class Medium is assigned to all other municipalities.
Also, in 2022, the two areas of high comfort and high discomfort felt by citizens are the areas including the municipalities of Brusciano and Castello di Cisterna, the central area, and the southwest area covering the municipalities of Casandrino, Grumo Nevano, and Melito di Napoli.
To detect hot and cold spots, we annotate as a cold or hot spot a municipality classified with pleasant or unpleasant relevance classes of Medium high, High, or Very high in all three consecutive years 2020, 2021, and 2022.
The thematic map in
Figure 9 shows the hot and cold spots detected in the study area.
The northeastern area of the province of Naples is covered by two hot spots. The largest is located along the central area of the study area and covers the extension of the municipalities Afragola, Caivano, Casavatore, and Casoria; the other is located to the southwest and covers the extension of the municipalities Casandrino, Grumo Nevano, and Melito di Napoli. The study area is also covered by a cold spot located to the southwest and including the municipalities of Brusciano and Castello di Cisterna.
To verify whether the presence of hot/cold spots is related to a high/low housing/building density, the values of housing densities were calculated by the number of inhabitants per square kilometer in the areas covered by hot and cold spots.
Table 8 shows the extension in km
2, the population, and the mean population density in the hot and cold spots. The mean population density is given by the average of the population densities measured in the subzone covered by the hot/cold spot.
The data in
Table 8 demonstrates that the mean population density in the two hot spots is higher than the mean population density in the research region. This is because the mean population density in all subzones of the study area is 5008.54 residents per km
2. In contrast, the cold spot’s mean population density is lower than the study area’s mean population density. This supports the hypothesis that the discomfort felt by the resident population in the summer months is felt more in areas with a high housing/building density. Conversely, this discomfort is less felt in areas with lower housing/building density.
These results highlight the usefulness of our approach for the decision maker in identifying the most critical areas and those less exposed to the phenomenon and in evaluating any correlations with the urban characteristics of these areas.