Crowdsourcing Urban Air Temperature Data for Estimating Urban Heat Island and Building Heating/Cooling Load in London

Urban heat island (UHI) effects significantly impact building energy. Traditional UHI investigation methods are often incapable of providing the high spatial density of observations required to distinguish small-scale temperature differences in the UHI. Crowdsourcing offers a solution. Building cooling/heating load in 2018 has been estimated in London, UK, using crowdsourced data from over 1300 Netatmo personal weather stations. The local climate zone (LCZ) scheme was used to classify the different urban environments of London (UK). Inter-LCZ temperature differences are found to be generally consistent with LCZ temperature definitions. Analysis of cooling degree hours in July shows LCZ 2 (the densest urban LCZ in London) had the highest cooling demand, with a total of 1550 cooling degree hours. The suburban related LCZs 5 and 6 and rural LCZs B and D all had about 80% of the demand of LCZ 2. In December, the rural LCZs A, B and D had the greatest heating demand, with all recording around 5750 heating degree hours. Urban LCZs 2, 5 and 6 had 91%, 86% and 95% of the heating demand of LCZ D, respectively. This study has highlighted both advantages and issues with using crowdsourced data for urban climate and building energy research.


Introduction
Energy resources in a building usually refer to electricity, natural gas and district heating supply [1]. It accounts for about 40% of the global energy consumption and contribute over 30% of the CO 2 emissions [2]. The building energy consumption over the world is increasing due to climate change, urbanization and other factors. The tonnes oil equivalent has tripled from 1970 to 2010. Meanwhile, fossil fuels (oil, gas and coal) account for 90% of global primary energy production [3], which is one of major contributors of carbon emissions. The steadily increasing amount of energy consumption and carbon emissions has raised great challenges for sustainable development such as greenhouse gas emissions, air pollution, and reliable energy supply.
It has been widely acknowledged that urban heat islands (UHIs) have a significant impact on building energy consumption. The warmer urban temperatures associated with UHIs could increase building energy required for cooling and decrease energy required for heating. For example, Santamouris et al. [4] noted that in Athens, where the mean UHI exceeds 10 • C, the UHI is capable of doubling the urban building cooling load and can triple the peak electricity load for cooling purposes. Conversely, during the winter, urban buildings' heating load can be reduced by up to 30%. Kolokotroni et al. [5] showed that an office located in a rural area had only 84% of the cooling demand of one in an urban area. In the UK heating (cooling) demand increases (decreases) with distance from London city centre, although this trend is not uniform with direction [6]. Kolokotroni et al. [6] discovered that, in addition to distance from the city centre, energy demand is determine building energy usage in each LCZ. This will advance knowledge of the impact of urban form on these factors, which can be applied to city planning. Moreover, it also aims to highlight some of the issues related to using crowdsourced data. Although the UHI has been studied for almost two centuries, both crowdsourcing meteorological data and the local climate zone scheme are relatively new techniques. They are both highly promising concepts and so must be applied and tested as much as possible to improve the methods of their usage and unlock their potential.
The rest of this paper is organized as follows: Section 2 briefly introduces the study data, including crowdsourced data, weather data, Local climate zone (LCZ) data and heating/cooling load, and presents the details of quality check of the crowdsourced data. Section 3 presents the main results. The discussion part is presented in Section 4. We conclude and discuss the future work in Section 5.

Study Area and Time Period
The city studied was Greater London, UK, which has a population of 8.25 million [25], making it one of the largest cities in Europe. London is located in the mid-latitudes and has a Kppen-Geiger climate classification of Cfb: warm temperature, fully humid with warm summers [26]. Data was collected for the approximate region bound by the M25 orbital motorway. This was considered the area within in a box with coordinates of 51.245 • latitude, −0.558 • longitude at its southwest corner, and 51.721 • latitude, 0.337 • longitude at its northeast corner. Therefore, data collected includes both Greater London and its surroundings.
Data was collected from the 5 of July 2018 until 31 of December 2018. This is inclusive of several weeks of a heatwave in the UK that brought above-average temperatures for much of the summer. These anomalously high temperatures persisted up from 22 of June until the 8 of August [27]. Over this period temperatures exceeded 30 • C in some places in the UK on numerous days [27]. These kinds of heatwave events could significantly increase building cooling demand [28]. Temperatures for the remaining studied months remained close to the mean temperature from 1981 to 2010, although November and December were milder than average, with positive temperature anomalies of 1.1 • C and 1.9 • C, respectively [27].

Crowdsourced Data, Data Acquisition and Quality Check
The crowdsourced data was obtained from 'Netatmo smart home weather stations' (Netatmo, Boulogne-Billancourt, France). These consist of two modules i.e., the indoor and outdoor ones. The outdoor module measures several variables including temperature, relative humidity, and pressure, with an anemometer and rain gauge also available as optional extras. The outdoor module transmits to the indoor module, which is connected to the internet whereby the users can view the data recorded by their station on a mobile app. Furthermore, the users can choose to publicly display the data on the 'Netatmo weather map' (https://weathermap.netatmo.com/, accessed on 2 January 2019). Netatmo stations are used extensively worldwide, especially in European cities. For example, data from a total of 1303 stations were obtained in this study. This high density of observations is important for the progression of the investigation of small-scale variations within UHI's and therefore its impact on building energy. Furthermore, the accuracy of the temperature sensor of Netatmo stations has been tested and verified by Meier et al. [24]. Meier et al. [24] placed Netatmo stations in a climate chamber and compared temperature measurements to that of a reference sensor. This revealed that the Netatmo sensor was within the manufacturer's accuracy specification of 0.3 • C. However, it should also be noted that a Netatmo station placed in a Stevenson screen next to a professionally managed station showed some considerable differences. Differences were greatest during the morning time when the Netatmo measured lower temperatures, likely due to its slower thermal response and differences in ventilation between the two apparatus [24]. Meier et al. [24] Energies 2021, 14, 5208 4 of 25 also noted that Netatmo had the additional advantage over several other crowdsourcing methods in that all stations are the same. Therefore, sensor errors should be consistent for all stations. Conversely, this is not true for other crowdsourcing methods such as Weather Underground [29], which use multiple station types.
Netatmo does not store any station data but data can be acquired in real-time through the Netatmo 'Application Programming Interface' (API). The Netatmo API provides numerous methods of data collection, and for this study, the 'get public data' method was used. This method returns instantaneous data from all stations within a specified area. As there is a limit on the number of stations that can be returned from a specified area, the study area was divided into 25 tiles, with a request made to the API for each tile on an hourly basis. The station ID, latitude, longitude, temperature, and relative humidity were obtained. Any stations that changed location throughout the study were considered a new station and assigned a new ID code. As previously mentioned in Section 2.1, the quality of each Netatmo measurement cannot be known, largely due to the lack of meta-data provided for each station. Misuse of stations can easily lead to errors in meteorological measurements. For example, radiation errors due to solar heating can arise from stations being placed in areas that are exposed to direct sunlight. Another potential misuse is that users can set up their outdoor modules indoors. Furthermore, failure of the Netatmo server, or the collecting server, can lead to chunks of missing data from all stations [24]. Hence, to ensure the analysed data has a good accuracy, a quality check (QC) must be implemented. For this study, the QC method devised by Napoly et al. [23] was used. This statistically-based method is divided into seven steps (briefly outlined in Table 1), which are designed to eliminate all the common issues associated with using CWS data.
In October, less than 80% of daily values are available, due to collection errors in this month. The option that to correct temperature readings by height was not carried out. This was because the correction presents an additional source of uncertainty as it corrects by the standard atmospheric lapse rate of 0.0065 K m −1 , which is not valid at all times. In addition, the topography of the studied area is relatively uniform, so the inclusion of this correction would not have led to dramatic changes in temperature.

Reference Weather Data
Reference weather data was acquired from UK Met Office operated weather stations, via the Met Office Integrated Data Archive Systems (MIDAS) dataset [30] in order to conduct verification of the Netatmo data. Seven stations are located in the study area (indicated in Figure 2). UK hourly weather observation data [31] was downloaded for these stations. The name, location, and LCZ classification (as shown in Figure 1) of these sites can be found in Table 2. All the stations take measurements automatically from instrumentation contained within a Stevenson screen, elevated 1.25 m off the ground [32]. The inter-LCZ temperature trends observed by crowdsourced data will be validated by the reference data obtained from professionally run stations.

Local Climate Zone (LCZ) Data
The local climate zone (LCZ) scheme categorises landscapes into zones of uniform surface cover, structure, material, and human activity [34]. The goal of the LCZ scheme is to improve the accuracy and consistency of urban climate reporting and aid in integrating urban climate knowledge with city planning. There are 17 LCZs (seen in Figure 1), with different zones having different thermal characteristics. Zones with very different characteristics can have temperature differences exceeding 5 • C [34].
Local climate zone data was obtained from the World Urban Database and Access Portal Tools (WUDAPT) [35]. Local city experts, with the aid of satellite imagery, generate such LCZ maps using a method that was first hypothesized by Bechtel and Daneke [36] and refined by Bechtel et al. [37]. The London LCZ map was downloaded from the WUDAPT website (http://www.wudapt.org/, accessed on 29 January 2019). The map is in the form of a GeoTiff file, containing a raster map, with 100 m x 100 m pixels, of LCZs. Figure 2 shows the LCZ map of London with the locations of every station collected throughout the study. Using tools in QGIS, each station was assigned an LCZ value based on the raster value of the pixel that it is within. The majority of Netatmo stations are classified as being in LCZs 5 (open mid-rise) and 6 (open low-rise) (as shown in Figure 3), which are associated with suburban areas. This is to be expected as most Netatmo owners will have disposable income and enough yard space for a CWS, both of which are typical for suburban areas. There are also stations present in the urban LCZs 2 (compact midrise), 8 (large low-rise), and 10 (heavy industry) and the rural LCZs A (dense trees), B (scattered trees), and D (low plants). There are some stations present in LCZ G (water), which may be installed in boats. As these are not our main interests, results from this LCZ are not mentioned further. Discussion of the possible reasons for and implications of this are considered further in Section 4. After this, stations that had heterogeneous land cover surrounding them were eliminated from the study, as ensuring temperature measurements were not influenced by different surrounding LCZs. This was achieved by using a similar approach to that used by Fenner et al. [22]. Any stations that 80% or more of the pixels within a 250 m radius were a different LCZ to the LCZ that the station was in were disregarded. This was also carried out for reference stations, but with a cut-off of 50% as opposed to 80%. The reason for this lower cut-off is that almost all the reference stations had high LCZ variability surrounding them, so the cut-off was reduced so that there were still several usable reference stations. An additional advantage of carrying this out is that it reduces the chance of a station being classified into the wrong LCZ. In theory, a station can be located within a pixel that contains two different land surfaces. In this case, the classification of the pixel may not cohere to the stations' underlying surface. Also, if the location of the station is marginally incorrect, or the precision of the co-ordinates is too low, it could be classified into the wrong LCZ. Eliminating stations with highly variable surroundings should mitigate this. This process reduced the number of available Netamo stations by 320, from the original 1303. Therefore, approximately a quarter of stations were eliminated even before the QC was carried out. Furthermore, London Heathrow, Kenley Airfield, and London St James's Park reference stations were eliminated.

Indicators for Determining Inter-LCZ Temperature Difference and Building Heating/Cooling Load
Data with the highest quality (Flag month if >80% of daily values available) after quality check [23] was analysed. For each station, the mean station temperature (T mean ), mean daily minimum temperature (T min ), and mean daily maximum temperature (T max ) was calculated for each month. The mean daily diurnal temperature range (DTR) was also evaluated by calculating the maximum temperature minus the minimum temperature for each day and finding the average over the month. The mean value of every station in each LCZ was then obtained. The same process was executed for the reference stations. Box plots for these variables were also computed for each LCZ in which ten or more stations were present. This allows for investigation into the thermal properties of different urban surfaces.
Stewart and Oke [34] suggest that, under the LCZ scheme, UHI magnitude could be defined as an LCZ temperature difference. Ideally, this would be defined as the temperature difference between LCZ 1 (compact high-rise) and LCZ D (low plants); however, according to the LCZ map from WUDAPT, LCZ 1 is not present in London. Therefore, for this study UHI magnitude has been defined as the temperature difference between LCZ 2 (compact midrise) and LCZ D.
To evaluate the UHI impact on building energy, the concept of heating and cooling degree hours (HDH/CDH), which is analogous to heating and cooling degree days, has been used. Degree hours are a simple and repeatable indicator of estimating and monitoring the energy consumption of buildings due to space heating and cooling. Degree hours can provide an approximation for building energy usage, although actual energy usage will vary depending on building-specific properties, such as casual gains, thermal capacity, and room moisture content [38]. The application of degree hours depends on the 'base temperature' (T b ). This is a balance point temperature at which outdoor ambient is sufficiently high (low) enough such that there is no need for additional heating (cooling). T b varies between regions with different climates, but for the UK 15.5 • C is commonly used as T b for HDH and 22 • C for CDH [39]. Degree hours are essentially an hourly summation of how many degrees the outdoor air temperature is above or below T b . However, an issue that is particularly problematic for crowdsourced data, is that the number of hours recorded varies between stations, due to intermittent failures in the transmitting of data to the Netatmo API. Therefore, the degree hours have been normalised by the number of hours the station recorded. Following this, it has been multiplied by the maximum number of available hours for the studied period. Consequently, the CDH and HDH are defined mathematically in Equations (1) and (2), respectively: where t start is the starting time for degree hours to be calculated, t end is the end time, T j is the temperature at time j and T b is the base temperature, n meas is the number of measurements made by the station between t start and t end , and n max is the maximum possible hourly measurements within this time frame.

Effect of the Quality Control
The quality check method was validated on the cities of Berlin (Germany) and Toulouse (France) by Napoly et al. [23], and the proportion of the eliminated data at each level were recorded. The similar process was done for London data, and the results are shown in Table 3. From level M1 to O2, a similar proportion of data was eliminated in this study to the other two cities. Levels M1 to M4 consist of a statistical test to eliminate erroneous measurement. The similar result indicates the intrinsic quality of data collected from London is comparable to that of Berlin and Toulouse. The remaining steps (O1-O3) are focused on the availability of the data of each station. Level O3 eliminated a considerable amount more data in this study compared to the cities studied by Napoly et al. [23]. At level O2, about 75% of data remained for all three cities, whereas at O3 London had just 49.56% remaining, when Berlin and Toulouse had 58.54% and 57.41% remaining, respectively. Level O3 omits stations that have less than 80% of data in the month. The large difference between O2 and O3 indicates that the data collection was more intermittent or inconsistent in London than that of Napoly et al. [23]. This is likely due to intermittent errors in the script collecting data, or problems related to the tile size submitted to the API being too large. Figure 4 draws the hourly temperatures over the study period for all Netatmo stations and stations that passed the O3 level of the quality check and shows a clear improvement in the quality of data. The time series of the raw Netatmo data ( Figure 4a) shows some unrealistic temperature readings exceeding 55 • C, likely obtained from stations that have been left in direct sunlight. Very little diurnal variation of the daily temperature can also be found, which may be contributed to the station being left indoors. Figure 4b shows the time series of Netatmo stations that passed level O3 of the QC. At this level, neither of these characteristic errors is visible, with much more realistic temperature readings and a spread much closer to the median temperature.

LCZ Temperature Characteristics
Monthly box plots of mean Tmean, Tmax, Tmin, and DTR in each LCZ are generally consistent with the definition of the characteristics of LCZs. The box plots for July and December are displayed as exemplars for a summer and winter month in Figures 5 and  6, respectively. Box plots are also displayed for LCZs with more than ten stations present. The method of error calculation of the means is shown in Appendix A. The remaining months are displayed in Appendix B, Figure A1, Figure A2 and Figure A3. In general, LCZs associated with more urban landscapes (LCZs 1-10) have a higher mean and minimum temperature than those associated with rural landscapes (LCZs A-G). This is especially prevalent in the summer months. In July, August, and September, LCZ 2 (compact mid-rise) displayed the highest mean temperature. LCZ 2 is the densest urban LCZ present in London and would be expected to have the highest temperatures. Differences in maximum temperature are generally smaller when compared to differences in minimum temperature. This is consistent with the understanding of urban heat islands, as urban areas have a higher thermal capacity so retain more heat during the night than rural areas. Alternatively, during the day, urban areas are often observed to be slightly cooler than rural areas. Because of this, rural LCZs would be expected to have a greater diurnal temperature range than urban LCZs. This is evident in July but is not obvious in other months. Analysis of the average diurnal cycle of each LCZ is shown in Figure 7. It shows a general trend of warmer temperatures in more built-up LCZs, especially at night. In July, maximum temperatures appear comparable for all LCZs, whereas in December there are still large differences between urban and rural in the middle of the day.
Although in many aspects, these results are consistent with the understanding of urban heat islands and the LCZ scheme, inconsistencies between the results and the expected temperature characteristics do exist, especially in cases where there are a low number of stations in an LCZ. One such inconsistency is that in December ( Figure 6) and November (Appendix B, Figure A3), LCZ 2 (compact midrise) was cooler than LCZ 5 (open midrise). In the average December diurnal cycle (shown in Figure 7b), LCZ 5 is hotter than LCZ 2 throughout the day, contrary to the thermal definition of these two urban classifications. This is likely related to the small number of stations in LCZ 2 in these months. There was only one qualified station in LCZ 2 in November and three in December. The sample cannot be representative of the entire LCZ and any station taking erroneous readings will play a considerable role in skewing the mean temperature of the

LCZ Temperature Characteristics
Monthly box plots of mean T mean , T max , T min , and DTR in each LCZ are generally consistent with the definition of the characteristics of LCZs. The box plots for July and December are displayed as exemplars for a summer and winter month in Figures 5 and 6, respectively. Box plots are also displayed for LCZs with more than ten stations present. The method of error calculation of the means is shown in Appendix A. The remaining months are displayed in Appendix B, Figures A1-A3. In general, LCZs associated with more urban landscapes (LCZs 1-10) have a higher mean and minimum temperature than those associated with rural landscapes (LCZs A-G). This is especially prevalent in the summer months. In July, August, and September, LCZ 2 (compact mid-rise) displayed the highest mean temperature. LCZ 2 is the densest urban LCZ present in London and would be expected to have the highest temperatures. Differences in maximum temperature are generally smaller when compared to differences in minimum temperature. This is consistent with the understanding of urban heat islands, as urban areas have a higher thermal capacity so retain more heat during the night than rural areas. Alternatively, during the day, urban areas are often observed to be slightly cooler than rural areas. Because of this, rural LCZs would be expected to have a greater diurnal temperature range than urban LCZs. This is evident in July but is not obvious in other months. Analysis of the average diurnal cycle of each LCZ is shown in Figure 7. It shows a general trend of warmer temperatures in more built-up LCZs, especially at night. In July, maximum temperatures appear comparable for all LCZs, whereas in December there are still large differences between urban and rural in the middle of the day.
Although in many aspects, these results are consistent with the understanding of urban heat islands and the LCZ scheme, inconsistencies between the results and the expected temperature characteristics do exist, especially in cases where there are a low number of stations in an LCZ. One such inconsistency is that in December ( Figure 6) and November (Appendix B, Figure A3), LCZ 2 (compact midrise) was cooler than LCZ 5 (open midrise). In the average December diurnal cycle (shown in Figure 7b), LCZ 5 is hotter than LCZ 2 throughout the day, contrary to the thermal definition of these two urban classifications. This is likely related to the small number of stations in LCZ 2 in these months. There was only one qualified station in LCZ 2 in November and three in December. The sample cannot be representative of the entire LCZ and any station taking erroneous readings will play a considerable role in skewing the mean temperature of the LCZ. Therefore, LCZ 2 values are more likely to be erroneous in these months.   Figure 7 shows the mean diurnal temperature cycle for each LCZ. Some unreasonable result appears. For example, LCZ 4 in July (Figure 7a) achieved maximum temperature earlier than others. These occur primarily in LCZs with few stations, which is probably due to the data loss at certain times. The reference stations show some temperature consistency with that expected from the LCZ scheme, but it's clear there is also dependency on the site itself. The 'London City' station, which is classed as being in LCZ D, consistently measures higher temperatures than both the other reference station in LCZ D (Northolt) and crowdsourced data. Other than London City, the other stations do show a slight trend of higher mean and minimum temperature in more built-up LCZs. It is noteworthy that in the summer months, crowdsourced temperatures are generally higher than reference temperatures in each LCZ except the 'London City' station. This is possible because crowdsourced sensors are heated not only by shortwave radiation, but also by longwave radiation from walls [23]. Interestingly, however, in the winter months, the reference mean temperature and maximum temperature were close to, or greater than, the crowdsourced. These differences between summer and winter may relate to the differences in incoming radiation between summer and winter. In the winter, when there's less solar radiation, the role of longwave heating of Netatmo stations by nearby walls will be considerably less. Having access to more reference data from stations with more homogeneous surroundings would benefit these comparisons. Figure 7. The mean diurnal temperature cycle for each LCZ (mean temperature of every station in the relevant LCZ at that hour) for July (a) and December (b). The number of stations in each LCZ is the same as that indicated in Figure 5 for July and Figure 6 for December. LCZs 2, 6, A, and D are highlighted to emphasise differences between LCZs with different forms, and that is common in London.

Urban Heat Island
Over the entire study period, the mean UHI magnitude was 1.07 • C ± 0.03 • C, which is remarkably similar to the average in London from 1901-2006 (1.1 • C) [40]. Jones and Lister found differences in 30-year mean temperatures between the stations in St James's Park and Rothamsted, noting no significant change in the difference in temperature between these sites over this time. London's UHI was then quantified by averaging over these 30-year periods. Although the method of data collection and UHI quantification varies greatly from this study, it still shows that the Netatmo estimated mean UHI is in the correct range at the very least. A maximum UHI magnitude of 4.89 • C ± 1.29 • C, was recorded, at midnight on the 30th of September. This is slightly less than that found by Chapman et al. [14] and Watkins et al. [41], who found maximum magnitudes of 5.5 • C and 7 • C, respectively. Again, this is not surprising due to the differences in techniques for UHI classification used in each study. This lack of consensus on UHI quantification is a major issue in UHI research [42] and was a key motivation for the development of the LCZ scheme.
A time series of the daily mean UHI over the study period is displayed in Figure 8. This shows the UHI fluctuating around 1 • C in July, August, and September. In the winter months, the UHI magnitude is noticeably lower, and rarely exceeds 1 • C. This trend of summer and autumn having a stronger UHI is consistent with previous observations of London's UHI [43]. However, it should also be noted that the number of stations in LCZs 2 and D that passed the QC is variable in different months. It may also impact results. It is possible that a station in LCZ 2 that was systematically measuring high temperatures in the summer, did not pass the QC in the winter months, and therefore did not contribute to UHI calculation in the winter.

Urban Heat Island
Over the entire study period, the mean UHI magnitude was 1.07 °C ± 0.03 °C, which is remarkably similar to the average in London from 1901-2006 (1.1 °C) [40]. Jones and Lister found differences in 30-year mean temperatures between the stations in St James's Park and Rothamsted, noting no significant change in the difference in temperature between these sites over this time. London's UHI was then quantified by averaging over these 30-year periods. Although the method of data collection and UHI quantification varies greatly from this study, it still shows that the Netatmo estimated mean UHI is in the correct range at the very least. A maximum UHI magnitude of 4.89 °C ± 1.29 °C, was recorded, at midnight on the 30th of September. This is slightly less than that found by Chapman et al. [14] and Watkins et al. [41], who found maximum magnitudes of 5.5 °C and 7 °C, respectively. Again, this is not surprising due to the differences in techniques for UHI classification used in each study. This lack of consensus on UHI quantification is a major issue in UHI research [42] and was a key motivation for the development of the LCZ scheme.
A time series of the daily mean UHI over the study period is displayed in Figure 8. This shows the UHI fluctuating around 1 °C in July, August, and September. In the winter months, the UHI magnitude is noticeably lower, and rarely exceeds 1 °C. This trend of summer and autumn having a stronger UHI is consistent with previous observations of London's UHI [43]. However, it should also be noted that the number of stations in LCZs 2 and D that passed the QC is variable in different months. It may also impact results. It is possible that a station in LCZ 2 that was systematically measuring high temperatures in the summer, did not pass the QC in the winter months, and therefore did not contribute to UHI calculation in the winter.  Figure 9 shows the building cooling demand in London in July 2018. The general spatial trend of cooling degree hours (CDH) was that CDH is highest in London city centre, with it decreasing further from the centre (Figure 9a). As CDH has a linear relationship with building energy usage for cooling [38], this indicates the highest energy usage in the city centre, with usage decreasing towards the outskirts. The decrease is not Figure 8. The daily mean urban heat island magnitude (defined as TLCZ2-TLCZD), indicated by the red line. n LCZ2 and n LCZD represent the most common number of stations for that month in LCZ 2 and LCZ D, respectively. The filled grey area shows the area between the mean plus one standard deviation and the mean minus one standard deviation. Figure 9 shows the building cooling demand in London in July 2018. The general spatial trend of cooling degree hours (CDH) was that CDH is highest in London city centre, with it decreasing further from the centre (Figure 9a). As CDH has a linear relationship with building energy usage for cooling [38], this indicates the highest energy usage in the city centre, with usage decreasing towards the outskirts. The decrease is not uniform in all directions. In addition, variability between stations is high, with some stations directly adjacent to each other measuring very different CDH. After classifying stations into their relevant LCZ and finding the mean CDH in each LCZ (Figure 9b), it's seen that LCZ 2 (compact midrise) requires the most energy for cooling purposes with a total CDH of 1550. This is to be expected because LCZ 2 is where the geometry, construction materials, and anthropogenic heat release typically lead to the warmest temperatures. LCZ 8 (large low-rise) has the least cooling demand, although there is only one valid station in this LCZ, making the accuracy of this value questionable. Figure 10 shows mean cooling degree hours in July 2018 for each LCZ expressed as a percentage of the mean cooling degree hours of LCZ 2. LCZ A (dense trees) had the second-lowest cooling demand with just 70% of the cooling demand of LCZ 2. Urban LCZs 5 (compact midrise) and 6 (open midrise) and rural LCZs B (scattered trees) and D (low plants) have similar cooling demands, all with between 80% and 85% of the cooling demand of LCZ 2. This is likely due to the similar maximum temperatures between these LCZs (Figure 5b). Temperatures in LCZs 5 and 6 only exceed the CDH base temperature of 22 • C from the late morning to the early evening. Therefore, temperatures during the hottest hours of the day will be the only temperatures included in the CDH calculation. Since the urban-rural temperature differences during the calculation hours are small (as seen in Figure 7), the cooling demands in these LCZs are similar. Overall this suggests that suburban areas have similar cooling demand to many rural landscapes, however, more compact urban environments have significantly higher demand. As LCZs 5 and 6 will be largely residential, this implies that the UHI may not have a great impact on housing cooling demand in London. However, the cooling demand of office buildings, which are more likely to be in LCZ 2, will be significantly affected by the UHI. The finding that rural LCZs B and D have between 80% and 85% of the cooling demand of LCZ 2 is comparable to the findings of previous research. Figure 11 draws the spatial distribution of heating degree hours (HDH) in December. It shows an inverse trend to that of the July CDH, with less heating demand in the city centre and increasing demand further out. As with Figure 11a, there is high variability in this trend. The LCZ box plot (Figure 11b) shows LCZ 5 has the least heating demand, even less than the more densely urban LCZ 2. The reason for this is ambiguous but may be related to the small number of stations in LCZ 2 (mentioned in Section 2.4. Rural LCZs have a considerably greater heating demand than urban LCZs with mean rural HDH is more than 350 higher than that of any urban LCZ. The differences in heating demand between urban and rural LCZ is clearer than that of cooling demand in July. This is because the greatest LCZ temperature differences are at night when temperatures are more likely to be further below the base temperature. Night temperatures drop in rural stations is further below the base temperature and last longer than in urban stations.  The mean cooling degree hours of each LCZ. Filled blue dots represent LCZs with more than one station inside; unfilled dots represent LCZs with only one station inside. Errors on the means are one standard deviation. Boxes are plotted for LCZs with more than ten stations within them. The meaning of the bounds and whiskers of the boxes are explained in Figure 5.

Building Energy Consumption
(b) Figure 9. Building cooling demand in London in July 2018: (a) the spatial distribution of cooling degree hours. Red colours indicate greater cooling demand than blue colours. (b) The mean cooling degree hours of each LCZ. Filled blue dots represent LCZs with more than one station inside; unfilled dots represent LCZs with only one station inside. Errors on the means are one standard deviation. Boxes are plotted for LCZs with more than ten stations within them. The meaning of the bounds and whiskers of the boxes are explained in Figure 5.  Figure 11 draws the spatial distribution of heating degree hours (HDH) in December. It shows an inverse trend to that of the July CDH, with less heating demand in the   Figure 12 shows the heating demand of each LCZ, expressed as a percentage of the heating demand of LCZ D. It is expressed as a percentage of LCZ D as opposed to LCZ 2 because of the low number of valid stations in LCZ 2 in December. It's seen that the heating demand of the rural LCZs A and B are identical to that of LCZ D, with all having around 5750 heating degree hours. The remaining, more urban, zones have between 85% and 95% of the heating demand. For example, LCZs 2, 5, and 6 had 91%, 86%, and 95% of the heating demand of LCZ D, respectively.

Discussion
In using crowdsourcing and the local climate zone scheme, this study has utilised two techniques that can be considered, relatively speaking, to still be in their infancy. Although both have proven exceedingly useful for producing the results displayed in this paper, the combined use of them has led to several difficulties. Overall, the inter-LCZ analysis of temperature showed many features that would be expected based on the understanding of the urban environment and the definitions of LCZs. However, in some cases, inter-LCZ differences were not consistent, such as LCZ 2 not being the hottest in the winter months (e.g., Figure 6). These inconsistencies almost exclusively arise in LCZs in which there are less than around five stations. It has been noted previously by Napoly et al. that even after a quality check, measurements from a single station should not be trusted. This can also be extended to a small number of stations when carrying out LCZ analysis. Adding to the uncertainty is the fact that in some cases, singular stations can show thermal characteristics consistent with the definition of the LCZ they are classified into. Therefore, it is difficult to know what results can be fully trusted. It may be beneficial to standardise a minimum amount of crowdsourced stations for an LCZ to be analysed. The difficulty with this is that citizen weather stations (CWS), especially in London, are found primarily in suburban areas, meaning these zones have high representation, whereas other LCZs don't. Once a QC is implemented and stations with heterogeneous surroundings eliminated, this often leaves very few stations in most LCZs. However, as CWS popularity grows over time, this may become less of an issue. The high number of stations located in LCZ 6 does generate a potential area of further research, as it can be investigated whether variability in this LCZ is dependent on the area of London the station is in. This would help determine the validity of the LCZ scheme in London as the temperature variability in LCZ 6 between areas of London could be compared to inter-LCZ variability.
The presence of stations classified in LCZ G (water) raises further concerns about combining the use of Netatmo stations and LCZs. There's a possibility that these stations could be legitimate and be located on boats on the River Thames. However, it seems more likely that this arises from errors in either the LCZ classification or Netatmo location. It could be that the LCZ map downloaded from the WUDAPT website is not perfectly accurate, or maybe stations located close to the banks of the Thames are classified into LCZ G as the pixel it is within is primarily water. Alternatively, this may arise from inaccuracies in the broadcasted location of these Netatmo stations. Regardless, the presence of stations in LCZ G does raise question marks whether other stations used in this study were classified into LCZs that, in reality, they were not located within. As the methods of urban classification become more refined and detailed, these kinds of errors should be mitigated. Ensuring that Netatmo location is always inputted to a high precision would also improve the applicability of using Netatmo stations for urban climate studies.
Discussed here is just a selection of a wide number of difficulties from using CWS data, coupled with the LCZ scheme, for scientific research. There still seems to be some way to go before CWS data can be used to initialise numerical weather prediction models, as has been suggested previously [21]. As has been mentioned in several cases [14,24]; the scientific use of Netatmo stations would be greatly improved by the stations having more metadata available. In that way, the quality of the data from the station will become clearer. Despite this, there is certainly great potential in the use of these techniques. The unparalleled high spatial density of CWSs provides the potential to unlock a wealth of knowledge of urban climates. In the future, it could be possible that city planners will know what the exact thermal properties of an urban area being developed will be. Then they can know what the characteristics of the building energy usage and heat-related risk to human health will be. The use of crowdsourced data and the LCZ scheme could well contribute to achieving this. Hence, further research into this topic should be carried out.
In London, the UHI may not have a great impact on housing cooling demand. However, the cooling demand of office buildings will be significantly affected by the UHI. The finding that rural LCZs have between 80% and 85% of the cooling demand of urban LCZ is comparable to the findings of previous research. Kolokotroni et al. [5] found that in a typical hot week, a rural office has 84% of the cooling demand of a similar office in urban London. This shows a reasonable degree of coherency between the findings of UHI impact on building energy, despite the methods used to obtain the findings being dramatically different.
Despite winter heating demand having a clearer difference between urban and rural LCZs, percentage differences in summer cooling demand are greater than percentage differences in winter heating demand. These results suggest the reduction in heating by the UHI is less than previous estimates. Watkins et al. [44] found that the annual heating load is 22% less in urban London compared to rural surroundings. This study suggests a reduction of the urban heating load of around 5-15%. This difference may simply be down to the differences in study period time and length.
It should be noted that the magnitude of HDH in December greatly exceeds that of CDH in July. HDH for a single station reached values of 6500, whereas CDH reached only 2000. This shows that even during heatwave conditions, summer cooling demand in London is not as significant as winter heating demand. For both summer cooling and winter heating demand, the standard deviation of the mean is high for all LCZs, suggesting high variability within each LCZ. This variability suggests that building cooling load may not depend only on LCZ. There may be dependence on site-specific parameters such as the amounts of anthropogenic heat release. Alternatively, the variability could be due to certain stations receiving longwave radiation from nearby walls or due to incorrect LCZ classification of certain stations (as discussed above).
Finally, it should be noted that the degree hours approach used in this study is only a proxy to real heating and cooling load for a specific building as this approach only considers the outdoor air temperature. Apart from outdoor air temperature, many other factors contribute to the variability of the building energy consumption in the cities e.g., solar radiation, wind speed and direction, and building features. More advanced building energy simulation tools e.g., EnergyPlus could be used to understand the energy consumption for a specific building located in different parts of the city which will lead to our future research. The datasets collected in this research could serve as a good database for future projects as well.

Conclusions
Crowdsourced data from Netatmo personal weather stations has been used to investigate the thermal characteristics and building energy consumption of each local climate zone (LCZ) in London. Data from over 1300 Netatmo stations was collected, from July 2018 to December 2018, which included a summer heatwave period. Each station was classified into an LCZ, using LCZ data obtained from World Urban Database and Access Portal Tools (WUDAPT). As the quality of data from Netatmo stations cannot be known, a quality check, devised by Napoly et al. [23], was implemented. This undoubtedly improved the quality of data used but at the cost of a 50% reduction in data availability.
For the most part, the temperature differences between LCZs are consistent with the definition of LCZs [34], with urban LCZs generally having higher mean temperatures than rural LCZs. Urban LCZs also have a trend of significantly warmer minimum temperatures, especially in the summer months. This is most likely due to the building materials used in urban areas retaining more heat over the nighttime. The urban heat island (UHI) has been defined as the temperature difference between LCZ 2 (compact midrise) and LCZ D (low plants). The mean UHI was found to be 1.07 • C ± 0.03 • C, which is comparable to previous estimations of London's heat island.
The building cooling demand was analysed in July and heating demand analysed in December, using cooling and heating degree hours (CDH/HDH). The magnitude of the heating degree hours in December far exceeded the magnitude of cooling degree hours in July, despite the heatwave. LCZ 2 had the highest July cooling demand, with a total CDH of 1550. The suburban-related LCZs 5 (open midrise) and 6 (open low-rise) had a similar cooling demand to the rural LCZs B (scattered trees) and D (low plants), with all these zones having between 80% and 85% of the demand of LCZ 2. LCZ A (dense trees) had just 70% of the cooling demand of LCZ 2. In December, rural LCZs had the highest heating demand, with LCZs A, B, and D all having a total HDH of around 5750. Urban LCZs 2, 5, and 6 had 91%, 86%, and 95% of the heating demand of LCZ D, respectively.
Additional refining of the use of crowdsourced weather data is still necessary for its potential to be fully utilised. Implementing a quality check greatly improves data quality but temperature readings from a single/small number of stations still often don't adhere to the understanding of the urban climate. It is difficult to know what results can be trusted and what can't. As has been stressed before, improved metadata of Netatmo stations would help address this issue. This study highlights the potential of crowdsourced data in distinguishing differences in building energy usage between LCZs, however, it has also accentuated some of the issues in using it. Once these issues are addressed, crowdsourced data may well become key to improving understanding of the urban climate and applying this understanding to city planning.