Next Article in Journal
Mobility and Dissemination of COVID-19 in Portugal: Correlations and Estimates from Google’s Mobility Data
Next Article in Special Issue
An Inventory of Large-Scale Landslides in Baoji City, Shaanxi Province, China
Previous Article in Journal
Populating the Data Space for Cultural Heritage with Heritage Digital Twins
Previous Article in Special Issue
Three-Dimensional, Km-Scale Hyperspectral Data of Well-Exposed Zn–Pb Mineralization at Black Angel Mountain, Greenland
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Evaluation of the OpenWeatherMap API versus INMET Using Weather Data from Two Brazilian Cities: Recife and Campina Grande

by
Anwar Musah
1,2,*,
Livia Màrcia Mosso Dutra
3,
Aisha Aldosery
2,
Ella Browning
4,
Tercio Ambrizzi
3,
Iuri Valerio Graciano Borges
3,
Merve Tunali
5,
Selma Başibüyük
5,
Orhan Yenigün
5,6,
Giselle Machado Magalhaes Moreno
3,
Ana Clara Gomes da Silva
7,
Wellington Pinheiro dos Santos
7,
Clarisse Lins de Lima
8,
Tiago Massoni
9,
Kate Elizabeth Jones
4,
Luiza Cintra Campos
10 and
Patty Kostkova
2
1
UCL Department of Geography, Geospatial Analytics and Computing Group (GSAC), University College London, London WC1E 6BT, UK
2
UCL Centre for Digital Public Health & Emergencies, University College London, London WC1E 6BT, UK
3
Department of Atmospheric Sciences, Institute of Astronomy, Geophysics and Atmospheric Sciences (IAG), University of São Paulo, São Paulo 05508-010, Brazil
4
Centre for Biodiversity and Environment Research, Department of Genetics, Evolution & Environment, University College London, London WC1E 6BT, UK
5
Institute of Environmental Sciences, Boğaziçi University, Bebek, Istanbul 34342, Turkey
6
School of Engineering, European University of Lefke, Lefke 99010, Northern Cyprus, Turkey
7
Department of Biomedical Engineering, Federal University of Pernambuco, Recife-PE 50740-550, Brazil
8
Polytechnic School of Pernambuco, University of Pernambuco (Poli-UPE), Recife-PE 50720-001, Brazil
9
Department Systems & Computing, Federal University of Campina Grande, Campina Grande-PB 58429-900, Brazil
10
Department of Civil, Environmental & Geomatic Engineering, University College London, London WC1E 6BT, UK
*
Author to whom correspondence should be addressed.
Data 2022, 7(8), 106; https://doi.org/10.3390/data7080106
Submission received: 30 June 2022 / Accepted: 22 July 2022 / Published: 30 July 2022

Abstract

:
Certain weather conditions are inadvertently related to increased population of various mosquitoes. In order to predict the burden of mosquito populations in the Global South, it is imperative to integrate weather-related risk factors into such predictive models. There are a lot of online open-source weather platforms that provide historical, current and future weather forecasts which can be utilised for general predictions, and these electronic sources serve as an alternate option for weather data when physical weather stations are inaccessible (or inactive). Before using data from such online source, it is important to assess the accuracy against some baseline measure. In this paper, we therefore evaluated the accuracy and suitability of weather forecasts of two parameters namely temperature and humidity from the OpenWeatherMap API (an online weather platform) and compared them with actual measurements collected from the Brazilian weather stations (INMET). The evaluation was focused on two Brazilian cites, namely, Recife and Campina Grande. The intention is to prepare an early warning model which will harness data from OpenWeatherMap API for mosquito prediction.
Dataset:https://figshare.com/s/08449337eb8194848c72 (accessed on 21 July 2022)
Dataset License: CC BY 4.0

1. Background and Summary

Certain meteorological parameters are risk factors for increased mosquito presence and, thus, are needed for integration into such early warning detection system. The use of statistical forecasts of meteorological indicators of the likes of temperature, humidity, atmospheric pressure and many more, in near- or real-time, are made by collecting as much data as possible about the existing state of the atmosphere. Generally, this information is often applied to many avenues of research for understanding their impacts on other ecosystems, environmental and anthropogenic processes [1,2,3,4,5]. This information can be pulled from online digital sources, such as automatic weather stations, satellite and remote sensing, and through open source weather channels that are online via Application Programming Interfaces (APIs) [6,7].
From an epidemiological perspective, with regional focus on the Global South where most tropical illnesses are endemic, these meteorological indicators play an immense role in the areas of neglected tropical diseases as risk factors for spreading parasitic illnesses such as soil-transmitted helminths, Onchocerciasis, Lymphatic Filariasis and Schistosomiasis [8,9,10]. These weather parameters play a substantial role in vector- or mosquito-borne diseases and are prominent risk factors for illnesses such as Malaria, West Nile Virus and Yellow Fever. The impact of these weather factor plays in two ways: (1) they create an environment or climate that is conducive and habitable for such vectors to thrive (i.e., environmental suitability) [11,12]; and (2) it allows vectors to breed in dwellings inhabited by humans and animals thereby posing a threat to humans [13,14].
In the context of mosquito surveillance and monitoring in vulnerable human populations in Brazil, one of the most sought-after tools is the early warning detection system and prediction of increased mosquito population to support the effort of environmental health officers to pre-emptively combat the burden of mosquito-borne arboviruses. The models developed for such tasks are usually either mechanistic (i.e., compartment or differential equations) models, or spatial spatiotemporal models for inferential purposes, which allows for adjustments for other risk factors, temporality and spatial structure. Such an early warning system can be integrated into a smartphone and web application. Therefore, and to this end, researchers from the University College London’s Centre for Digital Public Health and Emergencies (London, UK) are working together with Brazilian academics and stakeholders from the Federal University of Pernambuco (Recife, State of Pernambuco, Brazil), Federal University of Campina Grande (Campina Grande, State of Paraiba, Brazil), University of São Paulo (State of São Paulo, Brazil) and Brazilian environmental agencies to design and develop such application [15,16,17,18,19,20] for it to be piloted in two major cities in the Northeast Brazil that were hit hard by the 2015/16 Zika virus epidemic, namely, Recife [21] and Campina Grande [22]. In this context, considerations are taken for the selection of weather-related risk factors which will be integrated into the application (containing the GPS of properties as well as property-level physical, environmental characteristics and indicators for socioeconomic deprivation and anthropogenic activities collected by agents) to make spatial prediction of current and future lead-time occurrence of mosquitoes. The meteorological parameters obtained from local weather stations from the Brazilian National Institute of Meteorology (INMET) provide historical and current weather data to include in geostatistical models (e.g., spatiotemporal Bayesian or general additive models (GAMs)) for early warning detection of mosquito populations [23,24]. Weather forecast-based sources, such as the OpenWeatherMap API, also provide through an API large amounts of data for the current time (analysis) and for future days based on real-time generated forecasts [25]. Both sources, i.e., INMET and OpenWeatherMap API, are being considered for integration into the smartphone and web application that will be developed. Before doing so, we must evaluate the accuracy and suitability of the OpenWeatherMap API resource versus the INMET database, since the former is a recent resource.
The next section describes the methodology for assessing the accuracy for a selected set of meteorological parameters (namely, mean temperature and relative humidity only) by making comparisons between OpenWeatherMap API and INMET. Here, we will first see how representative the two sources are by assessing the residual difference between the current INMET observations and those of OpenWeatherMap API. A second analysis will also examine the forecast errors within OpenWeatherMap API by comparing those at 1-day, 2-day, 3-day and 4-day against the current measurements (i.e., 0-day) (so as to detect any evidence of forecast drift with the lead-times). Both analyses will take into consideration two broader seasons (i.e., Rainy and Dry). The analysis uses data from two cities, namely, Recife and Campina Grande. The main reasons why we have restricted the analysis to these two cities is the fact that Recife and Campina Grande are two study areas we are currently engage with now. We are conducting collaborative research with the environmental health agencies at these two study cities to combat mosquito-arborviruses, and we intend to develop an early warning system for mosquito population outbreaks. Hence, we are channelling various data streams, especially those that are weather related variables. Thus, we endeavour to compare how similar OpenWeatherMap is with some baseline (i.e., INMET) which the former provides lead time forecast and while the latter provides historical measure, which both, in turn, can be harnessed for such early warning predictions.

2. Methods

In this section, we describe in detail how the datasets were extracted from two electronic meteorological sources: OpenWeatherMap API (or OpenWeather Global Services) and the Brazilian National Institute of Meteorology (in Portuguese: Instituto Nacional de Meteorologia, INMET), for the two cities in Northeast Brazil. Here, we used data from 1 May 2020 to 28 March 2021 (as this is where we have data available at the time, when we carried out the API extraction from OpenWeatherMap). For Recife, it should be noted that March, April, May, June, July and August were defined as the period of Raining season; while the remaining months were defined as the period of Dryness [26]. Campina Grande, on the other hand, experiences shorter periods of rain during March, April, May and June (whereas the remaining months are dry) [27].

2.1. Data Records

2.1.1. Dataset 1: OpenWeatherMap API

The OpenWeatherMap API is an online meteorological service which provides weather data that includes forecasts and current analysis data to researchers and developers of web-based services and mobile applications. For data sources, it harnesses meteorological broadcast services particularly using raw data from airport weather stations as well as those from radar stations, and data from other official weather stations [25]. The OpenWeatherMap API, in particular, processes all the data using machine learning to enhance the numerical weather prediction models provided by several data sources e.g., NOAA, Met Office, ECMWF and Environmental Canada (see https://openweathermap.org/technology (accessed on 18 November 2021)). The idea behind the development of OpenWeatherMap API was inspired by the platforms known as OpenStreetMap and Wikipedia, which make information freely available to everyone; OpenWeatherMap API typically utilises the OpenStreetMap to visualise its spatially referenced weather predictions. It also provides an API with JSON endpoints to make free and unlimited calls which is updated on every hour, to get current values for weather indicators and 3-hourly forecast values stretching up to five days (i.e., these are observed (now) measures, with accompanied by 3-hourly “consecutive” predictions on lead time estimates (going up-to five days in the future) on what these weather variable measurements will be ahead of what was currently observed) and we have compiled these data from OpenWeatherMap API. Note that the current and forecasted values are updated and released every hourly. For our research, which is situated in Recife and Campina Grande, we automated the process via RStudio and Python for extracting the current and 3-hourly weather information for the following parameters: temperature, relative humidity, pressure, cloud cover and weather description, and automatically into MongoDB using the API address: http://api.openweathermap.org/data/2.5/forecast?id=[ID]&APPID=[KEY] (accessed on 21 July 2022).
Given the API key provided by the OpenWeatherMap API services and then setting the station IDs by inserting the values of 3390760 (longitude: −34.8811 and latitude: −8.0539) and 3403642 (longitude: −35.8811 and latitude: −7.2306) to the above connection, we are able to extract the weather parameters for Recife and Campina Grande, respectively, and continuously compile the records into MongoDB. The dataset from OpenWeatherMap API were downloaded as a JSON which have a nested structure (i.e., 3-hourly weather estimates nested within city-level information) (Figure 1).

2.1.2. Dataset 2: Brazilian National Institute of Meteorology (INMET)

The INMET database comprises an extended network of automatic and conventional weather stations since the year 1961 and it is the one most recognised [7,23,24]. As of May 2000, there are 584 stations in operation throughout the country and each station may provide hourly, daily, and/or monthly values for the measured meteorological parameters of temperature (average, minimum and maximum), relative humidity, pressure and many more (Figure 2). We downloaded the full available data for the INMET automatic weather stations located in Recife and Campina Grande with station IDs A301 (with longitude: −34.959239 and latitude: −8.05928) and A313 (with longitude: −35.904831 and latitude: −7.225574), respectively; using the following link: https://tempo.inmet.gov.br/TabelaEstacoes/# (accessed on 21 July 2022).

2.2. Statistical Analysis

In this work, we are mostly interested in analysing two weather parameters: temperature and humidity, as these variables are the most important predictors for mosquito presence, which we intend to use for future work, and so the analysis in this paper limited to these two variables only. To iterate, we begin the analysis by examining how representative the two sources are by assessing the residual difference in two steps:
First, by comparing the day-by-day (according to time stamp as well) OpenWeatherMap API current measurements with baseline measurements from INMET stratified by season type (i.e., dry or wet). Here, we measure the mean difference between samples and use a 2-sampled unpaired t-test to determine whether the two sources are significantly different from each other. A p-value exceeding the 0.05 is an indication the day-by-day differences are not zero, and, thus, the OpenWeatherMap being different from INMET.
A second analysis focuses on comparing OpenWeatherMap API’s future forecasts made for 1-day, 2-day, 3-day and 4-day lead times against its own current baseline measurement for 0-day (so as to detect any evidence of forecast drift with the lead-times) using Taylor diagrams without stratification.
To reiterate, in order to account for broader variations caused by seasonality, the analyses in outlined in 1 are stratified by “Wet” and “Dry” season. In step one, it will involve calculating the mean of the difference between the current and baseline measurements of OpenWeatherMap API and INMET and summarising this information using a series of time series based residual plots, and then computing their overall mean differences and standard deviations accordingly by season type (i.e., “Wet” and “Dry”) for each city. The analysis is performed on day-by-day and time stamp level (on a 6-h interval instead of a 3-h—we chose this for easy data management purposes) (e.g., 1 May 2020 00:00:00 UTC, 1 May 2020 06:00:00 UTC, 1 May 2020 12:00:00 UTC and so on) and stratified by season type, i.e., “dry” and “wet”, to account for any type of effect modification [4]. Normality for each of these differences was ensured through visual inspection of histograms, and by usage of the Kolmogorov–Smirnov test (where the p-value is greater than 0.05 to ensure that the distribution does not deviate from a normal distribution). Further analysis was conducted through the use of Taylor diagrams to compare each lead time forecast from OpenWeatherMap API against INMET and to ensure that the results are consistent. The results presented in these diagrams will graph out Root Mean Squared Error (RMSE), Correlation coefficient and Standard deviation. All statistical analysis and visualisations were carried out and generated using RStudio Desktop (version 2022.07.0+548 for MAC).

3. Technical Validation

By computing the mean difference and its variability by city and season between OpenWeatherMap’s and INMET (i.e., reference database), their summary statistics are presented in Table 1. The stratified plots show the point-to-point residual differences between the two sources, it gives an overarching description of the temporal patterns for temperature and humidity, and how the databases differ from each other (i.e., values approximate to zero is an indication that the readings are the same) (Figure 3 and Figure 4). The differences are marginal for temperature with p-values being above 0.05; conversely, the results indicate a significant difference when considering humidity, the mean difference and variability between OpenWeatherMap API and INMET are quite large with SD exceeding ±7 with most p-values being less than 0.01 (Table 1). Figure 5 and Figure 6, as well as results in Table 2 inclusive, provides a graphical (and statistical) summary of how closely the lead time forecasts of “what it will be” observations predicted by OpenWeatherMap API closely matches its own baseline data. Let us consider the output of Figure 6C; the lead time predictions (versus day-0) for humidity in Campina Grande are poor as they jointly show RSME increase from ±7.42 to ±8.15, and its correlation decreasing from 0.76 to 0.70.

4. Usage Notes

The two datasets described here, (1) INMET, which provides observed historical and current/near real-time measurements for weather, and (2) OpenWeatherMap API, which provides up to five days of future weather predictions, can be linked to city- and/or neighbourhood-level mosquito-borne arbovirus surveillance data to investigate the temporal patterns of mosquito populations in Recife and Campina Grande. It should be noted that such evaluation on accuracy presented in this manuscript is specifically focussed on these two Brazilian cities, and, therefore, the results are strictly representative to Recife and Campina Grande (and not applicable to other cities).
With that in mind, the overarching purpose for drawing weather-related information (in particular temperature and relative humidity) from INMET and OpenWeatherMap API is for their integration into a predictive model for making current and lead-time temporal predictions of mosquito population on a city- or neighbourhood(s) scale. INMET is an established source for weather data and is considered the reference standard to compare OpenWeatherMap API against. In terms of accuracy, we noticed that the overall residual difference for temperature between the two data sources, regardless of season type, are somewhat marginal, which is an indication that readings for temperature from OpenWeatherMap API are marginally similar to those automatically recorded in INMET. However, OpenWeatherMap API’s data for humidity differ substantially. The outcome of our examination showed that temperature as a measure from OpenWeatherMap has the least residual mean difference and variability (see Table 1) while humidity is opposite. However, when carrying out the day-by-day comparison, we find the forecasted readings in OpenWeatherMap API to be substantially different from the baseline measures—with this in mind, using OpenWeatherMap API for modelling and making predictions about mosquito population is viable option for temperature; however, for humidity, users should be wary and examine that error’s distribution on how far OpenWeatherMap API forecasts deviate from actual observed baseline weather data. This paper limits the comparisons to only two resources (because at the time, the authors were considering OpenWeatherMap API for making lead-time prediction for mosquito populations [15,16,17,18,19,20] with a specific focus on two cities—what would be an interesting possibility is to extend such analysis with other forecasts from other global weather models which might be better than OpenWeatherMap API, such examples include the Global Forecast System (GFS), which is one of many models produced by the US Government’s agency National Centres for Environmental Prediction (NCEP). On a final note, a small caution is raised when considering the use of relative humidity for mosquito prediction. It should be noted that it is a function of both water and temperature; hence, its usage in a statistical model should be “standalone” and not with temperature treated as an adjustment to avoid potential issues such as multicollinearity [28].

Author Contributions

Conceptualisation, A.M. and P.K.; Data management, extraction and analysis, A.M., L.M.M.D. and A.A.; Original draft and visualisation, A.M.; Substantial revision, L.M.M.D., T.A. and P.K.; Supervision and funding acquisition, P.K., K.E.J., W.P.d.S., T.M., L.C.C., O.Y. and T.A.; Project support, feedback and advice, I.V.G.B., M.T., S.B., G.M.M.M., A.C.G.d.S., C.L.d.L., E.B. and A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Belmont Forum, which was supported in the United Kingdom by UKRI NERC under the grant NE/T013664/1, and in Turkey by TÜBİTAK under the grant 119N373. This work was supported in Brazil by FAPESP under the grants 2019/23553-1 and 2020/11567-5, and by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset used to produce the results for this paper are ready prepared and available through figshare (https://figshare.com/s/08449337eb8194848c72, accessed on 21 July 2022). Download is not restricted, and usage is regulated by CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/, accessed on 21 July 2022) license. Hence, the original sources i.e., OpenWeatherMap API (https://openweathermap.org/, accessed on 21 July 2022) and the Brazilian National Institute of Meteorology (INMET) (http://portal.inmet.gov.br/, accessed on 21 July 2022) must be given full attribution.

Acknowledgments

This research was conducted under the project titled: Mosquito populations modelling for early warning system and rapid public health response (MEWAR). This research through the Belmont Forum was supported in the United Kingdom by UKRI NERC under the grant NE/T013664/1, and in Turkey by TÜBİTAK under the grant 119N373. This work was supported in Brazil by FAPESP under the grants 2019/23553-1 and 2020/11567-5, and by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001. We take the opportunity to thank the Space and Aeronautics Research Institution (National Center for Satellite Technology, King Abdul-Aziz City for Science and Technology, Saudi Arabia) for their support in funding in PhD research conducted by author A.A. in the UK.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hussain, M.M.; Avci, B.; Trajcevski, G.; Scheuermann, P. Incorporating Weather Updates for Public Transportation Users of Recommendation Systems. In Proceedings of the 2016 17th IEEE International Conference on Mobile Data Management (MDM), Porto, Portugal, 13–16 June 2016; Volume 1, pp. 333–336. [Google Scholar]
  2. Makkar, G. Real-Time Disease Forecasting using Climatic Factors: Supervised Analytical Methodology. In Proceedings of the 2018 IEEE Punecon, Pune, India, 30 November–2 December 2018; pp. 1–5. [Google Scholar] [CrossRef]
  3. Marquez, P. Weather Prediction: Improving Accuracy Using Data Mining and Forecasting Techniques. Master’s Theses, University of Texas at El Paso, El Paso, TX, USA, 2020. [Google Scholar]
  4. Rao, V.B.; Franchito, S.H.; Santo CM, E.; Gan, M.A. An update on the rainfall characteristics of Brazil: Seasonal variations and trends in 1979–2011. Int. J. Clim. 2016, 36, 291–302. [Google Scholar] [CrossRef]
  5. Gómez-Martín, M.B.; Armesto-López, X.A.; Martínez-Ibarra, E. Tourists, Weather and Climate. Official Tourism Promotion Websites as a Source of Information. Atmosphere 2017, 8, 255. [Google Scholar] [CrossRef] [Green Version]
  6. Dewi, C.; Chen, R.-C. Integrating Real-Time Weather Forecasts Data Using OpenWeatherMap and Twitter. Int. J. Inf. Technol. Bus. 2019, 1, 48–52. [Google Scholar]
  7. Machado, R.D.; Bravo, G.; Starke, A.; Lemos, L.; Colle, S. Generation of 441 typical meteorological year from INMET stations-Brazil. In Proceedings of the IEA SHC International Conference on Solar Heating and Cooling for Buildings and Industry 2019, Santiago, Chile, 4–7 November 2019. [Google Scholar] [CrossRef]
  8. Brooker, S.; Clements, A.C.A.; Bundy, D.A.P. Global Epidemiology, Ecology and Control of Soil-Transmitted Helminth Infections. In Advances in Parasitology; Hay, S.I., Graham, A., Rogers, D.J., Eds.; Academic Press: Cambridge, MA, USA, 2006; Volume 62, pp. 221–261. [Google Scholar]
  9. O’Hanlon, S.J.; Slater, H.C.; Cheke, R.A.; Boatin, B.A.; Coffeng, L.E.; Pion, S.D.S.; Boussinesq, M.; Zouré, H.G.M.; Stolk, W.A.; Basáñez, M. Model-Based Geostatistical Mapping of the Prevalence of Onchocerca volvulus in West Africa. PLOS Negl. Trop. Dis. 2016, 10, e0004328. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Slater, H.; Michael, E. Mapping, Bayesian Geostatistical Analysis and Spatial Prediction of Lymphatic Filariasis Prevalence in Africa. PLoS ONE 2013, 8, e71574. [Google Scholar] [CrossRef] [PubMed]
  11. Lubinda, J.; Treviño C, A.T.; Walsh, M.R.; Moore, A.J.; Hanafi-Bojd, A.A.; Akgun, S.; Zhao, B.; Barro, A.S.; Begum, M.M.; Jamal, H.; et al. Environmental suitability for Aedes aegypti and Aedes albopictus and the spatial distribution of major arboviral infections in Mexico. Parasite Epidemiol. Control. 2019, 6, e00116. [Google Scholar] [CrossRef] [PubMed]
  12. Messina, J.P.; Kraemer, M.U.; Brady, O.J.; Pigott, D.M.; Shearer, F.M.; Weiss, D.J.; Golding, N.; Ruktanonchai, C.W.; Gething, P.W.; Cohn, E.; et al. Mapping global environmental suitability for Zika virus. eLife 2016, 5, e15272. [Google Scholar] [CrossRef] [PubMed]
  13. Caminade, C.; McIntyre, K.M.; Jones, A.E. Impact of recent and future climate change on vector-borne diseases. Ann. N. Y. Acad. Sci. 2018, 1436, 157–173. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Cailly, P.; Tran, A.; Balenghien, T.; L’Ambert, G.; Toty, C.; Ezanno, P. A climate-driven abundance model to assess mosquito control strategies. Ecol. Model. 2012, 227, 7–17. [Google Scholar] [CrossRef]
  15. Beltrán, J.D.; Boscor, A.; dos Santos, W.P.; Massoni, T.; Kostkova, P. ZIKA: A New System to Empower Health Workers and Local Communities to Improve Surveillance Protocols by E-learning and to Forecast Zika Virus in Real Time in Brazil. In Proceedings of the 2018 International Conference on Digital Health, online, 23 April 2018; ACM: New York, NY, USA, 2018; pp. 90–94. [Google Scholar] [CrossRef]
  16. Musah, A.; Rubio-Solis, A.; Birjovanu, G.; dos Santos, W.P.; Massoni, T.; Kostkova, P. Assessing the Relationship between various Climatic Risk Factors & Mosquito Abundance in Recife, Brazil. In Proceedings of the 9th International Conference on Digital Public Health, online, 20 November 2019; Association for Computing Machinery: New York, NY, USA, 2019; pp. 97–100. [Google Scholar] [CrossRef]
  17. Rubio-Solis, A.; Musah, A.; Dos Santos, W.P.; Massoni, T.; Birjovanu, G.; Kostkova, P. ZIKA Virus: Prediction of Aedes Mosquito Larvae Occurrence in Recife (Brazil) using Online Extreme Learning Machine and Neural Networks. In Proceedings of the 9th International Conference on Digital Public Health, online, 20 November 2019; Association for Computing Machinery: New York, NY, USA, 2019; pp. 101–110. [Google Scholar] [CrossRef]
  18. UCL-IRDR. ZIKA: A Deployment of ZIKA: A Mobile Mosquito Surveillance App to Combat Arboviruses in Brazil. UCL Institute for Risk and Disaster Reduction. 2018. Available online: https://www.ucl.ac.uk/risk-disaster-reduction/zika-deployment-zika-mobile-mosquito-surveillance-app-combat-arboviruses-brazil (accessed on 18 November 2021).
  19. UCL-IRDR. Mosquito Population Modelling for Early Warning System and Rapid Health Authority Response (MEWAR). UCL Institute for Risk and Disaster Reduction. 2020. Available online: https://www.ucl.ac.uk/risk-disaster-reduction/research-projects/2020/sep/mosquito-population-modelling-early-warning-system-and-rapid-health (accessed on 18 November 2021).
  20. Kostkova, P.; dos Santos, W.P.; Massoni, T. ZIKA: Improved surveillance and forecast of Zika virus in Brazil. Eur. J. Public Health 2019, 29 (Suppl. S4), 414–415. [Google Scholar] [CrossRef]
  21. De Souza, W.V.; Albuquerque, M.D.F.P.M.D.; Vazquez, E.; Bezerra, L.C.A.; Mendes, A.D.C.G.; Lyra, T.M.; De Araujo, T.V.B.; De Oliveira, A.L.S.; Braga, M.C.; Ximenes, R.A.D.A.; et al. Microcephaly epidemic related to the Zika virus and living conditions in Recife, Northeast Brazil. BMC Public Health 2018, 18, 130. [Google Scholar] [CrossRef] [PubMed]
  22. Cavalcante, A.C.P.; De Olinda, R.A.; Gomes, A.; Traxler, J.; Smith, M.; Santos, S. Spatial modelling of the infestation indices of Aedes aegypti: An innovative strategy for vector control actions in developing countries. Parasites Vectors 2020, 13, 197. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. INMET Information. Instituto Nacional de Meteorologia (INMET); 2020. Available online: http://portal.inmet.gov.br/ (accessed on 18 November 2021).
  24. INMET (All Weather Data). Instituto Nacional de Meteorologia (INMET); 2020. Available online: https://tempo.inmet.gov.br/TabelaEstacoes/# (accessed on 18 November 2021).
  25. OpenWeatherMap API. OpenWeather API. OpenWeather Mobile Application. 2021. Available online: https://openweathermap.org/api (accessed on 18 November 2021).
  26. De Oliveira, F.H.P.C.; Ara, A.L.S.C.E.; Moreira, C.H.P.; Lira, O.O.; Padilha, M.D.R.F.; Shinohara, N.K.S. Seasonal changes of water quality in a tropical shallow and eutrophic reservoir in the metropolitan region of Recife (Pernambuco-Brazil). Anais da Academia Brasileira de Ciências 2014, 86, 1863–1872. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Jale, J.D.S.; Júnior, S.F.A.X.; Xavier, F.M.; Stošić, T.; Stošić, B.; Ferreira, T.A.E. Application of Markov chain on daily rainfall data in Paraíba-Brazil from 1995–2015. Acta Sci. Technol. 2019, 41, 37186. [Google Scholar] [CrossRef]
  28. Pu, Z.; Kalnay, E. Numerical Weather Prediction Basics: Models, Numerical Methods, and Data Assimilation. In Handbook of Hydrometeorological Ensemble Forecasting; Duan, Q., Pappenberger, F., Wood, A., Cloke, H.L., Schaake, J.C., Eds.; Springer: Berlin/Heidelberg, Germany, 2019; pp. 67–97. [Google Scholar] [CrossRef]
Figure 1. Flow chart tables describes the structure of the nested JSON weather data extracted from the OpenWeatherMap API and the list of variables it contains.
Figure 1. Flow chart tables describes the structure of the nested JSON weather data extracted from the OpenWeatherMap API and the list of variables it contains.
Data 07 00106 g001
Figure 2. Describe the list of weather parameters drawn from Brazilian National Institute of Meteorology (Instituto Nacional de Meteorologia (INMET)).
Figure 2. Describe the list of weather parameters drawn from Brazilian National Institute of Meteorology (Instituto Nacional de Meteorologia (INMET)).
Data 07 00106 g002
Figure 3. Shows the extent of how OpenWeatherMap API deviates from baseline observations in INMET; Recife is represented by panels (A,B), corresponding to comparisons made for temperature by wet and dry, respectively; Campina Grande is represented by panels (C,D), corresponding to comparisons made for temperature by dry and wet season, respectively. The dashed lines provide an indication for how the points deviate from the centre 0.
Figure 3. Shows the extent of how OpenWeatherMap API deviates from baseline observations in INMET; Recife is represented by panels (A,B), corresponding to comparisons made for temperature by wet and dry, respectively; Campina Grande is represented by panels (C,D), corresponding to comparisons made for temperature by dry and wet season, respectively. The dashed lines provide an indication for how the points deviate from the centre 0.
Data 07 00106 g003
Figure 4. Shows the extent of how OpenWeatherMap API deviates from baseline observations in INMET; Recife is represented by panels (A,B), corresponding to comparisons made for humidity by wet and dry, respectively; Campina Grande is represented by panels (C,D), corresponding to comparisons made for humidity by dry and wet season, respectively. The dashed lines provide an indication for how the points deviate from the centre 0.
Figure 4. Shows the extent of how OpenWeatherMap API deviates from baseline observations in INMET; Recife is represented by panels (A,B), corresponding to comparisons made for humidity by wet and dry, respectively; Campina Grande is represented by panels (C,D), corresponding to comparisons made for humidity by dry and wet season, respectively. The dashed lines provide an indication for how the points deviate from the centre 0.
Data 07 00106 g004
Figure 5. Taylor diagram illustrates the statistical comparison between 1-day (purple), 2-day (green), 3-day (orange) and 4-day (blue) against baseline measurements of OpenWeatherMap API (i.e., day-0). Recife is represented by panels (A,B), which show analysis for temperature stratified wet and dry, respectively; Campina Grande is represented by panels (C,D), which show analysis for temperature stratified wet and dry, respectively. The x–y axes are standard deviation, arc represents correlation between the lead time prediction and baseline, and circular contours are the root-mean-squared error differences.
Figure 5. Taylor diagram illustrates the statistical comparison between 1-day (purple), 2-day (green), 3-day (orange) and 4-day (blue) against baseline measurements of OpenWeatherMap API (i.e., day-0). Recife is represented by panels (A,B), which show analysis for temperature stratified wet and dry, respectively; Campina Grande is represented by panels (C,D), which show analysis for temperature stratified wet and dry, respectively. The x–y axes are standard deviation, arc represents correlation between the lead time prediction and baseline, and circular contours are the root-mean-squared error differences.
Data 07 00106 g005
Figure 6. Taylor diagram illustrates the statistical comparison between 1-day (purple), 2-day (green), 3-day (orange) and 4-day (blue) against baseline measurements of OpenWeatherMap API (i.e., day-0). Recife is represented by panels (A,B), which show analysis for humidity stratified wet and dry, respectively; Campina Grande is represented by panels (C,D), which show analysis for humidity stratified wet and dry, respectively. The x–y axes are standard deviation, arc represents correlation between the lead time prediction and baseline, and circular contours are the root-mean-squared error differences.
Figure 6. Taylor diagram illustrates the statistical comparison between 1-day (purple), 2-day (green), 3-day (orange) and 4-day (blue) against baseline measurements of OpenWeatherMap API (i.e., day-0). Recife is represented by panels (A,B), which show analysis for humidity stratified wet and dry, respectively; Campina Grande is represented by panels (C,D), which show analysis for humidity stratified wet and dry, respectively. The x–y axes are standard deviation, arc represents correlation between the lead time prediction and baseline, and circular contours are the root-mean-squared error differences.
Data 07 00106 g006
Table 1. Shows the variability in the mean differences between the OpenWeatherMap API and INMET observations stratified by Wet and Dry seasons for Recife and Campina Grande.
Table 1. Shows the variability in the mean differences between the OpenWeatherMap API and INMET observations stratified by Wet and Dry seasons for Recife and Campina Grande.
Dry Season
Mean Difference (SD)
Wet Season
Mean Difference (SD)
Recife: Temperature−0.35 (±1.47) (p > 0.05)0.85 (±1.86) (p > 0.05)
Recife: Humidity1.03 (±7.73) (p < 0.001)−4.16 (±7.62) (p < 0.01)
Campina Grande: Temperature−0.67 (±1.58) (p > 0.05)−1.14 (±1.56) (p > 0.05)
Campina Grande: Humidity−0.87 (±9.23) (p < 0.05)3.19 (±7.52) (p < 0.01)
Table 2. Shows the reported standard deviation, correlation coefficient and RMSE differences derived from Taylor diagram analysis for the comparison between 1-day (purple), 2-day (green), 3-day (orange) and 4-day (blue) forecasts against baseline measurements (day-0) of OpenWeatherMap API.
Table 2. Shows the reported standard deviation, correlation coefficient and RMSE differences derived from Taylor diagram analysis for the comparison between 1-day (purple), 2-day (green), 3-day (orange) and 4-day (blue) forecasts against baseline measurements (day-0) of OpenWeatherMap API.
Temperature aWet SeasonDry Season
Day-1
(Purple)
Day-2
(Green)
Day-3
(Orange)
Day-4
(Blue)
Day-1
(Purple)
Day-2
(Green)
Day-3
(Orange)
Day-4
(Blue)
Recife (See Figure 5A) (See Figure 5B)
 Correlation0.770.760.740.740.920.920.920.91
 RMSE0.970.991.011.020.540.550.540.55
Campina Grande (See Figure 5C) (See Figure 5D)
 Correlation0.790.800.780.780.930.920.910.90
 RMSE1.711.691.751.711.611.651.671.69
Humidity bWet SeasonDry Season
Day-1
(Purple)
Day-2
(Green)
Day-3
(Orange)
Day-4
(Blue)
Day-1
(Purple)
Day-2
(Green)
Day-3
(Orange)
Day-4
(Blue)
Recife (See Figure 6A) (See Figure 6B)
 Correlation0.710.670.590.580.860.830.810.82
 RMSE5.095.375.685.763.443.814.043.99
Campina Grande (See Figure 6C) (See Figure 6D)
 Correlation0.760.770.700.700.920.900.900.91
 RMSE7.427.348.178.158.448.859.129.14
a Standard deviation estimates for baseline temperature (Celsius) OWM (at day-0) are Recife ±1.46 (wet season) and ±1.31 (dry season); and in Campina Grande ±2.59 (wet season) ±4.13 (dry season). b Standard deviation estimates for baseline humidity (%) OWM (at day-0) are Recife ±6.74 (wet season) and ±6.64 (dry season); and in Campina Grande ±10.5 (wet season) ±20.1 (dry season).
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Musah, A.; Dutra, L.M.M.; Aldosery, A.; Browning, E.; Ambrizzi, T.; Borges, I.V.G.; Tunali, M.; Başibüyük, S.; Yenigün, O.; Moreno, G.M.M.; et al. An Evaluation of the OpenWeatherMap API versus INMET Using Weather Data from Two Brazilian Cities: Recife and Campina Grande. Data 2022, 7, 106. https://doi.org/10.3390/data7080106

AMA Style

Musah A, Dutra LMM, Aldosery A, Browning E, Ambrizzi T, Borges IVG, Tunali M, Başibüyük S, Yenigün O, Moreno GMM, et al. An Evaluation of the OpenWeatherMap API versus INMET Using Weather Data from Two Brazilian Cities: Recife and Campina Grande. Data. 2022; 7(8):106. https://doi.org/10.3390/data7080106

Chicago/Turabian Style

Musah, Anwar, Livia Màrcia Mosso Dutra, Aisha Aldosery, Ella Browning, Tercio Ambrizzi, Iuri Valerio Graciano Borges, Merve Tunali, Selma Başibüyük, Orhan Yenigün, Giselle Machado Magalhaes Moreno, and et al. 2022. "An Evaluation of the OpenWeatherMap API versus INMET Using Weather Data from Two Brazilian Cities: Recife and Campina Grande" Data 7, no. 8: 106. https://doi.org/10.3390/data7080106

Article Metrics

Back to TopTop