Application of CCTV Methodology to Analyze COVID-19 Evolution in Italy
Abstract
:1. Introduction
2. Background
2.1. Background on Community Discovery
2.2. COVID-19 Spread in Italy
3. CCTV Methodology
- 1.
- Building of the similarity matrix: CCTV function takes as input the datasets consisting of the collected data for all regions. Then, the user can select: (i) the kind of aggregate data on which analysis will perform, i.e., Hospitalised with Symptom data, and (ii) the observation period on which he/she want to focus the analysis. After that, CCTV function applies the Wilcoxon test to compute the pair-wise similarity among the regions regarding the selected kind data, i.e., Hospitalised with Symptom data related to Abruzzo vs. Hospitalised with Symptom data related to Basilicata. Then, CCTV builds a similarity matrix that records the p-value resulting of the Wilcoxon test for each (i,j) region. At the end of this step, CCTV enable to save as output the built similarity matrix in a text format.
- 2.
- Mapping similarity matrices to networks-Network building: Starting from the similarity matrix, the CCTV function builds a network related the selected data.
- 3.
- Network analysis over time: CCTV function builds the networks related to the selected aggregate data in different time intervals. At the end of this step, CCTV function plots the built network. At the end of this step, CCTV enable to save as output the network in image format.
- 4.
- Community detection: CCTV applied a community detection algorithm to mine the communities on the network built in the previous step. At the end of this step, CCTV function plots the detected communities and it enables to save as output the comunities in image format.
Algorithm 1: CCTV Methodology Pseudocode |
Data: D // Dataset Data: w // Similarity Measure Result: C (Network Communities ) SimilarityMatrix (D,w); SimilarityNetwork (M); CommunityDetection (N) Return (C); |
3.1. Dataset
- Hospitalised with Symptoms, as regards the daily the number of COVID-19 patients in the hospital;
- Intensive Care, as regards the daily number of COVID-19 patients in Intensive Care Units;
- Total Hospitalised, as regards the daily sum of Hospitalised with Symptoms and Intensive Care measured;
- Home Isolation, as regards the daily number of subjects in quarantine at home;
- Total Currently Positive, relating to the daily number of COVID-19 positive subjects;
- New Currently Positive, as regards the daily number of COVID-19 positive subjects;
- Discharged/Healed, relating to the daily number healed or discharged from hospital subjects;
- Deceased, as regards the daily number of deaths;
- Total Cases, as regards the daily number of subjects affected by COVID-19;
- Swabs, as regards the daily number of test swab carried on COVID-19 positive subjects and on suspected COVID-19 positivity.
3.2. Building of Similarity Matrices
3.3. Converting Similarity Matrix to Network
3.4. Network Analysis over Time
3.5. Community Detection and Temporal Evolution
4. Results and Discussion
4.1. Trend of Hospitalised with Symptoms Network Communities
4.2. Trend of Intensive Care Network Communities
4.3. Trend of Total Hospitalised Network Communities
4.4. Trend of Home Isolation Network Communities
4.5. Trend of Total Currently Positive Network Communities
4.6. Trend of New Currently Positive Network Communities
4.7. Trend of Discarded or Healed Network Communities
4.8. Trend of Deceased Network Communities
4.9. Trend of Total Cases Network Communities
4.10. Trend of Swab Network Communities
4.11. Impact of Containment Measures and Vaccination Campaign
4.12. Comparison with State-of-Art COVID-19 Research
5. Conclusions and Future Work
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Turki, H.; Khrouf, K. Data Analysis of Coronavirus CoVID-19: Study of Spread and Vaccination in European Countries. Int. J. Adv. Comput. Sci. Appl. 2022, 13, 1. [Google Scholar] [CrossRef]
- Milano, M.; Zucco, C.; Cannataro, M. COVID-19 Community Temporal Visualizer: A new methodology for the network-based analysis and visualization of COVID-19 data. Netw. Model. Anal. Health Inform. Bioinform. 2021, 10, 1–38. [Google Scholar] [CrossRef]
- Agapito, G.; Milano, M.; Cannataro, M. Parallel Network Analysis and Communities Detection (PANC) Pipeline for the Analysis and Visualization of COVID-19 Data. Parallel Process. Lett. 2021, 32, 2142002. [Google Scholar] [CrossRef]
- Fortunato, S.; Hric, D. Community detection in networks: A user guide. Phys. Rep. 2016, 659, 1–44. [Google Scholar] [CrossRef] [Green Version]
- Wang, Z.; Li, Z.; Yuan, G.; Sun, Y.; Rui, X.; Xiang, X. Tracking the evolution of overlapping communities in dynamic social networks. Knowl.-Based Syst. 2018, 157, 81–97. [Google Scholar] [CrossRef]
- Palla, G.; Barabási, A.L.; Vicsek, T. Quantifying social group evolution. Nature 2007, 446, 664–667. [Google Scholar] [CrossRef] [Green Version]
- Asur, S.; Parthasarathy, S.; Ucar, D. An event-based framework for characterizing the evolutionary behavior of interaction graphs. ACM Trans. Knowl. Discov. Data (TKDD) 2009, 3, 1–36. [Google Scholar] [CrossRef]
- Bródka, P.; Kazienko, P.; Kołoszczyk, B. Predicting group evolution in the social network. In Proceedings of the International Conference on Social Informatics, Alexandria, VA, USA, 14–16 December 2012; Springer: Berlin/Heidelberg, Germany, 2012; pp. 54–67. [Google Scholar]
- Clauset, A.; Newman, M.E.; Moore, C. Finding community structure in very large networks. Phys. Rev. E 2004, 70, 066111. [Google Scholar] [CrossRef] [Green Version]
- Blondel, V.D.; Guillaume, J.L.; Lambiotte, R.; Lefebvre, E. Fast unfolding of communities in large networks. J. Stat. Mech. Theory Exp. 2008, 2008, P10008. [Google Scholar] [CrossRef] [Green Version]
- Yang, Z.; Algesheimer, R.; Tessone, C.J. A comparative analysis of community detection algorithms on artificial networks. Sci. Rep. 2016, 6, 30750. [Google Scholar] [CrossRef] [Green Version]
- Pons, P.; Latapy, M. Computing communities in large networks using random walks. In Proceedings of the International Symposium on Computer and Information Sciences, Istanbul, Turkey, 26–28 October 2005; Springer: Berlin/Heidelberg, Germany, 2005; pp. 284–293. [Google Scholar]
- Rosvall, M.; Bergstrom, C.T. An information-theoretic framework for resolving community structure in complex networks. Proc. Natl. Acad. Sci. USA 2007, 104, 7327–7331. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Rosvall, M.; Bergstrom, C.T. Maps of random walks on complex networks reveal community structure. Proc. Natl. Acad. Sci. USA 2008, 105, 1118–1123. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Girvan, M.; Newman, M.E. Community structure in social and biological networks. Proc. Natl. Acad. Sci. USA 2002, 99, 7821–7826. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Freeman, L.C. Centrality in social networks conceptual clarification. Soc. Netw. 1978, 1, 215–239. [Google Scholar] [CrossRef] [Green Version]
- Reichardt, J.; Bornholdt, S. Statistical mechanics of community detection. Phys. Rev. E 2006, 74, 016110. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Raghavan, U.N.; Albert, R.; Kumara, S. Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 2007, 76, 036106. [Google Scholar] [CrossRef] [Green Version]
- Gregory, S. Finding overlapping communities in networks by label propagation. New J. Phys. 2010, 12, 103018. [Google Scholar] [CrossRef]
- Van Dongen, S. Graph clustering via a discrete uncoupling process. SIAM J. Matrix Anal. Appl. 2008, 30, 121–141. [Google Scholar] [CrossRef]
- Newman, M.E. Finding community structure in networks using the eigenvectors of matrices. Phys. Rev. E 2006, 74, 036104. [Google Scholar] [CrossRef] [Green Version]
- Lancichinetti, A.; Radicchi, F.; Ramasco, J.J.; Fortunato, S. Finding statistically significant communities in networks. PLoS ONE 2011, 6, e18961. [Google Scholar] [CrossRef]
- Chakrabarti, D.; Kumar, R.; Tomkins, A. Evolutionary clustering. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA, 20–23 August 2006; pp. 554–560. [Google Scholar]
- Li, X.; Wu, B.; Guo, Q.; Zeng, X.; Shi, C. Dynamic community detection algorithm based on incremental identification. In Proceedings of the 2015 IEEE International Conference on Data Mining Workshop (ICDMW), Atlantic City, NJ, USA, 14–17 November 2015; pp. 900–907. [Google Scholar]
- Coccia, M. The relation between length of lockdown, numbers of infected people and deaths of Covid-19, and economic growth of countries: Lessons learned to cope with future pandemics similar to COVID-19 and to constrain the deterioration of economic system. Sci. Total Environ. 2021, 775, 145801. [Google Scholar] [CrossRef]
- Milano, M. Computing Languages for Bioinformatics: R. In Encyclopedia of Bioinformatics and Computational Biology; Gribskov, M., Nakai, K., Schonbach, C., Eds.; Elsevier: Amsterdam, The Netherlands, 2019; Volume 1, pp. 889–895. [Google Scholar]
- Csardi, G.; Nepusz, T. The igraph software package for complex network research. Interjournal Complex Syst. 2006, 1695, 1–9. [Google Scholar]
- Gehan, E.A. A generalized Wilcoxon test for comparing arbitrarily singly-censored samples. Biometrika 1965, 52, 203–224. [Google Scholar] [CrossRef] [PubMed]
- Agapito, G.; Guzzi, P.H.; Cannataro, M. Challenges and Opportunities for Visualization and Analysis of Graph-Modeled Medical Data. Math. Comput. Sci. 2017, 20171000182. [Google Scholar] [CrossRef] [Green Version]
- De Sousa, F.B.; Zhao, L. Evaluating and comparing the igraph community detection algorithms. In Proceedings of the 2014 Brazilian Conference on Intelligent Systems, Washington, DC, USA, 18–22 October 2014; pp. 408–413. [Google Scholar]
- Wang, P.; Lu, J.a.; Jin, Y.; Zhu, M.; Wang, L.; Chen, S. Statistical and network analysis of 1212 COVID-19 patients in Henan, China. Int. J. Infect. Dis. 2020, 95, 391–398. [Google Scholar] [CrossRef]
- Renardy, M.; Eisenberg, M.; Kirschner, D. Predicting the second wave of COVID-19 in Washtenaw County, MI. J. Theor. Biol. 2020, 507, 110461. [Google Scholar] [CrossRef]
- Kuzdeuov, A.; Baimukashev, D.; Karabay, A.; Ibragimov, B.; Mirzakhmetov, A.; Nurpeiissov, M.; Lewis, M.; Varol, H.A. A Network-Based Stochastic Epidemic Simulator: Controlling COVID-19 with Region-Specific Policies. medRxiv 2020. [Google Scholar] [CrossRef]
- Kumar, A. Modeling geographical spread of COVID-19 in India using network-based approach. medRxiv 2020. [Google Scholar] [CrossRef]
- Herrmann, H.A.; Schwartz, J.M. Using network science to propose strategies for effectively dealing with pandemics: The COVID-19 example. medRxiv 2020. [Google Scholar] [CrossRef] [Green Version]
- Reich, O.; Shalev, G.; Kalvari, T. Modeling COVID-19 on a network: Super-spreaders, testing and containment. medRxiv 2020. [Google Scholar] [CrossRef]
- Chaudhary, L.; Singh, B. Community detection using unsupervised machine learning techniques on COVID-19 dataset. Soc. Netw. Anal. Min. 2021, 11, 1–9. [Google Scholar] [CrossRef] [PubMed]
- Gibbs, H.; Nightingale, E.; Liu, Y.; Cheshire, J.; Danon, L.; Smeeth, L.; Pearson, C.A.B.; Grundy, C.; LSHTM CMMID COVID-19 Working Group; Kucharski, A.J.; et al. Detecting Behavioural changes in human movement to inform the spatial scale of interventions against COVID-19. PLoS Comput. Biol. 2021, 17, e1009162. [Google Scholar] [CrossRef] [PubMed]
- Coccia, M. The impact of first and second wave of the COVID-19 pandemic in society: Comparative analysis to support control measures to cope with negative effects of future infectious diseases. Environ. Res. 2021, 197, 111099. [Google Scholar] [CrossRef] [PubMed]
Abruzzo | Basilicata | Bolzano | Calabria | Campania | Emilia | Friuli | Lazio | Liguria | Lombardia | Marche | Molise | Piemonte | Puglia | Sardegna | Sicilia | Toscana | Trento | Umbria | ValleAosta | Veneto | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Abruzzo | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.1 | 0 | 0 | 0 | 0 | 0 |
Basilicata | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Bolzano | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Calabria | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.15 | 0 | 0 |
Campania | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0.39 | 0 | 0 | 0.73 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Emilia | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0.32 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Friuli | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.94 | 0 | 0 | 0 |
Lazio | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Liguria | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.27 | 0 | 0 | 0 | 0 |
Lombardia | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Marche | 0 | 0 | 0 | 0 | 0.39 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0.19 | 0 | 0.17 | 0 | 0 | 0 | 0 | 0 |
Molise | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Piemonte | 0 | 0 | 0 | 0 | 0 | 0.32 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Puglia | 0 | 0 | 0 | 0 | 0.73 | 0 | 0 | 0 | 0 | 0 | 0.19 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Sardegna | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
Sicilia | 0.1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.17 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
Toscana | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.27 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
Trento | 0 | 0 | 0 | 0 | 0 | 0 | 0.94 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
Umbria | 0 | 0 | 0 | 0.15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0.6 | 0 |
ValleAosta | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.6 | 1 | 0 |
Veneto | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Milano, M.; Agapito, G.; Cannataro, M. Application of CCTV Methodology to Analyze COVID-19 Evolution in Italy. BioTech 2022, 11, 33. https://doi.org/10.3390/biotech11030033
Milano M, Agapito G, Cannataro M. Application of CCTV Methodology to Analyze COVID-19 Evolution in Italy. BioTech. 2022; 11(3):33. https://doi.org/10.3390/biotech11030033
Chicago/Turabian StyleMilano, Marianna, Giuseppe Agapito, and Mario Cannataro. 2022. "Application of CCTV Methodology to Analyze COVID-19 Evolution in Italy" BioTech 11, no. 3: 33. https://doi.org/10.3390/biotech11030033
APA StyleMilano, M., Agapito, G., & Cannataro, M. (2022). Application of CCTV Methodology to Analyze COVID-19 Evolution in Italy. BioTech, 11(3), 33. https://doi.org/10.3390/biotech11030033