1. Introduction
Geographic Information Systems (GIS) are now essential for analyzing the complex interactions among health, environment, and urban security. By integrating spatial, temporal, and multivariate data, GIS drives major advances in spatial epidemiology, healthcare accessibility, disease surveillance, and crime prevention. However, the field remains fragmented, marked by heterogeneous data quality, diverse study designs, and uneven methodological standards.
Although public health and urban security address different problem structures, both depend on spatial determinants, territorial inequalities, and the distribution of risks across populations and infrastructures. The studies reviewed here show that both domains use GIS to identify vulnerable groups, map exposure patterns, and support decisions in geographically heterogeneous environments. Public health research typically operates at national and regional scales, focusing on healthcare accessibility and epidemiological surveillance, whereas security studies concentrate on localized risks, such as crime hotspots, surveillance technologies, and critical infrastructure protection. Examining these domains together provides an integrated understanding of how spatial analysis supports risk detection, resource allocation, and preventive interventions across territorial scales. It also reveals methodological asymmetries and shared challenges in data quality, reproducibility, and operational scalability.
Rationale for Integrating Public Health and Urban Security
The joint analysis of public health and urban security is grounded in their strong conceptual and methodological interdependence within spatial research. Both fields operate on the same territorial unit of analysis (the urban environment); share structural determinants, such as population density, mobility networks, infrastructure quality, segregation, and deprivation; and rely on overlapping GIS techniques, including hotspot detection, spatial autocorrelation, cluster analysis, and accessibility modeling. In both domains, spatial risks emerge from inequalities in exposure, vulnerability, and access to essential services. This convergence has been widely recognized by international frameworks led by the World Health Organization (WHO) through the Healthy Cities Program [
1], by UN-Habitat through the Safer Cities Programme [
2], and by the United Nations Office on Drugs and Crime (UNODC) through the Global Program on Safer Cities [
3]. These institutional frameworks consistently highlight that epidemiological and security risks interact within the same urban systems and therefore require integrated spatial responses.
Recent studies show that GIS effectively capture spatial patterns of health inequities and risk distribution across diverse populations [
4,
5,
6], but persistent methodological and contextual limitations continue to constrain the generalizability of these findings, especially in low- and middle-income settings where data completeness and comparability remain problematic [
7,
8]. In urban security, spatial modeling approaches have advanced predictive analytics [
9,
10], but they also face criticism for ethical concerns, methodological biases, and limited replicability [
11,
12].
Although several systematic reviews have examined GIS applications in specific sectors—such as healthcare accessibility, environmental health risks, or crime hotspot analysis—these evidence syntheses remain domain-specific and do not integrate public health and urban security within a unified analytical framework. Because the present study was designed as a primary SM + SR focusing exclusively on empirical GIS research, previous reviews were not included in the analytical corpus. Importantly, our comprehensive search strategy did not identify any systematic mapping or systematic review that jointly analyzed GIS applications across both domains, underscoring the novelty and necessity of the integrated approach adopted here.
In response to these challenges, this study introduces an integrated pipeline combining Systematic Mapping (SM) and Systematic Review (SR). The pipeline characterizes the available evidence, identifies recurring strengths and limitations, and highlights critical research gaps. As an added contribution, the study applies internationally recognized frameworks—PICOS (Population, Intervention, Comparison, Outcome, Study type), as recommended in PRISMA guidelines [
13,
14], for public health, and SPICE (Setting, Perspective, Intervention, Comparison, Evaluation), following its formal definition for context-driven evidence synthesis [
15], for urban security—to ensure the traceability, comparability, and reproducibility of the results.
The methodological framework is organized into three complementary levels: descriptive, intermediate, and advanced—corresponding to the needs of early-stage researchers (ESRs), mid-career researchers (MCRs), and senior researchers (SRs). This structure delivers a comprehensive synthesis while facilitating knowledge transfer among diverse professional and scientific audiences, ultimately strengthening the relevance of GIS-based evidence in both health and urban security domains.
Previous evidence synthesis studies rigorously applied PRISMA and PICOS to ensure methodological transparency and reproducibility in fields such as medical image processing and sensor-based cancer detection [
16,
17]. These works established a strong methodological foundation and demonstrated the value of systematic evidence synthesis in biomedical research. Building on this foundation, the present study expands the scope to the broader health domain, offering an extended analytical perspective and addressing research gaps that the current literature has yet to resolve.
In this study, GIS are framed as programmable analytical environments that integrate vector and raster operations, spatial statistics, and open-data workflows. Within the PRISMA–PICOS/SPICE framework, we explicitly document the software environments used (ArcGIS, QGIS, R, Python, and SaTScan) and their coordinate reference systems (CRSs) to ensure geocomputational reproducibility. This strategy builds a technological bridge between spatial modeling and its applications in public health and urban security, where transparency and replicability remain fundamental principles for long-term methodological synthesis.
2. Baseline Benchmark of the Systematic Mapping and Review Corpus
2.1. Public and Private Data Sources
The following summary presents the main public databases used in the publications included in this research. We report each dataset according to its producing agency, year of release, and official documentation to ensure transparency and traceability.
DBO49—The DHS Program. The program provides country-specific survey reports with their own bibliographic references; it does not assign a single DOI or generic citation to the dataset. Therefore, researchers must cite the country, year, and producing agency. (
https://www.dhsprogram.com/data/available-datasets.cfm, accessed on 1 December 2025). Ethiopia DHS 2016. Ethiopia Demographic and Health Survey 2016. Women aged 15–49 increased their use of antenatal care from skilled personnel from 27% in 2000 to 62% in 2016; although, only 32% completed four or more visits. Skilled providers frequently performed blood pressure checks (75%) and blood tests (73%), followed by urine analysis and nutritional counseling (66%). Institutional deliveries rose from 5% (2000) to 26% (2016), while home births declined from 95% to 74%. Within two days after delivery, 17% of women and 13% of newborns received postnatal care. Barriers to accessing medical care decreased from 96% in 2005 to 70% in 2016 [
18]. India NFHS-5 (2019–21). International Institute for Population Sciences (IIPS) and ICF. India National Family Health Survey (NFHS-5). Fieldwork collected data in two phases: Phase I (June 2019–January 2020), covering 17 states and 5 Union Territories, and Phase II (January 2020–April 2021), covering 11 states and 3 Union Territories. The survey collected information from 636,699 households, 724,115 women, and 101,839 men [
19]. Uganda DHS 2016. Uganda Bureau of Statistics (UBOS) and ICF. Uganda Demographic and Health Survey 2016. Kampala, Uganda, and Rockville, Maryland, USA: UBOS and ICF. The survey progress in family-planning indicators: modern contraceptive use among married women was increased from 14% (2000–2001) to 35% (2016), with injectables being identified as the most common method. However, discontinuation rates remained high (45% stopped within 12 months, mostly due to health concerns or side effects). Total demand for family planning rose from 54% to 67%, but modern methods met only 52% of this demand. Unmet need persisted (28% of married and 32% of sexually active single women). Among non-users, 64% expressed an intention to use contraceptives in the future [
20]. Cameroon DHS 2018. Cameroon Demographic and Health Survey 2018. The survey reports relatively high knowledge of HIV prevention, but reveals marked inequalities by education and gender. While 70% of individuals aged 15–49 acknowledged that condom use and monogamy reduce infection risk, only 42% of uneducated men knew this, compared to 85% with higher education. Regarding mother-to-child transmission, 71% of women and 68% of men recognized pregnancy as a risk period, and three-quarters of women reported awareness of antiretroviral use. Behavioral indicators show that 22% of women and 38% of men reported non-marital sex in the past year, with condom use limited (43% in women and 63% in men). Lifetime sexual partners differed by gender (4.2 in women vs. 9.7 in men). Although nearly 90% knew where to get tested, 29% of women and 43% of men had never been tested; among those who had been tested, 40% of women and 34% of men did so in the previous year. Coverage of HIV counseling during antenatal care reached 55% of pregnant women [
21].
DBO41—IBGE, Brazil Census 2010. Sinopse do Censo Demográfico 2010. Brazil reached 5565 municipalities in 2010, with higher concentration in the Northeast (32%) and Southeast (30%). Minas Gerais stood out as the state with the largest number of municipalities (853) (
https://www.ibge.gov.br/novo-portal-destaques.html, accessed on 1 December 2025) [
22].
DBO8—WorldPop Hub. Estimated total number of people per grid-cell. The dataset provides GeoTIFF files at a 3 arc-second resolution (approx. 100 m at the equator), using a Geographic Coordinate System (WGS84). Each pixel reports the estimated number of people, derived through Random Forest-based dasymetric redistribution (
https://hub.worldpop.org/project/list, accessed on 1 December 2025) [
23].
In contrast to these public datasets, 29 studies used private or licensed datasets, with access restricted to the producing institutions or the participating research teams. This restriction inherently limits independent replication and reduces methodological transparency. Within this group, some studies used proprietary experimental datasets generated by the authors, which strengthened the contextual validity of findings by ensuring direct control over data collection and processing, while simultaneously restricting external verification and complicating cross-study comparability.
Beyond the qualitative characterization of the most frequently used datasets, the systematic mapping quantified the distribution of public data sources across the 65 included articles.
Figure 1 summarizes the top-10 public datasets by number of unique articles. The distribution is dominated by large-scale demographic survey data from the DHS Program (DB049), which accounts for 24.2% of the studies, followed by DB06 with 15.2%. A second group of public datasets—including DB019, DB025, DB029, DB041, DB051, and DB08—each contributes approximately 9.1% of the articles, while DB01 represents 6.1%. This distribution highlights the central role of nationally and internationally harmonized survey and census-related datasets in GIS-based research on public health and urban security, reflecting both their accessibility and their suitability for comparative and reproducible spatial analyses.
GIS provide a methodological framework that integrates, manages, and transforms georeferenced data from multiple sources. In this systematic mapping and review, software platforms are reported as part of the technological ecosystems identified in the reviewed studies; therefore, specific software versions are not consistently available across the literature and are not reported. As shown in
Figure 2 and detailed in
Table 1, the framework links software ecosystems to analytical domains (Health and Urban Safety) and spatial techniques, establishing the technological baseline for the review.
In public health, researchers use GIS to conduct spatio-temporal pandemic analyses [
24], optimize pre-hospital coverage [
5], interpolate pollutants such as PM2.5 [
6], and assess vector-borne disease risk (e.g., dengue) [
25]. In the security domain, researchers apply GIS to map crime, identify hotspots, and develop predictive modeling [
9,
26,
27], as well as to protect critical infrastructures and strengthen territorial resilience [
28]. The decision-support potential of GIS depends on data quality and resolution, as well as the robustness of applied models, including the translation of analytical outputs into operational interventions [
12,
29,
30,
31,
32].
2.2. Technological Benchmark: Integration of Software and Spatial Techniques
This subsection establishes the technological baseline of the analytical ecosystem identified across the corpus of 65 studies. It characterizes the diversity of software environments and spatial analytical approaches used to address health and security-related problems through GIS. The analysis integrates three complementary dimensions: (i) the type of GIS and computational environments employed; (ii) the methodological domains in which they operate; and (iii) the spatial techniques most frequently applied within each ecosystem.
The corpus reveals a balanced yet heterogeneous technological landscape in which proprietary GIS and statistical packages (e.g., ArcGIS, IBM SPSS, Stata) coexist with open-source environments (QGIS, R, Python, GeoDa, SaTScan). Each platform is associated with specific analytical strengths: For instance, ArcGIS and QGIS dominate in cartographic visualization and network-based accessibility analysis, GeoDa and R support inferential spatial statistics (e.g., Moran’s I, LISA, GWR), and SaTScan specializes in cluster detection using Poisson or Bernoulli models. Python environments extend this ecosystem through automated processing, machine learning, and spatial data integration workflows.
Overall, this technological benchmark consolidates a descriptive foundation for understanding how GIS environments, computational tools, and spatial methodologies converge across domains. By clarifying these associations, it provides the conceptual groundwork for assessing reproducibility and methodological consistency in subsequent sections of this study. Importantly, this benchmark also establishes the structural context for the inferential analyses developed in
Section 4.2.4, where the spatial techniques identified here (e.g., Moran’s I, Getis–Ord Gi
∗, SaTScan, GWR, OLS, Poisson models) are quantitatively evaluated according to their statistical significance, effect strength, and robustness tiers (Effect–canonical vs. Performance domains).
3. Methodology: Fused Pipeline SM→SR
Based on the PICOS and SPICE frameworks, the methodological design formulated the research questions directly from the dataset fields (title, year, journal, thematic area, analytical technique, geographical area, and application scenario). These questions were organized by researcher levels (ESR, MCR, SR) and translated into operational variables directly extractable from the dataset, ensuring transparency and reproducibility (
Table 2).
Which software tools, platforms, and libraries are most frequently reported in GIS-based health and public-safety studies, and how concentrated is their usage?
3.1. Information Sources and Search Strategy
The systematic mapping protocol incorporated technical criteria derived from the PICOS framework for public health and the SPICE framework for urban security, which guided the operationalization of population, spatial context, outcomes, and evaluation dimensions during the search process. Based on these components, search strings were constructed using predefined keywords combined with the logical operator OR, maximizing sensitivity while maintaining specificity and ensuring full traceability of the bibliographic corpus.
The search was limited to established scientific repositories (Elsevier, IEEE, MDPI, Springer, ACM, Taylor & Francis, SciELO, PLOS, Wiley, SAGE, Frontiers Media, Springer Nature, BMJ, Hindawi, ASCE, Emerald, and JMIR) for the period 2019–first quarter of 2025, applied to titles, abstracts, and keywords.
Table 3 summarizes the distribution of records across publishers and SM phases, and the last column reports the first systematic review filter (
).
PICOS (Health). Example search string:
(“spatial analysis” OR “GIS” OR “geographic information system”) AND (“epidemiology” OR “mortality” OR “risk” OR “health” OR “COVID-19”) AND (“sensitivity” OR “accuracy” OR “correlation” OR “mse”) AND (“deep learning” OR “machine learning”) AND (“monitoring” OR “disease”)
SPICE (Security). Example search string:
(“spatial analysis” OR “GIS” OR “spatial distribution”) AND (“crime” OR “crime mapping” OR “urban security”) AND (“drones” OR “AI” OR “facial recognition”) AND (“accuracy” OR “sensitivity” OR “correlation”) AND (“effectiveness” OR “crime reduction”)
This search strategy ensures transparency and reproducibility in line with PRISMA guidelines. The retrieved records formed the initial SM evidence base. All records were exported to a reference manager, where duplicates were removed and metadata standardized. Previous systematic mapping and review studies in biomedical domains [
16,
17] established the methodological foundation extended in this study.
3.2. Eligibility Criteria and PRISMA Flow
The selection for the SM was based exclusively on metadata (title, abstract, and keywords). We use a compact notation in the eligibility tables, where
and
denote the PICOS and SPICE components, respectively, and “Cin/Cex” indicate inclusion or exclusion criteria. A master criterion was defined for the SR (
), and the literal criteria are reported in
Table 4,
Table 5 and
Table 6.
PRISMA 2020 Statement and Review Registration
The full process of identification, screening, and selection followed the PRISMA 2020 recommendations for systematic and scoping reviews, adapted to a fused methodological pipeline from SM to SR.
Figure 3 (PRISMA 2020 Flow Diagram for the fused SM→SR pipeline, 2019–first quarter of 2025) transparently documents the initial records identified (
), duplicate removal (10.3%), the successive thematic and documentary eligibility stages (
–
), and finally the master filter for the systematic review (
), which resulted in the final corpus of 65 included studies (59.1%) and 45 excluded studies (40.9%). This sequence is fully consistent with the figures reported in
Table 3 and with the criteria tables presented in the following subsections.
In accordance with international best-practice guidelines for evidence synthesis (PRISMA 2020, JBI, and COS–OSF), this review was preregistered in the Open Science Framework (OSF) under the official identifier DOI:
https://doi.org/10.17605/OSF.IO/MAQBH.
The preregistration includes the complete methodological protocol, search strings, literal PICOS/SPICE inclusion and exclusion criteria (
Table 4,
Table 5 and
Table 6), and the PRISMA flow diagram adapted to the fused SM→SR pipeline. This registration ensures full traceability, transparency, and reproducibility of the process in compliance with Open Science principles.
These compact inclusion and exclusion criteria, summarized in
Table 4,
Table 5 and
Table 6, established the operational definitions for each stage of the SM pipeline (
–
) and for the master filter of the systematic review (
). The detailed application of these criteria across filters, together with the quantitative results of included and excluded records, is presented in the following subsection.
3.3. Classification and Data Extraction: Filters –
The systematic mapping followed five consecutive filters (–), each addressing specific research questions aligned with ESR, MCR, and SR levels.
3.3.1. Filter 1 (): Technical Verification and Deduplication
A total of 7106 records were identified. After exact (DOI/PMID) and fuzzy (title–author–year) deduplication, metadata normalization, and title/abstract screening by two independent reviewers, 6371 records (89.7%) were retained and 735 (10.3%) were excluded. This stage ensured basic documentary validity and provided the input for thematic filtering.
3.3.2. Filter 2 (): Thematic and Methodological Eligibility
From the 6371 valid references, we selected studies that explicitly addressed public health or citizen security in a territorial context and were relevant to the population, setting, and policy dimensions defined by the PICOS and SPICE frameworks. In total, 6211 records (97.5%) were included and 160 (2.5%) were excluded. This stage informed ESR and MCR questions related to geographical contexts and thematic areas.
3.3.3. Filter 3 (): Full-Text Content Verification
At this stage, full texts were screened to confirm the population, objectives, and study setting, and to verify the presence of spatial comparison terms, metrics, and datasets consistent with the PICOS/SPICE components. A total of 2914 references (46.9%) were retained and 3297 (53.1%) were excluded.
3.3.4. Filter 4 (): Scoring Matrix (1–4)
The 2914 retained articles were evaluated using a 1–4 scoring matrix based on dataset availability, quantitative indicators, magnitude of effects, and evidence of validation or replicability. The results were as follows: 145 articles (4.98%) with Score 4, 902 (30.95%) with Score 3, 1317 (45.20%) with Score 2, and 550 (18.87%) with Score 1. Only the most informative articles progressed to the next stage.
3.3.5. Filter 5 (): Consistency Check and Snowballing
A manual consistency check verified completeness, replicability, and minimum reporting standards (indexed source, English language, and explicit methodology and discussion). Backward and forward citation tracking (snowballing) was then applied. From the 145 Score 4 articles, 97 were retained and 13 additional studies were identified, yielding a corpus of 110 articles.
3.3.6. Filter 1 (): Master Inclusion/Exclusion
The master criteria for the SR selected studies that reported concrete spatial techniques, quantitative metrics, replicable procedures, and potential impact on public health or citizen security policies. Domains outside the scope (e.g., agriculture, hydrology, energy) or studies without quantitative metrics were excluded. This reduced the 110 articles to 65 (59.1%), which define the final SR evidence base (
Table 7).
3.4. Software Identification and Reproducibility Verification
Building upon the technological benchmark described in
Section 2.2, this stage assessed the reproducibility and traceability of the computational environments reported across the SR corpus. Each included study was systematically coded for explicit mentions of GIS software tools, analytical platforms, programming languages, CRS, and—when available—dataset access routes (repositories, URLs, institutional portals). This coding scheme was designed to distinguish between generic software mentions and fully traceable technical descriptions, ensuring methodological transparency and auditability.
A total of 38 studies (58.5 % of the SR corpus; 38/65) met the criteria forreproducible version metadata. These studies explicitly reported at least the software name and version, together with either the CRS/projection used or a clearly identifiable dataset source. Most articles in this group employed combinations of ArcGIS [
4,
5,
11,
12,
29,
31,
32,
33,
34,
35,
36,
37,
38,
39,
40,
41,
42,
43,
44,
45,
46,
47], QGIS [
25,
48,
49,
50,
51,
52], R [
4,
12,
25,
36,
41,
42,
43,
53,
54,
55], and SaTScan [
7,
9,
10,
41,
48]. Python-based workflows were also documented [
26,
35,
38,
39,
46,
56,
57,
58,
59], often associated with automated geoprocessing and visualization routines.
This reproducible subset represents approximately 80 % of the studies that reported any GIS or computational environment (73.1 % of the total corpus, according to ). In practical terms, most studies that disclosed software usage also provided sufficient technical detail to reconstruct the analytical workflow. To our knowledge, few GIS-based evidence syntheses quantify reproducibility at this level of granularity; most prior reviews limit reporting to the software name without systematic extraction of versioning, CRS specifications, or data-access information. Thus, the present review contributes an additional layer of methodological rigor by explicitly documenting technical traceability across the corpus.
Importantly, the 58.5 % figure is presented as a descriptive transparency indicator, aligned with PRISMA and OSF best practices, rather than as a threshold for inclusion or exclusion. Its purpose is to enhance the auditability of the review by identifying how many studies provide fully traceable analytical environments.
Collectively, the integration of the technological benchmark (
Section 2.2) and the reproducibility verification (this subsection) provides the methodological infrastructure for the quantitative synthesis developed in
Section 4.2.3 and
Section 4.2.4. These subsequent sections analyze the concentration of software tools, libraries, and computational environments, and evaluate inferential spatial techniques (e.g., Moran’s I, Getis–Ord Gi
∗, SaTScan, GWR, OLS, Poisson models) in terms of effect strength, robustness, and statistical consistency across domains.
4. Results
4.1. Evidence Landscape (Mapping Layer—SM)
The results are presented according to the nine research questions derived from the fused pipeline, which combines SM and SR. To ensure transparency and traceability, each question is addressed sequentially, following the logic of the PICOS and SPICE frameworks. The answers are reported with quantitative evidence extracted from the dataset (percentages, frequencies, and distributions), complemented by figures and tables for visual support. This structure is consistent across the four SM-related questions (Q1SM–Q4SM) and the five SR-related questions (Q1SR–Q5SR), thus enabling both a descriptive overview and an analytical synthesis of GIS applications in health and urban security.
4.1.1. Early-Stage Researcher (ESR)—
In terms of cumulative study region distribution, the evidence shows a marked concentration of studies in Asia and North America, with 22 records (33.8%) and 11 (16.9%), respectively. Africa ranks third with eight studies (12.3%), followed by Europe with seven (10.8%) and South America with six (9.2%). Minor contributions correspond to Western Asia with four studies (6.2%), Continents with three (4.6%), and finally Global, Middle East, Republic of Korea, and Saudi Arabia, each with one study (1.5%). These results confirm a strong pattern of geographical concentration of study sites, where a limited number of regions dominate the corpus (
Figure 4).
The domain-based analysis reveals that most studies are oriented towards public health, accounting for approximately 86% of the total corpus. Citizen security represents about 11%, with visible contributions in Asia, Europe, and South America, while non-core or residual studies remain below 5%. In Asia, of the twenty-two studies identified, four (18.2%) address citizen security topics, whereas the remaining eighteen (81.8%) focus on public health. In North America, all eleven studies are devoted exclusively to public health. Africa shows a similar pattern, with 100% of its eight studies centered on health. Europe presents a dual profile: five of its seven studies focus on health, while two explicitly address citizen security. In South America, although public health remains predominant, citizen security represents a comparatively larger share than in other regions. These findings confirm that the surge in 2023–2024 is driven primarily by public health studies conducted in Asia, North America, and Africa, while citizen security remains in its early stages, though it is beginning to consolidate in Europe and South America (
Figure 5).
Taken together, these findings reveal a clear concentration of study site geography, with Asia, North America, and Africa dominating the distribution of cases, particularly in public health. By contrast, citizen security remains marginal, with only incipient contributions in Europe and South America. This asymmetry highlights opportunities and gaps for future agendas, especially regarding regional equity and thematic diversification. The next subsection () builds upon these results by examining application scenarios, specifically how urban, rural, and regional settings are distributed between health and security studies.
4.1.2. Early-Stage Researcher (ESR)—
The distribution of the analyzed settings shows clear differences between the health and security domains. In health-related studies (), national (38.5%) and regional (38.5%) contexts comprise the majority of articles, particularly in areas of healthcare access and epidemiological surveillance. In contrast, security studies () are primarily developed in urban and subnational/regional contexts, encompassing subdomains, such as crime prediction, surveillance technologies, occupational monitoring, and critical infrastructure (13.8% combined).
The analysis indicates that health studies tend to adopt broader and more scalable population-based approaches, whereas security research is oriented toward specific and localized territorial vulnerabilities. This contrast suggests that, while public health seeks solutions applicable at national and regional system levels, citizen security emphasizes the management of immediate risks in concrete territories.
Figure 6 and
Table 8 complement these results with the full breakdown of categories, percentages, and article identifiers, ensuring data traceability and reproducibility.
Table 8 summarizes the refined categories of health (HS-), security (SC-), and other domains, including illustrative descriptions that directly reflect the operationalization of study objectives. These descriptions exemplify the types of populations, infrastructures, datasets, and environmental factors analyzed across the literature, thus linking the conceptual categories of the research with the empirical evidence reported.
Overall, these findings highlight a structural divergence between the two domains: public health studies privilege national and regional strategies aimed at strengthening healthcare systems, whereas citizen security studies remain constrained to localized territorial risks. This divergence has methodological implications for replicability and policy design, since the scalability of health-focused GIS models contrasts with the case-specific and context-dependent nature of security-oriented applications.
An additional consideration concerns the continental distribution of the studies. Although our setting-based classification highlights differences between national–regional (health) and urban–subnational (security) orientations, the corpus is unevenly distributed across world regions, with Africa and South America markedly underrepresented. This geographical imbalance limits the global generalizability of the findings, as many health-system and security-risk dynamics observed in well-represented regions (e.g., North America, Europe, and parts of Asia) may not transfer to contexts with different structural conditions, data infrastructures, and socio-spatial vulnerabilities. Likewise, the scarcity of studies from Africa and South America constrains bias-mitigation efforts, since methodological patterns and model performance cannot be contrasted across diverse epidemiological, socio-economic, and environmental settings. These gaps represent a structural limitation of the available evidence rather than a methodological choice of the review protocol, and they underscore the need for increased regional coverage in future GIS-based research on health and security.
4.1.3. Mid-Career Researcher (MCR)—
To contextualize the findings, we first provide an overview of how the study type methodological design combinations are distributed and concentrated across the corpus.
Figure 7 presents a Pareto-based visualization in which bars represent the relative share of each combination within the 65 reviewed studies, and the black cumulative line quantifies concentration using 95% Wilson binomial confidence intervals. This dual representation transparently identifies the methodological core and the surrounding exploratory configurations.
At the level of specific combinations, five categories exceed the 7% threshold: C_E5 (Quantitative/Ecological/Observational, 13.8%), C_E4 Quantitative/Ecological / Epidemiological, 10.8%), C_O1 (Quantitative/Observational/Cross-sectional, 9.2%), C_M3 (Quantitative/Modeling/Simulation, 7.7%), and C_O2 (Quantitative/Observational / Epidemiological, 7.7%). Together, these five combinations account for 49.2% of the corpus, defining the methodological nucleus of GIS-based research in health and security contexts.
Expanding to the top ten combinations, including, among others, C_Ex2 (Experimental/Validation), C_L1 (Longitudinal), and C_M1 (Modeling), the cumulative coverage surpasses 70%. This pattern highlights the prevalence of empirical, model-based, and comparative frameworks that dominate applied GIS research across both domains.
Beyond this core, the long tail comprises more than twenty additional combinations with frequencies of 3.1% or less. Although individually marginal, these configurations reflect methodological experimentation and adaptation to diverse geographic, epidemiological, and social contexts.
Collectively, this tail represents about 30% of the corpus and supports a skewed 80/20 distribution consistent with Pareto dynamics. The quantitative indicators (HHI = 0.067, Gini = 0.367, Shannonn = 0.926) corroborate the moderate concentration and sustained methodological diversity of the field.
Overall, these results confirm the coexistence of a stable methodological core ensuring comparability and replicability across studies and an experimental peripheral front, where less frequent designs are tested, some of which may consolidate in future work. The explicit reporting of confidence intervals in
Figure 7, in conjunction with the structured synthesis in
Table 9, reinforces the statistical validity of these observations and provides a reproducible framework for the critical interpretation of evidence.
Taken together, these results highlight a strong methodological bias toward observational cross-sectional approaches in GIS-based research on health and security. While such designs enable broad descriptive mapping, they limit the capacity for causal inference and experimental validation. The next subsection () shifts the focus from methodological design to the publication channels, examining which journals and categories publish the greatest volume of studies and which concentrate the high-impact core of GIS-related health and security research.
4.1.4. Senior Researcher (SR)—
The dataset (
) reveals a dual structure in the publication patterns of GIS research applied to health and security. A limited set of journals and parent categories dominate the overall volume, while high impact (Q1 & Active JCR) contributions are selectively concentrated in more specialized outlets.
Journals (J1–J12). PLOS ONE (J1) is the leading outlet in absolute volume, accounting for approximately 18% of the dataset (
). However, only about 28% of its contributions qualify as core (Q1 & Active JCR).
IEEE Access (J2) follows with approximately 12% (
), mostly focused on applied computing and security. In contrast, the
International Journal of Public Health (J3) and
Environmental Science and Pollution Research (J4) publish smaller shares of the dataset (≤6% each) but exhibit much higher internal core proportions (≥60% of their articles). This pattern indicates that volume leaders ensure visibility, whereas high-impact contributions cluster in domain-focused outlets (
Figure 8;
Table 10).
Categories (C1–Ck). At the thematic level, Epidemiology and Surveillance (C1) emerges as the largest parent category, comprising approximately 20% of all articles. Public Health Infrastructure (C2) and Environment and Climate (C3) together add nearly 28%, with C2 standing out for a markedly higher internal core share (approximately 55%), compared with C1 (approximately 35%). Security-related categories (e.g., crime prediction, surveillance technologies, and critical infrastructure) contribute fewer articles (≤10% combined) but often place a substantial proportion in high-impact outlets (
Figure 9;
Table 10).
Overall, the results reveal a structural duality: (i) high-volume diffusion is concentrated in multidisciplinary journals such as PLOS ONE (J1) and broad thematic categories such as Epidemiology and Surveillance (C1), whereas (ii) high-impact concentration is anchored in domain-specific outlets (J3, J4) and thematic niches such as Public Health Infrastructure (C2). Consequently, although PLOS ONE and Epidemiology and Surveillance lead in overall production, the most influential GIS-based knowledge is concentrated in specialized journals, such as the International Journal of Public Health and Environmental Science and Pollution Research, together with focused categories like Public Health Infrastructure.
4.2. Critical Deepening (Systematic Review Layer—SR)
4.2.1. Early-Stage Researcher (ESR)—
To contextualize the results, we first provide a global view of how metrics are distributed and concentrated across the corpus. The associated
Figure 10 and
Table 11 offer, respectively, (i) a visual read of domain-specific concentration patterns and (ii) a family-structured synthesis, enabling identification of the methodological core and its peripheral diversity.
Table 11 organizes metrics by families and reports, for each family, the absolute count of unique articles (
N), the proportion relative to the corpus of 65 articles (
), the domain-level breakdown (Health, Public Safety), and article-level traceability through Ref. This version omits Rank and cumulative percentage, prioritizing family-wise and domain-wise comparisons while preserving explicit reproducibility.
At the level of individual metrics, Moran’s I value (MRV) leads with
(14.0%), followed by coefficient
(CFF) with
(21.5%) and prevalence ratio (PRC) with
(15.4%). In aggregate, the top five metrics (MRV, CFF, PRC, CNT, SMTR) account for 40.8% of all occurrences (
Figure 10), and the top ten (through
R2) exceed 60%, evidencing strong concentration around spatial autocorrelation, regression models, and determination measures.
The domain breakdown shows a predominance of the Health domain, which concentrates most mentions among leading metrics, whereas Public Safety contributes more selectively (e.g.,
CNT,
CLS,
ACC) consistent with the asymmetry observed in
Figure 10. Finally, the “long tail” of less frequent metrics (≥30 metrics with
) constitutes roughly 25% of total mentions, suggesting exploratory use without consolidation; this structure reinforces the Pareto-like (80/20) pattern and motivates the standardization of a reduced set of core metrics to ensure cross-study comparability.
4.2.2. Early-Stage Researcher (ESR)—
The temporal analysis of
Figure 11 shows the evolution in the use of standardized metrics, expressed as proportions of unique articles normalized across periods (≤2019, 2020–2021, 2022–2023, ≥2024).
Table 12 summarizes absolute and relative frequencies, normalized by the number of articles in each period, together with 95% confidence intervals (CIs) computed using Wilson’s method. This approach was chosen for its robustness when handling heterogeneous sample sizes and low-frequency events, thus avoiding the biases of the classical Wald approximation.
A marked intensification of accessibility related metrics is observed, particularly Travel time and Two-Step Floating Catchment Area (2SFCA), which together account for more than 40% of all articles since 2020 (95% CI: 35.2–45.1%). In contrast, classical measures such as Euclidean distance and Kernel Density Estimation (KDE) remain relatively stable at around 10% or less, while other metrics (e.g., network decay functions and accessibility coverage ratios) display more erratic trajectories with greater uncertainty. Overall, five metrics (Travel time, 2SFCA, Euclidean distance, Coverage ratios, KDE) form a canonical core consistently concentrating between 60% and 70% of studies per period, reflecting a strong methodological concentration. Less frequent metrics constitute a long tail of exploratory alternatives that emerge sporadically but without significant consolidation. This observed heterogeneity in peripheral metrics reinforces the need for careful statistical treatment when assessing temporal trends. According to the methodological design,
Figure 11b displays, for each period, the percentage of articles reporting each metric, together with 95% Wilson binomial confidence intervals. These intervals are computed using the denominator
, defined as the number of unique articles in the corresponding period. The 95% Wilson confidence interval is:
where
denotes the sample proportion,
x is the number of articles reporting the metric, and
corresponds to a 95% confidence level.
4.2.3. Mid-Career Researcher (MCR)—
Research question: Which software tools, platforms, and libraries are most frequently reported in GIS-based health and public safety studies, and how concentrated is their usage?
The quantitative synthesis of the sixty-five studies included in this review identified thirteen distinct software tools, platforms, and computational environments used in Geographic Information Systems (GIS) research applied to health and public-safety contexts. These tools were classified into six methodological categories—desktop GIS platforms, statistical environments, programming languages, geospatial portals, spreadsheets, and application programming interfaces (APIs), reflecting the diversity of analytical frameworks reported across the corpus.
Figure 12 and
Table 13 jointly present the empirical evidence and technical descriptions supporting this classification.
A clear pattern of methodological concentration emerges from these results. More than eighty percent of the studies rely on only six software tools, indicating a highly clustered analytical ecosystem. ArcGIS (55.4%) and GeoDa (24.6%) dominate this landscape, jointly accounting for nearly seventy percent of the corpus. Together, these tools constitute the core software environment supporting GIS-based analyses in health and urban safety research. A secondary cluster comprising Stata (18.5%), R (15.4%), Python (13.8%), and IBM SPSS (12.3%) reflects a gradual transition toward reproducible and script-oriented computational workflows. Additional tools, including Microsoft Excel (7.7%), Google Earth (4.6%), and specialized APIs or libraries (e.g., Leaflet, Earth Engine, MATLAB; each approximately 1.5%), play complementary roles in data handling and visualization.
From a quantitative perspective, the distribution shown in
Figure 12 follows a Pareto-type pattern, whereby fewer than five tools account for approximately 80% of reported implementations. The Herfindahl–Hirschman Index (HHI = 0.36) indicates a moderately high level of methodological concentration. Moreover, the non-overlapping 95% Wilson confidence intervals for ArcGIS (43.2–67.2%) and GeoDa (16.0–35.0%) statistically support their predominance. These indicators, documented in the corresponding audit spreadsheet, provide a reproducible baseline for assessing methodological centralization in future comparative studies.
The joint interpretation of
Figure 12 and
Table 13 highlights a dual structure within the analytical ecosystem. Proprietary GIS platforms, particularly ArcGIS, primarily support cartographic and institutionalized workflows characterized by licensed environments and standardized procedures. In contrast, open and semi-open tools such as GeoDa, R, and Python reflect an increasing emphasis on computational reproducibility, spatial inference, and transparent modeling practices. Stata and SPSS occupy an intermediate position, maintaining classical biostatistical workflows while interfacing with spatial analysis through external GIS components.
Overall, this distribution reflects a shift in the role of GIS from a predominantly cartographic technology toward a more computationally oriented analytical framework. Recent publications increasingly reference programming libraries such as GeoPandas, NumPy, and scikit-learn, suggesting a growing integration of statistical computing and geospatial analysis. This convergence reinforces reproducibility and algorithmic transparency as central elements of contemporary GIS-based research in health and public safety.
Beyond descriptive outcomes, the Q3SR analysis provides an auditable and quantitative lens on the technological dependency of current spatial practice. The combined evidence of the 80% Pareto threshold, the Herfindahl–Hirschman Index (0.36), and the 95% Wilson intervals establishes a transparent framework for measuring methodological concentration. Documenting these metrics within the audit sheet ensures reproducibility and enables future systematic reviews to evaluate the pace of methodological diversification across time and domains.
4.2.4. Mid-Career Researcher (MCR)—
To evaluate inferential capacity at the MCR level, we organized the 89 techniques identified across the 65 included studies into two complementary analytical axes: (i) Effect–canonical, representing the strength of statistical evidence (primarily through p-values and canonical statistics); and (ii) Robustness/Performance, representing consistency across repeated tests and model performance indicators. Of the 89 techniques, 18 appeared exclusively in Effect–canonical, 51 exclusively in Robustness/Performance, and 20 contributed to both axes. This results in 38 techniques informing the Effect–canonical axis and 71 informing Robustness/Performance. Within Effect–canonical, all techniques reported p-values, and 17 used canonical metrics (RR, LLR, , z, Moran’s I, NNI, Getis–Ord G). Non-canonical metrics were standardized when necessary for comparison. In the Robustness/Performance axis, 11 techniques also produced canonical evidence, while 7 reported auxiliary performance metrics (, AIC, BIC, RMSE/MAE/MSE, AUC).
Figure 13 synthesizes both perspectives.
Figure 13a highlights the top 20 Effect–canonical techniques. The coefficient of variation (CV), Moran’s I (v), and the
t-test exhibit the strongest effects (truncated at
). SaTScan and Moran’s I (t) form a second tier (
3–4), consistent with their use in cluster detection and temporal autocorrelation. A third tier—including ANOVA, connectivity and density analyses, Getis–Ord Gi
∗, GLM, OLS, Poisson models, and related techniques—exhibits moderate effect strength (
2–3). In the figure, filled markers denote techniques contributing to both analytical axes, whereas hollow markers denote techniques providing effect-only evidence.
Figure 13b summarizes robustness, displaying the proportion of significant results (
) with 95% Wilson confidence intervals. Line color and thickness encode robustness tier (R1 high, R2 moderate, R3 limited), enabling identification of techniques with consistent inferential support across studies.
Table 14 details the correspondence between techniques, related methods, evidence level, axis membership, and supporting references. Together, the figure and table indicate which techniques provide strong and reproducible inferential evidence (e.g., Moran’s I variants, SaTScan, OLS, Poisson, Spatial Lag, and Spatial Error Models) and which play more contextual or complementary roles within the MCR-level literature.
4.2.5. Senior Researcher (SR)—
Research question: Which methodological strengths, limitations, and research gaps predominate in the GIS health/public-safety literature, and which Limitation Gap pairs show non-random co-occurrence, indicating priorities to improve replicability and validity?
To answer the guiding question, we proceed as follows:
Figure 14a outlines the methodological strengths;
Figure 14b synthesizes the recurring limitations;
Figure 14c elevates these to field-level research needs; and
Figure 14d with
Table 15 operationalizes the links between limitations and gaps through article-level co-occurrences (
), effect sizes, and BH adjusted evidence, producing a coherent path from data to implications.
Figure 14a quantifies the methodological strengths reported in the corpus (
), establishing a baseline for interpreting subsequent results. Planning and multimap frameworks appear in 9/65 studies (13.8%), while recurrent advantages observed in 3/65 studies (4.6%) include Bayesian smoothing of rates, biomarker GPS integration, and low-cost scalable approaches. Additional strengths reported in 2/65 studies (3.1%) include environmental and clinical census integration, GIS statistical integration, large sample sizes, national coverage, and residual or network-based models, with automation and replicability reported in 1/65 studies (1.5%). Collectively, these strengths point toward standardized pipelines, multi-source data integration, and stabilized small-area estimates, for example through Bayesian smoothing. This baseline anticipates mechanisms supporting replicability (traceable procedures) and validity (internal and external), which are contrasted below against identified limitations and gaps.
Second,
Figure 14b identifies bottlenecks that counteract these strengths and guide interpretation. Missing variables or data gaps (11/65; 16.9%) and under-registration (7/65; 10.8%) are the most frequently reported limitations, followed by limited data availability and selection bias (5/65; 7.7% each), data quality dependency and outdated or proxy variables (4/65; 6.2%), and four limitations reported in 3/65 studies (4.6%), including heterogeneous data quality, uncertainty estimates, cross-sectional designs, and model-specific assumptions. Accordingly, incompleteness and under-registration threaten internal validity through measurement error and selection bias and hinder reproducibility; at the same time, data scarcity and source bias constrain external validity and cross-context generalization.
Third, to translate these risks into actionable needs,
Figure 14c prioritizes research gaps. Data quality and demographic integration lead with 17/65 studies each (26.2%), followed by institutional factors (11/65; 16.9%), temporal analysis (10/65; 15.4%), context replication (9/65; 13.8%), and spatial models (5/65; 7.7%), with additional targeted gaps reported in 1–2/65 studies (1.5–3.1%). Accordingly, the most urgent priorities include standardizing data quality and metadata, implementing reproducible demographic linkage, increasing transparency around institutional constraints, and strengthening spatial and spatiotemporal modeling.
Finally,
Figure 14a–d, together with
Table 15, operationalize the link between weaknesses and needs by reporting limitation–gap co-occurrences (
), together with effect sizes and evidence. Three pairs reach
(4.6%): under-registration → institutional factors (lift
,
,
,
; IDs: [
43,
65,
71]), limited data availability → data quality (lift
,
,
,
; IDs: [
31,
35,
59]), and limited data availability → demographic integration (same metrics; IDs: [
31,
63,
71]).
In addition, two
signals (3.1%) exhibit high enrichment: privacy and ethics constraints → institutional factors (lift
,
,
,
; IDs: [
30,
70]) and small sample size → institutional factors (same metrics; IDs: [
66,
70]). These associations are treated as exploratory due to their lower support. No top-10 pair meets the threshold
(BH), and they are therefore interpreted as auditable hypotheses. Article identifiers are reported directly in the table to ensure full traceability.
Statistical note for
Table 15. For each Limitation–Gap pair, a
contingency table is constructed over the article universe, with counts
a (both),
b (limitation only),
c (gap only), and
d (neither), such that
. Columns report
,
, the enrichment (lift),
and the binary effect size,
Two-sided Fisher exact tests are applied for inference, and multiplicity is controlled using the Benjamini–Hochberg procedure to obtain . In exploratory screening, pairs are flagged as robust when and ; otherwise, results should be interpreted cautiously and validated with additional evidence.
4.3. Integrated Synthesis (SM + SR)
The integrated synthesis connects the descriptive patterns identified through the SM with the quantitative findings derived from the SR. This cross-layer perspective reveals how global production trends and geographical concentrations are linked to specific methodological practices and outcomes, such as accessibility analysis, spatial autocorrelation, and predictive risk modeling. The comparative analysis further highlights relationships between thematic domains, application scenarios, technical parameters, and measurable outcomes, providing a coherent picture of both methodological strengths and persistent gaps. In practical terms, this integration consolidates evidence on accessibility inequities, epidemiological surveillance, and crime reduction, thereby reinforcing the translational value of GIS-based research for evidence-informed public health and urban security policies.
5. Discussion
This study represents the first systematic synthesis in which SM and SR of GIS applications in the domains of health and urban security are jointly integrated. A methodological ecosystem concentrated on accessibility metrics, spatial autocorrelation, and cluster detection, with a strong geographical bias toward Asia and North America. Such concentration is shown to reflect both the consolidation of GIS as a strategic analytical tool and the persistent underrepresentation of highly vulnerable regions where data availability and quality are unevenly distributed. The integrated analysis of public health and urban security is further supported by the fact that both domains are operate within the same territorial environment and are driven by overlapping spatial determinants—such as population density, mobility networks, infrastructure quality, segregation, and deprivation—as well as common GIS analytical techniques (e.g., hotspot detection, spatial autocorrelation, accessibility modeling, and cluster analysis). In this way, methodological coherence of examining both sectors within a unified SM + SR pipeline is reinforced, and the extent to which spatial inequalities jointly shape epidemiological and security risks across urban systems is highlighted.
From a methodological standpoint, this work extends the application of the PRISMA, PICOS, and SPICE frameworks by incorporating quality and reproducibility filters that reinforce evidence traceability [
16,
17]. The analytical pipeline (7106 → 65 articles) and the scoring matrix (1–4) provide a transparent and replicable pathway for future geospatial reviews. However, the dominance of cross-sectional observational designs and the limited use of advanced models—such as Bayesian inference, machine learning, or hybrid approaches—confirm persistent methodological gaps within the field.
At an applied level, the results underscore the potential of GIS to assess health inequalities and to inform evidence-based urban security policies. The high spatial resolution and modeling versatility of GIS confirm their suitability for territorial planning and public-policy design. Nevertheless, technological dependency on proprietary platforms (ArcGIS, QGIS) and the coexistence between open datasets, such as DHS, IBGE, or WorldPop, and institutionally restricted databases continue to constrain full replicability and scientific equity. Addressing these challenges requires regulatory frameworks that promote interoperability, data openness, and standardized protocols for spatial validation.
Looking forward, advancing the field will require the opening of multisectoral data sources, the adoption of computationally efficient models suitable for resource-limited environments, and the integration of GIS within reproducible ecosystems combining artificial intelligence and high-performance computing (HPC). Such integration will strengthen transparency, methodological comparability, and the long-term societal impact of geospatial science.
6. Conclusions
The results of this study confirm that GIS have consolidated as strategic analytical tools for addressing public health and urban security challenges, although their application remains characterized by significant geographical and thematic inequalities. Research output is heavily concentrated in Asia and North America, whereas Africa and South America remain underrepresented. This territorial imbalance limits the generalizability of evidence and highlights the need to strengthen research capacities in vulnerable regions.
Methodologically, the fused SM + SR pipeline applied in this study combined the PICOS and SPICE frameworks, systematically filtering an initial corpus of 7106 references down to 65 core articles. This structured process enhanced reproducibility and enabled the differentiation between robust and weak methodological contributions.
The analysis identified a predominance of cross-sectional observational designs and a methodological core centered on accessibility and spatial autocorrelation indicators. While these approaches offer comparability across contexts, the limited adoption of advanced models—such as Bayesian inference, machine learning, and hybrid AI–GIS pipelines constrains predictive and causal insights.
The software ecosystem remains highly dependent on ArcGIS and QGIS, complemented by emerging open-source environments such as R, Python, and SaTScan. This configuration promotes standardization but raises concerns about technological dependency and uneven access to computational resources. Expanding interoperability and promoting open geospatial infrastructures are, therefore, essential steps toward greater scientific equity.
From a scientific standpoint, the most consistent strengths observed across studies include high spatial and temporal resolution, scalability, and multivariate precision. Conversely, key limitations involve the scarcity of openly available data, the predominance of static designs, and the absence of reproducible validation protocols. Addressing these gaps requires methodological standardization, multisource data integration, and computationally efficient modeling suited for resource-limited settings.
In summary, this fused SM + SR framework provides a transparent, replicable, and scalable foundation for synthesizing GIS-based evidence in health and urban security. Advancing toward open data, reproducible validation workflows, and spatially explicit artificial intelligence will be critical to enhancing predictive capacity and policy relevance, particularly in high-vulnerability regions where GIS can directly inform equitable and data-driven decision making.