Highlights
What are the main findings ?
- Population attributable fraction (PAF) of lung cancer mortality due to groundwater arsenic is modeled.
- The PAF for India is calculated to be 1.4% (95% CI; ±1.8%).
- State-level PAFs vary widely, the highest being 12% for Assam and 8% for West Bengal.
What are the implications of the main findings ?
- Removing groundwater arsenic exposure could reduce lung cancer deaths in India by ~1000/year.
- Groundwater arsenic exposure is a relatively minor risk factor for lung cancer mortality in India.
- Groundwater arsenic is not the main driver of secular increases in lung cancer mortality in India.
Abstract
Chronic consumption of groundwater arsenic is a well-known risk factor for many cancers, notably lung, bladder, kidney and skin cancers, and is recognized as such in many countries, notably India. Indeed, increasing cancer incidence and mortality in India has been ascribed to such exposure. Notwithstanding this, there has hitherto been a dearth of quantitative data on the magnitude and spatial distribution of groundwater arsenic attributable cancer mortality in India. Here, we combined random forest model-derived data on the spatial distribution of groundwater arsenic in India with India census data on populations and groundwater usage and recently published dose–response relationships to address this knowledge gap through a population attributable fraction (PAF) approach. We show that around 1.4% (95% CI; ±1.8%) of all lung cancer mortality in India can plausibly be attributable to exposure to groundwater arsenic. Whilst this is a substantial range of values, it is too small to sensibly indicate any plausibility of the narrative of groundwater arsenic exposure being primarily responsible for the increased cancer incidence and mortality in India over the last few decades. Nevertheless, the modeled spatial distribution of groundwater arsenic exposure PAF of lung cancer mortality in India may inform public policy aimed at reducing environment-related detrimental health outcomes.
1. Introduction
Arsenic in groundwater utilized for drinking or cooking has long been identified as a major risk to human health with both carcinogenic (e.g., lung, bladder, kidney, skin cancers) [1,2,3,4,5,6,7] and non-carcinogenic (e.g., cardiovascular disease) detrimental health outcomes [5,7,8,9,10,11] being important aspects of this. After Bangladesh, India is arguably the country that is most impacted by groundwater arsenic with tens of millions of people exposed [12,13,14] to groundwater-derived drinking water with arsenic exceeding the WHO provisional guide value of 10 μg/L [15]. It is widely reported that exposure to groundwater arsenic is a major cause of cancer in India [16,17,18]; indeed, it is further reported or implied [19] that it is responsible for the substantive increased prevalence of many cancers in India at either a national or state level. However, there is a paucity of studies to quantitatively substantiate either of these claims or the extent to which they may be valid, including with respect to different parts of the country.
There are few detailed spatially resolved models of the public health impacts of groundwater arsenic in India. A notable exception is the work of Wu et al. (2021) [20] who determined the spatial distribution of groundwater arsenic attributable cardiovascular disease mortality in India. Wu et al. (2021) [20] used a population attributable fraction (PAF) approach [21], which is readily extended to modeling cancer mortality as an end-point provided that suitable dose–response data were available. Lung cancers are a major group of cancers (i) that cause over 50,000 deaths per year in India [22] and for which there is not only extensive evidence of a causal link of arising mortality with exposure to arsenic via drinking water [23,24,25,26] but for which dose–response relationships have recently been published in systematic reviews involving meta-analyses or other studies [25,26] enabling the PAF approach to be extended to these sequelae.
The aim of this study was to provide a detailed spatially resolved quantification of the model population attributable fraction (PAF) of lung mortality in India attributable to direct or indirect ingestion of groundwater arsenic. Calculation of a PAF readily enables quantification of the relative importance of groundwater arsenic exposure and (collectively) other risks to the spatial distribution of lung cancer health burdens in India. The work is intended as a scoping study, the limitations of which are discussed along with suggestions for further work to improve the models.
2. Materials and Methods
2.1. Overall Approach
Secondary data for (i) the spatial distribution of arsenic in groundwater in India; (ii) the spatial distribution of rural and urban population in India; and (iii) the spatial distribution of groundwater usage for drinking water in India were used to (iv) determine the spatial distribution of the magnitude of human exposure to groundwater arsenic across India and then combined with (v) dose–response relationships between lung cancer mortality and drinking water arsenic concentrations and (vi) theoretical equations for population attributable fractions, to (vii) calculate the spatial distribution of the fraction of lung cancer mortality in India attributable to human exposure to groundwater arsenic. In this section, in turn, are described (i) the theoretical basis and equations utilized in the calculations and (ii) the sources of the secondary data utilized in those equations. Unless otherwise stated, all calculations were carried out in Microsoft® Excel® for Microsoft 365 MSO (Version 2511 Build 16.0.19426.20260) 64-bit.
2.2. Theoretical Basis
The population attributable fraction, , for a groundwater contaminant, c, such as arsenic, to a detrimental health outcome, h, such as annual lung cancer deaths, in an administrative area, a, such as a state or district, is related to the relative risk, (x), of the detrimental health outcome, h, due to the contaminant, c, as a function of the contaminant concentration, x, and the administrative area population distribution, (x), and counterfactual (adopted reference) population distribution, (x), with respect to the concentration, x, of contaminant, c, by Equation (1) (with m being the maximum contaminant concentration; modified after [27]):
where the population distribution with respect to the contaminant concentration is not known as a continuous distribution but rather discretely with respect to a series of concentration classes or levels and for which = 1 for the lowest level, then Equation (1) may be simplified to Equation (2) modified after [20,21,28]. Thus, the population attributable fraction, , for a groundwater contaminant, c, such as arsenic, to a detrimental health outcome, h, such as annual lung cancer deaths, in an administrative area, a, such as a country, state or district, can be calculated by Equation (2):
where is the fraction of the total administrative area population exposed to groundwater contaminant level,; n is the number of such groundwater contaminant levels; and is the risk of the detrimental health outcome, h, at the contaminant exposure level, , relative to that, = 1, for the reference exposure level, 1. Implicit assumptions of this approach are that there is no non-zero population attributable fraction of the detrimental health outcome due to the contaminant in the reference exposure level and that is independent of age, sex, genetic and behavioral characteristics. It is noted that all of F, RR and may have, indeed likely do have, different values for different contaminants.
In heterogeneous administrative areas, where groundwater usage may be distinctly different between rural (r) and urban (u) areas, the fraction, (), of the total (t) area population exposed to groundwater contaminant level can be calculated by the following:
where , and are the total, rural, and urban population respectively of the administrative area, a, and and are the fractions of households drinking groundwater in rural and urban sub-areas respectively of the administrative area, a, and calculable by Equations (4) and (5):
where and are the fraction of the households drinking groundwater in rural and urban areas of a district respectively and is the fraction of the total area of the administrative area, a, for which the groundwater contaminant level is . These equations assume that the mean household occupancies are the same in rural and urban sub-areas within a given administrative area and that there is no difference within the given administrative area, a, between the fractional distribution of a given groundwater contaminant level, , of rural and urban areas.
Where groundwater contaminant classification levels are known on a grid of equidimensional pixels or cells, then these data can be converted into values for each contaminant level and administrative area by Equation (6):
where |S| refers to the number of members of the inscribed set; M is the set of all grid cells; is the set of all grid cells within M for which the groundwater contaminant level is and is the set of all grid cells within M for which the centroid is in an administrative area, a.
It follows that the rural, , urban, , and total,, populations exposed to a groundwater contaminant at a given contaminant level, , within a given administrative area, a, may be calculated by the following:
Finally, the total detrimental health outcomes, , attributable to a groundwater contaminant, c, in a given administrative area, a, can be calculated from the population attributable fraction, , by the following:
where is the total detrimental health outcomes, h, irrespective of their cause or risk factors, in the administrative area, a.
2.3. Secondary Data and Data Sources
The district-level distribution of groundwater arsenic concentrations across India was derived from the all-India country-wide 1 km × 1 km scale machine learning model of Wu et al. (2021) based on secondary groundwater arsenic data compiled between 2005 and 2020 [20]. In brief, ref. [20] followed the random forest approach of Podgorski et al. [13] but with a somewhat updated dataset. Predictive variables included those plausibly related to expert-identified processes leading to elevated groundwater arsenic concentrations [20,29,30,31,32]. More details on random forest modeling and justification of it as an effective modeling approach have already been detailed elsewhere [13,20,33,34,35]. Whilst there are also other machine learning models of groundwater arsenic including India as part of the modeling (e.g., [36,37]) or modeling specific areas of India (e.g., [13,33,38,39,40]), we preferred, here, the use of the Wu et al. (2021) [20] model because the adequacy of the model at a country scale was specifically demonstrated through cross-validation approaches and because both granular state-level and district-level concentration data were readily available from the novel pseudo-contouring carried out [20] and which enabled the classification of groundwater arsenic concentrations at 1 km × 1 km scale into 5 levels/classes, viz., level 1: As = 0 to <10 μg/L, level 2: As = ≥10 to <20 μg/L, level 3: As = ≥20 to <50 μg/L, level 4: As = ≥50 to <80 μg/L, and level 5: As = ≥80 μg/L. These 1 km × 1 km classifications were then converted into values of for each class/level for each state or district in India using Equation (6).
Population data, all of total, rural and urban, at all of country, state and district level, were obtained from the India Census of 2011 [41].
Data on groundwater usage as drinking water, and , for rural and urban sub-areas respectively at a district level were obtained from [42], considering hand pump and tubewells/boreholes sources—this corresponds to exposure model 2 of [20] (see Figure 1b of [20] for district-wise groundwater usage as drinking water distribution across India)—and excludes the following categories of drinking water supply: “tap water from treated source”, “tap water from un-treated source”, “covered well”, “un-covered well”, “spring”, “river/canal”, “tank/pond/lake” and “other sources”. None of the excluded sources correspond unequivocally to likely high arsenic groundwater sources and most correspond to sources, notably treated water and surface or near surface waters, that are generally, albeit not always, expected to be low in arsenic.
Secondary dose–response data for lung cancer mortality and groundwater arsenic concentrations were found by a systematic search of English-language systematic reviews with a meta-analysis published over the period 2015–2025 and indexed by Web of Science (All Collections). The search term combination used was as follows: TITLE = arsenic and “lung cancer” and (“systematic review” or “meta-analysis”) and YEARS = 2015 to 2025; and LANGUAGE = English. Of the two papers [25,26] identified by the search, only one paper [26] explicitly included dose–response data for mortality end-points so we have adopted the dose–response data from [26] as follows. Relative risks, , for lung cancer mortality were determined for the mid-point concentration of each groundwater arsenic level/class, viz., 5, 15, 35 and 65 µg/L for levels 1, 2, 3 and 4 respectively and, following the approach of [20], 131 µg/L for level 5. Since explicit dose–response data were not available for these specific concentrations, to enable their calculation over the range of 15–131 µg/L, relevant tabulated dose–response data were fitted using unweighted least squares regression to a second-order polynomial fit of the form:
with
where x is the value of the concentration of groundwater arsenic expressed in units of µg/L.
RR(x) = β0 + β1 x + β2 x2 + ε for x = 15–131 µg/L
RR(x) = 1.00 for x = 0–10 µg/L
Absolute values of lung cancer mortality for India by state or territory for the years 1980 to 2023 were obtained from [43] data for combined tracheal, bronchus and lung cancers—in tabulating those data, we combine the data for the now separate states of Andhra Pradesh and Telangana; we further note that the comparative rarity of tracheal and bronchus cancers in India means that these combined secondary data more than adequately represent lung cancer mortality data without any unacceptable loss of accuracy for the purposes of this paper.
3. Results
3.1. Groundwater Arsenic Hazard and Exposure Distribution
The state-level and district-level distribution of groundwater arsenic across India utilized in this study has previously been tabulated and otherwise presented by Wu et al. (2021) (see their Figure 2e) [20]. Given that, we only briefly summarize here some of the key features of those distributions—notably that relatively high (taken here to be greater than the WHO provisional guide value of 10 µg/L [15]) groundwater arsenic concentrations tend to be more commonly found in the north and north-east parts, notably West Bengal and Assam, of India; and which, coincidentally, tend also to be parts of the country in which hand pumps and tubewells/boreholes are more commonly used for drinking water, particularly in rural areas (for more details, see [20]). The total number of people exposed to high groundwater arsenic is estimated to be around 60 million, a value intermediate between that of Podgorski et al. [13] and Mukherjee et al. (2021) [14], and a value still [20] preferred by us since it is based on a more realistic and granular exposure model than those of the other estimates [13,14]. As noted by [20], the precision of both hazard and exposure data tends to be substantively better at country level and state level than at district level and for that reason, our further calculations of groundwater arsenic exposure PAF for lung cancer mortality is limited to the country level and state level.
3.2. Relative Risks of Lung Cancer Mortality as a Function of Drinking Water Arsenic
The relative risks of lung cancer mortality at the mid-point concentrations of the groundwater arsenic concentration classes/levels utilized in this study are presented in Table 1 together with the best-fit coefficients used to calculate those RR values. The estimated 95% confidence intervals are on the order of ±30% and for one class/level, the lower bound of the 95% confidence intervals is notably below 1.000.
Table 1.
Dose–response relationships between lung cancer mortality and arsenic concentration in drinking water. Pooled relative risks (95% CIs) of lung cancer mortality for the median concentration in each arsenic concentration class/level utilized in this relative to the median concentration in the reference class/level of 0–10 μg/L. Calculated from Equation (11) in the text with β0 = 9.220 × 10−1; β1 = 1.631 × 10−2; and β2 = −4.293 × 10−5; (valid over the range 10–131 μg/L;) and Equation (12) in the text, derived by fitting the tabulated data for As > 10 μg/L of [26] to Equation (11); R2 = 0.99.
3.3. Population Attributable Fraction (PAF) of Lung Cancer Mortality in India Linked to Groundwater Arsenic Exposure
The calculated PAF of lung cancer morality in India linked to groundwater arsenic exposure for each state and the country as a whole is presented in Table 2.
Table 2.
Calculated PAF (population attributable fraction) (with 95% CI) of lung cancer mortality in India and Indian states linked to exposure to groundwater arsenic. Estimates of groundwater arsenic exposure attributable lung cancer deaths in 2011 are uncertain values, calculated from the product of PAF (this study) and secondary published lung cancer mortality data [22].
For India, the PAF value is 1.4% (95% CI: −0.5%, 3.2%), which represents around 700 groundwater arsenic attributable deaths of the approximately 53,000 reported for 2011 by [22] and around 1000/annum for the current (2026) population if the PAF is assumed to remain constant between 2011 and 2026. Notably, the lower bound of the 95% CI is less than 0.0%—this reflects the magnitude of the 95% CIs for the drinking water arsenic–lung cancer mortality dose–response relationship [22] utilized here (see Table 1)—indeed, whilst there are also imprecisions and inaccuracies associated with the modeling of both groundwater arsenic hazard and human exposure to it [20], the negative PAF 95% CI lower bounds not just for India but for many states indicate that these errors are outweighed by those associated with the utilized dose–response relationship.
There is considerable variation in PAFs from state to state. Whilst for many states, the PAF is less than 1%, indeed close to 0.0%, for a number of states, the PAFs are considerably higher than that—notably for Assam (12.0% (95% CI: −3.6%, 23.5%)); West Bengal (7.8% (95% CI: −0.l4%, 14.8%)); Bihar (2.2% (95% CI: −2.0%, 6.0%)) and the Punjab (2.0% (−1.8%, 5.4%)). These elevated values broadly reflect a combination of the known elevated groundwater arsenic concentrations in many parts of those states and the known elevated levels of groundwater usage for drinking and cooking in many parts of those states, and notably in rural areas.
4. Discussion
In this section, limitations of the model and data are briefly discussed before outlining what are some of the implications of the results.
4.1. Limitations of the Study
The results presented here are subject to limitations related to both (a) errors and uncertainties in the input data; and (b) errors associated with the model utilized, in particular, the validity of key model assumptions. These are outlined below with respect to each of the hazard, exposure and dose–response models.
There are some important limitations to consider in our usage of these groundwater hazard models, notably (i) process heterogeneity [20,43] means that the weighting of optimal hazard predictor variables in one part of the modeled area may well not be the optimal weighting in other parts of the modeled area, where different processes controlling groundwater arsenic may be important; in particular, the relative importance of the reductive dissolution of arsenic-bearing Fe-O-H phases, oxidation of arsenian iron sulfides and pH-dependent competitive desorption may vary spatially across India (cf. [12,13,16,31,32]); and (ii) the hazard model adopted is a 2D model rather than a more realistic 3D model [13,20] but for which relevant secondary depth-dependent data are less available and for which modeling is more challenging because of the geological, hydrogeological and hydrobiogeochemical complexity of sub-surface aquifer systems in many parts of India [12,13,16,20,39].
With respect to the exposure model utilized, it is recognized that (i) the omission of springs and some shallow wells may give rise to an under-estimation of groundwater arsenic exposures; (iv) the current state of drinking water supplies (in 2026) will be different from those in 2011, not least of all due to groundwater arsenic mitigations implemented by governments and NGOs; (ii) granular consideration of age, gender, body weight and behavioral factors influencing individual water consumption rates has not been considered and, in particular, the spatial distribution of these across various states in India; and (iii) there have been substantial changes in population since 2011.
With respect to the dose–response model utilized, it is recognized that it is based on epidemiological studies not exclusively focused on India (cf. [28]). Further, the dose–response model utilized is agnostic to sex, age, genetics and behavioral characteristics (e.g., smoking, other forms of tobacco usage). Whilst the PAF approach to a large extent mitigates some of these issues, nevertheless, if there are non-zero interaction terms between arsenic exposure [44] and any of these factors AND there is a substantive spatial variation in these factors, then this might lead to systematic biases in the estimation of model PAF values—such biases would be expected to be much more evident at the smaller district level, so we have restricted our reported calculations to state level and country level. Irrespective of synergistic or antagonistic effects, numerical biases may be introduced by not modeling a continuous distribution of arsenic exposures but rather five categorical classes of exposure. Lastly, we note several studies that have questioned the accuracy of dose–response relationships for various sequelae at low to moderate drinking water arsenic concentrations below 100 μg/L, generally claiming lower (if any) demonstrable health impacts at those concentrations [45,46].
Nevertheless, notwithstanding these limitations, the hazard, exposure and dose–response models used here enable a useful first step in the model of groundwater arsenic exposure PAF of lung cancer mortality in India and which may form the theoretical basis for improved estimates in the future.
4.2. Implications
There are several important public health implications of the results of this study. Firstly, although exposure to groundwater arsenic is clearly a contributing factor to the development and mortality arising from lung cancers in India, its importance is much outweighed by other factors. This is a similar finding to that of [27], who found that for France, all of tobacco usage, alcohol consumption, diet, obesity and infections were substantively more important risk factors, albeit for all cancers, not specifically lung cancer, than was arsenic exposure. Notwithstanding the relatively higher groundwater arsenic concentrations in India than in France, the PAFs calculated here nevertheless also point to other factors being considerably more important than arsenic exposure to the overall burden of lung cancer disease. Whilst remediation of high groundwater arsenic-derived drinking water supplies is clearly indicated for other reasons, for the sole purpose of reducing lung cancer burden, it is evident that efforts might be more cost-effectively placed in other directions (e.g., for effective programs aimed at reducing tobacco usage or reducing obesity).
Further, the magnitude of groundwater arsenic exposure as a risk factor for lung cancer mortality is simply too small, 1.4% (95% CI; ±1.8%), for this exposure to be a major driver of secular increases in lung cancer incidence and mortality in India. The reported incidence of cancer, including lung cancer, has been increasing by several tens of percent over the last decade and is projected to continue to increase by a similar magnitude over the next decade [43,47,48]—instead, this can be largely ascribed to a combination of (i) an aging population; (ii) improved quality of drinking water supplies, particularly with regard to infectious agents, and improved public health services and systems addressing other causes of mortality; and (iii) improved rates of reporting and detection [47,48,49]. Other risk factors for lung cancer in India include tobacco usage [50], occupational exposures, outdoor air pollution [51], indoor air pollution (including through the burning of biofuels) and socioeconomic factors.
Lastly, whilst the PAFs calculated here are relatively small at a country scale, there are regions in India, notably Assam and West Bengal, but also further areas, where the combination of the demonstrated or modeling impacts of groundwater arsenic exposure on public health, not just in relation to lung cancer but also to other detrimental health outcomes, such as cardiovascular disease (CVD) [20], still indicates the importance of remediating or providing mitigations in relation to groundwater arsenic hazard.
5. Conclusions
This study presents, for the first time, a quantitative hazard–exposure–receptor model that shows that exposure to groundwater arsenic plausibly gives rise to around 1.4% of all lung cancer mortality in India. We note the limitations of the model, including with respect to input data accuracy and representativeness and with respect to model assumptions, particularly in relation to the dose–response model adopted. Lastly, we note that the model is reported here at a country and state level rather than at a district level (for which model uncertainties are inevitably higher) and should not—as is typical of such population-level models—be used to estimate health risks for individuals. We highlight that the model could be used to inform public health policy decisions particularly in relation to the management of detrimental health outcomes arising from the use of groundwater as drinking or cooking water.
Author Contributions
Conceptualization, D.A.P.; methodology, software, validation, R.W. and D.A.P.; investigation, D.A.P. and R.W.; data curation, D.A.P.; writing—original draft preparation, D.A.P.; writing—review and editing, R.W. and D.A.P.; supervision and project administration, D.A.P.; funding acquisition, D.A.P. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded in part by the Newton Bhabha Fund, Natural Environmental Research Council (NERC, UK), Engineering and Physical Sciences Research (EPSRC, UK) (NE/R003386/1), and the Department of Science and Technology (DST, India) (DST/TM/INDO-UK/2K17/55(C) and 55(G)) and by an EPSRC IAA Impact Support Fund Award to D.A.P.
Data Availability Statement
The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.
Acknowledgments
We thank Qinchuan Xin for his generosity in supporting time for R.W. to further contribute to this work during the tenure of R.W.’s postdoctoral fellowship at SYSU. We thank Michael Berg, DRS Middleton, Abhijit Mukherjee, Joel Podgorski and Dipankar Saha for discussions. We are grateful to members of the Indo-UK Water Quality project, FAR-GANGA (https://www.farganga.org/), for sharing their wisdom and insights. The opinions expressed in this paper, however, do not necessarily reflect those of any of the organizations or individuals whom we acknowledge here. We dedicate this paper to Albert Camus’s Bernard Rieux and the inspiration that “the only means of fighting a plague is … common decency … I don’t know what it means for other people … but in my case, I know that it consists in doing my job.” A draft version of this work was presented [49] at the Geological Society of America CONNECTS conference in San Antonio, Texas, in October 2025—we thank the session organizers, notably Prosun Bhattacharya and Saugata Datta, for the invitation to do so.
Conflicts of Interest
The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.
References
- Currie, A. The role of arsenic in carcinogenesis. Br. Med. Bull. 1947, 4, 402–405. [Google Scholar] [CrossRef] [PubMed]
- Smith, A.H.; Lingas, E.O.; Rahman, M. Contamination of Drinking Water by Arsenic in Bangladesh: A Public Health Emergency. Bull. WHO 2000, 78, 1093–1103. [Google Scholar] [PubMed]
- Ferreccio, C.; Smith, A.H.; Duran, V.; Barlaro, T.; Benitez, H.; Valdes, R.; Aguirre, J.J.; Moore, L.E.; Acevedo, J.; Vasquez, M.I.; et al. Case-Control Study of Arsenic in Drinking Water and Kidney Cancer in Uniquely Exposed Northern Chile. Am. J. Epidemiol. 2013, 178, 813–818. [Google Scholar] [CrossRef] [PubMed]
- Rossman, T.G.; Uddin, A.; Burns, F.J. Arsenic and human health effects: New molecular mechanisms for arsenic-induced cancers. Toxicol. Appl. Pharmacol. 2024, 456, 116. [Google Scholar] [CrossRef]
- National Research Council. Critical Aspects of EPA’s IRIS Assessment of Inorganic Arsenic: Interim Report; The National Academies Press: Washington, DC, USA, 2013; p. 127. [Google Scholar]
- IARC Monographs on the Evaluation of Carcinogenic Risks to Humans, Volume 100C: Arsenic, Metals, Fibres and Dusts; IARC: Lyon, France, 2012; pp. 11–465.
- Smith, A.H.; Marshall, G.; Roh, T.; Ferrecio, C.; Liaw, J.; Steinmaus, C. Lung, Bladder and Kidney Cancer Mortality 40 Years After Arsenic Exposure Reduction. J. Natl. Cancer Inst. 2018, 110, djx201. [Google Scholar] [CrossRef]
- Chen, Y.; Graziano, J.H.; Parvez, F.; Liu, M.; Slavkovich, V.; Kalra, T.; Argos, M.; Islam, T.; Ahmed, A.; Rakibuzzaman, M. Arsenic exposure from drinking water and mortality from cardiovascular disease in Bangladesh: Prospective cohort study. Brit. Med. J. (Overseas Retired Dr. Ed.) 2011, 342, d2431. [Google Scholar] [CrossRef]
- Farzan, S.F.; Chen, Y.; Rees, J.R.; Zens, M.S.; Karagas, M.R. Risk of death from cardiovascular disease associated with low-level arsenic exposure among long-term smokers in a US population-based study. Toxicol. Appl. Pharmcol. 2015, 287, 93–97. [Google Scholar] [CrossRef]
- Xu, L.; Mondal, D.; Polya, D.A. Positive association of Cardiovascular Disease (CVD) with chronic exposure to drinking water Arsenic (As) at concentrations below the WHO provisional guideline value: A systematic review and meta-analysis. Int. J. Environ. Res. Public Health 2020, 17, 2536. [Google Scholar] [CrossRef]
- Rahman, M.; Sohel, N.; Yunus, M.; Chowdhury, M.E.; Hore, S.K.; Zaman, K.; Bhuiya, A.; Streatfield, P.K. A prospective cohort study of stroke mortality and arsenic in drinking water in Bangladeshi adults. BMC Public Health 2014, 14, 174. [Google Scholar] [CrossRef]
- Mukherjee, A.; Coomar, P.; Sarkar, S.; Johannesson, K.H.; Fryar, A.E.; Schreiber, M.E.; Ahmed, K.M.; Alam, M.A.; Bhattacharya, P.; Bundschuh, J.; et al. Arsenic and other geogenic contaminants in global groundwater. Nat. Rev. Earth Environ. 2024, 5, 312–328. [Google Scholar] [CrossRef]
- Podgorski, J.E.; Wu, R.; Chakravorty, B.; Polya, D.A. Groundwater Arsenic Distribution in India by Machine Learning Geospatial Modeling. Int. J. Environ. Res. Public Health 2020, 17, 7119. [Google Scholar] [CrossRef]
- Mukherjee, A.; Sarkar, S.; Chakraborty, M.; Duttagupta, S.; Bhattacharya, A.; Saha, D.; Bhattacharya, P.; Mitra, A.; Gupta, S. Occurrence, predictors and hazards of elevated groundwater arsenic across India through field observations and regional-scale AI-based modeling. Sci. Total Environ. 2021, 759, 143511. [Google Scholar] [CrossRef]
- World Health Organization (WHO). Guidelines for Drinking-Water Quality, 3rd Edition, Vol. 1: Recommendations; World Health Organization: Geneva, Switzerland, 2004. [Google Scholar]
- Chakraborti, D.; Mukherjee, S.C.; Pati, S.; Sengupta, M.K.; Rahman, M.M.; Chowdhury, U.K.; Hossain, M.A.; Mukherjee, A.; Das, B.; Kalman, D.; et al. Arsenic groundwater contamination in the middle Ganga Plain, Bihar, India: A future danger? Environ. Health Perspect. 2003, 111, 1194–1201. [Google Scholar] [CrossRef]
- Bates, M.N.; Smith, A.H.; Hopenhayn-Rich, C. Arsenic ingestion and internal cancers: A review. Am. J. Epidemiol. 1992, 135, 462–476. [Google Scholar] [CrossRef] [PubMed]
- Smith, A.H.; Steinmaus, C.M. Health effects of arsenic and chromium in drinking water: Recent human findings. Annu. Rev. Public Health 2009, 30, 107–122. [Google Scholar] [CrossRef] [PubMed]
- Kumar, A.; Ali, M.; Kumar, R.; Kumar, M.; Sagar, P.; Pandey, R.K.; Akhouri, V.; Kumar, V.; Anand, G.; Niraj, P.K.; et al. Arsenic exposure in Indo Gangetic plains of Bihar causing increased cancer risk. Sci. Rep. 2021, 11, 2376. [Google Scholar] [CrossRef] [PubMed]
- Wu, R.; Xu, L.; Polya, D.A. Groundwater Arsenic-Attributable Cardiovascular Disease (CVD) Mortality Risks in India. Water 2021, 13, 2232. [Google Scholar] [CrossRef]
- Rockhill, B.; Newman, B.; Weinberg, C. Use and misuse of population attributable fractions. Am. J. Public Health 1998, 88, 15–19. [Google Scholar] [CrossRef]
- Institute for Health Metrics and Evaluation. Global Burden of Disease Study 2019. Available online: http://vizhub.healthdata.org/gbd-compare/ (accessed on 4 January 2026).
- Chen, C.J.; Chaung, Y.C.; You, S.L.; Lin, T.M.; Wu, H.Y. A retrospective study on malignant neoplasms of bladder, lung and liver in Blackfoot disease endemic area in Taiwan. Br. J. Cancer 1986, 53, 399–405. [Google Scholar] [CrossRef]
- Yang, T.Y.; Hsu, L.I.; Chen, H.C.; Chiou, H.Y.; Hsueh, Y.M.; Wu, M.M.; Chen, C.L.; Wang, Y.H.; Liao, Y.T.; Chen, C.J. Lifetime risk of urothelial carcinoma and lung cancer in the arseniasis-endemic area of Northeastern Taiwan. J. Asian Earth Sci. 2013, 77, 332–337. [Google Scholar] [CrossRef]
- Yang, T.W.; Zhang, H.B.; Chen, B.; Zhang, H.; Tao, S.S. Association between lung cancer risk and inorganic arsenic concentration in drinking water: A dose-response meta-analysis. Toxicol. Res. 2018, 7, 1257–1266. [Google Scholar] [CrossRef]
- Issanov, A.; Adewusi, B.; Saint-Jacques, N.; Dummer, T.J.B. Arsenic in drinking water and lung cancer: A systematic review of 35 years of evidence. Toxicol. Appl. Pharmacol. 2024, 483, 116808. [Google Scholar] [CrossRef]
- Soerjomataram, I.; Shield, K.; Marant-Micallef, C.; Vignat, J.; Hill, C.; Rogel, A.; Menvielle, G.; Dossus, L.; Ormsby, J.-N.; Rehm, J.; et al. Cancers related to lifestyle and environmental factors in France in 2015. Eur. J. Cancer 2018, 105, 103–113. [Google Scholar] [CrossRef]
- Mondal, D.; Adamson, G.C.; Nickson, R.; Polya, D.A. A comparison of two techniques for calculating groundwater arsenic-related lung, bladder and liver cancer disease burden using data from Chakdha block, West Bengal. Appl. Geochem. 2008, 23, 2999–3009. [Google Scholar] [CrossRef]
- Ahmed, K.M.; Bhattacharya, P.; Hasan, M.A.; Akhter, S.H.; Alam, S.M.; Bhuyian, M.H.; Iman, M.B.; Khan, A.A.; Sracek, O. Arsenic enrichment in groundwater of the alluvial aquifers in Bangladesh: An overview. Appl. Geochem. 2004, 19, 181–200. [Google Scholar] [CrossRef]
- Chakraborti, D.; Rahman, M.M.; Das, B.; Nayak, B.; Pal, A.; Sengupta, M.K.; Hossain, M.A.; Ahamed, S.; Sahu, M.; Saha, K.C.; et al. Groundwater arsenic contamination in Ganga–Meghna–Brahmaputra plain, its health effects and an approach for mitigation. Environ. Earth Sci. 2013, 70, 1993–2008. [Google Scholar] [CrossRef]
- Islam, F.S.; Gault, A.G.; Boothman, C.; Polya, D.A.; Charnock, J.M.; Chatterjee, D.; Lloyd, J.R. Role of metal-reducing bacteria in arsenic release from Bengal delta sediments. Nature 2004, 430, 68–71. [Google Scholar] [CrossRef] [PubMed]
- Berg, M.; Stengel, C.; Trang, P.T.K.; Viet, P.H.; Sampson, M.L.; Leng, M.; Samreth, S.; Fredericks, D. Magnitude of arsenic pollution in the Mekong and Red River Deltas—Cambodia and Vietnam. Sci. Total Environ. 2007, 372, 413–425. [Google Scholar] [CrossRef] [PubMed]
- Wu, R.; Podgorski, J.; Berg, M.; Polya, D.A. Geostatistical model of the spatial distribution of arsenic in groundwaters in Gujarat State, India. Environ. Geochem. Health 2020, 43, 2649–2664. [Google Scholar] [CrossRef]
- Bindal, S.; Singh, C.K. Predicting groundwater arsenic contamination: Regions at risk in highest populated state of India. Water Res. 2019, 159, 65–76. [Google Scholar] [CrossRef]
- Mehrabi, M.; Han, Y.; Polya, D.A. Machine Learning Models of the Geospatial Distribution of Groundwater Quality: A Systematic Review. Water 2025, 17, 2861. [Google Scholar] [CrossRef]
- Amini, M.; Abbaspour, K.; Berg, M.; Winkel, L.; Hug, S.; Hoehn, E.; Yang, H.; Johnson, C. Statistical modeling of global geogenic arsenic contamination in groundwater. Environ. Sci. Technol. 2008, 42, 3669–3675. [Google Scholar] [CrossRef] [PubMed]
- Podgorski, J.; Berg, M. Global threat of arsenic in groundwater. Science 2020, 368, 845–850. [Google Scholar] [CrossRef] [PubMed]
- Dhamija, S.; Joshi, H. Prediction of Groundwater Arsenic Hazard Employing Geostatistical Modelling for the Ganga Basin, India. Water 2022, 14, 2440. [Google Scholar] [CrossRef]
- Chakraborty, M.; Sarkar, S.; Mukherjee, A.; Shamsuddhuha, M.; Ahmed, M.K.; Bhattacharya, A.; Mitra, A. Modeling regional-scale groundwater arsenic hazard in the transboundary Ganges River Delta, India and Bangladesh: Infusing physically based model with machine learning. Sci. Total Environ. 2020, 748, 141107. [Google Scholar] [CrossRef]
- Nath, B.; Chowdhury, R.; Ni-Meister, W.; Mahanta, C. Predicting the Distribution of Arsenic in Groundwater by a Geospatial Machine Learning Technique in the Two Most Affected Districts of Assam, India: The Public Health Implications. GeoHealth 2022, 6, e2021GH000585. [Google Scholar] [CrossRef]
- Government of India, Census of India: Population Enumeration Data. Available online: https://censusindia.gov.in/nada/index.php/home (accessed on 16 February 2026).
- Government of India. Census of India: HL-06: Households (Excluding Institutional Households) by Main Source of Drinking Water and Location. 2001. Available online: https://censusindia.gov.in/census.website/ (accessed on 16 February 2026).
- Wu, R.; Alvareda, E.M.; Polya, D.A.; Blanco, G.; Gamazo, P. Distribution of Groundwater Arsenic in Uruguay Using Hybrid Machine Learning and Expert System Approaches. Water 2021, 13, 527. [Google Scholar] [CrossRef]
- Su, Z.; Wei, M.-N.; Jia, X.-H.; Fan, Y.-G.; Zhao, F.H.; Zhou, Q.-H.; Taylor, P.R.; Qiao, Y.-L. Arsenic, tobacco use, and lung cancer: An occupational cohort with 27 follow-up years. Environ. Res. 2022, 206, 112611. [Google Scholar] [CrossRef]
- Lamm, S.H.; Ferdosi, H.; Dissen, E.K.; Li, J.; Ahn, J. A Systematic Review and Meta-Regression Analysis of Lung Cancer Risk and Inorganic Arsenic in Drinking Water. Int. J. Environ. Res. Public Health 2015, 12, 15498–15515. [Google Scholar] [CrossRef]
- Baastrup, R.; Sørensen, M.; Balstrøm, T.; Frederiksen, K.; Larsen, C.L.; Tjønneland, A.; Overvad, K.; Raaschou-Nielsen, O. Arsenic in drinking-water and risk for cancer in Denmark. Environ. Health Perspect. 2008, 116, 231–237. [Google Scholar] [CrossRef]
- Mallath, H.; Taylor, D.G.; Badwe, R.A.; Rath, G.K.; Shanta, V.; Pramesh, C.S.; Digumarti, R.; Sebastian, P.; Borthakur, B.B.; Kalwar, A.; et al. The growing burden of cancer in India: Epidemiology and social context. Lancet Oncol. 2014, 15, e205–e212. [Google Scholar] [CrossRef]
- Singh, K.; Grover, A.; Dhanasekaran, D. Unveiling the cancer epidemic in India: A glimpse into GLOBOCAN 2022 and past patterns. Lancet Reg. Health—Southeast Asia 2025, 34, 100546. [Google Scholar] [CrossRef]
- Polya, D.A. Groundwater arsenic attributable cancer burden in India—Plausible model population attributable fraction (PAF) using AI/machine learning. In Proceedings of the GSA CONNECTS 2025, San Antonio, TX, USA, 19–22 October 2025. [Google Scholar]
- Paharai, S.; Barman, D.; Talukdar, R. Tobacco usage in India: A meta-analysis of evidence drawn from regional studies between 2010 and 2022. Trop. Med. Int. Health 2023, 28, 699–709. [Google Scholar] [CrossRef]
- Suvvari, T.K.; Arigapudi, N.; Veggalam, S. Air pollution and lung cancer in India: An escalating public health crisis. Int. Health 2025, 0, ihaf154. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.