1. Introduction
Halogenated polycyclic aromatic hydrocarbons (HPAHs), formed through the substitution of hydrogen atoms in parent PAHs by halogen elements such as chlorine or bromine, have recently emerged as a class of contaminants of increasing environmental and toxicological concern [
1,
2,
3]. Compared with their parent PAHs, HPAHs generally exhibit higher chemical stability, stronger lipophilicity, and enhanced resistance to environmental degradation, which may promote their persistence in food matrices and biological tissues [
3,
4,
5]. Toxicological evidence further suggests that certain HPAHs possess dioxin-like activities, endocrine-disrupting potential, and carcinogenicity that may equal or exceed those of unsubstituted PAHs [
6,
7,
8]. Despite these concerns, HPAHs remain largely absent from routine environmental monitoring programs and food safety regulations, resulting in substantial uncertainty regarding their exposure pathways and associated health risks.
Dietary intake has been recognized as one of the dominant exposure routes for hydrophobic organic contaminants in the general population [
9,
10,
11]. For conventional PAHs, numerous studies have demonstrated that food consumption—particularly of meat, cereals, vegetables, and processed foods—contributes more substantially to human exposure than inhalation or dermal contact (e.g., [
11,
12,
13,
14,
15]). In contrast, knowledge regarding dietary exposure to HPAHs remains fragmented. Existing studies have primarily focused on the occurrence of HPAHs in environmental matrices such as atmospheric particles, soils, sediments, and industrial emissions (e.g., [
2,
5,
8]), while systematic investigations targeting foodstuffs are still scarce. Moreover, most available dietary studies are limited to a small number of compounds or food categories, hampering a comprehensive understanding of exposure profiles across different dietary habits. As a result, the translation of environmental HPAH contamination into actual human dietary exposure remains poorly understood, representing a critical bottleneck in evaluating their real-world health relevance.
Industrial activities involving high-temperature combustion and chlorination or bromination processes are considered major anthropogenic sources of HPAHs [
2,
8,
16]. Among these, the coking industry has been identified as a particularly important contributor due to the combined effects of coal pyrolysis, halogen-containing additives, and complex emission pathways [
1,
2,
17]. Previous research has documented elevated concentrations of parent PAHs in environmental media and agricultural products surrounding coking facilities, suggesting a heightened exposure risk for nearby populations [
18,
19,
20]. However, whether similar spatial patterns apply to HPAHs, and to what extent coking-related emissions influence population-level dietary exposure in surrounding communities, remain poorly characterized. Population-based assessments of dietary exposure to HPAHs in industrially impacted regions are extremely limited, representing a critical gap in current environmental health research. Addressing this gap requires integrating food-borne HPAH contamination with population-oriented dietary exposure frameworks that explicitly consider industrial emission contexts.
Another major challenge in assessing HPAH-related health risks lies in the pronounced structural diversity and congener-specific toxicity of these compounds. Different halogen substitution patterns and ring structures can result in substantial variability in bioaccumulation potential and toxic potency [
2,
21,
22]. While toxic equivalency factors (TEFs) have been proposed for a limited number of HPAHs, their application in dietary risk assessment remains rare. Consequently, most previous studies have relied on concentration-based evaluations without adequately addressing differences in toxicological relevance among individual congeners. This limitation may lead to significant under- or overestimation of actual health risks associated with dietary exposure. A congener-resolved risk perspective is therefore essential for identifying risk-dominant HPAHs and for avoiding misleading conclusions based solely on total concentrations.
Food processing and preparation practices further complicate the assessment of dietary HPAHs [
23,
24,
25]. Thermal treatment, curing, and seasoning processes have been shown to influence the formation and accumulation of parent PAHs in foods [
12,
23,
24,
25,
26], yet their effects on halogenated analogues are poorly understood. In addition, environmental contamination during cultivation, livestock feeding, and atmospheric deposition may play a critical role in determining HPAH levels in raw and processed foods [
2,
4,
27]. Distinguishing the relative contributions of environmental background contamination and food processing–related inputs is therefore essential for identifying effective risk mitigation strategies, but has rarely been addressed for HPAHs. Integrating food-type characteristics, processing practices, and environmental source influences remains an underdeveloped but necessary step in dietary HPAH exposure assessment.
Against this background, a comprehensive dietary exposure assessment of HPAHs in industrially influenced regions is urgently needed. Such an assessment should integrate a wide spectrum of food categories representative of local consumption patterns, incorporate congener-specific analysis of HPAHs because differences in halogen substitution pattern and molecular structure can substantially affect environmental behavior and toxic potency, and apply health risk metrics that account for differences in toxic potency. This is particularly important because total-concentration-based evaluation may obscure the contribution of low-abundance but highly potent congeners to overall dietary risk. Importantly, comparisons among populations with varying degrees of industrial influence can provide valuable insights into the role of emission sources in shaping dietary exposure profiles.
In this study, we develop an integrated framework to characterize dietary exposure to HPAHs and evaluate associated health risks among populations residing in and around a typical coking industrial area. By combining extensive food sampling across multiple categories with congener-resolved chemical analysis and toxic equivalency-based risk assessment, this work aims to address key knowledge gaps regarding the occurrence, exposure pathways, and potential health implications of HPAHs in human diets. By explicitly linking industrial emission contexts, food contamination profiles, and congener-specific health risk metrics within a population-based dietary exposure framework, this study provides new insight into how HPAHs are transferred from industrial sources into human diets. The findings are expected to support improved interpretation of HPAH behavior in food systems, contribute to more accurate dietary risk assessments, and provide a scientific basis for future monitoring and regulatory considerations targeting halogenated PAHs in industrial regions.
2. Materials and Methods
2.1. Study Area and Sample Collection
Shanxi Province is the largest coke-producing region in China, with an annual output of 98.6 million tons in 2021, accounting for approximately 21% of national coke production, highlighting the regional significance of coking-related emissions. The study was conducted in Lüliang City, Shanxi Province, northern China, a region characterized by a temperate semi-arid continental monsoon climate. Lüliang City represents a typical coal-based industrial region in northern China, with intensive coking activities accompanied by related industries such as steel production, coal-fired power generation, petroleum processing, and chemical manufacturing.
Industrial facilities are primarily concentrated in the southeastern part of the city, whereas the western and northern areas are dominated by residential communities and agricultural land, forming a clear spatial gradient of industrial influence. According to the most recent population census, the city has approximately 226,800 permanent residents, providing a representative population for assessing industrially influenced dietary exposure.
Based on differences in proximity to industrial activities, the study area was categorized into three zones: a coking plant area, an exposed residential area, and a control area. The approximate distances between the coking plant area and the exposed residential area and control area were ~1 km and ~50 km, respectively (
Figure S1). To protect sensitive information related to industrial infrastructure, specific sampling coordinates are not disclosed, and only schematic maps are provided to illustrate relative spatial relationships among the three areas. Dietary samples were collected using the duplicate diet method, which involves collecting identical portions of all foods consumed by participants over a 24 h period. This method accounts for contamination introduced during food preparation, cooking, and processing, as well as food losses during consumption, thereby providing an accurate representation of actual dietary intake [
11,
28].
Participants were recruited from the three study zones using a field-based voluntary sampling approach. Eligible participants were adults who lived and/or worked in the corresponding study area and were willing to provide a complete 24 h duplicate diet sample together with questionnaire information on demographic characteristics, dietary habits, and cooking practices. Individuals with incomplete dietary records or insufficient sample collection were excluded from the study. All participant-based sampling procedures were conducted in accordance with relevant ethical guidelines, and informed consent was obtained from all participants prior to sample collection. The study protocol was reviewed and approved by the Medical Ethics Committee of Tongji Medical College, Huazhong University of Science and Technology.
To ensure representative sampling, structured questionnaires were administered to participants to obtain information on demographic characteristics, dietary habits, and cooking practices across the coking plant area, exposed residential area, and control area (
Table S1). A total of 87 duplicate diet samples were collected (
Table S2), including 31 samples from participants associated with the coking plant area, 18 samples from residents in the exposed residential area, and 38 samples from residents in the control area. The exposure grouping was defined primarily according to spatial proximity to the coking industry and the corresponding industrial influence gradient, rather than occupation alone. Specifically, the coking plant area represented a high-exposure scenario and included participants linked to the coking plant setting, whereas the exposed residential area and control area represented residential exposure scenarios with intermediate and low industrial influence, respectively. Thus, the present study was designed to compare real-world dietary exposure under different industrial influence contexts, while occupation and residence were not fully disentangled in the highest-exposure group. Immediately after collection, all dietary samples were sealed in clean low-density polyethylene (LDPE) food storage bags and placed in cooled containers with ice packs for transport to the laboratory. Upon arrival at the laboratory, all food samples were carefully weighed, homogenized, transferred into 250 mL amber glass bottles, and then stored at −20 °C until further processing and analysis. Samples were carefully handled, separated, and prepared individually to minimize the risk of cross-contamination throughout the sampling and analytical procedures.
2.2. Sample Preparation and Instrumental Analysis
The study employed the duplicate diet method, widely regarded as a “gold standard” for assessing dietary exposure due to its accuracy [
11,
28]. Although resource-intensive, this approach provides precise individual exposure data [
11,
29]. The standard solution containing 31 HPAHs and the fluorinated polycyclic aromatic hydrocarbon (F-PAH) standards were purchased from AccuStandard (New Haven, CT, USA). Ethyl acetate, hexane, acetonitrile, dichloromethane, and isooctane were all of high-performance liquid chromatography (HPLC) grade and were obtained from CNW Technologies GmbH (Düsseldorf, Germany). Dietary samples collected by this method comprised complex cooked food matrices containing multiple food items, condiments, and seasonings, with high lipid content and abundant pigments. Such complexity poses significant challenges for the determination of HPAHs, particularly in terms of efficient extraction and lipid removal. To address these challenges, an optimized sample preparation procedure was developed for reliable quantification of multiple HPAHs.
Briefly, 1.0 g of homogenized food sample was weighed into a 50 mL polypropylene centrifuge tube (Tube A). A mixture of isotopically labeled internal standards (Nap-d
8, Ace-d
10, Phe-d
10, Chr-d
12, and Per-d
12) was added and allowed to equilibrate for 30 min. These deuterated parent-PAH standards were used as surrogate internal standards because a complete set of isotopically labeled H-PAH analogues was not commercially available. They were selected to span a broad range of molecular weights and chromatographic retention behavior, thereby providing correction across different classes of target H-PAHs. Their practical suitability for the present method was supported by the overall method-validation results, including acceptable recoveries, precision, and linearity (
Table S3), although compound-specific differences among H-PAHs cannot be entirely excluded. Subsequently, 2 mL of ultrapure water and 5 mL of acetonitrile were added, followed by vortex mixing for 3 min and ultrasonic extraction for 10 min. The extract was stored at 4 °C for 2 h to facilitate phase separation. After cooling, 2 g anhydrous MgSO
4 and 0.5 g NaCl (QuEChERS salt mixture) were added, vortexed for 1 min, and centrifuged at 10,000 rpm and 4 °C for 5 min.
The supernatant was transferred to a 15 mL centrifuge tube (Tube B) containing SupelTM QuE Z-Sep sorbent and anhydrous MgSO4 for enhanced lipid removal. After vortexing for 3 min and centrifugation at 10,000 rpm and 4 °C for 5 min, the resulting supernatant was transferred to a 10 mL centrifuge tube (Tube C). To improve analyte recovery, an additional 2 mL of acetonitrile was added to Tube B, and the extraction, vortexing, and centrifugation steps were repeated. Combined extracts were gently evaporated under nitrogen and reconstituted to a final volume of 1.5 mL in autosampler vials, to which 50 μL of fluorinated PAHs (F-PAHs, 50 ng/mL) was added as injection standards, corresponding to 2.5 ng per extract. Prepared samples were stored at −20 °C until analysis.
Quantitative determination of 31 HPAHs was performed using a Shimadzu GCMS-TQ8040 (Shimadzu Corporation, Kyoto, Japan) gas chromatograph–triple quadrupole mass spectrometer equipped with a DB-5MS capillary column (30 m × 0.25 mm × 0.25 μm). One microliter of each sample was injected in splitless mode at 280 °C. Helium (99.999%) was used as the carrier gas at 1.0 mL/min. The oven program was: 80 °C for 0.5 min, ramped to 280 °C at 5 °C/min, then to 300 °C at 15 °C/min (held 15 min), followed by a final ramp to 315 °C at 20 °C/min (held 0.5 min). The transfer line and ion source temperatures were 300 °C and 230 °C, respectively, with electron ionization (EI) and multiple reaction monitoring (MRM) used for target compound quantification.
2.3. Quality Assurance and Quality Control (QA/QC)
Method reliability was evaluated using procedural blanks, spiked samples, and replicate analyses. Method recoveries, calculated by subtracting concentrations in non-spiked samples from those in spiked samples, ranged from 79.24% to 107.46% for all HPAHs, indicating acceptable method accuracy (
Table S3). The QA/QC parameters reported in
Table S3 cover all target HPAHs analyzed in this study, including the seven previously unreported congeners identified in dietary samples. Precision, expressed as relative standard deviation (RSD, %), ranged from 0.68% to 10.94% across five replicate analyses of spiked samples, demonstrating good reproducibility (
Table S3).
Limits of detection (LODs) and quantification (LOQs) were 0.0010–0.0478 ng/g and 0.0054–0.1222 ng/g, respectively (
Table S3). Seven-point external calibration curves (5–500 ng/mL) exhibited excellent linearity for all compounds (R
2 > 0.99), confirming reliable quantification. All solvents were of chromatographic grade (CNW Technologies), and ultrapure water was used throughout. Glassware was baked at 450 °C for 8 h prior to use to remove potential organic contaminants. Procedural blanks confirmed negligible background contamination during sample preparation and analysis.
2.4. Dietary Exposure and Health Risk Assessment
The potential carcinogenic risk associated with dietary exposure to HPAHs was assessed using the incremental lifetime cancer risk (ILCR) model recommended by the United States Environmental Protection Agency [
30], which has been widely applied to organic contaminants such as PAHs, HPAHs, and polychlorinated biphenyls (PCBs). This approach integrates compound-specific toxicity with long-term dietary intake to estimate population-level cancer risk. Because duplicate diet samples represent the actual foods consumed within a 24 h period, the measured HPAH concentrations provide a direct estimate of short-term dietary intake on the sampling day. In the present study, these measured concentrations were further used as a screening-level proxy for longer-term dietary exposure under the assumption that the sampled meals broadly reflect habitual dietary patterns and local exposure contexts within each study group. Therefore, the ILCR values reported here should be interpreted as indicative estimates of chronic dietary cancer risk at the group level, rather than precise lifetime risk predictions for individual participants.
Given the pronounced congener-specific toxicity of HPAHs, carcinogenic risk was evaluated using a toxic equivalency–based framework. The toxic equivalency (TEQ
i) of HPAHs relative to benzo[a]pyrene (BaP) was calculated according to Equation (1):
where
represents the concentration of an individual HPAH congener in food, and
is its relative effect potency with respect to BaP. For interpretive purposes, the TEQ-derived BaP-equivalent dietary exposure term represents the toxicity-weighted dietary intake component used in the ILCR framework prior to application of the slope factor and lifetime adjustment terms. The ILCR associated with dietary exposure to HPAHs was then calculated using Equation (2):
where IR is the dietary ingestion rate (kg d
−1), ED is the exposure duration (years),
is the oral cancer slope factor of BaP (7.3 mg/kg/d), BW is body weight (kg), and AT is the averaging lifetime (years). Based on established risk criteria, ILCR values < 10
−6, between 10
−6 and 10
−4, and >10
−4 were interpreted as negligible, potential, and unacceptable cancer risks, respectively.
Parameter values were selected to represent typical adult exposure conditions in China. Dietary ingestion rates were set at 1.033 kg/d for rural residents and 1.1177 kg/day for urban residents, assuming daily food consumption throughout the year (365 days/year). The exposure duration was set to 45 years, corresponding to the average age of the study participants, while body weight and lifetime were assumed to be 64 kg and 70 years, respectively. These parameter settings represent a simplified adult exposure scenario and do not explicitly capture inter-individual variability in food intake, body size, or long-term exposure duration. Accordingly, they may influence the absolute magnitude of ILCR estimates, although they remain useful for comparative risk characterization across exposure groups.
Due to the limited availability of experimentally derived toxicological data for many HPAH congeners, relative effect potency factors (
) were derived using a combined literature-based and quantitative structure–activity relationship (QSAR) approach. Carcinogenicity and mutagenicity of HPAHs were first predicted using the OECD QSAR Toolbox (version 4.5), and reference compounds with high structural similarity (i.e., similar ring number, halogen substitution pattern, and molecular framework) were selected accordingly (
Table S4). Subsequently, carcinogenic potency values (CPVs) for target HPAHs and their reference analogues were estimated using the CTV model. For HPAHs without available experimental REP data but with suitable structural analogues,
values for HPAHs without available experimental data were then calculated using Equation (3):
where
and
represent the predicted carcinogenic potency values of the target HPAH and its reference analogue, respectively, and
is the known relative potency of the reference compound. The average prediction error of the CTV model is approximately 0.97 log
10 units, and its applicability domain covers more than 80% of environmental organic chemicals. Using this approach,
values were obtained for 18 HPAH congeners, either directly from the literature or through structural extrapolation (
Table S4). This combined strategy expands the congener coverage of cumulative risk assessment, but it also introduces uncertainty because QSAR-assisted REP_BaP estimates may deviate from the true biological potency of individual HPAHs, particularly for congeners lacking direct toxicological validation. Therefore, the resulting TEQ and ILCR values should be interpreted as screening-level, toxicity-weighted estimates that are suitable for comparative assessment, rather than exact toxicity-equivalent or lifetime cancer-risk values for specific individuals.
2.5. Data Analysis
Data analysis and visualization were performed using R software (version 4.4.3). For dietary exposure and health risk assessment, concentrations of HPAHs below the limit of detection (LOD) were treated as zero in the primary analysis as a lower-bound assumption. This approach was adopted because the duplicate diet samples were large composite food samples analyzed using optimized extraction and enrichment procedures, such that non-detects were considered more likely to reflect very low concentrations than clear analytical limitations. However, we acknowledge that alternative substitution approaches (e.g., LOD/2 or LOD) could lead to higher absolute estimates of HPAH concentration, TEQ, and ILCR, particularly for congeners with low detection frequencies.
ILCR values were calculated for individual dietary samples using a toxicity equivalency–based framework. To characterize population-level cancer risk patterns, ILCR values were log10-transformed to reduce right-skewness and facilitate comparison across exposure groups. Formal statistical comparisons of log10-transformed ILCR among the coking plant area, exposed residential area, and control area were performed using the Kruskal–Wallis test, followed by pairwise Wilcoxon rank-sum tests with Holm adjustment for multiple comparisons. These nonparametric methods were selected because ILCR data showed evident right-skewness and unequal dispersion among groups. Probability density distributions of log10-transformed ILCR were estimated using kernel density estimation (KDE) with a Gaussian kernel, providing a non-parametric representation of population-level cancer risk distributions, including central tendency, dispersion, and high-risk tails. Accordingly, comparisons among the coking plant area, exposed residential area, and control area were based on both formal nonparametric statistical testing and descriptive evaluation of distributional shifts, overlap, and tail behavior relative to established cancer risk benchmarks (10−6 and 10−4). Because only one duplicate diet sample was collected per participant, temporal variability in food consumption, cooking practices, and HPAH concentrations across days or seasons was not explicitly characterized. Accordingly, the exposure and ILCR estimates derived in this study should be regarded as snapshot-based measurements anchored to the sampling day, while their interpretation for chronic risk relies on the assumption that these samples broadly reflect typical local exposure conditions at the population level.
2.6. Sensitivity Analysis
To evaluate the robustness of the toxicity-weighted dietary cancer-risk estimates, a scenario-based sensitivity analysis was performed for the fixed exposure assumptions used in the ILCR model. Because ILCR is proportional to TEQ × IR × ED/BW, lower- and upper-bound exposure scenarios were generated by varying dietary ingestion rate (IR), body weight (BW), and exposure duration (ED) around the base-case assumptions. Specifically, the lower-bound scenario was defined as IR −20%, BW +20%, and ED = 30 years, whereas the upper-bound scenario was defined as IR +20%, BW −20%, and ED = 60 years. ILCR values were recalculated under each scenario, and the stability of both the absolute risk magnitude and the between-group risk pattern was evaluated across the coking plant area, exposed residential area, and control area.