Molecular Fingerprinting of the Biodegradation of Petroleum Organic Pollutants in Groundwater and under Site-Specific Environmental Impacts

: A quantitative and qualitative assessment using molecular markers derived from compound-specific indices for indicating groundwater impacted by petroleum spills in an oil field was recently undertaken and demonstrated serious contamination, with both high total petroleum hydrocarbons (TPH) (3.68–7.32 mg/L) and hazardous compounds in the groundwater. A petroleum source was identified, and the analysis revealed a decreasing trend of fresh petroleum input, along with groundwater advection and an increasing trend of biodegradation potential at locations farther from the source. This was confirmed via microbial analysis with both biodegrading microorganisms and diversity indices (Shannon, Simpson, and Pielou) and the principal component analysis (PCA) modeling approach, which classified the field samples into three types according to the distribution correlations between different organic compounds. Biodegradation was believed to be the dominant sink of hydrocarbons due to the increasing Pr/ C 17 and Ph/ C 18 values with seasonal changes. Raised temperatures activated the microbial degradation process; specifically, low-weight hydrocarbons degraded more rapidly than high-weight hydrocarbons, resulting in the accumulation of an unresolved complex mixture of bioproducts at locations that were farther away. Spatially, the Pr/ C 17 and Ph/ C 18 values increased from the upstream to the downstream areas, showing substantial biodegradation. The relationships between the molecular markers and chemical indices were quantified via canonical correlation analysis (CCA) to visually explain the interactive reaction processes. It was also demonstrated that the biodegradation of petroleum organics can be characterized by the consumption of dissolved oxygen and a decreasing Pr/Ph ratio, due to system reduction. These results demonstrate that compound-specific molecular markers, coupled with biochemical parameters, can effectively support a better understanding and effective fingerprinting of the fate and transport of petroleum organic contaminants, thus offering valuable technical support for a cost-effective remediation strategy.


Introduction
Groundwater contamination by petroleum hydrocarbons is a widely recognized and serious environmental risk, due to the widespread accidental release of petroleum hydrocarbons [1][2][3][4][5] and their carcinogenic and mutagenic effects on human health [6][7][8].Such release accidents frequently occur in oilfields [9,10].Petroleum is a complex mixture that contains thousands of different organic compounds with various physical-chemicalbiological features [11][12][13], which differentiate their fate and migration pathways and make in situ remediation complicated [14,15].Once hydrocarbon contaminants infiltrate into the subsurface and groundwater, they are subjected to various natural attenuation processes [16,17], which can be divided into abiotic processes and biodegradation [18,19].Because of this environmentally friendly feature, biodegradation is considered a promising approach and has been widely used in groundwater petroleum remediation [20].It is important to note that the biodegradation of petroleum organic pollutants is a dynamic and intricate process that is influenced by various environmental factors, the composition of the pollutant mixture, and the diversity of microbial populations present at the contaminated site [21,22].However, these changes remain unpredictable, especially in the subsurface environment, where contaminants and microbes are mutually influenced by groundwater, soil, and air, leading to a great challenge in determining their distribution and communities [23].The main limitation of oil pollution evaluation is that analyzing the pollution process based solely on pollutant concentration may be insufficient and inconsistent, which may lead to uncertainty or even errors in the evaluation outcomes.More importantly, different petroleum organic components have distinct chemical structures, resulting in varied migration processes in the soil and groundwater.This is a primary reason why a single remediation method is often insufficient to completely remove complex oil pollution.Therefore, an effective and accurate assessment of the biodegradation process, based on compound classification, is a prerequisite for the remediation of pollution remediation projects [24,25].Another challenge in remediating oilfield leakage-type petroleum pollution is repeated contamination caused by prolonged oil extraction, which makes it difficult to accurately locate specific pollution sources in the subsurface environment [26,27].
Although petroleum components undergo transformation during the transportation process, the molecular structures of these components have the capability to retain certain stable chemical structures and characteristic decomposition patterns [28,29].Therefore, the molecular structural features (as revealed by the GC-MS (gas chromatography-mass spectrometry) technique) of the components can be used to analyze the transformation patterns of the components [30][31][32][33][34].It has been reported that hydrocarbons from different biological origins have different odd/even carbon number ratios [35,36].For example, the carbon preference index (CPI) reflects the relative abundance of odd-numbered and even-numbered carbon atoms in the alkanes contained in organic compounds.According to statistics, the CPI value of petroleum compounds should be less than 1 [37].Thus, the CPI is often considered as a good indicator of oil pollution [38].
Pristane (Pr) and phytane (Ph) are isoprenoids that are frequently found in petroleum compounds, usually as the major constituents of a wide range of isoprenoid alkanes.They are considered to originate primarily from phytol during diagenesis [39,40].Shi et al. showed that under oxidizing conditions, phytol may be degraded preferably into pristane; conversely, this alteration is thought to result in a predominance of phytane over pristane under reducing conditions [41].On the basis of the reaction pathways, a Pr/Ph ratio of >1 would represent an oxidizing condition, whereas a Pr/Ph ratio of < 1 is an indication of a reducing environment [42,43].These distinct accompanying molecular reaction pathways offer us a special indicator (Pr/Ph, the ratio of pristane to phytane) to determine the redox environment.
According to the biodegradation sequence of different compounds, it is clear that lowmolecular-weight hydrocarbons are preferentially consumed, while high-molecular-weight hydrocarbons generally have a relatively stable structure that is more resistant to microorganism metabolism.Therefore, the ratio of low-molecular-weight to high-molecular-weight hydrocarbons (L/H) can be used as a criterion to assess the extent of biodegradation.Similarly, Pr and Ph are barely degradable, while alkanes C17 and C18 degrade more easily; hence, the ratio of Pr/C17 and Ph/C18 could demonstrate the degree of biodegradation [44].There is a special type of compound, named unresolved complex compounds (UCM), which commonly appear as a bump in the GC-MS spectrum.It has been widely reported that UCM is composed of petroleum-derived products such as ester and phenols (usually non-hydrocarbons), providing a good indicator of the extent of biodegradation.Hence, an increasing concentration of UCM suggests that hydrocarbons are degrading [45].
The biodegradation of hydrocarbons is accompanied by oxidation-reduction reactions that involve breaking chemical bonds and transferring electrons away from the contaminant [46].Subsurface environments offer electron acceptors such as iron, manganese, sulfate, nitrate, and oxygen to complete the transfer and, consequently, convert original compounds into intermediates.Biodegradation is classified into aerobic and anaerobic reactions according to the characteristics of the electron acceptors, which can distinguish the basic biodegradation pathway.In most cases, electron balances show the complete anaerobic oxidation of these aromatic compounds into CO 2 [47].Furthermore, some intermediates, such as benzylsuccinic acid and methylbenzylsuccicic acid isomers, have been proposed as distinctive indicators for use in the monitoring of anaerobic toluene and xylene degradation in fuel-contaminated aquifers [48,49].
Oxygen is the main participator in aerobic metabolism.It is incorporated by oxygenase and, finally, becomes part of the intermediates.During this process, O 2 is reduced.The major byproducts of aerobic respiration are carbon dioxide, water, and an increased population of microorganisms [50].Dissolved oxygen (DO) concentrations provide estimates for the relative rate of hydrocarbon biodegradation and oxygen drawdown within the plume.Anaerobic respiration uses nitrate (NO 3 − ), sulfate (SO 4 2− ), metals such as iron (Fe 2+ ) and manganese (Mn 4+ ), or even CO 2 as accepting electrons [51].
Oilfields in the northeast of China are critical energy resources and significant industrial centers that have witnessed decades of oil exploitation and are still operating.Consequently, petroleum contaminants are prevalent, both in the surface soil and in groundwater, which poses a significant threat to local residents and the eco-environment.This work presents a comprehensive investigation of petroleum-contaminated groundwater at an oil field in northeast China.The main objectives are to (1) effectively evaluate the composition and distribution of petroleum contamination in groundwater; (2) assess the spatial and temporal features of biodegradation by using molecular markers; and (3) identify the related chemical alterations and their inter-reactions with the environment.The results achieved demonstrate that a combination of molecular markers and chemical indices offers robust indices for the measurement of petroleum hydrocarbon contamination.

Site Setting and Sampling Strategy
The study area is located in the northeast of China, a typically cold and arid region.Winter here can last for five months, and the annual average temperature is 4.7 • C. The topographic features of this area characterize it as an alluvial-proluvial plain.The main body of surface water is a local river with 1.8-4 m average water-table fluctuation.This area's dry season usually occurs in winter each year, while the wet season occurs during July.The elevation in the study area is 130.2-132.5 m, and the contaminated site lies at the junction between the first terrace and a hillock (15-20 m high) that offers a hydraulic gradient for the movement of groundwater.After decades of continuous extraction, the groundwater pollution problem at this oilfield has become very severe.Despite multiple pollution remediation efforts, the complexity of contamination points and the long history of pollution have prevented complete restoration.The main contamination source is an oil-water pit produced by a faulty oil drilling well, located in the southeast corner.Quaternary fine, medium, and coarse sands and gravels are the principal components of the lithology in this area.The shallow aquifer is composed of these media and is 15-20 m thick.The groundwater table lies at 2.0-4.0 m below the surface and fluctuates by 0.5 m annually.The general direction of groundwater flow is from southeast (up-gradient) to northwest (down-gradient).For the purpose of accurate investigation regarding both source identification and migration pathways in the groundwater of different types of organic petroleum compounds, 13 monitoring wells were established within the contaminated site, and groundwater samples were collected every month from those wells.During the Water 2024, 16, 1773 4 of 20 sampling process, groundwater from the monitoring well was initially extracted using a bailer.To prevent degradation from exposure to sunlight and air, a 500 mL amber sampling bottle was filled with groundwater samples.Then, 2 mL of hydrochloric acid was added to adjust the pH of the samples to less than 2. The sampling bottles were sealed with aluminum foil to prevent air from entering, and the samples were stored at 4 • C indoors.The petroleum organic hydrocarbons and water chemical properties of all those samples were analyzed.In addition, to study the spatial migration characteristics of petroleum pollutants in groundwater, 3 sections were set up along the direction of the water flow (Section 1) and perpendicular to the direction of the water flow (Sections 2 and 3), respectively, to analyze the impact of groundwater flow on the migration of petroleum pollutants (Figure 1).
lithology in this area.The shallow aquifer is composed of these media and is 15-20 m thick.The groundwater table lies at 2.0-4.0 m below the surface and fluctuates by 0.5 m annually.The general direction of groundwater flow is from southeast (up-gradient) to northwest (down-gradient).For the purpose of accurate investigation regarding both source identification and migration pathways in the groundwater of different types of organic petroleum compounds, 13 monitoring wells were established within the contaminated site, and groundwater samples were collected every month from those wells.During the sampling process, groundwater from the monitoring well was initially extracted using a bailer.To prevent degradation from exposure to sunlight and air, a 500 mL amber sampling bottle was filled with groundwater samples.Then, 2 mL of hydrochloric acid was added to adjust the pH of the samples to less than 2. The sampling bottles were sealed with aluminum foil to prevent air from entering, and the samples were stored at 4 °C indoors.The petroleum organic hydrocarbons and water chemical properties of all those samples were analyzed.In addition, to study the spatial migration characteristics of petroleum pollutants in groundwater, 3 sections were set up along the direction of the water flow (section 1) and perpendicular to the direction of the water flow (section 2 and section 3), respectively, to analyze the impact of groundwater flow on the migration of petroleum pollutants (Figure 1).

Hydrocarbon Analysis 2.2.1. Extraction of Hydrocarbons
We used the dichloromethane liquid-liquid extraction method to extract samples.Then, we transferred the entire 500 mL water sample to a separating funnel, rinsed the sample bottle with dichloromethane, and added a rinsing solution to the separating funnel.Then, 20 mL of dichloromethane was added to the separating funnel, the sample was shaken for Water 2024, 16, 1773 5 of 20 10 min and allowed to settle into layers, and then the dichloromethane extract was collected in a brown bottle.These steps were repeated three times.Finally, we evaporated the collected extract to 1 mL using a rotary evaporator at 45 • C and used GC-MS to determine the organic components.

GC-MS Conditions
The hydrocarbon analyses were performed using an Agilent 6890N-5975 GC-MS system.GC-MS was employed to determine the concentration under the following test conditions: (1) chromatographic conditions involved a capillary column (50 m × 0.25 mm × 0.25 µm).The gas chromatography inlet temperature was set at 250 • C, with split injection (split ratio 10:1).High-purity helium (99.999% purity) was used as the carrier gas, with a constant current mode and a column flow of 10 mL/min.The temperature-programming process began at 35 • C, was maintained for 3 min, and then was ramped up to 150 • C at a rate of 10 • C/min, held for 2 min, and, finally, increased to 200 • C at a rate of 20 • C/min for 2 min.(2) The mass spectrometry conditions included an electron multiplier voltage of 2108 eV, a GC-MS interface temperature of 250 • C, an ion source temperature of 230 • C, and electron energy set at 70 eV.The full scan mass range was 50-400 m/z.Hydrocarbon identification was achieved by comparing retention times with internal standards.

Inorganic Analysis
Inorganic materials, such as Fe 3+ , Mn 2+ , and SO 4 2− , were detected to indicate the oxidation-reduction conditions in the groundwater system.Samples for Fe/Mn were acidified with hydrochloric acid, while samples for sulfide were treated with 1 mol/L of sodium hydroxide and 1 mol/L of zinc acetate as stabilizers.The sampling bottles were filled to the top with no headspace, sealed with aluminum foil, protected from sunlight during shipping, and stored at 4 • C for subsequent laboratory analysis.Total petroleum hydrocarbons were analyzed using infrared spectrophotometry (JDS-108U+).K + and Na + were measured using flame atomic absorption spectrophotometry; Ca 2+ , Mg 2+ , NO 3 − , Cl − , and Fe 3+ were determined through titration; SO 4 2− was quantified using the turbidity method.The analytical methods followed the Quality Standards for Groundwater (GB/T 14848-2017) in China [52].Additionally, a multiparameter water quality analyzer (Hach-HQ40) was used to assess other groundwater properties.For rapid sample analysis, a HANNA handheld multiparameter portable water quality meter was employed to measure the dissolved oxygen (DO), pH, temperature, electrical conductivity (EC), oxidation-reduction potential (ORP), and other parameters in groundwater.

Microbial Analysis 1. Microbial Incubation
Biodegradation is a process in which microbes consume hydrocarbons to obtain carbon and energy resources for their growth and reproduction.The population of microbes is crucial for determining the extent of biodegradation.In this study, we utilized the plate culture method to quantify the number of microbes in water samples, measured in CFU/mL.Upon the collection of water samples, the microbes underwent incubation using three different types of cultured medium: (1) beef extract-peptone medium consisting of 3 g beef extract, 10 g peptone, 5 g NaCl, 15-20 g agar, 1000 mL water, with solution pH set at 7, and sterilized at 121 • C for 30 min.(2) Starch agar medium containing KNO 3 1 g; NaCl 0.5 g; (NH 4 ) 2 SO 4 1 g; K 2 HPO 4 5 g; MgSO 4 0.5 g; CaCl 2 0.1 g; with trace elements FeSO 4 0.01 g; MnSO4 0.003; ZnSO 4 0.003; solution pH set at 7. (3) PDA medium composed of 200 g potato, 20 g sucrose (or glucose), 15-20 g agar, 1000 mL water, and the solution pH set at 7. The potatoes were peeled, cut into pieces, boiled for half an hour, and filtered through gauze, then the sugar and agar were added, and the water was adjusted to 1000 mL after dissolving, with the solution pH set at 7, and finally sterilized at 121 • C for 20 min.For the sugar in the formula, sucrose was added for mold culture, while glucose was added for yeast, actinomycetes, and bacillus culture.

Screening of Dominant Strains
After culturing the microorganisms from the site on different media, most of the site's microorganisms were obtained.To identify microorganisms with growth advantages for future degradation of the contaminated site, well-growing single colonies are selected and picked from numerous colonies.These are initially streaked on fresh solid medium plates for isolation and cultivation.After 2 days, well-growing single colonies are selected again for streaking and isolation.The purified single colonies are then preserved in solid slant culture tubes.

Molecular Testing of Microbes
Microorganisms present in the samples were collected and their genomic DNA was extracted using the BS423 column genomic DNA isolation kit (Sangon Biotech, Shanghai, China), following the manufacturer's instructions.The variable region V3 of 16S rDNA was amplified with the forward primer F27 and the reverse primer R1492.F27 was designed with a 40-base GC clamp at the 5 ′ end to aid in the separation of DNA fragments.PCR was carried out in a 50 µL reaction mixture containing 0.2 mM of each primer, 0.2 mM of deoxynucleoside triphosphates (dNTP), 2 U of Taq DNA polymerase, and 1 µL of undiluted DNA.The PCR thermocycling program consisted of initial denaturation at 94 • C for 5 min, followed by 30 cycles of denaturation at 94 • C for 1 min, annealing at 55 • C for 1 min, extension at 72 • C for 45 s, and a final extension at 72 • C for 10 min.Subsequently, the amplified products were sequenced by a commercial DNA sequencing service (BGI Co., Beijing, China) to identify the microbial species.

Microbial Community Structure Analysis
Shannon, Simpson, and Pielou are three diversity indices that are used to calculate the carbon source utilization diversity of environmental microbial communities, assessing the species richness, the dominance of the most common species, and species in the environmental microbial community, respectively.
The Shannon index (H) is a function that combines the diversity of species and the number of species.The higher the value, the richer the community structure and diversity.The calculation formula for the Shannon index is: where P i = ni/N, ni is the relative luminosity of band i, N is the total relative absorbance of the lane, and S is the number of operational taxonomic units (OTUs) in the calculated sample.The Simpson index (D), also known as ecological dominance, reflects the role and status of dominant species in the community.For the same species, the higher the index, the simpler the community structure and diversity.Its calculation formula is: The Pielou index (E) is used to evaluate the evenness of species in a given sample.The closer its value is to 1, the smaller the difference in the number of species in the sample microbial community.The Pielou index is calculated as follows:

Determination of Molecular Markers
Molecular markers can be applied to fingerprint the distribution features of petroleum hydrocarbons and to provide additional information on the redox environment and the extent of degradation of the spilled oil.These indicators are the CPI, odd/even predominance (OEP), the ratio of low-molecular-weight hydrocarbon to high-molecular-weight hydrocarbon (L/H), unresolved complex mixture (UCM), the ratio of pristane to phystane (Pr/Ph), the ratio of pristane to n-C17 (hydrocarbon with 17 carbon atoms) (Pr/C17), and the ratio of phytane to n-C18 (hydrocarbon with 18 carbon atoms) (Ph/C18).The calculation equations for these markers are presented below:

Statistical Analysis
Principal component analysis (PCA) was employed to distinguish the distribution similarity and dissimilarity of sampling locations to reveal the contamination level and source of the different compounds.A data matrix comprising 13 sampling sites and 10 variables was used.The relationships between molecular markers and chemical indices were examined using canonical correlation analysis (CCA) to comprehensively explain the extent of biodegradation and redox condition.Statistical processing was performed with the CANOCO program for Windows 4.5.

Distribution of Hydrocarbons at the Site
The main source of petroleum hydrocarbon organic compounds in the groundwater at the research site is primarily various oil leaks that occurred during the oil drilling process, resulting in significant pollution after long-term accumulation.The main source of pollution at the site is an oil pit in the southeast corner, where petroleum contaminants collect and continuously infiltrate into the soil and groundwater through rainfall leaching and gravity.Moreover, this oil pit happens to be located upstream of the groundwater flow at the site, causing abandoned crude oil to migrate with the water flow, thereby leading to a larger area of pollution.When tested, the total petroleum hydrocarbons (TPHs) in the shallow groundwater ranged from 3.68 to 7.32 mg/L, with an average value of about 4.75 mg/L, which significantly exceeds the acceptable national water quality standard in China, demonstrating serious petroleum contamination in the study area (Table 1).The highest TPH concentration in the groundwater appeared at S1, with an average TPH value reaching 6.23 mg/L, while the lowest appeared at Z20, with an average TPH value reaching 4.26 mg/L.The average TPH concentration contours in different wells are shown in Figure 2. It is clearly shown that a contamination plume originated in the southeast of the site and then developed, along the path of groundwater flow, to the northwest.In order to discuss the specific distribution diversities of the different compounds, hydrocarbons were divided into three types, based on the results of GC-MS analysis and their different functional groups: alkane hydrocarbon (ALH), aromatic hydrocarbon (ARH), and unresolved complex mixture (UCM).

Alkane Hydrocarbons
Alkane hydrocarbons are a principal component of crude oil and contain various molecular formations with different carbon numbers.The individual ALH concentration in groundwater ranged from 1.26 to 4.79 mg/L, with an average of 65.6% in TPH, which displayed a distribution trend similar to that of TPH.The highest and lowest ALH concentrations were detected in S1 (4.79 mg/L, March) and Z10 (1.26 mg/L, July), with a carbon number from C6 to C30, including cycloalkanes (e.g., hexamethylene and heptamethylene), straight-chain hydrocarbons and their homologous compounds, and isoprenoid hydrocarbons (such as Pr and Ph).According to the analysis results, low-weight hydrocarbons played a dominant role in shallow groundwater, and their percentage of ALH ranged from 56.2% to 73.5%.This demonstrates the intrusion of a fresh petroleum source.In order to discuss the specific distribution diversities of the different compounds, hydrocarbons were divided into three types, based on the results of GC-MS analysis and their different functional groups: alkane hydrocarbon (ALH), aromatic hydrocarbon (ARH), and unresolved complex mixture (UCM).

Alkane Hydrocarbons
Alkane hydrocarbons are a principal component of crude oil and contain various molecular formations with different carbon numbers.The individual ALH concentration in groundwater ranged from 1.26 to 4.79 mg/L, with an average of 65.6% in TPH, which displayed a distribution trend similar to that of TPH.The highest and lowest ALH concentrations were detected in S1 (4.79 mg/L, March) and Z10 (1.26 mg/L, July), with a carbon number from C6 to C30, including cycloalkanes (e.g., hexamethylene and heptamethylene), straight-chain hydrocarbons and their homologous compounds, and isoprenoid hydrocarbons (such as Pr and Ph).According to the analysis results, low-weight hydrocarbons played a dominant role in shallow groundwater, and their percentage of ALH ranged from 56.2% to 73.5%.This demonstrates the intrusion of a fresh petroleum source.This may be due to the relatively high solubility in water that is frequently demonstrated by ALHs, which enables their migration with groundwater.While migrating in the aquifer, the ALH underwent various weathering progresses such as adsorption, evaporation, and biodegradation (discussed below), which explains the decreasing amount of ALH.

Aromatic Hydrocarbons
Aromatic hydrocarbons have a unique molecular structure that possesses a benzene ring.On the basis of ring numbers, aromatic hydrocarbons can be divided into two main categories: BTEX (one ring) and PAH (two or more rings) [53].The toxicity of petroleum contaminants mainly arises from aromatic compounds such as benzene, toluene, ethylbenzene, naphthalene, and anthracene, which are highly poisonous materials that were detected in every well.The concentration of aromatic hydrocarbons in the groundwater varied from 0.13 to 2.71 mg/L (more specifically, benzene, 0.01-0.31mg/L; ethylbenzene, 0.08-0.98mg/L).Aromatic compounds are highly volatile but demonstrate low dissolubility.This may be the reason for their rare occurrence around Z9, which was far removed from the contamination source.These intrinsic properties, weakening their migration ability in groundwater, resulted in a centralized region in S1 and Z1, indicating that these were the sources of petroleum.However, in regions of downward groundwater flow, such as Z10, the concentrations ranged from 0.39 to 0.78 mg/L, suggesting strong contamination.This may result from long-term submergence in a crude oil environment.

Unresolved Complex Mixture
Alongside ALH and ARH, the links between UCM and degraded petroleum are well known.UCM was also present in all samples from the shallow groundwater [54].Compared to alkanes, UCM is more resistant to biodegradation and, thus, has a greater tendency to remain and accumulate in the environment.UCM is generally considered to be composed of many intermediate products (such as ester and alcoholic hydrocarbons) that are produced during various weathering processes.Thus, the occurrence of high concentrations of UCM in the samples indicates that this area was polluted by the weathered products of petroleum hydrocarbons, while areas affected by fresh oil contamination tend to have relatively higher concentrations of n-alkanes.This was demonstrated by the diversity in their concentrations among different wells.For instance, wells near the oil-water pit had low UCM concentrations (S1, 0.60 mg/L; Z1, 0.66 mg/L), suggesting moderate microbial activities around the source area because new fresh compounds may not be able to fully come into contact with microbes.Conversely, after a long period of migration and reaction with many other chemicals, the UCM levels increased in the downstream areas of groundwater flow, such as at Z9 (1.72 mg/L) and Z10 (1.88 mg/L).

Molecular Marker Analysis
The CPI and OEP values of organic compounds in the water represent the dominance of odd/even carbon atom numbers in n-alkanes.The range of CPI values was 0.53-0.97,with the highest value observed at S1, the lowest at Z9, and an average of 0.78.The range of OEP values was 0.61-1.30,with the highest value occurring at S1, the lowest at Z21, and an average of 0.87 (Table 2).The CPI and OEP values in all the groundwater samples were both close to 1, which further proved the existence of petroleum contamination.In addition, there was a decreasing trend in CPI and OEP values along the groundwater flow direction; this finding was in perfect agreement with the TPH distribution.The detected results were statistically analyzed to describe the distribution characteristics of compounds and the similarity/dissimilarity of sampling locations.The PCA results showed that the first and second axes explained 86.0% and 6.5% (the cumulative percentage variance explained by the two axes was 92.5%) of the total variance, respectively (Figure 3).The positive value of PC1 on the horizontal axis indicated an association with Pr, Ph, BTEX, and PAH, which suggests fresh petroleum contamination, while the UCM, as defined by the negative value on the axis, suggests that the hydrocarbons that were present were mainly from biodegradation.The sampling locations had three main compositional patterns: First, wells in the first and second quadrants showed a positive relation with TPH, C16, Pr, Ph, and BTEX (high concentrations of all those chemicals suggest a strong influence from the nearby oil-water pit).Second, wells in the third quadrant (Z16, Z19, Z21, and Z22) demonstrated a medium contamination level, which may indicate a weathered area that is critically controlled by physical processes (such as volatilization and diffusion).Third, wells in the fourth quadrant revealed a positive relationship with UCM, which may be interpreted as the biodegraded component from fresh hydrocarbon inputs.

Biodegradation of Petroleum Hydrocarbons
Consumption by microbes as an energy and carbon source is a common sink pathway for petroleum hydrocarbons.However, the potential for biodegradation is quite variable, depending on the different molecular structures of the compounds.For example, isoprenoid hydrocarbons are more resistant to biodegradation than n-alkanes (such as C17 and C18).Thus, the Pr/C17 and Ph/C18 ratios in isoprenoid hydrocarbons are much higher than 1, indicating deeply degraded hydrocarbons [48].These ratios are used to evaluate the extent of biodegradation in groundwater.Chemical properties during biodegradation are also utilized to further explain material alterations.

Biodegradation of Petroleum Hydrocarbons
Consumption by microbes as an energy and carbon source is a common sink pathway for petroleum hydrocarbons.However, the potential for biodegradation is quite variable, depending on the different molecular structures of the compounds.For example, isoprenoid hydrocarbons are more resistant to biodegradation than n-alkanes (such as C17 and C18).Thus, the Pr/C17 and Ph/C18 ratios in isoprenoid hydrocarbons are much higher than 1, indicating deeply degraded hydrocarbons [48].These ratios are used to evaluate the extent of biodegradation in groundwater.Chemical properties during biodegradation are also utilized to further explain material alterations.

Biodegrading Microorganism Analysis
After cultivating the microorganisms in the groundwater at the site, 67 strains of oil-contaminated bacteria were extracted.Eight strains with excellent growth were selected, according to the culture medium type and growth characteristics.These eight strains included: three strains of bacteria (SX1, SX2, and SX3), three strains of actinomycetes (SF1, SF2, and SF3), and two strains of fungi (SZ1 and SZ2).We extracted DNA from the eight microorganisms and performed gel electrophoresis and PCR, followed by electrophoresis detection, and finally conducted gene sequencing on the samples.The obtained sequences were submitted to GenBank, and the online BLAST tool provided by NCBI was used to compare the sample gene sequences with those in the database for similarity.The 16S rDNA gene sequences of the eight strains and their closest species with high similarity in GenBank were shown in Table 3.These strains have been reported to possess strong organic matter degradation capabilities [55][56][57][58].After plate culture, we used an optical microscope to observe the typical characteristics of each strain, as shown in Figure 4. SX1 is Gram-negative, appearing in long strips.Characteristics such as flagella, spores, and capsules can be observed.SX2 is Gram-positive; sporulation can be observed, with no flagella and capsule, showing a needle shape and sparse distribution.Comparing the characteristics of the three bacterial strains, it can be seen that SX3 (hook-shaped) has the greatest number, and the strain size is between the other two strains, which means that the bacteria can more easily attach to TPH.Although SX1 has a small number of bacteria, the bacteria are neatly arranged and elongated, giving more reaction space than with SX2.Among the actinomycetes, SF3 is the most numerous, and its surface is partly wrinkled.However, SF2 has many wrinkles on its surface and has spore filament branches growing on it.Its bacterial body distribution is relatively loose, which can provide a larger space for the degradation of TPH.In contrast, the bacterial cells of SF1 are smaller and more scattered, which is not conducive to the degradation of TPH.Fungi can be observed with septate hyphae, non-septate hyphae, and conidia.SZ1 has the largest number, a smaller size, and a wrinkled bacterial surface, and finds it easier to react with TPH than SZ2.To sum up, SX3, SF2, and SZ1 are all smaller in size and have a larger number of bacteria, so they have a greater specific surface area and a relatively favorable reaction space, which provides good conditions for the degradation of TPH.
Figure 5 compares the diversity index (Shannon, Simpson, and Pielou) of January (winter and dry season) and July (summer and wet season) in groundwater at the study sites.There is a good negative correlation between the Shannon index and the Simpon index, indicating that the richer the microbial community population in the groundwater, the lower the ecological dominance, and these findings were consistent with those reported in [18,59].The Pielou index and the Simpon index are negatively correlated, but the degree of correlation is not as high as for the Shannon index, mainly because the Pielou index depends largely on the number of species in the system; under the same conditions the difference in ecological dominance between species-rich and species-less systems is not obvious.
SX1 has a small number of bacteria, the bacteria are neatly arranged and elongated, giving more reaction space than with SX2.Among the actinomycetes, SF3 is the most numerous, and its surface is partly wrinkled.However, SF2 has many wrinkles on its surface and has spore filament branches growing on it.Its bacterial body distribution is relatively loose, which can provide a larger space for the degradation of TPH.In contrast, the bacterial cells of SF1 are smaller and more scattered, which is not conducive to the degradation of TPH.Fungi can be observed with septate hyphae, non-septate hyphae, and conidia.SZ1 has the largest number, a smaller size, and a wrinkled bacterial surface, and finds it easier to react with TPH than SZ2.To sum up, SX3, SF2, and SZ1 are all smaller in size and have a larger number of bacteria, so they have a greater specific surface area and a relatively favorable reaction space, which provides good conditions for the degradation of TPH.index, indicating that the richer the microbial community population in the groundwater, the lower the ecological dominance, and these findings were consistent with those reported in [18,59].The Pielou index and the Simpon index are negatively correlated, but the degree of correlation is not as high as for the Shannon index, mainly because the Pielou index depends largely on the number of species in the system; under the same conditions the difference in ecological dominance between species-rich and species-less systems is not obvious.
In the direction of groundwater flow from S1 to Z9 (section 1), as the concentration of TPH decreases, the Shannon index shows a gradually increasing trend, indicating that the diversity of microorganisms was increasing, which is consistent with the change trend of the measured number of microorganisms (Figure 6).This result shows that as pollutants migrate in the groundwater, they are continuously degraded by microorganisms, causing their number and diversity to increase.Similarly, the Simpson index gradually decreased, indicating that the microbial structure and diversity in groundwater became increasingly abundant.The spatiotemporal difference of the Pielou index did not change much, which is related to the low microbial content at the site.
Seasonal comparison shows that the Shannon index in S1, Z1, and Z7 are low in winter and high in summer, while the Simpson index is high in winter and low in summer, both indicating that the microbial diversity is higher in summer than in winter.It is clearly demonstrated that rising temperature is conducive to the growth and reproduction of microorganisms.However, the opposite situation occurs in Z16 and Z9 (the Shannon index is higher in winter than in summer, while the Simpson index is lower in winter than in summer).The reasons for this phenomenon may be that Z16 and Z9 are far away from the pollution leakage point and are downstream of the site, which means that they can be affected by dilution from upstream water.In the direction of groundwater flow from S1 to Z9 (Section 1), as the concentration of TPH decreases, the Shannon index shows a gradually increasing trend, indicating that the diversity of microorganisms was increasing, which is consistent with the change trend of the measured number of microorganisms (Figure 6).This result shows that as pollutants migrate in the groundwater, they are continuously degraded by microorganisms, causing their number and diversity to increase.Similarly, the Simpson index gradually decreased, indicating that the microbial structure and diversity in groundwater became increasingly abundant.The spatiotemporal difference of the Pielou index did not change much, which is related to the low microbial content at the site.
Seasonal comparison shows that the Shannon index in S1, Z1, and Z7 are low in winter and high in summer, while the Simpson index is high in winter and low in summer, both indicating that the microbial diversity is higher in summer than in winter.It is clearly demonstrated that rising temperature is conducive to the growth and reproduction of microorganisms.However, the opposite situation occurs in Z16 and Z9 (the Shannon index is higher in winter than in summer, while the Simpson index is lower in winter than in summer).The reasons for this phenomenon may be that Z16 and Z9 are far away from the pollution leakage point and are downstream of the site, which means that they can be affected by dilution from upstream water.

Spatial Features
The biodegradation of petroleum hydrocarbons can vary significantly in terms of the reaction pathways, reaction extents, and produced intermediates, due to environmental influences.According to the PCA results, petroleum contamination can be divided into three sections on the basis of contamination levels: section 1, including S1, Z1, Z7, Z16, and Z9 wells, exhibits the effects of groundwater flow and a diversity of microbial activities; section 2 characterizes the central region of the area; and section 3 is characterized by degraded contamination.The correlation analysis results of PCA at the sampling sites clearly indicated that the classification (grouping Z16, Z19, Z21, and Z22 as similar sites and Z9, Z10, Z11, and Z20 as similar sites) was consistent with the positional classification results of the sampling wells in the study area, which were related to the distance from the pollution source.These results also represented the gradient positional classification of groundwater petroleum pollutants forming pollution plumes.The sample analysis results, depicting the spatial features of biodegradation, are shown in Figure 6.
Figure 6 clearly demonstrates that along the path of groundwater flow (section 1), the levels of short-chain compounds decreased, while heavy-chain compounds accumulated, resulting in a decrease in the L/H ratio from 5.43 (S1) to 0.98 (Z20) in winter and from 2.59 (S1) to 2.14 (Z20) in summer.This trend matches well with the conventional hierarchies of petroleum biodegradation, which begin with the loss of straight-chain alkanes, followed sequentially by acyclic isoprenoids and highly branched and cyclic saturated hydrocarbons.This conclusion is supported by the decreasing L/H ratio due to the continuous loss of low-weight compounds.In addition, the increasing ratio values of Pr/C17 and Ph/C18 explain the finding that while petroleum hydrocarbons were migrating with the groundwater, they were constantly consumed by microbes, leading to the growth of the microbial population (S1: 1.70 × 10 6 CFU/mL in winter and 2.15 × 10 6 CFU/mL in summer, Z9: 3.18 × 10 6 CFU/mL in winter and 4.14 × 10 6 CFU/mL in summer) and the compositional alteration of compounds.
The hydrocarbon composition in groundwater samples from wells in section 2 and section 3 changed only slightly.This may be because both section 2 and section 3 were not located in the direction of groundwater flow and were not affected by the flushing action

Spatial Features
The biodegradation of petroleum hydrocarbons can vary significantly in terms of the reaction pathways, reaction extents, and produced intermediates, due to environmental influences.According to the PCA results, petroleum contamination can be divided into three sections on the basis of contamination levels: Section 1, including S1, Z1, Z7, Z16, and Z9 wells, exhibits the effects of groundwater flow and a diversity of microbial activities; Section 2 characterizes the central region of the area; and Section 3 is characterized by degraded contamination.The correlation analysis results of PCA at the sampling sites clearly indicated that the classification (grouping Z16, Z19, Z21, and Z22 as similar sites and Z9, Z10, Z11, and Z20 as similar sites) was consistent with the positional classification results of the sampling wells in the study area, which were related to the distance from the pollution source.These results also represented the gradient positional classification of groundwater petroleum pollutants forming pollution plumes.The sample analysis results, depicting the spatial features of biodegradation, are shown in Figure 6.
Figure 6 clearly demonstrates that along the path of groundwater flow (Section 1), the levels of short-chain compounds decreased, while heavy-chain compounds accumulated, resulting in a decrease in the L/H ratio from 5.43 (S1) to 0.98 (Z20) in winter and from 2.59 (S1) to 2.14 (Z20) in summer.This trend matches well with the conventional hierarchies of petroleum biodegradation, which begin with the loss of straight-chain alkanes, followed sequentially by acyclic isoprenoids and highly branched and cyclic saturated hydrocarbons.This conclusion is supported by the decreasing L/H ratio due to the continuous loss of lowweight compounds.In addition, the increasing ratio values of Pr/C17 and Ph/C18 explain the finding that while petroleum hydrocarbons were migrating with the groundwater, they were constantly consumed by microbes, leading to the growth of the microbial population (S1: 1.70 × 10 6 CFU/mL in winter and 2.15 × 10 6 CFU/mL in summer, Z9: 3.18 × 10 6 CFU/mL in winter and 4.14 × 10 6 CFU/mL in summer) and the compositional alteration of compounds.
The hydrocarbon composition in groundwater samples from wells in Sections 2 and 3 changed only slightly.This may be because both Sections 2 and 3 were not located in the direction of groundwater flow and were not affected by the flushing action and migration of groundwater.Instead, they only experienced processes such as dispersion and degradation.However, the Pr/C17 and Ph/C18 values were also maintained at around 1, indicating that biodegradation was occurring.

Temporal Features
Seasonal changes result in dramatic variations in environmental factors such as temperature, atmospheric pressure, and microbial activity.More importantly, the groundwater table varies due to precipitation.The results of the groundwater compositional analysis for Z1 from January to July are shown in Figure 7 these demonstrate the clear influence of seasonal changes.(1) Winter conditions: The study area is a cold region in China, where 1-2 m of surface soil is frozen during the winter, and contaminants in the soil cannot penetrate, leading to a decrease in TPH.In addition, microbial activity decreases in cold conditions; therefore, the Pr/C17 and Ph/C18 values were low.(2) Spring conditions: The rising temperature from March to April melted the frozen soil and increased the amount of infiltration contaminants that were originally retained in the soil.Then, the concentration of low-weight hydrocarbons rose from 4.50 mg/L to 4.81 mg/L, which suggested fresh petroleum input.(3) Summer conditions: Precipitation in this area commonly happens in June and July; these months are also a period of dramatic increases in temperature.These climate changes resulted in a dilution of contaminant concentration.However, the increasing values of Pr/C17 and Ph/C18 ratios indicate intense biodegradation, which results in microbial accumulation.Other factors, such as the temperature and water table elevation, also validate this conclusion.and migration of groundwater.Instead, they only experienced processes such as dispersion and degradation.However, the Pr/C17 and Ph/C18 values were also maintained at around 1, indicating that biodegradation was occurring.

Temporal Features
Seasonal changes result in dramatic variations in environmental factors such as temperature, atmospheric pressure, and microbial activity.More importantly, the groundwater table varies due to precipitation.The results of the groundwater compositional analysis for Z1 from January to July are shown in Figure 7 these demonstrate the clear influence of seasonal changes.(1) Winter conditions: The study area is a cold region in China, where 1-2 m of surface soil is frozen during the winter, and contaminants in the soil cannot penetrate, leading to a decrease in TPH.In addition, microbial activity decreases in cold conditions; therefore, the Pr/C17 and Ph/C18 values were low.(2) Spring conditions: The rising temperature from March to April melted the frozen soil and increased the amount of infiltration contaminants that were originally retained in the soil.Then, the concentration of low-weight hydrocarbons rose from 4.50 mg/L to 4.81 mg/L, which suggested fresh petroleum input.(3) Summer conditions: Precipitation in this area commonly happens in June and July; these months are also a period of dramatic increases in temperature.These climate changes resulted in a dilution of contaminant concentration.However, the increasing values of Pr/C17 and Ph/C18 ratios indicate intense biodegradation, which results in microbial accumulation.Other factors, such as the temperature and water table elevation, also validate this conclusion.

Canonical Correspondence Analysis (CCA) of Indicators
Molecular markers can be used to provide a comprehensive explanation of compound reaction processes that are strongly controlled by environmental chemicals.Therefore, these two types of parameters should have an intrinsic relationship that can be combined to interpret changeable environmental conditions.The relationships between molecular markers and chemical indices were examined using canonical correspondence analysis (CCA).The molecular indices considered were Pr/Ph, Pr/C17, Ph/C18, and L/H, while the chemical markers were dissolved oxygen (DO), electrical conductivity (EC), the oxidation-reduction potential (ORP), pH, SO 4 2− , Fe 2+ , Mn 2+ , and the microbial population.A summary of statistics and a CCA ordination biplot are shown in Table 4 and Figure 8. lecular markers and chemical indices were examined using canonical correspondence analysis (CCA).The molecular indices considered were Pr/Ph, Pr/C17, Ph/C18, and L/H, while the chemical markers were dissolved oxygen (DO), electrical conductivity (EC), the oxidation-reduction potential (ORP), pH, SO4 2− , Fe 2+ , Mn 2+ , and the microbial population.A summary of statistics and a CCA ordination biplot are shown in Table 4 and Figure 8.  4 shows that the correlation index between the molecular markers and chemical indices was high (0.962 and 0.916), demonstrating that these indicators are closely related and showing the high likelihood of their efficacy as environmental indicators.Table 4 shows that the correlation index between the molecular markers and chemical indices was high (0.962 and 0.916), demonstrating that these indicators are closely related and showing the high likelihood of their efficacy as environmental indicators.
Figure 8 clearly demonstrates that DO was positively related to Pr/C17 and Ph/C18 but negatively related to L/H, explaining the consumption of DO during the biodegradation of low-weight hydrocarbons such as C17 and C18, which increased the value of Pr/C17 and Ph/C18 while it decreased the value of L/H.For the same reason, the environment changed to reducing conditions, as proven by the lowered value of Pr/Ph.Furthermore, as biodegradation was accompanied by the growth and breeding of microbes, the microbial population rose dramatically.Fe 2+ , Mn 2+ , and SO 4 2− are signals of reducing conditions, and their negative relationship with DO is clearly shown.

Conclusions
This article comprehensively explains the levels and sources of petroleum contamination in an oilfield and discusses the extent of biodegradation by coupling molecular markers, microbial indices (population of microbes, types of stains, diversity index of microbes), and chemical indices.Quantitative and qualitative detections made via GC-MS, as well as the deduced molecular markers (CPI, OEP, Pr/Ph, etc.), revealed that the petroleum contamination was mainly caused by the oil-water pit, and the contamination plume was strongly controlled by the groundwater flow and biodegradation process.S1 and Z1 were the closest monitoring wells to the oil-water pit and demonstrated the highest TPH concentrations, which were primarily dominated by ALH.Groundwater flow was a key factor in the transportation of petroleum pollution.The data from the monitoring wells in Section 1 (S1→Z1→Z7→Z16→Z9, along the direction of groundwater flow) showed that the pollution concentration of ALH gradually decreased but the UCM was enriched.Those conclusions agreed with the results of the PCA and CCA, which suggested that different well sections had different compositional patterns due to their location and unique redox conditions.CCA revealed that molecular alterations were accompanied by chemical changes in terms of oxygen consumption and the accumulation of bioproducts during the microbial degradation of hydrocarbons.The spatial and temporal features of biodegradation were proven by the Pr/C17, Ph/C18, and L/H ratios, acting as biomarkers based on the biodegradation-resistant dissimilarities of different compounds.More specifically, isoprenoids were more resistant to biodegradation than the low-weight hydrocarbons, and seasonal factors such as temperature and precipitation influenced this process in terms of microbial activity; thus, the increased Pr/C17 and Ph/C18 and decreased L/H values were indicators of biodegradation.According to the diversity analysis of microorganisms, the Shannon index and Simpson index both indicated that the microbial diversity is higher in summer than in winter, illustrating that the microbial degradation of pollutants was stronger during the wet season (summer) than in the dry season (winter); furthermore, during the wet season, due to rainfall infiltration, the groundwater pollution plume was more pronounced in the direction of groundwater flow.
Through this study, it can be observed that although groundwater petroleum pollution poses challenges such as diverse pollutant types, significant differences in properties, and difficulty in remediation, the organic components of petroleum pollution can serve as analytical markers for tracing pollution sources, tracking pollutant migration pathways, and assessing the degree of biodegradation.This can also serve as an important scientific basis for selecting differential treatment methods, such as using air sparging (AS) in volatilesubstance-rich areas, permeable reactive barriers (PRB) in areas along the direction of groundwater flow, and biological degradation in areas with refractory UCM.Ultimately, this research provides a scientific basis for efficient and cost-effective groundwater petroleum pollution remediation schemes.

Figure 1 .
Figure 1.Location of sampling wells and a typical hydrogeological profile (A-A1) of the contaminated oil field.

Figure 2 .
Figure 2. Average TPH contours and the CPI and OEP values of groundwater at each sampling well.

Figure 2 .
Figure 2. Average TPH contours and the CPI and OEP values of groundwater at each sampling well.

Figure 3 .
Figure 3. Scatter plot of hydrocarbon concentrations and molecular markers from PCA analysis.

Figure 3 .
Figure 3. Scatter plot of hydrocarbon concentrations and molecular markers from PCA analysis.

Figure 4 .
Figure 4. Observation of the morphologies of different strains.

Figure 5
Figure 5 compares the diversity index (Shannon, Simpson, and Pielou) of January (winter and dry season) and July (summer and wet season) in groundwater at the study sites.There is a good negative correlation between the Shannon index and the Simpon

Figure 4 .
Figure 4. Observation of the morphologies of different strains.

Figure 5 .
Figure 5. Seasonal comparison of microbial diversity indices in groundwater along the flow direction at the site.

Figure 5 .
Figure 5. Seasonal comparison of microbial diversity indices in groundwater along the flow direction at the site.

Figure 6 .
Figure 6.The distribution of hydrocarbons and molecular markers in different sections.

Figure 6 .
Figure 6.The distribution of hydrocarbons and molecular markers in different sections.

Figure 7 .
Figure 7. Distribution and variation characteristics of petroleum contaminants in groundwater at Z1.

Figure 7 .
Figure 7. Distribution and variation characteristics of petroleum contaminants in groundwater at Z1.

Figure 8 .
Figure 8. Relation between molecular markers and chemical indices from CCA analysis.

Figure 8
Figure 8  clearly demonstrates that DO was positively related to Pr/C17 and Ph/C18 but negatively related to L/H, explaining the consumption of DO during the biodegradation of low-weight hydrocarbons such as C17 and C18, which increased the value of Pr/C17 and Ph/C18 while it decreased the value of L/H.For the same reason, the environment changed to reducing conditions, as proven by the lowered value of Pr/Ph.Furthermore, as biodegradation was accompanied by the growth and breeding of microbes, the

Figure 8 .
Figure 8. Relation between molecular markers and chemical indices from CCA analysis.

Table 1 .
Concentrations of TPH and different types of compounds at the sampling wells.

Table 2 .
Average concentrations of petroleum hydrocarbons, inorganic parameters, and molecular marker values.

Table 3 .
Gene sequence analysis of microbes.

Table 4 .
Summary of the statistical results of the CCA.

Table 4 .
Summary of the statistical results of the CCA.