Determination and Similarity Analysis of PM2.5 Emission Source Profiles Based on Organic Markers for Monterrey, Mexico

Source attribution of airborne particulate matter (PM) relies on a host of different chemical species. Organic molecular markers are a set of particularly useful marker compounds for estimating source contributions to the fine PM fraction (i.e., PM2.5). Although there are many source apportionment studies based on organic markers, these studies heavily rely on the few studies that report region-specific emission profiles. Source attribution efforts, particularly those conducted in countries with emerging economies, benefit from ad hoc information to conduct the corresponding analyses. In this study, we report organic molecular marker source profiles for PM2.5 emitted from 12 major sources types from five general source categories (meat cooking operations, vehicle exhausts, industries, biomass and trash burning, and urban background) for the Monterrey Metropolitan Area (Mexico). Source emission samples were obtained from a ground-based source-dominated sampling approach. Filter-based instruments were utilized, and the loaded filters were chemically characterized for organic markers by GC-MS. Levoglucosan and cholesterol dominate charbroiled-cooking operation sources while methoxyphenols, PAHs and hopanes dominate open-waste burning, vehicle exhaust and industrial emissions, respectively. A statistical analysis showed values of the Pearson distance < 0.4 and the similarity identity distance > 0.8 in all cases, indicating dissimilar source profiles. This was supported by the coefficient of divergence average values that ranged from 0.62 to 0.72. These profiles could further be utilized in receptor models to conduct source apportionment in regions with similar characteristics and can also be used to develop air pollution abatement strategies.


Introduction
Fine organic carbon (OC) aerosol plays a major role in environmental impacts and health risks derived from air pollution exposure [1][2][3]. As part of the urban air pollution by PM 2.5 (fine particles with aerodynamic diameters less than 2.5 µm), the levels of OC have become a central issue in decision making in improving air quality in urban areas [4,5]. Fine OC is frequently the major fraction of the ambient PM 2.5 mass in urban environments [6,7]. Fine OC is a complex mixture of hundreds of organic compounds that are directly emitted or generated by atmospheric chemical processes [8][9][10]. Despite these organic compounds being toxic and carcinogenic [11], some can be useful as markers to perform source attribution studies at specific sites. These chemical markers are present in relatively high concentrations in emissions from specific sources and in low concentrations in other sources, and they react slowly enough in the atmosphere such that their origins can be tracked during their transport from the sources to the receptor sites [12][13][14]. Organic markers that have been used in the past include organic compounds from the families of sugars, steroids, methoxyphenols, resin acids, n-alkanes, n-alkanoic acids, hopanes (pentacyclic triterpenes), and polycyclic aromatic hydrocarbons (PAHs) [15].
Much research related to source apportionment studies has focused on using trace elements, particularly metals, as chemical markers (e.g., [16]); however, most metals can be emitted from multiple types of sources, and thus source apportionment using only elemental data has become difficult [14]. The use of organic compounds in source apportionment studies has the advantage that some of them are exclusively characteristic of specific chemical source fingerprints and can be used in the source attribution process [14]. Thus, the use of organic markers has been helpful in distinguishing types of source emissions in source apportionment studies around the world (e.g., [17]). In particular, the use of organic markers to construct source profiles and their application to source attribution using a chemical mass balance approach has proven fruitful [4].
Despite the proven usefulness of source profiles constructed from organic markers, few studies have developed these for emission sources specific to regions that experience severe air pollution episodes in countries with emerging economies [17,18]. Although several source apportionment approaches do not explicitly require chemical source fingerprints to perform the analysis (e.g., using positive matrix factorization, PMF), they do benefit from a benchmark to support the interpretation of the results. If a chemical mass balance approach is followed, source profiles are a fundamental part of the attribution process. A normal practice has been to assume that a set of source profiles derived for a given region or regions can be used in a source attribution study for another set; however, it has been recognized that the exact nature of similar sources can vary substantially from region to region, and thus the use of different sets of non-specific regional source profiles in a given attribution study can lead to significant differences among receptor model applications (e.g., [19]). Therefore, in an international context, the availability of organic sources profiles data must be extended for areas where no previous information is available.
This work provides an initial data set of organic source profiles based on the chemical characterization of fine OC for 12 major emission source types from five general source categories in the Monterrey Metropolitan Area (MMA). The MMA is the second-most populous urban center in Mexico with air quality problems due to high concentrations of PM 10 and PM 2. 5 . It has been estimated that fine OC represents the major fraction of the PM 2.5 found in the ambient air of the MMA, accounting for 36-71% of the average PM 2.5 mass [20,21]. The chemical characterization of the ambient fine OC in the MMA has been scarcely studied, focused mainly on PAHs [22,23]. Only one study that included a wide chemical characterization of organic compounds has been conducted [24], which has been also the only source apportionment study based on the organic markers approach conducted in the MMA [24], which is the second of its class in the country after the one reported by Stone et al. [25]. In addition to deriving necessary region-specific organic source profile data for countries with emerging economies, statistical tests were performed to provide insight into the uniqueness of the obtained profiles, thus ensuring source representativeness. The obtained source profiles can be used to conduct source apportionment studies in the MMA or other locations with similar emission sources.

Study Area and Sampling
The MMA, the second largest urban center of Mexico, was selected for this study ( Figure 1). The potential types of sources that contribute to PM 2.5 concentrations in the MMA were identified from previous studies [24,26] and official emission inventories [27]. Source samples were collected in June and July of 2014 to represent five general source categories: (1) meat cooking operations, (2)   A general scheme of the methodology for this study is shown in Figure 2. All the samples were collected on Teflon and quartz fiber filters following a ground-based sourcedominated approach for source profile characterization (e.g., [28,29]). Before sampling, Teflon filters were weighted by gravimetry while quartz fiber filters were pre-fired for 8 h at 600 • C in a furnace to remove residual carbon. Teflon and quartz fiber filters were stored in petri-slides and in baked aluminum foil within sealed plastic bags, respectively, until they were used. During the sampling, two filter-based instruments were used in parallel to collect the samples from each source emission. The first one was a MiniVol low-volume sampler (Airmetrics, Springfield, OR, USA; flow rate of 5-7 L/min) to collect samples on 47 mm Teflon filters (Whatman, Maidstone, UK), and the second one was a high-volume sampler (TE-6070-2.5, Tisch Environmental, flow rate of 1100-1200 L/min; Cleves, OH, USA) to collect samples on 20×25 cm micro-quartz filters (Whatman QM-A; Maidstone, UK). The two samplers were calibrated previously to each test by using NIST-traceable certificated orifice calibrators MNF-1236 and TE-5040 for the MiniVol and Hi-Vol samplers (Airmetrics, Springfield, OR, United States), respectively. After sampling, collected filters were placed in a cooler with blue ice for immediate transport from the sampling site to the laboratory. All collected filters were stored at −20 • C in the dark until they were analyzed. The sampling sites for each source category and their descriptions are shown in Table 1.

Sampling Sites Description
Each sampling site corresponded to a hot spot highly impacted by each source emission type. Preliminary sampling tests were conducted to estimate the proper location and sampling time for each source emission (Table 1) based on the saturation of the filters. A total of 30 individual profile samples were collected and depending on particle loading, the sampling duration varied from 0.9 h for open waste burning to 15.2 h for the suburban area at nighttime.
Meat cooking emissions were collected based on the most common forms of cooking operations in the MMA. These samples included emissions from restaurants that use natural gas for heating (hot plate) purposes and seed oils for food cooking as well as charbroiled meat services at supermarkets and residential charbroiled meat activities. More specifically, the residential cooking test was conducted by cooking a specific combination of meat types and other food complements (e.g., sausage, onions, cheese) using an outdoor charbroiling system. Two samples were obtained for each type of cooking source; all tests involved the cooking of meat from beef, pork, sausage, and other typical food dishes associated with the traditional charbroiled cooking operations in the MMA. Based in both, other references [28,29] and the infrastructure of the cooking operation system, the sampling equipment was positioned at approximately 1.5 m downwind from each source.
Motor vehicle exhaust samples were collected for gasoline-and diesel-powered vehicles from a road tunnel and through side-road sampling, respectively. In both cases, the samplers were placed on a sidewalk close to the traffic lanes, with the sampling inlet placed at 1.5 m above ground level. For the gasoline-powered vehicle emissions, the same site and general procedure described by Mancilla and Mendoza [30] was followed. These samples were collected from a two-bore urban tunnel (532 m long) in which the traffic is composed of 98% gasoline-powered vehicles, most of them light-duty passenger cars. The samplers were located approximately 200 m downstream from the inlet of the north-south bore with a positive slope of 3.5%. The vehicle exhausts for diesel-powered vehicles were divided in two categories: urban (public and private buses) and freight transport (at least four-axle single units and trailers). Information about the main transit routes in the MMA with high urban and freight transport traffic was provided by the Monterrey Department of Public Safety and Roads. From this data, two street locations were selected for sampling these source emissions. Samples were collected during the hours in which the fleet of diesel-powered vehicles was significantly higher (400 buses/h and 200 trailers/h) than other means of transportation (mainly gasoline-powered vehicles); this was determined by previously monitored vehicle count tests in the selected avenues.
The biomass and trash burning profile was obtained by ground-based source-dominated sampling in the plumes of real open-waste burning emissions. The waste burned consisted of wood, grass, leaves, plastics, cardboard, paper, textiles, and household materials. The samples were collected at 2 m downwind from the open fire, where the smoke particles directly impacted the sampler. This procedure was conducted in the suburbs of the MMA, where these types of burnings occur, randomly tracking burning events as they occurred.
The urban background profile was obtained by sampling in a country villa (approximately 60,000 m 2 ) located in the suburban area of the MMA at 30 km southeast from the downtown area, upwind from the urban core with respect to the typical prevailing wind patterns in the region (i.e., east-west). This villa is far from the urban area, thus allowing for the isolation of environmental background emissions. This site is a natural area for recreational purposes (uninhabited), and it is surrounded by shrubs, grass, and woody trees. Sampling was conducted when these activities were not occurring to minimize their influence on the profile. Secondary organic aerosol (SOA) formation due to the photochemical activity of particles produced over natural landscapes can be expected [31]. Samples were collected for daytime and nighttime to identify a possible SOA contribution in the collected samples. The samplers were placed at 4.5 m above the ground on a small structure rooftop.
Industrial profiles were developed for oil refinery and general manufacturing industry emissions. For the oil refinery, two samples were collected from a sampling site located 3 km downwind from the source. The main emissions on this sampling site come from an oil refinery of 7.7 km 2 that it is composed of more than 30 facilities that process around 110 thousand barrels of petroleum per day to produce mainly gasoline and diesel fuels and other petrochemical products. Another industrial sector was a typical industrial park of the MMA with several manufacturing companies in the field of automotive assembly, electrical appliances, cosmetics, and petrochemical products. Unlike the case for the oil refinery, the samples were collected from a sampling site located within 1 km downwind from these industrial sources within the industrial park premises.
The urban construction profile was developed by setting up the samplers close to an urban construction area. This area is influenced by particles from paved roads, concrete, and other construction operations of a new subway line for local transportation. Significant fugitive dust emissions occurred from the construction activities: material handling and road-resuspension by local and close-by traffic.

Gravimetry and OC Determination
The loaded Teflon filters were analyzed for mass by gravimetry using a Sartorius ME5 microbalance (Goettingen, Germany) (±1 µg) in a weighing chamber with a controlled temperature (21 ± 2 • C) and relative humidity (35 ± 5%) for 24 h to obtain the total collected particulate matter.
The micro-quartz filters were analyzed for OC and EC by TOT using a Sunset Laboratory thermo-optical carbon analyzer (Model 4L; Tigard, USA) following the NIOSH 5040 guidelines from the US EPA. A standard punch (1.5 cm 2 ) was extracted from a loaded quartz microfiber filter and placed in an oven using a quartz boat. The oven was initially purged with He. OC was then analyzed over varying time steps between 45 s and 300 s while maintaining an inert atmosphere (pure He). The heating desorbed OC thermally. The evolved OC was catalytically oxidized to CO 2 in a manganese dioxide (MnO 2 ) oxidizing oven; subsequently, the CO 2 was swept out of the oxidizing oven with the He stream and reduced to CH 4 in a (Ni/firebrick) methanator and quantified as CH 4 by a flame ionization detector (FID). OC was tracked at different temperatures during the specified time frame: 310 • C, 475 • C, 615 • C, and 870 • C, respectively. The EC analysis was conducted using temperature profiles with a withhold time of 45 s and a final holding time of 120 s at 870 • C in an oxidizing atmosphere (He:O 2 90:10 v/v). EC was oxidized from the filter into the oxidation oven, converted into CO 2 , reduced to CH 4 , and detected by FID as CH 4 . During this stage, a pyrolysis correction was made. A split point should be defined for OC and EC to quantify these components. The split point is defined as the point at which the light transmittance of the sample returns to the initial value. The carbon that evolved before or after the split point was considered OC or EC, respectively [32].

Organic Markers Characterization
Solvent-extractable organic markers were quantified by GC/MS using dichloromethane (DCM) and methanol (MeOH), both at high purity of ≥99.9% (Fischer Scientific Optima grade). Individual filters were cut into pieces and spiked with 50 µL of the following deuterated internal standards (isotopic purity of 98-99 atom % D, Sigma Aldrich): n-hexadecane-d 34 , n-hexatriacontane-d 74 , n-eicosane-d 42 , n-triacontane-d 62 , n-tetracosaned 50 , vanillin-d 3 , benzophenone-d 5 , dibenzo(a,h)anthracene-d 14 , naphthalene-d 8 , chrysened 12 , benzo(e)pyrene-d 12 , coronene-d 12 , 1,4-dichlorobenzene-d 4 , acenapthylene-d 10 , perylened 12 , benzaldehyde-2,3,4,5,6-d 5 , decanoic acid-d 19 , palmitic acid-d 31 , stearic acid-d 35 , terephthalic acid-d 4 , levoglucosan-13 c 6 , cholesterol-d 6 , and 1,4-benzoquinone-d 4 . Thereafter, a DCM extraction was conducted three times. During each extraction, enough DCM was added to fully cover the filters, and then ultrasonic agitation was applied for 20 min using a sonicator (Bransonic ® , 5510R-DTH; Brookfield, CT, USA). The extracts were gathered together and then concentrated by evaporation under a low flow of ultra-high-purity nitrogen until the extract reached a volume of~5 mL. The extracts were filtered through a pre-fired quartz filter and subsequently reduced in volume using the same flow of high purity nitrogen to obtain a final extract of 250 µL. The extracts were then separated into three fractions with one fraction for a direct analysis by GC/MS, and the other two fractions were used for chemical derivatizations before being introduced to the GC/MS. One derivatization was the methylation using diazomethane (CH 2 N 2 ) to convert carboxylic acids to their respective methyl esters; 50 µL of a CH 2 N 2 solution was combined in a vial with 50 µL of extract. The other derivatization was silylation using a combination of N,Obis(trimethylsilyl)trifluoroacetamide (BSTFA) and trimethylchlorosilane (TMCS) to convert sterols and sugars to their respective trimethylsilyl esters; 50 µL of BSTFA+TMCS (molar ratio 99:1) was combined with 50 µL of extract. For each derivatization, the mixtures were allowed to react for 3 h at 70 • C.
Finally, the extracts were analyzed by gas chromatography (Agilent 6890N equipment) coupled with a selective mass detector (Agilent 5973 inert). The separation was performed with a non-polar capillary column of 30 m × 250 µm × 0.25 µm with 5% of methyl-phenylsiloxane. For each sample, 3 µL of the extract was supplied to the GC through an automated injector (Agilent 7683 equipment). The operating conditions of the GC/MS began with a temperature of 65 • C for the first 5 min, and then the temperature increased at a rate of 10 • C/min for 25 min until the GC oven reached a temperature of 300 • C. The quantification and identification of the organic compounds were based on comparisons with authentic standards, retention times, literature mass spectra, and fragmentation patterns using the HP ChemStation software. The full list of organic compounds to be identified and quantified is showed in Table 2. A scheme of the GC/MS methodology is shown in Figure 3 and more details of the extraction and chemical analysis process of the organic markers can be found elsewhere [33].

Source Profile Representativeness
Near-field source sampling has the benefit of providing an average source composition of evolved emissions from a target upwind source. As can be seen in other studies, representativeness with low number of samples can be achieved following a ground-based source-dominated approach for source profile characterization [28,29]. A well-known example of this is the use of roadway tunnels to characterize overall emissions from a vehicle fleet. This is particularly useful when the goal is not to characterize each individual vehicle but rather to obtain a representation of the fleet; however, a known drawback of this approach is that other upwind or background sources will be part of the sampled air masses. In addition, the sample will be a mix of emissions from the major source being targeted and a set of minor sources that occur at the same time that are complicated to eliminate due to the real-world condition of the sampling. Thus, it is convenient to know how well the derived emission profiles represent the target sources. To do so, in this study, we opted to estimate diagnostic ratios to obtain a representation of the most probable origin of the emissions and similarity metrics to establish a degree of uniqueness. In addition, Table 3 summarizes the main organic markers used to distinguish between source profiles. Table 3. Key organic markers for every source profile.

Emission Source Profile Organic Marker
Restaurant charbroiled and grilled meat Cholesterol and levoglucosan Residential charbroiled meat Cholesterol and levoglucosan Supermarket charbroiled meat service Cholesterol and levoglucosan Gasoline-powered vehicles High molecular weight PAHs and n-alkanes Urban transport Low molecular weight PAHs, n-alkanes and hopanes Freight transport Low molecular weight PAHs, n-alkanes and hopanes Oil refinery Hopanes and n-alkanes Manufacturing industry Hopanes and n-alkanes Open waste burning Levoglucosan, methoxyphenols and diterpenoids Suburban area in daytime Cis-pinonic acid (secondary organic marker) Suburban area in nighttime Cis-pinonic acid (secondary organic marker) Urban construction site Hopanes and n-alkanes

Carbon Number Indicators
Carbon number indicators are useful to identify whether a given emission profile is from natural or anthropogenic sources. The carbon preference index (CPI) is an indicator of the measure of odd-or even-carbon homologue series of organic compounds within a sample. Based on several studies [34][35][36][37][38], the CPI for n-alkanes (odd to even ratio) was calculated as: and that for n-alkanoic acids (even to odd ratio) as: For both n-alkanes and n-alkanoic acids, high values of CPI (> 5) indicate that they are emitted from natural sources (e.g., plant waxes), whereas values of CPI close to the unity indicate that they are emitted from anthropogenic sources [38][39][40][41][42][43][44].
Another useful indicator that is used to identify the origin of the emissions is the highest carbon number among the homologous series (C max ). For n-alkanes and n-alkanoic acids, a high molecular weight (>C 25 and >C 20 , respectively) is emitted from biogenic sources, while those with a low molecular weight (≤C 25 and ≤C 20 , respectively) are mainly emitted from fossil fuel combustion processes and meat cooking operations [37,39,41,45]. The information provided by these indicators is valuable as it can be used to provide an initial insight into how well the sampling process was able to capture the target emission sources.

Test of Similarity between Organic Source Profiles
Similarities between different organic source profiles can be identified using the Pearson distance (PD) and the similarity identity distance (SID) in accordance with Belis and Pernigotti [59]. PD is equal to 1 − r 2 , where r 2 is the Pearson coefficient, and SID is defined by equation (3) [60]: where x j and y j are the relative masses of organic marker j to the PM 2.5 of two different organic source profiles (x and y), and m is the number of common organic compounds between these profiles. These two metrics (PD and SID) aim to compare two profiles based on their common chemical relative mass composition. PD is highly sensitive to variations in the major components of PM 2.5 , while SID is evenly sensitive to all components [60]. PD < 0.4 and SID < 0.8 are acceptable criteria for profile similarity [60]. Several researchers have successfully used this approach to assess the similarity among source emission profiles [59,61].
To complement this statistical analysis, a coefficient of divergence (COD) analysis was conducted. CODs are used to determine the relative measure of homogeneity or spatial variability in the concentration fields of any pollutant [62], while others have used it to find similarities between source profiles [58]. The COD is defined as follows, Equation (4): where z ij and z ik are in this case the average mass concentration or fraction of chemical species i for source emission profiles j and k, and p is the number of chemical species. Values of CODs near the unity indicate that profiles j and k are different, and as the values approach zero, it indicates that profiles j and k are similar.

General Results
Based on the detailed organic characterization for each sample obtained, 12 average source profiles were derived (Table S1 of the Supplementary Material). The total extractable organic matter (EOM) accounted for 6% to 32% of the OC (Figure 4). Depending on the type of emission source, the distribution of % of EOM within each profile can vary. The n-alkanes contribution to the % of EOM varied from 0.5% to 1.0% for meat-cooking operations profiles, from 3.4% to 5.0% for diesel vehicle exhaust, from 1.4 to 4.7 for industry profiles, and about 47% for gasoline vehicle exhaust. The n-alkanes contribution to the total EOM was the highest in the gasoline-powered vehicles profile (TLL) with a contribution that was 9-13 times higher than that of the diesel-powered vehicle exhaust profiles (TP, CC). The hopanes comprised from not detectable to 1.3% of the total EOM, once again being the TLL, CC and TP profiles with the highest hopanes relative contribution to the EOM with values of 0.2%, 0.3% and 1.3%, respectively. The hopanes contribution to the total EOM was also 4-5 times higher for the gasoline-powered vehicles than that observed for diesel-powered vehicles. The PAHs comprised from 0.1% to 1.5% of the EOM. As in the case of the hopanes, the contribution of the PAHs to the EOM was the highest in the gasoline vehicles profile and was 5-6 times higher than that of the diesel vehicles. The total n-alkanoic acids and the sugars (levoglucosan) were the major components among source profiles, accounting from 6.3% to 76% and from 18% to 80% of the EOM, respectively, with large variations among profiles. Five methoxyphenols were quantified comprising 2.5% of the EOM, while sterols (cholesterol) comprised 0.01% to 0.53% of the EOM. The biomass/trash burning profile (QB) had the highest levoglucosan, methoxyphenols, and resin acids relative contribution to the EOM, while the meat cooking operations profiles had the highest sterols (cholesterol) relative contribution to the EOM among all profiles (>0.1%).
The profiles were further processed to obtain the corresponding diagnostic ratios ( Table 5). The profiles and diagnostic ratios are discussed in the following sections by major chemical families.  Table 1.

n-alkanes and Hopanes
As can be observed in Figure 5a, the highest n-alkanes levels were exhibited on average by the vehicle exhaust profiles (TLL, CC, TP) followed by industry emission profiles (PSN, PMX), which ranged from 1400 to 6700 ng/mg and 880 to 1070 ng/mg, respectively. This is consistent with the fact that n-alkanes are derived from petroleum lubricants and fossil fuel combustion processes [15,[63][64][65]. Within the vehicle exhaust profiles, the n-alkanes levels for the gasoline-powered vehicle profile (TLL) were two and four times higher than those from diesel-powered vehicles of urban (TP) and freight (CC) transport, respectively. For the manufacturing industry profile (PSN), the n-alkanes mass levels were 20% higher than those of the oil-refinery profile (PMX). This could suggest that manufacturing processes are highly dominated by fossil fuel combustion processes. The meat cooking operations profiles exhibited the lowest levels of n-alkanes, while the background profiles had n-alkanes levels that where on average 20% lower than the industry profiles.  Table 1.
To assess whether the organic source profiles were anthropogenic or biogenic in origin, a CPI was calculated for C 19 to C 33 for each profile ( Table 4). The n-alkanes had CPI values ranging from 1.0 to 1.7. Overall profiles exhibited values close to the unity, indicating a dominance of anthropogenic sources [41,42]. Only the restaurant charbroiled grilled meat (CAR) and residential charbroiled meat (CAF) profiles had the highest values of 1.7 and 1.7, respectively, consistent with the origin of the fuel and suggesting a slight contribution from natural vegetation waxes [45,66], which consist of long chain plant lipids (>C 20 ) mainly as n-alkanes, n-alkanoic acids, and n-alkanols [67,68]. Similarities among the carbon contribution among vehicle, industry, and background profiles can be observed in Figure 5b along with differences among meat cooking operations with respect to all other profiles. CPI values reported by other researchers were 1.0 [69], 1.1 [69], 1.5 [70] and 2 [58] for gasoline, diesel, wood burning and cooking operations, respectively. All CPI values are close to unity although small differences exist, depending on the particular conditions.
High molecular weight n-alkanes (HMW; >C 25 ) primarily originate from plant waxes, for which C 27 , C 29 , C 31 , and C 33 are major species [71]. Lower molecular weight n-alkanes (LMW; ≤C 25 ) are usually related to anthropogenic activities, such as fossil fuel combustion [42]. The C max for n-alkanes varied among source profiles (Table 4). For the restaurant charbroiled grilled meat (CAR), residential charbroiled meat (CAF), open-waste burning (QB) and suburban background (RTD and RTN), the C max were in the range from C 27 to C 29 ; this indicates that these profiles are influenced by biomass burning processes, including some plant waxes, which can emit HMW n-alkanes [70,72]. For all other developed source profiles, the C max was dominated by a LMW n-alkanes series, suggesting anthropogenic emission sources. For instance, C 24 , C 25 , and C 28 were the C max for TLL, CC, and TP, respectively; this is different from Cai et al. [69], who found that C 17 was a C max for both gasoline and diesel motor vehicles.
Similarly, the hopanes were detectable for the vehicle exhaust and industry emissions profiles with levels that range from 80 to 240 ng/mg and 31-130 ng/mg, respectively (Figure 6a). Hopanes, which are markers of the petroleum components, are found in lubricant oils employed in motor vehicles and petrochemical industries [15,65]. The total hopanes levels in this study were 83% (TLL) and 66% (TP) lower than those for gasolineand diesel-motor vehicles reported by Cai et al. [69], respectively.  Table 1.
The vehicle profiles showed a full distribution of the hopanes series (HOP1 to HOP9 as defined in Table 2); the hopane with the highest mass contribution was HOP5 (Figure 6b), which has been used as a tracer for vehicle emissions [73]. For the industrial emissions, the PMX profile presented the presence of hopanes in the series HOP1 to HOP5, whereas the PSN profile only exhibited HOP1 (Figure 4); the emissions for PMX were 4.3 times higher than those from the manufacturing industries. Other source profiles had low levels of hopanes. The observed levels in the CAF profile can be associated with the lubricant oils used to burn the charcoal, while those in the RTN and RTD profiles could be associated with the motor vehicle exhausts from suburban highways and roadways near the site.

Polycyclic Aromatic Hydrocarbons (PAHs)
PAHs are semi-volatile and ubiquitous organic compounds formed from an incomplete combustion of any organic material. Biogenic sources include forest fires [74,75], and anthropogenic sources comprise emissions from industries, vehicle exhausts, and the combustion of fossil and biomass fuels [46,64,76]. In addition, some are well-known human carcinogens [77], and they can be classified as low molecular weight (LMW; ≤4 rings) or high molecular weight (HMW; >4 rings) PAHs. The levels of 13 measured PAHs for each organic source profile are shown in Figure 7a. Retene (RET), an organic marker of conifer wood combustion [78], was detected only in the open waste burning (QB) profile, whereas all other PAHs were detected in several profiles (Figure 7b). The individual PAHs ranged from 0.02 to 318 ng/mg, with chrysene (CHR) having the highest levels among profiles. The total PAHs levels (∑PAHs) ranged from 1.3 to 522 ng/mg; the suburban profiles showed the lowest ∑PAHs, while the open waste burning and meat cooking operations profiles had the highest levels.  Table 1.
For the gasoline-powered vehicle profile, the LMW-∑PAHs were 25% lower than HMW-∑PAHs. Conversely, for the diesel-powered vehicle profiles, LMW-∑PAHs were 1.4-2.3 times higher than HMW-∑PAHs. This is characteristic in differentiating between gasoline-from diesel-powered vehicles [52,69,79]. For both, gasoline-and diesel-powered vehicle profiles, the major component for LMW-∑PAHs was chrysene, while for HMW-∑PAHs, it was coronene. The latter is partially different from that found by other studies in which pyrene and coronene were reported as the major components for gasoline-powered vehicles [69,80], while for diesel-powered vehicles, the major components that were reported are pyrene and benzo(ghi)fluoranthene [69,81].
The meat cooking operations and open-waste burning profiles were dominated by LMW-∑PAHs, which were 3, 6, 26, and 7 times higher than those of HMW-∑PAHs in CAR, CAF, CAS, and QB, respectively. The CAF, CAS, and QB profiles had chrysene as the major component, while for the CAR profile, fluoranthene and pyrene were major components. This suggests that CAF, CAS, and QB operations utilized charcoal and organic matter derived from hardwoods [70], while CAR operations can be associated with operations that use seed oils for cooking food [79] and natural gas for heating [82].
Characteristic diagnostic ratios for PAHs are shown in Table 4. The range values for FLT/(FLT+PYR) were from 0.50 to 1.0. For IPY/(IPY+BPE), they were 0.19-0.60, and for BaA/(BaA+CHR), they were 0.04-1.0 These ratios can be used to suggest the source origin of the organic profiles. The values of these ratios for meat cooking operations (CAF, CAR, and CAS) and vehicle exhaust (TLL, CC, and TP) indicate pyrogenic sources, such as charcoal combustion and fossil fuel combustion, respectively. For the industry emissions (PMX and PSN), the ratios could be associated with the presence of petcoke processes (pyrogenic source) within the oil-refinery profile, whereas for the manufacturing industry, they indicate a dominance of fuel combustion processes. The latter is in line with the low levels of hopanes exhibited by the manufacturing industry compared with oil-refinery emissions. For open waste burning (QB), the ratios indicate grass and wood combustion. Finally, for suburban profiles, the values of these ratios suggest a mixture of biomass burning, fossil fuel, and fuel oil combustion processes based on the results obtained in other studies [43,48,53,56]. For instance, Chen et al. [49] reported BaA/(BaA+CHR) ratios of 0.22-0.55 and 0.38-0.65 for gasoline and diesel motor vehicles, respectively, while Alves et al. [50] reported FLT/(FLT+PYR), IPY/(IPY+BPE), and BaA/(BaA+CHR) values of 0.23, 0.37, and 0.75, respectively. In this study, the corresponding ratios were different than those from these studies, suggesting differences in traffic fleet characteristics (e.g., age, car model, maintenance, fuel quality) among regions.

n-alkanoic Acids
The n-alkanoic acids are ubiquitous, and they are commonly used as complementary organic markers. It can be observed that overall source profiles in this study are acid-enriched. The total levels of these acids ranged from 4156 ng/mg to 75,311 ng/mg (Figure 8a). Most of the relative contribution of n-alkanoic acids is comprised of C 14 -C 18 acids (Figure 8b), suggesting an attribution to anthropogenic sources [83]. The CPI for n-alkanoic acids ranged from 2.7-17, and the C max for n-alkanoic acids was in the range from C 14 to C 18 , indicating an anthropogenic origin [83]. For some organic source profiles, such as meat operations and environmental background profiles, a low contribution of n-alkanoic acids > C 20 indicated a contribution from plant waxes [83]. Another marker detected, particularly in the meat cooking operations profiles, was oleic acid, which is related to the oils used in cooking; the concentrations of this acid were 84-98% lower than the levels reported by Zhao et al. [84] and Schauer et al. [85]. These low levels suggest the use of low-quality oils given that oleic acid is found in natural oils of high quality (e.g., olive oil, avocado oil).  Table 1.

Levoglucosan and Cholesterol
The major organic marker for smoke particles from biomass burning is levoglucosan, which is derived from the thermal decomposition of cellulose and lignin [70,[86][87][88][89][90]. For steroids, cholesterol has been used as a good marker for meat-cooking operations, specifically for particles emitted directly from meat when being cooked [85,91,92].
The average abundances for levoglucosan and cholesterol can be observed in Figure 9. Their levels ranged from 2920 to 130,000 ng/mg for levoglucosan and 3.7 to 81 ng/mg for cholesterol. The open waste burning profile (QB) is clearly dominated by levoglucosan, whose levels are 3 to 45 times higher than those from the other profiles. In addition, the levoglucosan concentration was 7.5 and 94 times higher than the concentration reported by Wang et al. [93] and Schauer et al. [70], respectively. These differences can be attributed to the uniqueness of the profiles. For example, for the QB profile, the biomass burning was an uncontrolled open fire with a combination of different waste materials, whereas for both Wang et al. [93] and Schauer et al. [70], profiles were obtained from a controlled domestic fireplace burning only wood. For the meat cooking operations profiles, the cholesterol was the major component with levels 4 to 20 times greater than those from the other profiles. These profiles showed relatively low levels of levoglucosan from the combustion of charcoal [94], while the cholesterol was from meat cooking [85]. The levoglucosan for CAS and CAF was 3 and 2 times higher than the levels in the CAR profile, respectively; this could be associated with the fact that restaurants use less charcoal and also use natural gas for cooking operations. The cholesterol for the CAR profile was 21% lower than in the CAS and CAF profiles; this could suggest that restaurants cook not only meat but also other meals with a low content of cholesterol. The cholesterol concentrations in this study were~90% and~98% lower than those reported by Zhao et al. [83] and Schauer et al. [85], respectively. Finally, the CC, TP, and PMX profiles exhibited a presence of levoglucosan, suggesting a slight influence from background biomass burning emissions.

Methoxyphenols
Lignin is a key organic polymer present in vascular plants, particularly in wood and bark. It produces phenolic markers when these biomasses are burnt [66,[95][96][97]. The open waste burning profile (QB) was the only profile in which the methoxyphenols were detectable with a total level of 4115 ng/mg. Five methoxyphenols were quantified. Vanillin, isovanillin, and guaiacol constituted 25% of the total methoxyphenols identified, suggesting the combustion of softwood, such as pine wood or other conifers, while the 4-methylsyringol and syringe aldehyde constitute the remaining 75%, indicating the combustion of hardwoods [98,99].

Resin Acids (Diterpenoids)
Resin acids are produced by some wood species (e.g., conifers), and they can be released when the wood is burned or degraded [100]. These organic markers are typically used to distinguish between softwood and hardwood combustion [70,101]. The pimaric and isopimaric acids are natural product resin acids along with abietic acid, whereas dehydroabietic acid and retene are secondary organic markers. All are used as organic markers for softwood burning [100,102]. These organic compounds occurred in the open waste burning profile (QB) with a contribution of 16,467 ng/mg, comprising 10% of the EOM. The presence of these organic markers suggests that most of the smoke particles in this profile were from softwood burning. In addition, the relatively high level of dehydroabietic acid (4497 ng/mg) compared with that of abietic acids (6906 ng/mg) indicates that smoke particles are rapidly degraded. The abietic, dehydroabietic, pimaric, and isopimaric acid levels in this study were 37, 31, 189, and 19 times higher, respectively, than the levels reported by Schauer et al. [70].

Secondary Organic Markers
The presence of cis-pinonic acid in the manufacturing emissions (PSN) and suburban area (RTD and RTN) and urban construction (RP) profiles suggests the presence of SOA in the samples [103]. The cis-pinonic acid is involved in SOA formation from α-pinene and ozone photoreaction [104,105]. For the RTD and RTN profiles, the presence of cis-pinonic could be associated with a transport emission effect because the collection of samples was carried out during no anthropogenic activity on site. The SOA contribution appears to be higher at nighttime (cis-pinonic contribution was twice of that found in the daytime samples). This could be explained by a transport effect enhanced by a lower mixing height at night, as was found in a previous study by Mancilla et al. [20]. For the PSN, PMX, and RP, the influence of SOA can be explained by the fact that the sampling was carried out in an open area; therefore, background ambient air masses could contribute to SOA [83,106].

Source Profile Similarity Analysis
All organic source profiles showed distinguishable organic markers. The meat cooking operations were distinguished by cholesterol and levoglucosan. Cholesterol was a characteristic organic marker for emissions from meat itself when cooked and levoglucosan for the smoke particles from charcoal combustion. Vehicle exhausts have hopanes (HOP1 to HOP9) as characteristic organic markers. In addition, heavy-duty diesel truck emissions (urban and freight transport) exhibited higher levels of hopanes and LMW-∑PAHs than gasoline motor vehicles. The open waste burning was dominated by methoxyphenols and levoglucosan, whereas the urban background was influenced by SOA precursor (α-pinene), as indicated by the presence of cis-pinonic acid. These differences were supported by the results from the similarity tests between the source profiles in this study that can be observed in Figure 10.
The profiles were compared using both the PD and SID similarity parameters. The results revealed that the average source profiles presented low PD but high SID, with overall values outside the acceptance box (PD < 0.4 and SID < 0.8), indicating that they were different over the different source profiles of this study. For all profiles, the variability in the PD-SID space did not reach the similarity acceptance box, which suggests that all organic source profiles may be different from most organic compounds emitted from multiple types of sources, as is expected when the molecular organic markers approach is used to distinguish between source profiles [14]. It is important to note that enrichment in some organic compounds contributes to the distinction among source profiles of the same category. A similar behavior was observed from the COD analysis. The COD range values ranged from 0.63 to 0.72, suggesting a slight dissimilarity among source profiles (values tend to be closer to 1 than to 0) [58].  Table 1.
For individual organic compound quantification, an uncertainty of ±20% of the measured concentration was used for all source profiles [13,24,[107][108][109]. In addition, an error propagation technique was followed to estimate the final uncertainties [109]. The estimated final uncertainties for each source profile can be observed in Table S2 of the Supplementary Material.

Conclusions
The organic composition of PM 2.5 samples collected from 12 anthropogenic sources in the MMA were obtained, quantifying 72 organic compounds. Following a ground-based source-dominated approach along with molecular organic marker approach assures the representativeness with a low number of samples. The organic source profiles of these sources were quite distinguishable, as determined by PD and SID and CODs for each pair of organic source profiles. It can be observed that diesel and gasoline vehicle traffic profiles were dominated by LMW-PAHs and HMW-PAHs which are considered carcinogenic and mutagenic organic compounds. There was a difference between industrial parks and the oil refinery: the first ones are dominated by n-alkanes indicating the use of fossil fuels in their processes while the second one is dominated by hopanes. Meat charbroiling operation profiles were dominated by cholesterol and levoglucosan while other cooking activities, not involving charcoal, saw very low levels of levoglucosan, highlighting the importance of fuel in cooking emissions. The urban background profiles in this study are a combination of secondary organic markers, n-alkanoic acids and hopanes. The biomass burning profiles are dominated by methoxyphenols and resin acids; all fires derived from trash or forest burning represent a potential source of the organic compounds. The relevance of these profiles resides on their potential application to conduct further source apportionment studies in Mexico, a region for which, to date, no local source profiles exist. These profiles can be used as receptor model input or for model output validation. This source profile data finds application also to validate emissions inventories derived from bottom-up approaches. Results from these applications can be used to outline strategies to reduce fine PM air pollution from specific emission sources as well as to quantify health impacts from specific air toxics.
Supplementary Materials: The following are available online at https://www.mdpi.com/article/ 10.3390/atmos12050554/s1, Table S1: organic composition of organic markers (ng/mg) for each source emission profile. Table S2: organic composition uncertainty of organic markers (ng/mg) for each source emission profile.

Conflicts of Interest:
The authors declare no conflict of interest.