Geochemical Background and Baseline Values Determination and Spatial Distribution of Heavy Metal Pollution in Soils of the Andes Mountain Range (Cajamarca-Huancavelica, Peru)

Concentrations of seven heavy metals (Cd, Cr, Cu, Hg, Ni, Pb, and Zn) and one metalloid (As) as well as various parameters (pH, organic carbon, granulometric analysis and cation exchange capacity) were analyzed in 77 soil samples collected in the mining areas of La Zanja and Colquirrumi (Department of Cajamarca) and Julcani (Department of Huancavelica). Our study proposed geochemical baseline values for heavy metals in a natural region (La Zanja) from samples collected during the period of the environmental impact study (2006), that is, from an earlier period which occurred at the beginning of the exploitation of the current gold mine. The baseline values obtained were as follows: 8.26 mg·kg−1 for Cr; 56.97 mg·kg−1 for Ni; 22, 20 mg·kg−1 for the Cu; 47.42 mg·kg−1 for Zn; 27.50 mg·kg−1 for As; 4.36 mg·kg−1 for Cd; 4.89 mg·kg−1 for Hg, and 44.87 mg·kg−1 for Pb. Through the use of different indices of heavy metal contamination (geo-accumulation index (Igeo), improved Nemerow index (IIN) and potential ecological risk index (RI)), the degree of pollution caused by mining activities in two areas, Colquirrumi and Julcani, which have a high density of mining sites in operation, was determined. The values obtained from these indices indicated that the Colquirrumi region was the most contaminated, followed by Julcani. The area of La Zanja, despite being free of mining operations, presented slight diffuse pollution. Several positive correlations were obtained, with a high level of significance, between pH, organic carbon content, cation exchange capacity, and the Cr, Pb and Ni concentrations of the soils. The spatial distribution of the heavy metals was realized by means of the interpolation method of ordinary kriging. The results obtained and the experience gained in this work were necessary to facilitate the identification of soil contamination processes in high altitude areas of the Andes Western Cordillera (Peru) as a basis for taking appropriate measures when restoring soils, during mine closure processes, and to protect the quality of soil resources.


Introduction
Heavy metals in soils have been identified as essential components of the environment and the food chain and as important factors inhuman health [1][2][3][4][5][6]. The term geochemical baseline-officially presented in 1993 under the International Geological Correlation Program as the Global Geochemical Baselines-refers to the natural variation in the concentration of an element in the surface environment, at a determined place and time. This concept includes natural geographic concentrations (background level) and the diffuse anthropogenic contribution in soils.
Background level is a measure that is used to differentiate between the concentration of the natural compound and the concentrations with an anthropogenic influence in a given environmental sample [7,8]. The concentrations of heavy metals in the natural soil background depend on the geological substrates and the processes that form the soils [9,10]. Rocks have a large influence on the content of heavy metals in soils, with concentrations sometimes above critical values [11]. However, it is almost impossible to establish natural background levels, i.e., the geochemical composition of virgin soils, since atmospheric deposition can contaminate soils with certain trace elements [12,13]. The calculation of the geochemical baseline is therefore more useful, since it represents conditions where a certain human impact on the environment already exists [14][15][16]. The soil compartment receives significant amounts of contaminants from different sources each year. Therefore, this compartment acts as a sink for a wide variety of emissions comprising several heavy metals, some of which are toxic. Sources of heavy metals in the environment include aerial deposition of particles emitted by different human activities, including mining.
The calculation of environmental geochemical baselines is necessary to assess the current state of the environment and to provide guidelines and quality standards in environmental legislation and policy-making, especially in the evaluation of contaminated soils and in environmental risk assessment [17]. Numerous studies have been carried out in different regions worldwide to estimate the geochemical background concentration and the baseline of heavy metals, and there are two main methods used to estimate the background level: direct (geochemical), and indirect (statistical). In this study, we followed the direct method, which uses samples not affected by industrial or mining activities, or samples from relatively pristine sites. This method generally uses simple statistical values, such as the median or the mean, to estimate background concentrations [18,19]. A geochemical baseline should be determined separately for each heavy metal in geologically different regions, otherwise the limit values for contaminated soils may be lower than the natural concentrations (background levels) calculated for an extensive area [20]. As a result, the soil trace element content is very variable, which makes the use of normative values of environmental legislation of other countries or regions inappropriate, as they must be determined locally. Currently, published studies that have determined the concentrations and patterns of spatial distribution of heavy metals in soils located at high altitudes in the Peruvian Andes are extremely limited.
The analysis of the environmental impacts of a mining activity is made through the difference between the situation of the environment before the activity was conducted, and after the development and cessation of mining activity. The investigation of the baseline of a territory represents a measure of the geochemical variations of its surface formations (rocks and soils) and is considered of great interest, not only from a scientific and mining point of view, but also constitutes a very important tool for environmental planning, environmental health, and sustainable development policies worldwide [21]. If an exploration campaign is successful (i.e., the location of an economic mineralized body has been discovered), at that time, the mining company must begin an investigation of the environmental baseline. This allows the development of a frame of reference to be able to properly control the environmental changes generated during and after the mining activity. To do this, baseline research has to be conducted before the activity in question has significantly affected the environment. This was the case of our study of the mining project of La Zanja (Cajamarca), which was carried out in 2006, during the period of the Environmental Impact Study and prior to the beginning of mineral extraction.
The first step in a baseline study is to define the thresholds of toxicity for the different pollutants, where the normal values of potentially polluting substances present in natural soils without human influence (known as the background level) are calculated, which corresponds to the normal value of an element in a given environment [22,23]. The calculation of the background level is often a complicated task since soils without any type of contamination are almost impossible to find due to atmospheric deposition of long distance trace elements and human activity [19]. Given these difficulties, the baseline values should show an average value and a range of concentrations of heavy metals for a specific area and at a specific time, as well as considering the diffuse entry of these elements into soils [24]. The value of the baseline should correspond to a statistically significant deviation from the addition to the arithmetic or geometric mean twice the value of the standard deviation for each element studied [25]. From the identification of the geochemical baseline values of a region, soil quality standards can be established (e.g., reference and intervention levels). The current methodology for assessing environmental quality through the content of metalloid and heavy metals in the soil includes the calculation of several pollution indices such as they geo-accumulation index (I geo ), the potential ecological risk index (RI), the improved Nemerow index (I IN ), etc., of the surface horizons of soils.
In this study, a geostatistical approach was adopted to calculate the spatial distribution of the geochemical anomalies of this region of the Peruvian Andes, specifically, the spatial distribution of the concentrations of seven heavy metals and one metalloid (Cu, Zn, Pb, Cr, Cd, Ni, As, and Hg) in soils using the Inverse Distance Weighting (IDW) and ordinary Kriging interpolation methods [26]. The spatial variability of heavy metal concentrations in soils contains basic information used to identify possible sources of contamination [27], to control and evaluate environmental risks [28], and outline the remediation strategies of the place. Considering the cost of extensive and repeated soil sampling and analysis is impractical and mapping with the spatial distribution of soil contamination requires spatial interpolation methods; consequently, interpolation techniques such as kriging are widely used in the investigation of contaminated soils [29][30][31].
Studies of soil contamination by heavy metals need to focus on identifying areas with a high risk of contamination. Samples from these types of areas usually have atypical spatial values at the local level [32]. The fundamental pollution problems (natural and/or anthropic) in the watersheds studied in Cajamarca and Huancavelica (Peru) are mainly derived from the removal of heavy metals from alteration zones and mining operations located at high altitude. During the rainy season, these materials go directly to the lower agricultural valleys and to the main river flows of this region. Therefore, it was considered necessary to conduct a heavy metal investigation, both in the soils and in the surface waters of these mining areas to provide baseline information on the impact of anthropogenic environmental pollution. Furthermore, the generation of acidic waters has been recognized as an important problem of environmental pollution in the last three decades [33].
The main objective of this study was: (1) to analyze the concentrations of eight heavy metals (Cr, Ni, Zn, Cu, Pb, As, Cd, and Hg) to calculate the background level in three mining areas of the provinces of Cajamarca and Huancavelica (La Zanja, Colquirrumi and Julcani) and calculate the geochemical baseline in La Zanja; (2) for purposes of comparison, to calculate several indices of heavy metal contamination (I geo , I IN , and RI) in the superficial horizons of soils; (3) to obtain geochemical maps revealing the degree of contamination of soils by heavy metals; and (4) to evaluate the correlations between heavy metal concentrations and soil properties (pH, organic carbon, cation exchange capacity, etc.). This environmental research study aimed to understand the influence of mining activity on the environmental contamination of high altitude areas within the provinces of Cajamarca (3200-3900 m) and Huancavelica (4100-4400 m) in the Western Cordillera de Los Andes (Peru). The results of this study will contribute to the creation of an environmental database of the soils and waters of the Cajamarca and Huancavelica regions (Peru), which will provide the authorities with monitoring data on the pollution caused by mining activities and will assist in the development of appropriate management strategies to control the contamination of heavy metals in soils and waters.

Study Area
The department of Cajamarca is in the extreme north of Peru, in the northern zone of the Western Cordillera de los Andes. It is limited in the North with the neighboring country of Ecuador, the South with the Department La Libertad, the East with the Department of Amazonas, and the West with the Departments Piura and Lambayeque ( Figure 1). The most important border of the Department of Cajamarca is marked towards the east by the river basin of the Marañón that separates it from the Department of Amazonas. The Departments of Cajamarca and Huancavelica are located at high altitudes above sea level (between 3200-4400 m), with very rainy climates (between 800-1300 mm) and cold temperatures (average temperature of the year between 5-13 °C), with an evapotranspiration between 900-1140 mm. The climate is sub-humid, with seasonal rains and frequent periods of drought. Seasons of intense rain occur between the months of November and April. In the upper parts of the mountain range, temperatures drop below 0 °C.
From a geological perspective, the mining area of Colquirrumi-Sinchao is constituted by folded sedimentary rocks from the Cretaceous period that have been improved by stocks, dykes and sills composed of diorite and granodiorite of the Miocene Half-Superior. All these bodies are located within a NW-SE regional course that includes the well-known mineral deposits of Colquirrumi, Hualgayoc, San Agustín, Shinchao, Constancia, Cerro San José, Cerro Corona, Minas Congas, Galeno, Michiquillay-Lambayeque, Mansita, San Agustín, Tres Cruces, etc.
The mineral deposits of the Hualgáyoc district are of different types: veins, mantles, bodies and The Departments of Cajamarca and Huancavelica are located at high altitudes above sea level (between 3200-4400 m), with very rainy climates (between 800-1300 mm) and cold temperatures (average temperature of the year between 5-13 • C), with an evapotranspiration between 900-1140 mm. The climate is sub-humid, with seasonal rains and frequent periods of drought. Seasons of intense rain occur between the months of November and April. In the upper parts of the mountain range, temperatures drop below 0 • C.
From a geological perspective, the mining area of Colquirrumi-Sinchao is constituted by folded sedimentary rocks from the Cretaceous period that have been improved by stocks, dykes and sills composed of diorite and granodiorite of the Miocene Half-Superior. All these bodies are located within a NW-SE regional course that includes the well-known mineral deposits of Colquirrumi, Hualgayoc, San Agustín, Shinchao, Constancia, Cerro San José, Cerro Corona, Minas Congas, Galeno, Michiquillay-Lambayeque, Mansita, San Agustín, Tres Cruces, etc.
The mineral deposits of the Hualgáyoc district are of different types: veins, mantles, bodies and porphyries with disseminated mineralization, and in stockwork which contain complex ores of Ag-Zn-Pb (Colquirrumi) and Cu-Ag-Au-Zn (Sinchao) from meso to epithermal. The Colquirrumi-Sinchao deposit is a replacement in the form of limestone and sandstones, which are covered with volcanic materials. Towards the NE appears a stock of medium-fine grain diorite and quartz-diorite porphyries. The igneous activity continues with the location of dacite domes and volcanic rhyolites and/or dacites that partially cover the other units. The final phase consists of rhyolitic dikes that cut epithermal mineralization. The ore is composed of chalcopyrite, tetrahedrite, galena, sphalerite and as gangue pyrite, quartz, chalcedony, and iron oxides [34,35]. In Hualgayoc and Julcani, there have been clearings, dumps and other liabilities as a result of decades of past mining that are currently unrecovered.
In the deposit of the Zanja are rocks of volcanoclastic origin, consisting of a sequence of tuffs, tuffs and lavas, of andesitic, dacitic, and rhyolitic nature, belonging to the Llama, Porculla and Volcanic Huambo formations ( Figure 2). The geological ages of these rocks vary from the Upper Eocene to the Upper Miocene and Late Pliocene. Near the project area, there are also subvolcanic bodies associated with a volcanic-magmatic event contemporaneous with pyroclastic deposits. Cretaceous sedimentary rocks belonging to the Goyllarisquizga group, strongly folded and faulted, are in an inward position and discordant to previous sequences. On the volcanoclastic sequence, and influenced by the subvolcanic bodies, minerals of economic value (Au and Ag) have been identified, as was the case of San Pedro Sur and Pampa Verde, which corresponded to epithermal processes of high sulfidation. This type of deposit is characterized by a clearly zoned hydrothermal alteration, with the presence of silicification in the central part and a gradation to argillic rocks (quartz-alunite-dickite, quartz-kaolinite, and illite-smectite-kaolinite-sericite) towards the edges.
The studied area of Julcani was geologically formed by a series of sedimentary rocks of Paleozoic-Mesozoic age including: (1) phyllites and sandstones of lower Devonian age (Excelsior Group), where these rocks are strongly deformed in syncline folds and anticline orientations NW; (2) a sequence of conglomerates, sandstones and shales, known as red layers (Mitu Group) that overlie, in erosive discordance, the metamorphosed and folded rocks of the Excelsior Group from the Permian period; (3) limestones and sandstones (Pucará Group) that usually contain abundant chert from the Triassic-Jurassic age; and (4) a volcanic sequence that occurred in the surroundings of Julcani comprised of dacites to rhyolites in a grouping of volcanic centers orientation WNW-ESE (Julcani Formation), and these volcanic materials have been dated to 10.4 Ma (upper Miocene).
In Julcani, deposits of economic value have been identified (including Au, Ag and polymetals, Cu, Pb) in mineralization located and genetically related to the Julcani volcanic center, which comprises a grouping of volcanic dacitic centers to the WNW-ESE, composed of pyroclastic rocks, lavas, endogenous domes and dikes. It is a filonian type deposit of fracture fillings with metallic content of Ag, Pb, Cu in the form of a system of irregular veins with bonanza bodies and gaps placed in pyroclastic rocks of the Julcani Formation. Most deposits in this mining district are within a volcanic sequence (Miocene), among them we have the following mines: Herminia, Mimosa, Sacramento, Estela, Temtadora, Nuestra Señora del Carmen, Rita, Achillia, etc. Another mineralized zone is of phyllites (Lower Devonian), where the mines are the Pucará, Bernabé and Contaglapampa [36].

Soil Samples and Analysis
The morphological inventory and the sampling of the soils has been realized by considering all units of soils that have been developed based on different types of rocks that exist in this territory. Thus, the selection of sampling points of the soil profiles has been based on lithological criterion, since in natural soils, the heavy metals present are inherited directly from the parent rock. In addition, the content of heavy metals (Pb, Cu, Zn, Ni, Cd, As, Hg, and Cr) produced by primary dispersion caused by the mineralization of the parent rock (background level) has been characterized, independently of that produced for mining activity, for this purpose, all horizons (A-and B-and subsurface-C or R-) of the soils were sampled. The sampling strategy focused specifically on the surface horizons (A and B) and deep or bedrock (C or R); the latter being unlikely to be contaminated by atmospheric deposition.
Four sub-samples of replicate soils were randomly collected at each sampling point within a 1.5 m × 1.5 m grid and mixed to obtain a composite sample of about 750 g in weight. In this study, a total of 77 soil samples were collected (77 × 4 subsamples = 308). Samples were air-dried and sieved through a 2 mm mesh. The determination of the physical and chemical properties was carried out as per traditional methods of soil analysis: organic matter by oxidation with potassium dichromate, granulometric analysis using the Robinson pipette method, pH (water 1:1) and capacity cation exchange (CEC) using the ammonium acetate method. The total contents of heavy metals in soil were analyzed as per the procedure recommended by the European Union International Organization for Standardization (ISO) standard 11466. Extraction was performed with a mixture of nitric acid and hydrochloric acid in a microwave oven, with determination by inductively coupled plasma mass spectrometry (ICP-MS) (Elan 6000, Perkin-Elmer, Waltham, MA, USA). The analysis was carried out by the Chemical Analysis Service of the University of Salamanca through the digestion of samples in a microwave oven (Ethos Plus Microwave Lastation, Milestone Inc., Shelton, CT, USA), using the standard USEPA method 3052. To calibrate the equipment, standard solutions (panreac) of 1000 mg/L of all metals were used, which were calibrated from 10-100 ppb.

Statistical Analysis Samples
SPSS software v.23.0 (IBM, Armonk, NY, USA) was used for statistical analysis. The results have allowed us to obtain different values of interest: arithmetic mean, geometric mean, median, range, standard deviation, coefficient of the variable, kurtosis, correlation analysis between heavy metals and soil properties. In addition, various pollution indices have been calculated in order to determine the levels of contamination of heavy metals in soils. The geographic information system (GIS) analysis (ArcGis v10.4, Esri, Redlands, CA, USA) with extensions Spatial analysis tools and geostatistical analyst has calculated the degree of spatial variability of each heavy metal using ordinary kriging [37]. Kriging is an advanced geostatistical procedure that generates a surface estimated from a set of dispersed values, generating an interactive investigation of the spatial behavior, in our case of soil contamination by heavy metals. Unlike the deterministic interpolation methods IDW and Spline (which depend directly on the surrounding measured values, which determine the smoothness of the resulting surface), the geostatistical kriging interpolation method is based on statistical models that include autocorrelation, i.e., The statistical relationships between the measured points. In the evaluation of soil contamination, this technique of geographic statistics generates a prediction surface of the distribution of heavy metals, providing certainty or precision in the prediction. For the creation of the prediction surface map, it uses an equation (Equation (1)) that discovers the rules of dependence between points and then performs the prediction of spatial or geographical distribution.
where (S i ) = the measured value at the location i; λ i = an unknown weight for the measured value at location i; S 0 = the location of the prediction; and N = the number of measured values. In this work, the applied kriging method was the ordinary one which assumes that the distance or direction between the sample points reflects a spatial correlation that can be used to explain the surface variation.

Soil Study
The predominant soils in these sectors located at high altitudes of the Andes Mountains are of little development, being located in steep areas with strong slopes (Figure 3). Most soils have been classified as Haplic Umbrisols (hyperdystrics and andics), which are acidic soils (average value of pH = 4.7) that have developed on volcanic rocks and are constituted by a surface horizon (Horizon A) that is very dark to black and high in organic matter (average content of 16.44%), with a low degree of saturation in bases (mean value = 9.5%). Some of these umbrisols have Andean properties (low bulk density and thixotropy) and can even be classified as umbric andosols.
The Andes mountain range is subject to intense erosive processes that increase sediment and soil trawling to the bottoms of valleys or depressions. It is in these zones where erosion processes dominate over edaphic processes, where soils have small development (Dystric Regosols and Gleyic Cambisols) and are developed on sloping landslides. These soils differ from the previous ones in that the superficial horizon has a low or moderate content of organic matter (4.65%) and, therefore, a brown color. Cambisols are characterized by having a subsurface horizon (Bw), of alteration (horizon of diagnostic cambium), and are brownish-gray or yellowish-brown color.
In the studied area, rocky outcrops that are associated to soils with little thickness with the denomination of Umbric Leptosols are also frequent, and limestone rocks have been observed with Rendzic Leptosols and Calcic Kastonozems. In some areas, there are small depressions where water accumulates (bofedal), and the reducing environment favors the accumulation of organic matter, giving rise to Fibric Hemic and Sapric Histosols (Pebbles). In the "valley bottoms" originating from rivers, streams and streams that run through the studied area, recent alluvial sediments have been deposited, consisting of gravel and sand with a thickness to the order of two meters. The soils that have developed in the current channel of streams and rivers are classified as Dystric Fluvisols.
A classification of these soils has been made by considering their morphology and analytical data, by the taxonomy of the World Reference Base (WRB). The statistical data of the main studied properties of the soils of the Zanja, Colquirrumi, and Julcani are presented in Table 1. In the analyzed samples, the physical and chemical properties of the soils have been discarded those that are compact and hard of some horizons C and R, giving a total of 59 samples analyzed.

Concentration of Heavy Metals in Soils of La Zanja, Colquirrumi, and Julcani
In this study, our aim was to understand the natural contents of the soils (pedogeochemical background) to detect the intensity of the contamination of the soils and to compare the results with norms or regulations of other countries worldwide [38,39].
In the statistical analysis of this study, the data of the most superficial horizons (A and B) and underlying (C) horizons were treated separately. The more superficial horizons or solum were those that provided information on the levels of pollution caused by the processes of soil formation and by anthropogenic sources, whereas the underlying horizon (bedrock) exclusively represents the lithogenic contributions, since there is little probability of contamination through atmospheric deposition [40]. It is for this reason that samples of soil bedrock were used to determine the level of the natural geological background.
The average values of the heavy metals obtained from the statistical analysis of the soils of La Zanja can be seen in Table 2, where other bibliographical data of levels obtained in other countries have also been attached for purposes of comparison.
Background levels of the Ni, Cu, Hg, As, and Pb metals were above the world average. The Zn content was equivalent to the world average, and the concentration of Cr was lower than the world average. The highlight of the mining area of La Zanja was the high content of Cd soils, since it had a value higher than the world's top ranking sample.  [10], although the range has reached as high as 1500 mg·kg −1 [42]. Obtaining such high values in Pb content forced us to repeat the sample analysis several times, given the incredulity regarding the resulting high values. In contrast, Cr metal had values considered inferior to the background levels reported in other parts of the world. Regarding the Ni content, the background level was lower (12.05 mg·kg −1 ) when compared to the world average. Furthermore, in numerous soil samples, Cu content exceeded the values of the world average (12 mg·kg −1 ), and in some soils Cu values reached between 100 and 500 mg·kg −1 . As for Zn, the background level (165.6 mg·kg −1 ) obtained in this study exceeded the world average values of 40 mg·kg −1 . In some soil samples, values higher than 3000 mg·kg −1 were reached. Of the soil samples collected in Colquirrumi, only three did not exceed the world average (20 mg·kg −1 ) regarding As content, where the background level (218.1 mg·kg −1 ) was up to 11 times higher than most other countries in the world, although the range reached as high 250 mg·kg −1 . Large As abnormalities have been observed in some soil samples, with values to the order of 1000 mg·kg −1 . Finally, it was noted that the values of the background levels of Cd and Hg were high, to the order of 15 and 70 times above the world averages. To summarize, from the study of heavy metals in soils in the mining area of Colquirrumi, we must highlight the geochemical anomalies in Pb, As, Cd, Hg, Cu, and Zn.
In relation to the values obtained in the soils of Julcani, it is necessary to point out the following: Cr, Ni, and Cu metals presented background and reference levels considered inferior to those described in the literature regarding other parts of the world. None of our soil samples exceeded the world average values, in terms of the Cr content. For Ni content, only two soil samples exceeded the values of the world average fund level (25 mg·kg −1 ). However, the concentrations of Zn and As were slightly higher than those obtained in other countries. The background levels of Pb, Cd, and Hg obtained from Julcani were 13, 14, and 34 times higher, respectively, than the levels obtained in soils from other parts of the world.

Box and Whisker Plots of Heavy Metal Concentrations in La Zanja, Colquirrumi and Julcani
The box and whisker plots in Figure 4 show a summary of the basic statistics of the concentrations of heavy metals studied in the soils of the Western Cordillera of the Peruvian Andes. When comparing the concentrations of heavy metals in the three areas studied, it can be described better graphically through several box plots. In La Zanja the contents were lower with respect to Colquirrumi and Julcani, which was justified by the null or weak contamination of soils of La Zanja, given the lack of mining activities in the sampling year (2006) when compared to areas with extensive and ancient mining activities such as Colquitrumi and Julcani. Only the area of Julcani (Huancavelica) had lower Ni and Cu contents than La Zanja (Cajamarca) as both zones belong to two different metalogenic belts.

Geochemical Baseline Study of the Soils of La Zanja
To calculate the geochemical baseline, the geometric mean was used with a normal logarithmic heavy metal distribution, and in cases where the variables were strongly asymmetric and their transformation could not be adjusted to normal, then the median was used [43]. The normal distribution of metal concentrations was investigated by performing the Kolmogorov-Sminov test, although this can also be inferred by the existence of high kurtosis values.
In the case of Zanja soils, concentrations of heavy metals from the surface horizons were used to calculate the baseline, which was conducted by adding the geometric mean to twice the standard deviation [25] (Table 3). Soils studied from La Zanja were characterized by high baseline values for Ni, Cu, Pb, Cd, and Hg. The application of the regional geochemical baseline values proposed in this study will allow the rapid identification of sites that could be affected by pollution processes due to current mining exploitation in the area of La Zanja. In addition, this information will also be very useful in the future, when it is time to close the mine and implement a restoration plan to try to reproduce the environmental conditions that existed prior to mining.

Assessment of Environmental Risks: Pollution Rates
The state of heavy metal contamination in soils from the areas studied in this work were evaluated using different quantitative contamination rates.

Pollution Factor and Nemerow Index
Using the pollution factor and Nemerow Index to determine pollution levels in the La Zanja area showed that most soil samples (70%) were contaminated, 20% were moderately contaminated, and 10% were strongly contaminated, mainly by Cr, As, and Pb. These data does not represent the reality observed on the land itself of the environmental conservation status of the territory, since La Zanja was a natural area without any type of mining when the samples were collected in 2006. Therefore, the soil samples obtained in this area were expected to appear low or very low on the contamination indices. These results question the validity of these indices to calculate the degree of soil contamination. In agricultural soils and in less industrialized areas of the Northern Plateau of Spain [44], the use of these pollution indices showed that the soils were between contaminated and heavily contaminated. That is, these results proved similar to those obtained in the Peruvian Andes that also did not adjust to the existing degree of contamination. As a result, with the experience already obtained, we verified that the assessment of environmental risks was better reflected reality by using other similar pollution indexes, such as the geo-accumulation index (I geo ) and the improved Nemerow index (I IN ).

Geo-Accumulation Index and the Improved Nemerow Index
The I geo is calculated from equation (Equation (2)) [45]: I geo = log 2 (C i /1.5B i ) (2) where C i is the measured concentration of the i metal examined in the soil, and B i is the background level of the i metal. The factor 1.5 was used to correct possible variations in the background values of a particular metal in the environment. The I geo for the eight heavy metals studied are summarized in Table 4. The I geo of the analyzed heavy metals allowed the analysis of the single factor contamination index to evaluate the presence of each individual metal and its level of contamination in the study area. However, the I IN shows the degree of general pollution caused by the simultaneous presence of the nine heavy metals. In La Zanja, considering the I geo , most soils were included in Class 0 (I geo ≤ 0) and Class 1 (0 < I geo ≤ 1), that is, uncontaminated to slightly contaminated, with 67.7% and 17.8% of the samples, respectively, and only some soils in Class 2 (1 < I geo ≤ 2), which were moderately contaminated by Cr, As, and Pb in 5.45% of the samples. Soils were not included in Classes 3, 4, 5, and 6 (from moderately contaminated to extremely contaminated). In Colquirrumi, samples of the soils studied were included in all classes ranging from uncontaminated to extremely contaminated soils: Class 0 (48.1% of the samples), Class 1 (16.3%), Class 2 (20.9%), Class 3 (5.4%), Class 4 (9.3%), Class 5 (0%), and Class 6 (16.3%). In Julcani, the soils, were classified as Class 0 (54.2%), Class 1 (30.5%, Class 2 (6.9%), Class 3 (5.6%), and Class 4 (1.4%), which ranged from uncontaminated to heavily contaminated.
By considering the I geo , the levels of contamination (from low to high) of the heavy metals in the superficial horizons of the soils was as follows: La Zanja < Julcani < Colquirrumi.
The I IN was calculated by Equation (3): where I geo max is the maximum I geo value of all metals in a sample, and I geo ave is the arithmetic mean of the I geo . In La Zanja, when considering the I IN , the soils ranged from non-contaminated to moderately contaminated: 20% of samples were in Class 0 (I IN < 0.5), 50% in Class 1 (0.5 ≤ I IN < 1), and 30% of soils in Class 2 (1 ≤ I IN < 2). However, it should be pointed out that the small degree of contamination indicated by this index may be due to anthropogenic processes (atmospheric deposition of metals, medium or long distance), which led to an increase in geological concentrations in the soils of La Zanja.
The main external sources of heavy metals in the soils of this area are through diffuse pollution, and wet and dry deposition caused by metallic mining and the processing of minerals from bordering areas that has led to the accumulation over the long-term use of heavy metals. In Colquirrumi, all soil samples were classified as moderately contaminated

Potential Ecological Risk Index
The ecological risk index (E r i ) evaluates the toxicity of trace elements in sediments and has been extensively applied to soils [46]. Soils contaminated by heavy metals can cause serious ecological risks and negatively impact human health due to various forms of interaction (agriculture, livestock, etc.) where highly toxic heavy metals can enter the food chain. Excessive accumulation of heavy metals in agricultural soils can affect the quality and safety of food and further increase the risk of serious diseases (cancer, kidney, liver damage, etc.), as well as impact ecosystems, thus combining environmental chemistry with biological toxicology and ecology [47].
To calculate the E r i for individual metals, we used Equation (4): where Tr is the toxicity coefficient of each metal whose standard values are Hg = 40, Cd = 30, As = 10, Co = 5, Cu = 5, Ni = 5, Pb = 5, Cr = 2, and Zn = 1 [48,49]; and C f i is the contamination factor (C f i = C i /B i ), where C i is the measured concentration of the pollutant, and B i is the level of geological background.
To calculate the potential response rate to the toxicity of all the studied heavy metals (RI), we use Equation (5): The potential ecological risk index (RI) index reflects the general situation of pollution caused by the simultaneous presence of the eight heavy metals (Table 5).  Considering the individual ecological risk index (Er), the levels of heavy metal contamination of the surface horizons of La Zanja soils were considered to be of low contamination risk (Er < 40), with only one soil sample (14.3%) with a considerable risk of contamination (80 ≤ Er < 160) per Hg.
In Colquirrumi, a majority of the samples (87%) were considered to have a low risk of contamination (Er < 40); a small percentage of samples (4.2%) were considered to have a moderate contamination risk (40 ≤ Er < 80) from Cr, Ni, As, and Hg; 8.2% of the samples presented a considerable risk of contamination (80 ≤ Er < 160) for Cd and Hg; and only 0.6% of the soil samples had a very high risk of contamination (160 ≤ Er < 320) per Hg.
In Julcani, it was also seen that the majority of the samples (86.1%) were considered to be at low risk of contamination (Er < 40); a small percentage (6.9%) of samples were considered to have a moderate contamination risk (40 ≤ Er < 80) from metals As, Cd, Hg, and Pb; 2.8% of samples presented a considerable risk of contamination (80 ≤ Er < 160) for Hg; and 4.2% of soil samples had a high risk of contamination (160 ≤ Er < 320) from As and Hg.
If we consider the individual ecological risk (Er) index, the levels of contamination (from lowest to highest) by heavy metals from the surface horizons of soils were as follows: La Zanja < Julcani < Colquirrumi.
When considering the RI of all the metals studied, the contamination levels were as follows: in La Zanja, all soil samples (100%), present low ecological risk of potential contamination (RI < 150). In Colquirrumi, 38.1% of the samples had a low ecological risk of potential contamination (RI < 150); 57.1% presented a moderate ecological risk potential (150 ≤ RI < 300); and 4.8% faced a significant potential ecological risk (300 ≤ RI < 600). In Julcani, 44.4% of soil samples presented a low ecological risk of potential contamination (RI < 150); 44.4% had a moderate ecological risk of potential contamination (150 ≤ IR < 300); and soils with a considerable risk of contamination (300 ≤ RI <600) were in the minority (11.1%).
Furthermore, when considering the potential ecological risk index (RI), the levels of contamination (from low to high) from heavy metals from the superficial horizons of soils were as follows: La Zanja < Julcani < Colquirrumi.

Spatial Distribution of Heavy Metal Content in Soil
The use of the geostatistical method of ordinary kriging spatial interpolation allowed us to predict the spatial distribution of the concentrations of each metal in the areas not sampled ( Figure 5), based on the values from samples collected in the field campaign, so samples closer to each other presented better correlations and similarity than the more distant ones.
In the area of La Zanja, the distributions of the heavy metals Ni, Cd, and Pb were similar as they had the highest concentrations in the NW and W of the studied area, in the superficial horizons of Profiles 13 and 17-19, which developed on rock alterations (quartz-alunite, quartz-kaolin, etc.) and on a porphyritic rhyolitic dome. These anomalies were located in two current areas of exploitation in Au and Ag mines (in the San Pedro Sur and Pampa Verde fields). The highest concentration of Cu was in the SW quadrant of the zone in Sample 14. The Cr content was abnormally high in the NE quadrant of the zone (Soil 15, developed on porphyritic lavas). The highest concentration of Zn occurred as a concentric circle in the SW quadrant of the studied area in Sample 14, and As dominated in the  In the Colquirrumi zone, the Zn and Cd heavy metal distributions were quite similar since they presented the highest concentrations in the NE and N corners of the studied area, in the surface horizons of the Profiles 6 (developed on sandstones and intercalated shales with sills in the Inca Formation) in the case of Zn, and in Profile 9 (developed on limestones and marls of the Pariatambo Formation) in the case of Cd. Furthermore, the distributions of heavy metals As and Hg (given their abnormally high concentrations in the SE quadrant of the studied area) in soils developed on marls, shales and limestones of the Chulec Formation. The highest Pb content was in the NW quadrant in Sample 5 (soil developed on monzonites and granodiorites). The Cu value identified a geochemical anomaly in the form of two concentric circles located in the N (Soil 6) and SE (Soil 8). Ni was present in the SW quadrant (Soils 4 and 7), and finally, the highest concentration of Cr was in the NW quadrant (Soil 4) and in the eastern half of the area (Soils 8 and 9).
In the Julcani zone, the distributions of the heavy metals Ni, Cd, Hg, and Pb were similar and showed a decrease in the form of concentric circles, from the center of the zone towards the periphery (the highest concentrations were Soils 22 and 23, developed on dacitic rocks and riodacites). The contents of Cu, Zn, and As were abnormally high in the SE part of the studied region (Soil samples 27, 26 and 22, developed on dacites and riodacites), which meant that there may have been several contaminations from point sources. The Cr value indicated a higher distribution in the form of two concentric circles located in the SE and E of the investigated area (Soil samples 26, on dacites and riodacites; and 20, on Triassic-Jurassic age limestones).
In addition, using both the ordinary kriging interpolation method and ArcGIS 10.4 (Esri), soil contamination rates (I IN and RI) were depicted that reflected the overall pollution situation caused by the simultaneous presence of the eight heavy metals in the three areas studied ( Figure 6). In the Julcani zone, the distributions of the heavy metals Ni, Cd, Hg, and Pb were similar and showed a decrease in the form of concentric circles, from the center of the zone towards the periphery (the highest concentrations were Soils 22 and 23, developed on dacitic rocks and riodacites). The contents of Cu, Zn, and As were abnormally high in the SE part of the studied region (Soil samples 27, 26 and 22, developed on dacites and riodacites), which meant that there may have been several contaminations from point sources. The Cr value indicated a higher distribution in the form of two concentric circles located in the SE and E of the investigated area (Soil samples 26, on dacites and riodacites; and 20, on Triassic-Jurassic age limestones).
In addition, using both the ordinary kriging interpolation method and ArcGIS 10.4 (Esri), soil contamination rates (IIN and RI) were depicted that reflected the overall pollution situation caused by the simultaneous presence of the eight heavy metals in the three areas studied ( Figure 6).

Conclusions
Soils studied in the Andes Mountains (Peru) were characterized by geochemical baselines with high concentrations of potentially toxic heavy metals. The average concentrations of some elements were much higher than the values obtained from other parts of the world, particularly for Pb, Cu, Zn, As, Cd, and Hg.
Several pollution indices (Igeo, IIN, and RI) have been used as tools to diagnose pollution. Data

Conclusions
Soils studied in the Andes Mountains (Peru) were characterized by geochemical baselines with high concentrations of potentially toxic heavy metals. The average concentrations of some elements were much higher than the values obtained from other parts of the world, particularly for Pb, Cu, Zn, As, Cd, and Hg.
Several pollution indices (I geo , I IN , and RI) have been used as tools to diagnose pollution. Data revealed that the soils were uncontaminated, or slightly contaminated at La Zanja (Cajamarca, Peru); significantly contaminated at Colquirrumi (Cajamarca, Peru); and moderately contaminated in Julcani (Huancavelica, Peru).
If we conducted an environmental risk assessment using the Nemerow index, the soils of La Zanja (virgin zone without the presence of mining operations) have values ranging between contaminated and heavily contaminated soils; however, using the improved Nemerow index I IN , most soils fit within Classes 0 and 1 (not contaminated to moderately contaminated). Therefore, there was a great difference in calculating the percentage and the degree of contamination in soils, through the use of different contamination indices. Thus, it is our belief that the I NM was much more accurate in assessing the environmental risks of heavy metal contamination.
The distribution patterns of heavy metal concentrations were mainly influenced by the lithology and geochemistry of the parent rock and by the existence of metallogenic belts in the studied area.
It is our goal to see that the results of this study contribute to the creation or be added to an environmental database of soils in this mining region of the Peruvian Andes, and to facilitate the development of management strategies and remediation of soils contaminated by heavy metals.